<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Steve Tjoa</title>
	<atom:link href="http://stevetjoa.com/feed" rel="self" type="application/rss+xml" />
	<link>http://stevetjoa.com</link>
	<description>Imagine Research, Inc.</description>
	<lastBuildDate>Sun, 29 Jan 2012 09:14:59 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Fourier at a Glance: Continuous vs. Discrete, Series vs. Transform</title>
		<link>http://stevetjoa.com/633</link>
		<comments>http://stevetjoa.com/633#comments</comments>
		<pubDate>Tue, 15 Nov 2011 05:56:18 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[fourier]]></category>
		<category><![CDATA[signal-processing]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=633</guid>
		<description><![CDATA[When I co-lectured ENEE322: Signal and System Theory, I wrote this handout as a complement to Oppenheim and Willsky. In particular, Table 4.1 on page 14, pictured above, shows the relationships between the continuous-time Fourier series (CTFS), discrete-time Fourier series (DTFS), continuous-time Fourier transform (CTFT), and discrete-time Fourier transform (DTFT). Note the similarities and differences [...]]]></description>
			<content:encoded><![CDATA[<p>When I co-lectured <a href="http://www.ece.umd.edu/Academic/Under/ucourses1.htm#ENEE%20322" title="ENEE322: Signal and System Theory" target="_blank">ENEE322: Signal and System Theory</a>, I wrote <a href="http://up.stevetjoa.com/notes322_20091119.pdf" title="ENEE322: Notes to Oppenheim and Willsky" target="_blank">this handout</a> as a complement to <em>Oppenheim and Willsky</em>. </p>
<p><a href="http://up.stevetjoa.com/fourier_table41.png"><img alt="Table 4.1: Comparison of Fourier Operations" src="http://up.stevetjoa.com/fourier_table41.png" title="Table 4.1: Comparison of Fourier Operations" class="alignnone" width="679" height="277" /></a><span id="more-633"></span></p>
<p>In particular, <strong>Table 4.1</strong> on page 14, pictured above, shows the relationships between the continuous-time Fourier series (CTFS), discrete-time Fourier series (DTFS), continuous-time Fourier transform (CTFT), and discrete-time Fourier transform (DTFT). Note the similarities and differences among the four operations:</p>
<ul>
<li>&#8220;Series&#8221;: periodic in time, discrete in frequency</li>
<li>&#8220;Transform&#8221;: aperiodic in time, continuous in frequency</li>
<li>&#8220;Continuous Time&#8221;: continuous in time, aperiodic in frequency</li>
<li>&#8220;Discrete Time&#8221;: discrete in time, periodic in frequency</li>
</ul>
<p>My motive behind writing <a href="http://up.stevetjoa.com/notes322_20091119.pdf" title="ENEE322: Notes to Oppenheim and Willsky" target="_blank">these notes</a> was to present the most important concepts from <em>Oppenheim and Willsky</em> as concisely as possible. Often, textbooks sacrifice clarity for depth. Here, I present the main concepts from separate chapters as close together as possible so that students can more easily discover the <em>relationships</em> between concepts rather than studying each chapter in isolation.</p>
<p>For anyone taking or teaching signal processing, please feel free to use these notes as you wish, and let me know what you think!</p>
<p>(This post was inspired by <a href="http://dsp.stackexchange.com/questions/646/what-is-the-most-lucid-intuitive-explanation-for-the-various-fts-cft-dft-dt/657#657" target="_blank">this answer</a> that I posted to dsp.stackexchange.com.)</p>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/633/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Factorization of Overlapping Harmonic Sounds Using Approximate Matching Pursuit</title>
		<link>http://stevetjoa.com/611</link>
		<comments>http://stevetjoa.com/611#comments</comments>
		<pubDate>Sat, 29 Oct 2011 05:29:50 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[dictionary-learning]]></category>
		<category><![CDATA[kjr-liu]]></category>
		<category><![CDATA[machine-learning]]></category>
		<category><![CDATA[music]]></category>
		<category><![CDATA[s-tjoa]]></category>
		<category><![CDATA[transcription]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=611</guid>
		<description><![CDATA[Steven K. Tjoa and K. J. Ray Liu ISMIR, October 2011 Download: Paper, Poster, BibTeX @INPROCEEDINGS{tjoa2011ismir, title = "Factorization of Overlapping Harmonic Sounds Using Approximate Matching Pursuit", author = "Steven K. Tjoa and K. J. Ray Liu", booktitle = "Proc. Int. Soc. Music Information Retrieval Conf.", address = "Miami, FL", year = "2011", month = [...]]]></description>
			<content:encoded><![CDATA[<ul>
<li>Steven K. Tjoa and K. J. Ray Liu</li>
<li>ISMIR, October 2011</li>
<li>Download: <a href='http://up.stevetjoa.com/tjoa2011ismir.pdf'>Paper</a>, <a href='http://up.stevetjoa.com/poster20111025ismir.pdf'>Poster</a>, <a class="bibtex">BibTeX</a>
<pre>@INPROCEEDINGS{tjoa2011ismir,
  title = "Factorization of Overlapping Harmonic Sounds Using Approximate Matching Pursuit",
  author = "Steven K. Tjoa and K. J. Ray Liu",
  booktitle = "Proc. Int. Soc. Music Information Retrieval Conf.",
  address = "Miami, FL",
  year = "2011",
  month = oct,
  pages = "257--262"
};</pre>
</li>
</ul>
<p><span id="more-611"></span></p>
<p>Sparse coding and nonnegative matrix factorization (NMF) have revolutionized the area of music transcription. These methods usually decompose spectra from input signals as linear combinations of dictionary spectra. Often, the dictionary is either learned (as in NMF) or synthetic (e.g., harmonic templates, Gabor atoms, etc.).</p>
<p>Instead, what if you <em>already have</em> a massive overcomplete dictionary containing <em>millions</em> of well-labeled atoms from real-world sounds? Assuming that the dictionary is large and representative enough, there is no longer a need to learn. However, coding (i.e., finding the coefficients of the input with respect to the dictionary) remains a problem. With such a large dictionary, complexity becomes an issue.</p>
<p>We propose a method called <strong>Approximate Matching Pursuit</strong> (AMP) which allows you to efficiently obtain a sparse decomposition of an input vector using a massive overcomplete dictionary. AMP is a simple variant of matching pursuit or orthogonal matching pursuit. However, instead of finding an exact match between the input (or residue) and the dictionary at each iteration, we only find an <em>approximate</em> match. In doing so, we sacrifice a bit of accuracy to obtain significant savings in computation.</p>
<p>Using AMP, we decomposed input spectra of containing overlapping harmonic sounds, where the amount of polyphony varies. Our results showed that, while retrieval performance is similar to orthogonal matching pursuit based upon F-measure, execution time and computation is reduced significantly.</p>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/611/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Super-resolution of Musical Signals Using Approximate Matching Pursuit</title>
		<link>http://stevetjoa.com/592</link>
		<comments>http://stevetjoa.com/592#comments</comments>
		<pubDate>Thu, 20 Oct 2011 20:19:04 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[bp-keegan]]></category>
		<category><![CDATA[ieee-waspaa]]></category>
		<category><![CDATA[kjr-liu]]></category>
		<category><![CDATA[music]]></category>
		<category><![CDATA[s-tjoa]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=592</guid>
		<description><![CDATA[Brennan P. Keegan, Steven K. Tjoa, and K. J. Ray Liu IEEE WASPAA, October 2011 Download: Paper, Poster, BibTeX @INPROCEEDINGS{keegan2011waspaa, title = "Super-resolution of Musical Signals Using Approximate Matching Pursuit", author = "Brennan P. Keegan and Steven K. Tjoa and K. J. Ray Liu", booktitle = "Proc. IEEE Workshop on Applications of Signal Processing to [...]]]></description>
			<content:encoded><![CDATA[<ul>
<li>Brennan P. Keegan, Steven K. Tjoa, and K. J. Ray Liu</li>
<li>IEEE WASPAA, October 2011</li>
<li>Download: <a href='http://up.stevetjoa.com/keegan2011waspaa.pdf'>Paper</a>, <a href='http://up.stevetjoa.com/poster20111017waspaa.pdf'>Poster</a>, <a class="bibtex">BibTeX</a>
<pre>@INPROCEEDINGS{keegan2011waspaa,
  title = "Super-resolution of Musical Signals Using Approximate Matching Pursuit",
  author = "Brennan P. Keegan and Steven K. Tjoa and K. J. Ray Liu",
  booktitle = "Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics",
  address = "New Paltz, NY",
  year = "2011",
  month = oct,
  pages = "81--84"
};</pre>
</li>
</ul>
<p><span id="more-592"></span></p>
<p>Super-resolution is the act of increasing the resolution (or sampling rate) of a signal in an intelligent manner that preserves the signal&#8217;s ground truth information, i.e., not through some parametric method like linear or sinusoidal interpolation. For example, when you use Adobe Photoshop or GIMP to make an image larger, the new image will likely be blurry because the program does not know how to restore sharp edges.</p>
<p>Super-resolution of images and video has been studied extensively, but not as much for audio. When an audio signal is upsampled from 4 kHz to 44.1 kHz using traditional sinc interpolation, the upsampled signal will not contain any new information that is not already present in the 4 kHz version. As a result, high-frequency components will be absent, and the signal will sound dull and muffled.</p>
<p>There are a few methods for audio super-resolution. For example, Smaragdis et al. have proposed a method whose basic idea is to take the low-resolution input signal and <em>project</em> it upon a low-resolution basis. Using that projection (or the <em>coefficients</em> with respect to the low-resolution basis), a high-resolution signal is constructed by multiplying the coefficients with components from a high-resolution basis.</p>
<p>What basis do you use? In the literature, that basis (or overcomplete dictionary &#8212; perhaps not orthogonal) is usually obtained in one of two ways: use a synthetic basis (Gabor, etc.), or learn it from real data (NMF, K-SVD, etc.).</p>
<p>We ask this question: What if you already have a massive overcomplete dictionary containing <em>millions</em> of well-labeled atoms from real-world sounds? Can you make use of it? How? With such a large dictionary, complexity becomes an issue. Using ordinary matching pursuit, each iteration has complexity that is linear in the size of the dictionary. This is not scalable.</p>
<p>We propose a method called <strong>Approximate Matching Pursuit</strong> which allows you to efficiently obtain a sparse decomposition of an input vector using a massive overcomplete dictionary. Using it, we perform super-resolution of signals containing piano music. Using a dictionary composed of sounds from the University of Iowa data set, we were able to accurately estimate missing high-frequency information using low-resolution inputs sampled as low as 2-4 kHz. For more details, please see the paper above.</p>
<p>For more about Approximate Matching Pursuit, please see our upcoming paper in ISMIR 2011.</p>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/592/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>My Academic Genealogy Since Helmholtz</title>
		<link>http://stevetjoa.com/547</link>
		<comments>http://stevetjoa.com/547#comments</comments>
		<pubDate>Tue, 23 Aug 2011 04:58:33 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[kjr-liu]]></category>
		<category><![CDATA[s-tjoa]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=547</guid>
		<description><![CDATA[The Mathematics Genealogy Project is an online service that lets you explore the academic lineage of mathematicians and those in related areas such as engineering. Using it, I discovered that I descend from Hermann von Helmholtz, famous for his work in acoustics (and a whole lot of other stuff). It is interesting to see how [...]]]></description>
			<content:encoded><![CDATA[<p>The <a href="http://genealogy.math.ndsu.nodak.edu/index.php" target="_blank">Mathematics Genealogy Project</a> is an online service that lets you explore the academic lineage of mathematicians and those in related areas such as engineering. Using it, I discovered that I descend from <a href="http://en.wikipedia.org/wiki/Hermann_von_Helmholtz" target="_blank">Hermann von Helmholtz</a>, famous for his work in acoustics (and a whole lot of other stuff). <span id="more-547"></span></p>
<p>It is interesting to see how thesis topics evolve from advisor to advisee, yet my research area &#8212; music information retrieval &#8212; is not so removed from Helmholtz&#8217;s work on acoustics; in fact, MIR scholars still cite and recognize his contributions!</p>
<p>Here are the people that make up my lineage since Helmholtz, each person&#8217;s dissertation title, place of graduate study, and graduation year.</p>
<ol>
<li>Hermann von Helmholtz, <em>De fabrica systematis nervosi evertebratorum</em>, Universität Berlin, 1842.</li>
<li>Edward Leamington Nichols, <em>Ueber das von glühendem Platin ausgestrahlte Licht: Ein Beitrag zur allgemeinen Ausstrahlungslehr</em>, Georg-August-Universität Göttingen, 1879.</li>
<li>Ernest Fox Nichols, <em>Radiometric Researches in the Remote Infra-Red Spectrum</em>, Cornell, 1897.</li>
<li>Frederic Columbus Blake, <em>The Reflection and Transmission of Electric Waves by Screens of Resonators and by Grids</em>, Columbia, 1906.</li>
<li>William Littell Everitt, <em>The Calculation and Design of Alternating Current Networks Employing Triodes Operating During a Portion of a Cycle</em>, Ohio State, 1933.</li>
<li>Karl Ralph Spangenberg, <em>The Effect of Grid Current Flow Upon the Dynamic Characteristics of Vacuum Tube Power</em>, Ohio State, 1937.</li>
<li>Willis Harman, <em>Tunable Waveguide Cavity Resonators for Broadband Operation of Reflex Klystrons</em>, Stanford, 1948.</li>
<li>John Bowman Thomas, <em>On the Statistical Design of Demodulation Systems for Signals in Additive Noise</em>, Stanford, 1955.</li>
<li>Kung Yao, <em>On Some Representations and Sampling Expansions for Band Limited Signals</em>, Princeton, 1965.</li>
<li>K. J. Ray Liu, <i>Efficient and Reliable Parallel Processing Algorithms and Architectures for Modern Signal Processing</i>, UCLA, 1990.</li>
<li>Steve Tjoa, <i>Sparse and Nonnegative Factorizations for Music Understanding</i>, Maryland, 2011.</li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/547/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CCRMA MIR Workshop 2011</title>
		<link>http://stevetjoa.com/568</link>
		<comments>http://stevetjoa.com/568#comments</comments>
		<pubDate>Mon, 04 Jul 2011 23:17:10 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[ccrma]]></category>
		<category><![CDATA[imagine-research]]></category>
		<category><![CDATA[music]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=568</guid>
		<description><![CDATA[We recently hosted a one-week long workshop on music information retrieval at the Center for Computer Research in Music and Acoustics (CCRMA) at Stanford University. Led by Imagine Research CEO Jay LeBoeuf, this workshop included other notable scholars and instructors from the MIR community such as Rebecca Fiebrink, Stephen Pope, Leigh Smith, George Tzanetakis, and [...]]]></description>
			<content:encoded><![CDATA[<p>We recently hosted a one-week long <a href="https://ccrma.stanford.edu/workshops/intelligent-audio-systems-foundations-and-applications-of-music-information-retrieval-mir" target="_blank">workshop on music information retrieval</a> at the <a href="http://ccrma.stanford.edu" target="_blank">Center for Computer Research in Music and Acoustics</a> (CCRMA) at <a href="http://stanford.edu" target="_blank">Stanford University</a>. <span id="more-568"></span></p>
<p>Led by <a href="http://imagine-research.com" target="_blank">Imagine Research</a> CEO <a href="http://www.linkedin.com/in/jayleboeuf" target="_blank">Jay LeBoeuf</a>, this workshop included other notable scholars and instructors from the MIR community such as <a href="http://www.cs.princeton.edu/~fiebrink/Rebecca_Fiebrink/welcome.html" target="_blank">Rebecca Fiebrink</a>, <a href="http://heaveneverywhere.com/stp/" target="_blank">Stephen Pope</a>, <a href="http://www.leighsmith.com/" target="_blank">Leigh Smith</a>, <a href="http://webhome.cs.uvic.ca/~gtzan/" target="_blank">George Tzanetakis</a>, and <a href="http://www.iro.umontreal.ca/~eckdoug/" target="_blank">Doug Eck</a>.</p>
<p>We discussed MIR tools and applications such as machine learning, feature extraction, real-time processing, rhythm/tempo estimation, pitch detection, psychoacoustics, cover song retrieval, recommendation, collaborative filtering, and more. Participants included both students and industry practitioners. I presented a few things related to sparse coding and nonnegative matrix factorization for music transcription and source separation. This year, labs included basic programming tasks in Matlab, C++, and Python. For more details about the workshop, please see the <a href="https://ccrma.stanford.edu/wiki/MIR_workshop_2011" target="_blank">workshop wiki</a>.</p>
<p>We hope to see <em>you</em> at the workshop next year! Feel free to email me if you have any questions.</p>
<p>Here is a photo taken on the last day of the workshop right before we departed. (Click to see larger photo.)</p>
<div id="attachment_570" class="wp-caption alignnone" style="width: 490px"><a href="http://up.stevetjoa.com/ccrma2011mir.jpg"><img src="http://up.stevetjoa.com/ccrma2011mir-480x320.jpg" alt="CCRMA MIR 2011 Workshop Participants" title="ccrma2011mir" width="480" height="320" class="size-large wp-image-570" /></a><p class="wp-caption-text">From left: Chris Colatos, Jeff Albert, Kamlesh Lakshminarayanan, Sean Zhang, Doug Eck, Eli Stine, David Bird, Gina Collecchia, Stephen Pope, Steve Tjoa. Not pictured: Jay LeBoeuf, Rebecca Fiebrink, George Tzanetakis, Leigh Smith, Dekun Zou, Bill Paseman, John Amuedo.</p></div>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/568/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Status Update</title>
		<link>http://stevetjoa.com/561</link>
		<comments>http://stevetjoa.com/561#comments</comments>
		<pubDate>Sun, 03 Jul 2011 17:31:22 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[ccrma]]></category>
		<category><![CDATA[imagine-research]]></category>
		<category><![CDATA[music]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=561</guid>
		<description><![CDATA[This page has been inactive since February. Here&#8217;s why. On May 12, 2011, I defended my dissertation entitled &#8220;Sparse and Nonnegative Factorizations for Music Understanding&#8220;. Matrix factorizations have become popular for performing music transcription and source separation. However, the basic factorization algorithms have trouble decomposing highly polyphonic music. In this dissertation, I proposed factorization methods [...]]]></description>
			<content:encoded><![CDATA[<p>This page has been inactive since February. Here&#8217;s why. <span id="more-561"></span></p>
<p>On May 12, 2011, I defended my dissertation entitled &#8220;<a href="http://www.ece.umd.edu/events/index.php?mode=4&#038;id=6160" target="_blank">Sparse and Nonnegative Factorizations for Music Understanding</a>&#8220;. Matrix factorizations have become popular for performing music transcription and source separation. However, the basic  factorization algorithms have trouble decomposing highly polyphonic music. In this dissertation, I proposed factorization methods that are specifically suited for analyzing musical signals, including a method that can efficiently use a musical spectral dictionary if one already exists.</p>
<p>On May 20, I graduated.</p>
<p>On May 28, I boarded a one-way flight to San Francisco. I left my 1999 Saturn SC2 in New Jersey, shipped nineteen boxes to San Francisco, and sold everything else.</p>
<p>On June 6, I began work at <a href="http://imagine-research.com" target="_blank">Imagine Research, Inc.</a>, in San Francisco.  We build software that understands sound. My position is sponsored by a <a href="http://www.ece.umd.edu/News/news_story.php?id=5744" target="_blank">National Science Foundation (NSF) Postdoctoral Research Fellowship</a> and administered by the American Society for Engineering Education (ASEE).</p>
<p>On June 14, I signed a lease for my new apartment. After two weeks of apartment hunting, I found a wonderful place in the Sunset district of San Francisco for only $800 per month with views of Golden Gate Park right across the street.</p>
<p>From June 27 to July 1, I attended and helped instruct a <a href="https://ccrma.stanford.edu/workshops/intelligent-audio-systems-foundations-and-applications-of-music-information-retrieval-mir" target="_blank">workshop on music information retrieval</a> at the Center for Computer Research in Music and Acoustics (CCRMA) at Stanford. We all had a great time talking about music, signal processing, and machine learning.</p>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/561/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>LaTeX At A Glance, v. 1.0</title>
		<link>http://stevetjoa.com/540</link>
		<comments>http://stevetjoa.com/540#comments</comments>
		<pubDate>Thu, 10 Feb 2011 04:49:11 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[latex]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=540</guid>
		<description><![CDATA[Yesterday, I led a workshop on LaTeX for engineering students. As a supplement, I wrote the following document to help people get started with LaTeX. Your feedback is welcome. Enjoy. [PDF]]]></description>
			<content:encoded><![CDATA[<p>Yesterday, I led a workshop on LaTeX for engineering students. As a supplement, I wrote the following document to help people get started with LaTeX. Your feedback is welcome. Enjoy. [<a href="http://up.stevetjoa.com/latex.pdf">PDF</a>]</p>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/540/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Thirty Tips For Dissertation Writing</title>
		<link>http://stevetjoa.com/492</link>
		<comments>http://stevetjoa.com/492#comments</comments>
		<pubDate>Sat, 05 Feb 2011 02:44:20 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[grad-school]]></category>
		<category><![CDATA[writing]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=492</guid>
		<description><![CDATA[Earlier today, I attended an excellent workshop by Dr. Rachna Jain on writing the dissertation. About 150-200 students attended. Here is some of her advice. Writing Strategies Writing is not revising. When you are writing, just write! Don&#8217;t stop, don&#8217;t backspace, don&#8217;t correct. Keep moving forward. Forward, forward, forward. Revise later. Perfectionism is your worst [...]]]></description>
			<content:encoded><![CDATA[<p>Earlier today, I attended an excellent workshop by <a href="http://completeyourdissertation.com" target="_blank">Dr. Rachna Jain</a> on writing the dissertation. About 150-200 students attended. Here is some of her advice.<span id="more-492"></span></p>
<ol>
<h3>Writing Strategies</h3>
<li><strong>Writing is not revising</strong>. When you are writing, just write! Don&#8217;t stop, don&#8217;t backspace, don&#8217;t correct. Keep moving forward. Forward, forward, forward. Revise later. Perfectionism is your worst enemy, particularly at early stages.</li>
<li>Write in <strong>layers</strong>. First, crude main ideas. Then, fill in the gaps. Finally, citations.</li>
<li><strong>Start with what you already know</strong>. &#8220;First drafts come from the heart.&#8221; In verbal conversation, you effortlessly tell the other person what you already know. When writing, pretend to tell someone the story. Then write the story. You may actually find it useful to record your own voice as you tell yourself the story so you can play it back as you write.</li>
<li>Write <strong>at least one page of your dissertation per day</strong>. Often more, but at least one page. And it doesn&#8217;t have to be in order. Any one page is fine.</li>
<li>Allow <strong>rough drafts</strong>. <a href="http://www.google.com/url?sa=t&#038;source=web&#038;cd=1&#038;ved=0CBMQFjAA&#038;url=http%3A%2F%2Fwww.openforum.com%2Fidea-hub%2Ftopics%2Ftechnology%2Farticle%2Fthe-iterate-fast-and-release-often-philosophy-of-entrepreneurship-ben-parr&#038;rct=j&#038;q=launch%20quickly%20iterate%20rapidly&#038;ei=j7ZMTfCDOoa8lQfpprQP&#038;usg=AFQjCNHh2XteBeOMs0RIvdXBmgmcQ-cENA&#038;sig2=fUw2_K9KVMgJ9ehhpS8xYA&#038;cad=rja" target="_blank">Launch quickly and iterate rapidly</a>. The rough draft should be very rough. Don&#8217;t worry about errors. Just write, baby. But also turn around revisions quickly.</li>
<li>Use an <strong>outline</strong>. It helps keep a coherent flow throughout your dissertation. Try constructing a mindmap &#8212; a &#8220;tree&#8221; of ideas with the core (root) idea in the center and branch ideas around the root, and so on.</li>
<li>&#8220;<strong>Powerpoint</strong>&#8221; your ideas. Then turn your slides into prose.</li>
<li>Seek <strong>feedback</strong> regularly. Feedback helps turn revisions around quickly, too.</li>
<li>At first, <strong>ignore your audience</strong>. Write for yourself, first. Then, once you have written a fair amount, consider your audience, and revise. Write outward, not inward.</li>
<li><strong>Writing out of order is fine</strong>, perhaps even preferable. I made the mistake of writing my early research papers in order from introduction to conclusion. However, research is always so unpredictable and amorphous that the important middle sections would always change. Subsequently, so would the introduction and conclusion. Now, I begin the middle sections first, then write the introduction, conclusion, and abstract last.</li>
<h3>Goals and Planning</h3>
<li><strong>Set many small goals</strong>. Break writing tasks into small sections. Incorporate transitions later.</li>
<li>Use <strong>organic goal setting</strong>. Only set goals for each week. Believe it or not, do not set rigid goals for a long period, e.g., semester. Shifting the finish line is deadly. Shift it once, and you get comfortable with it.</li>
<li>Set <strong>specific, measurable goals</strong>. &#8220;Finish dissertation&#8221; is too broad. &#8220;Finish chapter 1&#8243; is still too broad. &#8220;Write five pages of subsection 2.1 from 1 pm to 1:45 pm&#8221; is better. Think in terms of pages per chapter or pages per section.</li>
<li><strong>Plan your progress</strong>. Keep track, e.g., with a calendar.</li>
<li><strong>Time yourself</strong>! You can use <a href="http://www.google.com/search?q=timer" target="_blank">website timers</a>. I am timing myself right now. I started this article at 8:53 pm and completely ended it at 9:39 pm. (Small typo corrections came after.)</li>
<li>The optimal chunk of time for contiguous writing is <strong>45 minutes</strong>.</li>
<li>Aim for <strong>5-7 hours of writing</strong> on a writing day. Do not exceed 7 hours. And not every day should be a dedicated writing day.</li>
<li><strong>Take fifteen-minute breaks</strong>. Get up and walk around. Observe your surroundings. &#8220;To be a good writer, one needs to be a good observer.&#8221;</li>
<h3>Environment</h3>
<li>Work in a sparse, <strong>uncluttered space</strong>. Visual clutter tends to clutter the mind.</li>
<li>Write as <strong>early in the day</strong> as possible. That is when you are freshest. If you must procrastinate, do not, under any circumstances, do it in the morning.</li>
<li><strong>Prepare the night before.</strong> Set your goals for the next day. That will make the following morning more productive.</li>
<li><strong>Change physical environments</strong>. I have heard this advice often. Sometimes, after working in the same place for so long, you grow too comfortable and complacent around it. Try the library or the cafe. Be around others who are also working. Of course, avoid areas that are too disruptive. Classical or soft instrumental music is fine. Music with words is naturally distracting to humans, whether we consciously realize it or not.</li>
<li><strong>Write with others</strong>. Being around others united in the same general goal is energizing. That type of work ethic becomes contagious.</li>
<li>But <strong>don&#8217;t compare with others</strong>. Focus on yourself.</li>
<h3>Internal Attitudes and Self-Improvement</h3>
<li><strong>Write something every day,</strong> even if it is not on the dissertation. Every skill takes <a href="http://alainsaffel.com/10000-hours/" target="_blank">10,000 hours</a> to master. That includes writing. Writing is like exercising a muscle. You only get good at it through practice. That can include a personal journal, emails, a blog post like this one, etc.</li>
<li><strong>Feel good first</strong>. Then write. A burdened mind is an unproductive mind. Negative thoughts are blocking. Get everything else in your life in order to the greatest extent possible. Think positive. Then write.</li>
<li>If you cannot get started, <strong>write about what troubles you</strong>. Notice what you keep thinking about.</li>
<li>If you still cannot get started, ask yourself, &#8220;<strong>Why did I care?</strong>&#8221; What made you start this project in the first place? Writer&#8217;s block comes from a lack of emotional engagement with one&#8217;s work. As time passes, it is natural to become more emotionally detached from the work. Step back, and remind yourself why you care and what makes your project so awesome.</li>
<li><strong>Reward yourself</strong>. Reward positive behavior. Reward your accomplishments, but only when you actually reach a milestone. Otherwise, don&#8217;t. (I asked, &#8220;But when I have momentum, I hate to disrupt my momentum with something frivolous like a reward. Is that okay to skip rewards?&#8221; Her answer: &#8220;No.&#8221; Because if you rely on momentum, then something must be wrong with the system. Therefore, improve the system. Ask yourself why you are so reliant on momentum in the first place. Otherwise, writer&#8217;s block will inevitably occur again.)</li>
<h3>&#8230; and a final reminder</h3>
<li><strong>Activity is not necessarily productivity</strong>. Output is what counts.</li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/492/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Engineers Build Computerized Beauty Contest Judges. Swell.</title>
		<link>http://stevetjoa.com/470</link>
		<comments>http://stevetjoa.com/470#comments</comments>
		<pubDate>Wed, 19 Jan 2011 20:59:45 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[image]]></category>
		<category><![CDATA[machine-learning]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=470</guid>
		<description><![CDATA[I was browsing Stack Overflow when I came across this question that asks how to use image processing and machine learning to measure the beauty of a human face. As this answer explains, papers have already been written on the topic, including this one in ECCV 2010. At AdMIRe 2009, I remember hearing renowned researcher [...]]]></description>
			<content:encoded><![CDATA[<p>I was browsing Stack Overflow when I came across <a href="http://stackoverflow.com/questions/4734602/image-recognition-using-supervised-or-unsupervised-learning">this question</a> that asks how to use image processing and machine learning to <em>measure the beauty of a human face</em>. As <a href="http://stackoverflow.com/questions/4734602/image-recognition-using-supervised-or-unsupervised-learning/4735893#4735893">this answer</a> explains, papers have already been written on the topic, including <a href="http://users.soe.ucsc.edu/~dgray/dgray_eccv2010final.pdf">this one</a> in ECCV 2010. <span id="more-470"></span></p>
<p>At <a href="http://www.cp.jku.at/conferences/admire2009/">AdMIRe 2009</a>, I remember hearing renowned researcher <a href="http://research.yahoo.com/Malcolm_Slaney">Malcolm Slaney</a> say the following, paraphrased: &#8220;Computers should solve computer problems, and humans should solve human problems.&#8221; I agree 100 percent. Yet, in the exciting and sometimes amusing world of machine learning, this principle is violated all the time. The ECCV paper is just one example.</p>
<p>I can understand why this paper was accepted. To be fair, the paper is well written. (Having reviewed many papers, I find that proper English can go a <em>long</em> way.) It uses standard machine learning approaches that have been applied to similar problems such as face recognition and gender recognition, e.g., PCA, neural networks, and gradient descent. Papers that ask provocative questions and address undersaturated problems tend to be accepted, as they should. Then again, as a reviewer, I would claim that there exists no ground truth for beauty, and so the proposed system can spit out whatever nonsense it wants, and there would be no way to validate or invalidate it.</p>
<p>More importantly, this research is just plain <em>mean</em>. People are sensitive about their appearance. To propose a computerized system that can quantify and rank the beauty of human faces is provocative at best and unprofessional at worst. Oh, and all of the test subjects are female. Draw your own conclusions.</p>
<p>Here&#8217;s one clever piece of marketing: the authors derive two mathematical operators, <em>beautify</em> and <em>beastify</em>, where one is simply the negative of the other. (Beastify? Seriously??) Applying the beautify operator to an image makes the person more beautiful, and the beastify operator does the opposite. How cute.</p>
<p>The practitioner in me can never imagine such a system being used for any practical purpose. I can find utility &#8212; at least an <em>iota</em> of utility &#8212; in related problems such as face recognition, speaker recognition, gender recognition, genre recognition (for music), ethnicity recognition, language recognition, and gesture recognition. But where would beauty measurement be used? The Miss Universe Pageant? American Idol? Hollywood auditions? Facebook? (Actually, I wouldn&#8217;t put that last one outside the realm of possibility.)</p>
<p>Computers should solve computer problems. &#8220;Multiply 9832473 by 29834&#8243; is a computer problem. &#8220;Compress this 12-megapixel image&#8221; is a computer problem. &#8220;Encode this bitstream so that it is cryptographically secure and robust to channel errors&#8221; is a computer problem. Let&#8217;s leave the human problems for humans.</p>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/470/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Resumé Template in LaTeX</title>
		<link>http://stevetjoa.com/449</link>
		<comments>http://stevetjoa.com/449#comments</comments>
		<pubDate>Thu, 02 Dec 2010 23:33:59 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[latex]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=449</guid>
		<description><![CDATA[People have been asking for the LaTeX template that I used to build my resumé/CV. Here it is: template (tex) and output (PDF, 90KB). I use pdflatex to compile. I wrote the template myself. A lot of it is self-explanatory. The template relies mainly on section*, subparagraph, and itemize. Comments: The fancyhdr package allows custom [...]]]></description>
			<content:encoded><![CDATA[<p>People have been asking for the LaTeX template that I used to build my resumé/CV. Here it is: <span id="more-449"></span> <a href="http://up.stevetjoa.com/cvtemplate20101202.tex">template (tex)</a> and <a href="http://up.stevetjoa.com/cvtemplate20101202.pdf">output (PDF, 90KB)</a>. I use <tt>pdflatex</tt> to compile.</p>
<p>I wrote the template myself. A lot of it is self-explanatory. The template relies mainly on <tt>section*</tt>, <tt>subparagraph</tt>, and <tt>itemize</tt>. Comments:</p>
<ol>
<li>The <tt>fancyhdr</tt> package allows custom headers and footers. That&#8217;s how I made the three-part footer.</li>
<li>I used <tt>renewcommand</tt> to remove item labels.</li>
<li>The Publications section uses the <tt>multibib</tt> package to embed a bibliography (or multiple bibliographies) in the middle of a document. With <tt>multibib</tt>, here are the commands for an entire build:<br />
<code><br />
pdflatex cv.tex<br />
bibtex c.aux<br />
bibtex j.aux<br />
pdflatex cv.tex<br />
pdflatex cv.tex<br />
</code></li>
</ol>
<p>Feel free to modify. I would appreciate at least some modification so your resumé doesn&#8217;t look exactly the same as mine. Let me know how it goes. Good luck.</p>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/449/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The One-Step Build for Academic Researchers</title>
		<link>http://stevetjoa.com/318</link>
		<comments>http://stevetjoa.com/318#comments</comments>
		<pubDate>Sun, 17 Oct 2010 23:00:51 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[build]]></category>
		<category><![CDATA[python]]></category>
		<category><![CDATA[software]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=318</guid>
		<description><![CDATA[The Joel Test is a set of twelve simple yes/no questions written by Joel Spolsky that is supposed to measure how good a software team is. Some of these questions include &#8220;Do you have a spec?&#8221;, &#8220;Do you have testers?&#8221;, and &#8220;Do you use source control?&#8221;. A software team that can answer &#8220;yes&#8221; to all [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.joelonsoftware.com/articles/fog0000000043.html" target="_blank">The Joel Test</a> is a set of twelve simple yes/no questions written by <a href="http://www.joelonsoftware.com/AboutMe.html" target="_blank">Joel Spolsky</a> that is supposed to measure how good a software team is. Some of these questions include &#8220;Do you have a spec?&#8221;, &#8220;Do you have testers?&#8221;, and &#8220;Do you use source control?&#8221;. A software team that can answer &#8220;yes&#8221; to all twelve questions probably produces excellent software, according to Spolsky. <span id="more-318"></span>Since being published in 2000, this test has achieved quite a bit of fame among software engineers in industry.</p>
<p>But what about academia? For example, how well does my research group perform on the Joel Test? After all, we write software, too. Should academic researchers be held to the same standard as a full-time software engineer in industry? I <a href="http://stackoverflow.com/questions/2734159/workflow-for-academic-research-projects-one-step-builds-and-the-joel-test" target="_blank">asked Stack Overflow</a> and got <a href="http://stackoverflow.com/questions/2734159/workflow-for-academic-research-projects-one-step-builds-and-the-joel-test/2734434#2734434" target="_blank">one response</a> that says <em>no</em>, and I agree. Academia and industry have different goals and different resources. Most research groups I know don&#8217;t have full-time testers. (Duh.) They don&#8217;t make daily builds, and they certainly don&#8217;t have the best tools money can buy.</p>
<p>Nevertheless, item 2 of this test reads, &#8220;<em>Can you make a build in one step?</em>&#8221; Spolsky elaborates:</p>
<blockquote><p>&#8230; how many steps does it take to make a shipping build from the latest source snapshot? &#8230; If the process takes any more than one step, it is prone to errors. And when you get closer to shipping, you want to have a very fast cycle of fixing the &#8220;last&#8221; bug, making the final EXEs, etc. If it takes 20 steps to compile the code, run the installation builder, etc., you&#8217;re going to go crazy and you&#8217;re going to make silly mistakes.</p></blockquote>
<p>I definitely did not have a one-step build for my research projects. And as predicted by Spolsky, without a one-step build, I went crazy. I could not remember the entire process to produce my results and incorporate them into my paper, especially after a few days of programming inactivity. When my numerical results changed, I had to repopulate the tables in the paper. All of these things could have been avoided by using a one-step build.</p>
<p><em>So I wrote one in Python.</em> It is a single file named <code>build.py</code>. As shown in Figure 1, this build accepts three inputs: source code, data, and a paper template. The primary output is a paper &#8212; the ultimate goal of any research project.<br />
<img src="http://up.stevetjoa.com/onestepbuild.jpg" border=0 alt="one-step build" />
</p>
<p>The build has three internal components:</p>
<ol>
<li>Compile the source code to generate an executable. This step is most important for languages such as C or C++. In languages such as Python or Matlab, you don&#8217;t need to worry about this step.</li>
<li>Feed data into the executable to generate numerical results. I save these results to a file either as text or a serialized format (e.g., using <code>pickle</code> in Python).</li>
<li>Feed results into a paper template to generate the paper. I have a <code>paper_template.tex</code> file that contains placeholders to be filled in. Then, I use the <code>string.Template</code> module in Python to fill in the results. (Perl is a good choice, too.) Finally, I use <code>pdflatex</code> to compile the <code>tex</code> file into a PDF document.</li>
</ol>
<p>To run the build, I simply type <code>python build.py</code> at the command line. That is the beauty of the one-step build: at the single push of a button, you generate a new program, new results, and a new paper. If you are really lazy, you could even tell Linux to automatically run this script when you turn on the computer.</p>
<p>If you are having trouble managing the software for your research projects, write a one-step build. Good luck, and let me know how it goes.</p>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/318/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>I used Matlab. Now I use Python.</title>
		<link>http://stevetjoa.com/305</link>
		<comments>http://stevetjoa.com/305#comments</comments>
		<pubDate>Sat, 25 Sep 2010 21:26:57 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[matlab]]></category>
		<category><![CDATA[python]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=305</guid>
		<description><![CDATA[Colleagues have asked me why I changed from Matlab to Python, and what makes Python so great. For example, a friend recently asked the following: I noticed that the use of python to do signal processing tasks is slowly increasing (saw it mentioned in some papers). I am curious to know what are the advantages/disadvantages [...]]]></description>
			<content:encoded><![CDATA[<p>Colleagues have asked me why I changed from Matlab to Python, and what makes Python so great. For example, a friend recently asked the following: <span id="more-305"></span></p>
<blockquote><p>I noticed that the use of python to do signal processing tasks is slowly increasing (saw it mentioned in some papers). I am curious to know what are the advantages/disadvantages of python vis-a-vis C or Matlab. What do you think are the main pros and cons that have led you to use python?</p></blockquote>
<p>Because this precise question &#8212; why Python over Matlab? &#8212; has become so popular, I thought I would offer my reasons for switching.</p>
<ol>
<li>Python is totally <b>free</b>. Matlab is not. Undergrads living on campus are able to get Matlab for free through&#8230; ahem&#8230; alternative means. As a grad student disconnected from on-campus life, it is hard for me to use such&#8230; ahem&#8230; alternative means to get a copy of Matlab, especially for Linux.</li>
<li>Python is a <strong>general-purpose language</strong>, i.e., it is used for scientific computing, enterprise software, web design, back end, front end, and everything in between. Matlab is purely for scientific computing, although it does a great job at it.</li>
<li>Python <strong>syntax is beautiful</strong>. Once you get over the use of meaningful whitespace, you realize how much it makes sense. Famous entrepreneur and investor <a href="http://paulgraham.com" target="_blank">Paul Graham</a> mentions a friend who uses Python because &#8220;<a href="http://paulgraham.com/pypar.html" target="_blank">he likes the way source code looks.</a>&#8221; Graham explains further:<br />
<blockquote><p>That may seem a frivolous reason to choose one language over another. But it is not so frivolous as it sounds: when you program, you spend more time reading code than writing it. You push blobs of source code around the way a sculptor does blobs of clay. So a language that makes source code ugly is maddening to an exacting programmer, as clay full of lumps would be to a sculptor.</p></blockquote>
</li>
<li>Python is <strong>inherently object oriented</strong>. Almost everything is an object: strings, lists, dictionaries, tuples, functions, classes, and more. The implied usefulness is that these things each have their own members and methods that encapsulate its functionality and information.</li>
<li>Python is <strong>high level, easy to learn, and fast to develop</strong>. (So is Matlab.) Program scripts are easily profiled and debugged. Although C++ may save you one minute of computation, Python may save you one month of development.</li>
<li>Python has so many <strong>cool features</strong> that makes tasks that are difficult in other languages easy. For example, see <a href="http://stackoverflow.com/questions/101268/hidden-features-of-python" target="_blank">these features</a>.</li>
<li>Python is <strong>fast enough</strong>. Bottlenecks can be addressed by writing a bit of C (something I have not yet needed to do), or you can parallelize your code easily in Python. For what it&#8217;s worth, Python was written in C.</li>
<li>Python is <strong>popular and has a great community</strong> among programmers, scientists, mathematicians, and engineers, making it easy to find help on the Internet. At the moment, &#8220;Python&#8221; is the <a href="http://stackoverflow.com/tags" target="_blank">tenth most popular tag on StackOverflow</a> (which, by the way, may be my favorite site on the Internet), and Python is the second most popular language among <a href="http://projecteuler.net/index.php?section=statistics" target="_blank">Project Euler members</a>.</li>
<li>Python has <strong>great libraries</strong>. It also makes any library you want already available. Want something that does hierarchical clustering? There are a bunch of Python packages that do exactly that.</li>
<li>Python packages <strong>can do nearly everything Matlab can do for signal processing</strong>.  <a href="http://numpy.scipy.org/" target="_blank">NumPy</a> is like the Matlab core, <a href="http://www.scipy.org/SciPy" target="_blank">SciPy</a> is like Matlab toolboxes, <a href="http://matplotlib.sourceforge.net/" target="_blank">Matplotlib</a> lets you print pretty graphs, and <a target="_blank" href="http://ipython.scipy.org/">IPython</a> emulates the Matlab desktop environment. Over the years, additions and revisions to NumPy/SciPy were written to make Matlab users comfortable, e.g., <a href="http://www.scipy.org/PyLab" target="_blank">PyLab</a>. Many sites offer help regarding the transition from Matlab to NumPy/SciPy, such as <a href="http://www.scipy.org/NumPy_for_Matlab_Users" target="_blank">this</a>.</li>
<li>The <strong>demand for Python programmers is increasing</strong>. Many large companies (e.g., <a href="http://www.google.com/intl/en/jobs/students/us/technical/software-engineer-new-grad-north-america-locations/" target="_blank">Google</a>, <a href="http://www.facebook.com/careers/department.php?dept=engineering" target="_blank">Facebook</a>) and small startups are hiring people with Python experience. Even the finance industry is interested. The U.S. Securities and Exchange Commission <a href="http://www.sec.gov/rules/proposed/2010/33-9117.pdf">recently proposed a mandate</a> that would require securities issuers to submit a computer program that maps the logic flow of funds, and that &#8220;this computer program be filed&#8230; in Python&#8221;.</li>
</ol>
<p>Disadvantages? Python is probably easier to learn from scratch than C++, but it still takes time. If you are already comfortable with Matlab, and you <em>have</em> Matlab, then you should keep using Matlab. But if you want a language that is free, flexible, popular, and may make you more marketable, then give Python a try.  Believe me, I like Matlab. I just like Python better.</p>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/305/feed</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>Summary: Musical Instrument Recognition Using Biologically Inspired Filtering of Temporal Dictionary Atoms</title>
		<link>http://stevetjoa.com/334</link>
		<comments>http://stevetjoa.com/334#comments</comments>
		<pubDate>Sat, 31 Jul 2010 23:49:31 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[instrument-rec]]></category>
		<category><![CDATA[ismir]]></category>
		<category><![CDATA[kjr-liu]]></category>
		<category><![CDATA[music]]></category>
		<category><![CDATA[s-tjoa]]></category>
		<category><![CDATA[transcription]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=334</guid>
		<description><![CDATA[Musical Instrument Recognition Using Biologically Inspired Filtering of Temporal Dictionary Atoms Steven K. Tjoa and K. J. Ray Liu Int. Soc. Music Information Retrieval Conf., August 2010 Download: Paper, Poster, BibTeX @INPROCEEDINGS{tjoa2010ismir, title = "Musical Instrument Recognition Using Biologically Inspired Filtering of Temporal Dictionary Atoms", author = "Steven K. Tjoa and K. J. Ray Liu", [...]]]></description>
			<content:encoded><![CDATA[<h3>Musical Instrument Recognition Using Biologically Inspired Filtering of Temporal Dictionary Atoms</h3>
<ul>
<li>Steven K. Tjoa and K. J. Ray Liu</li>
<li>Int. Soc. Music Information Retrieval Conf., August 2010</li>
<li>Download: <a href='http://up.stevetjoa.com/tjoa2010ismir.pdf'>Paper</a>, <a href='http://up.stevetjoa.com/tjoa20100812ismir.pdf'>Poster</a>, <a class="bibtex">BibTeX</a>
<pre>@INPROCEEDINGS{tjoa2010ismir,
  title = "Musical Instrument Recognition Using Biologically Inspired Filtering of Temporal Dictionary Atoms",
  author = "Steven K. Tjoa and K. J. Ray Liu",
  booktitle = "Proc. Int. Soc. Music Information Retrieval Conf.",
  address = "Utrecht, Netherlands",
  year = "2010",
  month = aug,
  pages = "435--440"
};</pre>
</li>
</ul>
<p>Most musical instrument recognition systems rely upon spectral information to classify sounds. Can <em>temporal</em> information improve classification accuracy even further? <span id="more-334"></span></p>
<p>Evidence from the psychoacoustic literature suggests that both spectral and temporal content carry information about acoustic timbre through the human auditory system. In other words, because humans recognize musical instruments so effortlessly, then perhaps machines would also benefit from a combination of spectral and temporal information. Researchers use a model known as the cortical representation to emulate the information output from the middle stage of the human auditory system. Although some engineers have tried to build music information retrieval (MIR) systems that use the cortical representation, its high dimensionality makes it difficult to employ in practical systems.</p>
<p>But what if we can embody the traits of the cortical representation into another representation that also includes spectral and temporal content? One such candidate is nonnegative matrix factorization (NMF), a tool that can extract spectral and temporal information from spectrograms.</p>
<p>In this paper, we test the usefulness of temporal information extracted using NMF in instrument recognition. We mimic the multiresolution aspect of the cortical representation by using a multiresolution gamma filterbank that parameterizes the shape of temporal envelopes. Our results show that this method of temporal processing can classify among isolated sounds among 24 instrument classes with 92.3% accuracy.<br />
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/334/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Summary: Multiplicative Update Rules for Nonnegative Matrix Factorization with Co-occurrence Constraints</title>
		<link>http://stevetjoa.com/269</link>
		<comments>http://stevetjoa.com/269#comments</comments>
		<pubDate>Mon, 22 Mar 2010 07:50:51 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[ieee-icassp]]></category>
		<category><![CDATA[kjr-liu]]></category>
		<category><![CDATA[music]]></category>
		<category><![CDATA[nmf]]></category>
		<category><![CDATA[s-tjoa]]></category>
		<category><![CDATA[source-separation]]></category>
		<category><![CDATA[transcription]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=269</guid>
		<description><![CDATA[Multiplicative Update Rules for Nonnegative Matrix Factorization with Co-occurrence Constraints Steven K. Tjoa and K. J. Ray Liu IEEE Int. Conf. Acoustics, Speech, and Signal Processing, March 2010 Download: Paper, Poster, BibTeX @INPROCEEDINGS{tjoa2010icassp_cooccurrence, title = "Multiplicative Update Rules for Nonnegative Matrix Factorization with Co-occurrence Constraints", author = "Steven K. Tjoa and K. J. Ray Liu", [...]]]></description>
			<content:encoded><![CDATA[<h3>Multiplicative Update Rules for Nonnegative Matrix Factorization with Co-occurrence Constraints</h3>
<ul>
<li>Steven K. Tjoa and K. J. Ray Liu</li>
<li>IEEE Int. Conf. Acoustics, Speech, and Signal Processing, March 2010</li>
<li>Download: <a href='http://up.stevetjoa.com/tjoa2010icassp_cooccurrence.pdf'>Paper</a>, <a href='http://up.stevetjoa.com/tjoa2010icassp_poster1.pdf'>Poster</a>, <a class="bibtex">BibTeX</a>
<pre>@INPROCEEDINGS{tjoa2010icassp_cooccurrence,
  title = "Multiplicative Update Rules for Nonnegative Matrix Factorization with Co-occurrence Constraints",
  author = "Steven K. Tjoa and K. J. Ray Liu",
  booktitle = "Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing",
  address = "Dallas, TX",
  year = "2010",
  month = mar,
  pages = "449--452"
};</pre>
</li>
</ul>
<p>Nonnegative matrix factorization (NMF) has become a popular tool for discovering structure in a variety of signals. When applied to a musical audio signal, NMF builds a set of <em>dictionary atoms</em> that represent the individual musical sources in the signal. To perform music transcription, we map the learned dictionary atoms to musical notes and beats.<span id="more-269"></span></p>
<p>The basic formulation of NMF has the notable disadvantage that sources may require more than a single dictionary atom in order to be approximated accurately. For example, one note played by a violin may require multiple dictionary atoms to be accurately represented due to the vibrato induced by the performer. In the presence of many other musical sources, the correspondence between atoms becomes unclear.</p>
<p>In this paper, we introduce <em>three new update rules</em> to enforce dependence among dictionary atoms by incorporating <em>co-occurrence constraints</em> into NMF.  These co-occurrence constraints have shown to be useful for describing sources with multiple, co-occurring dictionary atoms by grouping similar atoms into sets. The proposed rules are conceptually simple, easy to implement, and effective for describing sources using multiple dictionary atoms.</p>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/269/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Summary: Harmonic Variable-Size Dictionary Learning for Music Source Separation</title>
		<link>http://stevetjoa.com/278</link>
		<comments>http://stevetjoa.com/278#comments</comments>
		<pubDate>Mon, 22 Mar 2010 07:20:19 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[dictionary-learning]]></category>
		<category><![CDATA[ieee-icassp]]></category>
		<category><![CDATA[k-svd]]></category>
		<category><![CDATA[kjr-liu]]></category>
		<category><![CDATA[mc-stamm]]></category>
		<category><![CDATA[music]]></category>
		<category><![CDATA[nmf]]></category>
		<category><![CDATA[s-tjoa]]></category>
		<category><![CDATA[source-separation]]></category>
		<category><![CDATA[transcription]]></category>
		<category><![CDATA[ws-lin]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=278</guid>
		<description><![CDATA[Harmonic Variable-Size Dictionary Learning for Music Source Separation Steven K. Tjoa, Matthew C. Stamm, W. Sabrina Lin, and K. J. Ray Liu IEEE Int. Conf. Acoustics, Speech, and Signal Processing, March 2010 Download: Paper, Poster, BibTeX @INPROCEEDINGS{tjoa2010icassp_harmonic, title = "Harmonic Variable-Size Dictionary Learning for Music Source Separation", author = "Steven K. Tjoa and Matthew C. [...]]]></description>
			<content:encoded><![CDATA[<h3>Harmonic Variable-Size Dictionary Learning for Music Source Separation</h3>
<ul>
<li>Steven K. Tjoa, Matthew C. Stamm, W. Sabrina Lin, and K. J. Ray Liu</li>
<li>IEEE Int. Conf. Acoustics, Speech, and Signal Processing, March 2010</li>
<li>Download: <a href='http://up.stevetjoa.com/tjoa2010icassp_harmonic.pdf'>Paper</a>, <a href='http://up.stevetjoa.com/tjoa2010icassp_poster2.pdf'>Poster</a>, <a class="bibtex">BibTeX</a>
<pre>@INPROCEEDINGS{tjoa2010icassp_harmonic,
  title = "Harmonic Variable-Size Dictionary Learning for Music Source Separation",
  author = "Steven K. Tjoa and Matthew C. Stamm and W. Sabrina Lin and K. J. Ray Liu",
  booktitle = "Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing",
  address = "Dallas, TX",
  year = "2010",
  month = mar,
  pages = "413--416"
};</pre>
</li>
</ul>
<p>Methods that employ dictionary learning and sparse coding have become popular for discovering structure in acoustic signals. Unfortunately, these methods also share a common limitation. When there is significant spectral-temporal overlap among the dictionary atoms present in a signal, it becomes difficult for these methods to learn atoms properly. <span id="more-278"></span>Often, information from multiple atoms is represented as a single atom by the learning procedure. If an atom in the output dictionary contains musical information from multiple sources, transcription and source separation cannot be accurately performed.</p>
<p>In this paper, we propose a novel dictionary learning method that performs well despite the presence of spectral-temporal overlap among dictionary atoms. Our method imposes a <em>harmonic constraint</em> that restricts each atom to represent at most one pitch. Furthermore, our method is <em>flexible</em> by allowing the size of the dictionary to grow based upon the complexity of the input signal. Our method consistently achieves higher recall and precision than other well-known dictionary learning algorithms.</p>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/278/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Summary: Anti-Forensics of JPEG Compression</title>
		<link>http://stevetjoa.com/294</link>
		<comments>http://stevetjoa.com/294#comments</comments>
		<pubDate>Mon, 22 Mar 2010 06:05:30 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[compression]]></category>
		<category><![CDATA[forensics]]></category>
		<category><![CDATA[ieee-icassp]]></category>
		<category><![CDATA[image]]></category>
		<category><![CDATA[kjr-liu]]></category>
		<category><![CDATA[mc-stamm]]></category>
		<category><![CDATA[s-tjoa]]></category>
		<category><![CDATA[ws-lin]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=294</guid>
		<description><![CDATA[Anti-Forensics of JPEG Compression Matthew C. Stamm, Steven K. Tjoa, W. Sabrina Lin, and K. J. Ray Liu IEEE Int. Conf. Acoustics, Speech, and Signal Processing, March 2010 Download: Paper, BibTeX @INPROCEEDINGS{stamm2010icassp_antiforensics, title = "Anti-Forensics of {JPEG} Compression", author = "Matthew C. Stamm and Steven K. Tjoa and W. Sabrina Lin and K. J. Ray [...]]]></description>
			<content:encoded><![CDATA[<h3>Anti-Forensics of JPEG Compression</h3>
<ul>
<li>Matthew C. Stamm, Steven K. Tjoa, W. Sabrina Lin, and K. J. Ray Liu</li>
<li>IEEE Int. Conf. Acoustics, Speech, and Signal Processing, March 2010</li>
<li>Download: <a href="http://up.stevetjoa.com/stamm2010icassp_antiforensics.pdf">Paper</a>, <a class="bibtex">BibTeX</a>
<pre>@INPROCEEDINGS{stamm2010icassp_antiforensics,
  title = "Anti-Forensics of {JPEG} Compression",
  author = "Matthew C. Stamm and Steven K. Tjoa and W. Sabrina Lin and K. J. Ray Liu",
  booktitle = "Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing",
  address = "Dallas, TX",
  year = "2010",
  month = mar,
  pages = "1694--1697"
};</pre>
</li>
</ul>
<p>Digital image forensics is a relatively new field that has only become popular within the last seven years. A newer body of research involves <em>anti-forensics</em> &#8212; methods of processing images that disguise evidence of earlier image processing or tampering. By altering the forensically significant properties in an image, existing image forensic systems may be rendered useless. <span id="more-294"></span>Like many areas of security, we benefit from the study of anti-forensics by understanding the attacks that are most likely to occur before they actually do. Then, we can refine our image forensic systems further.</p>
<p>Earlier, our group proposed the <a href="http://stevetjoa.com/110">first forensic system</a> that identifies which compression method was used upon a digital image. In this paper, we propose an anti-forensic method for <em>defeating the identification of JPEG compression</em> in an image. Our method alters the intrinsic fingerprint of JPEG-compressed images in the transform domain such that the attacked image is perceptually similar to the image before attack. To our knowledge, this work is the <em>first</em> to propose a method specifically designed to defeat compression identification systems, and we anticipate that more work will follow.</p>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/294/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Summary: Digital Image Source Coder Forensics via Intrinsic Fingerprints</title>
		<link>http://stevetjoa.com/110</link>
		<comments>http://stevetjoa.com/110#comments</comments>
		<pubDate>Mon, 01 Feb 2010 03:04:52 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[compression]]></category>
		<category><![CDATA[forensics]]></category>
		<category><![CDATA[hv-zhao]]></category>
		<category><![CDATA[ieee-tifs]]></category>
		<category><![CDATA[image]]></category>
		<category><![CDATA[kjr-liu]]></category>
		<category><![CDATA[s-tjoa]]></category>
		<category><![CDATA[ws-lin]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=110</guid>
		<description><![CDATA[Digital Image Source Coder Forensics via Intrinsic Fingerprints W. Sabrina Lin, Steven K. Tjoa, H. Vicky Zhao, and K. J. Ray Liu IEEE Trans. Information Forensics and Security, September 2009 Download: Paper, BibTeX @ARTICLE{lin2009tifs, title = "Digital Image Source Coder Forensics via Intrinsic Fingerprints", author = "W. Sabrina Lin and Steven K. Tjoa and H. [...]]]></description>
			<content:encoded><![CDATA[<h3>Digital Image Source Coder Forensics via Intrinsic Fingerprints</h3>
<ul>
<li>W. Sabrina Lin, Steven K. Tjoa, H. Vicky Zhao, and K. J. Ray Liu</li>
<li>IEEE Trans. Information Forensics and Security, September 2009</li>
<li>Download: <a href='http://up.stevetjoa.com/lin2009tifs.pdf'>Paper</a>, <a class="bibtex">BibTeX</a>
<pre>@ARTICLE{lin2009tifs,
  title = "Digital Image Source Coder Forensics via Intrinsic Fingerprints",
  author = "W. Sabrina Lin and Steven K. Tjoa and H. Vicky Zhao and K. J. Ray Liu",
  journal = "IEEE Trans. Information Forensics and Security",
  year = "2009",
  month = sep,
  volume = "4",
  number = "3",
  pages = "460--475"
};</pre>
</li>
</ul>
<p>Multimedia forensic methods allow us to verify and maintain the integrity of our multimedia data. For example, we can embed a watermark into a digital image to bind the identity of its owner to the image itself. However, traditional forensic approaches such as watermarking are not applicable in many real-world scenarios, for example, when we do not have access to the original data. <span id="more-110"></span></p>
<p>In this paper, we investigate the use of <em>intrinsic fingerprints</em> &#8212; subsets of data which are, or have become, an intrinsic part of the data in question &#8212;  for formulating a forensic methodology that can identify the compression history of a digital image. By examining the intrinsic fingerprints in an image, we can tell what compression method (e.g., JPEG, etc.) was used in order to determine the origin of the image and thereby assess its authenticity.</p>
<p>Although there exist methods that estimate parameters of individual compression schemes, to our knowledge, our work is the <em>first</em> to address the problem of image compression <em>identification</em> and offer a unified forensic framework incorporating multiple encoders.</p>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/110/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Summary: Transform Coder Classification for Digital Image Forensics</title>
		<link>http://stevetjoa.com/123</link>
		<comments>http://stevetjoa.com/123#comments</comments>
		<pubDate>Sun, 31 Jan 2010 21:39:26 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[compression]]></category>
		<category><![CDATA[forensics]]></category>
		<category><![CDATA[ieee-icip]]></category>
		<category><![CDATA[image]]></category>
		<category><![CDATA[kjr-liu]]></category>
		<category><![CDATA[s-tjoa]]></category>
		<category><![CDATA[ws-lin]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=123</guid>
		<description><![CDATA[Transform Coder Classification for Digital Image Forensics Steven Tjoa, W. Sabrina Lin, and K. J. Ray Liu IEEE Int. Conf. Image Processing, September 2007 Download: Paper, Presentation, BibTeX @INPROCEEDINGS{tjoa2007icip, title = "Transform Coder Classification for Digital Image Forensics", author = "Steven Tjoa and W. Sabrina Lin and K. J. Ray Liu", booktitle = "Proc. IEEE [...]]]></description>
			<content:encoded><![CDATA[<h3>Transform Coder Classification for Digital Image Forensics</h3>
<ul>
<li>Steven Tjoa, W. Sabrina Lin, and K. J. Ray Liu</li>
<li>IEEE Int. Conf. Image Processing, September 2007</li>
<li>Download: <a href='http://up.stevetjoa.com/tjoa2007icip.pdf'>Paper</a>, <a href='http://up.stevetjoa.com/tjoa2007icip_presentation.pdf'>Presentation</a>, <a class="bibtex">BibTeX</a>
<pre>@INPROCEEDINGS{tjoa2007icip,
  title = "Transform Coder Classification for Digital Image Forensics",
  author = "Steven Tjoa and W. Sabrina Lin and K. J. Ray Liu",
  booktitle = "Proc. IEEE Int. Conf. on Image Processing",
  address = "San Antonio, TX",
  year = "2007",
  month = sep,
  volume = "6",
  pages = {VI-105--VI-108},
};</pre>
</li>
</ul>
<p>Our <a href="http://stevetjoa.com/110">TIFS paper</a> was the first work to offer a unified forensic framework that can identify the compression method used upon an image. Among these compression methods are <em>transform coders</em> &#8212; compression methods that apply some form of entropy coding in the transform domain (e.g., via discrete cosine transform, wavelet transform, etc.). <span id="more-123"></span> Transform coding is undoubtedly the most popular category of image coders in use today. Therefore, the forensic problem of transform coder classification deserves attention.</p>
<p>In this paper, we propose a method that can identify <em>which transform</em> was used upon an image during the compression process. By analyzing the transform coefficient histogram obtained after applying a candidate transform, we can determine if the image was compressed using the transform tested. Results show that forensic classification accuracy is high when testing from among six well-known transforms. </p>
<p>To our knowledge, this work is <em>the first</em> to address the problem of transform coder classification and propose a working solution.</p>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/123/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Summary: Image Source Coding Forensics Via Intrinsic Fingerprints</title>
		<link>http://stevetjoa.com/131</link>
		<comments>http://stevetjoa.com/131#comments</comments>
		<pubDate>Sun, 31 Jan 2010 10:52:33 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[compression]]></category>
		<category><![CDATA[forensics]]></category>
		<category><![CDATA[hv-zhao]]></category>
		<category><![CDATA[ieee-icme]]></category>
		<category><![CDATA[image]]></category>
		<category><![CDATA[kjr-liu]]></category>
		<category><![CDATA[s-tjoa]]></category>
		<category><![CDATA[ws-lin]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=131</guid>
		<description><![CDATA[Image Source Coding Forensics Via Intrinsic Fingerprints W. Sabrina Lin, Steven Tjoa, H. Vicky Zhao, and K. J. Ray Liu IEEE Int. Conf. Multimedia and Expo, July 2007 Download: Paper, BibTeX @INPROCEEDINGS{lin2007icme, title = "Image Source Coding Forensics via Intrinsic Fingerprints", author = "W. Sabrina Lin and Steven Tjoa and H. Vicky Zhao and K. [...]]]></description>
			<content:encoded><![CDATA[<h3>Image Source Coding Forensics Via Intrinsic Fingerprints</h3>
<ul>
<li>W. Sabrina Lin, Steven Tjoa, H. Vicky Zhao, and K. J. Ray Liu</li>
<li>IEEE Int. Conf. Multimedia and Expo, July 2007</li>
<li>Download: <a href='http://up.stevetjoa.com/lin2007icme.pdf'>Paper</a>, <a class="bibtex">BibTeX</a>
<pre>@INPROCEEDINGS{lin2007icme,
  title = "Image Source Coding Forensics via Intrinsic Fingerprints",
  author = "W. Sabrina Lin and Steven Tjoa and H. Vicky Zhao and K. J. Ray Liu",
  booktitle = "Proc. IEEE Int. Conf. Multimedia and Expo",
  address = "Beijing, China",
  year = "2007",
  month = jul,
  pages = "1127--1130",
};</pre>
</li>
</ul>
<p>The most popular image compression schemes in use today are lossy, i.e., compression imposes some irreversible distortion in the image in order to achieve a smaller file size. Because each compression method imposes different kinds of distortion, the distortion can act as a <em>fingerprint</em> of the compression method. <span id="more-131"></span> Existing image compression methods can be grouped into a few categories &#8212; e.g., transform coding, vector quantization, subband coding, linear predictive coding, embedded coding &#8212; where all methods in the same category leave behind the same type of fingerprint in an image during compression.</p>
<p>In this paper, we analyze the <em>intrinsic fingerprints</em> of different types of image compression methods and propose a forensic system that identifies the type of the compression and provides a confidence measure in the system&#8217;s decision. Results show that the system can achieve a probability of detection of 0.82 for an image PSNR of 40 dB and even higher accuracy for lower PSNRs.</p>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/131/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Summary: Block Size Forensic Analysis in Digital Images</title>
		<link>http://stevetjoa.com/136</link>
		<comments>http://stevetjoa.com/136#comments</comments>
		<pubDate>Sun, 31 Jan 2010 06:09:41 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[compression]]></category>
		<category><![CDATA[forensics]]></category>
		<category><![CDATA[hv-zhao]]></category>
		<category><![CDATA[ieee-icassp]]></category>
		<category><![CDATA[image]]></category>
		<category><![CDATA[kjr-liu]]></category>
		<category><![CDATA[s-tjoa]]></category>
		<category><![CDATA[ws-lin]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=136</guid>
		<description><![CDATA[Block Size Forensic Analysis in Digital Images Steven Tjoa, W. Sabrina Lin, H. Vicky Zhao, and K. J. Ray Liu IEEE Int. Conf. Acoustics, Speech, and Signal Processing, April 2007 Download: Paper, Presentation, BibTeX @INPROCEEDINGS{tjoa2007icassp, title = "Block Size Forensic Analysis in Digital Images", author = "Steven Tjoa and W. Sabrina Lin and H. Vicky [...]]]></description>
			<content:encoded><![CDATA[<h3>Block Size Forensic Analysis in Digital Images</h3>
<ul>
<li>Steven Tjoa, W. Sabrina Lin, H. Vicky Zhao, and K. J. Ray Liu</li>
<li>IEEE Int. Conf. Acoustics, Speech, and Signal Processing, April 2007</li>
<li>Download: <a href='http://up.stevetjoa.com/tjoa2007icassp.pdf'>Paper</a>, <a href='http://up.stevetjoa.com/tjoa2007icassp_presentation.pdf'>Presentation</a>, <a class="bibtex">BibTeX</a>
<pre>@INPROCEEDINGS{tjoa2007icassp,
  title = "Block Size Forensic Analysis in Digital Images",
  author = "Steven Tjoa and W. Sabrina Lin and H. Vicky Zhao and K. J. Ray Liu",
  booktitle = "Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing",
  year = "2007",
  address = "Honolulu, HI",
  month = apr,
  volume = "1",
  pages = {I-633--I-636}
};</pre>
</li>
</ul>
<p>Our work on image compression forensics attempts to identify the compression method used upon an image by analyzing its intrinsic fingerprint. Many popular image compression methods such as JPEG employ block processing. Therefore, to even begin forensic analysis for digital images, we must first address the presence of block processing on our image data. <span id="more-136"></span>For forensic analysis of block-based coding schemes, estimating the block size is an obvious and crucial first step, because inaccurate block size estimation can possibly invalidate subsequent forensic tests.</p>
<p>Block artifact measurement is a well-established research area, with purposes primarily related to image restoration and distortion measurement. However, artifact measurement for the purpose of <em>forensic analysis</em> has not been explored. Here, we pose the following question: given a compressed image, can we detect the presence of block processing? If so, can we estimate the block size? Existing work in block artifact measurement is not tailored to answer this question due to strong assumptions placed upon the input data.</p>
<p>In this paper, we propose a novel scheme to detect the presence of block processing along with estimation of the block size. We first obtain a block artifact signature and then estimate the block size through maximum-likelihood estimation. Finally, we propose a binary hypothesis test that identifies the presence or absence of block processing with high detection rates and low false alarm rates.</p>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/136/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

