<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Steve Tjoa &#187; kjr-liu</title>
	<atom:link href="http://stevetjoa.com/tag/kjr-liu/feed" rel="self" type="application/rss+xml" />
	<link>http://stevetjoa.com</link>
	<description>iZotope, Inc.</description>
	<lastBuildDate>Mon, 07 May 2012 07:44:28 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<item>
		<title>Factorization of Overlapping Harmonic Sounds Using Approximate Matching Pursuit</title>
		<link>http://stevetjoa.com/611</link>
		<comments>http://stevetjoa.com/611#comments</comments>
		<pubDate>Sat, 29 Oct 2011 05:29:50 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[dictionary-learning]]></category>
		<category><![CDATA[kjr-liu]]></category>
		<category><![CDATA[machine-learning]]></category>
		<category><![CDATA[music]]></category>
		<category><![CDATA[s-tjoa]]></category>
		<category><![CDATA[transcription]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=611</guid>
		<description><![CDATA[Steven K. Tjoa and K. J. Ray Liu ISMIR, October 2011 Download: Paper, Poster, BibTeX @INPROCEEDINGS{tjoa2011ismir, title = "Factorization of Overlapping Harmonic Sounds Using Approximate Matching Pursuit", author = "Steven K. Tjoa and K. J. Ray Liu", booktitle = "Proc. Int. Soc. Music Information Retrieval Conf.", address = "Miami, FL", year = "2011", month = [...]]]></description>
			<content:encoded><![CDATA[<ul>
<li>Steven K. Tjoa and K. J. Ray Liu</li>
<li>ISMIR, October 2011</li>
<li>Download: <a href='http://up.stevetjoa.com/tjoa2011ismir.pdf'>Paper</a>, <a href='http://up.stevetjoa.com/poster20111025ismir.pdf'>Poster</a>, <a class="bibtex">BibTeX</a>
<pre>@INPROCEEDINGS{tjoa2011ismir,
  title = "Factorization of Overlapping Harmonic Sounds Using Approximate Matching Pursuit",
  author = "Steven K. Tjoa and K. J. Ray Liu",
  booktitle = "Proc. Int. Soc. Music Information Retrieval Conf.",
  address = "Miami, FL",
  year = "2011",
  month = oct,
  pages = "257--262"
};</pre>
</li>
</ul>
<p><span id="more-611"></span></p>
<p>Sparse coding and nonnegative matrix factorization (NMF) have revolutionized the area of music transcription. These methods usually decompose spectra from input signals as linear combinations of dictionary spectra. Often, the dictionary is either learned (as in NMF) or synthetic (e.g., harmonic templates, Gabor atoms, etc.).</p>
<p>Instead, what if you <em>already have</em> a massive overcomplete dictionary containing <em>millions</em> of well-labeled atoms from real-world sounds? Assuming that the dictionary is large and representative enough, there is no longer a need to learn. However, coding (i.e., finding the coefficients of the input with respect to the dictionary) remains a problem. With such a large dictionary, complexity becomes an issue.</p>
<p>We propose a method called <strong>Approximate Matching Pursuit</strong> (AMP) which allows you to efficiently obtain a sparse decomposition of an input vector using a massive overcomplete dictionary. AMP is a simple variant of matching pursuit or orthogonal matching pursuit. However, instead of finding an exact match between the input (or residue) and the dictionary at each iteration, we only find an <em>approximate</em> match. In doing so, we sacrifice a bit of accuracy to obtain significant savings in computation.</p>
<p>Using AMP, we decomposed input spectra of containing overlapping harmonic sounds, where the amount of polyphony varies. Our results showed that, while retrieval performance is similar to orthogonal matching pursuit based upon F-measure, execution time and computation is reduced significantly.</p>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/611/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Super-resolution of Musical Signals Using Approximate Matching Pursuit</title>
		<link>http://stevetjoa.com/592</link>
		<comments>http://stevetjoa.com/592#comments</comments>
		<pubDate>Thu, 20 Oct 2011 20:19:04 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[bp-keegan]]></category>
		<category><![CDATA[ieee-waspaa]]></category>
		<category><![CDATA[kjr-liu]]></category>
		<category><![CDATA[music]]></category>
		<category><![CDATA[s-tjoa]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=592</guid>
		<description><![CDATA[Brennan P. Keegan, Steven K. Tjoa, and K. J. Ray Liu IEEE WASPAA, October 2011 Download: Paper, Poster, BibTeX @INPROCEEDINGS{keegan2011waspaa, title = "Super-resolution of Musical Signals Using Approximate Matching Pursuit", author = "Brennan P. Keegan and Steven K. Tjoa and K. J. Ray Liu", booktitle = "Proc. IEEE Workshop on Applications of Signal Processing to [...]]]></description>
			<content:encoded><![CDATA[<ul>
<li>Brennan P. Keegan, Steven K. Tjoa, and K. J. Ray Liu</li>
<li>IEEE WASPAA, October 2011</li>
<li>Download: <a href='http://up.stevetjoa.com/keegan2011waspaa.pdf'>Paper</a>, <a href='http://up.stevetjoa.com/poster20111017waspaa.pdf'>Poster</a>, <a class="bibtex">BibTeX</a>
<pre>@INPROCEEDINGS{keegan2011waspaa,
  title = "Super-resolution of Musical Signals Using Approximate Matching Pursuit",
  author = "Brennan P. Keegan and Steven K. Tjoa and K. J. Ray Liu",
  booktitle = "Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics",
  address = "New Paltz, NY",
  year = "2011",
  month = oct,
  pages = "81--84"
};</pre>
</li>
</ul>
<p><span id="more-592"></span></p>
<p>Super-resolution is the act of increasing the resolution (or sampling rate) of a signal in an intelligent manner that preserves the signal&#8217;s ground truth information, i.e., not through some parametric method like linear or sinusoidal interpolation. For example, when you use Adobe Photoshop or GIMP to make an image larger, the new image will likely be blurry because the program does not know how to restore sharp edges.</p>
<p>Super-resolution of images and video has been studied extensively, but not as much for audio. When an audio signal is upsampled from 4 kHz to 44.1 kHz using traditional sinc interpolation, the upsampled signal will not contain any new information that is not already present in the 4 kHz version. As a result, high-frequency components will be absent, and the signal will sound dull and muffled.</p>
<p>There are a few methods for audio super-resolution. For example, Smaragdis et al. have proposed a method whose basic idea is to take the low-resolution input signal and <em>project</em> it upon a low-resolution basis. Using that projection (or the <em>coefficients</em> with respect to the low-resolution basis), a high-resolution signal is constructed by multiplying the coefficients with components from a high-resolution basis.</p>
<p>What basis do you use? In the literature, that basis (or overcomplete dictionary &#8212; perhaps not orthogonal) is usually obtained in one of two ways: use a synthetic basis (Gabor, etc.), or learn it from real data (NMF, K-SVD, etc.).</p>
<p>We ask this question: What if you already have a massive overcomplete dictionary containing <em>millions</em> of well-labeled atoms from real-world sounds? Can you make use of it? How? With such a large dictionary, complexity becomes an issue. Using ordinary matching pursuit, each iteration has complexity that is linear in the size of the dictionary. This is not scalable.</p>
<p>We propose a method called <strong>Approximate Matching Pursuit</strong> which allows you to efficiently obtain a sparse decomposition of an input vector using a massive overcomplete dictionary. Using it, we perform super-resolution of signals containing piano music. Using a dictionary composed of sounds from the University of Iowa data set, we were able to accurately estimate missing high-frequency information using low-resolution inputs sampled as low as 2-4 kHz. For more details, please see the paper above.</p>
<p>For more about Approximate Matching Pursuit, please see our upcoming paper in ISMIR 2011.</p>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/592/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>My Academic Genealogy Since Helmholtz</title>
		<link>http://stevetjoa.com/547</link>
		<comments>http://stevetjoa.com/547#comments</comments>
		<pubDate>Tue, 23 Aug 2011 04:58:33 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[kjr-liu]]></category>
		<category><![CDATA[s-tjoa]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=547</guid>
		<description><![CDATA[The Mathematics Genealogy Project is an online service that lets you explore the academic lineage of mathematicians and those in related areas such as engineering. Using it, I discovered that I descend from Hermann von Helmholtz, famous for his work in acoustics (and a whole lot of other stuff). It is interesting to see how [...]]]></description>
			<content:encoded><![CDATA[<p>The <a href="http://genealogy.math.ndsu.nodak.edu/index.php" target="_blank">Mathematics Genealogy Project</a> is an online service that lets you explore the academic lineage of mathematicians and those in related areas such as engineering. Using it, I discovered that I descend from <a href="http://en.wikipedia.org/wiki/Hermann_von_Helmholtz" target="_blank">Hermann von Helmholtz</a>, famous for his work in acoustics (and a whole lot of other stuff). <span id="more-547"></span></p>
<p>It is interesting to see how thesis topics evolve from advisor to advisee, yet my research area &#8212; music information retrieval &#8212; is not so removed from Helmholtz&#8217;s work on acoustics; in fact, MIR scholars still cite and recognize his contributions!</p>
<p>Here are the people that make up my lineage since Helmholtz, each person&#8217;s dissertation title, place of graduate study, and graduation year.</p>
<ol>
<li>Hermann von Helmholtz, <em>De fabrica systematis nervosi evertebratorum</em>, Universität Berlin, 1842.</li>
<li>Edward Leamington Nichols, <em>Ueber das von glühendem Platin ausgestrahlte Licht: Ein Beitrag zur allgemeinen Ausstrahlungslehr</em>, Georg-August-Universität Göttingen, 1879.</li>
<li>Ernest Fox Nichols, <em>Radiometric Researches in the Remote Infra-Red Spectrum</em>, Cornell, 1897.</li>
<li>Frederic Columbus Blake, <em>The Reflection and Transmission of Electric Waves by Screens of Resonators and by Grids</em>, Columbia, 1906.</li>
<li>William Littell Everitt, <em>The Calculation and Design of Alternating Current Networks Employing Triodes Operating During a Portion of a Cycle</em>, Ohio State, 1933.</li>
<li>Karl Ralph Spangenberg, <em>The Effect of Grid Current Flow Upon the Dynamic Characteristics of Vacuum Tube Power</em>, Ohio State, 1937.</li>
<li>Willis Harman, <em>Tunable Waveguide Cavity Resonators for Broadband Operation of Reflex Klystrons</em>, Stanford, 1948.</li>
<li>John Bowman Thomas, <em>On the Statistical Design of Demodulation Systems for Signals in Additive Noise</em>, Stanford, 1955.</li>
<li>Kung Yao, <em>On Some Representations and Sampling Expansions for Band Limited Signals</em>, Princeton, 1965.</li>
<li>K. J. Ray Liu, <i>Efficient and Reliable Parallel Processing Algorithms and Architectures for Modern Signal Processing</i>, UCLA, 1990.</li>
<li>Steve Tjoa, <i>Sparse and Nonnegative Factorizations for Music Understanding</i>, Maryland, 2011.</li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/547/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Summary: Musical Instrument Recognition Using Biologically Inspired Filtering of Temporal Dictionary Atoms</title>
		<link>http://stevetjoa.com/334</link>
		<comments>http://stevetjoa.com/334#comments</comments>
		<pubDate>Sat, 31 Jul 2010 23:49:31 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[instrument-rec]]></category>
		<category><![CDATA[ismir]]></category>
		<category><![CDATA[kjr-liu]]></category>
		<category><![CDATA[music]]></category>
		<category><![CDATA[s-tjoa]]></category>
		<category><![CDATA[transcription]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=334</guid>
		<description><![CDATA[Musical Instrument Recognition Using Biologically Inspired Filtering of Temporal Dictionary Atoms Steven K. Tjoa and K. J. Ray Liu Int. Soc. Music Information Retrieval Conf., August 2010 Download: Paper, Poster, BibTeX @INPROCEEDINGS{tjoa2010ismir, title = "Musical Instrument Recognition Using Biologically Inspired Filtering of Temporal Dictionary Atoms", author = "Steven K. Tjoa and K. J. Ray Liu", [...]]]></description>
			<content:encoded><![CDATA[<h3>Musical Instrument Recognition Using Biologically Inspired Filtering of Temporal Dictionary Atoms</h3>
<ul>
<li>Steven K. Tjoa and K. J. Ray Liu</li>
<li>Int. Soc. Music Information Retrieval Conf., August 2010</li>
<li>Download: <a href='http://up.stevetjoa.com/tjoa2010ismir.pdf'>Paper</a>, <a href='http://up.stevetjoa.com/tjoa20100812ismir.pdf'>Poster</a>, <a class="bibtex">BibTeX</a>
<pre>@INPROCEEDINGS{tjoa2010ismir,
  title = "Musical Instrument Recognition Using Biologically Inspired Filtering of Temporal Dictionary Atoms",
  author = "Steven K. Tjoa and K. J. Ray Liu",
  booktitle = "Proc. Int. Soc. Music Information Retrieval Conf.",
  address = "Utrecht, Netherlands",
  year = "2010",
  month = aug,
  pages = "435--440"
};</pre>
</li>
</ul>
<p>Most musical instrument recognition systems rely upon spectral information to classify sounds. Can <em>temporal</em> information improve classification accuracy even further? <span id="more-334"></span></p>
<p>Evidence from the psychoacoustic literature suggests that both spectral and temporal content carry information about acoustic timbre through the human auditory system. In other words, because humans recognize musical instruments so effortlessly, then perhaps machines would also benefit from a combination of spectral and temporal information. Researchers use a model known as the cortical representation to emulate the information output from the middle stage of the human auditory system. Although some engineers have tried to build music information retrieval (MIR) systems that use the cortical representation, its high dimensionality makes it difficult to employ in practical systems.</p>
<p>But what if we can embody the traits of the cortical representation into another representation that also includes spectral and temporal content? One such candidate is nonnegative matrix factorization (NMF), a tool that can extract spectral and temporal information from spectrograms.</p>
<p>In this paper, we test the usefulness of temporal information extracted using NMF in instrument recognition. We mimic the multiresolution aspect of the cortical representation by using a multiresolution gamma filterbank that parameterizes the shape of temporal envelopes. Our results show that this method of temporal processing can classify among isolated sounds among 24 instrument classes with 92.3% accuracy.<br />
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/334/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Summary: Multiplicative Update Rules for Nonnegative Matrix Factorization with Co-occurrence Constraints</title>
		<link>http://stevetjoa.com/269</link>
		<comments>http://stevetjoa.com/269#comments</comments>
		<pubDate>Mon, 22 Mar 2010 07:50:51 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[ieee-icassp]]></category>
		<category><![CDATA[kjr-liu]]></category>
		<category><![CDATA[music]]></category>
		<category><![CDATA[nmf]]></category>
		<category><![CDATA[s-tjoa]]></category>
		<category><![CDATA[source-separation]]></category>
		<category><![CDATA[transcription]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=269</guid>
		<description><![CDATA[Multiplicative Update Rules for Nonnegative Matrix Factorization with Co-occurrence Constraints Steven K. Tjoa and K. J. Ray Liu IEEE Int. Conf. Acoustics, Speech, and Signal Processing, March 2010 Download: Paper, Poster, BibTeX @INPROCEEDINGS{tjoa2010icassp_cooccurrence, title = "Multiplicative Update Rules for Nonnegative Matrix Factorization with Co-occurrence Constraints", author = "Steven K. Tjoa and K. J. Ray Liu", [...]]]></description>
			<content:encoded><![CDATA[<h3>Multiplicative Update Rules for Nonnegative Matrix Factorization with Co-occurrence Constraints</h3>
<ul>
<li>Steven K. Tjoa and K. J. Ray Liu</li>
<li>IEEE Int. Conf. Acoustics, Speech, and Signal Processing, March 2010</li>
<li>Download: <a href='http://up.stevetjoa.com/tjoa2010icassp_cooccurrence.pdf'>Paper</a>, <a href='http://up.stevetjoa.com/tjoa2010icassp_poster1.pdf'>Poster</a>, <a class="bibtex">BibTeX</a>
<pre>@INPROCEEDINGS{tjoa2010icassp_cooccurrence,
  title = "Multiplicative Update Rules for Nonnegative Matrix Factorization with Co-occurrence Constraints",
  author = "Steven K. Tjoa and K. J. Ray Liu",
  booktitle = "Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing",
  address = "Dallas, TX",
  year = "2010",
  month = mar,
  pages = "449--452"
};</pre>
</li>
</ul>
<p>Nonnegative matrix factorization (NMF) has become a popular tool for discovering structure in a variety of signals. When applied to a musical audio signal, NMF builds a set of <em>dictionary atoms</em> that represent the individual musical sources in the signal. To perform music transcription, we map the learned dictionary atoms to musical notes and beats.<span id="more-269"></span></p>
<p>The basic formulation of NMF has the notable disadvantage that sources may require more than a single dictionary atom in order to be approximated accurately. For example, one note played by a violin may require multiple dictionary atoms to be accurately represented due to the vibrato induced by the performer. In the presence of many other musical sources, the correspondence between atoms becomes unclear.</p>
<p>In this paper, we introduce <em>three new update rules</em> to enforce dependence among dictionary atoms by incorporating <em>co-occurrence constraints</em> into NMF.  These co-occurrence constraints have shown to be useful for describing sources with multiple, co-occurring dictionary atoms by grouping similar atoms into sets. The proposed rules are conceptually simple, easy to implement, and effective for describing sources using multiple dictionary atoms.</p>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/269/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Summary: Harmonic Variable-Size Dictionary Learning for Music Source Separation</title>
		<link>http://stevetjoa.com/278</link>
		<comments>http://stevetjoa.com/278#comments</comments>
		<pubDate>Mon, 22 Mar 2010 07:20:19 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[dictionary-learning]]></category>
		<category><![CDATA[ieee-icassp]]></category>
		<category><![CDATA[k-svd]]></category>
		<category><![CDATA[kjr-liu]]></category>
		<category><![CDATA[mc-stamm]]></category>
		<category><![CDATA[music]]></category>
		<category><![CDATA[nmf]]></category>
		<category><![CDATA[s-tjoa]]></category>
		<category><![CDATA[source-separation]]></category>
		<category><![CDATA[transcription]]></category>
		<category><![CDATA[ws-lin]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=278</guid>
		<description><![CDATA[Harmonic Variable-Size Dictionary Learning for Music Source Separation Steven K. Tjoa, Matthew C. Stamm, W. Sabrina Lin, and K. J. Ray Liu IEEE Int. Conf. Acoustics, Speech, and Signal Processing, March 2010 Download: Paper, Poster, BibTeX @INPROCEEDINGS{tjoa2010icassp_harmonic, title = "Harmonic Variable-Size Dictionary Learning for Music Source Separation", author = "Steven K. Tjoa and Matthew C. [...]]]></description>
			<content:encoded><![CDATA[<h3>Harmonic Variable-Size Dictionary Learning for Music Source Separation</h3>
<ul>
<li>Steven K. Tjoa, Matthew C. Stamm, W. Sabrina Lin, and K. J. Ray Liu</li>
<li>IEEE Int. Conf. Acoustics, Speech, and Signal Processing, March 2010</li>
<li>Download: <a href='http://up.stevetjoa.com/tjoa2010icassp_harmonic.pdf'>Paper</a>, <a href='http://up.stevetjoa.com/tjoa2010icassp_poster2.pdf'>Poster</a>, <a class="bibtex">BibTeX</a>
<pre>@INPROCEEDINGS{tjoa2010icassp_harmonic,
  title = "Harmonic Variable-Size Dictionary Learning for Music Source Separation",
  author = "Steven K. Tjoa and Matthew C. Stamm and W. Sabrina Lin and K. J. Ray Liu",
  booktitle = "Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing",
  address = "Dallas, TX",
  year = "2010",
  month = mar,
  pages = "413--416"
};</pre>
</li>
</ul>
<p>Methods that employ dictionary learning and sparse coding have become popular for discovering structure in acoustic signals. Unfortunately, these methods also share a common limitation. When there is significant spectral-temporal overlap among the dictionary atoms present in a signal, it becomes difficult for these methods to learn atoms properly. <span id="more-278"></span>Often, information from multiple atoms is represented as a single atom by the learning procedure. If an atom in the output dictionary contains musical information from multiple sources, transcription and source separation cannot be accurately performed.</p>
<p>In this paper, we propose a novel dictionary learning method that performs well despite the presence of spectral-temporal overlap among dictionary atoms. Our method imposes a <em>harmonic constraint</em> that restricts each atom to represent at most one pitch. Furthermore, our method is <em>flexible</em> by allowing the size of the dictionary to grow based upon the complexity of the input signal. Our method consistently achieves higher recall and precision than other well-known dictionary learning algorithms.</p>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/278/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Summary: Anti-Forensics of JPEG Compression</title>
		<link>http://stevetjoa.com/294</link>
		<comments>http://stevetjoa.com/294#comments</comments>
		<pubDate>Mon, 22 Mar 2010 06:05:30 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[compression]]></category>
		<category><![CDATA[forensics]]></category>
		<category><![CDATA[ieee-icassp]]></category>
		<category><![CDATA[image]]></category>
		<category><![CDATA[kjr-liu]]></category>
		<category><![CDATA[mc-stamm]]></category>
		<category><![CDATA[s-tjoa]]></category>
		<category><![CDATA[ws-lin]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=294</guid>
		<description><![CDATA[Anti-Forensics of JPEG Compression Matthew C. Stamm, Steven K. Tjoa, W. Sabrina Lin, and K. J. Ray Liu IEEE Int. Conf. Acoustics, Speech, and Signal Processing, March 2010 Download: Paper, BibTeX @INPROCEEDINGS{stamm2010icassp_antiforensics, title = "Anti-Forensics of {JPEG} Compression", author = "Matthew C. Stamm and Steven K. Tjoa and W. Sabrina Lin and K. J. Ray [...]]]></description>
			<content:encoded><![CDATA[<h3>Anti-Forensics of JPEG Compression</h3>
<ul>
<li>Matthew C. Stamm, Steven K. Tjoa, W. Sabrina Lin, and K. J. Ray Liu</li>
<li>IEEE Int. Conf. Acoustics, Speech, and Signal Processing, March 2010</li>
<li>Download: <a href="http://up.stevetjoa.com/stamm2010icassp_antiforensics.pdf">Paper</a>, <a class="bibtex">BibTeX</a>
<pre>@INPROCEEDINGS{stamm2010icassp_antiforensics,
  title = "Anti-Forensics of {JPEG} Compression",
  author = "Matthew C. Stamm and Steven K. Tjoa and W. Sabrina Lin and K. J. Ray Liu",
  booktitle = "Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing",
  address = "Dallas, TX",
  year = "2010",
  month = mar,
  pages = "1694--1697"
};</pre>
</li>
</ul>
<p>Digital image forensics is a relatively new field that has only become popular within the last seven years. A newer body of research involves <em>anti-forensics</em> &#8212; methods of processing images that disguise evidence of earlier image processing or tampering. By altering the forensically significant properties in an image, existing image forensic systems may be rendered useless. <span id="more-294"></span>Like many areas of security, we benefit from the study of anti-forensics by understanding the attacks that are most likely to occur before they actually do. Then, we can refine our image forensic systems further.</p>
<p>Earlier, our group proposed the <a href="http://stevetjoa.com/110">first forensic system</a> that identifies which compression method was used upon a digital image. In this paper, we propose an anti-forensic method for <em>defeating the identification of JPEG compression</em> in an image. Our method alters the intrinsic fingerprint of JPEG-compressed images in the transform domain such that the attacked image is perceptually similar to the image before attack. To our knowledge, this work is the <em>first</em> to propose a method specifically designed to defeat compression identification systems, and we anticipate that more work will follow.</p>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/294/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Summary: Digital Image Source Coder Forensics via Intrinsic Fingerprints</title>
		<link>http://stevetjoa.com/110</link>
		<comments>http://stevetjoa.com/110#comments</comments>
		<pubDate>Mon, 01 Feb 2010 03:04:52 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[compression]]></category>
		<category><![CDATA[forensics]]></category>
		<category><![CDATA[hv-zhao]]></category>
		<category><![CDATA[ieee-tifs]]></category>
		<category><![CDATA[image]]></category>
		<category><![CDATA[kjr-liu]]></category>
		<category><![CDATA[s-tjoa]]></category>
		<category><![CDATA[ws-lin]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=110</guid>
		<description><![CDATA[Digital Image Source Coder Forensics via Intrinsic Fingerprints W. Sabrina Lin, Steven K. Tjoa, H. Vicky Zhao, and K. J. Ray Liu IEEE Trans. Information Forensics and Security, September 2009 Download: Paper, BibTeX @ARTICLE{lin2009tifs, title = "Digital Image Source Coder Forensics via Intrinsic Fingerprints", author = "W. Sabrina Lin and Steven K. Tjoa and H. [...]]]></description>
			<content:encoded><![CDATA[<h3>Digital Image Source Coder Forensics via Intrinsic Fingerprints</h3>
<ul>
<li>W. Sabrina Lin, Steven K. Tjoa, H. Vicky Zhao, and K. J. Ray Liu</li>
<li>IEEE Trans. Information Forensics and Security, September 2009</li>
<li>Download: <a href='http://up.stevetjoa.com/lin2009tifs.pdf'>Paper</a>, <a class="bibtex">BibTeX</a>
<pre>@ARTICLE{lin2009tifs,
  title = "Digital Image Source Coder Forensics via Intrinsic Fingerprints",
  author = "W. Sabrina Lin and Steven K. Tjoa and H. Vicky Zhao and K. J. Ray Liu",
  journal = "IEEE Trans. Information Forensics and Security",
  year = "2009",
  month = sep,
  volume = "4",
  number = "3",
  pages = "460--475"
};</pre>
</li>
</ul>
<p>Multimedia forensic methods allow us to verify and maintain the integrity of our multimedia data. For example, we can embed a watermark into a digital image to bind the identity of its owner to the image itself. However, traditional forensic approaches such as watermarking are not applicable in many real-world scenarios, for example, when we do not have access to the original data. <span id="more-110"></span></p>
<p>In this paper, we investigate the use of <em>intrinsic fingerprints</em> &#8212; subsets of data which are, or have become, an intrinsic part of the data in question &#8212;  for formulating a forensic methodology that can identify the compression history of a digital image. By examining the intrinsic fingerprints in an image, we can tell what compression method (e.g., JPEG, etc.) was used in order to determine the origin of the image and thereby assess its authenticity.</p>
<p>Although there exist methods that estimate parameters of individual compression schemes, to our knowledge, our work is the <em>first</em> to address the problem of image compression <em>identification</em> and offer a unified forensic framework incorporating multiple encoders.</p>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/110/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Summary: Transform Coder Classification for Digital Image Forensics</title>
		<link>http://stevetjoa.com/123</link>
		<comments>http://stevetjoa.com/123#comments</comments>
		<pubDate>Sun, 31 Jan 2010 21:39:26 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[compression]]></category>
		<category><![CDATA[forensics]]></category>
		<category><![CDATA[ieee-icip]]></category>
		<category><![CDATA[image]]></category>
		<category><![CDATA[kjr-liu]]></category>
		<category><![CDATA[s-tjoa]]></category>
		<category><![CDATA[ws-lin]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=123</guid>
		<description><![CDATA[Transform Coder Classification for Digital Image Forensics Steven Tjoa, W. Sabrina Lin, and K. J. Ray Liu IEEE Int. Conf. Image Processing, September 2007 Download: Paper, Presentation, BibTeX @INPROCEEDINGS{tjoa2007icip, title = "Transform Coder Classification for Digital Image Forensics", author = "Steven Tjoa and W. Sabrina Lin and K. J. Ray Liu", booktitle = "Proc. IEEE [...]]]></description>
			<content:encoded><![CDATA[<h3>Transform Coder Classification for Digital Image Forensics</h3>
<ul>
<li>Steven Tjoa, W. Sabrina Lin, and K. J. Ray Liu</li>
<li>IEEE Int. Conf. Image Processing, September 2007</li>
<li>Download: <a href='http://up.stevetjoa.com/tjoa2007icip.pdf'>Paper</a>, <a href='http://up.stevetjoa.com/tjoa2007icip_presentation.pdf'>Presentation</a>, <a class="bibtex">BibTeX</a>
<pre>@INPROCEEDINGS{tjoa2007icip,
  title = "Transform Coder Classification for Digital Image Forensics",
  author = "Steven Tjoa and W. Sabrina Lin and K. J. Ray Liu",
  booktitle = "Proc. IEEE Int. Conf. on Image Processing",
  address = "San Antonio, TX",
  year = "2007",
  month = sep,
  volume = "6",
  pages = {VI-105--VI-108},
};</pre>
</li>
</ul>
<p>Our <a href="http://stevetjoa.com/110">TIFS paper</a> was the first work to offer a unified forensic framework that can identify the compression method used upon an image. Among these compression methods are <em>transform coders</em> &#8212; compression methods that apply some form of entropy coding in the transform domain (e.g., via discrete cosine transform, wavelet transform, etc.). <span id="more-123"></span> Transform coding is undoubtedly the most popular category of image coders in use today. Therefore, the forensic problem of transform coder classification deserves attention.</p>
<p>In this paper, we propose a method that can identify <em>which transform</em> was used upon an image during the compression process. By analyzing the transform coefficient histogram obtained after applying a candidate transform, we can determine if the image was compressed using the transform tested. Results show that forensic classification accuracy is high when testing from among six well-known transforms. </p>
<p>To our knowledge, this work is <em>the first</em> to address the problem of transform coder classification and propose a working solution.</p>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/123/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Summary: Image Source Coding Forensics Via Intrinsic Fingerprints</title>
		<link>http://stevetjoa.com/131</link>
		<comments>http://stevetjoa.com/131#comments</comments>
		<pubDate>Sun, 31 Jan 2010 10:52:33 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[compression]]></category>
		<category><![CDATA[forensics]]></category>
		<category><![CDATA[hv-zhao]]></category>
		<category><![CDATA[ieee-icme]]></category>
		<category><![CDATA[image]]></category>
		<category><![CDATA[kjr-liu]]></category>
		<category><![CDATA[s-tjoa]]></category>
		<category><![CDATA[ws-lin]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=131</guid>
		<description><![CDATA[Image Source Coding Forensics Via Intrinsic Fingerprints W. Sabrina Lin, Steven Tjoa, H. Vicky Zhao, and K. J. Ray Liu IEEE Int. Conf. Multimedia and Expo, July 2007 Download: Paper, BibTeX @INPROCEEDINGS{lin2007icme, title = "Image Source Coding Forensics via Intrinsic Fingerprints", author = "W. Sabrina Lin and Steven Tjoa and H. Vicky Zhao and K. [...]]]></description>
			<content:encoded><![CDATA[<h3>Image Source Coding Forensics Via Intrinsic Fingerprints</h3>
<ul>
<li>W. Sabrina Lin, Steven Tjoa, H. Vicky Zhao, and K. J. Ray Liu</li>
<li>IEEE Int. Conf. Multimedia and Expo, July 2007</li>
<li>Download: <a href='http://up.stevetjoa.com/lin2007icme.pdf'>Paper</a>, <a class="bibtex">BibTeX</a>
<pre>@INPROCEEDINGS{lin2007icme,
  title = "Image Source Coding Forensics via Intrinsic Fingerprints",
  author = "W. Sabrina Lin and Steven Tjoa and H. Vicky Zhao and K. J. Ray Liu",
  booktitle = "Proc. IEEE Int. Conf. Multimedia and Expo",
  address = "Beijing, China",
  year = "2007",
  month = jul,
  pages = "1127--1130",
};</pre>
</li>
</ul>
<p>The most popular image compression schemes in use today are lossy, i.e., compression imposes some irreversible distortion in the image in order to achieve a smaller file size. Because each compression method imposes different kinds of distortion, the distortion can act as a <em>fingerprint</em> of the compression method. <span id="more-131"></span> Existing image compression methods can be grouped into a few categories &#8212; e.g., transform coding, vector quantization, subband coding, linear predictive coding, embedded coding &#8212; where all methods in the same category leave behind the same type of fingerprint in an image during compression.</p>
<p>In this paper, we analyze the <em>intrinsic fingerprints</em> of different types of image compression methods and propose a forensic system that identifies the type of the compression and provides a confidence measure in the system&#8217;s decision. Results show that the system can achieve a probability of detection of 0.82 for an image PSNR of 40 dB and even higher accuracy for lower PSNRs.</p>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/131/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Summary: Block Size Forensic Analysis in Digital Images</title>
		<link>http://stevetjoa.com/136</link>
		<comments>http://stevetjoa.com/136#comments</comments>
		<pubDate>Sun, 31 Jan 2010 06:09:41 +0000</pubDate>
		<dc:creator>Steve</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[compression]]></category>
		<category><![CDATA[forensics]]></category>
		<category><![CDATA[hv-zhao]]></category>
		<category><![CDATA[ieee-icassp]]></category>
		<category><![CDATA[image]]></category>
		<category><![CDATA[kjr-liu]]></category>
		<category><![CDATA[s-tjoa]]></category>
		<category><![CDATA[ws-lin]]></category>

		<guid isPermaLink="false">http://stevetjoa.com/?p=136</guid>
		<description><![CDATA[Block Size Forensic Analysis in Digital Images Steven Tjoa, W. Sabrina Lin, H. Vicky Zhao, and K. J. Ray Liu IEEE Int. Conf. Acoustics, Speech, and Signal Processing, April 2007 Download: Paper, Presentation, BibTeX @INPROCEEDINGS{tjoa2007icassp, title = "Block Size Forensic Analysis in Digital Images", author = "Steven Tjoa and W. Sabrina Lin and H. Vicky [...]]]></description>
			<content:encoded><![CDATA[<h3>Block Size Forensic Analysis in Digital Images</h3>
<ul>
<li>Steven Tjoa, W. Sabrina Lin, H. Vicky Zhao, and K. J. Ray Liu</li>
<li>IEEE Int. Conf. Acoustics, Speech, and Signal Processing, April 2007</li>
<li>Download: <a href='http://up.stevetjoa.com/tjoa2007icassp.pdf'>Paper</a>, <a href='http://up.stevetjoa.com/tjoa2007icassp_presentation.pdf'>Presentation</a>, <a class="bibtex">BibTeX</a>
<pre>@INPROCEEDINGS{tjoa2007icassp,
  title = "Block Size Forensic Analysis in Digital Images",
  author = "Steven Tjoa and W. Sabrina Lin and H. Vicky Zhao and K. J. Ray Liu",
  booktitle = "Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing",
  year = "2007",
  address = "Honolulu, HI",
  month = apr,
  volume = "1",
  pages = {I-633--I-636}
};</pre>
</li>
</ul>
<p>Our work on image compression forensics attempts to identify the compression method used upon an image by analyzing its intrinsic fingerprint. Many popular image compression methods such as JPEG employ block processing. Therefore, to even begin forensic analysis for digital images, we must first address the presence of block processing on our image data. <span id="more-136"></span>For forensic analysis of block-based coding schemes, estimating the block size is an obvious and crucial first step, because inaccurate block size estimation can possibly invalidate subsequent forensic tests.</p>
<p>Block artifact measurement is a well-established research area, with purposes primarily related to image restoration and distortion measurement. However, artifact measurement for the purpose of <em>forensic analysis</em> has not been explored. Here, we pose the following question: given a compressed image, can we detect the presence of block processing? If so, can we estimate the block size? Existing work in block artifact measurement is not tailored to answer this question due to strong assumptions placed upon the input data.</p>
<p>In this paper, we propose a novel scheme to detect the presence of block processing along with estimation of the block size. We first obtain a block artifact signature and then estimate the block size through maximum-likelihood estimation. Finally, we propose a binary hypothesis test that identifies the presence or absence of block processing with high detection rates and low false alarm rates.</p>
]]></content:encoded>
			<wfw:commentRss>http://stevetjoa.com/136/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

