Underdetermined blind source separation is a key application in audio where it is desirable to extract multiple sources from a stereo recording. A new variant on the stereo matching pursuit, the dual matching pursuit, is presented whereby independent matching pursuits are run on both channels of a stereo mixture of greater than two sources. By identifying correlating atoms from each decomposition, a histogram plot is applied to identify the position of each source in the stereo image and the atoms grouped to recover the original signals. To improve the atomic correlation between channels, a fixed overcomplete representation for each of the signal types present in a mixture is obtained by applying a learning algorithm to existing sources of that type and reducing the redundancy in the resulting basis set via a correlation-based algorithm. The resulting dictionaries are then used as a time-frequency basis for the independent matching pursuits. The results show improved separation quality compared to the dual matching pursuit with mathematical time-frequency dictionaries. The noise immunity of this method due to the use of overcomplete representations is also demonstrated showing that the system can withstand mixture signal-to-noise ratios down to 30 dB.
This material is posted here with permission of the IEEE. Such permission of the IEEE does not in any way imply IEEE endorsement of any of the University of Bristol's products or services. Internal or personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution must be obtained from the IEEE by writing to email@example.com.
By choosing to view this document, you agree to all provisions of the copyright laws protecting it.
Name of Conference: International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
Sugden IEEE ICASSP 2004
Author final version (often known as postprint) , 126 KB, PDF-document
Author final version (often known as postprint) , 4 KB, text/plain
Author final version (often known as postprint) , 15 KB, text/plain