Independent vector analysis using non-spherical joint densities for the separation of speech signals

Gil Jin Jang, Intae Lee, Te Won Lee

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

10 Scopus citations

Abstract

We propose a new blind source separation approach that models the inherent signal dependencies such as those observed in speech signals in order to solve the problem of separating convolved sources. The frequency domain methods for the convolved mixture problem require a solution to the wellknown permutation problem. Our approach is based on assuming a vector representation of the source signal where its multidimensional joint densities are non-spherical. Spherical distributions may be adequate for signals that exhibit uniform dependencies across frequencies but in case of speech signals we can observe stronger dependencies for neighboring frequency bins and almost no dependency for frequency bins that are far apart. The non-spherical joint density model takes into account this phenomenon. For the separation of convolved sources, the proposed method demonstrates consistent performance over previous methods and improved performance over the spherical joint density representations.

Original languageEnglish
Title of host publication2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07
PagesII629-II632
DOIs
StatePublished - 2007
Event2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07 - Honolulu, HI, United States
Duration: 15 Apr 200720 Apr 2007

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume2
ISSN (Print)1520-6149

Conference

Conference2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07
Country/TerritoryUnited States
CityHonolulu, HI
Period15/04/0720/04/07

Keywords

  • Speech processing

Fingerprint

Dive into the research topics of 'Independent vector analysis using non-spherical joint densities for the separation of speech signals'. Together they form a unique fingerprint.

Cite this