Sparse Representation in Speech Signal Processing

Te Won Lee, Gil Jin Jang, Oh Wook Kwon

Research output: Contribution to journalConference articlepeer-review

Abstract

We review the sparse representation principle for processing speech signals. A transformation for encoding the speech signals is learned such that the resulting coefficients are as independent as possible. We use independent component analysis with an exponential prior to learn a statistical representation for speech signals. This representation leads to extremely sparse priors that can be used for encoding speech signals for a variety of purposes. We review applications of this method for speech feature extraction, automatic speech recognition and speaker identification. Furthermore, this method is also suited for tackling the difficult problem of separating two sounds given only a single microphone.

Original languageEnglish
Pages (from-to)311-320
Number of pages10
JournalProceedings of SPIE - The International Society for Optical Engineering
Volume5207
Issue number1
DOIs
StatePublished - 2003
EventWavelets: Applications in Signal and Image Processing X - San Diego, CA, United States
Duration: 4 Aug 20038 Aug 2003

Keywords

  • ICA
  • Independent component analysis
  • Sparse representation
  • Speech signal coding

Fingerprint

Dive into the research topics of 'Sparse Representation in Speech Signal Processing'. Together they form a unique fingerprint.

Cite this