Abstract
We review the sparse representation principle for processing speech signals. A transformation for encoding the speech signals is learned such that the resulting coefficients are as independent as possible. We use independent component analysis with an exponential prior to learn a statistical representation for speech signals. This representation leads to extremely sparse priors that can be used for encoding speech signals for a variety of purposes. We review applications of this method for speech feature extraction, automatic speech recognition and speaker identification. Furthermore, this method is also suited for tackling the difficult problem of separating two sounds given only a single microphone.
Original language | English |
---|---|
Pages (from-to) | 311-320 |
Number of pages | 10 |
Journal | Proceedings of SPIE - The International Society for Optical Engineering |
Volume | 5207 |
Issue number | 1 |
DOIs | |
State | Published - 2003 |
Event | Wavelets: Applications in Signal and Image Processing X - San Diego, CA, United States Duration: 4 Aug 2003 → 8 Aug 2003 |
Keywords
- ICA
- Independent component analysis
- Sparse representation
- Speech signal coding