Abstract
We propose a novel feature processing technique which can provide a cepstral liftering effect in the log-spectral domain. Cepstral liftering aims at the equalization of variance of cepstral coefficients for the distance-based speech recognizer, and as a result, provides the robustness for additive noise and speaker variability. However, in the popular hidden Markov model based framework, cepstral liftering has no effect in recognition performance. We derive a filtering method in log-spectral domain corresponding to the cepstral liftering. The proposed method performs a high-pass filtering based on the decorrelation of filter-bank energies. We show that in noisy speech recognition, the proposed method reduces the error rate by 52.7% to conventional feature.
Original language | English |
---|---|
Pages (from-to) | 273-276 |
Number of pages | 4 |
Journal | ETRI Journal |
Volume | 26 |
Issue number | 3 |
DOIs | |
State | Published - Jun 2004 |
Keywords
- Robust feature extraction
- Speech recognition