Abstract
This paper presents a statistical model-based noise suppression approach for voice recognition in a car environment. In order to alleviate the spectral whitening and signal distortion problem in the traditional decisiondirected Wiener filter, we combine a decision-directed method with an original spectrum reconstruction method and develop a new two-stage noise reduction filter estimation scheme. When a tradeoff between the performance and computational efficiency under resource-constrained automotive devices is considered, ETSI standard advance distributed speech recognition font-end (ETSI-AFE) can be an effective solution, and ETSI-AFE is also based on the decision-directed Wiener filter. Thus, a series of voice recognition and computational complexity tests are conducted by comparing the proposed approach with ETSI-AFE. The experimental results show that the proposed approach is superior to the conventional method in terms of speech recognition accuracy, while the computational cost and frame latency are significantly reduced.
Original language | English |
---|---|
Pages (from-to) | 801-809 |
Number of pages | 9 |
Journal | ETRI Journal |
Volume | 32 |
Issue number | 5 |
DOIs | |
State | Published - Oct 2010 |
Keywords
- Clean spectrum reconstruction
- ETSI standard aurora advanced front-end
- Gaussian mixture model
- Speech enhancement
- Speech recognition
- Two-stage mel-warped wiener filter