Abstract
We propose a new bandpass filter (BPF)-based online channel normalization method to dynamically suppress channel distortion when the speech and channel noise components are unknown. In this method, an adaptive modulation frequency filter is used to perform channel normalization, whereas conventional modulation filtering methods apply the same filter form to each utterance. In this paper, we only normalize the two mel frequency cepstral coefficients (C0 and C1) with large dynamic ranges; the computational complexity is thus decreased, and channel normalization accuracy is improved. Additionally, to update the filter weights dynamically, we normalize the learning rates using the dimensional power of each frame. Our speech recognition experiments using the proposed BPF-based blind channel normalization method show that this approach effectively removes channel distortion and results in only a minor decline in accuracy when online channel normalization processing is used instead of batch processing.
Original language | English |
---|---|
Pages (from-to) | 1190-1196 |
Number of pages | 7 |
Journal | ETRI Journal |
Volume | 38 |
Issue number | 6 |
DOIs | |
State | Published - Dec 2016 |
Keywords
- Adaptive filter modeling
- Channel normalization
- Modulation frequency filtering
- Speech recognition