Abstract
This paper proposes a new data-driven method for high-pass approaches, which suppresses slow-varying noise components. Conventional high-pass approaches are based on the idea of decorrelating the feature vector sequence, and are trying for adaptability to various conditions. The proposed method is based on temporal local decorrelation using the information-maximization theory for each utterance. This is performed on an utterance-by-utterance basis, which provides an adaptive channel normalization filter for each condition. The performance of the proposed method is evaluated by isolated-word recognition experiments with channel distortion. Experimental results show that the proposed method yields outstanding improvement for channel-distorted speech recognition.
Original language | English |
---|---|
Pages (from-to) | 300-304 |
Number of pages | 5 |
Journal | ETRI Journal |
Volume | 29 |
Issue number | 3 |
DOIs | |
State | Published - Jun 2007 |
Keywords
- Adaptive channel normalization
- Blind decorrelation
- Information-maximization method
- RASTA-like filtering
- Robust speech recognition