Abstract
Voice activity detection (VAD) is a main process of speech recognition tasks in which every voice region is detected to extract acoustic feature parameters from the region. This paper proposes an efficient VAD approach for applying to real-time voice interface systems. Even though diverse VAD approaches have been successfully applied for speech applications, they may operate inefficiently according to environmental conditions. In this study, we attempt to enhance the conventional VAD method based on signal energy within time and spectral domain. In addition, an efficient end-point detection method is also proposed. We successfully verified the efficiency of the proposed approach via a set of VAD experiments, comparing with the performance of some conventional VAD methods including zero crossing rate.
Original language | English |
---|---|
Pages (from-to) | 4304-4312 |
Number of pages | 9 |
Journal | Journal of Theoretical and Applied Information Technology |
Volume | 95 |
Issue number | 17 |
State | Published - 15 Sep 2017 |
Keywords
- End-point Detection
- Spectral Domain
- Spectral Energy
- Voice Activity Detection
- Voice Interface