Spectral energy based voice activity detection for real-time voice interface

Jeong Sik Park, Jung Seok Yoon, Yong Ho Seo, Gil Jin Jang

Research output: Contribution to journalArticlepeer-review

6 Scopus citations

Abstract

Voice activity detection (VAD) is a main process of speech recognition tasks in which every voice region is detected to extract acoustic feature parameters from the region. This paper proposes an efficient VAD approach for applying to real-time voice interface systems. Even though diverse VAD approaches have been successfully applied for speech applications, they may operate inefficiently according to environmental conditions. In this study, we attempt to enhance the conventional VAD method based on signal energy within time and spectral domain. In addition, an efficient end-point detection method is also proposed. We successfully verified the efficiency of the proposed approach via a set of VAD experiments, comparing with the performance of some conventional VAD methods including zero crossing rate.

Original languageEnglish
Pages (from-to)4304-4312
Number of pages9
JournalJournal of Theoretical and Applied Information Technology
Volume95
Issue number17
StatePublished - 15 Sep 2017

Keywords

  • End-point Detection
  • Spectral Domain
  • Spectral Energy
  • Voice Activity Detection
  • Voice Interface

Fingerprint

Dive into the research topics of 'Spectral energy based voice activity detection for real-time voice interface'. Together they form a unique fingerprint.

Cite this