Variable quantile level based noise suppression for robust speech recognition

Kangyeoul Lee, Gil Jin Jang, Jeong Sik Park, Ji Hwan Kim

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper addresses the issues of single microphone based noise estimation technique for speech recognition in noisy environments. Many researches have been performed on the environmental noise estimation; however, most of them require voice activity detection (VAD) for accurate estimation of noise characteristics. We propose an approach for efficient noise estimation without VAD, aiming at improving the conventional quantile-based noise estimation (QBNE). We fostered the QBNE by adjusting the quantile level according to the relative amount of added noise to the target speech. From the observation that the power spectral density (PSD) of noise is close to the Gaussian distribution, while that of speech is more narrowly populated, the level of additive noise is measured by the selected Gaussianity functions. We compared the proposed method with the conventional QBNE and minimum statistics based method on a simple speech recognition task in various SNR levels. The experimental results show that the proposed method is superior to the conventional methods.

Original languageEnglish
Title of host publicationInformation Technology Convergence
Subtitle of host publicationSecurity, Robotics, Automations and Communication
PublisherSpringer Verlag
Pages1037-1045
Number of pages9
ISBN (Print)9789400769953
DOIs
StatePublished - 2013
Event5th FTRA International Conference on Information Technology Convergence and Services, ITCS 2013 and the 3rd International Conference on Intelligent Robotics, Automations, Telecommunication Facilities, and Applications, IRoA 2013 - Fukuoka, Japan
Duration: 8 Jul 201310 Jul 2013

Publication series

NameLecture Notes in Electrical Engineering
Volume253 LNEE
ISSN (Print)1876-1100
ISSN (Electronic)1876-1119

Conference

Conference5th FTRA International Conference on Information Technology Convergence and Services, ITCS 2013 and the 3rd International Conference on Intelligent Robotics, Automations, Telecommunication Facilities, and Applications, IRoA 2013
Country/TerritoryJapan
CityFukuoka
Period8/07/1310/07/13

Keywords

  • Gaussianity
  • Quantile-based noise estimation
  • Speech recognition

Fingerprint

Dive into the research topics of 'Variable quantile level based noise suppression for robust speech recognition'. Together they form a unique fingerprint.

Cite this