Noise robust spontaneous speech recognition using multi-space GMM

Byung Ok Kang, Ho Young Jung, Oh Wook Kwon

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

In this paper, we propose a new approach using a multi-space Gaussian mixture model (GMM) for a large-scale spontaneous speech recognition system that is robust to the acoustic environmental noise. Current speech recognition systems based on a hidden Markov model (HMM) perform well in matched conditions, but their performance is degraded by mismatch conditions, such as mobile environments with diverse additive noise. In the case of mobile voice search services, the real noise environment is reflected in rich speech log data, and using speech logs, performance improvement is achieved in the growing matched condition. However, because most of this speech data is short with a limited pattern, when it is used for large-scale spontaneous speech recognition tasks like voice SMS, the performance improvement is limited and degradation is even observed in a quiet environment. Therefore, this paper proposes a new approach which, using rich voice search speech data, constructs a multi- Acoustic space GMM with distributions of speech corrupted by diverse environment noise and reflects these statistics in an acoustic model for a speech recognition system with a distinct domain like dictation speech. The evaluation results obtained from the voice SMS task show that the proposed method provides meaningful improvements over conventional adaptive training methods to handle multi-style training data. Copyright

Original languageEnglish
Title of host publication42nd International Congress and Exposition on Noise Control Engineering 2013, INTER-NOISE 2013
Subtitle of host publicationNoise Control for Quality of Life
PublisherOAL-Osterreichischer Arbeitsring fur Larmbekampfung
Pages3682-3685
Number of pages4
ISBN (Print)9781632662675
StatePublished - 2013
Event42nd International Congress and Exposition on Noise Control Engineering 2013: Noise Control for Quality of Life, INTER-NOISE 2013 - Innsbruck, Austria
Duration: 15 Sep 201318 Sep 2013

Publication series

Name42nd International Congress and Exposition on Noise Control Engineering 2013, INTER-NOISE 2013: Noise Control for Quality of Life
Volume5

Conference

Conference42nd International Congress and Exposition on Noise Control Engineering 2013: Noise Control for Quality of Life, INTER-NOISE 2013
Country/TerritoryAustria
CityInnsbruck
Period15/09/1318/09/13

Keywords

  • Acoustic model
  • Multi-space GMM
  • Noise robustness
  • Speech recognition

Fingerprint

Dive into the research topics of 'Noise robust spontaneous speech recognition using multi-space GMM'. Together they form a unique fingerprint.

Cite this