Monaural speech segregation based on pitch track correction using an ensemble kalman filter

Han Gyu Kim, Gil Jin Jang, Jeong Sik Park, Yung Hwan Oh

Research output: Contribution to journalConference articlepeer-review

1 Scopus citations

Abstract

We propose a novel method of pitch track correction that uses an ensemble Kalman filter to improve the performance of monau- ral speech segregation. The proposed method considers all reliable pitch streaks for pitch track correction, whereas the conventional segregation approach relies on only the longest streak in a given speech stream. In addition, unreliable pitch streaks are corrected with an ensemble Kalman filter that uses autocorrelation functions as noisy observations for the hidden true pitch values. Our proposed approach provides more accu- rate pitch estimation, thus improving speech segregation perfor- mance for various types of noises, in particular, colored noise. In speech segregation experiments on mixtures of speech and various competing noises, the proposed method demonstrated superior performance to the conventional approach.

Original languageEnglish
Pages (from-to)813-816
Number of pages4
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
StatePublished - 2013
Event14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013 - Lyon, France
Duration: 25 Aug 201329 Aug 2013

Keywords

  • En- semble kalman filter
  • Pitch track correction
  • Speech segregation

Fingerprint

Dive into the research topics of 'Monaural speech segregation based on pitch track correction using an ensemble kalman filter'. Together they form a unique fingerprint.

Cite this