Learning statistically efficient features for speaker recognition

G. J. Jang, T. W. Lee, Y. H. Oh

Research output: Contribution to journalConference articlepeer-review

20 Scopus citations

Abstract

We apply independent component analysis (ICA) for extracting an optimal basis to the problem of finding efficient features for a speaker. The basis functions learned by the algorithm are oriented and localized in both space and frequency, bearing a resemblance to Gabor functions. The speech segments are assumed to be generated by a linear combination of the basis functions, thus the distribution of speech segments of a speaker is modeled by a basis, which is calculated so that each component should be independent upon others on the given training data. The speaker distribution is modeled by the basis functions. To asses the efficiency of the basis functions, we performed speaker classification experiments and compared our results with the conventional Fourier-basis. Our results show that the proposed method is more efficient than the conventional Fourier-based features, in that they can obtain a higher classification rate.

Original languageEnglish
Pages (from-to)437-440
Number of pages4
JournalProceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing
Volume1
StatePublished - 2001
Event2001 IEEE Interntional Conference on Acoustics, Speech, and Signal Processing - Salt Lake, UT, United States
Duration: 7 May 200111 May 2001

Fingerprint

Dive into the research topics of 'Learning statistically efficient features for speaker recognition'. Together they form a unique fingerprint.

Cite this