TY - GEN
T1 - Improvement of speaker recognition system by individual information weighting
AU - Kim, Se Hyun
AU - Jang, Gil Jin
AU - Oh, Yung Hwan
PY - 2000
Y1 - 2000
N2 - In speaker recognition, it is very important to use individual information extracted from speech waves. Most of the speaker recognition methods assume that each part of speech has equal amount of information to represent a speaker, although it differently contribute to speaker recognition. The aim of this paper is to suggest a new scoring method of the HMM, which applies different importance to all the basic portions of a sampled speech waveform. we first define the quantity of the importance of speech frames, propose how to measure it and apply to speaker recognition. The performance of the proposed method was compared to non-weighting HMM based speaker recognition system. In speaker verification experiments, the proposed method reduced equal error rates considerably as compared to a conventional method which treats all speech segments to have the same importance. In speaker identification experiments, the proposed method marked relatively 28% higher recognition rate than the baseline system, and was more robust in long-term variation. These results demonstrate that the proposed method is efficient in measuring speaker information and more appropriate for speaker recognition.
AB - In speaker recognition, it is very important to use individual information extracted from speech waves. Most of the speaker recognition methods assume that each part of speech has equal amount of information to represent a speaker, although it differently contribute to speaker recognition. The aim of this paper is to suggest a new scoring method of the HMM, which applies different importance to all the basic portions of a sampled speech waveform. we first define the quantity of the importance of speech frames, propose how to measure it and apply to speaker recognition. The performance of the proposed method was compared to non-weighting HMM based speaker recognition system. In speaker verification experiments, the proposed method reduced equal error rates considerably as compared to a conventional method which treats all speech segments to have the same importance. In speaker identification experiments, the proposed method marked relatively 28% higher recognition rate than the baseline system, and was more robust in long-term variation. These results demonstrate that the proposed method is efficient in measuring speaker information and more appropriate for speaker recognition.
UR - http://www.scopus.com/inward/record.url?scp=85009061089&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85009061089
T3 - 6th International Conference on Spoken Language Processing, ICSLP 2000
BT - 6th International Conference on Spoken Language Processing, ICSLP 2000
PB - International Speech Communication Association
T2 - 6th International Conference on Spoken Language Processing, ICSLP 2000
Y2 - 16 October 2000 through 20 October 2000
ER -