Abstract
The goal of this paper is to learn or adapt statistical features of gender specific speech signals. The adaptation is performed by finding basis functions that encode the speech signal such that the resulting coefficients are statistically independent and the information redundancy is minimized. We use a flexible independent component analysis (ICA) algorithm to adapt the basis functions as well as the source coefficients for male and female speakers respectively. The learned features show significant differences in frequency and time span. Our results suggest that the male speech features can be described by Gabor-like wavelet filters whereas the female speech signal has a much longer time span. We present a detailed time-frequency analysis strongly suggesting that those features can be used to qualify and quantify gender-specific speech signal differences.
Original language | English |
---|---|
Pages (from-to) | 105-108 |
Number of pages | 4 |
Journal | Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing |
Volume | 1 |
DOIs | |
State | Published - 2001 |