|
Speaker Recognition from Excitation Source PerspectiveKeywords: Excitation source , Glottal wave , Pitch , Residual , Speaker recognition. Abstract: This paper gives a survey of different explorations carried out using speaker information present in the excitation source of speech for speaker recognition. The paper begins with an overview of the speaker recognition task. This is followed by a discussion on different speaker information present in speech, feature extraction methods, and types of excitation sources for speech production. Detailed descriptions on different explorations to exploit the speaker information in the excitation source are then given. These include methods based on pitch contour, jitter, shimmer, glottal flow derivative, linear prediction (LP) residual, LP residual phase, LP residual cepstrum, harmonic structure of the LP residual spectrum, and time frequency analysis of LP residual. A comparative study of all these methods is then carried out to highlight their merits and demerits. The paper is concluded by mentioning a future direction for speaker recognition from -excitation source perspective.
|