CPL - Chalmers Publication Library
| Utbildning | Forskning | Styrkeområden | Om Chalmers | In English In English Ej inloggad.

A pitch synchronous feature extraction method for speaker recognition

Samuel Kim ; Thomas Eriksson (Institutionen för signaler och system, Kommunikationssystem) ; Hong-Goo Kang ; Chungyong Lee
Proceedings - IEEE International Conference on Acoustics, Speech, and Signal Processing; Montreal, Que; Canada; 17 May 2004 through 21 May 2004 (15206149). Vol. 1 (2004), p. I405-I408.
[Konferensbidrag, refereegranskat]

This paper presents a novel feature extraction method to improve the performance of speaker identification systems. The proposed feature has a form of a typical conventional feature, mel frequency cepstral coefficients (MFCC), but a flexible segmentation to reduce spectral mismatch between training and testing processes. Specifically, the length and shift size of the analysis frame are determined by a pitch synchronous method, pitch synchronous MFCC (PSMFCC). To verify the performance of the new feature, we measure the cepstral distortion between training and testing and also perform closed set speaker identification tests. With text-independent and text-dependent experiments, the proposed algorithm provides 44.3 % and 26.7 % relative improvement respectively.

Denna post skapades 2006-09-12. Senast ändrad 2016-06-10.
CPL Pubid: 15277


Institutioner (Chalmers)

Institutionen för signaler och system, Kommunikationssystem (1900-2017)


Information Technology

Chalmers infrastruktur