首页|Application of formant instantaneous characteristics to speech recognition and speaker identification

Application of formant instantaneous characteristics to speech recognition and speaker identification

扫码查看
This paper proposes a new phase feature derived from the formant instantaneous characteristics for speech recognition (SR) and speaker identification (SI) systems.Using Hilbert transform (HT), the formant characteristics can be represented by instantaneous frequency (IF) and instantaneous bandwidth, namely formant instantaneous characteristics (FIC).In order to explore the importance of FIC both in SR and SI, this paper proposes different features from FIC used for SR and SI systems.When combing these new features with conventional parameters, higher identification rate can be achieved than that of using Mel-frequency cepstral coefficients (MFCC) parameters only.The experiment results show that the new features are effective characteristic parameters and can be treated as the compensation of conventional parameters for SR and SI.

instantaneous frequency (IF)Hilbert transform (HT)speech recognitionspeaker identificationMel-frequency cepstral coefficients (MFCC)

HOU Li-min、HU Xiao-ning、XIE Juan-min

展开 >

Key laboratory of Specialty Fiber Optics and Optical Access Networks, School of Communication and Information Engineering, Shanghai University, Shanghai 200072, P. R. China

国家自然科学基金Shanghai Leading Academic Discipline Project

60903186J50104

2011

上海大学学报(英文版)
上海大学

上海大学学报(英文版)

影响因子:0.196
ISSN:1007-6417
年,卷(期):2011.15(2)
  • 1