首页|Application of formant instantaneous characteristics to speech recognition and speaker identification
Application of formant instantaneous characteristics to speech recognition and speaker identification
扫码查看
点击上方二维码区域,可以放大扫码查看
原文链接
NETL
NSTL
万方数据
This paper proposes a new phase feature derived from the formant instantaneous characteristics for speech recognition (SR) and speaker identification (SI) systems.Using Hilbert transform (HT), the formant characteristics can be represented by instantaneous frequency (IF) and instantaneous bandwidth, namely formant instantaneous characteristics (FIC).In order to explore the importance of FIC both in SR and SI, this paper proposes different features from FIC used for SR and SI systems.When combing these new features with conventional parameters, higher identification rate can be achieved than that of using Mel-frequency cepstral coefficients (MFCC) parameters only.The experiment results show that the new features are effective characteristic parameters and can be treated as the compensation of conventional parameters for SR and SI.
instantaneous frequency (IF)Hilbert transform (HT)speech recognitionspeaker identificationMel-frequency cepstral coefficients (MFCC)
HOU Li-min、HU Xiao-ning、XIE Juan-min
展开 >
Key laboratory of Specialty Fiber Optics and Optical Access Networks, School of Communication and Information Engineering, Shanghai University, Shanghai 200072, P. R. China
国家自然科学基金Shanghai Leading Academic Discipline Project