首页|Extraction of novel features for emotion recognition
Extraction of novel features for emotion recognition
扫码查看
点击上方二维码区域,可以放大扫码查看
原文链接
NETL
NSTL
万方数据
维普
Hilbert-Huang transform method has been widely utilized from its inception because of the superiority in varieties of areas.The Hilbert spectrum thus obtained is able to reflect the distribution of the signal energy in a number of scales accurately.In this paper,a novel feature called ECC is proposed via feature extraction of the Hilbert energy spectrum which describes the distribution of the instantaneous energy.The experimental results conspicuously demonstrate that ECC outperforms the traditional short-term average energy.Combination of the ECC with mel frequency cepstral coefficients (MFCC)delineates the distribution of energy in the time domain and frequency domain,and the features of this group achieve a better recognition effect compared with the feature combination of the short-term average energy,pitch and MFCC.Afterwards,further improvements of ECC are developed.TECC is gained by combining ECC with the teager energy operator,and EFCC is obtained by introducing the instantaneous frequency to the energy.In the experiments,seven status of emotion are selected to be recognized and the highest recognition rate 83.57% is achieved within the classification accuracy of boredom reaching 100%.The numerical results indicate that the proposed features ECC,TECC and EFCC can improve the performance of speech emotion recognition substantially.
emotion recognitionmel frequency cepstral coefficients(MFCC)feature extraction
LI Xiang、ZHENG Yu、LI Xin
展开 >
School of Mechatronics Engineering and Automation, Shanghai University, Shanghai 200072, P.R.China
School of Computer Engineering and Science, Shanghai University, Shanghai 200072, P.R.China
State Key Laboratory of Robotics and SystemShanghai Leading Academic Discipline Project