基于改进深度学习算法的乐器音色识别
Musical Instrument Timbre Recognition Based on Improved Deep Learning Algorithm
陈曙光 1栗超2
作者信息
- 1. 崇左幼儿师范高等专科学校 学前教育系,广西 崇左 532200
- 2. 广西科技大学 电子工程学院,广西 柳州 545005
- 折叠
摘要
为保持乐器音色时间序列基础上实现乐器音色的准确识别,提出基于改进深度学习算法的乐器音色识别新方法.该方法首先采用基于梅尔滤波器组能量对数和梅尔频率倒谱系数的一维卷积神经网络提取乐器音色特征;其次,将乐器音色特征输入基于长短期记忆和深度神经网络的乐器音色分类器进行乐器音色识别;最后,对来自5 个乐器声音数据库的乐器音色进行了仿真测试.对于5 个数据库中乐器音色的识别测试结果表明,该乐器音色识别比基于卷积神经网络的乐器音色识别方法、基于卷积神经网络与深度置信网络的乐器音色识别方法分别提高了2.49%和2.02%.
Abstract
To achieve accurate identification of musical instrument timbres while preserving the temporal char-acteristics of their sound,a new approach based on an improved deep learning algorithm is proposed.This method commences by utilizing a one-dimensional convolutional neural network to extract features of musical instrument timbres,which leverages the combination of Mel-scaled filter bank log energies and Mel-Fre-quency Cepstral Coefficients.Then input the musical instrument timbre features into the musical instrument timbre classifier based on long short-term memory and deep neural network for musical instrument timbre rec-ognition.The results of the recognition tests conducted on the timbres of instruments from these five databases reveal that the proposed approach outperforms both traditional convolutional neural network-based timbre rec-ognition methods and hybrid convolutional neural network deep belief network approaches.Specifically,it a-chieves an improvement of 2.49%over convolutional neural network methods and 2.02%over hybrid convo-lutional neural network deep belief network methods,demonstrating its effectiveness in accurately identifying musical instrument timbres while preserving the inherent temporal dynamics of their sounds.
关键词
乐器音色识别/深度学习算法/长短期记忆/一维卷积神经网络Key words
musical instrument timbre recognition/deep learning algorithm/long short-term memory/one-dimensional convolutional neural network引用本文复制引用
基金项目
广西壮族自治区教育厅自然科学研究项目(20GX2374307)
出版年
2024