基于PANNs-CNN的环境声音分类算法研究及应用

Research and application of environmental sound classification algorithm based on PANNs-CNN

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：环境声音分类(ESC)技术主要涉及声音特征提取和分类器算法的选择.为了探索最佳的特征提取方法和分类器组合,文章对深度学习模型PANNs-CNN进行了研究和分析,对不同的特征提取方法进行了实验对比.实验结果表明,在与同类模型对比中,选用预训练且更深层的CNN模型可以提高ESC的预测性能;Log-Mel特征可以更好地保留声音信号高维度特征及特征相关性,有助于提升模型分类准确率.文章研究的基于Log-Mel特征提取方式和PANNs-CNN 14 的环境声音分类算法在ESC-50 数据集上的分类准确率最好,并且在实际应用中验证了该算法的有效性.

外文摘要：Environmental sound classification(ESC)technology mainly involves sound feature extraction and the selection of classifier algorithms.In order to explore the best feature extraction methods and classifier combinations,this article studies and analyzes the deep learning model PANNs-CNN,and compares different feature extraction methods through experiments.The experimental results show that compared with similar models,selecting pretrained and deeper CNN models can improve the predictive performance of ESC.Log-Mel features can better preserve high-dimensional features and feature correlations of sound signals,which helps improve the accuracy of model classification.The environmental sound classification algorithm based on Log-Mel feature extraction method and PANNs-CNN14 studied in the article has the best classification accuracy on the ESC-50 dataset,and its effectiveness has been verified in practical applications.

外文关键词：

ESCPANNsCNNLog-MelMel frequency cepstrum coefficient

作者：

关志广

展开 >

作者单位：

南宁职业技术大学,广西南宁 530008

关键词：

环境声音分类预训练音频神经网络卷积神经网络 Log-Mel Mel频率倒谱系数

基金：

广西壮族自治区教育科学规划课题专项(十四五)(2023)

项目编号：

2023ZJY1841

出版年：

2024

无线互联科技

江苏省科学技术情报研究所

无线互联科技

影响因子：0.263

ISSN：1672-6944

年,卷(期)：2024.21(16)