计算机工程与设计2024,Vol.45Issue(12) :3681-3687.DOI:10.16208/j.issn1000-7024.2024.12.021

基于双特征时频注意力的声音事件检测算法

Sound event detection algorithm based on dual feature time-frequency attention

郭梦溪 马建芬 降爱莲 王炜欣
计算机工程与设计2024,Vol.45Issue(12) :3681-3687.DOI:10.16208/j.issn1000-7024.2024.12.021

基于双特征时频注意力的声音事件检测算法

Sound event detection algorithm based on dual feature time-frequency attention

郭梦溪 1马建芬 1降爱莲 1王炜欣2
扫码查看

作者信息

  • 1. 太原理工大学计算机科学与技术学院,山西晋中 030600;太原理工大学新型传感器与智能控制教育部(山西省)重点实验室,山西太原 030000
  • 2. 太原理工大学电子信息与光学工程学院,山西晋中 030600;太原理工大学新型传感器与智能控制教育部(山西省)重点实验室,山西太原 030000
  • 折叠

摘要

针对现有声音事件检测方法中对不同时间和频带信息关注不够,且传统的单一特征无法表征时频重叠声音事件的空间相位信息问题,提出一种基于双特征输入的时频注意力算法,将对数梅尔谱图、相位变换的广义互相关作为输入,分别从时间和频率两个维度使用注意力机制捕捉更有效的时频特征.为提高算法的多分辨率处理能力,设计一种基于注意力的特征金字塔模型,学习多尺度特征,帮助模型识别不同声音事件.实验结果表明,所提算法能够有效提取关键特,进行多分辨率处理,提高了声音事件检测性能.

Abstract

Aiming at the fact that the existing sound event detection methods do not pay enough attention to different time and frequency band information,and the traditional single feature cannot characterize the spatial phase information of time-frequency overlapping sound events,a time-frequency attention algorithm based on dual feature input was proposed,in which the logarith-mic Mel spectrum and the generalized cross-correlation with phase transformation were used as input,and the attention mecha-nism was used to capture more effective time-frequency features from the two dimensions of time and frequency,respectively.To improve the multi-resolution processing ability of the algorithm,the feature pyramid model based on attention was designed to learn multi-scale features and to help the model identify different sound events.Experimental results show that the proposed algorithm can effectively extract key features and perform multi resolution processing,improving the performance of sound event detection.

关键词

声音事件检测/声学事件/时频注意力/特征金字塔/特征融合/深度学习/多分辨率

Key words

sound event detection/acoustic event/time-frequency attention/feature pyramid/feature fusion/deep learning/multi-resolution

引用本文复制引用

出版年

2024
计算机工程与设计
中国航天科工集团二院706所

计算机工程与设计

CSTPCD北大核心
影响因子:0.617
ISSN:1000-7024
段落导航相关论文