基于深度学习的多声音事件检测研究综述

扫码查看

原文链接

国家科技期刊平台
NETL
NSTL
万方数据

中文摘要：多声音事件检测是当前语音处理的研究热点之一,本文对近年来基于深度学习的多声音事件检测模型进行综述.首先介绍了 4种监督学习模型和13种弱监督学习模型,弱监督学习模型包括基于平均教师的模型、基于注意力的模型、基于源分离的模型、基于自训练的模型以及其他模型,分析了各模型的特征、结构和性能;然后对各种模型使用的数据集及评价指标进行简要介绍;最后讨论了该领域未来的研究方向.

外文标题：Review of deep learning based polyphonic sound event detection

外文摘要：Polyphonic sound event detection is one of the research hotspots in speech processing.The polyphonic sound event detection models based on deep learning in recent years are reviewed.Firstly,four supervised learning models and 13 weakly supervised learning models are introduced.Weakly supervised learning models include mean-teacher-based model,attention-based model,source separation-based model,self-training model and other models.Then,the data sets and evaluation indexes used in each model are briefly introduced.Finally,the future research direction in this field is discussed.

外文关键词：

deep learningpolyphonic sound event detectionweakly supervised learningsemi-supervised learning

作者：

张珑、张恒远、魏育华、杨烁祯

展开 >

作者单位：

天津师范大学计算机与信息工程学院,天津 300387

广州华立科技职业学院计算机信息工程学院,广州 511325

关键词：

深度学习多声音事件检测弱监督学习半监督学习

出版年：

2024

DOI：

10.19638/j.issn1671-1114.20240601

天津师范大学学报(自然科学版)

天津师范大学

天津师范大学学报(自然科学版)

CSTPCD北大核心

影响因子：0.311

ISSN：1671-1114

年,卷(期)：2024.44(6)