Review of deep learning based polyphonic sound event detection
Polyphonic sound event detection is one of the research hotspots in speech processing.The polyphonic sound event detection models based on deep learning in recent years are reviewed.Firstly,four supervised learning models and 13 weakly supervised learning models are introduced.Weakly supervised learning models include mean-teacher-based model,attention-based model,source separation-based model,self-training model and other models.Then,the data sets and evaluation indexes used in each model are briefly introduced.Finally,the future research direction in this field is discussed.
deep learningpolyphonic sound event detectionweakly supervised learningsemi-supervised learning