Research on the Application of Deep Learning Technology in Automatic Audio Tagging
The purpose of automatic audio tagging is to generate a paragraph of texts that can describe the audio from the audio input.Currently,the effectiveness of audio tagging models is not good,and there are few applica-tions of preloading models in improving the audio tagging effect.The goal of automatic audio tagging is to generate appropriate descriptive statements for audio segments,and to have the ability to process audio and text modal data.Therefore,research is conducted on the preloading models of audio and text modalities,and automatic tagging based on audio modality and text modality are proposed to solve the problem of inconsistent goals in the training and testing stages of traditional tagging methods.
Audio taggingAutomatic taggingDeep learningPreloading model