低资源语音识别研究进展

Research Progress of Low-Resource Speech Recognition

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：探讨低资源语音识别领域最新研究进展,旨在为未来研究和应用提供有益参考.首先,简要回顾了语音识别的发展过程,并介绍了当前主流端到端语音识别框架的基本原理.其次,针对低资源语音识别面临的问题,详细分析了在语音数据增强、自监督语音表征学习、多语言联合学习、结合大语言模型以及语言知识增强5 个方面的相关研究工作.最后,对低资源语音识别未来的研究方向进行了展望.

外文摘要：This paper explores the latest research advancements in low-resource speech recognition,aiming to provide valuable references for future research and applications.It first briefly reviews the development process of speech recognition and introduces the basic principles of the current mainstream end-to-end speech recogni-tion frameworks.Addressing the challenges faced in low-resource speech recognition,the paper provides a de-tailed analysis of related research in five areas:speech data augmentation,self-supervised speech representation learning,multilingual joint learning,integration of large language models,and enhancement of language knowl-edge.Finally,it outlines the future research directions of low-resource speech recognition.

外文关键词：

speech recognitionlow-resource languagesdata augmentationspeech representation learninglarge language modellanguage knowledge

作者：

余正涛、董凌、高盛祥

展开 >

作者单位：

昆明理工大学信息工程与自动化学院,云南昆明 650500

云南省人工智能重点实验室,云南昆明 650500

关键词：

语音识别低资源语言数据增强语音表征学习大语言模型语言知识

基金：

国家自然科学基金项目国家自然科学基金项目云南省基础研究重大项目云南省重点研发计划项目

项目编号：

62376111U23A20388202401BC070021202303AP140008

出版年：

2024

DOI：

10.16112/j.cnki.53-1223/n.2024.03.231

昆明理工大学学报(自然科学版)

昆明理工大学

昆明理工大学学报(自然科学版)

CSTPCD北大核心

影响因子：0.516

ISSN：1007-855X

年,卷(期)：2024.49(3)

参考文献量97