LSTM网络在语音转文字应用中的优化方法

扫码查看

原文链接

万方数据
维普

中文摘要：为研究基于长短期记忆(Long Short-Term Memory,LSTM)网络的语音转文字系统的优化方法,首先说明LSTM在语音转文字任务中的基本原理和架构,其次分析自适应矩估计(Adaptive Moment Estimation,Adam)优化算法的核心机制及其在LSTM网络中的应用,最后在Mozilla DeepSpeech框架中嵌入基于Adam优化的LSTM模型,并使用THCHS-30数据集进行实验.实验结果表明,基于Adam优化的LSTM模型在词错率和F1分数上均表现出显著的优越性.

外文标题：Optimization Method of LSTM Network in Speech-to-Text Application

外文摘要：In order to study the optimization method of speech-to-text system based on Long Short-Term Memory (LSTM) network,the basic principle and architecture of LSTM in speech-to-text task are first explained,and then the core mechanism of Adaptive Moment Estimation (Adam) optimization algorithm and its application in LSTM network are analyzed. Finally,the LSTM model based on Adam optimization is embedded in the Mozilla DeepSpeech framework,and the experiment is carried out using the THCHS-30 dataset. The experimental results show that the LSTM model based on Adam optimization has obvious advantages in terms of word error rate and F1 score.

外文关键词：

Long Short-Term Memory (LSTM)Adaptive Moment Estimation (Adam)speech recognitiontraining optimization

作者：

张乾

展开 >

作者单位：

江海职业技术学院,江苏扬州 225101

关键词：

长短期记忆(LSTM) 自适应矩估计(Adam) 语音识别训练优化

出版年：

2024

DOI：

10.16311/j.audioe.2024.09.027

电声技术

电视电声研究所(中国电子科技集团公司第三研究所)

电声技术

影响因子：0.259

ISSN：1002-8684

年,卷(期)：2024.48(9)

参考文献量9