为研究基于长短期记忆(Long Short-Term Memory,LSTM)网络的语音转文字系统的优化方法,首先说明LSTM在语音转文字任务中的基本原理和架构,其次分析自适应矩估计(Adaptive Moment Estimation,Adam)优化算法的核心机制及其在LSTM网络中的应用,最后在Mozilla DeepSpeech框架中嵌入基于Adam优化的LSTM模型,并使用THCHS-30数据集进行实验.实验结果表明,基于Adam优化的LSTM模型在词错率和F1分数上均表现出显著的优越性.
Optimization Method of LSTM Network in Speech-to-Text Application
In order to study the optimization method of speech-to-text system based on Long Short-Term Memory (LSTM) network,the basic principle and architecture of LSTM in speech-to-text task are first explained,and then the core mechanism of Adaptive Moment Estimation (Adam) optimization algorithm and its application in LSTM network are analyzed. Finally,the LSTM model based on Adam optimization is embedded in the Mozilla DeepSpeech framework,and the experiment is carried out using the THCHS-30 dataset. The experimental results show that the LSTM model based on Adam optimization has obvious advantages in terms of word error rate and F1 score.
Long Short-Term Memory (LSTM)Adaptive Moment Estimation (Adam)speech recognitiontraining optimization