电子技术2024,Vol.53Issue(8) :400-401.

端到端语音识别模型的设计与实现

Design and Implementation of an End to End Speech Recognition Model

刘帅
电子技术2024,Vol.53Issue(8) :400-401.

端到端语音识别模型的设计与实现

Design and Implementation of an End to End Speech Recognition Model

刘帅1
扫码查看

作者信息

  • 1. 山东凌然智能科技有限公司,山东 264006
  • 折叠

摘要

阐述一种基于注意力机制的端到端语音识别模型,采用编码器-解码器架构,可直接将语音信号转换为文本.在Librispeech数据集中,该模型的字错误率低于5.8%,优于大多数传统语音识别系统.

Abstract

This paper describes an end-to-end speech recognition model based on attention mechanism,which adopts an encoder decoder architecture and can directly convert speech signals into text.In the Librispeech dataset,the model achieved a word error rate of 5.8%,which is better than most traditional speech recognition systems.

关键词

语音识别/端到端模型/注意力机制/深度学习/编码器-解码器架构

Key words

speech recognition/end-to-end model/attention mechanism/deep learning/encoder decoder architecture

引用本文复制引用

出版年

2024
电子技术
上海市电子学会,上海市通信学会

电子技术

影响因子:0.296
ISSN:1000-0755
段落导航相关论文