联合CTC和Transformer的轮式移动机器人语音控制研究
Research on voice control of wheeled mobile robot combined with CTC and Transformer
唐咸荣 1高瑞贞1
作者信息
- 1. 河北工程大学机械与装备工程学院,河北邯郸 056038
- 折叠
摘要
针对人机交互过程中手动控制轮式移动机器人步骤繁琐且双手受到限制的问题,提出并实现基于深度学习算法的轮式移动机器人语音控制系统.系统选取树莓派 4B开发板作为主控制器,科大讯飞 6阵列语音模块作为语音采集器,STM32单片机作为底层轮式移动机器人的控制器.语音识别算法部分设计基于Transformer的端到端语音识别模型,并加入连续时序分类算法来辅助模型的训练,模型的收敛速度和鲁棒性都得到相应的提升.模型在AISHELL-1语音数据集上测试的字错率为 5.57%,相对于Transformer单独训练,字错率相对下降 5.1%.经过平台搭建和实验,实现轮式移动机器人根据用户语音指令完成相对应动作的目的,有利于提高工作效率和解放用户的双手.
Abstract
Aiming at the problem that the manual control of wheeled mobile robot in the process of human-computer interaction is cumbersome and hands are limited,a voice control system of wheeled mobile robot based on deep learning algorithm is proposed and implemented.The system chooses Raspberry PI 4B development board as the main controller,iFLYTEK voice module with 6 array as the voice collector and STM32 MCU as the controller of the bottom wheeled mobile robot.The speech recognition part designs an end-to-end speech recognition model based on Transformer,and connectionist temporal classification algorithm is added to assist the training of the model.The convergence rate of the model is accelerated and the robustness of the model is improved.The word error rate of the model tested on the speech dataset AISHELL-1 is 5.57%,separately decreased by 5.1%compared to Transformer training alone.Through the platform construction and experiments,the wheeled mobile robot can complete corresponding actions according to the user's voice command,which is beneficial to improve work efficiency and liberate the user's hands.
关键词
轮式移动机器人/语音控制/Transformer/连接时序分类Key words
wheeled mobile robot/voice control/Transformer/connectionist temporal classification引用本文复制引用
基金项目
河北省高校科技攻关项目(ZD2018207)
出版年
2024