Robotics & Machine Learning Daily News2024,Issue(Jun.24) :13-13.

Researcher from Harbin Institute of Technology Reports Details of New Studies an d Findings in the Area of Robotics (A Soft Actor-Critic Deep Reinforcement-Learn ing-Based Robot Navigation Method Using LiDAR)

哈尔滨工业大学的研究员报告了机器人领域的新研究和发现的细节(一种软演员-批评深度强化-学习基于激光雷达的机器人导航方法)

Robotics & Machine Learning Daily News2024,Issue(Jun.24) :13-13.

Researcher from Harbin Institute of Technology Reports Details of New Studies an d Findings in the Area of Robotics (A Soft Actor-Critic Deep Reinforcement-Learn ing-Based Robot Navigation Method Using LiDAR)

哈尔滨工业大学的研究员报告了机器人领域的新研究和发现的细节(一种软演员-批评深度强化-学习基于激光雷达的机器人导航方法)

扫码查看

摘要

由一名新闻记者-机器人与机器学习的工作人员新闻编辑每日新闻-机器人的新研究是一份新报告的主旨。根据NewsRx记者在哈尔滨的新闻报道,研究表明:“当环境中存在动态障碍物时,传统的路径生成算法很难达到预期的避障效果。”本研究的资金来源包括黑龙江省重点研发项目的重点专项项目。为了解决这一问题,本文提出了一种基于SAC(Soft Actor-Critic)深度强化学习的机器人导航控制方法,首先利用快速路径生成算法控制机器人在遇到危险和接近目标时产生专家轨迹;将SAC强化学习与基于EXP ERT轨迹的模拟学习相结合,提高了训练的安全性,针对Agent数据和专家数据的混合数据,采用改进的优先经验重放方法提高了策略的学习效率.

Abstract

By a News Reporter-Staff News Editor at Robotics & Machine Learning Daily News Daily News-New research on robotics is the subjec t of a new report. According to news reporting from Harbin, People's Republic of China, by NewsRx journalists, research stated, "When there are dynamic obstacle s in the environment, it is difficult for traditional path-generation algorithms to achieve desired obstacle-avoidance results." Financial supporters for this research include Key Special Projects of Heilongji ang Province's Key R &D Program. The news correspondents obtained a quote from the research from Harbin Institute of Technology: "To solve this problem, we propose a robot navigation control me thod based on SAC (Soft Actor-Critic) Deep Reinforcement Learning. Firstly, we u se a fast path-generation algorithm to control the robot to generate expert traj ectories when the robot encounters danger as well as when it approaches a target , and we combine SAC reinforcement learning with imitation learning based on exp ert trajectories to improve the safety of training. Then, for the hybrid data co nsisting of agent data and expert data, we use an improved prioritized experienc e replay method to improve the learning efficiency of the policies."

Key words

Harbin Institute of Technology/Harbin/People's Republic of China/Asia/Emerging Technologies/Machine Learning/Reinf orcement Learning/Robot/Robotics

引用本文复制引用

出版年

2024
Robotics & Machine Learning Daily News

Robotics & Machine Learning Daily News

ISSN:
段落导航相关论文