融合经验知识与深度强化学习的久棋Alpha-Beta算法优化研究

扫码查看

原文链接

国家科技期刊平台
NETL
NSTL
万方数据

中文摘要：藏族久棋作为一种传统的棋类博弈游戏,具备高度复杂的规则体系以及变幻莫测的棋局演变.传统的博弈策略在面对不同对手和棋局时不稳定,性能差,需要新的方法提高藏族久棋AI的博弈水平.以藏族久棋为研究对象,针对布局阶段,改进传统Alpha-Beta剪枝搜索算法,并结合经验知识,融入深度强化学习算法完成棋盘布局合理性的落子选择,以此为后续阶段铺路.在行棋阶段与飞子阶段,结合经验知识使用Alpha-Beta算法,完成行棋路径.最后,将所提算法和策略集成于久棋AI程序,在中国计算机博弈锦标赛中取得了良好的成绩,验证了该方法的有效性.

外文标题：Optimization of Alpha-Beta algorithm of Jiu Chess by combining empirical knowledge and deep reinforcement learning

外文摘要：Zangzu Jiu Chess, as a traditional board game, has highly complex rule systems and ever-changing board configurations.Traditional gaming strategies are unstable and perform poorly when faced with different opponents and situations, necessitating the development of new approaches to enhance the gaming capabilities of Zangzu Jiu Chess AI.This paper focuses on Zangzu Jiu Chess and, during the board layout phase, improves the traditional Alpha-Beta pruning search algorithm.It integrates empirical knowledge with deep reinforcement learning algorithms to make informed choices for rational piece placement on the chessboard, thereby paving the way for subsequent stages.During the chess stage and the flying stage, the Alpha-Beta algorithm is employed in conjunction with empirical knowledge to determine movement paths.Finally, the previously mentioned algorithms and strategies are integrated into a Zangzu Jiu Chess AI program, which achieves favorable results in the China Computer Game Championship, validating the effectiveness of this approach.

外文关键词：

Zangzu Jiu Chessempirical knowledgeAlpha-Beta algorithmdeep reinforcement learningcomputer game

作者：

张小川、杨小漫、涂飞、王鑫、严明珠、梁渝卓

展开 >

作者单位：

重庆理工大学两江人工智能学院, 重庆 401135

关键词：

藏族久棋经验知识 Alpha-Beta算法深度强化学习计算机博弈

基金：

国家自然科学基金项目重庆市技术创新与应用发展专项项目

项目编号：

60443004cstc2021jscxdxwtBX0019

出版年：

2024

DOI：

10.3969/j.issn.1674-8425(z).2024.05.015

重庆理工大学学报

重庆理工大学

重庆理工大学学报

CSTPCD北大核心

影响因子：0.567

ISSN：1674-8425

年,卷(期)：2024.38(9)

参考文献量18