中国科学:信息科学(英文版)2024,Vol.67Issue(7) :129-155.DOI:10.1007/s11432-023-3955-x

Learning in games:a systematic review

Rong-Jun QIN Yang YU
中国科学:信息科学(英文版)2024,Vol.67Issue(7) :129-155.DOI:10.1007/s11432-023-3955-x

Learning in games:a systematic review

Rong-Jun QIN 1Yang YU1
扫码查看

作者信息

  • 1. National Key Laboratory for Novel Software Technology,Nanjing University,Nanjing 210023,China;Polixir Technologies,Nanjing 210023,China
  • 折叠

Abstract

Game theory studies the mathematical models for self-interested individuals.Nash equilibrium is arguably the most central solution in game theory.While finding the Nash equilibrium in general is known as polynomial parity arguments on directed graphs(PPAD)-complete,learning in games provides an alternative to approximate Nash equilibrium,which iteratively updates the player's strategy through interactions with other players.Rules and models have been developed for learning in games,such as fictitious play and no-regret learning.Particularly,with recent advances in online learning and deep reinforcement learning,techniques from these fields greatly boost the breakthroughs in learning in games from theory to application.As a result,we have witnessed many superhuman game AI systems.The techniques used in these systems evolve from conventional search and learning to purely reinforcement learning(RL)-style learning methods,gradually getting rid of the domain knowledge.In this article,we systematically review the above techniques,discuss the trend of basic learning rules towards a unified framework,and recap applications in large games.Finally,we discuss some future directions and make the prospect of future game AI systems.We hope this article will give some insights into designing novel approaches.

Key words

non-cooperative games/learning in games/no-regret learning/reinforcement learning/superhu-man AI

引用本文复制引用

基金项目

National Key Research and Development Program of China(2020AAA0107200)

National Natural Science Foundation of China(61921006)

出版年

2024
中国科学:信息科学(英文版)
中国科学院

中国科学:信息科学(英文版)

CSTPCDEI
影响因子:0.715
ISSN:1674-733X
段落导航相关论文