首页|The Tong Test:Evaluating Artificial General Intelligence Through Dynamic Embodied Physical and Social Interactions

The Tong Test:Evaluating Artificial General Intelligence Through Dynamic Embodied Physical and Social Interactions

扫码查看
The release of the generative pre-trained transformer(GPT)series has brought artificial general intelli-gence(AGI)to the forefront of the artificial intelligence(AI)field once again.However,the questions of how to define and evaluate AGI remain unclear.This perspective article proposes that the evaluation of AGI should be rooted in dynamic embodied physical and social interactions(DEPSI).More specifically,we propose five critical characteristics to be considered as AGI benchmarks and suggest the Tong test as an AGI evaluation system.The Tong test describes a value-and ability-oriented testing system that delin-eates five levels of AGI milestones through a virtual environment with DEPSI,allowing for infinite task generation.We contrast the Tong test with classical AI testing systems in terms of various aspects and propose a systematic evaluation system to promote standardized,quantitative,and objective bench-marks and evaluation of AGI.

Artificial general intelligenceArtificial intelligence benchmarkArtificial intelligence evaluationEmbodied artificial intelligenceValue alignmentTuring testCausality

Yujia Peng、Jiaheng Han、Zhenliang Zhang、Lifeng Fan、Tengyu Liu、Siyuan Qi、Xue Feng、Yuxi Ma、Yizhou Wang、Song-Chun Zhu

展开 >

National Key Laboratory of General Artificial Intelligence,Beijing Institute for General Artificial Intelligence,Beijing 100086,China

Institute for Artificial Intelligence,Peking University,Beijing 100871,China

Beijing Key Laboratory of Behavior and Mental Health,School of Psychological and Cognitive Sciences,Peking University,Beijing 100871,China

School of Intelligence Science and Technology,Peking University,Beijing 100871,China

School of Computer Science,Peking University,Beijing 100871,China

展开 >

National Key Research and Development Program of China

2022ZD0114900

2024

工程(英文)

工程(英文)

CSTPCDEI
ISSN:2095-8099
年,卷(期):2024.34(3)