offline reinforcement learningonline fine-tuningtask generalizationsuccessor representationsensembles
National Science Fund for Distinguished Young ScholarsNational Natural Science Foundation of ChinaNational Natural Science Foundation of ChinaNational Natural Science Foundation of ChinaFok Ying-Tong Education Foundation ChinaTencent Foundation,XPLORER PRIZE,Science Center Program of National Natural Science Foundation of ChinaHeilongjiang Touyan Innovation Team Program
6202560262306242U22B20361193101517110562188101
2024