全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Learning of Soccer Player Agents Using a Policy Gradient Method : Coordination Between Kicker and Receiver During Free Kicks

Keywords: RoboCup , Soccer Simulation , Multiagents , Policy-Gradient methods , Reinforcement Learning

Full-Text   Cite this paper   Add to My Lib

Abstract:

As an example of multi-agent learning in soccer games of the RoboCup 2D Soccer SimulationLeague, we dealt with a learning problem between a kicker and a receiver when a direct free kickis awarded just outside the opponent’s penalty area. We propose how to use a heuristic functionto evaluate an advantageous target point for safely sending/receiving a pass and scoring. Theheuristics include an interaction term between a kicker and a receiver to intensify theircoordination. To calculate the interaction term, we let a kicker/receiver agent have areceiver’s/kicker’s action decision model to predict a receiver’s/kicker’s action. Parameters in theheuristic function can be learned by a kind of reinforcement learning called the policy gradientmethod. Our experiments show that if the two agents do not have the same type of heuristics, theinteraction term based on prediction of a teammate’s decision model leads to learning a masterservantrelation between a kicker and a receiver, where a receiver is a master and a kicker is aservant.

Full-Text

comments powered by Disqus

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133

WeChat 1538708413