OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

计算机科学 2010

Study of Temporal Difference Learning in Computer Games
面向机器博弈的即时差分学习研究

XU Chang-ming,MA Zong-min,XU Xin-he,LI Xin-xing,
徐长明,马宗民,徐心和,李新星

Keywords: Computer games,Temporal difference learning,Connect6
机器博弈,即时差分学习,六子棋

Full-Text Cite this paper Add to My Lib

Abstract:

Temporal Difference (Abbr. TD) learning algorithm was used to adjust weights of evaluation function by using Connect6 game as testbed in this paper,which makes the weights adjustment process can be done automatically. A new evaluation scheme was proposed,which can solve the difficult to combine the prior knowledge and multi-layer neural network organically. On account of the specific application,the method selecting part of the whole TD sectuence to learn was proposed, by which the interference of useless states is prevented to a certain extent. After 10020 self-learning training, the winning rate is increased with 8 % around against the same Connect6-playing program, which is a good result.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133

Study of Temporal Difference Learning in Computer Games面向机器博弈的即时差分学习研究

Study of Temporal Difference Learning in Computer Games
面向机器博弈的即时差分学习研究