|
计算机应用研究 2010
Reinforcement learning: survey of recent work
|
Abstract:
The problem of agent learning to act in an unknown world is both challenging and interesting. Reinforcement lear-ning has been successful at finding optimal control policies through trial-and-error interaction with dynamic environment. Its properties of self-improving and online learning make reinforcement learning become one of most important machine learning methods. The goal of this paper was to provide a comprehensive review of reinforcement learning about theory, algorithms and applications. First of all, this paper surveyed the foundation, model of environment of reinforcement learning. Discussed the convergence and generalization of the algorithms in the next. Then deeply discussed two representative selection of these algorithm, including discounted reward and average reward. Finally, provided some applications of reinforcement learning, and pointed out some challenges and problems of reinforcement learning.