OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

自动化学报 2012

Learning Control of Dynamical Systems Based on Markov Decision Processes: Research Frontiers and Outlooks
基于马氏决策过程模型的动态系统学习控制:研究前沿与展望

XU Xin,SHEN Dong,GAO Yan-Qing,WANG Kai,
徐昕,沈栋,高岩青,王凯

Keywords: Learning control,Markov decision processes (MDP),reinforcement learning (RL),approximate dynamic programming (ADP),machine learning,adaptive control
学习控制,Markov决策过程,增强学习,近似动态规划,机器学习,自适应控制

Full-Text Cite this paper Add to My Lib

Abstract:

Learning control of dynamical systems based on Markov decision processes (MDPs) is an interdisciplinary research area of machine learning, control theory, and operations research. The main objective in this research area is to realize data-driven multi-stage optimal control for complex or uncertain dynamical systems. This paper presents a comprehensive survey on the theory, algorithms, and applications of MDP-based learning control of dynamical systems. Emphases are put on recent advances in the theory and methods of reinforcement learning (RL) and adaptive/approximate dynamic programming (ADP), including temporal-difference learning theory, value function approximation for continuous state and action spaces, direct policy search, approximate policy iteration, and adaptive critic designs. Applications and the trends for future research and developments in related fields are also discussed.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133

Learning Control of Dynamical Systems Based on Markov Decision Processes: Research Frontiers and Outlooks基于马氏决策过程模型的动态系统学习控制:研究前沿与展望

Learning Control of Dynamical Systems Based on Markov Decision Processes: Research Frontiers and Outlooks
基于马氏决策过程模型的动态系统学习控制:研究前沿与展望