全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

THE EXPECTED AVERAGE CRITERION FOR NONSTATIONARY MDP
非平稳MDP的期望平均准则

Keywords: Markov decision processes,expected average criterion,optimal equation,optimal policy
马氏决策过程(MDP)
,期望平均准则,最优方程,最优策略

Full-Text   Cite this paper   Add to My Lib

Abstract:

In this paper, we consider the expected average criterion for nonstationary MDP. By probability and martingale method, we prove the existence of #epsilon#(>=0)-optimal Markov policies under weakly ergodic conditions. As a typical example of this paper, we solve the open problem posed by Feinberg and Park again.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133