全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Non-trivial two-armed partial-monitoring games are bandits

Full-Text   Cite this paper   Add to My Lib

Abstract:

We consider online learning in partial-monitoring games against an oblivious adversary. We show that when the number of actions available to the learner is two and the game is nontrivial then it is reducible to a bandit-like game and thus the minimax regret is $\Theta(\sqrt{T})$.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133