全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...
电子学报  2006 

基于用户偏好的智能业务选取研究

, PP. 2537-2540

Keywords: 效用理论,用户偏好,马尔可夫判决过程,强化学习

Full-Text   Cite this paper   Add to My Lib

Abstract:

将马尔可夫判决过程和强化学习算法相结合,给出了异构无线网络环境下用户业务偏好评估模型的技术框架.为动态环境下用户需求的感知、量化和适配特征的研究提供了基本的数学描述,对解决用户体验的评价问题和业务与业务环境的适配问题提供了新的研究思路.仿真结果表明构建的模型能够在满足用户偏好的前提下智能选择业务.

References

[1]  B Liver,J Altmann.Social carder recommendation for selecting services in electronic telecommunication markets:A preliminary report[R].Technical Report TR-97-033,ICSI,Berkeley,CA,USA,1997.
[2]  U Chajewska,D Koller,R Parr.Making rational decisions during adaptive utility elicitation[A].In Proceedings of the Seventeenth National Conference on Artificial Intelligence[C].Austin,TX,USA,2000.363-369.
[3]  Daniel P Boulet,Niall M Fraser.Improving preference elicitation for decision support systems[J].IEEE Trans,1995.1574-1579.
[4]  R S Sutton,A G Barto.Reinforcement learning[M].MIT Press,Cambridge,MA,1998.
[5]  刘克.实用马尔可夫决策过程[M].北京:清华大学出版社,2004.
[6]  Craig Boutilier.A POMDP formulation of preference eficitation problems[A].In Proceedings of American Association of Artificial Intelligence[C].Edmonton,Alberta,Canada,2002.239-246.
[7]  Simon French.Decision Theory:An Introduction to the Mathematics of Rationality[M].New York,USA,Halsted Press,1986.
[8]  Leslie Pack Kaelbling,Andrew W Moore.Reinforcement learning:a survey[J].Journal of Artificial Intelligence Research,1996,237-285.
[9]  胡奇英,刘建庸.马尔可夫决策过程引论[M].西安:西安电子科技大学出版社,2000.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133