B Liver,J Altmann.Social carder recommendation for selecting services in electronic telecommunication markets:A preliminary report[R].Technical Report TR-97-033,ICSI,Berkeley,CA,USA,1997.
[2]
U Chajewska,D Koller,R Parr.Making rational decisions during adaptive utility elicitation[A].In Proceedings of the Seventeenth National Conference on Artificial Intelligence[C].Austin,TX,USA,2000.363-369.
[3]
Daniel P Boulet,Niall M Fraser.Improving preference elicitation for decision support systems[J].IEEE Trans,1995.1574-1579.
[4]
R S Sutton,A G Barto.Reinforcement learning[M].MIT Press,Cambridge,MA,1998.
[5]
刘克.实用马尔可夫决策过程[M].北京:清华大学出版社,2004.
[6]
Craig Boutilier.A POMDP formulation of preference eficitation problems[A].In Proceedings of American Association of Artificial Intelligence[C].Edmonton,Alberta,Canada,2002.239-246.
[7]
Simon French.Decision Theory:An Introduction to the Mathematics of Rationality[M].New York,USA,Halsted Press,1986.
[8]
Leslie Pack Kaelbling,Andrew W Moore.Reinforcement learning:a survey[J].Journal of Artificial Intelligence Research,1996,237-285.