OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

模式识别与人工智能 2006

基于最先策略增强学习的ART2神经网络*

, PP. 428-432

樊建,吴耿锋

Keywords: 增强学习,ART2神经网络,最先策略,避碰撞

Full-Text Cite this paper Add to My Lib

Abstract:

提出一种基于最先策略增强学习的ART2神经网络FPRLART2(ForemostPolicyReinforcementLearningbasedART2neuralnetwork),并介绍其学习算法.为了达到在线学习的目的,在FPRLART2中,从状态到行为值之间的映射中,选择第一个得到奖励的行为,而不是选择诸如1stepQLearning中具有最优行为值的行为.ART2神经网络用于存储分类模式,其权重通过增强学习增强或减弱,达到学习的目的.并将FPRLART2运用到移动机器人避碰撞问题的研究中.仿真实验表明,引入FPRLART2后减少移动机器人与障碍物发生碰撞的次数,具有良好的避碰效果.

References

[1]	Carpenter G A, Grossberg S. ART2: Stable Self-Organization of Category Recognition Codes for Analog Input Patterns. Applied Optics, 1987, 26(23): 4919-4930
[2]	Liu X H, Yu Z Z, Duan J, et al. Face Recognition Using Adaptive Resonance Theory. In: Proc of the International Conference on Machine Learning and Cybernetics. Xi’an, China, 2003, Ⅴ: 3167-3171
[3]	Fan J, Wu G F, et al. Reinforcement Learning and ART2 Neural Network Based Collision Avoidance System of Mobile Robot. In: Yin F L, Wang J, Guo C G, eds. Lecture Notes in Computer Science. 2004, 3174: 35-40
[4]	Li M, Yan C H, Liu G H. ART2 Neural Networks with More Vigorous Vigilance Test Criterion. Journal of Image and Graphics, 2001, 6(1): 81-85 (in Chinese) (黎明,严超华,刘高航.具有更严格警戒测试准则的ART2神经网络.中国图象图形学报, 2001, 6(1): 81-85)
[5]	Whitehead S D, Sutton R S, Ballard D H. Advances in Reinforcement Learning and Their Implications for Intelligent Control. In: Proc of the 5th IEEE International Symposium on Intelligent Control. Philadelphia, USA ,1990, Ⅱ: 1289-1297
[6]	Suwimonteerabuth D, Chongstitvatana P. Online Robot Learning by Reward and Punishment for a Mobile Robot. In: Proc of the IEEE/RSJ International Conference on Intelligent Robots and Systems. Lausanne, Switzerland, 2002, Ⅰ: 921-926
[7]	Fujimori A, Tani S. A Navigation of Mobile Robot with Collision Avoidance for Moving Obstacles. In: Proc of the IEEE International Conference on Industrial Technology. Bangkok, Thailand, 2002, Ⅰ: 1-6
[8]	Grossberg S. Adaptive Pattern Classification and Universal Recoding, I: Parallel Development and Coding of Neural Feature Detectors. Biological Cybernetics, 1976, 23(3): 121-134
[9]	Grossberg S. Adaptive Pattern Classification and Universal Recoding, II: Feedback, Expectation, Olfaction, Illusions. Biological Cybernetics, 1976, 23(4): 187-202
[10]	Sutton R S, Barto A G. Reinforcement Learning: An Introduction. Cambridge, USA: MIT Press, 1998
[11]	Xiao N F, Nahavandi S. A Reinforcement Learning Approach for Robot Control in an Unknown Environment. In: Proc of the IEEE International Conference on Industrial Technology. Bangkok, Thailand, 2002, Ⅱ: 1096-1099

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133