OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

控制与决策 2014

一种基于操作条件反射原理的学习模型

DOI: 10.13195/j.kzyjc.2013.0522, PP. 1016-1020

阮晓钢,黄静,范青武,魏若岩

Keywords: 学习模型,操作条件反射,自学习,仿生,避障

Full-Text Cite this paper Add to My Lib

Abstract:

针对认知机器人的自主学习问题,提出一种基于操作条件反射原理的学习模型(OCLM).该模型采用状态空间、操作行为空间、概率分布函数、仿生学习机制、系统熵等进行描述,给出状态的“负理想度”的概念,定义了取向函数的计算方法.运用模型对机器人避障导航问题进行仿真实验,并对参数设置进行了讨论.实验结果表明,基于OCLM模型的机器人能通过与环境的交互获得认知,成功避障到达目的地,具有一定的自学习能力,从而表明了模型的有效性.

References

[1]	Skinner B F. The behavior of organisms: An experimental analysis[M]. New York: Appleton-Century Company, 1938: 110-150.
[2]	Zalama E, Gaudiano P, Coronado J L. Obstacle avoidance by means of an operant conditioning model[M]. Berlin: Springer, 1995: 471-477.
[3]	Gaudiano P, Chang C. Adaptive obstacle avoidance with a neural network for operant conditioning: experiments with real robots[C]. IEEE Int Symposium on Computational Intelligence in Robotics and Automation. Monterey: IEEE Press, 1997: 13-18.
[4]	Gaudiano P, Zalama E, Chang C, et al. A model of operant conditioning for adaptive obstacle avoidance[C]. From Animals to Animats. Cambridge: MIT Press, 1996: 373-381.
[5]	Ishii H, Nakasuji M, Ogura M, et al. Accelerating rat’s learning speed using a robot: The robot autonomously shows rats its functions[C]. Proc of the 2004 IEEE Int Workshop on Robot and Human Interactive Communication. Roman: IEEE Press, 2004: 229-234.
[6]	Itoh K, Miwa H, Matsumoto M, et al. Behavior model of humanoid robots based on operant conditioning[C]. The 5th IEEE-RAS Int Conf on Humanoid Robots. Tsukuba: IEEE Press, 2005: 220-225.
[7]	Taniguchi T, Sawaragi T. Incremental acquisition of behaviors and signs based on a reinforcement learning schemata model and a spike timing-dependent plasticity network[J]. Advanced Robotics, 2007, 21(10): 1177-1199.
[8]	Salotti J M, Lepretre F. Classical and operant conditioning as roots of interaction for robots[C]. Proc of the Workshop From Motor to Interaction Learning in Robots Conf on Intelligent Robotics Systems. Nice: Springer, 2008: 124-133.
[9]	阮晓钢, 蔡建羡, 戴丽珍. 基于概率自动机的操作条件反射计算模型[J]. 北京工业大学学报, 2010, 36(8): 1025-1030.
[10]	(Cai J X, Ruan X G. OCPA bionic autonomous learning system and its application to robot poster balance control[J]. Pattern Recognition and Artificial Intelligence, 2011, 24(1): 138-146.)
[11]	(Ruan X G, Cai J X, Dai L Z. Compute model of operant conditioning based on probabilistic automata[J]. J of Beijing University of Technology, 2010, 36(8): 1025-1030.)
[12]	蔡建羡, 阮晓钢. OCPA仿生自主学习系统及在机器人姿态平衡控制上的应用[J]. 模式识别与人工智能, 2011, 24(1): 138-146.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133