%0 Journal Article
%T 事件驱动的强化学习多智能体编队控制
%A 徐鹏
%A 谢广明
%A 文家燕
%A 高远
%J 智能系统学报
%D 2019
%R 10.11992/tis.201807010
%X 针对经典强化学习的多智能体编队存在通信和计算资源消耗大的问题，本文引入事件驱动控制机制，智能体的动作决策无须按固定周期进行，而依赖于事件驱动条件更新智能体动作。在设计事件驱动条件时，不仅考虑智能体的累积奖赏值，还引入智能体与邻居奖赏值的偏差，智能体间通过交互来寻求最优联合策略实现编队。数值仿真结果表明，基于事件驱动的强化学习多智能体编队控制算法，在保证系统性能的情况下，能有效降低多智能体的动作决策频率和资源消耗。&lt;/br&gt;A large consumption of communication and computing capabilities has been reported in classical reinforcement learning of multi-agent formation. This paper introduces an event-triggered mechanism so that the multi-agent’s decisions do not need to be carried out periodically; instead, the multi-agent’s actions are replaced depending on the event-triggered condition. Both the sum of total reward and variance in current rewards are considered when designing an event-triggered condition, so a joint optimization strategy is obtained by exchanging information among multiple agents. Numerical simulation results demonstrate that the multi-agent formation control algorithm can effectively reduce the frequency of a multi-agent’s action decisions and consumption of resources while ensuring system performance
%K 强化学习
%K 多智能体
%K 事件驱动
%K 编队控制
%K 马尔可夫过程
%K 集群智能
%K 动作决策
%K 粒子群算法&lt
%K /br&gt
%K reinforcement learning
%K multi-agent
%K event-triggered
%K formation control
%K Markov decision processes
%K swarm intelligence
%K action-decisions
%K particle swarm optimization
%U http://tis.hrbeu.edu.cn/oa/darticle.aspx?type=view&id=201807010