%0 Journal Article
%T 基于强化学习的多智能体路径规划研究与应用<br>Research and Application of Multi-Agent Path Planning Based on Reinforcement Learning
%A 陈天润
%A 高大威
%J Modeling and Simulation
%P 5272-5283
%@ 2324-870X
%D 2023
%I Hans Publishing
%R 10.12677/MOS.2023.126479
%X 研究聚焦于智能仓储中AGV的路径规划问题，构建了基于强化学习的多智能体寻路算法。每个AGV能从环境和以往经验中学习，利用不同行为产生的奖励机制，训练智能体自主选择更高效策略，以达到预定目标。本研究在ACTOR-CRITIC算法基础上加入经验回放机制并采用了中心化训练和去中心化决策的方法，以提高智能体的路径规划效率。同时，将ACTOR-CRITIC算法在智能仓的环境下进行模拟训练，验证AGV的路径规划效果。<br />
The research focuses on the path planning problem of AGV in intelligent warehousing, constructs a multi-agent path finding algorithm based on reinforcement learning. Each AGV can learn from the environment and previous experience, use the reward mechanism generated by different behaviors to train the agent to independently choose more efficient strategies to achieve the predetermined goal. In this research, an experience playback mechanism is added to the ACTOR-CRITIC algorithm, centralized training and decentralized decision-making methods are adopted to improve the path planning efficiency of the agent. At the same time, the ACTOR-CRITIC algorithm is simulated and trained in the environment of the smart warehouse to verify the path planning effect of the AGV.
%K 智能仓储，多智能体强化学习，路径规划<br>Intelligent Warehousing
%K Multi-Agent Reinforcement Learning
%K Path Planning
%U http://www.hanspub.org/journal/PaperInformation.aspx?PaperID=75287