马寿峰,李英,刘豹.一种基于Agent的单路口交通信号学习控制方法[J].系统工程学报,2002,17(6):526-530.MA Shou-feng,LI Ying,LIU Bao.Agent-based learning control method for urban traffic signal intersection[J].Journal of Control Theory and Applications,2002,17(6):526-530.(in Chinese)
[2]
LEE Jee-hyong,HYUNG Lee-kwang.Distributed and cooperative fuzzy controllers for traffic intersections group[J].IEEE Trans Syst Man Cybem C,1999,29(2):263-271.
[3]
SUTTON R S,BARTO A.Reinforcement learning:an introduction[M].Cambridge,Massachusetts:MIT Press,1998.
[4]
SUTTON R S.Introduction:the challenge of reinforcement learning[J].Machine Learning,1992(8):225-227.
蒋国飞,高慧琪,吴沧浦.Q学习算法中网格离散化方法的收敛性分析[J].控制理论与应用.1999,16(2):194-198.JIANG Guo-fei,GAO Hui-qi,WU Cang-pu.Convergence of discretization procedure in Q learning[J].Journal of Control Theory and Applications,1999,16(2):194-198.(in Chinese)
[7]
蒋国飞,吴沧浦.基于Q学习算法和BP神经网络的倒立摆控制[J].自动化学报,1998,24(5):662-666.JIANG Guo-fei,WU Cang-pu.Learning to control an inverted pendulum using Q-lemming and neural networks[J].Acta Automatica Sinica,1998,24(5):662-666.(in Chinese)