全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

有限阶段马尔可夫决策的可变限速控制模型

, PP. 109-114

Keywords: 交通信息工程,可变限速控制,马尔可夫决策,强化学习,高速公路主线

Full-Text   Cite this paper   Add to My Lib

Abstract:

分析了高速公路主线可变限速控制的作用,研究了现有的限速方法,将高速公路主线可变限速控制过程看作是离散时间的马尔可夫决策过程,提出基于强化学习与有限阶段马尔可夫决策的可变限速控制模型,通过与交通环境的交互学习进行模型的动态调整。采用有限阶段向后递归迭代的算法对模型进行求解,运用Paramics仿真软件对长吉高速公路全程进行仿真。仿真结果表明在平均限速值低于设计时速6.25%的情况下,平均流量不仅没有降低反而增加了3.20%。可见,该模型可以有效提高交通流量,改善高速公路主线的交通状况。

References

[1]  ABDEL-ATY M, CUNNINGHAM R J, GAYAH V V, et al. Dynamic variable speed limit strategies for real-time crash risk reduction on freeways[J]. Transportation Research Record, 2008(2078): 108-116.
[2]  LIN P, KANG K P, CHANG G L. Exploring the effectiveness of variable speed limit controls on highway work-zone oper-ations[J]. Journal of Intelligent Transportation Systems, 2004, 8(3): 155-168.
[3]  陈大山.高速公路主线可变限速控制研究[D].西安:长安大学,2009. CHEN Da-shan. Variable speed control of highway[D]. Xi’an: Chang’an University, 2009.(in Chinese)
[4]  VAN DEN HOOGEN E, SMULDERS S, HEIDEMIJ A. Control by variable speed signs: results of the dutch experiment[C]∥IEEE. Seventh International Conference on Road Traffic Monitoring and Control. London: IEEE, 1994: 145-149.
[5]  HEGYI A, DE SCHUTTER B, HELLENDOORN H. Model predictive control for optimal coordination of ramp metering and variable speed limits[J]. Transportation Research Part C: Emerging Technologies, 2005, 13(3): 185-209.
[6]  LYLES R W, TAYLOR W C, LAVANSIRI D, et al. A field test and evaluation of variable speed limits in work zones[C]∥TRB. TRB Annual Meeting Proceedings. Washington DC: TRB, 2004: 1-21.
[7]  ALLABY P, HELLINGA B, BULLOCK M. Variable speed limits: safety and operational impacts of a candidate control strategy for freeway applications[J]. IEEE Transactions on Intelligent Transportation Systems, 2006, 8(4): 671-680.
[8]  梁新荣,刘智勇,孙德山,等.基于支持向量机的高速公路限速控制[J].计算机工程与应用,2005,41(34):178-180. LIANG Xin-rong, LIU Zhi-yong, SUN De-shan, et al. Control speed limitation on freeway based on support vector machine[J]. Computer Engineering and Applications, 2005, 41(34): 178-180.(in Chinese)
[9]  干宏程,孙立军.高速公路可变限速控制技术研究[J].交通科技,2004(6):91-93. GAN Hong-cheng, SUN Li-jun. A study on the variable speed limits technology for freeways[J]. Transportation Science & Technology, 2004(6): 91-93.(in Chinese)
[10]  张汝波,顾国昌,刘照德,等. 强化学习理论、算法及应用[J].控制理论与应用,2000,17(5):637-642. ZHANG Ru-bo, GU Guo-chang, LIU Zhao-de, et al. Re-inforcement learning theory, algorithms and its application[J]. Control Theory and Applications, 2000, 17(5): 637-642.(in Chinese)
[11]  黄炳强.强化学习方法及其应用研究[D].上海:上海交通大学,2007. HUANG Bing-qiang. Research on the reinforcement learning method and its application[D]. Shanghai: Shanghai Jiaotong University, 2007.(in Chinese)
[12]  郑 宇.分层强化学习算法及其应用研究[D].北京:北京交通大学,2009. ZHENG Yu. Research on hierarchy reinforcement learning algorithm and its application[D]. Beijing: Beijing Jiaotong University, 2009.(in Chinese)

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133