|
强化学习方法的理论与应用研究
|
Abstract:
[1] | Sutton, R.S. and Barto, A.G. (2018) Reinforcement Learning: An Introduction. MIT Press, Cambridge, 54-93. |
[2] | Konda, V.R. and Tsitsiklis, J.N. (2000) Actor-Critic Algorithms. Advances in Neural Information Pro-cessing Systems. NIPS Conference, Denver, Colorado, 29 November-4 December 1999. |
[3] | Silver, D., Lever, G., Heess, N., et al. (2014) Deterministic Policy Gradient Algorithms. International Conference on Machine Learning, Bei-jing, 21-26 June 2014, 387-395. |
[4] | Watkins, C.J.C.H. and Dayan, P. (1992) Q-Learning. Machine Learning, 8, 279-292. |
[5] | Fujimoto, S., Hoof, H. and Meger, D. (2018) Addressing Function Approximation Error in Actor-Critic Methods. International Conference on Machine Learning, Stockholm, 10-15 July 2018, 1587-1596. |
[6] | Heuillet, A., Couthouis, F. and Díaz-Rodríguez, N. (2021) Explainability in Deep Reinforcement Learning. Knowledge-Based Sys-tems, 214, Article ID: 106685. https://doi.org/10.1016/j.knosys.2020.106685 |
[7] | Madumal, P., Miller, T., Sonenberg, L., et al. (2020) Explainable Reinforcement Learning through a Causal Lens. Proceedings of the AAAI Con-ference on Artificial Intelligence, New York, 7-12 February 2020, 2493-2500. |
[8] | Sequeira, P. and Gervasio, M. (2020) Interestingness Elements for Explainable Reinforcement Learning: Understanding Agents’ Capabilities and Limi-tations. Artificial Intelligence, 288, Article ID: 103367.
https://doi.org/10.1016/j.artint.2020.103367 |
[9] | Madumal, P., Miller, T., Sonenberg, L., et al. (2020) Distal Ex-planations for Explainable Reinforcement Learning Agents. arXiv:2001.10284. |
[10] | Liventsev, V., H?rm?, A. and Petkovi?, M. (2021) Neurogenetic Programming Framework for Explainable Reinforcement Learning. Proceedings of the Genetic and Evolutionary Computation Conference Companion, Lille, 10-14 July 2021, 329-330. |
[11] | Cruz, F., Daze-ley, R., Vamplew, P., et al. (2021) Explainable Robotic Systems: Understanding Goal-Driven Actions in a Reinforcement Learning Scenario. arXiv:2006.13615. |
[12] | Vaswani, A., Shazeer, N., Parmar, N., et al. (2017) Attention Is All You Need. 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, 4-9 December 2017, 11 p. |
[13] | Parisotto, E., Song, F., Rae, J., et al. (2020) Stabilizing Transformers for Reinforcement Learning. International Conference on Machine Learning, Virtual, 12-18 July 2020, 7487-7498. |
[14] | Janner, M., Li, Q. and Levine, S. (2021) Offline Reinforcement Learning as One Big Sequence Modeling Problem. arXiv:2106.02039. |
[15] | Chen, L., Lu, K., Rajeswaran, A., et al. (2021) Decision Transformer: Reinforcement Learning via Sequence Modeling. arXiv:2106.01345. |
[16] | Yarats, D., Fergus, R., Lazaric, A., et al. (2021) Reinforcement Learning with Prototypical Representations. International Conference on Machine Learning, Virtual, 18-24 July 2021, 11920-11931. |
[17] | Schwarzer, M., Rajkumar, N., Noukhovitch, M., et al. (2021) Pretraining Representations for Da-ta-Efficient Reinforcement Learning. 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Vir-tual, 6-14 December 2021, 14 p. |
[18] | Hansen, N. and Wang, X. (2021) Generalization in Reinforcement Learning by Soft Data Augmentation. 2021 IEEE International Conference on Robotics and Automation (ICRA), Xi’an, 30 May-5 June 2021, 13611-13617.
https://doi.org/10.1109/ICRA48506.2021.9561103 |
[19] | Brockman, G., Cheung, V., Pettersson, L., et al. (2016) OpenAI Gym. arXiv:1606.01540. |
[20] | Schrittwieser, J., Antonoglou, I., Hubert, T., et al. (2020) Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model. Nature, 588, 604-609. |
[21] | Gu, B. and Sung, Y. (2021) Enhanced Reinforcement Learning Method Combining One-Hot Encoding-Based Vectors for CNN-Based Alternative High-Level Decisions. Applied Sciences, 11, Article No. 1291.
https://doi.org/10.3390/app11031291 |
[22] | Johannink, T., Bahl, S., Nair, A., et al. (2019) Residual Reinforcement Learning for Robot Control. 2019 International Conference on Robotics and Automation (ICRA), Montreal, 20-24 May 2019, 6023-6029.
https://doi.org/10.1109/ICRA.2019.8794127 |
[23] | Zhang, R., Lv, Q., Li, J., et al. (2022) A Reinforcement Learn-ing Method for Human-Robot Collaboration in Assembly Tasks. Robotics and Computer-Integrated Manufacturing, 73, Article ID: 102227.
https://doi.org/10.1016/j.rcim.2021.102227 |
[24] | Kiran, B.R., Sobh, I., Talpaert, V., et al. (2021) Deep Reinforce-ment Learning for Autonomous Driving: A Survey. IEEE Transactions on Intelligent Transportation Systems, 1-18. https://doi.org/10.1109/TITS.2021.3054625 |
[25] | Ma, X., Li, J., Kochenderfer, M.J., et al. (2021) Reinforcement Learning for Autonomous Driving with Latent State Inference and Spatial-Temporal Relationships. 2021 IEEE Interna-tional Conference on Robotics and Automation (ICRA), Xi’an, 30 May-5 June 2021, 6064-6071. https://doi.org/10.1109/ICRA48506.2021.9562006 |
[26] | Chen, J., Li, S.E. and Tomizuka, M. (2021) Interpretable End-to-End Urban Autonomous Driving with Latent Deep Reinforcement Learning. IEEE Transactions on Intelligent Transportation Systems, 1-11.
https://doi.org/10.1109/TITS.2020.3046646 |