|
基于DeepSeek微调和动态建模的交通流预测
|
Abstract:
交通流预测是智能交通系统中的关键任务,对城市规划和交通管理具有重要意义。传统深度学习方法虽然提升了预测精度,但其可解释性较差,且常规大语言模型难以捕捉交通流的复杂时空动态特性。为此,本文提出了一种基于DeepSeek大语言模型微调与动态建模相结合的交通流预测方法(DynaSeek)。将动态建模技术和数据修正机制嵌入交通流预测任务中,量化时空因素(如地区、天气)对交通流的影响,采用Lora微调策略对DeepSeek模型进行优化,最后模型在预测时结合历史数据和实时时空信息进行动态修正。结果表明,本文方法在加利福尼亚州多模态数据集上表现优于基线模型,且在可解释性方面提供了更清晰的交通流变化规律。本文为交通流预测提供了一种新的多维度数据融合与动态建模框架,显著提升了模型的实用性和可靠性。
Traffic flow prediction is a key task in intelligent transportation systems and is of great significance to urban planning and traffic management. Although traditional deep learning methods have improved prediction accuracy, their interpretability is poor, and conventional large language models are difficult to capture the complex spatiotemporal dynamic characteristics of traffic flow. To this end, this paper proposes a traffic flow prediction method (DynaSeek) based on the fine-tuning of the DeepSeek large language model and dynamic modeling. The dynamic modeling technology and data correction mechanism are embedded in the traffic flow prediction task, and the impact of spatiotemporal factors (such as region and weather) on traffic flow is quantified. The DeepSeek model is optimized using the Lora fine-tuning strategy. Finally, the model is dynamically corrected by combining historical data and real-time spatiotemporal information during prediction. The results show that the proposed method performs better than the baseline model on the California multimodal dataset and provides a clearer law of traffic flow changes in terms of interpretability. This paper provides a new multi-dimensional data fusion and dynamic modeling framework for traffic flow prediction, which significantly improves the practicality and reliability of the model.
[1] | 刘明宇, 吴建平, 王钰博, 等. 基于深度学习的交通流量预测[J]. 系统仿真学报, 2018, 30(11): 4100-4105. |
[2] | 汪维泰, 王晓强, 李雷孝, 等. 时空图神经网络在交通流预测研究中的构建与应用综述[J]. 计算机工程与应用, 2024, 60(8): 31-45. |
[3] | Min, B., Ross, H., Sulem, E., Veyseh, A.P.B., Nguyen, T.H., Sainz, O., et al. (2023) Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey. ACM Computing Surveys, 56, 1-40. https://doi.org/10.1145/3605943 |
[4] | Zhang, Q., Huang, C., Xia, L., Wang, Z., Li, Z. and Yiu, S. (2023) Automated Spatio-Temporal Graph Contrastive Learning. Proceedings of the ACM Web Conference 2023, Austin, 30 April-4 May 2023, 295-305. https://doi.org/10.1145/3543507.3583304 |
[5] | Wang, C. and Kantarcioglu, M. (2025) A Review of DeepSeek Models’ Key Innovative Techniques. |
[6] | Fu, R., Zhang, Z. and Li, L. (2016) Using LSTM and GRU Neural Network Methods for Traffic Flow Prediction. 2016 31st Youth Academic Annual Conference of Chinese Association of Automation (YAC), Wuhan, 11-13 November 2016, 324-328. https://doi.org/10.1109/yac.2016.7804912 |
[7] | Zhao, L., Song, Y., Zhang, C., Liu, Y., Wang, P., Lin, T., et al. (2020) T-GCN: A Temporal Graph Convolutional Network for Traffic Prediction. IEEE Transactions on Intelligent Transportation Systems, 21, 3848-3858. https://doi.org/10.1109/tits.2019.2935152 |
[8] | Jiang, J., Han, C., Zhao, W.X. and Wang, J. (2023) Pdformer: Propagation Delay-Aware Dynamic Long-Range Transformer for Traffic Flow Prediction. Proceedings of the AAAI Conference on Artificial Intelligence, 37, 4365-4373. https://doi.org/10.1609/aaai.v37i4.25556 |
[9] | Ji, J., Wang, J., Huang, C., Wu, J., Xu, B., Wu, Z., et al. (2023) Spatio-temporal Self-Supervised Learning for Traffic Flow Prediction. Proceedings of the AAAI Conference on Artificial Intelligence, 37, 4356-4364. https://doi.org/10.1609/aaai.v37i4.25555 |
[10] | Ren, Y., Chen, Y., Liu, S., et al. (2024) TPLLM: A Traffic Prediction Framework Based on Pretrained Large Language Models. |
[11] | Zhang, S., Fu, D., Liang, W., Zhang, Z., Yu, B., Cai, P., et al. (2024) Traffic GPT: Viewing, Processing and Interacting with Traffic Foundation Models. Transport Policy, 150, 95-105. https://doi.org/10.1016/j.tranpol.2024.03.006 |
[12] | Guo, X., Zhang, Q., Jiang, J., Peng, M., Zhu, M. and Yang, H.F. (2024) Towards Explainable Traffic Flow Prediction with Large Language Models. Communications in Transportation Research, 4, Article 100150. https://doi.org/10.1016/j.commtr.2024.100150 |
[13] | Bi, X., Chen, D., Chen, G., et al. (2024) DeepSeek LLM: Scaling Open-Source Language Models with Long Termism. |
[14] | Liu, A., Feng, B., Xue, B., et al. (2024) DeepSeek-v3 Technical Report. |
[15] | Guo, D., Yang, D., Zhang, H., et al. (2025) DeepSeek-r1: Incentivizing Reasoning Capability in LLMS via Reinforcement Learning. |
[16] | Hochreiter, S. and Schmidhuber, J. (1997) Long Short-Term Memory. Neural Computation, 9, 1735-1780. https://doi.org/10.1162/neco.1997.9.8.1735 |
[17] | Yu, B., Yin, H. and Zhu, Z. (2018) Spatio-Temporal Graph Convolutional Networks: A Deep Learning Framework for Traffic Forecasting. Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, Sweden, 13-19 July 2018, 3634-3640. https://doi.org/10.24963/ijcai.2018/505 |
[18] | Guo, S., Lin, Y., Feng, N., Song, C. and Wan, H. (2019) Attention Based Spatial-Temporal Graph Convolutional Networks for Traffic Flow Forecasting. Proceedings of the AAAI Conference on Artificial Intelligence, 33, 922-929. https://doi.org/10.1609/aaai.v33i01.3301922 |
[19] | Wu, Z., Pan, S., Long, G., Jiang, J. and Zhang, C. (2019) Graph Wavenet for Deep Spatial-Temporal Graph Modeling. Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, Macao, 10-16 August 2019, 1907-1913. https://doi.org/10.24963/ijcai.2019/264 |
[20] | Bai, L., Yao, L., Li, C., et al. (2020) Adaptive Graph Convolutional Recurrent Network for Traffic Forecasting. Advances in Neural Information Processing Systems, 33, 17804-17815. |
[21] | Zeng, Y., Fu, J. and Chao, H. (2020) Learning Joint Spatial-Temporal Transformations for Video Inpainting. In: Lecture Notes in Computer Science, Springer, 528-543. https://doi.org/10.1007/978-3-030-58517-4_31 |
[22] | Lan, S., Ma, Y., Huang, W., et al. (2022) Dstagnn: Dynamic Spatial-Temporal Aware Graph Neural Network for Traffic Flow Forecasting. International Conference on Machine Learning, Baltimore, 17-23 July 2022, 11906-11917. |