To improve the efficiency of air quality analysis and the accuracy of predictions, this paper proposes a composite method based on Vector Autoregressive (VAR) and Random Forest (RF) models. In the theoretical section, the model introduction and estimation algorithms are provided. In the empirical analysis section, global air quality data from 2022 to 2024 are used, and the proposed method is applied. Specifically, principal component analysis (PCA) is first conducted, and then VAR and Random Forest methods are used for prediction on the reduced-dimensional data. The results show that the RMSE of the hybrid model is 45.27, significantly lower than the 49.11 of the VAR model alone, verifying its superiority. The stability and predictive performance of the model are effectively enhanced.
References
[1]
World Health Organization (2021) Global Air Quality Guidelines. WHO Publications.
[2]
Liu, J., He, C., Si, Y., Li, B., Wu, Q., Ni, J., et al. (2024) Toward Better and Healthier Air Quality: Global PM2.5 and O3 Pollution Status and Risk Assessment Based on the New WHO Air Quality Guidelines for 2021. Global Challenges, 8, Article ID: 2300258. https://doi.org/10.1002/gch2.202300258
[3]
Sims, C.A. (1980) Macroeconomics and Reality. Econometrica, 48, 1-48. https://doi.org/10.2307/1912017
[4]
Kou, L., Liao, J., Li, X., et al. (2022) Climate Change Prediction in Canada Based on VAR Model. ComputerandModernization, 10, 13-18.
[5]
Breiman, L. (2001) Random Forests. MachineLearning, 45, 5-32. https://doi.org/10.1023/a:1010933404324
[6]
Qiu, W. and Lu, D. (2019) Analysis of Factors Affecting Agricultural Carbon Emission Based on VAR Model and Its Dynamic Response Mechanism. Hubei AgriculturalSciences, 58, 271-276.
[7]
Hotelling, H. (1933) Analysis of a Complex of Statistical Variables into Principal Components. JournalofEducationalPsychology, 24, 417-441. https://doi.org/10.1037/h0071325
[8]
Liu, H. and Zhang, H. (2019) Atmospheric Environmental Quality Evaluation of City Based on Principal Component Analysis. ChinaResourceComprehensiveUtilization, 37, 141-143. (In Chinese)
Richards, L.E. and Jolliffe, I.T. (1988) Book Review: Principal Component Analysis. JournalofMarketingResearch, 25, 410-410. https://doi.org/10.1177/002224378802500410
[11]
Hou, P.S., Fadzil, L.M., Manickam, S. and Al-Shareeda, M.A. (2023) Vector Autoregression Model-Based Forecasting of Reference Evapotranspiration in Malaysia. Sustainability, 15, Article No. 3675. https://doi.org/10.3390/su15043675
[12]
Nachouki, M., Mohamed, E.A., Mehdi, R. and Abou Naaj, M. (2023) Student Course Grade Prediction Using the Random Forest Algorithm: Analysis of Predictors’ Importance. TrendsinNeuroscienceandEducation, 33, Article ID: 100214. https://doi.org/10.1016/j.tine.2023.100214
[13]
Lin, J. and He, J. (2022) Parallel Random Forest Prediction Algorithm Based on PCA Stratified Sampling in the Big Data Environment. China Management Informationization, 25, 172-176. (In Chinese)
[14]
Huang, S. and Zhang, Z. (2023) Study on the Dynamic Relationship between Energy Consumption and Environmental Pollution in Chongqing City: Empirical Analysis Based on VAR Model. China-ArabStatesScienceandTechnologyForum (ChineseandEnglish), 2023, 23-27. (In Chinese)
[15]
Wei, X. (2023) Treatment and Application of Outlier in VAR Model. ScienceandTechnologyandEconomy, 36, 101-105. (In Chinese) https://doi.org/10.14059/j.cnki.cn32-1276n.2023.06.021