|
基于协同训练半监督学习的干旱灾害天气预测
|
Abstract:
近年来,气象研究领域中存在大量高价值信息的无标签数据,然而对这些无标签数据进行高置信度标记是非常困难的,同时这些数据对于建立准确的气象预测模型又是十分重要的。基于此,本文研究了一种基于协同训练的半监督学习方法并用于干旱灾害天气分析预测,该方法使用重复标记策略,根据训练过程中的数据变化动态的计算置信度阈值作为约束条件,并提出了一种增强协同训练方法来评估气象领域中的无标签样本数据的置信度。为了评估所提出方法的性能进行了实验分析,结果表明,该方法的性能优于原始协同训练方法,有效地提高了分类器的分类精度,并验证了该方法用于干旱灾害天气预测的有效性和显著性。
In recent years, a large amount of unlabeled data with high-value information exists in the field of meteorological research. However, it is very difficult to mark these unlabeled data with high confidence, and these data are very important for establishing an accurate meteorological prediction model. Based on the situation, this paper studies a semi-supervised learning method based on co-training and used for drought disaster weather analysis and prediction. This method uses a repeated labeling strategy to calculate the confidence threshold dynamically according to the data changes during the training process as a constraint condition, and proposes an enhanced co-training method to evaluate the confidence of unlabeled sample data in the meteorological field. In order to evaluate the performance of the proposed method, experimental analysis is carried out. The results show that the performance of this method is better than that of the original co-training method, which effectively improves the classification accuracy of the classifier, and verifies the effectiveness and significance of this method for drought disaster weather prediction.
[1] | Mayuravaani, M. and Manivannan, S. (2021) A Semi-Supervised Deep Learning Approach for the Classification of Steel Surface Defects. 2021 10th International Conference on Information and Automation for Sustainability (ICIAfS), Negambo, 11-13 August 2021, 179-184. https://doi.org/10.1109/ICIAfS52090.2021.9606143 |
[2] | Vale, K.M.O., Canuto, A.M.P., Gorg?nio, F.L., et al. (2019) A Data Stratification Process for Instances Selection in Semi-Supervised Learning. 2019 International Joint Conference on Neural Networks (IJCNN). Budapest, 14-19 July 2019, 1-8. https://doi.org/10.1109/IJCNN.2019.8851946 |
[3] | 王宇, 李延晖. 一种基于协同训练半监督的分类算法[J]. 华中师范大学学报(自然科学版), 2021, 55(6): 1020-1029. |
[4] | Li, Z., Hong, Z. and Zheng, H. (2021) Aggressive Growing Mixup: A Faster and Better Semi-Supervised Learning Approach. 2021 IEEE 3rd International Conference on Frontiers Technology of Information and Computer (ICFTIC), Greenville, 12-14 November 2021, 278-284. https://doi.org/10.1109/ICFTIC54370.2021.9647353 |
[5] | Li, S., Zhang, Y., Yang, F., et al. (2021) Fault Classification in Transmission Network with Semi-supervised Learning Method. 2021 IEEE PES Innovative Smart Grid Technologies Europe (ISGT Europe), Espoo, 18-21 October 2021, 1-6.
https://doi.org/10.1109/ISGTEurope52324.2021.9640006 |
[6] | Ngo, B.H., Kim, J.H., Chae, Y.J., et al. (2021) Multi-View Collaborative Learning for Semi-Supervised Domain Adaptation. IEEE Access, 9, 166488-166501. https://doi.org/10.1109/ACCESS.2021.3136567 |
[7] | Gao, L., Shah, S., Assery, N., et al. (2021) Semi-Supervised Self Training to Assess the Credibility of Tweets. 2021 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom), New York City, 30 September 2021-3 October 2021, 1532-1537.
https://doi.org/10.1109/ISPA-BDCloud-SocialCom-SustainCom52081.2021.00206 |
[8] | Vale, K.M.O., Gorg?nio, F.L., Araújo, Y.N., et al. (2020) A Co-Training-Based Algorithm Using Confidence Values to Select Instances. 2020 International Joint Conference on Neural Networks (IJCNN), Glasgow, 19-24 July 2020, 1-7.
https://doi.org/10.1109/IJCNN48605.2020.9206621 |
[9] | Lu, J. and Gong, Y. (2021) A Co-Training Method Based on Entropy and Multi-Criteria. Applied Intelligence, 51, 3212-3225. https://doi.org/10.1007/s10489-020-02014-6 |
[10] | Vale, K.M.O., Gorg?nio, A.C., Flavius Da Luz, E.G., et al. (2021) An Efficient Approach to Select Instances in Self-Training and Co-Training Semi-Supervised Methods. IEEE Access, 10, 7254-7276.
https://doi.org/10.1109/ACCESS.2021.3138682 |
[11] | Blum, A. and Mitchell, T. (1998) Combining Labeled and Unlabeled Data with Co-Training. Proceedings of the Eleventh Annual Conference on Computational Learning Theory, 92-100. https://doi.org/10.1145/279943.279962 |
[12] | Chawla, N.V., Bowyer, K.W., Hall, L.O., et al. (2002) SMOTE: Synthetic Minority Over-Sampling Technique. Journal of Artificial Intelligence Research, 16, 321-357. https://doi.org/10.1613/jair.953 |
[13] | Tomek, I. (1976) Two Modifications of CNN. IEEE Transactions on Systems Man & Cybernetics, SMC-6, 769-772.
https://doi.org/10.1109/TSMC.1976.4309452 |