%0 Journal Article %T 面向不平衡甲状腺眼病数据集的分类算法对比研究及应用
Comparative Study and Application of Classification Algorithms for Unbalanced Thyroid Eye Disease Datasets %A 张天凤 %A 赵廉 %A 邓浩然 %A 朱沁汶 %A 陈钱 %A 宋诗淳 %A 宋雪霏 %A 周雷 %J Software Engineering and Applications %P 495-505 %@ 2325-2278 %D 2023 %I Hans Publishing %R 10.12677/SEA.2023.123049 %X 对不同数据进行分类是机器学习的研究热点,然而在各大领域,数据不平衡现象是普遍存在的。现有的许多机器学习算法虽然取得了良好的效果,但他们都是在默认数据集分布均衡的前提下进行的,并且认为不同类别的误分代价一致,这导致它们在不平衡数据集上表现很差。本文针对用于甲状腺眼病诊断的数据集出现的正负样本不平衡现象,选择了WCE loss,LDAM-loss,Focal-loss,Minimax四种面向不平衡数据的优化方法进行了对比实验。实验结果表明,用不平衡优化方法训练的分类模型相对于原始模型具有更好的分类性能。实验还发现随正负样本比例的不同,各方法对结果的提升存在一定差异,在重度不平衡条件下,LDAM loss和Minimax表现出更好的鲁棒性,尤其是Minimax方法,它对于少数类的分类性能更好。总结而言,本论文所展示的对比实验能在不平衡甲状腺眼病诊断数据的条件下,对分类算法的选取提供指导。
Data classification is a prominent area of machine learning, but data imbalance is a common issue across major fields. Although many machine learning algorithms have produced favorable outcomes, they rely on the assumption that the default dataset is uniformly distributed and the cost of false separation for different categories is consistent. Consequently, they exhibit poor performance on unbalanced datasets. In this study, four optimization methods were selected to address the issue of unbalanced data in the diagnosis of thyroid eye disease. These methods, including WCE loss, LDAM-loss, Focal-loss, and Minimax, were used to compare the positive and negative sample imbalance in the dataset. The experimental results demonstrate that the unbalanced optimization method produced a better classification performance than the original model. Furthermore, the experiments revealed that the improvement of results varied with the proportion of positive and negative samples, with LDAM loss and Minimax exhibiting better robustness under severe imbalance conditions. The Minimax method, in particular, demonstrated superior classification performance for minority classes. In conclusion, the comparative experiment presented in this study can offer valuable insights for the selection of classification algorithms under the condition of unbalanced thyroid eye disease diagnostic data. %K 不平衡数据,机器学习,甲状腺相关眼病,分类
Unbalanced Data %K Machine Learning %K Thyroid-Associated Ophthalmopathy %K Classification %U http://www.hanspub.org/journal/PaperInformation.aspx?PaperID=67515