|
基于机器学习的帕金森病进展预测
|
Abstract:
帕金森病作为一种慢性神经系统疾病严重影响中老年人群的生活质量,其运动功能评分(motor-UPDRS)对于评估患者疾病严重程度和治疗效果至关重要。本文基于UCI的帕金森病远程语音数据集,首先采用lasso变量选择方法筛选出影响运动功能评分的15个重要特征,分别构建高斯过程回归、支持向量回归、随机森林和XGBoost四种机器学习模型。经过训练、贝叶斯调参优化和性能评估,发现XGBoost模型在预测帕金森病患者的运动功能评分方面表现最优,模型的RMSE、MAE和R2分别为1.18,0.63和0.98。最后对XGBoost模型的特征重要性进行分析,探寻影响帕金森病预测的关键特征,为帕金森病的诊疗提供更加合理的理论和技术支撑。
As a chronic neurological disease, Parkinson’s disease seriously affects the quality of middle-aged and elderly people’s life. Its motor function score (motor-UPDRS) is crucial to assess the severity of the disease and the treatment effect of patients. Based on the remote voice dataset of Parkinson’s disease from UCI, this paper first used the lasso variable selection method to screen out 15 important features that affect motor function scores, and constructed four machine learning models, namely Gaussian process regression, support vector regression, random forest and XGBoost respectively. After training, parameter tuning and performance evaluation, it was found that the XGBoost model had the best performance in predicting the motor function score of patients with Parkinson’s disease, with RMSE, MAE and R2 of the model being 1.18, 0.63 and 0.98 respectively. Finally, the feature importance of the XGBoost model was analyzed, and the key features affecting the prediction of Parkinson’s disease were explored, so as to provide more reasonable theoretical and technical support for the diagnosis and treatment of Parkinson’s disease.
[1] | Dauer, W. and Przedborski, S. (2003) Parkinson’s Disease: Mechanisms and Models. Neuron, 39, 889-909. https://doi.org/10.1016/s0896-6273(03)00568-3 |
[2] | 王岚, 孙圣刚. 帕金森病发病机制研究进展[J]. 临床内科杂志, 2006, 23(6): 365-368. |
[3] | 王祥翔, 司志华, 柴湘婷, 等. 帕金森病的临床特征评估[J]. 山东医药, 2014, 54(22): 101-103. |
[4] | Tsanas, A., Little, M.A., McSharry, P.E. and Ramig, L.O. (2010) Accurate Telemonitoring of Parkinson’s Disease Progression by Noninvasive Speech Tests. IEEE Transactions on Biomedical Engineering, 57, 884-893. https://doi.org/10.1109/tbme.2009.2036000 |
[5] | Asgari, M. and Shafran, I. (2010) Predicting Severity of Parkinson’s Disease from Speech. 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology, Buenos Aires, 31 August-4 September 2010, 5201-5204. https://doi.org/10.1109/iembs.2010.5626104 |
[6] | Eskidere, Ö., Ertaş, F. and Hanilçi, C. (2012) A Comparison of Regression Methods for Remote Tracking of Parkinson’s Disease Progression. Expert Systems with Applications, 39, 5523-5528. https://doi.org/10.1016/j.eswa.2011.11.067 |
[7] | Nilashi, M., Ibrahim, O. and Ahani, A. (2016) Accuracy Improvement for Predicting Parkinson’s Disease Progression. Scientific Reports, 6, Article No. 34181. https://doi.org/10.1038/srep34181 |
[8] | Nilashi, M., Ibrahim, O., Ahmadi, H., Shahmoradi, L. and Farahmand, M. (2018) A Hybrid Intelligent System for the Prediction of Parkinson’s Disease Progression Using Machine Learning Techniques. Biocybernetics and Biomedical Engineering, 38, 1-15. https://doi.org/10.1016/j.bbe.2017.09.002 |
[9] | Shahid, A.H. and Singh, M.P. (2020) A Deep Learning Approach for Prediction of Parkinson’s Disease Progression. Biomedical Engineering Letters, 10, 227-239. https://doi.org/10.1007/s13534-020-00156-7 |
[10] | Liu, F., Ji, W. and Li, Y. (2018) Prediction of Parkinson’s Disease Based on Multi-Task Regression of Model Filtering. Journal of Computer Applications, 38, 3221-3224. |
[11] | Anter, A.M., Mohamed, A.W., Zhang, M. and Zhang, Z. (2023) A Robust Intelligence Regression Model for Monitoring Parkinson’s Disease Based on Speech Signals. Future Generation Computer Systems, 147, 316-327. https://doi.org/10.1016/j.future.2023.05.012 |
[12] | 李焱, 贾雅君, 李磊, 等. 基于随机森林算法的短期电力负荷预测[J]. 电力系统保护与控制, 2020, 48(21): 117-124. |
[13] | 王露. 基于集成学习的异常数据检测及其在网络流量数据中的应用[D]: [硕士学位论文]. 武汉: 华中师范大学, 2021. |