全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

基于机器学习方法的早期糖尿病风险预测
Machine Learning-Based Approach to Early Diabetes Risk Prediction

DOI: 10.12677/SA.2023.124101, PP. 974-984

Keywords: 朴素贝叶斯,决策树,随机森林,逻辑斯蒂回归,R语言
Naive Bayes
, Decision Trees, Random Forest, Logistic Regression, R Language

Full-Text   Cite this paper   Add to My Lib

Abstract:

糖尿病疾病是一个日益严重的医学问题,它是一种代谢疾病,身体内的葡萄糖长期处于一个高水平的状态,会产生尿频、口渴、饥饿程度加剧等症状,从而导致肾衰竭、中风、视力受损等并发症的产生。糖尿病的识别往往是病人询问医生或者是到诊断中心询问,会使诊断过程过于繁琐。但是逐步上升的机器学习方法解决了这一问题。本次研究的目的是采用机器学习方法,预测患者患糖尿病的可能性。因此采用四个机器学习分类算法,即朴素贝叶斯、决策树、随机森林及逻辑斯蒂回归,来检测早期糖尿病。实验采用的是UCI机器学习库中,从孟加拉国锡尔赫特的锡尔赫特医院患者那里收集的直接问卷。这四个算法的性能评估采用准确率来进行评估。实验显示随机森林的精度优于其他算法,达到了98.07%。
Diabetic disease is a growing medical problem. It is a metabolic disease in which glucose in the body remains at a high level for a long time, producing symptoms such as frequent urination, thirst and increased hunger levels, which can lead to complications such as kidney failure, stroke and impaired vision. Diabetes is often identified by the patient asking a doctor or visiting a diagnostic centre, which can make the diagnosis process too cumbersome. But progressively increasing machine learning methods solve this problem. The aim of this study was to use machine learning methods to predict the likelihood of a patient developing diabetes. Four machine learning classification algorithms, namely, plain Bayesian, decision tree, random forest and logistic regression, were therefore used to detect early diabetes. The experiments were conducted using direct questionnaires collected from patients at Sylhet Hospital, Sylhet, Bangladesh, from the UCI Machine Learning Library. The performance of these four algorithms was evaluated using accuracy. The experiments showed that Random Forest outperformed the other algorithms with an accuracy of 98.07%.

References

[1]  Kitabchi, A.E., Umpierrez, G.E., Miles, J.M. and Fisher, J.N. (2009) Hyperglycemic Crises in Adult Patients with Diabetes. Diabetes Care, 32, 1335-1343.
https://doi.org/10.2337/dc09-9032
[2]  World Health Organization (2023) World Health Statistics 2023: Monitoring Health for the SDGs, Sustainable Development Goals. Geneva.
[3]  Chandalia, H.B. and Tripathy, B.E. (2023) RSSDI Textbook of Diabetes Mellitus.
[4]  Vijayan, V.V. and Anjali, C. (2016) Prediction and Diagnosis of Diabetes Mellitus—A Machine Learning Approach. 2015 IEEE Recent Advances in Intelligent Computational Systems (RAICS), Trivandrum, 10-12 December 2015, 122-127.
https://doi.org/10.1109/RAICS.2015.7488400
[5]  Huang, S., Cai, N., Pacheco, P.P., et al. (2018) Applications of Support Vector Machine (SVM) Learning in Cancer Genomics. Cancer Genomics & Proteomics, 15, 41-51.
https://doi.org/10.21873/cgp.20063
[6]  Boyles, S., Fajardo, D. and Waller, S.T. (2007) Naive Bayesian Classifier for Incident Duration Prediction. Transportation Research Board 86th Annual Meeting, Washington DC, 21-25 January 2007.
[7]  Song, Y.Y. and Lu, Y. (2015) Decision Tree Methods: Applications for Classification and Prediction. Shanghai Archives of Psychiatry, 27, 130-135.
[8]  La Valley, M.P. (2008) Logistic Regression. Circulation, 117, 2395-2399.
https://doi.org/10.1161/CIRCULATIONAHA.106.682658
[9]  Bishop, C.M. (1994) Neural Networks and Their Applications. Review of Scientific Instruments, 65, 1803-1832.
https://doi.org/10.1063/1.1144830
[10]  Bai, B.G.M., Nalini, B.M. and Majumdar, J. (2019) Analysis and Detection of Diabetes Using Data Mining Techniques—A Big Data Application in Health Care. Conference Proceedings Emerging Research in Computing, Information, Communication and Applications, Vol. 1, 443-455.
[11]  Fatima, M. and Pasha, M. (2017) Survey of Machine Learning Algorithms for Disease Diagnostic. Journal of Intelligent Learning Systems and Applications, 9, 1-16.
https://doi.org/10.4236/jilsa.2017.91001
[12]  Nai-Arun, N. and Sittidech, P. (2014) Ensemble Learning Model for Diabetes Classification. Advanced Materials Research, 931-932, 1427-1431.
https://doi.org/10.4028/www.scientific.net/AMR.931-932.1427
[13]  Ying, W.X., Hong, H.S., Quan, L.Z., et al. (2009) The Predictive Role of Diabetes Mellitus with Erectile Dysfunction on Cardiovascular Risk. Chinese Circulation Journal.
[14]  Pradhan, M. and Bamnote, G.R. (2015) Design of Classifier for Detection of Diabetes Mellitus Using Genetic Programming. Advances in Intelligent Systems & Computing, 327, 763-770.
https://doi.org/10.1007/978-3-319-11933-5_86
[15]  Nai-Arun, N. and Moungmai, R. (2015) Comparison of Classifiers for the Risk of Diabetes Prediction. Procedia Computer Science, 69, 132-142.
https://doi.org/10.1016/j.procs.2015.10.014
[16]  Vijiya Kumar, K., Lavanya, B., Nirmala, I. and Sofia Caroline, S. (2019) Random Forest Algorithm for the Prediction of Diabetes. 2019 IEEE International Conference on System, Computation, Automation and Networking (ICSCAN), Pondicherry, 29-30 March 2019, 1-5.
https://doi.org/10.1109/ICSCAN.2019.8878802
[17]  Sisodia, D. and Sisodia, D.S. (2018) Prediction of Diabetes Using Classification Algorithms. Procedia Computer Science, 132, 1578-1585.
https://doi.org/10.1016/j.procs.2018.05.122

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133