全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

Optimization of Malaria Diagnosis by Machine Learning According to the CRISP-DM Model Applied to the University Teaching Hospital Clinics of Lubumbashi (DRC)

DOI: 10.4236/oalib.1114143, PP. 1-23

Subject Areas: Artificial Intelligence

Keywords: Malaria, Machine Learning, Artificial Intelligence, CRISP-DM, Expert System, Medical Diagnosis, Public Health, DRC, Lubumbashi

Full-Text   Cite this paper   Add to My Lib

Abstract

Malaria remains a major public health challenge in the Democratic Republic of Congo (DRC), particularly in Lubumbashi, where traditional diagnostic methods are struggling to meet growing demand. The study was conducted at the University Clinics of Lubumbashi (UCL), the teaching hospital affiliated with the University of Lubumbashi. This work proposes an expert system based on artificial intelligence (AI) and the Cross-Industry Standard Process for Data Mining (CRISP-DM) methodology to optimize malaria diagnosis in this setting. By leveraging a decision tree classifier trained on local clinical data, the system achieved an accuracy of 90.4%, a recall of 88%, and a specificity of 92%. The results demonstrate a substantial improvement in the speed and reliability of diagnosis, providing a transparent and interpretable decision-support tool suitable for resource-limited healthcare environments.

Cite this paper

Mazunze, B. , Vicky, L. M. , Franck, K. N. , Pierre-Stéphane, M. M. , Patrice, K. M. E. , Desiré, K. D. and Eddy, M. S. (2025). Optimization of Malaria Diagnosis by Machine Learning According to the CRISP-DM Model Applied to the University Teaching Hospital Clinics of Lubumbashi (DRC). Open Access Library Journal, 12, e14143. doi: http://dx.doi.org/10.4236/oalib.1114143.

References

[1]  Daneluzzo, L., Daneluzzo, M., Thellier, F., et al. (2025) “Severe Imported Malaria in Children in Metropolitan France, 2011-2023.” Medecine et Maladies Infectieuses, 4, S24.
[2]  Mbanefo, A. and Kumar, N. (2020) Evaluation of Malaria Diagnostic Methods as a Key for Successful Control and Elimination Programs. Tropical Medicine and Infectious Disease, 5, Article 102. https://doi.org/10.3390/tropicalmed5020102
[3]  World Health Organization (2025) World Malaria Report 2024. https://www.who.int/teams/global-malaria-programme/reports/world-malaria-report-2024
[4]  World Health Organization (2025) World Malaria Report 2023. https://www.who.int/teams/global-malaria-programme/reports/world-malaria-report-2023
[5]  Acherar, I.A. (2024) Detection and Identification of Plasmodium falciparum on Microscopic Images. PhD Thesis, Sorbonne University. https://theses.hal.science/tel-04698420/
[6]  Rajaraman, S., Antani, S.K., Poostchi, M., Silamut, K., Hossain, M.A., Maude, R.J., et al. (2018) Pre-Trained Convolutional Neural Networks as Feature Extractors toward Improved Malaria Parasite De-tection in Thin Blood Smear Images. Peer Journal, 6, e4568. https://doi.org/10.7717/peerj.4568
[7]  Liang, H.Y., Tsui, B.Y., Ni, H., Valentim, S., et al. (2019) Evaluation and Accurate Diagnoses of Pediatric Diseases Using Artificial Intelligence. Nature Medicine, 25, 433-438.
[8]  Kermany, D.S., Goldbaum, M., Cai, W.J., et al. (2018) Identifying Medical Diagnoses and Treatable Diseases by Image-Based Deep Learning. Cell, 172, 1122-1131.
[9]  Nsoesie, E.O., Buckeridge, D.L. and Brownstein, J.S. (2014) Guess Who’s Not Coming to Dinner? Evaluating Online Restaurant Reservations for Disease Surveil-lance. Journal of Medical Internet Research, 16, e22. https://doi.org/10.2196/jmir.2998
[10]  Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., et al. (2011) Scikit-Learn: Machine Learning in Python. Journal of Machine Learning Research, 12, 2825-2830.
[11]  Shearer, C. (2000) The CRISP-DM Model: The New Blueprint for Data Mining. Journal of Data Ware-hous, 5, 13-22.
[12]  Marbn, S., Mariscal, G. and Segovi, J. (2009) A Data Mining & Knowledge Discovery Process Model. In: Data Mining and Knowledge Discovery in Real Life Applications, I-Tech Education and Publishing. https://doi.org/10.5772/6438
[13]  Garcia-Rios, V., Marres-Salhuana, M., Sierra-Liñan, F. and Cabanillas-Carbonell, M. (2023) Predictive Machine Learning Applying Cross Industry Standard Process for Data Mining for the Diagnosis of Diabe-tes Mellitus Type 2. IAES International Journal of Artificial Intelligence, 12, Article 1713. https://doi.org/10.11591/ijai.v12.i4.pp1713-1726
[14]  Amato, F., López, A., Peña-Méndez, E.M., Vaňhara, P., Hampl, A. and Havel, J. (2013) Artificial Neural Networks in Medical Diagnosis. Journal of Applied Biomedicine, 11, 47-58. https://doi.org/10.2478/v10136-012-0031-x
[15]  Chapman, P. (2000) CRISP-DM .0: Step-by-Step Data Mining Guide”. SPSS Inc.
[16]  Jakobsen, J.C., Gluud, C., Wetterslev, J. and Winkel, P. (2017) When and How Should Multiple Imputation Be Used for Handling Missing Data in Randomized Clinical Trials—A Practical Guide with Flowcharts. BMC Medical Research Methodology, 17, Article No. 162.
[17]  Tan, P.-N., Steinbach, M. and Kumar, V. (2016) Introduction to Data Mining. Pear-son Education India.
[18]  Kursa, M.B. and Rudnicki, W.R. (2010) Feature Selection with the Boruta Package. Journal of Statistical Software, 36, 1-13. https://doi.org/10.18637/jss.v036.i11
[19]  Altman, D.G. and Bland, J.M. (1994) Statistics Notes: Diagnostic Tests 1: Sensitivity and Specificity. British Medical Journal, 308, Article 1552. https://doi.org/10.1136/bmj.308.6943.1552
[20]  Guidotti, R., Monreale, A., Ruggieri, S., Turini, F., Giannotti, F. and Pedreschi, D. (2019) A Survey of Methods for Explaining Black Box Models. ACM Computing Surveys, 51, 1-42. https://doi.org/10.1145/3236009
[21]  Shortliffe, E.H. and Sepúlveda, M.J. (2018) Clinical Decision Support in the Era of Artificial Intelligence. Journal of the American Medical Association, 320, 2199-2200. https://doi.org/10.1001/jama.2018.17163
[22]  Gambetti, A., Han, Q., Shen, H. and Soares, C. (2025) A Survey on Hu-man-Centered Evaluation of Explainable AI Methods in Clinical Decision Support Systems.
[23]  Nahm, F.S. (2022) Receiv-er Operating Characteristic Curve: Overview and Practical Use for Clinicians. Korean Journal of Anesthesiology, 75, 25-36. https://doi.org/10.4097/kja.21209
[24]  Lasko, T.A., Bhagwat, J.G., Zou, K.H. and Ohno-Machado, L. (2005) The Use of Receiver Operating Characteristic Curves in Biomedical Informatics. Journal of Biomedical Informatics, 38, 404-415. https://doi.org/10.1016/j.jbi.2005.02.008
[25]  Hanley, J.A. and McNeil, B.J. (1982) “The Meaning and Use of the Area under a Receiver Operating Characteristic (ROC) Curve.” Radiology, 143, 29-36.
[26]  Corbacioglu, S.K. and Aksel, G. (2023) Receiver Operating Characteristic Curve Analysis in Diagnostic Accuracy Studies: A Guide to Interpreting the Area under the Curve Value. Turkish Journal of Emergency Medicine, 23, 195-198. https://doi.org/10.4103/tjem.tjem_182_23
[27]  Sathyanarayanan, S. and Tantri, B.R. (2024) Confusion Matrix-Based Performance Evaluation Metrics. African Journal of Biomedical Research, 27, 4023-4031.
[28]  Lasko, T.A., Bhagwat, J.G., Zou, K.H. and Ohno-Machado, L. (2005) The Use of Receiver Operating Characteristic Curves in Biomedical Informatics. Journal of Biomedical Informatics, 38, 404-415. https://doi.org/10.1016/j.jbi.2005.02.008
[29]  Flach, P. (2012) Machine Learning: The Art and Science of Algorithms That Make Sense of Data. Cambridge University Press. https://books.google.com/books?hl=fr&lr=&id=Ofp4h_oXsZ4C& oi=fnd&pg=PR15&dq=P. Flach, *Machine Learning: The Art and Sc ience of Algorithms that Make Sense of Data*, Cambridge University Press, 2012.&ots=XMtWjqdpSM&sig=ofDk26bPb6y_EDEqgA1Zit2euNo
[30]  Davis, J. and Goadrich, M. (2006) The Relationship between Preci-sion-Recall and ROC Curves. Proceedings of the 23rd International Conference on Machine Learning, Pittsburgh, 25-29 June 2006, 233-240. https://doi.org/10.1145/1143844.1143874
[31]  Sokolova, M. and Lapalme, G. (2009) A System-atic Analysis of Performance Measures for Classification Tasks. Information Processing & Management, 45, 427-437. https://doi.org/10.1016/j.ipm.2009.03.002
[32]  Bent, B., Goldstein, B.A., Kibbe, W.A. and Dunn, J.P. (2020) Investigat-ing Sources of Inaccuracy in Wearable Optical Heart Rate Sensors. NPJ Digital Medicine, 3, Article 18. https://doi.org/10.1038/s41746-020-0226-6

Full-Text


Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133