Background: Osteoporosis is a serious health issue that can lead to severe clinical diseases, involving fractures. The resulting fracture can be a major risk factor for disability or even death for the elderly. A well-timed diagnosis of osteoporosis disease can help identify and prevent such fractures and improve patient outcomes. Objective: The aim of this study is to explore a novel hybrid approach for the characterization of osteoporotic patients into different subtypes, leading to enhanced classification of the condition. Methods: We examined a cohort of 10,000 patients based on nationwide chronic disease data in Germany, which included 1293 osteoporotic patients. We included various medical variables such as chronic kidney disease, cancer, stroke, hypertension, and diabetes. We deployed a hybrid approach that used HDBSCAN clustering to stratify patients into distinct subtypes. We constructed the predictive models for each subtype using seven different classification methods. Results: We identified seven distinct subtypes, each linked with different conditions such as cancer, cardiovascular diseases, and chronic obstructive pulmonary disease (COPD). Logistic Regression showed the highest subtype-level prediction performance, reaching an accuracy score of 87.8% compared to other predictive models based on the original dataset without clustering. Unsupervised learning approach improved prediction using all classification methods, emphasizing the impact of deploying subtype analysis to complex data. Conclusion: This research revealed that deploying a hybrid methodology is important for the discovery of patient subtypes and for making predictions more precise. The choice of the methods in this research was critical in ensuring robust prediction performance. The predictive model is vital for finding patients at high risk for osteoporosis disease and enabling early intervention and prevention strategies. This approach holds potential for the study of other complex clinical diseases using any data source.
References
[1]
Kemmak, A.R., Rezapour, A., Jahangiri, R., Nikjoo, S., Farabi, H. and Soleimanpour, S. (2020) Economic Burden of Osteoporosis in the World: A Systematic Review. Medical Journal of the Islamic Republic of Iran, 34, 154.
[2]
Aydin Ozturk, P., Arac, E., Ozturk, U. and Arac, S. (2021) Estimation of Bone Mineral Density with Hounsfield Unit Measurement. British Journal of Neurosurgery, 38, 464-467. https://doi.org/10.1080/02688697.2021.1888877
[3]
Lu, B. and Zhang, L. (2023) Association of a History of Gestational Diabetes Mellitus with Osteoporosis, Bone Mineral Density, and Trabecular Bone Score in Postmenopausal Women. Diabetology & Metabolic Syndrome, 15, Article No. 215. https://doi.org/10.1186/s13098-023-01194-8
[4]
Wu, Y., Chao, J., Bao, M. and Zhang, N. (2023) Predictive Value of Machine Learning on Fracture Risk in Osteoporosis: A Systematic Review and Meta-Analysis. BMJ Open, 13, e071430. https://doi.org/10.1136/bmjopen-2022-071430
[5]
Kong, S.H. (2024) Sex/Gender Differences in Osteoporosis. In: Sex/Gender-Specific Medicine in Clinical Areas, Springer, 277-286. https://doi.org/10.1007/978-981-97-0130-8_13
[6]
Xu, P., Ge, J., Jiang, H., Lin, Y., Ye, Y., Huang, X., et al. (2023) Gastrointestinal Disease Is an Important Influencing Factor of Osteoporosis Fracture: A Retrospective Study in Chinese Postmenopausal Women. BMC Musculoskeletal Disorders, 24, Article No. 659. https://doi.org/10.1186/s12891-023-06765-4
[7]
Rahim, F., Zaki Zadeh, A., Javanmardi, P., Emmanuel Komolafe, T., Khalafi, M., Arjomandi, A., et al. (2023) Machine Learning Algorithms for Diagnosis of Hip Bone Osteoporosis: A Systematic Review and Meta-Analysis Study. BioMedical Engineering Online, 22, Article No. 68. https://doi.org/10.1186/s12938-023-01132-9
[8]
Albuquerque, G.A., Carvalho, D.D.A., Cruz, A.S., Santos, J.P.Q., Machado, G.M., Gendriz, I.S., et al. (2023) Osteoporosis Screening Using Machine Learning and Electromagnetic Waves. Scientific Reports, 13, Article No. 12865. https://doi.org/10.1038/s41598-023-40104-w
[9]
Xiao, P.-L., Cui, A.-Y., Hsu, C.-J., Peng, R., Jiang, N., et al. (2022) Global, Regional Prevalence, and Risk Factors of Osteoporosis According to the World Health Organization Diagnostic Criteria: A Systematic Review and Meta-Analysis. Osteoporosis International, 33, 2137-2153. https://doi.org/10.1007/s00198-022-06454-3
[10]
Ramchand, S.K. and Leder, B.Z. (2023) Sequential Therapy for the Long-Term Treatment of Postmenopausal Osteoporosis. The Journal of Clinical Endocrinology & Metabolism, 109, 303-311. https://doi.org/10.1210/clinem/dgad496
[11]
Al-Moaibed, G., Al Hamam, N., Alfayez, E., Alfayez, E., Al-Mubaddil, M. and Alramadhan, N. (2020) Prevalence and Risk Factors for Osteoporotic Fracture among Adults with Comorbidities in Al-Ahsaa, Saudi Arabia. Journal of Family Medicine and Primary Care, 9, 877-882. https://doi.org/10.4103/jfmpc.jfmpc_982_19
[12]
Sing, C.W., Lin, T.C., Bartholomew, S., Bell, J.S., Bennett, C., Beyene, K., Bosco‐Levy, P., Bradbury, B.D., Chan, A.H.Y., Chandran, M. and Cooper, C. (2023) Global Epidemiology of Hip Fractures: Secular Trends in Incidence Rate, Post-Fracture Treatment, and All-Cause Mortality. Journal of Bone and Mineral Research, 38, 1064-1075.
[13]
Moayyeri, A., Warden, J., Han, S., Suh, H.S., Pinedo-Villanueva, R., Harvey, N.C., etal. (2023) Estimating the Economic Burden of Osteoporotic Fractures in a Multinational Study: A Real-World Data Perspective. Osteoporosis International, 34, 2121-2132. https://doi.org/10.1007/s00198-023-06895-4
[14]
Tu, J., Liao, W., Liu, W. and Gao, X. (2024) Using Machine Learning Techniques to Predict the Risk of Osteoporosis Based on Nationwide Chronic Disease Data. Scientific Reports, 14, Article No. 5245. https://doi.org/10.1038/s41598-024-56114-1
[15]
Garg, U., Shukla, A.K., Singh, H., Sharma, N. and Kaur, E.S. (2020) Use of Machine Learning in the Pattern Finding. International Journal of Recent Technology and Engineering, 9, 527-531. https://doi.org/10.35940/ijrte.a1237.059120
[16]
Khanna, V.V., Chadaga, K., Sampathila, N., Chadaga, R., Prabhu, S., K S, S., et al. (2023) A Decision Support System for Osteoporosis Risk Prediction Using Machine Learning and Explainable Artificial Intelligence. Heliyon, 9, e22456. https://doi.org/10.1016/j.heliyon.2023.e22456
[17]
Nam, K.H., Seo, I., Kim, D.H., Lee, J.I., Choi, B.K. and Han, I.H. (2019) Machine Learning Model to Predict Osteoporotic Spine with Hounsfield Units on Lumbar Computed Tomography. Journal of Korean Neurosurgical Society, 62, 442-449. https://doi.org/10.3340/jkns.2018.0178
[18]
Huang, C., Hu, J., Tan, K., Zhang, W., Xu, T. and Yang, L. (2022) Application of Machine Learning Model to Predict Osteoporosis Based on Abdominal Computed Tomography Images of the Psoas Muscle: A Retrospective Study. BMC Geriatrics, 22, Article No. 796. https://doi.org/10.1186/s12877-022-03502-9
[19]
Qiu, C., Su, K., Luo, Z., Tian, Q., Zhao, L., Wu, L., et al. (2024) Developing and Comparing Deep Learning and Machine Learning Algorithms for Osteoporosis Risk Prediction. Frontiers in Artificial Intelligence, 7, Article 1355287. https://doi.org/10.3389/frai.2024.1355287
[20]
Lin, Y., Chu, C., Hung, K., Lu, C., Bednarczyk, E.M. and Chen, H. (2022) Can Machine Learning Predict Pharmacotherapy Outcomes? An Application Study in Osteoporosis. Computer Methods and Programs in Biomedicine, 225, Article 107028. https://doi.org/10.1016/j.cmpb.2022.107028
[21]
Irmawati, I., Herdit Juningsih, E. and Yanto, Y. (2024) Predictive Modeling of Osteoporosis Risk Factors Using XGBoost and Bagging Ensemble Technique. Journal Medical Informatics Technology, 2, 6-10. https://doi.org/10.37034/medinftech.v2i1.27
[22]
Alyousef, A.A., Nihtyanova, S., Denton, C., Bosoni, P., Bellazzi, R. and Tucker, A. (2018) Nearest Consensus Clustering Classification to Identify Subclasses and Predict Disease. Journal of Healthcare Informatics Research, 2, 402-422. https://doi.org/10.1007/s41666-018-0029-6
[23]
Zaman, S.M.M., Qureshi, W.M., Raihan, M.M.S., Shams, A.B. and Sultana, S. (2021) Survival Prediction of Heart Failure Patients Using Stacked Ensemble Machine Learning Algorithm. 2021 IEEE International Women in Engineering (WIE) Conference on Electrical and Computer Engineering (WIECON-ECE), Dhaka, 4-5 December 2021, 117-120. https://doi.org/10.1109/wiecon-ece54711.2021.9829577
[24]
Karel, K. (2016) Data from: Prevalence of Chronic Diseases among Older Patients in German General Practices. https://datadryad.org/stash/dataset/doi:10.5061/dryad.qh0h1
[25]
Huang, Y. and Ye, J. (2024) Association between Hypertension and Osteoporosis: A Population-Based Cross-Sectional Study. BMC Musculoskeletal Disorders, 25, Article No. 434. https://doi.org/10.1186/s12891-024-07553-4