全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

An Analysis and Prediction of Health Insurance Costs Using Machine Learning-Based Regressor Techniques

DOI: 10.4236/jdaip.2024.124031, PP. 581-596

Keywords: Medical Cost, Health Insurance, Cost Prediction, Medical Cost Personal Datasets, Machine Learning

Full-Text   Cite this paper   Add to My Lib

Abstract:

One of the most significant annual expenses that a person has is their health insurance coverage. Health insurance accounts for one-third of GDP, and everyone needs medical treatment to varying degrees. Changes in medicine, pharmaceutical trends, and political factors are only a few of the many factors that cause annual fluctuations in healthcare costs. This paper describes how a system may analyse a person’s medical history to display their insurance plans and make predictions about their health insurance premiums. The performance of four ML models—XGBoost, Lasso, KNN, and Ridge—is evaluated using R2-score and RMSE. The analysis of medical health insurance cost prediction using Lasso regression, Ridge regression, and K-Nearest Neighbours (KNN), and XGBoost (XGB) highlights notable differences in performance. KNN has the lowest R2-score of 55.21 and an RMSE of 4431.1, indicating limited predictive ability. Ridge Regression improves on this by an R2-score of 78.38 but has a higher RMSE of 4652.06. Lasso Regression slightly edges out Ridge with an R2-score of 79.78, yet it suffers from an advanced RMSE of 5671.6. In contrast, XGBoost excels with the highest R2-score of 86.81 and the lowermost RMSE of 4450.4, demonstrating superior predictive accuracy and making it the most effective model for this task. The best method for accurately predicting health insurance premiums was XGBoost Regression. The findings beneficial for policymakers, insurers, and healthcare providers as they can use this information to allocate resources more efficiently and enhance cost-effectiveness in the healthcare industry.

References

[1]  Kumar, V.V., Sahoo, A., Balasubramanian, S.K. and Gholston, S. (2024) Mitigating Healthcare Supply Chain Challenges under Disaster Conditions: A Holistic AI-Based Analysis of Social Media Data. International Journal of Production Research.
https://doi.org/10.1080/00207543.2024.2316884
[2]  Sommers, B.D. (2020) Health Insurance Coverage: What Comes after the Aca? Health Affairs, 39, 502-508.
https://doi.org/10.1377/hlthaff.2019.01416
[3]  Milovic, B. (2012) Prediction and Decision Making in Health Care Using Data Mining. International Journal of Public Health Science (IJPHS), 1, 126-136.
https://doi.org/10.11591/ijphs.v1i2.1380
[4]  Mathur, S. and Gupta, S. (2023) Classification and Detection of Automated Facial Mask to COVID-19 Based on Deep CNN Model. 2023 IEEE 7th Conference on Information and Communication Technology (CICT), Jabalpur, 15-17 December 2023, 1-6.
https://doi.org/10.1109/cict59886.2023.10455699
[5]  Morid, M.A., Kawamoto, K., Ault, T., Dorius, J. and Abdelrahman, S. (2017) Supervised Learning Methods for Predicting Healthcare Costs: Systematic Literature Review and Empirical Evaluation. AMIA Annual Symposium Proceedings, 2017, 1312-1321.
[6]  Basile, L.J., Carbonara, N., Pellegrino, R. and Panniello, U. (2023) Business Intelligence in the Healthcare Industry: The Utilization of a Data-Driven Approach to Support Clinical Decision Making. Technovation, 120, Article ID: 102482.
https://doi.org/10.1016/j.technovation.2022.102482
[7]  Ramya, D., Manigandan, S.K. and Deepa, J. (2022) Health Insurance Cost Prediction Using Machine Learning Algorithms. 2022 International Conference on Edge Computing and Applications (ICECAA), Tamilnadu, 13-15 October 2022, 1381-1384.
https://doi.org/10.1109/icecaa55415.2022.9936153
[8]  Thejeshwar, T., Sai Harsha, T., Vamsi Krishna, V. and Kaladevi, R. (2023) Medical Insurance Cost Analysis and Prediction Using Machine Learning. 2023 International Conference on Innovative Data Communication Technologies and Application (ICIDCA), Uttarakhand, 14-16 March 2023, 113-117.
https://doi.org/10.1109/icidca56705.2023.10100057
[9]  Duijvestijn, M., de Wit, G.A., van Gils, P.F. and Wendel-Vos, G.C.W. (2023) Impact of Physical Activity on Healthcare Costs: A Systematic Review. BMC Health Services Research, 23, Article No. 572.
https://doi.org/10.1186/s12913-023-09556-8
[10]  Gabriel, J. (2024) A Machine Learning-Based Web Application for Heart Disease Prediction. Intelligent Control and Automation, 15, 9-27.
https://doi.org/10.4236/ica.2024.151002
[11]  Nori, N. (2024) Machine Learning Based Virtual Screening for Biodegradable Polyesters. Journal of Materials Science and Chemical Engineering, 12, 1-11.
https://doi.org/10.4236/msce.2024.128001
[12]  Anumandla, S.K.R., Yarlagadda, V.K., Vennapusa, S.C.R. and Kothapalli, K.R.V. (2020) Unveiling the Influence of Artificial Intelligence on Resource Management and Sustainable Development: A Comprehensive Investigation. Technology & Management Review, 5, 45-65.
[13]  Yang, C., Delcher, C., Shenkman, E. and Ranka, S. (2018) Machine Learning Approaches for Predicting High Cost High Need Patient Expenditures in Health Care. BioMedical Engineering OnLine, 17, Article No. 131.
https://doi.org/10.1186/s12938-018-0568-3
[14]  Iorliam, I.B., Ikyo, B.A., Iorliam, A., Okube, E.O., Kwaghtyo, K.D. and Shehu, Y.I. (2021) Application of Machine Learning Techniques for Okra Shelf Life Prediction. Journal of Data Analysis and Information Processing, 9, 136-150.
https://doi.org/10.4236/jdaip.2021.93009
[15]  Vijayalakshmi, V., Selvakumar, A. and Panimalar, K. (2023) Implementation of Medical Insurance Price Prediction System Using Regression Algorithms. 2023 5th International Conference on Smart Systems and Inventive Technology (ICSSIT), Tirunelveli, 23-25 January 2023, 1529-1534.
https://doi.org/10.1109/icssit55814.2023.10060926
[16]  Marinova, G. and Todorova, M. (2023) Regression Analysis for Predicting Health Insurance. 2023 4th International Conference on Communications, Information, Electronic and Energy Systems (CIEES), Plovdiv, 23-25 November 2023, 1-4.
https://doi.org/10.1109/ciees58940.2023.10378755
[17]  Dutta, K., Chandra, S., Gourisaria, M.K. and GM, H. (2021) A Data Mining Based Target Regression-Oriented Approach to Modelling of Health Insurance Claims. 2021 5th International Conference on Computing Methodologies and Communication (ICCMC), Erode, 8-10 April 2021, 1168-1175.
https://doi.org/10.1109/iccmc51019.2021.9418038
[18]  Baro, E.F., Oliveira, L.S. and de Souza Britto Junior, A. (2022) Predicting Hospitalization from Health Insurance Data. 2022 IEEE International Conference on Systems, Man, and Cybernetics (SMC), Prague, 9-12 October 2022, 2790-2795.
https://doi.org/10.1109/smc53654.2022.9945601
[19]  Luo, L., Yu, X., Yong, Z., Li, C. and Gu, Y. (2021) Design Comorbidity Portfolios to Improve Treatment Cost Prediction of Asthma Using Machine Learning. IEEE Journal of Biomedical and Health Informatics, 25, 2237-2247.
https://doi.org/10.1109/jbhi.2020.3034092
[20]  Farhud, D.D. and Zokaei, S. (2021) Ethical Issues of Artificial Intelligence in Medicine and Healthcare. Iranian Journal of Public Health, 50, i-v.
https://doi.org/10.18502/ijph.v50i11.7600
[21]  Mondal, H. and Mondal, S. (2024) Ethical and Social Issues Related to AI in Healthcare. Methods in Microbiology, 55, 247-281.
https://doi.org/10.1016/bs.mim.2024.05.009
[22]  Nill, A., Laczniak, G. and Thistle, P. (2017) The Use of Genetic Testing Information in the Insurance Industry: An Ethical and Societal Analysis of Public Policy Options. Journal of Business Ethics, 156, 105-121.
https://doi.org/10.1007/s10551-017-3554-y
[23]  Rohilla, V., Chakraborty, S. and Kumar, R. (2022) Deep Learning Based Feature Extraction and a Bidirectional Hybrid Optimized Model for Location Based Advertising. Multimedia Tools and Applications, 81, 16067-16095.
https://doi.org/10.1007/s11042-022-12457-3
[24]  Okfalisa, Gazalba, I., Mustakim, and Reza, N.G.I. (2017) Comparative Analysis of K-Nearest Neighbor and Modified K-Nearest Neighbor Algorithm for Data Classification. 2017 2nd International conferences on Information Technology, Information Systems and Electrical Engineering (ICITISEE), Yogyakarta, 1-2 November 2017, 294-298.
https://doi.org/10.1109/icitisee.2017.8285514
[25]  Chen, T. and Guestrin, C. (2016) XGBoost: A Scalable Tree Boosting System. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, 13-17 August 2016, 785-794.
https://doi.org/10.1145/2939672.2939785
[26]  Anmol, Aggarwal, S. and Jahan Badhon, A. (2022) Medical Insurance Cost Prediction Using Machine Learning Algorithms. In: Maurya, S., Peddoju, S.K., Ahmad, B. and Chihi, I., Eds., Cyber Technologies and Emerging Sciences, Springer, 271-281.
https://doi.org/10.1007/978-981-19-2538-2_27
[27]  Ul Hassan, C.A., Iqbal, J., Hussain, S., AlSalman, H., Mosleh, M.A.A. and Sajid Ullah, S. (2021) A Computational Intelligence Approach for Predicting Medical Insurance Cost. Mathematical Problems in Engineering, 2021, Article ID: 1162553.
https://doi.org/10.1155/2021/1162553
[28]  Christobel, Y.A. and Subramanian, S. (2022) An Empirical Study of Machine Learning Regression Models to Predict Health Insurance Cost. Webology, 19, 1677-1685.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133