|
基于因果推断的慢性肾脏病分析
|
Abstract:
慢性肾脏病(CKD)是一种慢性健康状况疾病,在CKD的不同阶段,其临床表现也各不相同。所以对于慢性肾脏病患者的相关指标进行详细的分析具有重要的意义。XGBoost (Extreme Gradient Boosting)是一个优化的分布式梯度增强库,它在梯度提升(Gradient Boosting)框架下实现机器学习算法。本文提出了一种基于机器学习模型XGBoost与因果推断相结合的思想,建立了一个新模型XGBoost-CI (Extreme Gradient Boosting-Casual Inference),并利用该模型对导致慢性肾脏病的因素进行分析,最后通过与随机森林模型(Random Forest),逻辑回归模型(Logistic Regression)和LightGBM (Light Gradient Boosting Machine)三个模型进行精确度对比,证实了本文模型的有效性。
Chronic kidney disease (CKD) is a chronic health condition that has a different clinical presentation at different stages of CKD. Therefore, it is of great significance to conduct a detailed analysis of the relevant indicators of patients with chronic kidney disease. XGBoost (Extreme Gradient Boosting) is an optimized distributed gradient boosting library that implements machine learning algorithms under the Gradient Boosting framework. In this paper, we propose a new model XGBoost-CI (Extreme Gradient Boosting-Casual Inference) based on the combination of machine learning model XGBoost and causal inference, and use the model to analyze the factors leading to chronic kidney disease) and Light GBM (Light Gradient Boosting Machine) models were compared to confirm the effectiveness of the proposed model.
[1] | 张兰, 马迎春. 慢性肾脏病不同分期患者糖代谢异常及相关影响因素的调查研究[J]. 首都医科大学学报, 2024, 45(6): 1106-1110. |
[2] | 彭红蕾, 王恒, 翟朝阳, 等. 老年慢性肾脏病患者心理痛苦在体力活动量与生活质量间的中介效应[J]. 中华老年多器官疾病杂志, 2024, 23(10): 734-738. |
[3] | 吴飞, 李清丽, 肖振卫. 孟德尔随机化探究细胞因子与慢性肾脏病的因果关系[J]. 山东大学学报(医学版), 2024, 62(11): 85-95. |
[4] | 陈思军, 齐贞铭, 杨芳玮. 不同年龄慢性肾脏病患者血肌酐、血尿素氮及β2微球蛋白的差异性分析[J]. 包头医学, 2024, 48(3): 4-6. |
[5] | 马蓓佳, 李红哲, 代静. LEARNS框架下的健康教育对慢性肾脏病患者的影响[J]. 生物医学工程学进展, 2024, 45(3): 244-249. |
[6] | 吴飞, 李清丽, 肖振卫. 孟德尔随机化探究细胞因子与慢性肾脏病的因果关系[J]. 山东大学学报(医学版), 2024, 62(11): 85-95. |
[7] | Chronic Kidney Disease Dataset. https://www.kaggle.com/datasets/rabieelkharoua/chronic-kidney-disease-dataset-analysis |
[8] | 李家宁, 熊睿彬, 兰艳艳, 庞亮, 郭嘉丰, 程学旗. 因果机器学习的前沿进展综述[J]. 计算机研究与发展, 2023, 60(1): 59-84. |
[9] | 王小曼, 韩梦琦, 张晓林, 俞鹏飞, 罗颢文, 刘建模, 刘松, 易应萍. 基于XGboost学习算法构建模型对缺血性脑卒中合并房颤患者并发肺部感染的预测价值[J]. 中华医院感染学杂志, 2024, 34(3): 460-464. |
[10] | 李家宁, 熊睿彬, 兰艳艳, 庞亮, 郭嘉丰, 程学旗. 因果机器学习的前沿进展综述[J]. 计算机研究与发展, 2023, 60(1): 59-84. |
[11] | Pearl, J. (2009) Causal Inference in Statistics: An Overview. Statistics Surveys, 3, 96-146. |
[12] | Cheng, X., Kuang, M. and Yang, H. (2024) Missing Data Imputation Based on Causal Inference to Enhance Advanced Persistent Threat Attack Prediction. Symmetry, 16, Article No. 1551. https://doi.org/10.3390/sym16111551 |
[13] | Kiriakidou, N., Ballas, A., Hernando, C.M., Miralles, A., Stamati, T., Anagnostopoulos, D., et al. (2024) A Causal Inference Methodology to Support Research on Osteopenia for Breast Cancer Patients. Applied Sciences, 14, 9700. https://doi.org/10.3390/app14219700 |
[14] | Moccia, C., Moirano, G., Popovic, M., Pizzi, C., Fariselli, P., Richiardi, L., et al. (2024) Machine Learning in Causal Inference for Epidemiology. European Journal of Epidemiology, 39, 1097-1108. https://doi.org/10.1007/s10654-024-01173-x |
[15] | Chen, C., Zhang, J., Ye, T., Roth, D. and Zhang, B. (2024) Causal Inference with Textual Data: A Quasi-Experimental Design Assessing the Association between Author Metadata and Acceptance among ICLR Submissions from 2017 to 2022. Journal of Causal Inference, 12, Article ID: 20230052. https://doi.org/10.1515/jci-2023-0052 |
[16] | Li, R.X., J Prastein, D. and Choi, B.G. (2024) The Impact of Preoperative Depression on In-Hospital Outcomes in Coronary Artery Bypass Grafting: A Propensity-Matched Analysis of National Inpatient Sample from 2015-2020. The American Journal of the Medical Sciences. |
[17] | Liao, J., Xie, Y., Zhao, P., Xia, K., Xu, B., Wang, H., et al. (2024) Probabilistic Assessment of the Thermal Performance of Low-Enthalpy Geothermal System under Impact of Spatially Correlated Heterogeneity by Using Xgboost Algorithms. Energy, 313, Article ID: 133947. https://doi.org/10.1016/j.energy.2024.133947 |
[18] | Huang, M., Zhao, H. and Chen, Y. (2024) Research on SAR Image Quality Evaluation Method Based on Improved Harris Hawk Optimization Algorithm and XGBoost. Scientific Reports, 14, Article No. 28364. https://doi.org/10.1038/s41598-024-79674-8 |