全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

基于Stacking模型融合的车辆保险反欺诈识别
Anti-Fraud Identification of Vehicle Insurance Based on Stacking Model Fusion

DOI: 10.12677/sa.2024.132048, PP. 486-495

Keywords: 模型融合,XGboost,LightGBM,随机森林,车险欺诈
Model Fusion
, XGBoost, Lightgbm, Random Forest, Automobile Insurance Fraud

Full-Text   Cite this paper   Add to My Lib

Abstract:

基于Kaggle公开数据集,首先对其进行了数据清洗、数据均衡化等预处理工作,确保数据的质量和适用性。在预处理完成后,使用随机森林、XGBoost和LightGBM三种不同的机器学习算法构建了保险欺诈识别模型,并通过网格搜索法调整参数来提高其性能。采用分类模型的评估方法,对这三个模型的Pression、Recall等指标进行了对比分析,结果显示LightGBM模型在整体的分类效果上表现最好。最后,我们引入了Stacking技术,将三个单一模型作为初级分类器,Logistic Regression作为元分类器,得到了融合的的车险欺诈识别模型。这一模型结合了三个单一模型的优点,不仅具有较高的稳定性,还能提高整体的预测精度。通过模型融合,我们得到了一个更加全面、准确的欺诈识别系统,为保险公司提供了更可靠的风险管理工具。
Based on the Kaggle public dataset, we first conducted preprocessing such as data cleaning and balancing to ensure the quality and applicability of the data. After preprocessing, we constructed insurance fraud detection models using three different machine learning algorithms: Random Forest, XGBoost, and LightGBM. We used grid search to tune the parameters to improve their performance. By evaluating the models using classification metrics such as Precision and Recall, we found that the LightGBM model performed the best overall. Finally, we introduced the Stacking technique, combining the three individual models as base classifiers and Logistic Regression as the meta-classifier, to obtain a fused auto insurance fraud detection model. This model combines the advantages of the three individual models, exhibiting not only high stability but also improved overall prediction accuracy. Through model fusion, we obtained a more comprehensive and accurate fraud detection system, providing insurance companies with more reliable risk management tools.

References

[1]  中国银保监会. 中国银保监会关于印发实施车险综合改革指导意见的通知[EB/OL].
https://www.gov.cn/zhengce/zhengceku/2020-09/04/content_5540321.htm, 2024-02-25.
[2]  中国银保监会. 银保监会、公安部大数据反保险欺诈试点初显成效[EB/OL].
http://www.cbirc.gov.cn/cn/view/pages/ItemDetail.html?docId=947691&itemId=915&generaltype=0, 2024-02-25.
[3]  Wang, Y.B. and Wei, X. (2018) Leveraging Deep Learning with LDA-Based Text Analytics to Detect Automobile Insurance Fraud. Decision Support Systems, 105, 87-95.
https://doi.org/10.1016/j.dss.2017.11.001
[4]  Manuel, A., Mercedes, A. and Montserrat G. (1999) Modelling Different Types of Automobile Insurance Fraud Behaviour in the Spanish Market. Insurance: Mathematics and Economics, 24, 67-81.
[5]  Stijn, V., Richard, A.D., Bart, B. and Guido, D. (2002) A Comparison of State-of-the-Art Classification Techniques for Expert Automobile Insurance Claim Fraud Detection. Journal of Risk and Insurance, 69, 373-421.
https://doi.org/10.1111/1539-6975.00023
[6]  Clifton, P., Daniel, D., Vincent, L. (2002) Skewed Data in Fraud Detection: Classification of Skewed Data. ACM SIGKDD Explorations Newsletter, 6, 1-19.
[7]  Jesús, M.P., Javier, M., Olatz, A., Ibai, G. and José, M. (2005) Consolidated Tree Classifier Learning in a Car Insurance Fraud Detection Domain with Class Imbalance. Pattern Recognition and Data Mining, 3686, 381-389.
https://doi.org/10.1007/11551188_41
[8]  搨思杰. A公司机动车辆保险欺诈识别指标研究[D]: [硕士学位论文]. 长沙: 湖南大学, 2016.
[9]  李亚琪. 基于数据挖掘技术的汽车保险欺诈识别研究[D]: [硕士学位论文]. 青岛: 山东科技大学, 2018.
[10]  杜小雨. 汽车保险欺诈索赔的关联分析[D]: [硕士学位论文]. 兰州: 兰州大学, 2019.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133