%0 Journal Article
%T 基于模型融合的上市公司财务造假的预测
Prediction of Financial Fraud of Listed Companies Based on Model Fusion
%A 仪梦
%A 吴丽丽
%J E-Commerce Letters
%P 1991-2006
%@ 2168-5851
%D 2024
%I Hans Publishing
%R 10.12677/ecl.2024.132241
%X 我国上市公司财务报告造假的问题一直伴随着市场的发展。针对此问题,构造了基于分类模型的上市公司财务造假的预测研究。通过数据的预处理和机器学习算法模型,以及统计学的一些方法,建立了一套完整的分析预测模型。首先按照行业分为大类,将年数据中缺失率达到50%以上的指标剔除,剩下缺失的数据选用0来填充。对于日数据提取每股指标,并按年进行均值化,将均值化后的日数据整合为年数据并且提取出特征因子,最终通过降维的思想筛选出对上市财务造假有较大影响的因子。通过确定的因子,将特征因子初步处理,并且进行标准化,通过使用三大类特征选择的方法,使系统的特定指标进一步优化,接着用主成分降维,正则化特征提取,最终用决策树分类模型、线性判别模型、梯度提升分类模型、支持向量机模型四种分类模型进行分类预测。
The problem of financial report fraud of listed companies in our country has been accompanied by the development of the market. In order to solve this problem, this paper constructs a forecasting research on financial fraud of listed companies based on classification model. Through the data preprocessing and machine learning algorithm model, as well as some statistical methods, we established a complete set of analysis and prediction model. First of all, according to the industry it is divided into large categories. The annual data missing rate of more than 50% of the indicators removed. The remaining missing data selected to fill 0. For the daily data, we extract the per-share index and average it every year, then integrate the average daily data into the number of years. We integrate the averaged daily data into annual data and extract the characteristic factors. Finally, through the thought of dimensionality reduction, the factors that have a greater impact on listed financial fraud are screened out. By using the method of feature selection of three categories, the specific index of the system is further optimized, and then the principal component is used to reduce the dimension. Finally, decision tree classification model, linear discriminant model, gradient promotion classification model and support vector machine model were used to predict the classification.
%K 财务造假,随机森林,组合决策树,逻辑回归模型,曲线下面积
Financial Fraud
%K Random Forest Model
%K XGBoost
%K Logistic Regression Model
%K AUC
%U http://www.hanspub.org/journal/PaperInformation.aspx?PaperID=87113