%0 Journal Article %T 基于随机森林方法的国产电影票房研究
Research on Box Office of Domestic Films Based on Random Forest Method %A 郭萱 %A 鲍勇 %A 于水源 %J Hans Journal of Data Mining %P 11-20 %@ 2163-1468 %D 2021 %I Hans Publishing %R 10.12677/HJDM.2021.112002 %X 随着经济的发展和人民生活水平的提高,电影行业迅速发展。电影票房影响因素的研究及预测,对提高国产电影质量十分必要。目前学者们多采用神经网络方法对电影票房进行研究,神经网络方法未给出变量重要性排序,预测结果不够稳健。本文依据2014~2018年225部国产影片的相关数据,采用随机森林方法建立电影票房预测模型。得到了影响我国国产电影票房的因素主要有首周末票房、首映日票房、百度指数、豆瓣评分和点映票房。同时本文采用线性回归模型和神经网络模型建立电影票房的预测模型,应用三种方法对2019年12部国产电影票房进行预测。结果表明:随机森林在电影票房预测方面更加精确稳健,对《飞驰人生》、《银河补习班》等八部影片的预测误差在10%左右。神经网络和线性回归模型预测误差较大。
With the development of economy and the improvement of people’s living standard, the film indus-try develops rapidly. It is necessary to study and forecast the influencing factors of film box office to improve the quality of domestic films. At present, most scholars use the neural network method to study the box office of films. The neural network method does not give the order of importance of variables, and the prediction results are not robust enough. Based on the relevant data of 225 do-mestic films from 2014 to 2018, this paper adopts the random forest method to establish the box office prediction model. The main factors that influence the box office of domestic films in China are the box office of the first weekend, the first day box office, baidu index, douban score and the ad-vance screenings box office. At the same time, this paper adopts linear regression model and neural network model to establish the box office prediction model, and applies three methods to predict the box office of 12 domestic films in 2019. The results show that the random forest is more accu-rate and stable in the prediction of box office, and the prediction error of eight films such as “Pega-sus” and “Looking Up” is around 10%. The prediction error of neural network and linear regression model is large. %K 电影票房预测,影响因素,随机森林,神经网络,线性回归
Box Office Forecast %K Affecting Factors %K Random Forests %K Neural Network %K Linear Regression %U http://www.hanspub.org/journal/PaperInformation.aspx?PaperID=41262