%0 Journal Article %T 基于Python爬虫的职位信息数据分析和可视化系统实现
Job Information Data Analysis and Visualization System Implementation Based on Python Crawler %A 刘娟 %A 管希东 %J Software Engineering and Applications %P 317-325 %@ 2325-2278 %D 2020 %I Hans Publishing %R 10.12677/SEA.2020.94036 %X 为了能更加直观地了解到国内大数据有关的职业对学历和工作经验的具体要求以及不同性质企业地区分布等情况,采用Python的数据分析和处理功能,通过Python爬虫技术爬取前程无忧网大量职位信息。按照删除有空值的信息、与大数据无关的职业、信息错位的数据清洗方法,对数据进行预处理,然后将清洗后的数据存入数据库,再利用Pyecharts对数据进行可视化分析,用Flask作为Web框架开发Web应用程序,将可视化的数据展示在网页,提高了用户查询信息的速度,方便求职者找到适合且满意的职位。
In order to have a more intuitive understanding of the domestic big data-related professions spe-cific requirements for education and work experience and the regional distribution of different kinds of enterprises and so on, using Python’s data analysis and processing functions, we crawl a large number of position information from the 51 Job network through the Python crawler tech-nology. We delete the null value information, the irrelevant job information and the mismatched information according to the data cleaning method to preprocess the data, and save the clean data to the database, then use Pyecharts for visualization of data analysis, with Flask as Web framework for Web application development, display the visual data on the web page. It improves the speed of users to query information and facilitates job seekers to find suitable and satisfying professions.  %K Python爬虫,职位信息,数据清洗,可视化,Flask
Python Crawler %K Position Information %K Data Cleaning %K Visualization %K Flask %U http://www.hanspub.org/journal/PaperInformation.aspx?PaperID=37323