全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

Research on Political Trend of North Korea Based on Big Data Text Mining Method

DOI: 10.4236/oalib.1105893, PP. 1-10

Subject Areas: Statistics

Keywords: Text Mining, K-Means Algorithm, LDA Topic Model, Public Opinion Analysis

Full-Text   Cite this paper   Add to My Lib

Abstract

With the method of text mining, this paper takes the related data of Rodong Sinmun in the recent ten years as the research object, extracts the hot topics and carries on the trend analysis. With the special attribute of his speech media, this paper analyzes the political issue. By extracting nearly one million news text data, their topic content is analyzed, combining with LDA topic model, and using K-means clustering algorithm. Aiming at the limitations of the traditional K-means algorithm, it is solved on the pre-built big data analysis platform, and the structure and content of the theme are analyzed in detail. In the end, the political theme and the trend of public opinion in recent years are derived. In terms of application, it is of great significance to analyze and study Korean big data text.

Cite this paper

Li, H. and Jin, Z. (2019). Research on Political Trend of North Korea Based on Big Data Text Mining Method. Open Access Library Journal, 6, e5893. doi: http://dx.doi.org/10.4236/oalib.1105893.

References

[1]  Likas, A., Vlassis, N.J. and Verbeek, J. (2003) The Global K-Means Clustering Algorithm. Pattern Recognition, 36, 451-461.
https://doi.org/10.1016/S0031-3203(02)00060-2
[2]  Noh, Y., Kim, T., Jeong, D.-K. and Lee, K. (2019) Trend Analysis of Convergence Research Based on Social Big Data. Journal of The Korea Contents Association, 19, 135-146.
[3]  Kim, M., Koo, C. and Sohn, B. (2019) A Study on the Effectiveness of Education Welfare Priority Support Program through Text Mining. Korean Journal of Youth Studies, 26, 313-332. https://doi.org/10.21509/KJYS.2019.02.26.2.313
[4]  Dhillon, I.S. and Modha, D.S. (2001) Concept Decompositions for Large Sparse Text Data Using Clustering. Machine Learning, 42, 143-175.
https://doi.org/10.1023/A:1007612920971
[5]  Juan, Z., Xiong, Z.Y. and Zhang, Y.F. (2006) Multi-Center Clustering Algorithm Based on Max-Min Distance Method. Journal of Computer Applications, 26, 1425-1427.
[6]  Wang, Y. and Tang, J. (2014) High-Efficiency K-Means Optimal Clustering Number Determination Algorithm. Journal of Computer Applications, 34, 1331-1335.
[7]  Sun, J.G., Liu, J. and Zhao, L.Y. (2008) Clustering Algorithms Research. Journal of Software, 19, 48-61. https://doi.org/10.3724/SP.J.1001.2008.00048
[8]  Wei, J. (2018) Research on Improved Algorithm Based on K-Means Clustering Algorithm. Information and Communications, 2018, 14-15.
[9]  An, J.Y., An, G.G. and Shi, Z.Q. (2015) An Improved K-Means Text Clustering Algorithm. Transducer and Microsystem Technologies, 34, 130-133.
[10]  Wang, C.L. and Zhang, J.X. (2014) Ap-plication of Improved K-Means Algorithm Based on LDA in Text Clustering. Journal of Computer Applications, 34, 249-254.

Full-Text


comments powered by Disqus

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133

WeChat 1538708413