The rise of social media paves way for unprecedented benefits or risks to several organisations depending on how they adapt to its changes. This rise comes with a great challenge of gaining insights from these big data for effective and efficient decision making that can improve quality, profitability, productivity, competitiveness and customer satisfaction. Sentiment analysis is the field that is concerned with the classification and analysis of user generated text under defined polarities. Despite the upsurge of research in sentiment analysis in recent years, there is a dearth in literature on sentiment analysis applied to banks social media data and mostly on African datasets. Against this background, this study applied machine learning technique (support vector machine) for sentiment analysis of Nigerian banks twitter data within a 2-year period, from 1st January 2017 to 31st December 2018. After crawling and preprocessing of the data, LibSVM algorithm in WEKA was used to build the sentiment classification model based on the training data. The performance of this model was evaluated on a pre-labelled test dataset generated from the five banks. The results show that the accuracy of the classifier was 71.8367%. The precision for both the positive and negative classes was above 0.7, the recall for the negative class was 0.696 and that of the positive class was 0.741 which shows the prediction did better than chance in addition to other measures. Applying the model in predicting the sentiments of the five Nigerian banks twitter data reveals that the number of positive tweets within this period was slightly greater than the number of negative tweets. The scatter plots for the sentiments series indicated that, majority of the data falls between 0 and 100 sentiments per day, with few outliers above this range.
References
[1]
Statista (2017) Number of Monthly Active Twitter Users Worldwide from 1st Quarter 2010 to 1st Quarter 2019 (in Millions). https://www.statista.com/statistics/282087/number-of-monthly-active-twitter-users/
[2]
Pang, B., Lee, L., et al. (2008) Opinion Mining and Sentiment Analysis. Foundations and Trends® in Information Retrieval, 2, 1-135. https://doi.org/10.1561/1500000011
[3]
Jadav, B.M. and Vaghela, V.B. (2016) Sentiment Analysis Using Support Vector Machine Based on Feature Selection and Semantic Analysis. International Journal of Computer Applications, 146, 26-30. https://doi.org/10.5120/ijca2016910921
[4]
Tripathy, A., Agrawal, A. and Rath, S.K. (2016) Classification of Sentiment Reviews Using n-Gram Machine Learning Approach. Expert Systems with Applications, 57, 117-126.
[5]
Taboada, M., Brooke, J., Tofiloski, M., Voll, K. and Stede, M. (2011) Lexicon-Based Methods for Sentiment Analysis. Computational Linguistics, 37, 267-307. https://doi.org/10.1162/COLI_a_00049
[6]
Al-Ayyoub, M., Essa, S.B. and Alsmadi, I. (2015) Lexicon-Based Sentiment Analysis of Arabic Tweets. International Journal of Social Network Mining, 2, 101-114. https://doi.org/10.1504/IJSNM.2015.072280
[7]
Khoo, C.S. and Johnkhan, S.B. (2018) Lexicon-Based Sentiment Analysis: Comparative Evaluation of Six Sentiment Lexicons. Journal of Information Science, 44, 491-511. https://doi.org/10.1177/0165551517703514
[8]
Prabowo, R. and Thelwall, M. (2009) Sentiment Analysis: A Combined Approach. Journal of Informetrics, 3, 143-157. https://doi.org/10.1016/j.joi.2009.01.003
[9]
Appel, O., Chiclana, F., Carter, J. and Fujita, H. (2016) A Hybrid Approach to the Sentiment Analysis Problem at the Sentence Level. Knowledge-Based Systems, 108, 110-124. https://doi.org/10.1016/j.knosys.2016.05.040
[10]
Cardie, C., Wiebe, J., Wilson, T. and Litman, D.J. (2003) Combining Low-Level and Summary Representations of Opinions for Multi-Perspective Question Answering. In: New Directions in Question Answering, 20-27.
[11]
Dave, K., Lawrence, S. and Pennock, D.M. (2003) Mining the Peanut Gallery: Opinion Extraction and Semantic Classification of Product Reviews. In: Proceedings of the 12th International Conference on World Wide Web, ACM, New York, 519-528. https://doi.org/10.1145/775152.775226
[12]
Mishne, G. and Glance, N.S. (2006) Predicting Movie Sales from Blogger Sentiment. AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs, 155-158.
[13]
Tetlock, P.C. (2007) Giving Content to Investor Sentiment: The Role of Media in the Stock Market. The Journal of Finance, 62, 1139-1168. https://doi.org/10.1111/j.1540-6261.2007.01232.x
[14]
Nofsinger, J.R. (2005) Social Mood and Financial Economics. The Journal of Behavioral Finance, 6, 144-160. https://doi.org/10.1207/s15427579jpfm0603_4
[15]
Bollen, J., Mao, H. and Zeng, X. (2011) Twitter Mood Predicts the Stock Market. Journal of Computational Science, 2, 1-8. https://doi.org/10.1016/j.jocs.2010.12.007
[16]
Nofer, M. and Hinz, O. (2015) Using Twitter to Predict the Stock Market. Business & Information Systems Engineering, 57, 229-242. https://doi.org/10.1007/s12599-015-0390-4
[17]
Nisar, T.M. and Yeung, M. (2018) Twitter as a Tool for Forecasting Stock Market Movements: A Shortwindow Event Study. The Journal of Finance and Data Science, 4, 101-119. https://doi.org/10.1016/j.jfds.2017.11.002
[18]
Agaian, S. and Kolm, P. (2017) Financial Sentiment Analysis Using Machine Learning Techniques. International Journal of Investment Management and Financial Innovations, 3, 1-9.
[19]
Dokoohaki, N., Zikou, F., Gillblad, D. and Matskin, M. (2015) Predicting Swedish elections with Twitter: A Case for Stochastic Link Structure Analysis. 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, Paris, 25-28 August 2015, 1269-1276. https://doi.org/10.1145/2808797.2808915
[20]
Hasan, A., Moin, S., Karim, A. and Shamshirband, S. (2018) Machine Learning-Based Sentiment Analysis for Twitter Accounts. Mathematical and Computational Applications, 23, 11. https://doi.org/10.3390/mca23010011
[21]
Almeida, J.M., Pappa, G.L., et al. (2015) Twitter Population Sample Bias and Its Impact on Predictive Outcomes: A Case Study on Elections. Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, Paris, 25-28 August 2015, 1254-1261. https://doi.org/10.1145/2808797.2809328
[22]
Relbanks (2017) Top Ten Banks in Nigeria Ranked by Their Total Assets. https://www.relbanks.com/africa/nigeria
[23]
Go, A., Bhayani, R. and Huang, L. (2009) Twitter Sentiment Classification Using Distant Supervision. CS224N Project Report.
[24]
Angiani, G., Ferrari, L., Fontanini, T., Fornacciari, P., Iotti, E., Magliani, F. and Manicardi, S. (2016) A Comparison between Preprocessing Techniques for Sentiment Analysis in Twitter. KDWeb.
[25]
Vapnik, V.N. (1995) The Nature of Statistical Learning Theory. Springer-Verlag, Berlin. https://doi.org/10.1007/978-1-4757-2440-0
[26]
Burges, C.J. (1998) A Tutorial on Support Vector Machine for Pattern Recognition. Data Mining and Knowledge Discovery, 2, 955-974. https://doi.org/10.1023/A:1009715923555
[27]
Cortes, C. and Vapnik, V. (1995) Support-Vector Networks. Machine Learning, 20, 273-297. https://doi.org/10.1007/BF00994018
[28]
Chang, C.-C. and Lin, C.-J. (2011) LIBSVM: A Library for Support Vector Machines. ACM Transactions on Intelligent Systems and Technology, 2, Article No. 27. https://doi.org/10.1145/1961189.1961199 https://dl.acm.org/citation.cfm?doid=1961189.1961199
[29]
Platt, J. (1998) Sequential Minimal Optimization: A Fast Algorithm for Training Support Vector Machines.