The growing need to use Artificial Intelligence (AI) technologies in addressing challenges in education sectors of developing countries is undermined by low awareness, limited skill and poor data quality. One particular persisting challenge, which can be addressed by AI, is school dropouts whereby hundreds of thousands of children drop annually in Africa. This article presents a data-driven approach to proactively predict likelihood of dropping from schools and enable effective management of dropouts. The approach is guided by a carefully crafted conceptual framework and new concepts of average absenteeism, current cumulative absenteeism and dropout risk appetite. In this study, a typical scenario of missing quality data is considered and for which synthetic data is generated to enable development of a functioning prediction model using neural network. The results show that, using the proposed approach, the levels of risk of dropping out of schools can be practically determined using data that is largely available in schools. Potentially, the study will inspire further research, encourage deployment of the technologies in real life, and inform processes of formulating or improving policies.
References
[1]
Raaijmakers, S. (2019) Artificial Intelligence for Law Enforcement: Challenges and Opportunities. IEEE Security and Privacy, 17, 74-77. https://doi.org/10.1109/MSEC.2019.2925649
[2]
du Boulay, B. (2016) Artificial Intelligence as an Effective Classroom Assistant. IEEE Intelligent Systems, 31, 76-81. https://doi.org/10.1109/MIS.2016.93
[3]
Cai, L. and Zhu, Y. (2015) The Challenges of Data Quality and Data Quality Assessment in the Big Data Era. Data Science Journal, 14, 2. https://doi.org/10.5334/dsj-2015-002
[4]
Theunissen, M.-J., Bosma, H., Verdonk, P. and Feron, F. (2015) Why Wait? Early Determinants of School Dropout in Preventive Pediatric Primary Care. PLoS ONE, 10, e0142315. https://doi.org/10.1371/journal.pone.0142315
[5]
Lee, S. and Chung, J. (2019) The Machine Learning-Based Dropout Early Warning System for Improving the Performance of Dropout Prediction. Applied Sciences, 9, 3093. https://doi.org/10.3390/app9153093
[6]
Tan, M. and Shao, P. (2015) Prediction of Student Dropout in E-Learning Program Through the Use of Machine Learning Method. International Journal of Emerging Technologies in Learning (iJET), 10, 11-17. https://doi.org/10.3991/ijet.v10i1.4189
[7]
Sivakumar, S., Venkataraman, S. and Selvaraj, R. (2016) Predictive Modeling of Student Dropout Indicators in Educational Data Mining using Improved Decision Tree. Indian Journal of Science and Technology, 9, 1-5. https://doi.org/10.17485/ijst/2016/v9i4/87032
[8]
Kalinga, T. (2013) Causes of the Dropout in Secondary School in Tanzania: The Case Study of Mbeya, Dar es Salaam and Kilimanjaro Regions. Master’s Thesis, The Open University of Tanzania, Dar es Salaam.
[9]
Cortez, P. and Silva, A. (2008) Using Data Mining to Predict Secondary School Student Performance. Proceedings of 5th Annual Future Business Technology Conference, Porto, 9-11 April 2008, 5-12.
[10]
Mundell, J. (2017) Africa’s Data Revolution: Accelerating Development through Data-Driven Decision-Making. https://www.inonafrica.com/2017/10/02/africas-data-revolution-accelerating-development-data-driven-decision-making/
[11]
PORALG (2016) Pre-Primary, Primary and Secondary Education Statistics in Brief. The Government of the United Republic of Tanzania, Dodoma.
[12]
Jaderberg, M., Simonyan, K., Vedaldi, A. and Zisserman, A. (2014) Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition.
[13]
Gupta, A., Vedaldi, A. and Zisserman, A. (2016) Synthetic Data for Text Localisation in Natural Images. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, 27-30 June 2016, 2315-2324. https://doi.org/10.1109/CVPR.2016.254
[14]
Le, T., Baydin, A., Zinkov, R. and Wood, F. (2017) Using Synthetic Data to Train Neural Networks Is Model-Based Reasoning. 2017 International Joint Conference on Neural Networks (IJCNN), Anchorage, AK, 14-19 May 2017, 3514-3521. https://doi.org/10.1109/IJCNN.2017.7966298
[15]
Nicholson, C. (2019) A.I. Wiki-Open Datasets. https://pathmind.com/wiki/open-datasets
[16]
Gonfalonieri, A. (2019) Do You Need Synthetic Data for Your AI Project? https://towardsdatascience.com/do-you-need-synthetic-data-for-your-ai-project-e7ecc2072d6b