According to the World Health Organization, Tb is the biggest cause of death among the infectious diseases. Due to the high percentage of people with tuberculosis infection and the high number of death among these patients, this study is a prospective study aimed to categorize and find the relationship between different clinical and demographic characteristics. The study was conducted on 600 patients from Masih-e-Daneshvari tuberculosis research center during 2015-2016. The K-Means clustering data mining algorithms and decision trees are used to perform the categorization and determine common indicators among patients. 2 clusters according to Dunn index were chosen as the optimal clusters. Common factors between clusters are provided in detail in the findings section. According to the results of this study, the most important factors identified by the clustering include hemoglobin, age, sex, smoking, alcohol consumption and creatinine. The RBF neural network tree has 98% accuracy. According to the results of this study, the most important factors identified are sex, smoking, alcohol consumption and WBC, albumin.
Al Jarullah, A.A. (2011) Decision Tree Discovery for the Diagnosis of Type II Diabetes. International Conference on IEEE Innovations in Information Technology (IIT), 25-27 April 2011, 303-307. https://doi.org/10.1109/innovations.2011.5893838
Khajehei, M. and Etemady, F. (2010) Data Mining and Medical Research Studies. Cimsim. 2nd International Conference on Computational Intelligence, Modelling and Simulation, 28-30 September 2010, 119-122.
Jayalakshmi, T. and Santhakumaran, A. (2010) A Novel Classification Method for Diagnosis of Diabetes Mellitus Using Artificial Neural Networks. International Conference on IEEE Data Storage and Data Engineering (DSDE), 9-10 February 2010, 159-163. https://doi.org/10.1109/dsde.2010.58
Ameri, H., Alizadeh, S. and Hadizadeh, M. (2014) Assessing the Effects of Infertility Treatment Drugs Using Clustering Algorithms and Data Mining Techniques. Journal of Mazandaran University of Medical Sciences, 24, 26-35. (Persian)
Uçar, T. and Karahoca, A. (2011) Predicting Existence of Mycobacterium tuberculosis on Patients Using Data Mining Approaches. Procedia Computer Science, 3, 1404-1411. https://doi.org/10.1016/j.procs.2011.01.022
Lujambio, I., Sottolano, M., Luzardo, L., Robinia, S., Krul, N., Thijs, L., et al. (2014) Estimation of Glomerular Filtration Rate Based on Serum Crystain C versus Creatinine in Uruguayan Population. International Journal of Nephrology, 2014, Article ID: 837106. https://doi.org/10.1155/2014/837106
Bakar, A.A. and Febriyani, F. (2007) Rough Neural Network Model for Tuberculosis Patient Categorization. Proceedings of the International Conference on Electrical Engineering and Informatics, Vol. 1, Bandung, 17-19 June 2007, 765-768.