Mood disorders—particularly bipolar spectrum conditions—pose enduring diagnostic and therapeutic challenges due to their episodic nature, heterogeneous presentations, and reliance on subjective assessments. Conventional frameworks often delay accurate diagnosis and appropriate treatment. This study proposes an interpretable machine learning (ML) framework that integrates biological and behavioural data to support mood state classification and treatment response prediction in bipolar disorder. We constructed a multimodal dataset combining seven genetic polymorphisms (SNPs), ten serum biomarkers, and clinically relevant behavioural features such as stress levels, sleep deviation, and lithium adherence. Using this dataset, we developed and evaluated an ensemble-based ML pipeline incorporating Random Forest, XGBoost, and neural network classifiers, enhanced with SHAP-based interpretability and dimensionality reduction for decision visualization. The model demonstrated robust performance, achieving a macro-average ROC-AUC of 0.81, with the manic class yielding an AUC of 0.87. While manic states were well classified, euthymic states were frequently misclassified, reflecting the clinical ambiguity and low separability of remission features. Feature attribution consistently highlighted stress, lithium adherence, and sleep deviation as top predictors. Dimensionality reduction techniques (PCA, t-SNE, UMAP) revealed substantial class overlap, but improved separability for manic clusters. Stratification of biomarker distributions by treatment response further revealed clinical relevance in BDNF, DLPFC connectivity, and IL-6 levels. This work demonstrates the feasibility of using interpretable ML to enhance psychiatric diagnostics. By providing transparent, biologically grounded predictions, such systems may complement clinical judgment, enable early interventions, and support the development of precision psychiatry.
Cite this paper
Filippis, R. D. and Foysal, A. A. (2025). Interpretable Machine Learning for Mood State Classification and Treatment Response Analysis Using Clinical and Biomarker Data. Open Access Library Journal, 12, e13868. doi: http://dx.doi.org/10.4236/oalib.1113868.
Engelbrecht, H., Dalvie, S., Agenbag, G., Stein, D.J. and Ramesar, R.S. (2020) Whole-exome Sequencing in an Afrikaner Family with Bipolar Disorder. Journal of Affective Disorders, 276, 69-75. https://doi.org/10.1016/j.jad.2020.06.045
Perugi, G., Hantouche, E., Vannucchi, G. and Pinto, O. (2015) Cyclothymia Reloaded: A Reappraisal of the Most Misconceived Affective Disorder. Journal of Affective Disorders, 183, 119-133. https://doi.org/10.1016/j.jad.2015.05.004
Newman-Toker, D.E., Wang, Z., Zhu, Y., Nassery, N., Saber Tehrani, A.S., Schaffer, A.C., et al. (2020) Rate of Diagnostic Errors and Serious Misdiagnosis-Related Harms for Major Vascular Events, Infections, and Cancers: Toward a National Incidence Estimate Using the “Big Three”. Diagnosis, 8, 67-84. https://doi.org/10.1515/dx-2019-0104
Singh, H., Schiff, G.D., Graber, M.L., Onakpoya, I. and Thompson, M.J. (2016) The Global Burden of Diagnostic Errors in Primary Care. BMJ Quality & Safety, 26, 484-494. https://doi.org/10.1136/bmjqs-2016-005401
Matza, L.S., Rajagopalan, K.S., Thompson, C.L. and de Lissovoy, G. (2005) Misdiagnosed Patients with Bipolar Disorder: Comorbidities, Treatment Patterns, and Direct Treatment Costs. The Journal of Clinical Psychiatry, 66, 1432-1440. https://doi.org/10.4088/jcp.v66n1114
Malhi, G.S., Green, M., Fagiolini, A., Peselow, E.D. and Kumari, V. (2008) Schizoaffective Disorder: Diagnostic Issues and Future Recommendations. Bipolar Disorders, 10, 215-230. https://doi.org/10.1111/j.1399-5618.2007.00564.x
Tanaka, M. (2025) From Serendipity to Precision: Integrating AI, Multi-Omics, and Human-Specific Models for Personalized Neuropsychiatric Care. Biomedicines, 13, Article No. 167. https://doi.org/10.3390/biomedicines13010167
Lin, E., Lin, C. and Lane, H. (2020) Precision Psychiatry Applica-tions with Pharmacogenomics: Artificial Intelligence and Machine Learning Approaches. International Journal of Molecular Sciences, 21, Article No. 969. https://doi.org/10.3390/ijms21030969
Wang, L., Li, S. and Jin, X. (2024) Revolution-izing Brain Disease Diagnosis: The Convergence of AI, Genetic Screening, and Neuroimaging. Proceedings of the 2024 In-ternational Conference on Smart Healthcare and Wearable Intelligent Devices, Guangzhou, 18-20 October 2024, 10-17. https://doi.org/10.1145/3703847.3703850
Kalani, M. and Anjankar, A. (2024) Revolutionizing Neurology: The Role of Artificial Intelligence in Advancing Diagnosis and Treatment. Cureus, 16, e61706. https://doi.org/10.7759/cureus.61706
Onciul, R., Tataru, C., Dumitru, A.V., Crivoi, C., Serban, M., Covache-Busuioc, R., et al. (2025) Artificial Intelligence and Neuroscience: Transformative Synergies in Brain Research and Clinical Applica-tions. Journal of Clinical Medicine, 14, Article No. 550. https://doi.org/10.3390/jcm14020550
Saraf, S., De, A. and Tripathy, B.K. (2024) Effective Use of Computational Biology and Artificial Intelligence in the Domain of Medical Oncology. In: Computational Intelligence for Oncology and Neurological Disorders, CRC Press, 228-252. https://doi.org/10.1201/9781003450153-17
Nedic Erjavec, G., Svob Strac, D., Tudor, L., Konjevod, M., Sagud, M. and Pivac, N. (2019) Genetic Markers in Psychiatry. In: Kim, Y.-K., Ed., Frontiers in Psychiatry: Artificial Intelligence, Preci-sion Medicine, and Other Paradigm Shifts, Springer, 53-93. https://doi.org/10.1007/978-981-32-9721-0_4
Frey, B.N., Andreazza, A.C., Houenou, J., Jamain, S., Goldstein, B.I., Frye, M.A., et al. (2013) Biomarkers in Bipolar Disorder: A Positional Paper from the International Society for Bipolar Dis-orders Biomarkers Task Force. Australian & New Zealand Journal of Psychiatry, 47, 321-332. https://doi.org/10.1177/0004867413478217
Scola, G., Duong, A. and Syed, B. (2015) Biomarkers for Bipolar Dis-order: Current Insights. Current Biomarker Findings, 5, 79-92. https://doi.org/10.2147/cbf.s79138
Horgan, D., Cili-berto, G., Conte, P., Baldwin, D., Seijo, L., Montuenga, L.M., et al. (2020) Bringing Greater Accuracy to Europe’s Healthcare Systems: The Unexploited Potential of Biomarker Testing in Oncology. Biomedicine Hub, 5, 1-42. https://doi.org/10.1159/000511209
Serelli-Lee, V., Ito, K., Koibuchi, A., Tanigawa, T., Ueno, T., Matsushima, N., et al. (2022) A State-of-the-Art Roadmap for Biomarker-Driven Drug Development in the Era of Personalized Therapies. Journal of Personalized Medicine, 12, Article No. 669. https://doi.org/10.3390/jpm12050669
Razavi, M., Ziyadidegan, S., Jahromi, R., et al. (2023) Machine Learning, Deep Learning and Data Preprocessing Techniques for Detection, Prediction, and Monitoring of Stress and Stress-Related Mental Disorders: A Scoping Review.
Sükei, E., Norbury, A., Perez-Rodriguez, M.M., Olmos, P.M. and Artés, A. (2021) Predicting Emotional States Using Behavioral Markers Derived from Passively Sensed Data: Data-Driven Machine Learning Approach. JMIR mHealth and uHealth, 9, e24465. https://doi.org/10.2196/24465
Washington, P., Mutlu, C.O., Kline, A., et al. (2022) Challenges and Opportunities for Machine Learning Classification of Behavior and Mental State from Images.
Nasarian, E., Alizadehsani, R., Acharya, U.R. and Tsui, K. (2024) Designing Interpretable ML System to En-hance Trust in Healthcare: A Systematic Review to Proposed Responsible Clinician-AI-Collaboration Framework. Information Fusion, 108, Article ID: 102412. https://doi.org/10.1016/j.inffus.2024.102412
Rasheed, K., Qayyum, A., Ghaly, M., Al-Fuqaha, A., Razi, A. and Qadir, J. (2022) Explainable, Trustworthy, and Ethical Machine Learning for Healthcare: A Sur-vey. Computers in Biology and Medicine, 149, Article ID: 106043. https://doi.org/10.1016/j.compbiomed.2022.106043
Quinn, T.P., Jacobs, S., Senadeera, M., Le, V. and Coghlan, S. (2022) The Three Ghosts of Medical AI: Can the Black-Box Present Deliver? Artificial Intelligence in Medicine, 124, Article ID: 102158. https://doi.org/10.1016/j.artmed.2021.102158
Viering, T. and Loog, M. (2023) The Shape of Learning Curves: A Review. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45, 7799-7819. https://doi.org/10.1109/tpami.2022.3220744
Li, S. and Hoefler, T. (2021) Chimera. Proceedings of the Internation-al Conference for High Performance Computing, Networking, Storage and Analysis, New York, 1-14. https://doi.org/10.1145/3458817.3476145
Allen-Zhu, Z. and Li, Y. (2022). Feature Purification: How Adversarial Training Performs Robust Deep Learning. 2021 IEEE 62nd Annual Symposium on Foundations of Computer Science (FOCS), Denver, 7-10 February 2022. https://doi.org/10.1109/focs52979.2021.00098
Baba, H., Kubota, Y., Suzuki, T. and Arai, H. (2007) Seven Cases of Late-Life Depression Treated with Cilostazol-Augmented Therapy. Journal of Clinical Psychopharmacology, 27, 727-728. https://doi.org/10.1097/jcp.0b013e31815a585e
Palanisamy, K., Krishnasamy, K. and Venkadasamy, P. (2024) Cardiovascular Disease Detection through Innovative Imbalanced Learning and AUC Optimization. Advances in Engineering and Intelligence Systems, 3, 68-84.
Birchwood, M. and Spencer, E. (2001) Early Intervention in Psychotic Relapse. Clinical Psychology Review, 21, 1211-1226. https://doi.org/10.1016/s0272-7358(01)00105-2
Adler, D.A., Ben-Zeev, D., Tseng, V.W., Kane, J.M., Brian, R., Campbell, A.T., et al. (2020) Predicting Early Warning Signs of Psychotic Relapse from Passive Sensing Data: An Approach Using Encoder-Decoder Neural Networks. JMIR mHealth and uHealth, 8, e19962. https://doi.org/10.2196/19962
Eisner, E., Drake, R. and Barrowclough, C. (2013) Assessing Early Signs of Relapse in Psychosis: Review and Future Directions. Clinical Psychology Review, 33, 637-653. https://doi.org/10.1016/j.cpr.2013.04.001
McGorry, P.D., Hickie, I.B., Yung, A.R., Pantelis, C. and Jackson, H.J. (2006) Clinical Staging of Psychiatric Disorders: A Heuristic Framework for Choosing Earlier, Safer and More Effective In-terventions. Australian & New Zealand Journal of Psychiatry, 40, 616-622. https://doi.org/10.1080/j.1440-1614.2006.01860.x
Owusu-Adjei, M., Ben Hayfron-Acquah, J., Frimpong, T. and Ab-dul-Salaam, G. (2023) Imbalanced Class Distribution and Performance Evaluation Metrics: A Systematic Review of Predic-tion Accuracy for Determining Model Performance in Healthcare Systems. PLOS Digital Health, 2, e0000290. https://doi.org/10.1371/journal.pdig.0000290
Ghavidel, A. and Pazos, P. (2023) Machine Learning (ML) Tech-niques to Predict Breast Cancer in Imbalanced Datasets: A Systematic Review. Journal of Cancer Survivorship, 19, 270-294. https://doi.org/10.1007/s11764-023-01465-3
Altalhan, M., Algarni, A. and Turki-Hadj Alouane, M. (2025) Imbal-anced Data Problem in Machine Learning: A Review. IEEE Access, 13, 13686-13699. https://doi.org/10.1109/access.2025.3531662
Depp, C., Torous, J. and Thompson, W. (2016) Technology-Based Early Warning Systems for Bipolar Disorder: A Conceptual Framework. JMIR Mental Health, 3, e42. https://doi.org/10.2196/mental.5798
Stephenson, L.A., Gergel, T., Gieselmann, A., Scholten, M., Keene, A.R., Rifkin, L., et al. (2020) Advance Decision Making in Bipolar: A Systematic Review. Frontiers in Psychiatry, 11, Article ID: 538107. https://doi.org/10.3389/fpsyt.2020.538107
Goodwin, G., Haddad, P., Ferrier, I., Aronson, J., Barnes, T., Cipriani, A., et al. (2016) Evidence-Based Guidelines for Treating Bipolar Disorder: Revised Third Edition Recommendations from the British Association for Psychopharmacology. Journal of Psychopharmacology, 30, 495-553. https://doi.org/10.1177/0269881116636545
Trevethan, R. (2017) Sensitivity, Specificity, and Predictive Values: Foundations, Pliabilities, and Pitfalls in Research and Practice. Frontiers in Public Health, 5, Article No. 307. https://doi.org/10.3389/fpubh.2017.00307
Lobo, J.M., Jiménez‐Valverde, A. and Real, R. (2007) AUC: A Misleading Measure of the Performance of Predictive Distribution Models. Global Ecology and Biogeography, 17, 145-151. https://doi.org/10.1111/j.1466-8238.2007.00358.x
Olowolayemo, A., Souag, A. and Sirlantzis, K. (2024) Evaluating the Impact of Machine Learning Platforms on Cancer Classification Model Performance: A Cross-Platform Comparative Study. International Journal on Advanced in Life Sciences, 16, 96-111.
Harpaz, R., DuMouchel, W., LePendu, P., Bau-er-Mehren, A., Ryan, P. and Shah, N.H. (2013) Performance of Pharmacovigilance Signal-Detection Algorithms for the FDA Adverse Event Reporting System. Clinical Pharmacology & Therapeutics, 93, 539-546. https://doi.org/10.1038/clpt.2013.24
Benazzi, F. (2008) Defining Mixed Depression. Progress in Neu-ro-Psychopharmacology and Biological Psychiatry, 32, 932-939. https://doi.org/10.1016/j.pnpbp.2007.12.019
Chouinard, G. and Margolese, H.C. (2005) Manual for the Extrapy-ramidal Symptom Rating Scale (ESRS). Schizophrenia Research, 76, 247-265. https://doi.org/10.1016/j.schres.2005.02.013
Abi‐Dargham, A., Moeller, S.J., Ali, F., DeLorenzo, C., Domschke, K., Horga, G., et al. (2023) Candidate Biomarkers in Psychiatric Disorders: State of the Field. World Psychiatry, 22, 236-262. https://doi.org/10.1002/wps.21078
McGraw, C.M., Ward, C.S. and Samaco, R.C. (2017) Genetic Rodent Models of Brain Disorders: Perspectives on Experimental Approaches and Therapeutic Strategies. American Journal of Medical Genet-ics Part C: Seminars in Medical Genetics, 175, 368-379. https://doi.org/10.1002/ajmg.c.31570
Dadi, K., Varoquaux, G., Houenou, J., Bzdok, D., Thirion, B. and Engemann, D. (2021) Population Modeling with Machine Learning Can Enhance Measures of Mental Health GigaScience, 10, giab071. https://doi.org/10.1093/gigascience/giab071
Liu, X.S., Xu, X., Xu, X., Li, X. and Xie, G.T. (2021) Representation Learning for Multi-Omics Data with Heterogeneous Gene Regulatory Net-work. 2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Houston, 9-12 December 2021. https://doi.org/10.1109/bibm52615.2021.9669359
Sun, C. and Liu, Z. (2024) Discovering Explainable Biomarkers for Breast Cancer Anti-Pd1 Response via Network Shapley Value Analysis. Computer Methods and Programs in Biomedicine, 257, Article ID: 108481. https://doi.org/10.1016/j.cmpb.2024.108481
Omiyefa, S. (2025) Artificial Intelligence and Machine Learning in Precision Mental Health Diagnostics and Predictive Treatment Models. International Journal of Research Publication and Reviews, 6, 85-99. https://doi.org/10.55248/gengpi.6.0325.1107
Lemaitre, F., Florentin, V., Blin, O., Bayle, A., Benito, S., Chauny, J., et al. (2024) Can Precision Medicine Be Integrated into Routine Therapeutic Decisions at the Bedside of Patients? Therapies, 79, 13-22. https://doi.org/10.1016/j.therap.2023.11.007