Federated Learning for Privacy-Preserving Psychiatric Decision Support: A Simulation Proof-of-Concept for Multi-Institutional Collaborative Risk Prediction
Psychiatric decision support systems hold promise for improving clinical outcomes, yet their development is hindered by data privacy regulations and institutional silos that prevent aggregation of sensitive patient information across healthcare facilities. This proof-of-concept simulation demonstrates that privacy-preserving federated learning can match centralized training performance under synthetic non-IID conditions; real-world validation on operational electronic health record data is required before clinical or regulatory conclusions can be drawn, enabling collaborative training of psychiatric readmission prediction models without centralizing raw patient data. Five simulated hospitals with non-independent and identically distributed data participated in federated training of neural network models over 20 communication rounds. We compared standard Federated Averaging (FedAvg) with differentially private federated learning (DP-FL, ε = 1.0) against a centralized baseline. The federated model achieved mean AUC-ROC of 0.800 (95% CI: 0.795 - 0.805), statistically equivalent to the centralized approach (AUC = 0.802, p = 0.42) while preserving data locality. DP-FL maintained strong performance (AUC = 0.806) with formal privacy guarantees. Per-hospital performance varied substantially (AUC range: 0.761 - 0.822), reflecting real-world data heterogeneity. Feature importance analysis identified medication adherence, PHQ-9 depression scores, and prior hospitalizations as top predictors. Communication costs were reduced 500-fold compared to raw data centralization. This federated learning framework demonstrates that privacy-preserving collaborative machine learning can achieve centralized-level predictive accuracy for psychiatric risk stratification while maintaining institutional data sovereignty and regulatory compliance.Subject AreasPsychiatry & Psychology
Cite this paper
Filippis, R. D. and Foysal, A. A. (2026). Federated Learning for Privacy-Preserving Psychiatric Decision Support: A Simulation Proof-of-Concept for Multi-Institutional Collaborative Risk Prediction. Open Access Library Journal, 13, e15138. doi: http://dx.doi.org/10.4236/oalib.1115138.
GBD 2019 Mental Disorders Collaborators (2022) Global, Regional, and National Burden of 12 Mental Disorders in 204 Countries and Territories, 1990-2019: A Systematic Analysis for the Global Burden of Disease Study 2019. <i>The </i><i>Lancet Psychiatry</i>, 9, 137-150.
Trautmann, S., Rehm, J. and Wittchen, H. (2016) The Economic Costs of Mental Disorders: Do Our Societies React Appropriately to the Burden of Mental Disorders? <i>The EMBO Reports</i>, 17, 1245-1249. <br>https://doi.org/10.15252/embr.201642951
Vigo, D., Thornicroft, G. and Atun, R. (2016) Estimating the True Global Burden of Mental Illness. <i>The Lancet Psychiatry</i>, 3, 171-178. <br>https://doi.org/10.1016/s2215-0366(15)00505-2
Rittenhouse, D.R., Shortell, S.M. and Fisher, E.S. (2009) Primary Care and Accountable Care—Two Essential Elements of Delivery-System Reform. <i>New England Journal of Medicine</i>, 361, 2301-2303. <br>https://doi.org/10.1056/nejmp0909327
Busch, A.B., Huskamp, H.A. and McWilliams, J.M. (2016) Early Efforts by Medicare Accountable Care Organizations Have Limited Effect on Mental Illness Care and Management. <i>Health Affairs</i>, 35, 1247-1256. <br>https://doi.org/10.1377/hlthaff.2015.1669
Kessler, R.C., Hwang, I., Hoffmire, C.A., McCarthy, J.F., Petukhova, M.V., Rosellini, A.J., <i>et al</i>. (2017) Developing a Practical Suicide Risk Prediction Model for Targeting High‐risk Patients in the Veterans Health Administration. <i>International Journal of Methods in Psychiatric Research</i>, 26, e1575. <br>https://doi.org/10.1002/mpr.1575
Rajpurkar, P., Chen, E., Banerjee, O. and Topol, E.J. (2022) AI in Health and Medicine. <i>Nature Medicine</i>, 28, 31-38. <br>https://doi.org/10.1038/s41591-021-01614-0
Beam, A.L. and Kohane, I.S. (2018) Big Data and Machine Learning in Health Care. <i>JAMA</i>, 319, 1317-1318. <br>https://doi.org/10.1001/jama.2017.18391
Price, W.N. and Cohen, I.G. (2019) Privacy in the Age of Medical Big Data. <i>Nature Medicine</i>, 25, 37-43. <br>https://doi.org/10.1038/s41591-018-0272-7
Mittelstadt, B.D. (2017) Ethics of the Health-Related Internet of Things: A Narrative Review. <i>Ethics and Information Technology</i>, 19, 157-175. <br>https://doi.org/10.1007/s10676-017-9426-4
Henderson, C., Evans-Lacko, S. and Thornicroft, G. (2013) Mental Illness Stigma, Help Seeking, and Public Health Programs. <i>American Journal of Public Health</i>, 103, 777-780. <br>https://doi.org/10.2105/ajph.2012.301056
Corrigan, P.W., Druss, B.G. and Perlick, D.A. (2014) The Impact of Mental Illness Stigma on Seeking and Participating in Mental Health Care. <i>Psychological Science in the Public Interest</i>, 15, 37-70. <br>https://doi.org/10.1177/1529100614531398
Chen, J.H. and Asch, S.M. (2017) Machine Learning and Prediction in Medicine—Beyond the Peak of Inflated Expectations. <i>New England Journal of Medicine</i>, 376, 2507-2509. <br>https://doi.org/10.1056/nejmp1702071
Sendak, M., Gao, M., Nichols, C., <i>et al</i>. (2020) “Human-Compatible” Machine Learning as a Step toward Safe Clinical AI. <i>NPJ Digital Medicine</i>, 3, 141.
McMahan, B., Moore, E., Ramage, D., <i>et al</i>. (2017) Communication-Efficient Learning of Deep Networks from Decentralized Data. <i>Proceedings of the </i>20<i>th International Conference on Artificial Intelligence and Statistics</i>, Volume 54, 1273-1282.
Rieke, N., Hancox, J., Li, W., Milletarì, F., Roth, H.R., Albarqouni, S., <i>et al</i>. (2020) The Future of Digital Health with Federated Learning. <i>NPJ Digital Medicine</i>, 3, Article No. 119. <br>https://doi.org/10.1038/s41746-020-00323-1
Yang, Q., Liu, Y., Chen, T. and Tong, Y. (2019) Federated Machine Learning: Concept and Applications. <i>ACM Transactions on Intelligent Systems and Technology</i>, 10, 1-19. <br>https://doi.org/10.1145/3298981
Kairouz, P. and McMahan, H.B. (2021) Advances and Open Problems in Federated Learning. <i>Foundations and Trends® in Machine Learning</i>, 14, 1-210. <br>https://doi.org/10.1561/2200000083
Dayan, I., Roth, H.R., Zhong, A., Harouni, A., Gentili, A., Abidin, A.Z., <i>et al</i>. (2021) Federated Learning for Predicting Clinical Outcomes in Patients with Covid-19. <i>Nature Medicine</i>, 27, 1735-1743. <br>https://doi.org/10.1038/s41591-021-01506-3
Li, T., Sahu, A.K., Talwalkar, A. and Smith, V. (2020) Federated Learning: Challenges, Methods, and Future Directions. <i>IEEE Signal Processing Magazine</i>, 37, 50-60. <br>https://doi.org/10.1109/msp.2020.2975749
Alonso, J., Vilagut, G., Adroher, N.D., <i>et al</i>. (2013) Disability-Adjusted Life Years Attributable to Mental and Substance Use Disorders: Findings from the Global Burden of Disease Study 2010. <i>PLOS ONE</i>, 8, e66392.
Kessler, R.C., Angermeyer, M., Anthony, J.C., <i>et al</i>. (2007) Lifetime Prevalence and Age-of-Onset Distributions of Mental Disorders in the World Health Organization’s World Mental Health Survey Initiative. <i>World Psychiatry</i>, 6, 168-176.
Dwork, C. and Roth, A. (2014) The Algorithmic Foundations of Differential Privacy. <i>Foundations and Trends® in Theoretical Computer Science</i>, 9, 211-487. <br>https://doi.org/10.1561/0400000042
Abadi, M., Chu, A., Goodfellow, I., McMahan, H.B., Mironov, I., Talwar, K., <i>et al</i>. (2016) Deep Learning with Differential Privacy. <i>Proceedings of the </i>2016<i> ACM SIGSAC Conference on Computer and Communications Security</i>, Vienna, 24-28 October 2016, 308-318. <br>https://doi.org/10.1145/2976749.2978318
Chen, T. and Guestrin, C. (2016) XGBoost: A Scalable Tree Boosting System. <i>Proceedings of the </i>22<i>nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining</i>, San Francisco, 13-17 August 2016, 785-794. <br>https://doi.org/10.1145/2939672.2939785
Kessler, R.C., Berglund, P., Demler, O., Jin, R., Merikangas, K.R. and Walters, E.E. (2005) Lifetime Prevalence and Age-of-Onset Distributions of DSM-IV Disorders in the National Comorbidity Survey Replication. <i>Archives of General Psychiatry</i>, 62, 593-602. <br>https://doi.org/10.1001/archpsyc.62.6.593
Blanco, C., Compton, W.M., Saha, T.D., Goldstein, B.I., Ruan, W.J., Huang, B., <i>et al</i>. (2017) Epidemiology of DSM-5 Bipolar I Disorder: Results from the National Epidemiologic Survey on Alcohol and Related Conditions-III. <i>Journal of Psychiatric Research</i>, 84, 310-317. <br>https://doi.org/10.1016/j.jpsychires.2016.10.003
Grande, I., Berk, M., Birmaher, B. and Vieta, E. (2016) Bipolar Disorder. <i>The Lancet</i>, 387, 1561-1572. <br>https://doi.org/10.1016/s0140-6736(15)00241-x
Malhi, G.S., Bassett, D., Boyce, P., Bryant, R., Fitzgerald, P.B., Fritz, K., <i>et al</i>. (2015) Royal Australian and New Zealand College of Psychiatrists Clinical Practice Guidelines for Mood Disorders. <i>Australian & New Zealand Journal of Psychiatry</i>, 49, 1087-1206. <br>https://doi.org/10.1177/0004867415617657
Kroenke, K., Spitzer, R.L. and Williams, J.B.W. (2001) The PHQ-9: Validity of a Brief Depression Severity Measure. <i>Journal of General Internal Medicine</i>, 16, 606-613. <br>https://doi.org/10.1046/j.1525-1497.2001.016009606.x
Spitzer, R.L., Kroenke, K., Williams, J.B.W. and Löwe, B. (2006) A Brief Measure for Assessing Generalized Anxiety Disorder: The GAD-7. <i>Archives of Internal Medicine</i>, 166, 1092-1097. <br>https://doi.org/10.1001/archinte.166.10.1092
Velligan, D.I., Weiden, P.J., Sajatovic, M., Scott, J., Carpenter, D., Ross, R., <i>et al</i>. (2010) Strategies for Addressing Adherence Problems in Patients with Serious and Persistent Mental Illness: Recommendations from the Expert Consensus Guidelines. <i>Journal of Psychiatric Practice</i>, 16, 306-324. <br>https://doi.org/10.1097/01.pra.0000388626.98662.a0
Bickman, L., Andrade, A.R. and Lambert, E.W. (2002) Dose Response in Child and Adolescent Mental Health Services. <i>Mental Health Services Research</i>, 4, 57-70. <br>https://doi.org/10.1023/a:1015210332175
Garland, A.F., Haine, R.A. and Lewczyk Boxmeyer, C. (2007) Correlates of Adolescents’ Satisfaction with Mental Health Services. <i>Mental Health Services Research</i>, 9, 263-270.
Nguyen, L.M., Scheinberg, K., and Takáč, M. (2021) Inexact SARAH Algorithm for Stochastic Optimization. <i>Optimization Methods and Software</i>, 36, 237-258. <br>https://doi.org/10.1080/10556788.2020.1818081
Srivastava, N., Hinton, G., Krizhevsky, A., <i>et al</i>. (2014) Dropout: A Simple Way to Prevent Neural Networks from Overfitting. <i>Journal of Machine Learning Research</i>, 15, 1929-1958.
Ioffe, S. and Szegedy, C. (2015) Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. <i>International Conference on Machine Learning</i>, Lille, 6-11 July 2015, 448-456.
Chen, Y.J., Yue, N., Slawski, M., and Rangwala, H. (2020) Asynchronous Online Federated Learning for Edge Devices with Non-IID Data. 2020 <i>IEEE International Conference on Big Data</i> (<i>Big Data</i>), Atlanta, 10-13 December 2020, 15-24.<br>https://doi.org/10.1109/BigData50022.2020.9378161
Lin, T., Kong, L., Stich, S.U. and Jaggi, M. (2020) Ensemble Distillation for Robust Model Fusion in Federated Learning. <i>Advances in Neural Information Processing Systems</i>, 6-12 December 2020, 2351-2363.
Hsu, T.M., Qi, H. and Brown, M. (2019) Measuring the Effects of Non-Identical Data Distribution for Federated Visual Classification. <br>https://arxiv.org/abs/1909.06335
Dwork, C., Kenthapadi, K., McSherry, F., Mironov, I. and Naor, M. (2006) Our Data, Ourselves: Privacy via Distributed Noise Generation. In: <i>Annual International Conference on the Theory and Applications of Cryptographic Techniques</i>, Springer, 486-503. <br>https://doi.org/10.1007/11761679_29
Hanley, J.A. and McNeil, B.J. (1982) The Meaning and Use of the Area under a Receiver Operating Characteristic (ROC) Curve. <i>Radiology</i>, 143, 29-36. <br>https://doi.org/10.1148/radiology.143.1.7063747
Steyerberg, E.W., Vickers, A.J., Cook, N.R., Gerds, T., Gonen, M., Obuchowski, N., <i>et al</i>. (2010) Assessing the Performance of Prediction Models: A Framework for Traditional and Novel Measures. <i>Epidemiology</i>, 21, 128-138. <br>https://doi.org/10.1097/ede.0b013e3181c30fb2
Carpenter, J. and Bithell, J. (2000) Bootstrap Confidence Intervals: When, Which, What? A Practical Guide for Medical Statisticians. <i>Statistics in Medicine</i>, 19, 1141-1164. <br>https://doi.org/10.1002/(sici)1097-0258(20000515)19:9<1141::aid-sim479>3.0.co;2-f
Lundberg, S.M. and Lee, S.I. (2017) A Unified Approach to Interpreting Model Predictions. <i>Proceedings of the </i>31<i>st International Conference on Neural Information Processing Systems</i>, Long Beach, 4-9 December 2017, 4768-4777.
Lundberg, S.M., Erion, G., Chen, H., DeGrave, A., Prutkin, J.M., Nair, B., <i>et al</i>. (2020) From Local Explanations to Global Understanding with Explainable AI for Trees. <i>Nature Machine Intelligence</i>, 2, 56-67. <br>https://doi.org/10.1038/s42256-019-0138-9
Sattler, F., Wiedemann, S., Muller, K. and Samek, W. (2020) Robust and Communication-Efficient Federated Learning from Non-IID Data. <i>IEEE Transactions on Neural Networks and Learning Systems</i>, 31, 3400-3413. <br>https://doi.org/10.1109/tnnls.2019.2944481
DeLong, E.R., DeLong, D.M. and Clarke-Pearson, D.L. (1988) Comparing the Areas under Two or More Correlated Receiver Operating Characteristic Curves: A Nonparametric Approach. <i>Biometrics</i>, 44, 837-845. <br>https://doi.org/10.2307/2531595
Sun, X. and Xu, W. (2014) Fast Implementation of Delong’s Algorithm for Comparing the Areas under Correlated Receiver Operating Characteristic Curves. <i>IEEE Signal Processing Letters</i>, 21, 1389-1393. <br>https://doi.org/10.1109/lsp.2014.2337313
Virtanen, P., Gommers, R., Oliphant, T.E., Haberland, M., Reddy, T., Cournapeau, D., <i>et al</i>. (2020) SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python. <i>Nature Methods</i>, 17, 261-272. <br>https://doi.org/10.1038/s41592-019-0686-2
Lin, W.C., Zhang, J., Lake, A.J., <i>et al</i>. (2019) Early Readmission after Hospital Discharge among Patients with Mental Disorders in the United States. <i>Psychiatric Services</i>, 70, 685-688.
Collins, G.S. and Altman, D.G. (2012) Predicting the 10 Year Risk of Cardiovascular Disease in the United Kingdom: Independent and External Validation of an Updated Version of QRISK2. <i>BMJ</i>, 344, e4181. <br>https://doi.org/10.1136/bmj.e4181
Nicholson, J., Krishnamurthy, R. and Lucas, P. (2021) Sharing Data for Public Health Research: A Systematic Review of Contextual Determinants. <i>International Journal of Environmental Research and Public Health</i>, 18, 3217.
Carter, P., Laurie, G.T. and Dixon-Woods, M. (2015) The Social Licence for Research: Why <i>care</i>.<i>data</i> Ran into Trouble. <i>Journal of Medical Ethics</i>, 41, 404-409. <br>https://doi.org/10.1136/medethics-2014-102374
Shojania, K.G. and Forster, A.J. (2008) Hospital Mortality: When Failure Is Not a Good Measure of Success. <i>Canadian Medical Association Journal</i>, 179, 153-157. <br>https://doi.org/10.1503/cmaj.080010
Wood, A., Altman, M., Bembenek, A., Bun, M., Gaboardi, M., Honaker, J., <i>et al</i>. (2018) Differential Privacy: A Primer for a Non-Technical Audience. <i>Vanderbilt Journal of Entertainment & Technology Law</i>, 21, 17.
Gostin, L.O. and Hodge, J.G.H. (2002) Personal Privacy and Common Goods: A Framework for Balancing under the National Health Information Privacy Rule. <i>Minnesota Law Review</i>, 86, Article No. 1439. <br>https://doi.org/10.24926/265535.2691
Mello, M.M., Francer, J.K., Wilenzick, M., Teden, P., Bierer, B.E. and Barnes, M. (2013) Preparing for Responsible Sharing of Clinical Trial Data. <i>New England Journal of Medicine</i>, 369, 1651-1658. <br>https://doi.org/10.1056/nejmhle1309073
Guo, Y., Liu, Y., Oerlemans, A., Lao, S., Wu, S. and Lew, M.S. (2016) Deep Learning for Visual Understanding: A Review. <i>Neurocomputing</i>, 187, 27-48. <br>https://doi.org/10.1016/j.neucom.2015.09.116
Oakden-Rayner, L., Dunnmon, J., Carneiro, G. and Re, C. (2020). Hidden Stratification Causes Clinically Meaningful Failures in Machine Learning for Medical Imaging. <i>Proceedings of the ACM Conference on Health</i>,<i> Inference</i>,<i> and Learning</i>, Toronto, 23-25 July 2020, 151-159. <br>https://doi.org/10.1145/3368555.3384468
Bonawitz, K., Eichner, H., Grieskamp, W., <i>et al</i>. (2019) Towards Federated Learning at Scale: System Design. <i>Proceedings of Machine Learning and Systems</i>, Vol. 1, 374-388.
Lai, F., Dai, Y., Zhu, X., <i>et al</i>. (2021) FedScale: Benchmarking Model and System Performance of Federated Learning. <i>Proceedings of the </i>1<i>st </i><i>Workshop on Distributed Machine Learning</i>, 25 October 2021, 1-3.
Chung, D.T., Ryan, C.J., Hadzi-Pavlovic, D., Singh, S.P., Stanton, C. and Large, M.M. (2017) Suicide Rates after Discharge from Psychiatric Facilities: A Systematic Review and Meta-Analysis. <i>JAMA Psychiatry</i>, 74, 694-702. <br>https://doi.org/10.1001/jamapsychiatry.2017.1044
So, J., Guler, B. and Avestimehr, A.S. (2021) Turbo-aggregate: Breaking the Quadratic Aggregation Barrier in Secure Federated Learning. <i>IEEE Journal on Selected Areas in Information Theory</i>, 2, 479-489. <br>https://doi.org/10.1109/jsait.2021.3054610
Bell, J.H., Bonawitz, K.A., Gascón, A., Lepoint, T. and Raykova, M. (2020) Secure Single-Server Aggregation with (Poly)logarithmic Overhead. <i>Proceedings of the </i>2020<i> ACM SIGSAC Conference on Computer and Communications Security</i>, 9-13 November 2020, 1253-1269. <br>https://doi.org/10.1145/3372297.3417885
Zhuang, W., Gan, X., Wen, Y., Zhang, S. and Yi, S. (2021) Collaborative Unsupervised Visual Representation Learning from Decentralized Data. 2021 <i>IEEE</i>/<i>CVF International Conference on Computer Vision</i> (<i>ICCV</i>), Montreal, 10-17 October 2021, 4912-4921. <br>https://doi.org/10.1109/iccv48922.2021.00487
Yao, D., Pan, W., Dai, Y., <i>et al</i>. (2019) FedLearn: Federated Machine Learning with Model Exchange. <i>Proceedings of the </i>2019<i> SIAM International Conference on Data Mining</i>, Calgary, 2-4 May 2019, 608-610.