Suicide risk classification with machine learning techniques in a large Brazilian community sample

•We developed machine learning models for the classification of suicide risk in Brazil.•The Random Forests model achieved the best AUC ROC (0.814).•The Naive Bayes classifier presented the best sensitivity (0.922).•Features representing depression were the most relevant for the classification task.•...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Psychiatry research 2023-07, Vol.325, p.115258-115258, Article 115258
Hauptverfasser: Roza, Thiago Henrique, Seibel, Gabriel de Souza, Recamonde-Mendoza, Mariana, Lotufo, Paulo A., Benseñor, Isabela M., Passos, Ives Cavalcante, Brunoni, Andre Russowsky
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•We developed machine learning models for the classification of suicide risk in Brazil.•The Random Forests model achieved the best AUC ROC (0.814).•The Naive Bayes classifier presented the best sensitivity (0.922).•Features representing depression were the most relevant for the classification task.•Objective suicide risk stratification can provide the means to early preventive interventions. Even though suicide is a relatively preventable poor outcome, its prediction remains an elusive task. The main goal of this study was to develop machine learning classifiers to identify increased suicide risk in Brazilians with common mental disorders. With the use of clinical and sociodemographic baseline data (n = 4039 adult participants) from a large Brazilian community sample, we developed several models (Elastic Net, Random Forests, Naïve Bayes, and ensemble) for the classification of increased suicide risk among individuals with common mental disorders. 1120 participants (27.7%) presented increased suicide risk. The Random Forests model achieved the best AUC ROC (0.814), followed by Naive Bayes (0.798) and Elastic Net (0.773). Sensitivity varied from 0.922 (Naive Bayes) to 0.630 (Random Forests), while specificity varied from 0.792 (Random Forests) to 0.473 (Naive Bayes). The ensemble model presented an AUC ROC of 0.811, sensitivity of 0.899, and specificity of 0.510. Features representing depression symptoms were the most relevant for the classification of increased suicide risk. Some of our models presented good performance metrics in the classification of increased suicide risk in the investigated sample, which can provide the means to early preventive interventions.
ISSN:0165-1781
1872-7123
DOI:10.1016/j.psychres.2023.115258