Combining automatic speech recognition with semantic natural language processing in schizophrenia

•The application of Natural Language Processing (NLP) tools to classify psychiatric disorders is hampered by the manual transcription of patient interviews.•By combining Automatic Speech Recognition (ASR) with a semantic NLP model, 77% accuracy was reached in classifying patients with schizophrenia....

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Psychiatry research 2023-07, Vol.325, p.115252-115252, Article 115252
Hauptverfasser: Ciampelli, S., Voppel, A.E., de Boer, J.N., Koops, S., Sommer, I.E.C.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•The application of Natural Language Processing (NLP) tools to classify psychiatric disorders is hampered by the manual transcription of patient interviews.•By combining Automatic Speech Recognition (ASR) with a semantic NLP model, 77% accuracy was reached in classifying patients with schizophrenia.•No significant difference in performance between the NLP model based on automatic transcripts vs. the model based on manual transcripts was found.•Combining ASR technology with semantic NLP models qualifies as a robust and efficient method for diagnosing schizophrenia. Natural language processing (NLP) tools are increasingly used to quantify semantic anomalies in schizophrenia. Automatic speech recognition (ASR) technology, if robust enough, could significantly speed up the NLP research process. In this study, we assessed the performance of a state-of-the-art ASR tool and its impact on diagnostic classification accuracy based on a NLP model. We compared ASR to human transcripts quantitatively (Word Error Rate (WER)) and qualitatively by analyzing error type and position. Subsequently, we evaluated the impact of ASR on classification accuracy using semantic similarity measures. Two random forest classifiers were trained with similarity measures derived from automatic and manual transcriptions, and their performance was compared. The ASR tool had a mean WER of 30.4%. Pronouns and words in sentence-final position had the highest WERs. The classification accuracy was 76.7% (sensitivity 70%; specificity 86%) using automated transcriptions and 79.8% (sensitivity 75%; specificity 86%) for manual transcriptions. The difference in performance between the models was not significant. These findings demonstrate that using ASR for semantic analysis is associated with only a small decrease in accuracy in classifying schizophrenia, compared to manual transcripts. Thus, combining ASR technology with semantic NLP models qualifies as a robust and efficient method for diagnosing schizophrenia.
ISSN:0165-1781
1872-7123
DOI:10.1016/j.psychres.2023.115252