Comparison of automated deep neural network against manual sleep stage scoring in clinical data
To compare the accuracy and generalizability of an automated deep neural network and the Philip Sleepware G3™ Somnolyzer system (Somnolyzer) for sleep stage scoring using American Academy of Sleep Medicine (AASM) guidelines. Sleep recordings from 104 participants were analyzed by a convolutional neu...
Gespeichert in:
Veröffentlicht in: | Computers in biology and medicine 2024-09, Vol.179, p.108855, Article 108855 |
---|---|
Hauptverfasser: | , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | To compare the accuracy and generalizability of an automated deep neural network and the Philip Sleepware G3™ Somnolyzer system (Somnolyzer) for sleep stage scoring using American Academy of Sleep Medicine (AASM) guidelines.
Sleep recordings from 104 participants were analyzed by a convolutional neural network (CNN), the Somnolyzer and skillful technicians. Evaluation metrics were derived for different combinations of sleep stages. A further comparison between the Somnolyzer and the CNN model using a single-channel signal as input was also performed. Sleep recordings from 263 participants with a lower prevalence of OSA served as a cross-validation dataset to validate the generalizability of the CNN model.
The overall agreement between automated and manual scoring for sleep staging in 104 participants outperformed that of the Somnolyzer according to various metrics (accuracy: 81.81 % vs. 77.07 %; F1: 76.36 % vs. 73.80 %; Cohen's kappa: 0.7403 vs. 0.6848). The results showed that the left electrooculography (EOG) single-channel model had minor advantages over the Somnolyzer. In terms of consistency with manual sleep staging, the CNN model demonstrated superior performance in identifying more pronounced sleep transitions, particularly in the N2 stage and sleep latency metrics. Conversely, the Somnolyzer showed enhanced proficiency in the analysis of REM stages, notably in measuring REM latency. The accuracy in the cross-validation set of 263 participants was also above 80 %.
The CNN-based automated deep neural network outperformed the Somnolyzer and is sufficiently accurate for sleep study analyses using the AASM classification criteria.
•The validity of a convolutional neural network and the Somnolyzer was investigated.•This is the first study to apply the Somnolyzer in an Asian population dataset.•The practicality of using different channel signals as input was verified. |
---|---|
ISSN: | 0010-4825 1879-0534 1879-0534 |
DOI: | 10.1016/j.compbiomed.2024.108855 |