Audio-based snore detection using deep neural networks

•We proposed an end-to-end deep neural network model (CNN+LSTM) combined with constant Q transformation to detect snore on audio data.•We used audio data recorded in a hospital (sleep lab) to train and validate our model.•We investigated the influence of microphone placement on snore detection perfo...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Computer methods and programs in biomedicine 2021-03, Vol.200, p.105917-105917, Article 105917
Hauptverfasser:	Xie, Jiali, Aubert, Xavier, Long, Xi, van Dijk, Johannes, Arsenali, Bruno, Fonseca, Pedro, Overeem, Sebastiaan
Format:	Artikel
Sprache:	eng
Schlagworte:	Audio signal processing Body-position in sleep Constant Q transformation Convolutional neural network Humans Neural Networks, Computer Polysomnography Recurrent neural network Sleep Apnea, Obstructive - diagnosis Snore detection Snoring - diagnosis Sound
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	•We proposed an end-to-end deep neural network model (CNN+LSTM) combined with constant Q transformation to detect snore on audio data.•We used audio data recorded in a hospital (sleep lab) to train and validate our model.•We investigated the influence of microphone placement on snore detection performance. Background and Objective: Snoring is a prevalent phenomenon. It may be benign, but can also be a symptom of obstructive sleep apnea (OSA) a prevalent sleep disorder. Accurate detection of snoring may help with screening and diagnosis of OSA. Methods: We introduce a snore detection algorithm based on the combination of a convolutional neural network (CNN) and a recurrent neural network (RNN). We obtained audio recordings of 38 subjects referred to a clinical center for a sleep study. All subjects were recorded by a total of 5 microphones placed at strategic positions around the bed. The CNN was used to extract features from the sound spectrogram, while the RNN was used to process the sequential CNN output and to classify the audio events to snore and non-snore events. We also addressed the impact of microphone placement on the performance of the algorithm. Results: The algorithm achieved an accuracy of 95.3 ± 0.5%, a sensitivity of 92.2 ± 0.9%, and a specificity of 97.7 ± 0.4% over all microphones in snore detection on our data set including 18412 sound events. The best accuracy (95.9%) was observed from the microphone placed about 70 cm above the subject's head and the worst (94.4%) was observed from the microphone placed about 130 cm above the subject's head. Conclusion: Our results suggest that our method detects snore events from audio recordings with high accuracy and that microphone placement does not have a major impact on detection performance.
ISSN:	0169-2607 1872-7565
DOI:	10.1016/j.cmpb.2020.105917