Robust and Interpretable Temporal Convolution Network for Event Detection in Lung Sound Recordings

Objective: This paper proposes a novel framework for lung sound event detection, segmenting continuous lung sound recordings into discrete events and performing recognition of each event. Methods: We propose the use of a multi-branch TCN architecture and exploit a novel fusion strategy to combine th...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE journal of biomedical and health informatics 2022-07, Vol.26 (7), p.2898-2908
Hauptverfasser:	Fernando, Tharindu, Sridharan, Sridha, Denman, Simon, Ghaemmaghami, Houman, Fookes, Clinton
Format:	Artikel
Sprache:	eng
Schlagworte:	adventitious sound Auscultation Benchmarks Computer architecture Convolution Convolutional neural networks electronic stethoscope Event detection Feature extraction Inhalation Lung Lung sound event detection Lungs Respiration Respiratory diseases respiratory monitor Robustness Segmentation Sound Sound recordings Spectrogram temporal convolution networks
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Objective: This paper proposes a novel framework for lung sound event detection, segmenting continuous lung sound recordings into discrete events and performing recognition of each event. Methods: We propose the use of a multi-branch TCN architecture and exploit a novel fusion strategy to combine the resultant features from these branches. This not only allows the network to retain the most salient information across different temporal granularities and disregards irrelevant information, but also allows our network to process recordings of arbitrary length. Results: The proposed method is evaluated on multiple public and in-house benchmarks, containing irregular and noisy recordings of the respiratory auscultation process for the identification of auscultation events including inhalation, crackles, and rhonchi. Moreover, we provide an end-to-end model interpretation pipeline. Conclusion: Our analysis of different feature fusion strategies shows that the proposed feature concatenation method leads to better suppression of non-informative features, which drastically reduces the classifier overhead resulting in a robust lightweight network. Significance: Lung sound event detection is a primary diagnostic step for numerous respiratory diseases. The proposed method provides a cost-effective and efficient alternative to exhaustive manual segmentation, and provides more accurate segmentation than existing methods. The end-to-end model interpretability helps to build the required trust in the system for use in clinical settings.
ISSN:	2168-2194 2168-2208
DOI:	10.1109/JBHI.2022.3144314