Severity prediction of pulmonary diseases using chest CT scans via cost-sensitive label multi-kernel distribution learning

The progression of pulmonary diseases is a complex progress. Timely predicting whether the patients will progress to the severe stage or not in its early stage is critical to take appropriate hospital treatment. However, this task suffers from the “insufficient and incomplete” data issue since it is...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Computers in biology and medicine 2023-06, Vol.159, p.106890-106890, Article 106890
Hauptverfasser: Wang, Xin, Wang, Jun, Shan, Fei, Zhan, Yiqiang, Shi, Jun, Shen, Dinggang
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The progression of pulmonary diseases is a complex progress. Timely predicting whether the patients will progress to the severe stage or not in its early stage is critical to take appropriate hospital treatment. However, this task suffers from the “insufficient and incomplete” data issue since it is clinically impossible to have adequate training samples for one patient at each day. Besides, the training samples are extremely imbalanced since the patients who will progress to the severe stage is far less than those who will not progress to the non-severe stage. We consider the severity prediction of pulmonary diseases as a time estimation problem based on CT scans. To handle the issue of “insufficient and incomplete” training samples, we introduced label distribution learning (LDL). Specifically, we generate a label distribution for each patient, making a CT image contribute to not only the learning of its chronological day, but also the learning of its neighboring days. In addition, a cost-sensitive mechanism is introduced to explore the imbalance data issue. To identify the importance of pulmonary segments in pulmonary disease severity prediction, multi-kernel learning in composite kernel space is further incorporated and particle swarm optimization (PSO) is used to find the optimal kernel weights. We compare the performance of the proposed CS-LD-MKSVR algorithm with several classical machine learning algorithms and deep learning (DL) algorithms. The proposed method has obtained the best classification results on the in-house data, fully indicating its effectiveness in pulmonary disease severity prediction. The severity prediction of pulmonary diseases is considered as a time estimation problem, and label distribution is introduced to describe the conversion time from non-severe stage to severe stage. The cost-sensitive mechanism is also introduced to handle the data imbalance issue to further improve the classification performance. •The severity prediction of pulmonary diseases was considered as a time estimation problem, and label distribution was introduced to describe the conversion time from non-severe stage to severe stage.•CS-LD-MKSVR is proposed to deal with the problem of insufficient and incomplete data in the prediction of the severity prediction.•The cost-sensitive mechanism is introduced in CS-LD-MKSVR to handle the data imbalance issue.
ISSN:0010-4825
1879-0534
DOI:10.1016/j.compbiomed.2023.106890