FDTR: Weakening feature disparity transformer for accurate multicategory computed tomography image segmentation
Accurately segmenting object structures in computed tomography (CT) images is crucial for computer-aided surgery, diagnosis, and other interdisciplinary applications. Most state-of-the-art CT segmentation methods employ a classical jump structure to integrate shallow and deep feature information. Ho...
Gespeichert in:
Veröffentlicht in: | Expert systems with applications 2024-02, Vol.236, p.121297, Article 121297 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Accurately segmenting object structures in computed tomography (CT) images is crucial for computer-aided surgery, diagnosis, and other interdisciplinary applications. Most state-of-the-art CT segmentation methods employ a classical jump structure to integrate shallow and deep feature information. However, the effectiveness of the current CT image segmentation and related subtasks must still be improved due to feature disparity between different layers and information transfer loss. To alleviate the problem, we propose a novel three-dimensional multicategory segmentation model, the weakening feature disparity transformer (FDTR), based on the transformer structure for CT imaging. Firstly, to effectively capture global features, we design a vision transformer-based encoder. Secondly, to enhance the model’s information representation in a lightweight manner, we devise a nested structure of dense compression connections. Lastly, to mitigate the disparity in semantic features at shallow layers, we propose supervised semantic signals. By employing the vision transformer-based encoder and the dense connection structure, we enhance the detailed information of deep features. Additionally, the semantic supervisory signal introduces deep semantic information to the shallow features, enriching the overall feature representation. We conducted extensive experiments on the KIPA2022 dataset for multiorgan segmentation and the YC2022 dataset for volumetric trait segmentation in broiler breeding. The proposed method, FDTR, effectively enhances feature transfer between shallow and deep features, achieving optimal results compared to other advanced models and demonstrating broad prospects for application.
•We propose a multiclass 3D CT image segmentation model•We design a novel dense compression connectivity strategy.•We design a novel semantic supervised signal for quantified feature disparity.•We conducted extensive experiments on CT datasets from two different domains. |
---|---|
ISSN: | 0957-4174 1873-6793 |
DOI: | 10.1016/j.eswa.2023.121297 |