An attention-guided network for surgical instrument segmentation from endoscopic images

Accurate surgical instrument segmentation can provide the precise location and pose information to the surgeons, assisting the surgeon to accurately judge the follow-up operation during the robot-assisted surgery procedures. Due to strong context extraction ability, there have been significant advan...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Computers in biology and medicine 2022-12, Vol.151 (Pt A), p.106216-106216, Article 106216
Hauptverfasser: Yang, Lei, Gu, Yuge, Bian, Guibin, Liu, Yanhong
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Accurate surgical instrument segmentation can provide the precise location and pose information to the surgeons, assisting the surgeon to accurately judge the follow-up operation during the robot-assisted surgery procedures. Due to strong context extraction ability, there have been significant advances in research of automatic surgical instrument segmentation, especially U-Net and its variant networks. However, there are still some problems to affect segmentation accuracy, like insufficient processing of local features, class imbalance issue, etc. To deal with these problems, with the typical encoder–decoder structure, an effective surgical instrument segmentation network is proposed for providing an end-to-end detection scheme. Specifically, aimed at the problem of insufficient processing of local features, the residual path is introduced for the full feature extraction to strengthen the backward propagation of low-level features. Further, to achieve feature enhancement of local feature maps, a non-local attention block is introduced to insert into the bottleneck layer to acquire global contexts. Besides, to highlight the pixel areas of the surgical instruments, a dual-attention module (DAM) is introduced to make full use of the high-level features extracted from decoder unit and the low-level features delivered by the encoder unit to acquire the attention features and suppress the irrelevant features. To prove the effectiveness and superiority of the proposed segmentation model, experiments are conducted on two public surgical instrument segmentation data sets, including Kvasir-instrument set and Endovis2017 set, which could acquire a 95.77% Dice score and 92.13% mIOU value on Kvasir-instrument set, and simultaneously reach 95.60% Dice score and 92.74% mIOU value on Endovis2017 set respectively. Experimental results show that the proposed segmentation model realizes a superior performance on surgical instruments in comparison to other advanced models, which could provide a good reference for further development of intelligent surgical robots. The source code is provided at https://github.com/lyangucas92/Surg_Net. •An attention-guided network is proposed for surgical instrument segmentation.•A residual path is proposed to realize effective feature representation.•A dual-attention block is proposed to highlight features of surgical instruments.•A non-local attention block is introduced to acquire global contexts.
ISSN:0010-4825
1879-0534
DOI:10.1016/j.compbiomed.2022.106216