MIST: Multi-instance selective transformer for histopathological subtype prediction

Accurate histopathological subtype prediction is clinically significant for cancer diagnosis and tumor microenvironment analysis. However, achieving accurate histopathological subtype prediction is a challenging task due to (1) instance-level discrimination of histopathological images, (2) low inter...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Medical image analysis 2024-10, Vol.97, p.103251, Article 103251
Hauptverfasser:	Zhao, Rongchang, Xi, Zijun, Liu, Huanchi, Jian, Xiangkun, Zhang, Jian, Zhang, Zijian, Li, Shuo
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Feature decoupling Histopathological subtype prediction Humans Image Interpretation, Computer-Assisted - methods Information bottleneck Machine Learning Multi-instance learning Neoplasms - diagnostic imaging Neoplasms - pathology Self-attention Tumor Microenvironment
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Accurate histopathological subtype prediction is clinically significant for cancer diagnosis and tumor microenvironment analysis. However, achieving accurate histopathological subtype prediction is a challenging task due to (1) instance-level discrimination of histopathological images, (2) low inter-class and large intra-class variances among histopathological images in their shape and chromatin texture, and (3) heterogeneous feature distribution over different images. In this paper, we formulate subtype prediction as fine-grained representation learning and propose a novel multi-instance selective transformer (MIST) framework, effectively achieving accurate histopathological subtype prediction. The proposed MIST designs an effective selective self-attention mechanism with multi-instance learning (MIL) and vision transformer (ViT) to adaptive identify informative instances for fine-grained representation. Innovatively, the MIST entrusts each instance with different contributions to the bag representation based on its interactions with instances and bags. Specifically, a SiT module with selective multi-head self-attention (S-MSA) is well-designed to identify the representative instances by modeling the instance-to-instance interactions. On the contrary, a MIFD module with the information bottleneck is proposed to learn the discriminative fine-grained representation for histopathological images by modeling instance-to-bag interactions with the selected instances. Substantial experiments on five clinical benchmarks demonstrate that the MIST achieves accurate histopathological subtype prediction and obtains state-of-the-art performance with an accuracy of 0.936. The MIST shows great potential to handle fine-grained medical image analysis, such as histopathological subtype prediction in clinical applications. [Display omitted] •A novel multi-instance selective transformer is proposed for the first time to formulate histopathological subtype prediction as fine-grained representation learning.•A selective instance transformer (SiT) is proposed to learn the instance- level fine-grained representation in histopathological subtype prediction by selecting the representative instances with a self-attention learning paradigm.•A multiple instance feature decoupling (MIFD) is proposed to leverage information bottleneck into the fine-grained representation learning and conduct accurate histopathological subtype prediction.
ISSN:	1361-8415 1361-8423 1361-8423
DOI:	10.1016/j.media.2024.103251