CellViT: Vision Transformers for precise cell segmentation and classification

Nuclei detection and segmentation in hematoxylin and eosin-stained (H&E) tissue images are important clinical tasks and crucial for a wide range of applications. However, it is a challenging task due to nuclei variances in staining and size, overlapping boundaries, and nuclei clustering. While c...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Medical image analysis 2024-05, Vol.94, p.103143-103143, Article 103143
Hauptverfasser:	Hörst, Fabian, Rempe, Moritz, Heine, Lukas, Seibold, Constantin, Keyl, Julius, Baldini, Giulia, Ugurel, Selma, Siveke, Jens, Grünwald, Barbara, Egger, Jan, Kleesiek, Jens
Format:	Artikel
Sprache:	eng
Schlagworte:	Cell Nucleus Cell segmentation Deep learning Digital pathology Eosine Yellowish-(YS) Hematoxylin Humans Image Processing, Computer-Assisted Neural Networks, Computer Staining and Labeling Vision transformer
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Nuclei detection and segmentation in hematoxylin and eosin-stained (H&E) tissue images are important clinical tasks and crucial for a wide range of applications. However, it is a challenging task due to nuclei variances in staining and size, overlapping boundaries, and nuclei clustering. While convolutional neural networks have been extensively used for this task, we explore the potential of Transformer-based networks in combination with large scale pre-training in this domain. Therefore, we introduce a new method for automated instance segmentation of cell nuclei in digitized tissue samples using a deep learning architecture based on Vision Transformer called CellViT. CellViT is trained and evaluated on the PanNuke dataset, which is one of the most challenging nuclei instance segmentation datasets, consisting of nearly 200,000 annotated nuclei into 5 clinically important classes in 19 tissue types. We demonstrate the superiority of large-scale in-domain and out-of-domain pre-trained Vision Transformers by leveraging the recently published Segment Anything Model and a ViT-encoder pre-trained on 104 million histological image patches — achieving state-of-the-art nuclei detection and instance segmentation performance on the PanNuke dataset with a mean panoptic quality of 0.50 and an F1-detection score of 0.83. The code is publicly available at https://github.com/TIO-IKIM/CellViT. [Display omitted] •Novel U-Net-style network for nuclei segmentation using Vision Transformers (CellViT)•Our method outperforms existing techniques and is state-of-the-art on PanNuke•First to embed pre-trained transformer-based foundation models for nuclei segmentation•We demonstrate the generalizability on the MoNuSeg dataset without finetuning
ISSN:	1361-8415 1361-8423
DOI:	10.1016/j.media.2024.103143