Cataract-1K: Cataract Surgery Dataset for Scene Segmentation, Phase Recognition, and Irregularity Detection

In recent years, the landscape of computer-assisted interventions and post-operative surgical video analysis has been dramatically reshaped by deep-learning techniques, resulting in significant advancements in surgeons' skills, operation room management, and overall surgical outcomes. However,...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Ghamsarian, Negin, El-Shabrawi, Yosuf, Nasirihaghighi, Sahar, Putzgruber-Adamitsch, Doris, Zinkernagel, Martin, Wolf, Sebastian, Schoeffmann, Klaus, Sznitman, Raphael
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Ghamsarian, Negin
El-Shabrawi, Yosuf
Nasirihaghighi, Sahar
Putzgruber-Adamitsch, Doris
Zinkernagel, Martin
Wolf, Sebastian
Schoeffmann, Klaus
Sznitman, Raphael
description In recent years, the landscape of computer-assisted interventions and post-operative surgical video analysis has been dramatically reshaped by deep-learning techniques, resulting in significant advancements in surgeons' skills, operation room management, and overall surgical outcomes. However, the progression of deep-learning-powered surgical technologies is profoundly reliant on large-scale datasets and annotations. Particularly, surgical scene understanding and phase recognition stand as pivotal pillars within the realm of computer-assisted surgery and post-operative assessment of cataract surgery videos. In this context, we present the largest cataract surgery video dataset that addresses diverse requisites for constructing computerized surgical workflow analysis and detecting post-operative irregularities in cataract surgery. We validate the quality of annotations by benchmarking the performance of several state-of-the-art neural network architectures for phase recognition and surgical scene segmentation. Besides, we initiate the research on domain adaptation for instrument segmentation in cataract surgery by evaluating cross-domain instrument segmentation performance in cataract surgery videos. The dataset and annotations will be publicly available upon acceptance of the paper.
doi_str_mv 10.48550/arxiv.2312.06295
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2312_06295</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2312_06295</sourcerecordid><originalsourceid>FETCH-LOGICAL-a675-a6662394b07ddd8e8d3f293a37c54cca8a6bbe1d01d6932d334ac3fcb03f1bf03</originalsourceid><addsrcrecordid>eNo1j81OwzAQhH3hgAoPwAk_AAm2N3ESbij8Va0EIr1HG3sdIloHuS6ib0_awmVGMyON9DF2JUWalXkubjH8DN-pAqlSoVWVn7PPGiMGNDGRizv-H3izCz2FPX-Yii1F7sbAG0OeeEP9hnzEOIz-hr99TDN_JzP2fjhV6C2fh0D9bo1hiNMHRTKH7YKdOVxv6fLPZ2z19LiqX5Ll6_O8vl8mqIt8Eq0VVFknCmttSaUFpypAKEyeGYMl6q4jaYW0ugJlATI04EwnwMnOCZix69Ptkbb9CsMGw749ULdHavgFiXVTnw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Cataract-1K: Cataract Surgery Dataset for Scene Segmentation, Phase Recognition, and Irregularity Detection</title><source>arXiv.org</source><creator>Ghamsarian, Negin ; El-Shabrawi, Yosuf ; Nasirihaghighi, Sahar ; Putzgruber-Adamitsch, Doris ; Zinkernagel, Martin ; Wolf, Sebastian ; Schoeffmann, Klaus ; Sznitman, Raphael</creator><creatorcontrib>Ghamsarian, Negin ; El-Shabrawi, Yosuf ; Nasirihaghighi, Sahar ; Putzgruber-Adamitsch, Doris ; Zinkernagel, Martin ; Wolf, Sebastian ; Schoeffmann, Klaus ; Sznitman, Raphael</creatorcontrib><description>In recent years, the landscape of computer-assisted interventions and post-operative surgical video analysis has been dramatically reshaped by deep-learning techniques, resulting in significant advancements in surgeons' skills, operation room management, and overall surgical outcomes. However, the progression of deep-learning-powered surgical technologies is profoundly reliant on large-scale datasets and annotations. Particularly, surgical scene understanding and phase recognition stand as pivotal pillars within the realm of computer-assisted surgery and post-operative assessment of cataract surgery videos. In this context, we present the largest cataract surgery video dataset that addresses diverse requisites for constructing computerized surgical workflow analysis and detecting post-operative irregularities in cataract surgery. We validate the quality of annotations by benchmarking the performance of several state-of-the-art neural network architectures for phase recognition and surgical scene segmentation. Besides, we initiate the research on domain adaptation for instrument segmentation in cataract surgery by evaluating cross-domain instrument segmentation performance in cataract surgery videos. The dataset and annotations will be publicly available upon acceptance of the paper.</description><identifier>DOI: 10.48550/arxiv.2312.06295</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2023-12</creationdate><rights>http://creativecommons.org/licenses/by-nc-nd/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2312.06295$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2312.06295$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Ghamsarian, Negin</creatorcontrib><creatorcontrib>El-Shabrawi, Yosuf</creatorcontrib><creatorcontrib>Nasirihaghighi, Sahar</creatorcontrib><creatorcontrib>Putzgruber-Adamitsch, Doris</creatorcontrib><creatorcontrib>Zinkernagel, Martin</creatorcontrib><creatorcontrib>Wolf, Sebastian</creatorcontrib><creatorcontrib>Schoeffmann, Klaus</creatorcontrib><creatorcontrib>Sznitman, Raphael</creatorcontrib><title>Cataract-1K: Cataract Surgery Dataset for Scene Segmentation, Phase Recognition, and Irregularity Detection</title><description>In recent years, the landscape of computer-assisted interventions and post-operative surgical video analysis has been dramatically reshaped by deep-learning techniques, resulting in significant advancements in surgeons' skills, operation room management, and overall surgical outcomes. However, the progression of deep-learning-powered surgical technologies is profoundly reliant on large-scale datasets and annotations. Particularly, surgical scene understanding and phase recognition stand as pivotal pillars within the realm of computer-assisted surgery and post-operative assessment of cataract surgery videos. In this context, we present the largest cataract surgery video dataset that addresses diverse requisites for constructing computerized surgical workflow analysis and detecting post-operative irregularities in cataract surgery. We validate the quality of annotations by benchmarking the performance of several state-of-the-art neural network architectures for phase recognition and surgical scene segmentation. Besides, we initiate the research on domain adaptation for instrument segmentation in cataract surgery by evaluating cross-domain instrument segmentation performance in cataract surgery videos. The dataset and annotations will be publicly available upon acceptance of the paper.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNo1j81OwzAQhH3hgAoPwAk_AAm2N3ESbij8Va0EIr1HG3sdIloHuS6ib0_awmVGMyON9DF2JUWalXkubjH8DN-pAqlSoVWVn7PPGiMGNDGRizv-H3izCz2FPX-Yii1F7sbAG0OeeEP9hnzEOIz-hr99TDN_JzP2fjhV6C2fh0D9bo1hiNMHRTKH7YKdOVxv6fLPZ2z19LiqX5Ll6_O8vl8mqIt8Eq0VVFknCmttSaUFpypAKEyeGYMl6q4jaYW0ugJlATI04EwnwMnOCZix69Ptkbb9CsMGw749ULdHavgFiXVTnw</recordid><startdate>20231211</startdate><enddate>20231211</enddate><creator>Ghamsarian, Negin</creator><creator>El-Shabrawi, Yosuf</creator><creator>Nasirihaghighi, Sahar</creator><creator>Putzgruber-Adamitsch, Doris</creator><creator>Zinkernagel, Martin</creator><creator>Wolf, Sebastian</creator><creator>Schoeffmann, Klaus</creator><creator>Sznitman, Raphael</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20231211</creationdate><title>Cataract-1K: Cataract Surgery Dataset for Scene Segmentation, Phase Recognition, and Irregularity Detection</title><author>Ghamsarian, Negin ; El-Shabrawi, Yosuf ; Nasirihaghighi, Sahar ; Putzgruber-Adamitsch, Doris ; Zinkernagel, Martin ; Wolf, Sebastian ; Schoeffmann, Klaus ; Sznitman, Raphael</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a675-a6662394b07ddd8e8d3f293a37c54cca8a6bbe1d01d6932d334ac3fcb03f1bf03</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Ghamsarian, Negin</creatorcontrib><creatorcontrib>El-Shabrawi, Yosuf</creatorcontrib><creatorcontrib>Nasirihaghighi, Sahar</creatorcontrib><creatorcontrib>Putzgruber-Adamitsch, Doris</creatorcontrib><creatorcontrib>Zinkernagel, Martin</creatorcontrib><creatorcontrib>Wolf, Sebastian</creatorcontrib><creatorcontrib>Schoeffmann, Klaus</creatorcontrib><creatorcontrib>Sznitman, Raphael</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Ghamsarian, Negin</au><au>El-Shabrawi, Yosuf</au><au>Nasirihaghighi, Sahar</au><au>Putzgruber-Adamitsch, Doris</au><au>Zinkernagel, Martin</au><au>Wolf, Sebastian</au><au>Schoeffmann, Klaus</au><au>Sznitman, Raphael</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Cataract-1K: Cataract Surgery Dataset for Scene Segmentation, Phase Recognition, and Irregularity Detection</atitle><date>2023-12-11</date><risdate>2023</risdate><abstract>In recent years, the landscape of computer-assisted interventions and post-operative surgical video analysis has been dramatically reshaped by deep-learning techniques, resulting in significant advancements in surgeons' skills, operation room management, and overall surgical outcomes. However, the progression of deep-learning-powered surgical technologies is profoundly reliant on large-scale datasets and annotations. Particularly, surgical scene understanding and phase recognition stand as pivotal pillars within the realm of computer-assisted surgery and post-operative assessment of cataract surgery videos. In this context, we present the largest cataract surgery video dataset that addresses diverse requisites for constructing computerized surgical workflow analysis and detecting post-operative irregularities in cataract surgery. We validate the quality of annotations by benchmarking the performance of several state-of-the-art neural network architectures for phase recognition and surgical scene segmentation. Besides, we initiate the research on domain adaptation for instrument segmentation in cataract surgery by evaluating cross-domain instrument segmentation performance in cataract surgery videos. The dataset and annotations will be publicly available upon acceptance of the paper.</abstract><doi>10.48550/arxiv.2312.06295</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2312.06295
ispartof
issn
language eng
recordid cdi_arxiv_primary_2312_06295
source arXiv.org
subjects Computer Science - Computer Vision and Pattern Recognition
title Cataract-1K: Cataract Surgery Dataset for Scene Segmentation, Phase Recognition, and Irregularity Detection
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-27T18%3A31%3A43IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Cataract-1K:%20Cataract%20Surgery%20Dataset%20for%20Scene%20Segmentation,%20Phase%20Recognition,%20and%20Irregularity%20Detection&rft.au=Ghamsarian,%20Negin&rft.date=2023-12-11&rft_id=info:doi/10.48550/arxiv.2312.06295&rft_dat=%3Carxiv_GOX%3E2312_06295%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true