Cataract-1K: Cataract Surgery Dataset for Scene Segmentation, Phase Recognition, and Irregularity Detection
In recent years, the landscape of computer-assisted interventions and post-operative surgical video analysis has been dramatically reshaped by deep-learning techniques, resulting in significant advancements in surgeons' skills, operation room management, and overall surgical outcomes. However,...
Gespeichert in:
Hauptverfasser: | , , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Ghamsarian, Negin El-Shabrawi, Yosuf Nasirihaghighi, Sahar Putzgruber-Adamitsch, Doris Zinkernagel, Martin Wolf, Sebastian Schoeffmann, Klaus Sznitman, Raphael |
description | In recent years, the landscape of computer-assisted interventions and
post-operative surgical video analysis has been dramatically reshaped by
deep-learning techniques, resulting in significant advancements in surgeons'
skills, operation room management, and overall surgical outcomes. However, the
progression of deep-learning-powered surgical technologies is profoundly
reliant on large-scale datasets and annotations. Particularly, surgical scene
understanding and phase recognition stand as pivotal pillars within the realm
of computer-assisted surgery and post-operative assessment of cataract surgery
videos. In this context, we present the largest cataract surgery video dataset
that addresses diverse requisites for constructing computerized surgical
workflow analysis and detecting post-operative irregularities in cataract
surgery. We validate the quality of annotations by benchmarking the performance
of several state-of-the-art neural network architectures for phase recognition
and surgical scene segmentation. Besides, we initiate the research on domain
adaptation for instrument segmentation in cataract surgery by evaluating
cross-domain instrument segmentation performance in cataract surgery videos.
The dataset and annotations will be publicly available upon acceptance of the
paper. |
doi_str_mv | 10.48550/arxiv.2312.06295 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2312_06295</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2312_06295</sourcerecordid><originalsourceid>FETCH-LOGICAL-a675-a6662394b07ddd8e8d3f293a37c54cca8a6bbe1d01d6932d334ac3fcb03f1bf03</originalsourceid><addsrcrecordid>eNo1j81OwzAQhH3hgAoPwAk_AAm2N3ESbij8Va0EIr1HG3sdIloHuS6ib0_awmVGMyON9DF2JUWalXkubjH8DN-pAqlSoVWVn7PPGiMGNDGRizv-H3izCz2FPX-Yii1F7sbAG0OeeEP9hnzEOIz-hr99TDN_JzP2fjhV6C2fh0D9bo1hiNMHRTKH7YKdOVxv6fLPZ2z19LiqX5Ll6_O8vl8mqIt8Eq0VVFknCmttSaUFpypAKEyeGYMl6q4jaYW0ugJlATI04EwnwMnOCZix69Ptkbb9CsMGw749ULdHavgFiXVTnw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Cataract-1K: Cataract Surgery Dataset for Scene Segmentation, Phase Recognition, and Irregularity Detection</title><source>arXiv.org</source><creator>Ghamsarian, Negin ; El-Shabrawi, Yosuf ; Nasirihaghighi, Sahar ; Putzgruber-Adamitsch, Doris ; Zinkernagel, Martin ; Wolf, Sebastian ; Schoeffmann, Klaus ; Sznitman, Raphael</creator><creatorcontrib>Ghamsarian, Negin ; El-Shabrawi, Yosuf ; Nasirihaghighi, Sahar ; Putzgruber-Adamitsch, Doris ; Zinkernagel, Martin ; Wolf, Sebastian ; Schoeffmann, Klaus ; Sznitman, Raphael</creatorcontrib><description>In recent years, the landscape of computer-assisted interventions and
post-operative surgical video analysis has been dramatically reshaped by
deep-learning techniques, resulting in significant advancements in surgeons'
skills, operation room management, and overall surgical outcomes. However, the
progression of deep-learning-powered surgical technologies is profoundly
reliant on large-scale datasets and annotations. Particularly, surgical scene
understanding and phase recognition stand as pivotal pillars within the realm
of computer-assisted surgery and post-operative assessment of cataract surgery
videos. In this context, we present the largest cataract surgery video dataset
that addresses diverse requisites for constructing computerized surgical
workflow analysis and detecting post-operative irregularities in cataract
surgery. We validate the quality of annotations by benchmarking the performance
of several state-of-the-art neural network architectures for phase recognition
and surgical scene segmentation. Besides, we initiate the research on domain
adaptation for instrument segmentation in cataract surgery by evaluating
cross-domain instrument segmentation performance in cataract surgery videos.
The dataset and annotations will be publicly available upon acceptance of the
paper.</description><identifier>DOI: 10.48550/arxiv.2312.06295</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2023-12</creationdate><rights>http://creativecommons.org/licenses/by-nc-nd/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2312.06295$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2312.06295$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Ghamsarian, Negin</creatorcontrib><creatorcontrib>El-Shabrawi, Yosuf</creatorcontrib><creatorcontrib>Nasirihaghighi, Sahar</creatorcontrib><creatorcontrib>Putzgruber-Adamitsch, Doris</creatorcontrib><creatorcontrib>Zinkernagel, Martin</creatorcontrib><creatorcontrib>Wolf, Sebastian</creatorcontrib><creatorcontrib>Schoeffmann, Klaus</creatorcontrib><creatorcontrib>Sznitman, Raphael</creatorcontrib><title>Cataract-1K: Cataract Surgery Dataset for Scene Segmentation, Phase Recognition, and Irregularity Detection</title><description>In recent years, the landscape of computer-assisted interventions and
post-operative surgical video analysis has been dramatically reshaped by
deep-learning techniques, resulting in significant advancements in surgeons'
skills, operation room management, and overall surgical outcomes. However, the
progression of deep-learning-powered surgical technologies is profoundly
reliant on large-scale datasets and annotations. Particularly, surgical scene
understanding and phase recognition stand as pivotal pillars within the realm
of computer-assisted surgery and post-operative assessment of cataract surgery
videos. In this context, we present the largest cataract surgery video dataset
that addresses diverse requisites for constructing computerized surgical
workflow analysis and detecting post-operative irregularities in cataract
surgery. We validate the quality of annotations by benchmarking the performance
of several state-of-the-art neural network architectures for phase recognition
and surgical scene segmentation. Besides, we initiate the research on domain
adaptation for instrument segmentation in cataract surgery by evaluating
cross-domain instrument segmentation performance in cataract surgery videos.
The dataset and annotations will be publicly available upon acceptance of the
paper.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNo1j81OwzAQhH3hgAoPwAk_AAm2N3ESbij8Va0EIr1HG3sdIloHuS6ib0_awmVGMyON9DF2JUWalXkubjH8DN-pAqlSoVWVn7PPGiMGNDGRizv-H3izCz2FPX-Yii1F7sbAG0OeeEP9hnzEOIz-hr99TDN_JzP2fjhV6C2fh0D9bo1hiNMHRTKH7YKdOVxv6fLPZ2z19LiqX5Ll6_O8vl8mqIt8Eq0VVFknCmttSaUFpypAKEyeGYMl6q4jaYW0ugJlATI04EwnwMnOCZix69Ptkbb9CsMGw749ULdHavgFiXVTnw</recordid><startdate>20231211</startdate><enddate>20231211</enddate><creator>Ghamsarian, Negin</creator><creator>El-Shabrawi, Yosuf</creator><creator>Nasirihaghighi, Sahar</creator><creator>Putzgruber-Adamitsch, Doris</creator><creator>Zinkernagel, Martin</creator><creator>Wolf, Sebastian</creator><creator>Schoeffmann, Klaus</creator><creator>Sznitman, Raphael</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20231211</creationdate><title>Cataract-1K: Cataract Surgery Dataset for Scene Segmentation, Phase Recognition, and Irregularity Detection</title><author>Ghamsarian, Negin ; El-Shabrawi, Yosuf ; Nasirihaghighi, Sahar ; Putzgruber-Adamitsch, Doris ; Zinkernagel, Martin ; Wolf, Sebastian ; Schoeffmann, Klaus ; Sznitman, Raphael</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a675-a6662394b07ddd8e8d3f293a37c54cca8a6bbe1d01d6932d334ac3fcb03f1bf03</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Ghamsarian, Negin</creatorcontrib><creatorcontrib>El-Shabrawi, Yosuf</creatorcontrib><creatorcontrib>Nasirihaghighi, Sahar</creatorcontrib><creatorcontrib>Putzgruber-Adamitsch, Doris</creatorcontrib><creatorcontrib>Zinkernagel, Martin</creatorcontrib><creatorcontrib>Wolf, Sebastian</creatorcontrib><creatorcontrib>Schoeffmann, Klaus</creatorcontrib><creatorcontrib>Sznitman, Raphael</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Ghamsarian, Negin</au><au>El-Shabrawi, Yosuf</au><au>Nasirihaghighi, Sahar</au><au>Putzgruber-Adamitsch, Doris</au><au>Zinkernagel, Martin</au><au>Wolf, Sebastian</au><au>Schoeffmann, Klaus</au><au>Sznitman, Raphael</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Cataract-1K: Cataract Surgery Dataset for Scene Segmentation, Phase Recognition, and Irregularity Detection</atitle><date>2023-12-11</date><risdate>2023</risdate><abstract>In recent years, the landscape of computer-assisted interventions and
post-operative surgical video analysis has been dramatically reshaped by
deep-learning techniques, resulting in significant advancements in surgeons'
skills, operation room management, and overall surgical outcomes. However, the
progression of deep-learning-powered surgical technologies is profoundly
reliant on large-scale datasets and annotations. Particularly, surgical scene
understanding and phase recognition stand as pivotal pillars within the realm
of computer-assisted surgery and post-operative assessment of cataract surgery
videos. In this context, we present the largest cataract surgery video dataset
that addresses diverse requisites for constructing computerized surgical
workflow analysis and detecting post-operative irregularities in cataract
surgery. We validate the quality of annotations by benchmarking the performance
of several state-of-the-art neural network architectures for phase recognition
and surgical scene segmentation. Besides, we initiate the research on domain
adaptation for instrument segmentation in cataract surgery by evaluating
cross-domain instrument segmentation performance in cataract surgery videos.
The dataset and annotations will be publicly available upon acceptance of the
paper.</abstract><doi>10.48550/arxiv.2312.06295</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2312.06295 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2312_06295 |
source | arXiv.org |
subjects | Computer Science - Computer Vision and Pattern Recognition |
title | Cataract-1K: Cataract Surgery Dataset for Scene Segmentation, Phase Recognition, and Irregularity Detection |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-27T18%3A31%3A43IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Cataract-1K:%20Cataract%20Surgery%20Dataset%20for%20Scene%20Segmentation,%20Phase%20Recognition,%20and%20Irregularity%20Detection&rft.au=Ghamsarian,%20Negin&rft.date=2023-12-11&rft_id=info:doi/10.48550/arxiv.2312.06295&rft_dat=%3Carxiv_GOX%3E2312_06295%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |