Ladder Fine-tuning approach for SAM integrating complementary network

Recently, foundation models have been introduced demonstrating various tasks in the field of computer vision. These models such as Segment Anything Model (SAM) are generalized models trained using huge datasets. Currently, ongoing research focuses on exploring the effective utilization of these gene...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2023-06
Hauptverfasser: Chai, Shurong, Jain, Rahul Kumar, Teng, Shiyu, Liu, Jiaqing, Li, Yinhao, Tateyama, Tomoko, Yen-wei, Chen
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Chai, Shurong
Jain, Rahul Kumar
Teng, Shiyu
Liu, Jiaqing
Li, Yinhao
Tateyama, Tomoko
Yen-wei, Chen
description Recently, foundation models have been introduced demonstrating various tasks in the field of computer vision. These models such as Segment Anything Model (SAM) are generalized models trained using huge datasets. Currently, ongoing research focuses on exploring the effective utilization of these generalized models for specific domains, such as medical imaging. However, in medical imaging, the lack of training samples due to privacy concerns and other factors presents a major challenge for applying these generalized models to medical image segmentation task. To address this issue, the effective fine tuning of these models is crucial to ensure their optimal utilization. In this study, we propose to combine a complementary Convolutional Neural Network (CNN) along with the standard SAM network for medical image segmentation. To reduce the burden of fine tuning large foundation model and implement cost-efficient trainnig scheme, we focus only on fine-tuning the additional CNN network and SAM decoder part. This strategy significantly reduces trainnig time and achieves competitive results on publicly available dataset. The code is available at https://github.com/11yxk/SAM-LST.
format Article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2828969037</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2828969037</sourcerecordid><originalsourceid>FETCH-proquest_journals_28289690373</originalsourceid><addsrcrecordid>eNqNjMsKgkAUQIcgSMp_GGgtTHfytYxQWtSq9jLo1TS9YzMj0d9n0Ae0OotzOAvmgZS7INkDrJhvbSeEgCiGMJQey86qqtDwvCUM3EQtNVyNo9GqvPNaG349XHhLDhuj3FeWehh7HJCcMm9O6F7aPDZsWaveov_jmm3z7HY8BfPoOaF1RacnQ7MqIIEkjVIhY_lf9QGH-ztC</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2828969037</pqid></control><display><type>article</type><title>Ladder Fine-tuning approach for SAM integrating complementary network</title><source>Free E- Journals</source><creator>Chai, Shurong ; Jain, Rahul Kumar ; Teng, Shiyu ; Liu, Jiaqing ; Li, Yinhao ; Tateyama, Tomoko ; Yen-wei, Chen</creator><creatorcontrib>Chai, Shurong ; Jain, Rahul Kumar ; Teng, Shiyu ; Liu, Jiaqing ; Li, Yinhao ; Tateyama, Tomoko ; Yen-wei, Chen</creatorcontrib><description>Recently, foundation models have been introduced demonstrating various tasks in the field of computer vision. These models such as Segment Anything Model (SAM) are generalized models trained using huge datasets. Currently, ongoing research focuses on exploring the effective utilization of these generalized models for specific domains, such as medical imaging. However, in medical imaging, the lack of training samples due to privacy concerns and other factors presents a major challenge for applying these generalized models to medical image segmentation task. To address this issue, the effective fine tuning of these models is crucial to ensure their optimal utilization. In this study, we propose to combine a complementary Convolutional Neural Network (CNN) along with the standard SAM network for medical image segmentation. To reduce the burden of fine tuning large foundation model and implement cost-efficient trainnig scheme, we focus only on fine-tuning the additional CNN network and SAM decoder part. This strategy significantly reduces trainnig time and achieves competitive results on publicly available dataset. The code is available at https://github.com/11yxk/SAM-LST.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Artificial neural networks ; Business competition ; Computer vision ; Datasets ; Image segmentation ; Medical imaging</subject><ispartof>arXiv.org, 2023-06</ispartof><rights>2023. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>781,785</link.rule.ids></links><search><creatorcontrib>Chai, Shurong</creatorcontrib><creatorcontrib>Jain, Rahul Kumar</creatorcontrib><creatorcontrib>Teng, Shiyu</creatorcontrib><creatorcontrib>Liu, Jiaqing</creatorcontrib><creatorcontrib>Li, Yinhao</creatorcontrib><creatorcontrib>Tateyama, Tomoko</creatorcontrib><creatorcontrib>Yen-wei, Chen</creatorcontrib><title>Ladder Fine-tuning approach for SAM integrating complementary network</title><title>arXiv.org</title><description>Recently, foundation models have been introduced demonstrating various tasks in the field of computer vision. These models such as Segment Anything Model (SAM) are generalized models trained using huge datasets. Currently, ongoing research focuses on exploring the effective utilization of these generalized models for specific domains, such as medical imaging. However, in medical imaging, the lack of training samples due to privacy concerns and other factors presents a major challenge for applying these generalized models to medical image segmentation task. To address this issue, the effective fine tuning of these models is crucial to ensure their optimal utilization. In this study, we propose to combine a complementary Convolutional Neural Network (CNN) along with the standard SAM network for medical image segmentation. To reduce the burden of fine tuning large foundation model and implement cost-efficient trainnig scheme, we focus only on fine-tuning the additional CNN network and SAM decoder part. This strategy significantly reduces trainnig time and achieves competitive results on publicly available dataset. The code is available at https://github.com/11yxk/SAM-LST.</description><subject>Artificial neural networks</subject><subject>Business competition</subject><subject>Computer vision</subject><subject>Datasets</subject><subject>Image segmentation</subject><subject>Medical imaging</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNjMsKgkAUQIcgSMp_GGgtTHfytYxQWtSq9jLo1TS9YzMj0d9n0Ae0OotzOAvmgZS7INkDrJhvbSeEgCiGMJQey86qqtDwvCUM3EQtNVyNo9GqvPNaG349XHhLDhuj3FeWehh7HJCcMm9O6F7aPDZsWaveov_jmm3z7HY8BfPoOaF1RacnQ7MqIIEkjVIhY_lf9QGH-ztC</recordid><startdate>20230622</startdate><enddate>20230622</enddate><creator>Chai, Shurong</creator><creator>Jain, Rahul Kumar</creator><creator>Teng, Shiyu</creator><creator>Liu, Jiaqing</creator><creator>Li, Yinhao</creator><creator>Tateyama, Tomoko</creator><creator>Yen-wei, Chen</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20230622</creationdate><title>Ladder Fine-tuning approach for SAM integrating complementary network</title><author>Chai, Shurong ; Jain, Rahul Kumar ; Teng, Shiyu ; Liu, Jiaqing ; Li, Yinhao ; Tateyama, Tomoko ; Yen-wei, Chen</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_28289690373</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Artificial neural networks</topic><topic>Business competition</topic><topic>Computer vision</topic><topic>Datasets</topic><topic>Image segmentation</topic><topic>Medical imaging</topic><toplevel>online_resources</toplevel><creatorcontrib>Chai, Shurong</creatorcontrib><creatorcontrib>Jain, Rahul Kumar</creatorcontrib><creatorcontrib>Teng, Shiyu</creatorcontrib><creatorcontrib>Liu, Jiaqing</creatorcontrib><creatorcontrib>Li, Yinhao</creatorcontrib><creatorcontrib>Tateyama, Tomoko</creatorcontrib><creatorcontrib>Yen-wei, Chen</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Access via ProQuest (Open Access)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Chai, Shurong</au><au>Jain, Rahul Kumar</au><au>Teng, Shiyu</au><au>Liu, Jiaqing</au><au>Li, Yinhao</au><au>Tateyama, Tomoko</au><au>Yen-wei, Chen</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Ladder Fine-tuning approach for SAM integrating complementary network</atitle><jtitle>arXiv.org</jtitle><date>2023-06-22</date><risdate>2023</risdate><eissn>2331-8422</eissn><abstract>Recently, foundation models have been introduced demonstrating various tasks in the field of computer vision. These models such as Segment Anything Model (SAM) are generalized models trained using huge datasets. Currently, ongoing research focuses on exploring the effective utilization of these generalized models for specific domains, such as medical imaging. However, in medical imaging, the lack of training samples due to privacy concerns and other factors presents a major challenge for applying these generalized models to medical image segmentation task. To address this issue, the effective fine tuning of these models is crucial to ensure their optimal utilization. In this study, we propose to combine a complementary Convolutional Neural Network (CNN) along with the standard SAM network for medical image segmentation. To reduce the burden of fine tuning large foundation model and implement cost-efficient trainnig scheme, we focus only on fine-tuning the additional CNN network and SAM decoder part. This strategy significantly reduces trainnig time and achieves competitive results on publicly available dataset. The code is available at https://github.com/11yxk/SAM-LST.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2023-06
issn 2331-8422
language eng
recordid cdi_proquest_journals_2828969037
source Free E- Journals
subjects Artificial neural networks
Business competition
Computer vision
Datasets
Image segmentation
Medical imaging
title Ladder Fine-tuning approach for SAM integrating complementary network
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-14T02%3A28%3A11IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Ladder%20Fine-tuning%20approach%20for%20SAM%20integrating%20complementary%20network&rft.jtitle=arXiv.org&rft.au=Chai,%20Shurong&rft.date=2023-06-22&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2828969037%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2828969037&rft_id=info:pmid/&rfr_iscdi=true