Ladder Fine-tuning approach for SAM integrating complementary network

Recently, foundation models have been introduced demonstrating various tasks in the field of computer vision. These models such as Segment Anything Model (SAM) are generalized models trained using huge datasets. Currently, ongoing research focuses on exploring the effective utilization of these gene...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	arXiv.org 2023-06
Hauptverfasser:	Chai, Shurong, Jain, Rahul Kumar, Teng, Shiyu, Liu, Jiaqing, Li, Yinhao, Tateyama, Tomoko, Yen-wei, Chen
Format:	Artikel
Sprache:	eng
Schlagworte:	Artificial neural networks Business competition Computer vision Datasets Image segmentation Medical imaging
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	Chai, Shurong Jain, Rahul Kumar Teng, Shiyu Liu, Jiaqing Li, Yinhao Tateyama, Tomoko Yen-wei, Chen
description	Recently, foundation models have been introduced demonstrating various tasks in the field of computer vision. These models such as Segment Anything Model (SAM) are generalized models trained using huge datasets. Currently, ongoing research focuses on exploring the effective utilization of these generalized models for specific domains, such as medical imaging. However, in medical imaging, the lack of training samples due to privacy concerns and other factors presents a major challenge for applying these generalized models to medical image segmentation task. To address this issue, the effective fine tuning of these models is crucial to ensure their optimal utilization. In this study, we propose to combine a complementary Convolutional Neural Network (CNN) along with the standard SAM network for medical image segmentation. To reduce the burden of fine tuning large foundation model and implement cost-efficient trainnig scheme, we focus only on fine-tuning the additional CNN network and SAM decoder part. This strategy significantly reduces trainnig time and achieves competitive results on publicly available dataset. The code is available at https://github.com/11yxk/SAM-LST.
format	Article
fullrecord	<record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2828969037</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2828969037</sourcerecordid><originalsourceid>FETCH-proquest_journals_28289690373</originalsourceid><addsrcrecordid>eNqNjMsKgkAUQIcgSMp_GGgtTHfytYxQWtSq9jLo1TS9YzMj0d9n0Ae0OotzOAvmgZS7INkDrJhvbSeEgCiGMJQey86qqtDwvCUM3EQtNVyNo9GqvPNaG349XHhLDhuj3FeWehh7HJCcMm9O6F7aPDZsWaveov_jmm3z7HY8BfPoOaF1RacnQ7MqIIEkjVIhY_lf9QGH-ztC</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2828969037</pqid></control><display><type>article</type><title>Ladder Fine-tuning approach for SAM integrating complementary network</title><source>Free E- Journals</source><creator>Chai, Shurong ; Jain, Rahul Kumar ; Teng, Shiyu ; Liu, Jiaqing ; Li, Yinhao ; Tateyama, Tomoko ; Yen-wei, Chen</creator><creatorcontrib>Chai, Shurong ; Jain, Rahul Kumar ; Teng, Shiyu ; Liu, Jiaqing ; Li, Yinhao ; Tateyama, Tomoko ; Yen-wei, Chen</creatorcontrib><description>Recently, foundation models have been introduced demonstrating various tasks in the field of computer vision. These models such as Segment Anything Model (SAM) are generalized models trained using huge datasets. Currently, ongoing research focuses on exploring the effective utilization of these generalized models for specific domains, such as medical imaging. However, in medical imaging, the lack of training samples due to privacy concerns and other factors presents a major challenge for applying these generalized models to medical image segmentation task. To address this issue, the effective fine tuning of these models is crucial to ensure their optimal utilization. In this study, we propose to combine a complementary Convolutional Neural Network (CNN) along with the standard SAM network for medical image segmentation. To reduce the burden of fine tuning large foundation model and implement cost-efficient trainnig scheme, we focus only on fine-tuning the additional CNN network and SAM decoder part. This strategy significantly reduces trainnig time and achieves competitive results on publicly available dataset. The code is available at https://github.com/11yxk/SAM-LST.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Artificial neural networks ; Business competition ; Computer vision ; Datasets ; Image segmentation ; Medical imaging</subject><ispartof>arXiv.org, 2023-06</ispartof><rights>2023. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>781,785</link.rule.ids></links><search><creatorcontrib>Chai, Shurong</creatorcontrib><creatorcontrib>Jain, Rahul Kumar</creatorcontrib><creatorcontrib>Teng, Shiyu</creatorcontrib><creatorcontrib>Liu, Jiaqing</creatorcontrib><creatorcontrib>Li, Yinhao</creatorcontrib><creatorcontrib>Tateyama, Tomoko</creatorcontrib><creatorcontrib>Yen-wei, Chen</creatorcontrib><title>Ladder Fine-tuning approach for SAM integrating complementary network</title><title>arXiv.org</title><description>Recently, foundation models have been introduced demonstrating various tasks in the field of computer vision. These models such as Segment Anything Model (SAM) are generalized models trained using huge datasets. Currently, ongoing research focuses on exploring the effective utilization of these generalized models for specific domains, such as medical imaging. However, in medical imaging, the lack of training samples due to privacy concerns and other factors presents a major challenge for applying these generalized models to medical image segmentation task. To address this issue, the effective fine tuning of these models is crucial to ensure their optimal utilization. In this study, we propose to combine a complementary Convolutional Neural Network (CNN) along with the standard SAM network for medical image segmentation. To reduce the burden of fine tuning large foundation model and implement cost-efficient trainnig scheme, we focus only on fine-tuning the additional CNN network and SAM decoder part. This strategy significantly reduces trainnig time and achieves competitive results on publicly available dataset. The code is available at https://github.com/11yxk/SAM-LST.</description><subject>Artificial neural networks</subject><subject>Business competition</subject><subject>Computer vision</subject><subject>Datasets</subject><subject>Image segmentation</subject><subject>Medical imaging</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNjMsKgkAUQIcgSMp_GGgtTHfytYxQWtSq9jLo1TS9YzMj0d9n0Ae0OotzOAvmgZS7INkDrJhvbSeEgCiGMJQey86qqtDwvCUM3EQtNVyNo9GqvPNaG349XHhLDhuj3FeWehh7HJCcMm9O6F7aPDZsWaveov_jmm3z7HY8BfPoOaF1RacnQ7MqIIEkjVIhY_lf9QGH-ztC</recordid><startdate>20230622</startdate><enddate>20230622</enddate><creator>Chai, Shurong</creator><creator>Jain, Rahul Kumar</creator><creator>Teng, Shiyu</creator><creator>Liu, Jiaqing</creator><creator>Li, Yinhao</creator><creator>Tateyama, Tomoko</creator><creator>Yen-wei, Chen</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20230622</creationdate><title>Ladder Fine-tuning approach for SAM integrating complementary network</title><author>Chai, Shurong ; Jain, Rahul Kumar ; Teng, Shiyu ; Liu, Jiaqing ; Li, Yinhao ; Tateyama, Tomoko ; Yen-wei, Chen</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_28289690373</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Artificial neural networks</topic><topic>Business competition</topic><topic>Computer vision</topic><topic>Datasets</topic><topic>Image segmentation</topic><topic>Medical imaging</topic><toplevel>online_resources</toplevel><creatorcontrib>Chai, Shurong</creatorcontrib><creatorcontrib>Jain, Rahul Kumar</creatorcontrib><creatorcontrib>Teng, Shiyu</creatorcontrib><creatorcontrib>Liu, Jiaqing</creatorcontrib><creatorcontrib>Li, Yinhao</creatorcontrib><creatorcontrib>Tateyama, Tomoko</creatorcontrib><creatorcontrib>Yen-wei, Chen</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Access via ProQuest (Open Access)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Chai, Shurong</au><au>Jain, Rahul Kumar</au><au>Teng, Shiyu</au><au>Liu, Jiaqing</au><au>Li, Yinhao</au><au>Tateyama, Tomoko</au><au>Yen-wei, Chen</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Ladder Fine-tuning approach for SAM integrating complementary network</atitle><jtitle>arXiv.org</jtitle><date>2023-06-22</date><risdate>2023</risdate><eissn>2331-8422</eissn><abstract>Recently, foundation models have been introduced demonstrating various tasks in the field of computer vision. These models such as Segment Anything Model (SAM) are generalized models trained using huge datasets. Currently, ongoing research focuses on exploring the effective utilization of these generalized models for specific domains, such as medical imaging. However, in medical imaging, the lack of training samples due to privacy concerns and other factors presents a major challenge for applying these generalized models to medical image segmentation task. To address this issue, the effective fine tuning of these models is crucial to ensure their optimal utilization. In this study, we propose to combine a complementary Convolutional Neural Network (CNN) along with the standard SAM network for medical image segmentation. To reduce the burden of fine tuning large foundation model and implement cost-efficient trainnig scheme, we focus only on fine-tuning the additional CNN network and SAM decoder part. This strategy significantly reduces trainnig time and achieves competitive results on publicly available dataset. The code is available at https://github.com/11yxk/SAM-LST.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2023-06
issn	2331-8422
language	eng
recordid	cdi_proquest_journals_2828969037
source	Free E- Journals
subjects	Artificial neural networks Business competition Computer vision Datasets Image segmentation Medical imaging
title	Ladder Fine-tuning approach for SAM integrating complementary network
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-14T02%3A28%3A11IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Ladder%20Fine-tuning%20approach%20for%20SAM%20integrating%20complementary%20network&rft.jtitle=arXiv.org&rft.au=Chai,%20Shurong&rft.date=2023-06-22&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2828969037%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2828969037&rft_id=info:pmid/&rfr_iscdi=true