Ladder Fine-tuning approach for SAM integrating complementary network
Recently, foundation models have been introduced demonstrating various tasks in the field of computer vision. These models such as Segment Anything Model (SAM) are generalized models trained using huge datasets. Currently, ongoing research focuses on exploring the effective utilization of these gene...
Gespeichert in:
Veröffentlicht in: | arXiv.org 2023-06 |
---|---|
Hauptverfasser: | , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Chai, Shurong Jain, Rahul Kumar Teng, Shiyu Liu, Jiaqing Li, Yinhao Tateyama, Tomoko Yen-wei, Chen |
description | Recently, foundation models have been introduced demonstrating various tasks in the field of computer vision. These models such as Segment Anything Model (SAM) are generalized models trained using huge datasets. Currently, ongoing research focuses on exploring the effective utilization of these generalized models for specific domains, such as medical imaging. However, in medical imaging, the lack of training samples due to privacy concerns and other factors presents a major challenge for applying these generalized models to medical image segmentation task. To address this issue, the effective fine tuning of these models is crucial to ensure their optimal utilization. In this study, we propose to combine a complementary Convolutional Neural Network (CNN) along with the standard SAM network for medical image segmentation. To reduce the burden of fine tuning large foundation model and implement cost-efficient trainnig scheme, we focus only on fine-tuning the additional CNN network and SAM decoder part. This strategy significantly reduces trainnig time and achieves competitive results on publicly available dataset. The code is available at https://github.com/11yxk/SAM-LST. |
format | Article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2828969037</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2828969037</sourcerecordid><originalsourceid>FETCH-proquest_journals_28289690373</originalsourceid><addsrcrecordid>eNqNjMsKgkAUQIcgSMp_GGgtTHfytYxQWtSq9jLo1TS9YzMj0d9n0Ae0OotzOAvmgZS7INkDrJhvbSeEgCiGMJQey86qqtDwvCUM3EQtNVyNo9GqvPNaG349XHhLDhuj3FeWehh7HJCcMm9O6F7aPDZsWaveov_jmm3z7HY8BfPoOaF1RacnQ7MqIIEkjVIhY_lf9QGH-ztC</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2828969037</pqid></control><display><type>article</type><title>Ladder Fine-tuning approach for SAM integrating complementary network</title><source>Free E- Journals</source><creator>Chai, Shurong ; Jain, Rahul Kumar ; Teng, Shiyu ; Liu, Jiaqing ; Li, Yinhao ; Tateyama, Tomoko ; Yen-wei, Chen</creator><creatorcontrib>Chai, Shurong ; Jain, Rahul Kumar ; Teng, Shiyu ; Liu, Jiaqing ; Li, Yinhao ; Tateyama, Tomoko ; Yen-wei, Chen</creatorcontrib><description>Recently, foundation models have been introduced demonstrating various tasks in the field of computer vision. These models such as Segment Anything Model (SAM) are generalized models trained using huge datasets. Currently, ongoing research focuses on exploring the effective utilization of these generalized models for specific domains, such as medical imaging. However, in medical imaging, the lack of training samples due to privacy concerns and other factors presents a major challenge for applying these generalized models to medical image segmentation task. To address this issue, the effective fine tuning of these models is crucial to ensure their optimal utilization. In this study, we propose to combine a complementary Convolutional Neural Network (CNN) along with the standard SAM network for medical image segmentation. To reduce the burden of fine tuning large foundation model and implement cost-efficient trainnig scheme, we focus only on fine-tuning the additional CNN network and SAM decoder part. This strategy significantly reduces trainnig time and achieves competitive results on publicly available dataset. The code is available at https://github.com/11yxk/SAM-LST.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Artificial neural networks ; Business competition ; Computer vision ; Datasets ; Image segmentation ; Medical imaging</subject><ispartof>arXiv.org, 2023-06</ispartof><rights>2023. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>781,785</link.rule.ids></links><search><creatorcontrib>Chai, Shurong</creatorcontrib><creatorcontrib>Jain, Rahul Kumar</creatorcontrib><creatorcontrib>Teng, Shiyu</creatorcontrib><creatorcontrib>Liu, Jiaqing</creatorcontrib><creatorcontrib>Li, Yinhao</creatorcontrib><creatorcontrib>Tateyama, Tomoko</creatorcontrib><creatorcontrib>Yen-wei, Chen</creatorcontrib><title>Ladder Fine-tuning approach for SAM integrating complementary network</title><title>arXiv.org</title><description>Recently, foundation models have been introduced demonstrating various tasks in the field of computer vision. These models such as Segment Anything Model (SAM) are generalized models trained using huge datasets. Currently, ongoing research focuses on exploring the effective utilization of these generalized models for specific domains, such as medical imaging. However, in medical imaging, the lack of training samples due to privacy concerns and other factors presents a major challenge for applying these generalized models to medical image segmentation task. To address this issue, the effective fine tuning of these models is crucial to ensure their optimal utilization. In this study, we propose to combine a complementary Convolutional Neural Network (CNN) along with the standard SAM network for medical image segmentation. To reduce the burden of fine tuning large foundation model and implement cost-efficient trainnig scheme, we focus only on fine-tuning the additional CNN network and SAM decoder part. This strategy significantly reduces trainnig time and achieves competitive results on publicly available dataset. The code is available at https://github.com/11yxk/SAM-LST.</description><subject>Artificial neural networks</subject><subject>Business competition</subject><subject>Computer vision</subject><subject>Datasets</subject><subject>Image segmentation</subject><subject>Medical imaging</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNjMsKgkAUQIcgSMp_GGgtTHfytYxQWtSq9jLo1TS9YzMj0d9n0Ae0OotzOAvmgZS7INkDrJhvbSeEgCiGMJQey86qqtDwvCUM3EQtNVyNo9GqvPNaG349XHhLDhuj3FeWehh7HJCcMm9O6F7aPDZsWaveov_jmm3z7HY8BfPoOaF1RacnQ7MqIIEkjVIhY_lf9QGH-ztC</recordid><startdate>20230622</startdate><enddate>20230622</enddate><creator>Chai, Shurong</creator><creator>Jain, Rahul Kumar</creator><creator>Teng, Shiyu</creator><creator>Liu, Jiaqing</creator><creator>Li, Yinhao</creator><creator>Tateyama, Tomoko</creator><creator>Yen-wei, Chen</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20230622</creationdate><title>Ladder Fine-tuning approach for SAM integrating complementary network</title><author>Chai, Shurong ; Jain, Rahul Kumar ; Teng, Shiyu ; Liu, Jiaqing ; Li, Yinhao ; Tateyama, Tomoko ; Yen-wei, Chen</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_28289690373</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Artificial neural networks</topic><topic>Business competition</topic><topic>Computer vision</topic><topic>Datasets</topic><topic>Image segmentation</topic><topic>Medical imaging</topic><toplevel>online_resources</toplevel><creatorcontrib>Chai, Shurong</creatorcontrib><creatorcontrib>Jain, Rahul Kumar</creatorcontrib><creatorcontrib>Teng, Shiyu</creatorcontrib><creatorcontrib>Liu, Jiaqing</creatorcontrib><creatorcontrib>Li, Yinhao</creatorcontrib><creatorcontrib>Tateyama, Tomoko</creatorcontrib><creatorcontrib>Yen-wei, Chen</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Access via ProQuest (Open Access)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Chai, Shurong</au><au>Jain, Rahul Kumar</au><au>Teng, Shiyu</au><au>Liu, Jiaqing</au><au>Li, Yinhao</au><au>Tateyama, Tomoko</au><au>Yen-wei, Chen</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Ladder Fine-tuning approach for SAM integrating complementary network</atitle><jtitle>arXiv.org</jtitle><date>2023-06-22</date><risdate>2023</risdate><eissn>2331-8422</eissn><abstract>Recently, foundation models have been introduced demonstrating various tasks in the field of computer vision. These models such as Segment Anything Model (SAM) are generalized models trained using huge datasets. Currently, ongoing research focuses on exploring the effective utilization of these generalized models for specific domains, such as medical imaging. However, in medical imaging, the lack of training samples due to privacy concerns and other factors presents a major challenge for applying these generalized models to medical image segmentation task. To address this issue, the effective fine tuning of these models is crucial to ensure their optimal utilization. In this study, we propose to combine a complementary Convolutional Neural Network (CNN) along with the standard SAM network for medical image segmentation. To reduce the burden of fine tuning large foundation model and implement cost-efficient trainnig scheme, we focus only on fine-tuning the additional CNN network and SAM decoder part. This strategy significantly reduces trainnig time and achieves competitive results on publicly available dataset. The code is available at https://github.com/11yxk/SAM-LST.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2023-06 |
issn | 2331-8422 |
language | eng |
recordid | cdi_proquest_journals_2828969037 |
source | Free E- Journals |
subjects | Artificial neural networks Business competition Computer vision Datasets Image segmentation Medical imaging |
title | Ladder Fine-tuning approach for SAM integrating complementary network |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-14T02%3A28%3A11IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Ladder%20Fine-tuning%20approach%20for%20SAM%20integrating%20complementary%20network&rft.jtitle=arXiv.org&rft.au=Chai,%20Shurong&rft.date=2023-06-22&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2828969037%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2828969037&rft_id=info:pmid/&rfr_iscdi=true |