SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process

In this paper, we explore a principal way to enhance the quality of object masks produced by different segmentation models. We propose a model-agnostic solution called SegRefiner, which offers a novel perspective on this problem by interpreting segmentation refinement as a data generation process. A...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Wang, Mengyu, Ding, Henghui, Liew, Jun Hao, Liu, Jiajun, Zhao, Yao, Wei, Yunchao
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Computer Vision and Pattern Recognition
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Wang, Mengyu Ding, Henghui Liew, Jun Hao Liu, Jiajun Zhao, Yao Wei, Yunchao
description	In this paper, we explore a principal way to enhance the quality of object masks produced by different segmentation models. We propose a model-agnostic solution called SegRefiner, which offers a novel perspective on this problem by interpreting segmentation refinement as a data generation process. As a result, the refinement process can be smoothly implemented through a series of denoising diffusion steps. Specifically, SegRefiner takes coarse masks as inputs and refines them using a discrete diffusion process. By predicting the label and corresponding states-transition probabilities for each pixel, SegRefiner progressively refines the noisy masks in a conditional denoising manner. To assess the effectiveness of SegRefiner, we conduct comprehensive experiments on various segmentation tasks, including semantic segmentation, instance segmentation, and dichotomous image segmentation. The results demonstrate the superiority of our SegRefiner from multiple aspects. Firstly, it consistently improves both the segmentation metrics and boundary metrics across different types of coarse masks. Secondly, it outperforms previous model-agnostic refinement methods by a significant margin. Lastly, it exhibits a strong capability to capture extremely fine details when refining high-resolution images. The source code and trained models are available at https://github.com/MengyuWang826/SegRefiner.
doi_str_mv	10.48550/arxiv.2312.12425
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2312_12425</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2312_12425</sourcerecordid><originalsourceid>FETCH-LOGICAL-a675-5c41cc4c2fa0002fd7b91d5bda026aa0262a543695ec5ca5ab322a00c23f97b43</originalsourceid><addsrcrecordid>eNotj8tOwzAQRb1hgQofwKr-gQR7bCeEXVWeUitQyYpNNPGjWGpjZBsKf0_Sspk7Vzq60iHkirNS3ijFrjH--O8SBIeSgwR1Tt7f7HZjnR9svKVtOGA0ia6DsbtisR1Cyl7TEdnbIWP2YaAneOr04PMHvfNJR5vt-Dj3lSbkNQZtU7ogZw53yV7-54y0D_ft8qlYvTw-LxerAqtaFUpLrrXU4JAxBs7UfcON6g0yqHA6gEqKqlFWK40KewEwohqEa-peihmZn2aPct1n9HuMv90k2R0lxR8L103G</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process</title><source>arXiv.org</source><creator>Wang, Mengyu ; Ding, Henghui ; Liew, Jun Hao ; Liu, Jiajun ; Zhao, Yao ; Wei, Yunchao</creator><creatorcontrib>Wang, Mengyu ; Ding, Henghui ; Liew, Jun Hao ; Liu, Jiajun ; Zhao, Yao ; Wei, Yunchao</creatorcontrib><description>In this paper, we explore a principal way to enhance the quality of object masks produced by different segmentation models. We propose a model-agnostic solution called SegRefiner, which offers a novel perspective on this problem by interpreting segmentation refinement as a data generation process. As a result, the refinement process can be smoothly implemented through a series of denoising diffusion steps. Specifically, SegRefiner takes coarse masks as inputs and refines them using a discrete diffusion process. By predicting the label and corresponding states-transition probabilities for each pixel, SegRefiner progressively refines the noisy masks in a conditional denoising manner. To assess the effectiveness of SegRefiner, we conduct comprehensive experiments on various segmentation tasks, including semantic segmentation, instance segmentation, and dichotomous image segmentation. The results demonstrate the superiority of our SegRefiner from multiple aspects. Firstly, it consistently improves both the segmentation metrics and boundary metrics across different types of coarse masks. Secondly, it outperforms previous model-agnostic refinement methods by a significant margin. Lastly, it exhibits a strong capability to capture extremely fine details when refining high-resolution images. The source code and trained models are available at https://github.com/MengyuWang826/SegRefiner.</description><identifier>DOI: 10.48550/arxiv.2312.12425</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2023-12</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,781,886</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2312.12425$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2312.12425$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Wang, Mengyu</creatorcontrib><creatorcontrib>Ding, Henghui</creatorcontrib><creatorcontrib>Liew, Jun Hao</creatorcontrib><creatorcontrib>Liu, Jiajun</creatorcontrib><creatorcontrib>Zhao, Yao</creatorcontrib><creatorcontrib>Wei, Yunchao</creatorcontrib><title>SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process</title><description>In this paper, we explore a principal way to enhance the quality of object masks produced by different segmentation models. We propose a model-agnostic solution called SegRefiner, which offers a novel perspective on this problem by interpreting segmentation refinement as a data generation process. As a result, the refinement process can be smoothly implemented through a series of denoising diffusion steps. Specifically, SegRefiner takes coarse masks as inputs and refines them using a discrete diffusion process. By predicting the label and corresponding states-transition probabilities for each pixel, SegRefiner progressively refines the noisy masks in a conditional denoising manner. To assess the effectiveness of SegRefiner, we conduct comprehensive experiments on various segmentation tasks, including semantic segmentation, instance segmentation, and dichotomous image segmentation. The results demonstrate the superiority of our SegRefiner from multiple aspects. Firstly, it consistently improves both the segmentation metrics and boundary metrics across different types of coarse masks. Secondly, it outperforms previous model-agnostic refinement methods by a significant margin. Lastly, it exhibits a strong capability to capture extremely fine details when refining high-resolution images. The source code and trained models are available at https://github.com/MengyuWang826/SegRefiner.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj8tOwzAQRb1hgQofwKr-gQR7bCeEXVWeUitQyYpNNPGjWGpjZBsKf0_Sspk7Vzq60iHkirNS3ijFrjH--O8SBIeSgwR1Tt7f7HZjnR9svKVtOGA0ia6DsbtisR1Cyl7TEdnbIWP2YaAneOr04PMHvfNJR5vt-Dj3lSbkNQZtU7ogZw53yV7-54y0D_ft8qlYvTw-LxerAqtaFUpLrrXU4JAxBs7UfcON6g0yqHA6gEqKqlFWK40KewEwohqEa-peihmZn2aPct1n9HuMv90k2R0lxR8L103G</recordid><startdate>20231219</startdate><enddate>20231219</enddate><creator>Wang, Mengyu</creator><creator>Ding, Henghui</creator><creator>Liew, Jun Hao</creator><creator>Liu, Jiajun</creator><creator>Zhao, Yao</creator><creator>Wei, Yunchao</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20231219</creationdate><title>SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process</title><author>Wang, Mengyu ; Ding, Henghui ; Liew, Jun Hao ; Liu, Jiajun ; Zhao, Yao ; Wei, Yunchao</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a675-5c41cc4c2fa0002fd7b91d5bda026aa0262a543695ec5ca5ab322a00c23f97b43</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Wang, Mengyu</creatorcontrib><creatorcontrib>Ding, Henghui</creatorcontrib><creatorcontrib>Liew, Jun Hao</creatorcontrib><creatorcontrib>Liu, Jiajun</creatorcontrib><creatorcontrib>Zhao, Yao</creatorcontrib><creatorcontrib>Wei, Yunchao</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Wang, Mengyu</au><au>Ding, Henghui</au><au>Liew, Jun Hao</au><au>Liu, Jiajun</au><au>Zhao, Yao</au><au>Wei, Yunchao</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process</atitle><date>2023-12-19</date><risdate>2023</risdate><abstract>In this paper, we explore a principal way to enhance the quality of object masks produced by different segmentation models. We propose a model-agnostic solution called SegRefiner, which offers a novel perspective on this problem by interpreting segmentation refinement as a data generation process. As a result, the refinement process can be smoothly implemented through a series of denoising diffusion steps. Specifically, SegRefiner takes coarse masks as inputs and refines them using a discrete diffusion process. By predicting the label and corresponding states-transition probabilities for each pixel, SegRefiner progressively refines the noisy masks in a conditional denoising manner. To assess the effectiveness of SegRefiner, we conduct comprehensive experiments on various segmentation tasks, including semantic segmentation, instance segmentation, and dichotomous image segmentation. The results demonstrate the superiority of our SegRefiner from multiple aspects. Firstly, it consistently improves both the segmentation metrics and boundary metrics across different types of coarse masks. Secondly, it outperforms previous model-agnostic refinement methods by a significant margin. Lastly, it exhibits a strong capability to capture extremely fine details when refining high-resolution images. The source code and trained models are available at https://github.com/MengyuWang826/SegRefiner.</abstract><doi>10.48550/arxiv.2312.12425</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.2312.12425
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_2312_12425
source	arXiv.org
subjects	Computer Science - Computer Vision and Pattern Recognition
title	SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-16T14%3A20%3A09IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=SegRefiner:%20Towards%20Model-Agnostic%20Segmentation%20Refinement%20with%20Discrete%20Diffusion%20Process&rft.au=Wang,%20Mengyu&rft.date=2023-12-19&rft_id=info:doi/10.48550/arxiv.2312.12425&rft_dat=%3Carxiv_GOX%3E2312_12425%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true