SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process
In this paper, we explore a principal way to enhance the quality of object masks produced by different segmentation models. We propose a model-agnostic solution called SegRefiner, which offers a novel perspective on this problem by interpreting segmentation refinement as a data generation process. A...
Gespeichert in:
Hauptverfasser: | , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Wang, Mengyu Ding, Henghui Liew, Jun Hao Liu, Jiajun Zhao, Yao Wei, Yunchao |
description | In this paper, we explore a principal way to enhance the quality of object
masks produced by different segmentation models. We propose a model-agnostic
solution called SegRefiner, which offers a novel perspective on this problem by
interpreting segmentation refinement as a data generation process. As a result,
the refinement process can be smoothly implemented through a series of
denoising diffusion steps. Specifically, SegRefiner takes coarse masks as
inputs and refines them using a discrete diffusion process. By predicting the
label and corresponding states-transition probabilities for each pixel,
SegRefiner progressively refines the noisy masks in a conditional denoising
manner. To assess the effectiveness of SegRefiner, we conduct comprehensive
experiments on various segmentation tasks, including semantic segmentation,
instance segmentation, and dichotomous image segmentation. The results
demonstrate the superiority of our SegRefiner from multiple aspects. Firstly,
it consistently improves both the segmentation metrics and boundary metrics
across different types of coarse masks. Secondly, it outperforms previous
model-agnostic refinement methods by a significant margin. Lastly, it exhibits
a strong capability to capture extremely fine details when refining
high-resolution images. The source code and trained models are available at
https://github.com/MengyuWang826/SegRefiner. |
doi_str_mv | 10.48550/arxiv.2312.12425 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2312_12425</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2312_12425</sourcerecordid><originalsourceid>FETCH-LOGICAL-a675-5c41cc4c2fa0002fd7b91d5bda026aa0262a543695ec5ca5ab322a00c23f97b43</originalsourceid><addsrcrecordid>eNotj8tOwzAQRb1hgQofwKr-gQR7bCeEXVWeUitQyYpNNPGjWGpjZBsKf0_Sspk7Vzq60iHkirNS3ijFrjH--O8SBIeSgwR1Tt7f7HZjnR9svKVtOGA0ia6DsbtisR1Cyl7TEdnbIWP2YaAneOr04PMHvfNJR5vt-Dj3lSbkNQZtU7ogZw53yV7-54y0D_ft8qlYvTw-LxerAqtaFUpLrrXU4JAxBs7UfcON6g0yqHA6gEqKqlFWK40KewEwohqEa-peihmZn2aPct1n9HuMv90k2R0lxR8L103G</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process</title><source>arXiv.org</source><creator>Wang, Mengyu ; Ding, Henghui ; Liew, Jun Hao ; Liu, Jiajun ; Zhao, Yao ; Wei, Yunchao</creator><creatorcontrib>Wang, Mengyu ; Ding, Henghui ; Liew, Jun Hao ; Liu, Jiajun ; Zhao, Yao ; Wei, Yunchao</creatorcontrib><description>In this paper, we explore a principal way to enhance the quality of object
masks produced by different segmentation models. We propose a model-agnostic
solution called SegRefiner, which offers a novel perspective on this problem by
interpreting segmentation refinement as a data generation process. As a result,
the refinement process can be smoothly implemented through a series of
denoising diffusion steps. Specifically, SegRefiner takes coarse masks as
inputs and refines them using a discrete diffusion process. By predicting the
label and corresponding states-transition probabilities for each pixel,
SegRefiner progressively refines the noisy masks in a conditional denoising
manner. To assess the effectiveness of SegRefiner, we conduct comprehensive
experiments on various segmentation tasks, including semantic segmentation,
instance segmentation, and dichotomous image segmentation. The results
demonstrate the superiority of our SegRefiner from multiple aspects. Firstly,
it consistently improves both the segmentation metrics and boundary metrics
across different types of coarse masks. Secondly, it outperforms previous
model-agnostic refinement methods by a significant margin. Lastly, it exhibits
a strong capability to capture extremely fine details when refining
high-resolution images. The source code and trained models are available at
https://github.com/MengyuWang826/SegRefiner.</description><identifier>DOI: 10.48550/arxiv.2312.12425</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2023-12</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,781,886</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2312.12425$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2312.12425$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Wang, Mengyu</creatorcontrib><creatorcontrib>Ding, Henghui</creatorcontrib><creatorcontrib>Liew, Jun Hao</creatorcontrib><creatorcontrib>Liu, Jiajun</creatorcontrib><creatorcontrib>Zhao, Yao</creatorcontrib><creatorcontrib>Wei, Yunchao</creatorcontrib><title>SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process</title><description>In this paper, we explore a principal way to enhance the quality of object
masks produced by different segmentation models. We propose a model-agnostic
solution called SegRefiner, which offers a novel perspective on this problem by
interpreting segmentation refinement as a data generation process. As a result,
the refinement process can be smoothly implemented through a series of
denoising diffusion steps. Specifically, SegRefiner takes coarse masks as
inputs and refines them using a discrete diffusion process. By predicting the
label and corresponding states-transition probabilities for each pixel,
SegRefiner progressively refines the noisy masks in a conditional denoising
manner. To assess the effectiveness of SegRefiner, we conduct comprehensive
experiments on various segmentation tasks, including semantic segmentation,
instance segmentation, and dichotomous image segmentation. The results
demonstrate the superiority of our SegRefiner from multiple aspects. Firstly,
it consistently improves both the segmentation metrics and boundary metrics
across different types of coarse masks. Secondly, it outperforms previous
model-agnostic refinement methods by a significant margin. Lastly, it exhibits
a strong capability to capture extremely fine details when refining
high-resolution images. The source code and trained models are available at
https://github.com/MengyuWang826/SegRefiner.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj8tOwzAQRb1hgQofwKr-gQR7bCeEXVWeUitQyYpNNPGjWGpjZBsKf0_Sspk7Vzq60iHkirNS3ijFrjH--O8SBIeSgwR1Tt7f7HZjnR9svKVtOGA0ia6DsbtisR1Cyl7TEdnbIWP2YaAneOr04PMHvfNJR5vt-Dj3lSbkNQZtU7ogZw53yV7-54y0D_ft8qlYvTw-LxerAqtaFUpLrrXU4JAxBs7UfcON6g0yqHA6gEqKqlFWK40KewEwohqEa-peihmZn2aPct1n9HuMv90k2R0lxR8L103G</recordid><startdate>20231219</startdate><enddate>20231219</enddate><creator>Wang, Mengyu</creator><creator>Ding, Henghui</creator><creator>Liew, Jun Hao</creator><creator>Liu, Jiajun</creator><creator>Zhao, Yao</creator><creator>Wei, Yunchao</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20231219</creationdate><title>SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process</title><author>Wang, Mengyu ; Ding, Henghui ; Liew, Jun Hao ; Liu, Jiajun ; Zhao, Yao ; Wei, Yunchao</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a675-5c41cc4c2fa0002fd7b91d5bda026aa0262a543695ec5ca5ab322a00c23f97b43</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Wang, Mengyu</creatorcontrib><creatorcontrib>Ding, Henghui</creatorcontrib><creatorcontrib>Liew, Jun Hao</creatorcontrib><creatorcontrib>Liu, Jiajun</creatorcontrib><creatorcontrib>Zhao, Yao</creatorcontrib><creatorcontrib>Wei, Yunchao</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Wang, Mengyu</au><au>Ding, Henghui</au><au>Liew, Jun Hao</au><au>Liu, Jiajun</au><au>Zhao, Yao</au><au>Wei, Yunchao</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process</atitle><date>2023-12-19</date><risdate>2023</risdate><abstract>In this paper, we explore a principal way to enhance the quality of object
masks produced by different segmentation models. We propose a model-agnostic
solution called SegRefiner, which offers a novel perspective on this problem by
interpreting segmentation refinement as a data generation process. As a result,
the refinement process can be smoothly implemented through a series of
denoising diffusion steps. Specifically, SegRefiner takes coarse masks as
inputs and refines them using a discrete diffusion process. By predicting the
label and corresponding states-transition probabilities for each pixel,
SegRefiner progressively refines the noisy masks in a conditional denoising
manner. To assess the effectiveness of SegRefiner, we conduct comprehensive
experiments on various segmentation tasks, including semantic segmentation,
instance segmentation, and dichotomous image segmentation. The results
demonstrate the superiority of our SegRefiner from multiple aspects. Firstly,
it consistently improves both the segmentation metrics and boundary metrics
across different types of coarse masks. Secondly, it outperforms previous
model-agnostic refinement methods by a significant margin. Lastly, it exhibits
a strong capability to capture extremely fine details when refining
high-resolution images. The source code and trained models are available at
https://github.com/MengyuWang826/SegRefiner.</abstract><doi>10.48550/arxiv.2312.12425</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2312.12425 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2312_12425 |
source | arXiv.org |
subjects | Computer Science - Computer Vision and Pattern Recognition |
title | SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-16T14%3A20%3A09IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=SegRefiner:%20Towards%20Model-Agnostic%20Segmentation%20Refinement%20with%20Discrete%20Diffusion%20Process&rft.au=Wang,%20Mengyu&rft.date=2023-12-19&rft_id=info:doi/10.48550/arxiv.2312.12425&rft_dat=%3Carxiv_GOX%3E2312_12425%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |