Interactive Class-Agnostic Object Counting

We propose a novel framework for interactive class-agnostic object counting, where a human user can interactively provide feedback to improve the accuracy of a counter. Our framework consists of two main components: a user-friendly visualizer to gather feedback and an efficient mechanism to incorpor...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Huang, Yifeng, Ranjan, Viresh, Hoai, Minh
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Huang, Yifeng
Ranjan, Viresh
Hoai, Minh
description We propose a novel framework for interactive class-agnostic object counting, where a human user can interactively provide feedback to improve the accuracy of a counter. Our framework consists of two main components: a user-friendly visualizer to gather feedback and an efficient mechanism to incorporate it. In each iteration, we produce a density map to show the current prediction result, and we segment it into non-overlapping regions with an easily verifiable number of objects. The user can provide feedback by selecting a region with obvious counting errors and specifying the range for the estimated number of objects within it. To improve the counting result, we develop a novel adaptation loss to force the visual counter to output the predicted count within the user-specified range. For effective and efficient adaptation, we propose a refinement module that can be used with any density-based visual counter, and only the parameters in the refinement module will be updated during adaptation. Our experiments on two challenging class-agnostic object counting benchmarks, FSCD-LVIS and FSC-147, show that our method can reduce the mean absolute error of multiple state-of-the-art visual counters by roughly 30% to 40% with minimal user input. Our project can be found at https://yifehuang97.github.io/ICACountProjectPage/.
doi_str_mv 10.48550/arxiv.2309.05277
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2309_05277</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2309_05277</sourcerecordid><originalsourceid>FETCH-LOGICAL-a677-d47a8a62b15f3f1795e7a11f8b15746d1d2824868feb79f056151648e75e1c573</originalsourceid><addsrcrecordid>eNotzrFuwjAUhWEvDBX0ATqRGSnB1_H1dUYUFYqExMIe3SQ2MoKAEhfB25cC05H-4egT4gtkpi2inHN_C9dM5bLIJCqiDzFbd9H13MRwdUl55GFIF_vuPMTQJNv64JqYlOffLoZuPxEjz8fBfb53LHbL7135k262q3W52KRsiNJWE1s2qgb0uQcq0BEDePsIpE0LrbJKW2O9q6nwEg0gGG0doYMGKR-L6ev2ia0ufThxf6_-0dUTnf8BPb86vw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Interactive Class-Agnostic Object Counting</title><source>arXiv.org</source><creator>Huang, Yifeng ; Ranjan, Viresh ; Hoai, Minh</creator><creatorcontrib>Huang, Yifeng ; Ranjan, Viresh ; Hoai, Minh</creatorcontrib><description>We propose a novel framework for interactive class-agnostic object counting, where a human user can interactively provide feedback to improve the accuracy of a counter. Our framework consists of two main components: a user-friendly visualizer to gather feedback and an efficient mechanism to incorporate it. In each iteration, we produce a density map to show the current prediction result, and we segment it into non-overlapping regions with an easily verifiable number of objects. The user can provide feedback by selecting a region with obvious counting errors and specifying the range for the estimated number of objects within it. To improve the counting result, we develop a novel adaptation loss to force the visual counter to output the predicted count within the user-specified range. For effective and efficient adaptation, we propose a refinement module that can be used with any density-based visual counter, and only the parameters in the refinement module will be updated during adaptation. Our experiments on two challenging class-agnostic object counting benchmarks, FSCD-LVIS and FSC-147, show that our method can reduce the mean absolute error of multiple state-of-the-art visual counters by roughly 30% to 40% with minimal user input. Our project can be found at https://yifehuang97.github.io/ICACountProjectPage/.</description><identifier>DOI: 10.48550/arxiv.2309.05277</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2023-09</creationdate><rights>http://creativecommons.org/publicdomain/zero/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2309.05277$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2309.05277$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Huang, Yifeng</creatorcontrib><creatorcontrib>Ranjan, Viresh</creatorcontrib><creatorcontrib>Hoai, Minh</creatorcontrib><title>Interactive Class-Agnostic Object Counting</title><description>We propose a novel framework for interactive class-agnostic object counting, where a human user can interactively provide feedback to improve the accuracy of a counter. Our framework consists of two main components: a user-friendly visualizer to gather feedback and an efficient mechanism to incorporate it. In each iteration, we produce a density map to show the current prediction result, and we segment it into non-overlapping regions with an easily verifiable number of objects. The user can provide feedback by selecting a region with obvious counting errors and specifying the range for the estimated number of objects within it. To improve the counting result, we develop a novel adaptation loss to force the visual counter to output the predicted count within the user-specified range. For effective and efficient adaptation, we propose a refinement module that can be used with any density-based visual counter, and only the parameters in the refinement module will be updated during adaptation. Our experiments on two challenging class-agnostic object counting benchmarks, FSCD-LVIS and FSC-147, show that our method can reduce the mean absolute error of multiple state-of-the-art visual counters by roughly 30% to 40% with minimal user input. Our project can be found at https://yifehuang97.github.io/ICACountProjectPage/.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotzrFuwjAUhWEvDBX0ATqRGSnB1_H1dUYUFYqExMIe3SQ2MoKAEhfB25cC05H-4egT4gtkpi2inHN_C9dM5bLIJCqiDzFbd9H13MRwdUl55GFIF_vuPMTQJNv64JqYlOffLoZuPxEjz8fBfb53LHbL7135k262q3W52KRsiNJWE1s2qgb0uQcq0BEDePsIpE0LrbJKW2O9q6nwEg0gGG0doYMGKR-L6ev2ia0ufThxf6_-0dUTnf8BPb86vw</recordid><startdate>20230911</startdate><enddate>20230911</enddate><creator>Huang, Yifeng</creator><creator>Ranjan, Viresh</creator><creator>Hoai, Minh</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20230911</creationdate><title>Interactive Class-Agnostic Object Counting</title><author>Huang, Yifeng ; Ranjan, Viresh ; Hoai, Minh</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a677-d47a8a62b15f3f1795e7a11f8b15746d1d2824868feb79f056151648e75e1c573</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Huang, Yifeng</creatorcontrib><creatorcontrib>Ranjan, Viresh</creatorcontrib><creatorcontrib>Hoai, Minh</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Huang, Yifeng</au><au>Ranjan, Viresh</au><au>Hoai, Minh</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Interactive Class-Agnostic Object Counting</atitle><date>2023-09-11</date><risdate>2023</risdate><abstract>We propose a novel framework for interactive class-agnostic object counting, where a human user can interactively provide feedback to improve the accuracy of a counter. Our framework consists of two main components: a user-friendly visualizer to gather feedback and an efficient mechanism to incorporate it. In each iteration, we produce a density map to show the current prediction result, and we segment it into non-overlapping regions with an easily verifiable number of objects. The user can provide feedback by selecting a region with obvious counting errors and specifying the range for the estimated number of objects within it. To improve the counting result, we develop a novel adaptation loss to force the visual counter to output the predicted count within the user-specified range. For effective and efficient adaptation, we propose a refinement module that can be used with any density-based visual counter, and only the parameters in the refinement module will be updated during adaptation. Our experiments on two challenging class-agnostic object counting benchmarks, FSCD-LVIS and FSC-147, show that our method can reduce the mean absolute error of multiple state-of-the-art visual counters by roughly 30% to 40% with minimal user input. Our project can be found at https://yifehuang97.github.io/ICACountProjectPage/.</abstract><doi>10.48550/arxiv.2309.05277</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2309.05277
ispartof
issn
language eng
recordid cdi_arxiv_primary_2309_05277
source arXiv.org
subjects Computer Science - Computer Vision and Pattern Recognition
title Interactive Class-Agnostic Object Counting
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-18T22%3A12%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Interactive%20Class-Agnostic%20Object%20Counting&rft.au=Huang,%20Yifeng&rft.date=2023-09-11&rft_id=info:doi/10.48550/arxiv.2309.05277&rft_dat=%3Carxiv_GOX%3E2309_05277%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true