Training object class detectors with click supervision

Training object class detectors typically requires a large set of images with objects annotated by bounding boxes. However, manually drawing bounding boxes is very time consuming. In this paper we greatly reduce annotation time by proposing center-click annotations: we ask annotators to click on the...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Papadopoulos, Dim P, Uijlings, Jasper R. R, Keller, Frank, Ferrari, Vittorio
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Computer Vision and Pattern Recognition
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Papadopoulos, Dim P Uijlings, Jasper R. R Keller, Frank Ferrari, Vittorio
description	Training object class detectors typically requires a large set of images with objects annotated by bounding boxes. However, manually drawing bounding boxes is very time consuming. In this paper we greatly reduce annotation time by proposing center-click annotations: we ask annotators to click on the center of an imaginary bounding box which tightly encloses the object instance. We then incorporate these clicks into existing Multiple Instance Learning techniques for weakly supervised object localization, to jointly localize object bounding boxes over all training images. Extensive experiments on PASCAL VOC 2007 and MS COCO show that: (1) our scheme delivers high-quality detectors, performing substantially better than those produced by weakly supervised techniques, with a modest extra annotation effort; (2) these detectors in fact perform in a range close to those trained from manually drawn bounding boxes; (3) as the center-click task is very fast, our scheme reduces total annotation time by 9x to 18x.
doi_str_mv	10.48550/arxiv.1704.06189
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_1704_06189</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1704_06189</sourcerecordid><originalsourceid>FETCH-LOGICAL-a679-86eabfa144f05b364e8734296436e5216c7118445f4e510d7f235e5e837dbafb3</originalsourceid><addsrcrecordid>eNotj8tOwzAURL1hgQofwKr-gaR2fP3IElXlIVVik310nVy3bktS2aHA39MHq9HM4mgOY09SlOC0FgtMP_FUSiugFEa6-p6ZJmEc4rDho99RN_HugDnznqZzGVPm33HansfY7Xn-OlI6xRzH4YHdBTxkevzPGWteVs3yrVh_vL4vn9cFGlsXzhD6gBIgCO2VAXJWQVUbUIZ0JU1npXQAOgBpKXobKqVJk1O29xi8mrH5DXs93h5T_MT0214E2quA-gMn3EAk</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Training object class detectors with click supervision</title><source>arXiv.org</source><creator>Papadopoulos, Dim P ; Uijlings, Jasper R. R ; Keller, Frank ; Ferrari, Vittorio</creator><creatorcontrib>Papadopoulos, Dim P ; Uijlings, Jasper R. R ; Keller, Frank ; Ferrari, Vittorio</creatorcontrib><description>Training object class detectors typically requires a large set of images with objects annotated by bounding boxes. However, manually drawing bounding boxes is very time consuming. In this paper we greatly reduce annotation time by proposing center-click annotations: we ask annotators to click on the center of an imaginary bounding box which tightly encloses the object instance. We then incorporate these clicks into existing Multiple Instance Learning techniques for weakly supervised object localization, to jointly localize object bounding boxes over all training images. Extensive experiments on PASCAL VOC 2007 and MS COCO show that: (1) our scheme delivers high-quality detectors, performing substantially better than those produced by weakly supervised techniques, with a modest extra annotation effort; (2) these detectors in fact perform in a range close to those trained from manually drawn bounding boxes; (3) as the center-click task is very fast, our scheme reduces total annotation time by 9x to 18x.</description><identifier>DOI: 10.48550/arxiv.1704.06189</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2017-04</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,778,883</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/1704.06189$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.1704.06189$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Papadopoulos, Dim P</creatorcontrib><creatorcontrib>Uijlings, Jasper R. R</creatorcontrib><creatorcontrib>Keller, Frank</creatorcontrib><creatorcontrib>Ferrari, Vittorio</creatorcontrib><title>Training object class detectors with click supervision</title><description>Training object class detectors typically requires a large set of images with objects annotated by bounding boxes. However, manually drawing bounding boxes is very time consuming. In this paper we greatly reduce annotation time by proposing center-click annotations: we ask annotators to click on the center of an imaginary bounding box which tightly encloses the object instance. We then incorporate these clicks into existing Multiple Instance Learning techniques for weakly supervised object localization, to jointly localize object bounding boxes over all training images. Extensive experiments on PASCAL VOC 2007 and MS COCO show that: (1) our scheme delivers high-quality detectors, performing substantially better than those produced by weakly supervised techniques, with a modest extra annotation effort; (2) these detectors in fact perform in a range close to those trained from manually drawn bounding boxes; (3) as the center-click task is very fast, our scheme reduces total annotation time by 9x to 18x.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj8tOwzAURL1hgQofwKr-gaR2fP3IElXlIVVik310nVy3bktS2aHA39MHq9HM4mgOY09SlOC0FgtMP_FUSiugFEa6-p6ZJmEc4rDho99RN_HugDnznqZzGVPm33HansfY7Xn-OlI6xRzH4YHdBTxkevzPGWteVs3yrVh_vL4vn9cFGlsXzhD6gBIgCO2VAXJWQVUbUIZ0JU1npXQAOgBpKXobKqVJk1O29xi8mrH5DXs93h5T_MT0214E2quA-gMn3EAk</recordid><startdate>20170420</startdate><enddate>20170420</enddate><creator>Papadopoulos, Dim P</creator><creator>Uijlings, Jasper R. R</creator><creator>Keller, Frank</creator><creator>Ferrari, Vittorio</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20170420</creationdate><title>Training object class detectors with click supervision</title><author>Papadopoulos, Dim P ; Uijlings, Jasper R. R ; Keller, Frank ; Ferrari, Vittorio</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a679-86eabfa144f05b364e8734296436e5216c7118445f4e510d7f235e5e837dbafb3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Papadopoulos, Dim P</creatorcontrib><creatorcontrib>Uijlings, Jasper R. R</creatorcontrib><creatorcontrib>Keller, Frank</creatorcontrib><creatorcontrib>Ferrari, Vittorio</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Papadopoulos, Dim P</au><au>Uijlings, Jasper R. R</au><au>Keller, Frank</au><au>Ferrari, Vittorio</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Training object class detectors with click supervision</atitle><date>2017-04-20</date><risdate>2017</risdate><abstract>Training object class detectors typically requires a large set of images with objects annotated by bounding boxes. However, manually drawing bounding boxes is very time consuming. In this paper we greatly reduce annotation time by proposing center-click annotations: we ask annotators to click on the center of an imaginary bounding box which tightly encloses the object instance. We then incorporate these clicks into existing Multiple Instance Learning techniques for weakly supervised object localization, to jointly localize object bounding boxes over all training images. Extensive experiments on PASCAL VOC 2007 and MS COCO show that: (1) our scheme delivers high-quality detectors, performing substantially better than those produced by weakly supervised techniques, with a modest extra annotation effort; (2) these detectors in fact perform in a range close to those trained from manually drawn bounding boxes; (3) as the center-click task is very fast, our scheme reduces total annotation time by 9x to 18x.</abstract><doi>10.48550/arxiv.1704.06189</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.1704.06189
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_1704_06189
source	arXiv.org
subjects	Computer Science - Computer Vision and Pattern Recognition
title	Training object class detectors with click supervision
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T15%3A49%3A27IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Training%20object%20class%20detectors%20with%20click%20supervision&rft.au=Papadopoulos,%20Dim%20P&rft.date=2017-04-20&rft_id=info:doi/10.48550/arxiv.1704.06189&rft_dat=%3Carxiv_GOX%3E1704_06189%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true