Training object class detectors with click supervision
Training object class detectors typically requires a large set of images with objects annotated by bounding boxes. However, manually drawing bounding boxes is very time consuming. In this paper we greatly reduce annotation time by proposing center-click annotations: we ask annotators to click on the...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Papadopoulos, Dim P Uijlings, Jasper R. R Keller, Frank Ferrari, Vittorio |
description | Training object class detectors typically requires a large set of images with
objects annotated by bounding boxes. However, manually drawing bounding boxes
is very time consuming. In this paper we greatly reduce annotation time by
proposing center-click annotations: we ask annotators to click on the center of
an imaginary bounding box which tightly encloses the object instance. We then
incorporate these clicks into existing Multiple Instance Learning techniques
for weakly supervised object localization, to jointly localize object bounding
boxes over all training images. Extensive experiments on PASCAL VOC 2007 and MS
COCO show that: (1) our scheme delivers high-quality detectors, performing
substantially better than those produced by weakly supervised techniques, with
a modest extra annotation effort; (2) these detectors in fact perform in a
range close to those trained from manually drawn bounding boxes; (3) as the
center-click task is very fast, our scheme reduces total annotation time by 9x
to 18x. |
doi_str_mv | 10.48550/arxiv.1704.06189 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_1704_06189</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1704_06189</sourcerecordid><originalsourceid>FETCH-LOGICAL-a679-86eabfa144f05b364e8734296436e5216c7118445f4e510d7f235e5e837dbafb3</originalsourceid><addsrcrecordid>eNotj8tOwzAURL1hgQofwKr-gaR2fP3IElXlIVVik310nVy3bktS2aHA39MHq9HM4mgOY09SlOC0FgtMP_FUSiugFEa6-p6ZJmEc4rDho99RN_HugDnznqZzGVPm33HansfY7Xn-OlI6xRzH4YHdBTxkevzPGWteVs3yrVh_vL4vn9cFGlsXzhD6gBIgCO2VAXJWQVUbUIZ0JU1npXQAOgBpKXobKqVJk1O29xi8mrH5DXs93h5T_MT0214E2quA-gMn3EAk</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Training object class detectors with click supervision</title><source>arXiv.org</source><creator>Papadopoulos, Dim P ; Uijlings, Jasper R. R ; Keller, Frank ; Ferrari, Vittorio</creator><creatorcontrib>Papadopoulos, Dim P ; Uijlings, Jasper R. R ; Keller, Frank ; Ferrari, Vittorio</creatorcontrib><description>Training object class detectors typically requires a large set of images with
objects annotated by bounding boxes. However, manually drawing bounding boxes
is very time consuming. In this paper we greatly reduce annotation time by
proposing center-click annotations: we ask annotators to click on the center of
an imaginary bounding box which tightly encloses the object instance. We then
incorporate these clicks into existing Multiple Instance Learning techniques
for weakly supervised object localization, to jointly localize object bounding
boxes over all training images. Extensive experiments on PASCAL VOC 2007 and MS
COCO show that: (1) our scheme delivers high-quality detectors, performing
substantially better than those produced by weakly supervised techniques, with
a modest extra annotation effort; (2) these detectors in fact perform in a
range close to those trained from manually drawn bounding boxes; (3) as the
center-click task is very fast, our scheme reduces total annotation time by 9x
to 18x.</description><identifier>DOI: 10.48550/arxiv.1704.06189</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2017-04</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,778,883</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/1704.06189$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.1704.06189$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Papadopoulos, Dim P</creatorcontrib><creatorcontrib>Uijlings, Jasper R. R</creatorcontrib><creatorcontrib>Keller, Frank</creatorcontrib><creatorcontrib>Ferrari, Vittorio</creatorcontrib><title>Training object class detectors with click supervision</title><description>Training object class detectors typically requires a large set of images with
objects annotated by bounding boxes. However, manually drawing bounding boxes
is very time consuming. In this paper we greatly reduce annotation time by
proposing center-click annotations: we ask annotators to click on the center of
an imaginary bounding box which tightly encloses the object instance. We then
incorporate these clicks into existing Multiple Instance Learning techniques
for weakly supervised object localization, to jointly localize object bounding
boxes over all training images. Extensive experiments on PASCAL VOC 2007 and MS
COCO show that: (1) our scheme delivers high-quality detectors, performing
substantially better than those produced by weakly supervised techniques, with
a modest extra annotation effort; (2) these detectors in fact perform in a
range close to those trained from manually drawn bounding boxes; (3) as the
center-click task is very fast, our scheme reduces total annotation time by 9x
to 18x.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj8tOwzAURL1hgQofwKr-gaR2fP3IElXlIVVik310nVy3bktS2aHA39MHq9HM4mgOY09SlOC0FgtMP_FUSiugFEa6-p6ZJmEc4rDho99RN_HugDnznqZzGVPm33HansfY7Xn-OlI6xRzH4YHdBTxkevzPGWteVs3yrVh_vL4vn9cFGlsXzhD6gBIgCO2VAXJWQVUbUIZ0JU1npXQAOgBpKXobKqVJk1O29xi8mrH5DXs93h5T_MT0214E2quA-gMn3EAk</recordid><startdate>20170420</startdate><enddate>20170420</enddate><creator>Papadopoulos, Dim P</creator><creator>Uijlings, Jasper R. R</creator><creator>Keller, Frank</creator><creator>Ferrari, Vittorio</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20170420</creationdate><title>Training object class detectors with click supervision</title><author>Papadopoulos, Dim P ; Uijlings, Jasper R. R ; Keller, Frank ; Ferrari, Vittorio</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a679-86eabfa144f05b364e8734296436e5216c7118445f4e510d7f235e5e837dbafb3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Papadopoulos, Dim P</creatorcontrib><creatorcontrib>Uijlings, Jasper R. R</creatorcontrib><creatorcontrib>Keller, Frank</creatorcontrib><creatorcontrib>Ferrari, Vittorio</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Papadopoulos, Dim P</au><au>Uijlings, Jasper R. R</au><au>Keller, Frank</au><au>Ferrari, Vittorio</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Training object class detectors with click supervision</atitle><date>2017-04-20</date><risdate>2017</risdate><abstract>Training object class detectors typically requires a large set of images with
objects annotated by bounding boxes. However, manually drawing bounding boxes
is very time consuming. In this paper we greatly reduce annotation time by
proposing center-click annotations: we ask annotators to click on the center of
an imaginary bounding box which tightly encloses the object instance. We then
incorporate these clicks into existing Multiple Instance Learning techniques
for weakly supervised object localization, to jointly localize object bounding
boxes over all training images. Extensive experiments on PASCAL VOC 2007 and MS
COCO show that: (1) our scheme delivers high-quality detectors, performing
substantially better than those produced by weakly supervised techniques, with
a modest extra annotation effort; (2) these detectors in fact perform in a
range close to those trained from manually drawn bounding boxes; (3) as the
center-click task is very fast, our scheme reduces total annotation time by 9x
to 18x.</abstract><doi>10.48550/arxiv.1704.06189</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.1704.06189 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_1704_06189 |
source | arXiv.org |
subjects | Computer Science - Computer Vision and Pattern Recognition |
title | Training object class detectors with click supervision |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T15%3A49%3A27IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Training%20object%20class%20detectors%20with%20click%20supervision&rft.au=Papadopoulos,%20Dim%20P&rft.date=2017-04-20&rft_id=info:doi/10.48550/arxiv.1704.06189&rft_dat=%3Carxiv_GOX%3E1704_06189%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |