Locate, Size and Count: Accurately Resolving People in Dense Crowds via Detection

We introduce a detection framework for dense crowd counting and eliminate the need for the prevalent density regression paradigm. Typical counting models predict crowd density for an image as opposed to detecting every person. These regression methods, in general, fail to localize persons accurate e...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	arXiv.org 2020-02
Hauptverfasser:	Deepak Babu Sam, Skand Vishwanath Peri, Mukuntha Narayanan Sundararaman, Kamath, Amogh, R Venkatesh Babu
Format:	Artikel
Sprache:	eng
Schlagworte:	Annotations Architecture Counting Density Image detection
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	Deepak Babu Sam Skand Vishwanath Peri Mukuntha Narayanan Sundararaman Kamath, Amogh R Venkatesh Babu
description	We introduce a detection framework for dense crowd counting and eliminate the need for the prevalent density regression paradigm. Typical counting models predict crowd density for an image as opposed to detecting every person. These regression methods, in general, fail to localize persons accurate enough for most applications other than counting. Hence, we adopt an architecture that locates every person in the crowd, sizes the spotted heads with bounding box and then counts them. Compared to normal object or face detectors, there exist certain unique challenges in designing such a detection system. Some of them are direct consequences of the huge diversity in dense crowds along with the need to predict boxes contiguously. We solve these issues and develop our LSC-CNN model, which can reliably detect heads of people across sparse to dense crowds. LSC-CNN employs a multi-column architecture with top-down feedback processing to better resolve persons and produce refined predictions at multiple resolutions. Interestingly, the proposed training regime requires only point head annotation, but can estimate approximate size information of heads. We show that LSC-CNN not only has superior localization than existing density regressors, but outperforms in counting as well. The code for our approach is available at https://github.com/val-iisc/lsc-cnn.
format	Article
fullrecord	<record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2243262584</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2243262584</sourcerecordid><originalsourceid>FETCH-proquest_journals_22432625843</originalsourceid><addsrcrecordid>eNqNissKgkAUQIcgSMp_uNA2we6oSbuwokWLXnsRvcXIMGPzMOrrc9EHtDpwzhmxADlfRnmCOGGhtW0cx5itME15wM5HXVeOFnAVH4JKNVBor9waNnXtzVDkGy5kteyFesCJdCcJhIItKUtQGP1qLPSiGoSj2gmtZmx8r6Sl8Mcpm-93t-IQdUY_PVlXttobNaQSMeGYYZon_L_rCxeRPlk</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2243262584</pqid></control><display><type>article</type><title>Locate, Size and Count: Accurately Resolving People in Dense Crowds via Detection</title><source>Free E- Journals</source><creator>Deepak Babu Sam ; Skand Vishwanath Peri ; Mukuntha Narayanan Sundararaman ; Kamath, Amogh ; R Venkatesh Babu</creator><creatorcontrib>Deepak Babu Sam ; Skand Vishwanath Peri ; Mukuntha Narayanan Sundararaman ; Kamath, Amogh ; R Venkatesh Babu</creatorcontrib><description>We introduce a detection framework for dense crowd counting and eliminate the need for the prevalent density regression paradigm. Typical counting models predict crowd density for an image as opposed to detecting every person. These regression methods, in general, fail to localize persons accurate enough for most applications other than counting. Hence, we adopt an architecture that locates every person in the crowd, sizes the spotted heads with bounding box and then counts them. Compared to normal object or face detectors, there exist certain unique challenges in designing such a detection system. Some of them are direct consequences of the huge diversity in dense crowds along with the need to predict boxes contiguously. We solve these issues and develop our LSC-CNN model, which can reliably detect heads of people across sparse to dense crowds. LSC-CNN employs a multi-column architecture with top-down feedback processing to better resolve persons and produce refined predictions at multiple resolutions. Interestingly, the proposed training regime requires only point head annotation, but can estimate approximate size information of heads. We show that LSC-CNN not only has superior localization than existing density regressors, but outperforms in counting as well. The code for our approach is available at https://github.com/val-iisc/lsc-cnn.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Annotations ; Architecture ; Counting ; Density ; Image detection</subject><ispartof>arXiv.org, 2020-02</ispartof><rights>2020. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Deepak Babu Sam</creatorcontrib><creatorcontrib>Skand Vishwanath Peri</creatorcontrib><creatorcontrib>Mukuntha Narayanan Sundararaman</creatorcontrib><creatorcontrib>Kamath, Amogh</creatorcontrib><creatorcontrib>R Venkatesh Babu</creatorcontrib><title>Locate, Size and Count: Accurately Resolving People in Dense Crowds via Detection</title><title>arXiv.org</title><description>We introduce a detection framework for dense crowd counting and eliminate the need for the prevalent density regression paradigm. Typical counting models predict crowd density for an image as opposed to detecting every person. These regression methods, in general, fail to localize persons accurate enough for most applications other than counting. Hence, we adopt an architecture that locates every person in the crowd, sizes the spotted heads with bounding box and then counts them. Compared to normal object or face detectors, there exist certain unique challenges in designing such a detection system. Some of them are direct consequences of the huge diversity in dense crowds along with the need to predict boxes contiguously. We solve these issues and develop our LSC-CNN model, which can reliably detect heads of people across sparse to dense crowds. LSC-CNN employs a multi-column architecture with top-down feedback processing to better resolve persons and produce refined predictions at multiple resolutions. Interestingly, the proposed training regime requires only point head annotation, but can estimate approximate size information of heads. We show that LSC-CNN not only has superior localization than existing density regressors, but outperforms in counting as well. The code for our approach is available at https://github.com/val-iisc/lsc-cnn.</description><subject>Annotations</subject><subject>Architecture</subject><subject>Counting</subject><subject>Density</subject><subject>Image detection</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNqNissKgkAUQIcgSMp_uNA2we6oSbuwokWLXnsRvcXIMGPzMOrrc9EHtDpwzhmxADlfRnmCOGGhtW0cx5itME15wM5HXVeOFnAVH4JKNVBor9waNnXtzVDkGy5kteyFesCJdCcJhIItKUtQGP1qLPSiGoSj2gmtZmx8r6Sl8Mcpm-93t-IQdUY_PVlXttobNaQSMeGYYZon_L_rCxeRPlk</recordid><startdate>20200215</startdate><enddate>20200215</enddate><creator>Deepak Babu Sam</creator><creator>Skand Vishwanath Peri</creator><creator>Mukuntha Narayanan Sundararaman</creator><creator>Kamath, Amogh</creator><creator>R Venkatesh Babu</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20200215</creationdate><title>Locate, Size and Count: Accurately Resolving People in Dense Crowds via Detection</title><author>Deepak Babu Sam ; Skand Vishwanath Peri ; Mukuntha Narayanan Sundararaman ; Kamath, Amogh ; R Venkatesh Babu</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_22432625843</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Annotations</topic><topic>Architecture</topic><topic>Counting</topic><topic>Density</topic><topic>Image detection</topic><toplevel>online_resources</toplevel><creatorcontrib>Deepak Babu Sam</creatorcontrib><creatorcontrib>Skand Vishwanath Peri</creatorcontrib><creatorcontrib>Mukuntha Narayanan Sundararaman</creatorcontrib><creatorcontrib>Kamath, Amogh</creatorcontrib><creatorcontrib>R Venkatesh Babu</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Deepak Babu Sam</au><au>Skand Vishwanath Peri</au><au>Mukuntha Narayanan Sundararaman</au><au>Kamath, Amogh</au><au>R Venkatesh Babu</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Locate, Size and Count: Accurately Resolving People in Dense Crowds via Detection</atitle><jtitle>arXiv.org</jtitle><date>2020-02-15</date><risdate>2020</risdate><eissn>2331-8422</eissn><abstract>We introduce a detection framework for dense crowd counting and eliminate the need for the prevalent density regression paradigm. Typical counting models predict crowd density for an image as opposed to detecting every person. These regression methods, in general, fail to localize persons accurate enough for most applications other than counting. Hence, we adopt an architecture that locates every person in the crowd, sizes the spotted heads with bounding box and then counts them. Compared to normal object or face detectors, there exist certain unique challenges in designing such a detection system. Some of them are direct consequences of the huge diversity in dense crowds along with the need to predict boxes contiguously. We solve these issues and develop our LSC-CNN model, which can reliably detect heads of people across sparse to dense crowds. LSC-CNN employs a multi-column architecture with top-down feedback processing to better resolve persons and produce refined predictions at multiple resolutions. Interestingly, the proposed training regime requires only point head annotation, but can estimate approximate size information of heads. We show that LSC-CNN not only has superior localization than existing density regressors, but outperforms in counting as well. The code for our approach is available at https://github.com/val-iisc/lsc-cnn.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2020-02
issn	2331-8422
language	eng
recordid	cdi_proquest_journals_2243262584
source	Free E- Journals
subjects	Annotations Architecture Counting Density Image detection
title	Locate, Size and Count: Accurately Resolving People in Dense Crowds via Detection
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-10T12%3A10%3A26IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Locate,%20Size%20and%20Count:%20Accurately%20Resolving%20People%20in%20Dense%20Crowds%20via%20Detection&rft.jtitle=arXiv.org&rft.au=Deepak%20Babu%20Sam&rft.date=2020-02-15&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2243262584%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2243262584&rft_id=info:pmid/&rfr_iscdi=true