A 3-D marked point process model for multi-view people detection

In this paper we introduce a probabilistic approach on multiple person localization using multiple calibrated camera views. People present in the scene are approximated by a population of cylinder objects in the 3-D world coordinate system, which is a realization of a Marked Point Process. The obser...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Utasi, A., Benedek, C.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 3392
container_issue
container_start_page 3385
container_title
container_volume
creator Utasi, A.
Benedek, C.
description In this paper we introduce a probabilistic approach on multiple person localization using multiple calibrated camera views. People present in the scene are approximated by a population of cylinder objects in the 3-D world coordinate system, which is a realization of a Marked Point Process. The observation model is based on the projection of the pixels of the obtained motion masks in the different camera images to the ground plane and to other parallel planes with different height. The proposed pixel-level feature is based on physical properties of the 2-D image formation process and can accurately localize the leg position on the ground plane and estimate the height of the people, even if the area of interest is only a part of the scene, meanwhile silhouettes from irrelevant outside motions may significantly overlap with the monitored region in some of the camera views. We introduce an energy function, which contains a data term calculated from the extracted features and a geometrical constraint term modeling the distance between two people. The final configuration results (location and height) are obtained by an iterative stochastic energy optimization process, called the Multiple Birth and Death dynamics. The proposed approached is compared to a recent state-of-the-art technique in a publicly available dataset and its advantages are quantitatively demonstrated.
doi_str_mv 10.1109/CVPR.2011.5995699
format Conference Proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_5995699</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>5995699</ieee_id><sourcerecordid>5995699</sourcerecordid><originalsourceid>FETCH-LOGICAL-i218t-975526d5e395ecede690099f243d02bfb11fb7f15630990824efeae27414d10d3</originalsourceid><addsrcrecordid>eNpFj91KxDAUhCMquK77AOJNXiA1J2mSnjuX-gsLiqi3S9ucQLTdlLYqvr0FF5ybYb6LYYaxc5AZgMTL8u3pOVMSIDOIxiIesFPIjXNSo8bD_5C7I7YAabWwCHjCVuP4LmdZW6BxC3a15lpc864aPsjzPsXdxPshNTSOvEueWh7SwLvPdoriK9I37yn1LXFPEzVTTLszdhyqdqTV3pfs9fbmpbwXm8e7h3K9EVFBMQl0xijrDWk01JAni1IiBpVrL1UdaoBQuwDG6hnLQuUUqCLlcsg9SK-X7OKvNxLRth_iPPlnu3-vfwGoOUrK</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>A 3-D marked point process model for multi-view people detection</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Utasi, A. ; Benedek, C.</creator><creatorcontrib>Utasi, A. ; Benedek, C.</creatorcontrib><description>In this paper we introduce a probabilistic approach on multiple person localization using multiple calibrated camera views. People present in the scene are approximated by a population of cylinder objects in the 3-D world coordinate system, which is a realization of a Marked Point Process. The observation model is based on the projection of the pixels of the obtained motion masks in the different camera images to the ground plane and to other parallel planes with different height. The proposed pixel-level feature is based on physical properties of the 2-D image formation process and can accurately localize the leg position on the ground plane and estimate the height of the people, even if the area of interest is only a part of the scene, meanwhile silhouettes from irrelevant outside motions may significantly overlap with the monitored region in some of the camera views. We introduce an energy function, which contains a data term calculated from the extracted features and a geometrical constraint term modeling the distance between two people. The final configuration results (location and height) are obtained by an iterative stochastic energy optimization process, called the Multiple Birth and Death dynamics. The proposed approached is compared to a recent state-of-the-art technique in a publicly available dataset and its advantages are quantitatively demonstrated.</description><identifier>ISSN: 1063-6919</identifier><identifier>ISBN: 1457703947</identifier><identifier>ISBN: 9781457703942</identifier><identifier>EISBN: 1457703939</identifier><identifier>EISBN: 1457703955</identifier><identifier>EISBN: 9781457703959</identifier><identifier>EISBN: 9781457703935</identifier><identifier>DOI: 10.1109/CVPR.2011.5995699</identifier><language>eng</language><publisher>IEEE</publisher><subject>Cameras ; Feature extraction ; Image color analysis ; Monitoring ; Optimization ; Shape ; Solid modeling</subject><ispartof>CVPR 2011, 2011, p.3385-3392</ispartof><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/5995699$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,777,781,786,787,2052,27906,54901</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/5995699$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Utasi, A.</creatorcontrib><creatorcontrib>Benedek, C.</creatorcontrib><title>A 3-D marked point process model for multi-view people detection</title><title>CVPR 2011</title><addtitle>CVPR</addtitle><description>In this paper we introduce a probabilistic approach on multiple person localization using multiple calibrated camera views. People present in the scene are approximated by a population of cylinder objects in the 3-D world coordinate system, which is a realization of a Marked Point Process. The observation model is based on the projection of the pixels of the obtained motion masks in the different camera images to the ground plane and to other parallel planes with different height. The proposed pixel-level feature is based on physical properties of the 2-D image formation process and can accurately localize the leg position on the ground plane and estimate the height of the people, even if the area of interest is only a part of the scene, meanwhile silhouettes from irrelevant outside motions may significantly overlap with the monitored region in some of the camera views. We introduce an energy function, which contains a data term calculated from the extracted features and a geometrical constraint term modeling the distance between two people. The final configuration results (location and height) are obtained by an iterative stochastic energy optimization process, called the Multiple Birth and Death dynamics. The proposed approached is compared to a recent state-of-the-art technique in a publicly available dataset and its advantages are quantitatively demonstrated.</description><subject>Cameras</subject><subject>Feature extraction</subject><subject>Image color analysis</subject><subject>Monitoring</subject><subject>Optimization</subject><subject>Shape</subject><subject>Solid modeling</subject><issn>1063-6919</issn><isbn>1457703947</isbn><isbn>9781457703942</isbn><isbn>1457703939</isbn><isbn>1457703955</isbn><isbn>9781457703959</isbn><isbn>9781457703935</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2011</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNpFj91KxDAUhCMquK77AOJNXiA1J2mSnjuX-gsLiqi3S9ucQLTdlLYqvr0FF5ybYb6LYYaxc5AZgMTL8u3pOVMSIDOIxiIesFPIjXNSo8bD_5C7I7YAabWwCHjCVuP4LmdZW6BxC3a15lpc864aPsjzPsXdxPshNTSOvEueWh7SwLvPdoriK9I37yn1LXFPEzVTTLszdhyqdqTV3pfs9fbmpbwXm8e7h3K9EVFBMQl0xijrDWk01JAni1IiBpVrL1UdaoBQuwDG6hnLQuUUqCLlcsg9SK-X7OKvNxLRth_iPPlnu3-vfwGoOUrK</recordid><startdate>20110101</startdate><enddate>20110101</enddate><creator>Utasi, A.</creator><creator>Benedek, C.</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>20110101</creationdate><title>A 3-D marked point process model for multi-view people detection</title><author>Utasi, A. ; Benedek, C.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i218t-975526d5e395ecede690099f243d02bfb11fb7f15630990824efeae27414d10d3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2011</creationdate><topic>Cameras</topic><topic>Feature extraction</topic><topic>Image color analysis</topic><topic>Monitoring</topic><topic>Optimization</topic><topic>Shape</topic><topic>Solid modeling</topic><toplevel>online_resources</toplevel><creatorcontrib>Utasi, A.</creatorcontrib><creatorcontrib>Benedek, C.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Utasi, A.</au><au>Benedek, C.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>A 3-D marked point process model for multi-view people detection</atitle><btitle>CVPR 2011</btitle><stitle>CVPR</stitle><date>2011-01-01</date><risdate>2011</risdate><spage>3385</spage><epage>3392</epage><pages>3385-3392</pages><issn>1063-6919</issn><isbn>1457703947</isbn><isbn>9781457703942</isbn><eisbn>1457703939</eisbn><eisbn>1457703955</eisbn><eisbn>9781457703959</eisbn><eisbn>9781457703935</eisbn><abstract>In this paper we introduce a probabilistic approach on multiple person localization using multiple calibrated camera views. People present in the scene are approximated by a population of cylinder objects in the 3-D world coordinate system, which is a realization of a Marked Point Process. The observation model is based on the projection of the pixels of the obtained motion masks in the different camera images to the ground plane and to other parallel planes with different height. The proposed pixel-level feature is based on physical properties of the 2-D image formation process and can accurately localize the leg position on the ground plane and estimate the height of the people, even if the area of interest is only a part of the scene, meanwhile silhouettes from irrelevant outside motions may significantly overlap with the monitored region in some of the camera views. We introduce an energy function, which contains a data term calculated from the extracted features and a geometrical constraint term modeling the distance between two people. The final configuration results (location and height) are obtained by an iterative stochastic energy optimization process, called the Multiple Birth and Death dynamics. The proposed approached is compared to a recent state-of-the-art technique in a publicly available dataset and its advantages are quantitatively demonstrated.</abstract><pub>IEEE</pub><doi>10.1109/CVPR.2011.5995699</doi><tpages>8</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1063-6919
ispartof CVPR 2011, 2011, p.3385-3392
issn 1063-6919
language eng
recordid cdi_ieee_primary_5995699
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Cameras
Feature extraction
Image color analysis
Monitoring
Optimization
Shape
Solid modeling
title A 3-D marked point process model for multi-view people detection
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-19T09%3A15%3A53IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=A%203-D%20marked%20point%20process%20model%20for%20multi-view%20people%20detection&rft.btitle=CVPR%202011&rft.au=Utasi,%20A.&rft.date=2011-01-01&rft.spage=3385&rft.epage=3392&rft.pages=3385-3392&rft.issn=1063-6919&rft.isbn=1457703947&rft.isbn_list=9781457703942&rft_id=info:doi/10.1109/CVPR.2011.5995699&rft_dat=%3Cieee_6IE%3E5995699%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=1457703939&rft.eisbn_list=1457703955&rft.eisbn_list=9781457703959&rft.eisbn_list=9781457703935&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=5995699&rfr_iscdi=true