A 3-D marked point process model for multi-view people detection
In this paper we introduce a probabilistic approach on multiple person localization using multiple calibrated camera views. People present in the scene are approximated by a population of cylinder objects in the 3-D world coordinate system, which is a realization of a Marked Point Process. The obser...
Gespeichert in:
Hauptverfasser: | , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 3392 |
---|---|
container_issue | |
container_start_page | 3385 |
container_title | |
container_volume | |
creator | Utasi, A. Benedek, C. |
description | In this paper we introduce a probabilistic approach on multiple person localization using multiple calibrated camera views. People present in the scene are approximated by a population of cylinder objects in the 3-D world coordinate system, which is a realization of a Marked Point Process. The observation model is based on the projection of the pixels of the obtained motion masks in the different camera images to the ground plane and to other parallel planes with different height. The proposed pixel-level feature is based on physical properties of the 2-D image formation process and can accurately localize the leg position on the ground plane and estimate the height of the people, even if the area of interest is only a part of the scene, meanwhile silhouettes from irrelevant outside motions may significantly overlap with the monitored region in some of the camera views. We introduce an energy function, which contains a data term calculated from the extracted features and a geometrical constraint term modeling the distance between two people. The final configuration results (location and height) are obtained by an iterative stochastic energy optimization process, called the Multiple Birth and Death dynamics. The proposed approached is compared to a recent state-of-the-art technique in a publicly available dataset and its advantages are quantitatively demonstrated. |
doi_str_mv | 10.1109/CVPR.2011.5995699 |
format | Conference Proceeding |
fullrecord | <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_5995699</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>5995699</ieee_id><sourcerecordid>5995699</sourcerecordid><originalsourceid>FETCH-LOGICAL-i218t-975526d5e395ecede690099f243d02bfb11fb7f15630990824efeae27414d10d3</originalsourceid><addsrcrecordid>eNpFj91KxDAUhCMquK77AOJNXiA1J2mSnjuX-gsLiqi3S9ucQLTdlLYqvr0FF5ybYb6LYYaxc5AZgMTL8u3pOVMSIDOIxiIesFPIjXNSo8bD_5C7I7YAabWwCHjCVuP4LmdZW6BxC3a15lpc864aPsjzPsXdxPshNTSOvEueWh7SwLvPdoriK9I37yn1LXFPEzVTTLszdhyqdqTV3pfs9fbmpbwXm8e7h3K9EVFBMQl0xijrDWk01JAni1IiBpVrL1UdaoBQuwDG6hnLQuUUqCLlcsg9SK-X7OKvNxLRth_iPPlnu3-vfwGoOUrK</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>A 3-D marked point process model for multi-view people detection</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Utasi, A. ; Benedek, C.</creator><creatorcontrib>Utasi, A. ; Benedek, C.</creatorcontrib><description>In this paper we introduce a probabilistic approach on multiple person localization using multiple calibrated camera views. People present in the scene are approximated by a population of cylinder objects in the 3-D world coordinate system, which is a realization of a Marked Point Process. The observation model is based on the projection of the pixels of the obtained motion masks in the different camera images to the ground plane and to other parallel planes with different height. The proposed pixel-level feature is based on physical properties of the 2-D image formation process and can accurately localize the leg position on the ground plane and estimate the height of the people, even if the area of interest is only a part of the scene, meanwhile silhouettes from irrelevant outside motions may significantly overlap with the monitored region in some of the camera views. We introduce an energy function, which contains a data term calculated from the extracted features and a geometrical constraint term modeling the distance between two people. The final configuration results (location and height) are obtained by an iterative stochastic energy optimization process, called the Multiple Birth and Death dynamics. The proposed approached is compared to a recent state-of-the-art technique in a publicly available dataset and its advantages are quantitatively demonstrated.</description><identifier>ISSN: 1063-6919</identifier><identifier>ISBN: 1457703947</identifier><identifier>ISBN: 9781457703942</identifier><identifier>EISBN: 1457703939</identifier><identifier>EISBN: 1457703955</identifier><identifier>EISBN: 9781457703959</identifier><identifier>EISBN: 9781457703935</identifier><identifier>DOI: 10.1109/CVPR.2011.5995699</identifier><language>eng</language><publisher>IEEE</publisher><subject>Cameras ; Feature extraction ; Image color analysis ; Monitoring ; Optimization ; Shape ; Solid modeling</subject><ispartof>CVPR 2011, 2011, p.3385-3392</ispartof><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/5995699$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,777,781,786,787,2052,27906,54901</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/5995699$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Utasi, A.</creatorcontrib><creatorcontrib>Benedek, C.</creatorcontrib><title>A 3-D marked point process model for multi-view people detection</title><title>CVPR 2011</title><addtitle>CVPR</addtitle><description>In this paper we introduce a probabilistic approach on multiple person localization using multiple calibrated camera views. People present in the scene are approximated by a population of cylinder objects in the 3-D world coordinate system, which is a realization of a Marked Point Process. The observation model is based on the projection of the pixels of the obtained motion masks in the different camera images to the ground plane and to other parallel planes with different height. The proposed pixel-level feature is based on physical properties of the 2-D image formation process and can accurately localize the leg position on the ground plane and estimate the height of the people, even if the area of interest is only a part of the scene, meanwhile silhouettes from irrelevant outside motions may significantly overlap with the monitored region in some of the camera views. We introduce an energy function, which contains a data term calculated from the extracted features and a geometrical constraint term modeling the distance between two people. The final configuration results (location and height) are obtained by an iterative stochastic energy optimization process, called the Multiple Birth and Death dynamics. The proposed approached is compared to a recent state-of-the-art technique in a publicly available dataset and its advantages are quantitatively demonstrated.</description><subject>Cameras</subject><subject>Feature extraction</subject><subject>Image color analysis</subject><subject>Monitoring</subject><subject>Optimization</subject><subject>Shape</subject><subject>Solid modeling</subject><issn>1063-6919</issn><isbn>1457703947</isbn><isbn>9781457703942</isbn><isbn>1457703939</isbn><isbn>1457703955</isbn><isbn>9781457703959</isbn><isbn>9781457703935</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2011</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNpFj91KxDAUhCMquK77AOJNXiA1J2mSnjuX-gsLiqi3S9ucQLTdlLYqvr0FF5ybYb6LYYaxc5AZgMTL8u3pOVMSIDOIxiIesFPIjXNSo8bD_5C7I7YAabWwCHjCVuP4LmdZW6BxC3a15lpc864aPsjzPsXdxPshNTSOvEueWh7SwLvPdoriK9I37yn1LXFPEzVTTLszdhyqdqTV3pfs9fbmpbwXm8e7h3K9EVFBMQl0xijrDWk01JAni1IiBpVrL1UdaoBQuwDG6hnLQuUUqCLlcsg9SK-X7OKvNxLRth_iPPlnu3-vfwGoOUrK</recordid><startdate>20110101</startdate><enddate>20110101</enddate><creator>Utasi, A.</creator><creator>Benedek, C.</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>20110101</creationdate><title>A 3-D marked point process model for multi-view people detection</title><author>Utasi, A. ; Benedek, C.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i218t-975526d5e395ecede690099f243d02bfb11fb7f15630990824efeae27414d10d3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2011</creationdate><topic>Cameras</topic><topic>Feature extraction</topic><topic>Image color analysis</topic><topic>Monitoring</topic><topic>Optimization</topic><topic>Shape</topic><topic>Solid modeling</topic><toplevel>online_resources</toplevel><creatorcontrib>Utasi, A.</creatorcontrib><creatorcontrib>Benedek, C.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Utasi, A.</au><au>Benedek, C.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>A 3-D marked point process model for multi-view people detection</atitle><btitle>CVPR 2011</btitle><stitle>CVPR</stitle><date>2011-01-01</date><risdate>2011</risdate><spage>3385</spage><epage>3392</epage><pages>3385-3392</pages><issn>1063-6919</issn><isbn>1457703947</isbn><isbn>9781457703942</isbn><eisbn>1457703939</eisbn><eisbn>1457703955</eisbn><eisbn>9781457703959</eisbn><eisbn>9781457703935</eisbn><abstract>In this paper we introduce a probabilistic approach on multiple person localization using multiple calibrated camera views. People present in the scene are approximated by a population of cylinder objects in the 3-D world coordinate system, which is a realization of a Marked Point Process. The observation model is based on the projection of the pixels of the obtained motion masks in the different camera images to the ground plane and to other parallel planes with different height. The proposed pixel-level feature is based on physical properties of the 2-D image formation process and can accurately localize the leg position on the ground plane and estimate the height of the people, even if the area of interest is only a part of the scene, meanwhile silhouettes from irrelevant outside motions may significantly overlap with the monitored region in some of the camera views. We introduce an energy function, which contains a data term calculated from the extracted features and a geometrical constraint term modeling the distance between two people. The final configuration results (location and height) are obtained by an iterative stochastic energy optimization process, called the Multiple Birth and Death dynamics. The proposed approached is compared to a recent state-of-the-art technique in a publicly available dataset and its advantages are quantitatively demonstrated.</abstract><pub>IEEE</pub><doi>10.1109/CVPR.2011.5995699</doi><tpages>8</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 1063-6919 |
ispartof | CVPR 2011, 2011, p.3385-3392 |
issn | 1063-6919 |
language | eng |
recordid | cdi_ieee_primary_5995699 |
source | IEEE Electronic Library (IEL) Conference Proceedings |
subjects | Cameras Feature extraction Image color analysis Monitoring Optimization Shape Solid modeling |
title | A 3-D marked point process model for multi-view people detection |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-19T09%3A15%3A53IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=A%203-D%20marked%20point%20process%20model%20for%20multi-view%20people%20detection&rft.btitle=CVPR%202011&rft.au=Utasi,%20A.&rft.date=2011-01-01&rft.spage=3385&rft.epage=3392&rft.pages=3385-3392&rft.issn=1063-6919&rft.isbn=1457703947&rft.isbn_list=9781457703942&rft_id=info:doi/10.1109/CVPR.2011.5995699&rft_dat=%3Cieee_6IE%3E5995699%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=1457703939&rft.eisbn_list=1457703955&rft.eisbn_list=9781457703959&rft.eisbn_list=9781457703935&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=5995699&rfr_iscdi=true |