Real Time Audio-Visual Person Tracking

This paper proposes a system for tracking people in three dimensions, utilizing audiovisual information from multiple acoustic and video sensors. The proposed system comprises a video and an audio subsystem combined using a Kalman filter. The video subsystem combines in 3D a number of 2D trackers ba...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Talantzis, F., Pnevmatikakis, A., Polymenakos, L.C.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 247
container_issue
container_start_page 243
container_title
container_volume
creator Talantzis, F.
Pnevmatikakis, A.
Polymenakos, L.C.
description This paper proposes a system for tracking people in three dimensions, utilizing audiovisual information from multiple acoustic and video sensors. The proposed system comprises a video and an audio subsystem combined using a Kalman filter. The video subsystem combines in 3D a number of 2D trackers based on a variation of Stauffer's adaptive background algorithm with spatio-temporal adaptation of the learning parameters and a Kalman tracker in a feedback configuration. The audio subsystem uses an information theoretic metric upon a pair of microphones to estimate the direction from which sound is arriving from. Combining measurements from a series of pairs the actual coordinate of the speaker in space is derived. Experiments show that gains are to be expected when fusion of the separate tracking systems is performed
doi_str_mv 10.1109/MMSP.2006.285306
format Conference Proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_4064556</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>4064556</ieee_id><sourcerecordid>4064556</sourcerecordid><originalsourceid>FETCH-LOGICAL-i175t-9b326ffa93a7bde4387acfaeaaee5fb990d81d475a367c1fde4bcde66e3607383</originalsourceid><addsrcrecordid>eNpVjM1Lw0AUxFdEUNrcBS85eUt8m7efx1L8gpYWjV7LS_atrPZDsvbgf29ALw4Dw28YRohLCbWU4G-Wy-d13QCYunEawZyIwlsHo9Fb3cjTfyzVuShyfodR6LX2-kJcPzFtyzbtuJwdQzpUrykfx2bNQz7sy3ag_iPt36biLNI2c_GXE_Fyd9vOH6rF6v5xPltUSVr9VfkOGxMjeSTbBVboLPWRmIhZx857CE4GZTWhsb2M46TrAxvDaMCiw4m4-v1NzLz5HNKOhu-NAqO0NvgD0edB9w</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Real Time Audio-Visual Person Tracking</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Talantzis, F. ; Pnevmatikakis, A. ; Polymenakos, L.C.</creator><creatorcontrib>Talantzis, F. ; Pnevmatikakis, A. ; Polymenakos, L.C.</creatorcontrib><description>This paper proposes a system for tracking people in three dimensions, utilizing audiovisual information from multiple acoustic and video sensors. The proposed system comprises a video and an audio subsystem combined using a Kalman filter. The video subsystem combines in 3D a number of 2D trackers based on a variation of Stauffer's adaptive background algorithm with spatio-temporal adaptation of the learning parameters and a Kalman tracker in a feedback configuration. The audio subsystem uses an information theoretic metric upon a pair of microphones to estimate the direction from which sound is arriving from. Combining measurements from a series of pairs the actual coordinate of the speaker in space is derived. Experiments show that gains are to be expected when fusion of the separate tracking systems is performed</description><identifier>ISBN: 9780780397514</identifier><identifier>ISBN: 0780397517</identifier><identifier>EISBN: 9780780397521</identifier><identifier>EISBN: 0780397525</identifier><identifier>DOI: 10.1109/MMSP.2006.285306</identifier><language>eng</language><publisher>IEEE</publisher><subject>Delay estimation ; Direction of arrival estimation ; Feedback ; Information Theory ; Kalman Filtering ; Kalman filters ; Loudspeakers ; Microphones ; Multimodal sensors ; Person Tracking ; Sensor systems ; Speech ; Target tracking</subject><ispartof>2006 IEEE Workshop on Multimedia Signal Processing, 2006, p.243-247</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/4064556$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,777,781,786,787,2052,27906,54901</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/4064556$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Talantzis, F.</creatorcontrib><creatorcontrib>Pnevmatikakis, A.</creatorcontrib><creatorcontrib>Polymenakos, L.C.</creatorcontrib><title>Real Time Audio-Visual Person Tracking</title><title>2006 IEEE Workshop on Multimedia Signal Processing</title><addtitle>MMSP</addtitle><description>This paper proposes a system for tracking people in three dimensions, utilizing audiovisual information from multiple acoustic and video sensors. The proposed system comprises a video and an audio subsystem combined using a Kalman filter. The video subsystem combines in 3D a number of 2D trackers based on a variation of Stauffer's adaptive background algorithm with spatio-temporal adaptation of the learning parameters and a Kalman tracker in a feedback configuration. The audio subsystem uses an information theoretic metric upon a pair of microphones to estimate the direction from which sound is arriving from. Combining measurements from a series of pairs the actual coordinate of the speaker in space is derived. Experiments show that gains are to be expected when fusion of the separate tracking systems is performed</description><subject>Delay estimation</subject><subject>Direction of arrival estimation</subject><subject>Feedback</subject><subject>Information Theory</subject><subject>Kalman Filtering</subject><subject>Kalman filters</subject><subject>Loudspeakers</subject><subject>Microphones</subject><subject>Multimodal sensors</subject><subject>Person Tracking</subject><subject>Sensor systems</subject><subject>Speech</subject><subject>Target tracking</subject><isbn>9780780397514</isbn><isbn>0780397517</isbn><isbn>9780780397521</isbn><isbn>0780397525</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2006</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNpVjM1Lw0AUxFdEUNrcBS85eUt8m7efx1L8gpYWjV7LS_atrPZDsvbgf29ALw4Dw28YRohLCbWU4G-Wy-d13QCYunEawZyIwlsHo9Fb3cjTfyzVuShyfodR6LX2-kJcPzFtyzbtuJwdQzpUrykfx2bNQz7sy3ag_iPt36biLNI2c_GXE_Fyd9vOH6rF6v5xPltUSVr9VfkOGxMjeSTbBVboLPWRmIhZx857CE4GZTWhsb2M46TrAxvDaMCiw4m4-v1NzLz5HNKOhu-NAqO0NvgD0edB9w</recordid><startdate>200610</startdate><enddate>200610</enddate><creator>Talantzis, F.</creator><creator>Pnevmatikakis, A.</creator><creator>Polymenakos, L.C.</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>200610</creationdate><title>Real Time Audio-Visual Person Tracking</title><author>Talantzis, F. ; Pnevmatikakis, A. ; Polymenakos, L.C.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i175t-9b326ffa93a7bde4387acfaeaaee5fb990d81d475a367c1fde4bcde66e3607383</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2006</creationdate><topic>Delay estimation</topic><topic>Direction of arrival estimation</topic><topic>Feedback</topic><topic>Information Theory</topic><topic>Kalman Filtering</topic><topic>Kalman filters</topic><topic>Loudspeakers</topic><topic>Microphones</topic><topic>Multimodal sensors</topic><topic>Person Tracking</topic><topic>Sensor systems</topic><topic>Speech</topic><topic>Target tracking</topic><toplevel>online_resources</toplevel><creatorcontrib>Talantzis, F.</creatorcontrib><creatorcontrib>Pnevmatikakis, A.</creatorcontrib><creatorcontrib>Polymenakos, L.C.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Talantzis, F.</au><au>Pnevmatikakis, A.</au><au>Polymenakos, L.C.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Real Time Audio-Visual Person Tracking</atitle><btitle>2006 IEEE Workshop on Multimedia Signal Processing</btitle><stitle>MMSP</stitle><date>2006-10</date><risdate>2006</risdate><spage>243</spage><epage>247</epage><pages>243-247</pages><isbn>9780780397514</isbn><isbn>0780397517</isbn><eisbn>9780780397521</eisbn><eisbn>0780397525</eisbn><abstract>This paper proposes a system for tracking people in three dimensions, utilizing audiovisual information from multiple acoustic and video sensors. The proposed system comprises a video and an audio subsystem combined using a Kalman filter. The video subsystem combines in 3D a number of 2D trackers based on a variation of Stauffer's adaptive background algorithm with spatio-temporal adaptation of the learning parameters and a Kalman tracker in a feedback configuration. The audio subsystem uses an information theoretic metric upon a pair of microphones to estimate the direction from which sound is arriving from. Combining measurements from a series of pairs the actual coordinate of the speaker in space is derived. Experiments show that gains are to be expected when fusion of the separate tracking systems is performed</abstract><pub>IEEE</pub><doi>10.1109/MMSP.2006.285306</doi><tpages>5</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISBN: 9780780397514
ispartof 2006 IEEE Workshop on Multimedia Signal Processing, 2006, p.243-247
issn
language eng
recordid cdi_ieee_primary_4064556
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Delay estimation
Direction of arrival estimation
Feedback
Information Theory
Kalman Filtering
Kalman filters
Loudspeakers
Microphones
Multimodal sensors
Person Tracking
Sensor systems
Speech
Target tracking
title Real Time Audio-Visual Person Tracking
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T21%3A20%3A51IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Real%20Time%20Audio-Visual%20Person%20Tracking&rft.btitle=2006%20IEEE%20Workshop%20on%20Multimedia%20Signal%20Processing&rft.au=Talantzis,%20F.&rft.date=2006-10&rft.spage=243&rft.epage=247&rft.pages=243-247&rft.isbn=9780780397514&rft.isbn_list=0780397517&rft_id=info:doi/10.1109/MMSP.2006.285306&rft_dat=%3Cieee_6IE%3E4064556%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=9780780397521&rft.eisbn_list=0780397525&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=4064556&rfr_iscdi=true