Real Time Audio-Visual Person Tracking
This paper proposes a system for tracking people in three dimensions, utilizing audiovisual information from multiple acoustic and video sensors. The proposed system comprises a video and an audio subsystem combined using a Kalman filter. The video subsystem combines in 3D a number of 2D trackers ba...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 247 |
---|---|
container_issue | |
container_start_page | 243 |
container_title | |
container_volume | |
creator | Talantzis, F. Pnevmatikakis, A. Polymenakos, L.C. |
description | This paper proposes a system for tracking people in three dimensions, utilizing audiovisual information from multiple acoustic and video sensors. The proposed system comprises a video and an audio subsystem combined using a Kalman filter. The video subsystem combines in 3D a number of 2D trackers based on a variation of Stauffer's adaptive background algorithm with spatio-temporal adaptation of the learning parameters and a Kalman tracker in a feedback configuration. The audio subsystem uses an information theoretic metric upon a pair of microphones to estimate the direction from which sound is arriving from. Combining measurements from a series of pairs the actual coordinate of the speaker in space is derived. Experiments show that gains are to be expected when fusion of the separate tracking systems is performed |
doi_str_mv | 10.1109/MMSP.2006.285306 |
format | Conference Proceeding |
fullrecord | <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_4064556</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>4064556</ieee_id><sourcerecordid>4064556</sourcerecordid><originalsourceid>FETCH-LOGICAL-i175t-9b326ffa93a7bde4387acfaeaaee5fb990d81d475a367c1fde4bcde66e3607383</originalsourceid><addsrcrecordid>eNpVjM1Lw0AUxFdEUNrcBS85eUt8m7efx1L8gpYWjV7LS_atrPZDsvbgf29ALw4Dw28YRohLCbWU4G-Wy-d13QCYunEawZyIwlsHo9Fb3cjTfyzVuShyfodR6LX2-kJcPzFtyzbtuJwdQzpUrykfx2bNQz7sy3ag_iPt36biLNI2c_GXE_Fyd9vOH6rF6v5xPltUSVr9VfkOGxMjeSTbBVboLPWRmIhZx857CE4GZTWhsb2M46TrAxvDaMCiw4m4-v1NzLz5HNKOhu-NAqO0NvgD0edB9w</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Real Time Audio-Visual Person Tracking</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Talantzis, F. ; Pnevmatikakis, A. ; Polymenakos, L.C.</creator><creatorcontrib>Talantzis, F. ; Pnevmatikakis, A. ; Polymenakos, L.C.</creatorcontrib><description>This paper proposes a system for tracking people in three dimensions, utilizing audiovisual information from multiple acoustic and video sensors. The proposed system comprises a video and an audio subsystem combined using a Kalman filter. The video subsystem combines in 3D a number of 2D trackers based on a variation of Stauffer's adaptive background algorithm with spatio-temporal adaptation of the learning parameters and a Kalman tracker in a feedback configuration. The audio subsystem uses an information theoretic metric upon a pair of microphones to estimate the direction from which sound is arriving from. Combining measurements from a series of pairs the actual coordinate of the speaker in space is derived. Experiments show that gains are to be expected when fusion of the separate tracking systems is performed</description><identifier>ISBN: 9780780397514</identifier><identifier>ISBN: 0780397517</identifier><identifier>EISBN: 9780780397521</identifier><identifier>EISBN: 0780397525</identifier><identifier>DOI: 10.1109/MMSP.2006.285306</identifier><language>eng</language><publisher>IEEE</publisher><subject>Delay estimation ; Direction of arrival estimation ; Feedback ; Information Theory ; Kalman Filtering ; Kalman filters ; Loudspeakers ; Microphones ; Multimodal sensors ; Person Tracking ; Sensor systems ; Speech ; Target tracking</subject><ispartof>2006 IEEE Workshop on Multimedia Signal Processing, 2006, p.243-247</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/4064556$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,777,781,786,787,2052,27906,54901</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/4064556$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Talantzis, F.</creatorcontrib><creatorcontrib>Pnevmatikakis, A.</creatorcontrib><creatorcontrib>Polymenakos, L.C.</creatorcontrib><title>Real Time Audio-Visual Person Tracking</title><title>2006 IEEE Workshop on Multimedia Signal Processing</title><addtitle>MMSP</addtitle><description>This paper proposes a system for tracking people in three dimensions, utilizing audiovisual information from multiple acoustic and video sensors. The proposed system comprises a video and an audio subsystem combined using a Kalman filter. The video subsystem combines in 3D a number of 2D trackers based on a variation of Stauffer's adaptive background algorithm with spatio-temporal adaptation of the learning parameters and a Kalman tracker in a feedback configuration. The audio subsystem uses an information theoretic metric upon a pair of microphones to estimate the direction from which sound is arriving from. Combining measurements from a series of pairs the actual coordinate of the speaker in space is derived. Experiments show that gains are to be expected when fusion of the separate tracking systems is performed</description><subject>Delay estimation</subject><subject>Direction of arrival estimation</subject><subject>Feedback</subject><subject>Information Theory</subject><subject>Kalman Filtering</subject><subject>Kalman filters</subject><subject>Loudspeakers</subject><subject>Microphones</subject><subject>Multimodal sensors</subject><subject>Person Tracking</subject><subject>Sensor systems</subject><subject>Speech</subject><subject>Target tracking</subject><isbn>9780780397514</isbn><isbn>0780397517</isbn><isbn>9780780397521</isbn><isbn>0780397525</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2006</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNpVjM1Lw0AUxFdEUNrcBS85eUt8m7efx1L8gpYWjV7LS_atrPZDsvbgf29ALw4Dw28YRohLCbWU4G-Wy-d13QCYunEawZyIwlsHo9Fb3cjTfyzVuShyfodR6LX2-kJcPzFtyzbtuJwdQzpUrykfx2bNQz7sy3ag_iPt36biLNI2c_GXE_Fyd9vOH6rF6v5xPltUSVr9VfkOGxMjeSTbBVboLPWRmIhZx857CE4GZTWhsb2M46TrAxvDaMCiw4m4-v1NzLz5HNKOhu-NAqO0NvgD0edB9w</recordid><startdate>200610</startdate><enddate>200610</enddate><creator>Talantzis, F.</creator><creator>Pnevmatikakis, A.</creator><creator>Polymenakos, L.C.</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>200610</creationdate><title>Real Time Audio-Visual Person Tracking</title><author>Talantzis, F. ; Pnevmatikakis, A. ; Polymenakos, L.C.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i175t-9b326ffa93a7bde4387acfaeaaee5fb990d81d475a367c1fde4bcde66e3607383</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2006</creationdate><topic>Delay estimation</topic><topic>Direction of arrival estimation</topic><topic>Feedback</topic><topic>Information Theory</topic><topic>Kalman Filtering</topic><topic>Kalman filters</topic><topic>Loudspeakers</topic><topic>Microphones</topic><topic>Multimodal sensors</topic><topic>Person Tracking</topic><topic>Sensor systems</topic><topic>Speech</topic><topic>Target tracking</topic><toplevel>online_resources</toplevel><creatorcontrib>Talantzis, F.</creatorcontrib><creatorcontrib>Pnevmatikakis, A.</creatorcontrib><creatorcontrib>Polymenakos, L.C.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Talantzis, F.</au><au>Pnevmatikakis, A.</au><au>Polymenakos, L.C.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Real Time Audio-Visual Person Tracking</atitle><btitle>2006 IEEE Workshop on Multimedia Signal Processing</btitle><stitle>MMSP</stitle><date>2006-10</date><risdate>2006</risdate><spage>243</spage><epage>247</epage><pages>243-247</pages><isbn>9780780397514</isbn><isbn>0780397517</isbn><eisbn>9780780397521</eisbn><eisbn>0780397525</eisbn><abstract>This paper proposes a system for tracking people in three dimensions, utilizing audiovisual information from multiple acoustic and video sensors. The proposed system comprises a video and an audio subsystem combined using a Kalman filter. The video subsystem combines in 3D a number of 2D trackers based on a variation of Stauffer's adaptive background algorithm with spatio-temporal adaptation of the learning parameters and a Kalman tracker in a feedback configuration. The audio subsystem uses an information theoretic metric upon a pair of microphones to estimate the direction from which sound is arriving from. Combining measurements from a series of pairs the actual coordinate of the speaker in space is derived. Experiments show that gains are to be expected when fusion of the separate tracking systems is performed</abstract><pub>IEEE</pub><doi>10.1109/MMSP.2006.285306</doi><tpages>5</tpages></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISBN: 9780780397514 |
ispartof | 2006 IEEE Workshop on Multimedia Signal Processing, 2006, p.243-247 |
issn | |
language | eng |
recordid | cdi_ieee_primary_4064556 |
source | IEEE Electronic Library (IEL) Conference Proceedings |
subjects | Delay estimation Direction of arrival estimation Feedback Information Theory Kalman Filtering Kalman filters Loudspeakers Microphones Multimodal sensors Person Tracking Sensor systems Speech Target tracking |
title | Real Time Audio-Visual Person Tracking |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T21%3A20%3A51IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Real%20Time%20Audio-Visual%20Person%20Tracking&rft.btitle=2006%20IEEE%20Workshop%20on%20Multimedia%20Signal%20Processing&rft.au=Talantzis,%20F.&rft.date=2006-10&rft.spage=243&rft.epage=247&rft.pages=243-247&rft.isbn=9780780397514&rft.isbn_list=0780397517&rft_id=info:doi/10.1109/MMSP.2006.285306&rft_dat=%3Cieee_6IE%3E4064556%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=9780780397521&rft.eisbn_list=0780397525&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=4064556&rfr_iscdi=true |