Lip modeling for visual speech recognition

In this paper, we describe an algorithm for modeling the shape of the mouth, and extracting meaningful dimensions for use by automatic lipreading systems. One advantage of this technique lies in the ability to normalize the model to compensate for scale and rotation. An error function is defined whi...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Rao, R.A., Mersereau, R.M.
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Acoustic noise Automatic speech recognition Detectors Feature extraction Image edge detection Lips Mouth Nonlinear filters Shape Speech recognition
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	590 vol.1
container_issue
container_start_page	587
container_title
container_volume	1
creator	Rao, R.A. Mersereau, R.M.
description	In this paper, we describe an algorithm for modeling the shape of the mouth, and extracting meaningful dimensions for use by automatic lipreading systems. One advantage of this technique lies in the ability to normalize the model to compensate for scale and rotation. An error function is defined which relates the model to the image, and minimization of the error yields the best fit model. This is similar to deformable templates, but we attempt to perform the minimization in closed form. Visual only recognition was performed with features extracted from the model, and the recognition system achieved 85% accuracy on a two word discrimination task.< >
doi_str_mv	10.1109/ACSSC.1994.471520
format	Conference Proceeding
fullrecord	<record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_471520</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>471520</ieee_id><sourcerecordid>471520</sourcerecordid><originalsourceid>FETCH-LOGICAL-i174t-66c3e391caac719b842f5dfb95bda3a3206c94a9fb5376105fa1eb15157c69603</originalsourceid><addsrcrecordid>eNotj8tKw0AUQAcfYFr9AF1lLSS9d-48MssSrAqBLqrrMpnM1JE0CZkq-PcKdXV2h3MYu0coEcGs1vVuV5dojCiFRsnhgmVcalVwArpkC6iwUkqApCuWIciqUGTohi1S-gTgwCuesccmTvlx7Hwfh0Mexjn_junL9nmavHcf-ezdeBjiKY7DLbsOtk_-7p9L9r55eqtfimb7_FqvmyKiFqdCKUeeDDprnUbTVoIH2YXWyLazZImDckZYE1pJWv11BYu-RYlSO2UU0JI9nL3Re7-f5ni088_-vEi_4LtCZA</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Lip modeling for visual speech recognition</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Rao, R.A. ; Mersereau, R.M.</creator><creatorcontrib>Rao, R.A. ; Mersereau, R.M.</creatorcontrib><description>In this paper, we describe an algorithm for modeling the shape of the mouth, and extracting meaningful dimensions for use by automatic lipreading systems. One advantage of this technique lies in the ability to normalize the model to compensate for scale and rotation. An error function is defined which relates the model to the image, and minimization of the error yields the best fit model. This is similar to deformable templates, but we attempt to perform the minimization in closed form. Visual only recognition was performed with features extracted from the model, and the recognition system achieved 85% accuracy on a two word discrimination task.< ></description><identifier>ISSN: 1058-6393</identifier><identifier>ISBN: 0818664053</identifier><identifier>ISBN: 9780818664052</identifier><identifier>EISSN: 2576-2303</identifier><identifier>DOI: 10.1109/ACSSC.1994.471520</identifier><language>eng</language><publisher>IEEE Comput. Soc. Press</publisher><subject>Acoustic noise ; Automatic speech recognition ; Detectors ; Feature extraction ; Image edge detection ; Lips ; Mouth ; Nonlinear filters ; Shape ; Speech recognition</subject><ispartof>Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers, 1994, Vol.1, p.587-590 vol.1</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/471520$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2058,4050,4051,27925,54920</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/471520$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Rao, R.A.</creatorcontrib><creatorcontrib>Mersereau, R.M.</creatorcontrib><title>Lip modeling for visual speech recognition</title><title>Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers</title><addtitle>ACSSC</addtitle><description>In this paper, we describe an algorithm for modeling the shape of the mouth, and extracting meaningful dimensions for use by automatic lipreading systems. One advantage of this technique lies in the ability to normalize the model to compensate for scale and rotation. An error function is defined which relates the model to the image, and minimization of the error yields the best fit model. This is similar to deformable templates, but we attempt to perform the minimization in closed form. Visual only recognition was performed with features extracted from the model, and the recognition system achieved 85% accuracy on a two word discrimination task.< ></description><subject>Acoustic noise</subject><subject>Automatic speech recognition</subject><subject>Detectors</subject><subject>Feature extraction</subject><subject>Image edge detection</subject><subject>Lips</subject><subject>Mouth</subject><subject>Nonlinear filters</subject><subject>Shape</subject><subject>Speech recognition</subject><issn>1058-6393</issn><issn>2576-2303</issn><isbn>0818664053</isbn><isbn>9780818664052</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>1994</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNotj8tKw0AUQAcfYFr9AF1lLSS9d-48MssSrAqBLqrrMpnM1JE0CZkq-PcKdXV2h3MYu0coEcGs1vVuV5dojCiFRsnhgmVcalVwArpkC6iwUkqApCuWIciqUGTohi1S-gTgwCuesccmTvlx7Hwfh0Mexjn_junL9nmavHcf-ezdeBjiKY7DLbsOtk_-7p9L9r55eqtfimb7_FqvmyKiFqdCKUeeDDprnUbTVoIH2YXWyLazZImDckZYE1pJWv11BYu-RYlSO2UU0JI9nL3Re7-f5ni088_-vEi_4LtCZA</recordid><startdate>1994</startdate><enddate>1994</enddate><creator>Rao, R.A.</creator><creator>Mersereau, R.M.</creator><general>IEEE Comput. Soc. Press</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>1994</creationdate><title>Lip modeling for visual speech recognition</title><author>Rao, R.A. ; Mersereau, R.M.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i174t-66c3e391caac719b842f5dfb95bda3a3206c94a9fb5376105fa1eb15157c69603</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>1994</creationdate><topic>Acoustic noise</topic><topic>Automatic speech recognition</topic><topic>Detectors</topic><topic>Feature extraction</topic><topic>Image edge detection</topic><topic>Lips</topic><topic>Mouth</topic><topic>Nonlinear filters</topic><topic>Shape</topic><topic>Speech recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Rao, R.A.</creatorcontrib><creatorcontrib>Mersereau, R.M.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Rao, R.A.</au><au>Mersereau, R.M.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Lip modeling for visual speech recognition</atitle><btitle>Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers</btitle><stitle>ACSSC</stitle><date>1994</date><risdate>1994</risdate><volume>1</volume><spage>587</spage><epage>590 vol.1</epage><pages>587-590 vol.1</pages><issn>1058-6393</issn><eissn>2576-2303</eissn><isbn>0818664053</isbn><isbn>9780818664052</isbn><abstract>In this paper, we describe an algorithm for modeling the shape of the mouth, and extracting meaningful dimensions for use by automatic lipreading systems. One advantage of this technique lies in the ability to normalize the model to compensate for scale and rotation. An error function is defined which relates the model to the image, and minimization of the error yields the best fit model. This is similar to deformable templates, but we attempt to perform the minimization in closed form. Visual only recognition was performed with features extracted from the model, and the recognition system achieved 85% accuracy on a two word discrimination task.< ></abstract><pub>IEEE Comput. Soc. Press</pub><doi>10.1109/ACSSC.1994.471520</doi></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 1058-6393
ispartof	Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers, 1994, Vol.1, p.587-590 vol.1
issn	1058-6393 2576-2303
language	eng
recordid	cdi_ieee_primary_471520
source	IEEE Electronic Library (IEL) Conference Proceedings
subjects	Acoustic noise Automatic speech recognition Detectors Feature extraction Image edge detection Lips Mouth Nonlinear filters Shape Speech recognition
title	Lip modeling for visual speech recognition
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T18%3A25%3A17IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Lip%20modeling%20for%20visual%20speech%20recognition&rft.btitle=Proceedings%20of%201994%2028th%20Asilomar%20Conference%20on%20Signals,%20Systems%20and%20Computers&rft.au=Rao,%20R.A.&rft.date=1994&rft.volume=1&rft.spage=587&rft.epage=590%20vol.1&rft.pages=587-590%20vol.1&rft.issn=1058-6393&rft.eissn=2576-2303&rft.isbn=0818664053&rft.isbn_list=9780818664052&rft_id=info:doi/10.1109/ACSSC.1994.471520&rft_dat=%3Cieee_6IE%3E471520%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=471520&rfr_iscdi=true