Lip modeling for visual speech recognition

In this paper, we describe an algorithm for modeling the shape of the mouth, and extracting meaningful dimensions for use by automatic lipreading systems. One advantage of this technique lies in the ability to normalize the model to compensate for scale and rotation. An error function is defined whi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Rao, R.A., Mersereau, R.M.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 590 vol.1
container_issue
container_start_page 587
container_title
container_volume 1
creator Rao, R.A.
Mersereau, R.M.
description In this paper, we describe an algorithm for modeling the shape of the mouth, and extracting meaningful dimensions for use by automatic lipreading systems. One advantage of this technique lies in the ability to normalize the model to compensate for scale and rotation. An error function is defined which relates the model to the image, and minimization of the error yields the best fit model. This is similar to deformable templates, but we attempt to perform the minimization in closed form. Visual only recognition was performed with features extracted from the model, and the recognition system achieved 85% accuracy on a two word discrimination task.< >
doi_str_mv 10.1109/ACSSC.1994.471520
format Conference Proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_471520</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>471520</ieee_id><sourcerecordid>471520</sourcerecordid><originalsourceid>FETCH-LOGICAL-i174t-66c3e391caac719b842f5dfb95bda3a3206c94a9fb5376105fa1eb15157c69603</originalsourceid><addsrcrecordid>eNotj8tKw0AUQAcfYFr9AF1lLSS9d-48MssSrAqBLqrrMpnM1JE0CZkq-PcKdXV2h3MYu0coEcGs1vVuV5dojCiFRsnhgmVcalVwArpkC6iwUkqApCuWIciqUGTohi1S-gTgwCuesccmTvlx7Hwfh0Mexjn_junL9nmavHcf-ezdeBjiKY7DLbsOtk_-7p9L9r55eqtfimb7_FqvmyKiFqdCKUeeDDprnUbTVoIH2YXWyLazZImDckZYE1pJWv11BYu-RYlSO2UU0JI9nL3Re7-f5ni088_-vEi_4LtCZA</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Lip modeling for visual speech recognition</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Rao, R.A. ; Mersereau, R.M.</creator><creatorcontrib>Rao, R.A. ; Mersereau, R.M.</creatorcontrib><description>In this paper, we describe an algorithm for modeling the shape of the mouth, and extracting meaningful dimensions for use by automatic lipreading systems. One advantage of this technique lies in the ability to normalize the model to compensate for scale and rotation. An error function is defined which relates the model to the image, and minimization of the error yields the best fit model. This is similar to deformable templates, but we attempt to perform the minimization in closed form. Visual only recognition was performed with features extracted from the model, and the recognition system achieved 85% accuracy on a two word discrimination task.&lt; &gt;</description><identifier>ISSN: 1058-6393</identifier><identifier>ISBN: 0818664053</identifier><identifier>ISBN: 9780818664052</identifier><identifier>EISSN: 2576-2303</identifier><identifier>DOI: 10.1109/ACSSC.1994.471520</identifier><language>eng</language><publisher>IEEE Comput. Soc. Press</publisher><subject>Acoustic noise ; Automatic speech recognition ; Detectors ; Feature extraction ; Image edge detection ; Lips ; Mouth ; Nonlinear filters ; Shape ; Speech recognition</subject><ispartof>Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers, 1994, Vol.1, p.587-590 vol.1</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/471520$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2058,4050,4051,27925,54920</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/471520$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Rao, R.A.</creatorcontrib><creatorcontrib>Mersereau, R.M.</creatorcontrib><title>Lip modeling for visual speech recognition</title><title>Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers</title><addtitle>ACSSC</addtitle><description>In this paper, we describe an algorithm for modeling the shape of the mouth, and extracting meaningful dimensions for use by automatic lipreading systems. One advantage of this technique lies in the ability to normalize the model to compensate for scale and rotation. An error function is defined which relates the model to the image, and minimization of the error yields the best fit model. This is similar to deformable templates, but we attempt to perform the minimization in closed form. Visual only recognition was performed with features extracted from the model, and the recognition system achieved 85% accuracy on a two word discrimination task.&lt; &gt;</description><subject>Acoustic noise</subject><subject>Automatic speech recognition</subject><subject>Detectors</subject><subject>Feature extraction</subject><subject>Image edge detection</subject><subject>Lips</subject><subject>Mouth</subject><subject>Nonlinear filters</subject><subject>Shape</subject><subject>Speech recognition</subject><issn>1058-6393</issn><issn>2576-2303</issn><isbn>0818664053</isbn><isbn>9780818664052</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>1994</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNotj8tKw0AUQAcfYFr9AF1lLSS9d-48MssSrAqBLqrrMpnM1JE0CZkq-PcKdXV2h3MYu0coEcGs1vVuV5dojCiFRsnhgmVcalVwArpkC6iwUkqApCuWIciqUGTohi1S-gTgwCuesccmTvlx7Hwfh0Mexjn_junL9nmavHcf-ezdeBjiKY7DLbsOtk_-7p9L9r55eqtfimb7_FqvmyKiFqdCKUeeDDprnUbTVoIH2YXWyLazZImDckZYE1pJWv11BYu-RYlSO2UU0JI9nL3Re7-f5ni088_-vEi_4LtCZA</recordid><startdate>1994</startdate><enddate>1994</enddate><creator>Rao, R.A.</creator><creator>Mersereau, R.M.</creator><general>IEEE Comput. Soc. Press</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>1994</creationdate><title>Lip modeling for visual speech recognition</title><author>Rao, R.A. ; Mersereau, R.M.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i174t-66c3e391caac719b842f5dfb95bda3a3206c94a9fb5376105fa1eb15157c69603</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>1994</creationdate><topic>Acoustic noise</topic><topic>Automatic speech recognition</topic><topic>Detectors</topic><topic>Feature extraction</topic><topic>Image edge detection</topic><topic>Lips</topic><topic>Mouth</topic><topic>Nonlinear filters</topic><topic>Shape</topic><topic>Speech recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Rao, R.A.</creatorcontrib><creatorcontrib>Mersereau, R.M.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Rao, R.A.</au><au>Mersereau, R.M.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Lip modeling for visual speech recognition</atitle><btitle>Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers</btitle><stitle>ACSSC</stitle><date>1994</date><risdate>1994</risdate><volume>1</volume><spage>587</spage><epage>590 vol.1</epage><pages>587-590 vol.1</pages><issn>1058-6393</issn><eissn>2576-2303</eissn><isbn>0818664053</isbn><isbn>9780818664052</isbn><abstract>In this paper, we describe an algorithm for modeling the shape of the mouth, and extracting meaningful dimensions for use by automatic lipreading systems. One advantage of this technique lies in the ability to normalize the model to compensate for scale and rotation. An error function is defined which relates the model to the image, and minimization of the error yields the best fit model. This is similar to deformable templates, but we attempt to perform the minimization in closed form. Visual only recognition was performed with features extracted from the model, and the recognition system achieved 85% accuracy on a two word discrimination task.&lt; &gt;</abstract><pub>IEEE Comput. Soc. Press</pub><doi>10.1109/ACSSC.1994.471520</doi></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1058-6393
ispartof Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers, 1994, Vol.1, p.587-590 vol.1
issn 1058-6393
2576-2303
language eng
recordid cdi_ieee_primary_471520
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Acoustic noise
Automatic speech recognition
Detectors
Feature extraction
Image edge detection
Lips
Mouth
Nonlinear filters
Shape
Speech recognition
title Lip modeling for visual speech recognition
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T18%3A25%3A17IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Lip%20modeling%20for%20visual%20speech%20recognition&rft.btitle=Proceedings%20of%201994%2028th%20Asilomar%20Conference%20on%20Signals,%20Systems%20and%20Computers&rft.au=Rao,%20R.A.&rft.date=1994&rft.volume=1&rft.spage=587&rft.epage=590%20vol.1&rft.pages=587-590%20vol.1&rft.issn=1058-6393&rft.eissn=2576-2303&rft.isbn=0818664053&rft.isbn_list=9780818664052&rft_id=info:doi/10.1109/ACSSC.1994.471520&rft_dat=%3Cieee_6IE%3E471520%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=471520&rfr_iscdi=true