Lip modeling for visual speech recognition
In this paper, we describe an algorithm for modeling the shape of the mouth, and extracting meaningful dimensions for use by automatic lipreading systems. One advantage of this technique lies in the ability to normalize the model to compensate for scale and rotation. An error function is defined whi...
Gespeichert in:
Hauptverfasser: | , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 590 vol.1 |
---|---|
container_issue | |
container_start_page | 587 |
container_title | |
container_volume | 1 |
creator | Rao, R.A. Mersereau, R.M. |
description | In this paper, we describe an algorithm for modeling the shape of the mouth, and extracting meaningful dimensions for use by automatic lipreading systems. One advantage of this technique lies in the ability to normalize the model to compensate for scale and rotation. An error function is defined which relates the model to the image, and minimization of the error yields the best fit model. This is similar to deformable templates, but we attempt to perform the minimization in closed form. Visual only recognition was performed with features extracted from the model, and the recognition system achieved 85% accuracy on a two word discrimination task.< > |
doi_str_mv | 10.1109/ACSSC.1994.471520 |
format | Conference Proceeding |
fullrecord | <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_471520</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>471520</ieee_id><sourcerecordid>471520</sourcerecordid><originalsourceid>FETCH-LOGICAL-i174t-66c3e391caac719b842f5dfb95bda3a3206c94a9fb5376105fa1eb15157c69603</originalsourceid><addsrcrecordid>eNotj8tKw0AUQAcfYFr9AF1lLSS9d-48MssSrAqBLqrrMpnM1JE0CZkq-PcKdXV2h3MYu0coEcGs1vVuV5dojCiFRsnhgmVcalVwArpkC6iwUkqApCuWIciqUGTohi1S-gTgwCuesccmTvlx7Hwfh0Mexjn_junL9nmavHcf-ezdeBjiKY7DLbsOtk_-7p9L9r55eqtfimb7_FqvmyKiFqdCKUeeDDprnUbTVoIH2YXWyLazZImDckZYE1pJWv11BYu-RYlSO2UU0JI9nL3Re7-f5ni088_-vEi_4LtCZA</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Lip modeling for visual speech recognition</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Rao, R.A. ; Mersereau, R.M.</creator><creatorcontrib>Rao, R.A. ; Mersereau, R.M.</creatorcontrib><description>In this paper, we describe an algorithm for modeling the shape of the mouth, and extracting meaningful dimensions for use by automatic lipreading systems. One advantage of this technique lies in the ability to normalize the model to compensate for scale and rotation. An error function is defined which relates the model to the image, and minimization of the error yields the best fit model. This is similar to deformable templates, but we attempt to perform the minimization in closed form. Visual only recognition was performed with features extracted from the model, and the recognition system achieved 85% accuracy on a two word discrimination task.< ></description><identifier>ISSN: 1058-6393</identifier><identifier>ISBN: 0818664053</identifier><identifier>ISBN: 9780818664052</identifier><identifier>EISSN: 2576-2303</identifier><identifier>DOI: 10.1109/ACSSC.1994.471520</identifier><language>eng</language><publisher>IEEE Comput. Soc. Press</publisher><subject>Acoustic noise ; Automatic speech recognition ; Detectors ; Feature extraction ; Image edge detection ; Lips ; Mouth ; Nonlinear filters ; Shape ; Speech recognition</subject><ispartof>Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers, 1994, Vol.1, p.587-590 vol.1</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/471520$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2058,4050,4051,27925,54920</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/471520$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Rao, R.A.</creatorcontrib><creatorcontrib>Mersereau, R.M.</creatorcontrib><title>Lip modeling for visual speech recognition</title><title>Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers</title><addtitle>ACSSC</addtitle><description>In this paper, we describe an algorithm for modeling the shape of the mouth, and extracting meaningful dimensions for use by automatic lipreading systems. One advantage of this technique lies in the ability to normalize the model to compensate for scale and rotation. An error function is defined which relates the model to the image, and minimization of the error yields the best fit model. This is similar to deformable templates, but we attempt to perform the minimization in closed form. Visual only recognition was performed with features extracted from the model, and the recognition system achieved 85% accuracy on a two word discrimination task.< ></description><subject>Acoustic noise</subject><subject>Automatic speech recognition</subject><subject>Detectors</subject><subject>Feature extraction</subject><subject>Image edge detection</subject><subject>Lips</subject><subject>Mouth</subject><subject>Nonlinear filters</subject><subject>Shape</subject><subject>Speech recognition</subject><issn>1058-6393</issn><issn>2576-2303</issn><isbn>0818664053</isbn><isbn>9780818664052</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>1994</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNotj8tKw0AUQAcfYFr9AF1lLSS9d-48MssSrAqBLqrrMpnM1JE0CZkq-PcKdXV2h3MYu0coEcGs1vVuV5dojCiFRsnhgmVcalVwArpkC6iwUkqApCuWIciqUGTohi1S-gTgwCuesccmTvlx7Hwfh0Mexjn_junL9nmavHcf-ezdeBjiKY7DLbsOtk_-7p9L9r55eqtfimb7_FqvmyKiFqdCKUeeDDprnUbTVoIH2YXWyLazZImDckZYE1pJWv11BYu-RYlSO2UU0JI9nL3Re7-f5ni088_-vEi_4LtCZA</recordid><startdate>1994</startdate><enddate>1994</enddate><creator>Rao, R.A.</creator><creator>Mersereau, R.M.</creator><general>IEEE Comput. Soc. Press</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>1994</creationdate><title>Lip modeling for visual speech recognition</title><author>Rao, R.A. ; Mersereau, R.M.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i174t-66c3e391caac719b842f5dfb95bda3a3206c94a9fb5376105fa1eb15157c69603</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>1994</creationdate><topic>Acoustic noise</topic><topic>Automatic speech recognition</topic><topic>Detectors</topic><topic>Feature extraction</topic><topic>Image edge detection</topic><topic>Lips</topic><topic>Mouth</topic><topic>Nonlinear filters</topic><topic>Shape</topic><topic>Speech recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Rao, R.A.</creatorcontrib><creatorcontrib>Mersereau, R.M.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Rao, R.A.</au><au>Mersereau, R.M.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Lip modeling for visual speech recognition</atitle><btitle>Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers</btitle><stitle>ACSSC</stitle><date>1994</date><risdate>1994</risdate><volume>1</volume><spage>587</spage><epage>590 vol.1</epage><pages>587-590 vol.1</pages><issn>1058-6393</issn><eissn>2576-2303</eissn><isbn>0818664053</isbn><isbn>9780818664052</isbn><abstract>In this paper, we describe an algorithm for modeling the shape of the mouth, and extracting meaningful dimensions for use by automatic lipreading systems. One advantage of this technique lies in the ability to normalize the model to compensate for scale and rotation. An error function is defined which relates the model to the image, and minimization of the error yields the best fit model. This is similar to deformable templates, but we attempt to perform the minimization in closed form. Visual only recognition was performed with features extracted from the model, and the recognition system achieved 85% accuracy on a two word discrimination task.< ></abstract><pub>IEEE Comput. Soc. Press</pub><doi>10.1109/ACSSC.1994.471520</doi></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 1058-6393 |
ispartof | Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers, 1994, Vol.1, p.587-590 vol.1 |
issn | 1058-6393 2576-2303 |
language | eng |
recordid | cdi_ieee_primary_471520 |
source | IEEE Electronic Library (IEL) Conference Proceedings |
subjects | Acoustic noise Automatic speech recognition Detectors Feature extraction Image edge detection Lips Mouth Nonlinear filters Shape Speech recognition |
title | Lip modeling for visual speech recognition |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T18%3A25%3A17IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Lip%20modeling%20for%20visual%20speech%20recognition&rft.btitle=Proceedings%20of%201994%2028th%20Asilomar%20Conference%20on%20Signals,%20Systems%20and%20Computers&rft.au=Rao,%20R.A.&rft.date=1994&rft.volume=1&rft.spage=587&rft.epage=590%20vol.1&rft.pages=587-590%20vol.1&rft.issn=1058-6393&rft.eissn=2576-2303&rft.isbn=0818664053&rft.isbn_list=9780818664052&rft_id=info:doi/10.1109/ACSSC.1994.471520&rft_dat=%3Cieee_6IE%3E471520%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=471520&rfr_iscdi=true |