A speech recognition system using a plurality of acoustic models which share probability distributions

A speech processing method comprising: receiving a speech input comprising a sequence of observations; determining the likelihood of a sequence of observations corresponding to a word or part thereof using a first acoustic model set with a first dictionary; determining the likelihood of a sequence o...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	MATTHEW STUTTLE, CATHERINE BRESLIN, KATE KNILL
Format:	Patent
Sprache:	eng
Schlagworte:	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	MATTHEW STUTTLE CATHERINE BRESLIN KATE KNILL
description	A speech processing method comprising: receiving a speech input comprising a sequence of observations; determining the likelihood of a sequence of observations corresponding to a word or part thereof using a first acoustic model set with a first dictionary; determining the likelihood of a sequence of observations corresponding to a word or part thereof using a second acoustic model set with a second dictionary; and outputting text determined from said first and second acoustic models; wherein each model uses a plurality of pre-calculated probability distributions to determine the said likelihood and wherein probability distributions are shared between the models. The probability distributions may be Gaussian probability density functions associated with an acoustic model which may be a Hidden Markov Model (HMM). The acoustic models may be a phoneme model and a grapheme model. The observations may be converted into an n-dimensional feature vector in an n dimensional space which is then further converted into a plurality of sub-vectors each within a reduced dimension subspace of said n-dimensional space and said shared probability distributions may have been pre-calculated for said sub-spaces.
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_GB2465383A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>GB2465383A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_GB2465383A3</originalsourceid><addsrcrecordid>eNqFjDkOwjAQANNQIOAN7AdoMCDagDgeQB9tnE28kuO1vLZQfs8heqppZmZe9TVoJLIOElkZAmeWADppphGKchgAIfqS0HOeQHpAK0UzWxilI6_wdPyu1WEiiElabPmrdqw5cVs-Q11Wsx690urHRbW-Xh7n-4aiNKQRLQXKze203R325mhq8994AcglP5E</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>A speech recognition system using a plurality of acoustic models which share probability distributions</title><source>esp@cenet</source><creator>MATTHEW STUTTLE ; CATHERINE BRESLIN ; KATE KNILL</creator><creatorcontrib>MATTHEW STUTTLE ; CATHERINE BRESLIN ; KATE KNILL</creatorcontrib><description>A speech processing method comprising: receiving a speech input comprising a sequence of observations; determining the likelihood of a sequence of observations corresponding to a word or part thereof using a first acoustic model set with a first dictionary; determining the likelihood of a sequence of observations corresponding to a word or part thereof using a second acoustic model set with a second dictionary; and outputting text determined from said first and second acoustic models; wherein each model uses a plurality of pre-calculated probability distributions to determine the said likelihood and wherein probability distributions are shared between the models. The probability distributions may be Gaussian probability density functions associated with an acoustic model which may be a Hidden Markov Model (HMM). The acoustic models may be a phoneme model and a grapheme model. The observations may be converted into an n-dimensional feature vector in an n dimensional space which is then further converted into a plurality of sub-vectors each within a reduced dimension subspace of said n-dimensional space and said shared probability distributions may have been pre-calculated for said sub-spaces.</description><language>eng</language><subject>ACOUSTICS ; MUSICAL INSTRUMENTS ; PHYSICS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION</subject><creationdate>2010</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20100519&DB=EPODOC&CC=GB&NR=2465383A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20100519&DB=EPODOC&CC=GB&NR=2465383A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>MATTHEW STUTTLE</creatorcontrib><creatorcontrib>CATHERINE BRESLIN</creatorcontrib><creatorcontrib>KATE KNILL</creatorcontrib><title>A speech recognition system using a plurality of acoustic models which share probability distributions</title><description>A speech processing method comprising: receiving a speech input comprising a sequence of observations; determining the likelihood of a sequence of observations corresponding to a word or part thereof using a first acoustic model set with a first dictionary; determining the likelihood of a sequence of observations corresponding to a word or part thereof using a second acoustic model set with a second dictionary; and outputting text determined from said first and second acoustic models; wherein each model uses a plurality of pre-calculated probability distributions to determine the said likelihood and wherein probability distributions are shared between the models. The probability distributions may be Gaussian probability density functions associated with an acoustic model which may be a Hidden Markov Model (HMM). The acoustic models may be a phoneme model and a grapheme model. The observations may be converted into an n-dimensional feature vector in an n dimensional space which is then further converted into a plurality of sub-vectors each within a reduced dimension subspace of said n-dimensional space and said shared probability distributions may have been pre-calculated for said sub-spaces.</description><subject>ACOUSTICS</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2010</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqFjDkOwjAQANNQIOAN7AdoMCDagDgeQB9tnE28kuO1vLZQfs8heqppZmZe9TVoJLIOElkZAmeWADppphGKchgAIfqS0HOeQHpAK0UzWxilI6_wdPyu1WEiiElabPmrdqw5cVs-Q11Wsx690urHRbW-Xh7n-4aiNKQRLQXKze203R325mhq8994AcglP5E</recordid><startdate>20100519</startdate><enddate>20100519</enddate><creator>MATTHEW STUTTLE</creator><creator>CATHERINE BRESLIN</creator><creator>KATE KNILL</creator><scope>EVB</scope></search><sort><creationdate>20100519</creationdate><title>A speech recognition system using a plurality of acoustic models which share probability distributions</title><author>MATTHEW STUTTLE ; CATHERINE BRESLIN ; KATE KNILL</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_GB2465383A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2010</creationdate><topic>ACOUSTICS</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><toplevel>online_resources</toplevel><creatorcontrib>MATTHEW STUTTLE</creatorcontrib><creatorcontrib>CATHERINE BRESLIN</creatorcontrib><creatorcontrib>KATE KNILL</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>MATTHEW STUTTLE</au><au>CATHERINE BRESLIN</au><au>KATE KNILL</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>A speech recognition system using a plurality of acoustic models which share probability distributions</title><date>2010-05-19</date><risdate>2010</risdate><abstract>A speech processing method comprising: receiving a speech input comprising a sequence of observations; determining the likelihood of a sequence of observations corresponding to a word or part thereof using a first acoustic model set with a first dictionary; determining the likelihood of a sequence of observations corresponding to a word or part thereof using a second acoustic model set with a second dictionary; and outputting text determined from said first and second acoustic models; wherein each model uses a plurality of pre-calculated probability distributions to determine the said likelihood and wherein probability distributions are shared between the models. The probability distributions may be Gaussian probability density functions associated with an acoustic model which may be a Hidden Markov Model (HMM). The acoustic models may be a phoneme model and a grapheme model. The observations may be converted into an n-dimensional feature vector in an n dimensional space which is then further converted into a plurality of sub-vectors each within a reduced dimension subspace of said n-dimensional space and said shared probability distributions may have been pre-calculated for said sub-spaces.</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	eng
recordid	cdi_epo_espacenet_GB2465383A
source	esp@cenet
subjects	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
title	A speech recognition system using a plurality of acoustic models which share probability distributions
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T15%3A12%3A15IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=MATTHEW%20STUTTLE&rft.date=2010-05-19&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EGB2465383A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true