Device selection from audio data

This disclosure describes techniques for identifying a voice-enabled device from a group of voice-enabled devices to respond to a speech utterance of a user. A speech-processing system may receive an audio signal representing the speech utterance captured in an environment of a voice-enabled device,...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Torbert, Charles James, Shah, Deepak Uttam, Tennety, Vijay Shankar, Lan, Gang, Cherukuri, Venkata Snehith, Rachakonda, Ravi Kiran, Tavares, Joseph Pedro, Clawson, Mckay
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Torbert, Charles James
Shah, Deepak Uttam
Tennety, Vijay Shankar
Lan, Gang
Cherukuri, Venkata Snehith
Rachakonda, Ravi Kiran
Tavares, Joseph Pedro
Clawson, Mckay
description This disclosure describes techniques for identifying a voice-enabled device from a group of voice-enabled devices to respond to a speech utterance of a user. A speech-processing system may receive an audio signal representing the speech utterance captured in an environment of a voice-enabled device, and identify another voice-enabled device located in the environment. The system may analyze the audio signal using a different natural-language-understanding model for each of the voice-enabled devices to identify an intent for each of the voice-enabled devices to respond to the speech utterance. The system may determine confidence scores that the intents are responsive to the speech utterance, and select the intent with the highest confidence score. The system may use the selected intent to generate a command for the corresponding voice-enabled device to respond to the user.
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_US10685669B1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>US10685669B1</sourcerecordid><originalsourceid>FETCH-epo_espacenet_US10685669B13</originalsourceid><addsrcrecordid>eNrjZFBwSS3LTE5VKE7NSU0uyczPU0grys9VSCxNycxXSEksSeRhYE1LzClO5YXS3AyKbq4hzh66qQX58anFBYnJqXmpJfGhwYYGZhamZmaWTobGxKgBAEFaJUE</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Device selection from audio data</title><source>esp@cenet</source><creator>Torbert, Charles James ; Shah, Deepak Uttam ; Tennety, Vijay Shankar ; Lan, Gang ; Cherukuri, Venkata Snehith ; Rachakonda, Ravi Kiran ; Tavares, Joseph Pedro ; Clawson, Mckay</creator><creatorcontrib>Torbert, Charles James ; Shah, Deepak Uttam ; Tennety, Vijay Shankar ; Lan, Gang ; Cherukuri, Venkata Snehith ; Rachakonda, Ravi Kiran ; Tavares, Joseph Pedro ; Clawson, Mckay</creatorcontrib><description>This disclosure describes techniques for identifying a voice-enabled device from a group of voice-enabled devices to respond to a speech utterance of a user. A speech-processing system may receive an audio signal representing the speech utterance captured in an environment of a voice-enabled device, and identify another voice-enabled device located in the environment. The system may analyze the audio signal using a different natural-language-understanding model for each of the voice-enabled devices to identify an intent for each of the voice-enabled devices to respond to the speech utterance. The system may determine confidence scores that the intents are responsive to the speech utterance, and select the intent with the highest confidence score. The system may use the selected intent to generate a command for the corresponding voice-enabled device to respond to the user.</description><language>eng</language><subject>ACOUSTICS ; CALCULATING ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; MUSICAL INSTRUMENTS ; PHYSICS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION</subject><creationdate>2020</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20200616&amp;DB=EPODOC&amp;CC=US&amp;NR=10685669B1$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25544,76293</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20200616&amp;DB=EPODOC&amp;CC=US&amp;NR=10685669B1$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Torbert, Charles James</creatorcontrib><creatorcontrib>Shah, Deepak Uttam</creatorcontrib><creatorcontrib>Tennety, Vijay Shankar</creatorcontrib><creatorcontrib>Lan, Gang</creatorcontrib><creatorcontrib>Cherukuri, Venkata Snehith</creatorcontrib><creatorcontrib>Rachakonda, Ravi Kiran</creatorcontrib><creatorcontrib>Tavares, Joseph Pedro</creatorcontrib><creatorcontrib>Clawson, Mckay</creatorcontrib><title>Device selection from audio data</title><description>This disclosure describes techniques for identifying a voice-enabled device from a group of voice-enabled devices to respond to a speech utterance of a user. A speech-processing system may receive an audio signal representing the speech utterance captured in an environment of a voice-enabled device, and identify another voice-enabled device located in the environment. The system may analyze the audio signal using a different natural-language-understanding model for each of the voice-enabled devices to identify an intent for each of the voice-enabled devices to respond to the speech utterance. The system may determine confidence scores that the intents are responsive to the speech utterance, and select the intent with the highest confidence score. The system may use the selected intent to generate a command for the corresponding voice-enabled device to respond to the user.</description><subject>ACOUSTICS</subject><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2020</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZFBwSS3LTE5VKE7NSU0uyczPU0grys9VSCxNycxXSEksSeRhYE1LzClO5YXS3AyKbq4hzh66qQX58anFBYnJqXmpJfGhwYYGZhamZmaWTobGxKgBAEFaJUE</recordid><startdate>20200616</startdate><enddate>20200616</enddate><creator>Torbert, Charles James</creator><creator>Shah, Deepak Uttam</creator><creator>Tennety, Vijay Shankar</creator><creator>Lan, Gang</creator><creator>Cherukuri, Venkata Snehith</creator><creator>Rachakonda, Ravi Kiran</creator><creator>Tavares, Joseph Pedro</creator><creator>Clawson, Mckay</creator><scope>EVB</scope></search><sort><creationdate>20200616</creationdate><title>Device selection from audio data</title><author>Torbert, Charles James ; Shah, Deepak Uttam ; Tennety, Vijay Shankar ; Lan, Gang ; Cherukuri, Venkata Snehith ; Rachakonda, Ravi Kiran ; Tavares, Joseph Pedro ; Clawson, Mckay</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_US10685669B13</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2020</creationdate><topic>ACOUSTICS</topic><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><toplevel>online_resources</toplevel><creatorcontrib>Torbert, Charles James</creatorcontrib><creatorcontrib>Shah, Deepak Uttam</creatorcontrib><creatorcontrib>Tennety, Vijay Shankar</creatorcontrib><creatorcontrib>Lan, Gang</creatorcontrib><creatorcontrib>Cherukuri, Venkata Snehith</creatorcontrib><creatorcontrib>Rachakonda, Ravi Kiran</creatorcontrib><creatorcontrib>Tavares, Joseph Pedro</creatorcontrib><creatorcontrib>Clawson, Mckay</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Torbert, Charles James</au><au>Shah, Deepak Uttam</au><au>Tennety, Vijay Shankar</au><au>Lan, Gang</au><au>Cherukuri, Venkata Snehith</au><au>Rachakonda, Ravi Kiran</au><au>Tavares, Joseph Pedro</au><au>Clawson, Mckay</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Device selection from audio data</title><date>2020-06-16</date><risdate>2020</risdate><abstract>This disclosure describes techniques for identifying a voice-enabled device from a group of voice-enabled devices to respond to a speech utterance of a user. A speech-processing system may receive an audio signal representing the speech utterance captured in an environment of a voice-enabled device, and identify another voice-enabled device located in the environment. The system may analyze the audio signal using a different natural-language-understanding model for each of the voice-enabled devices to identify an intent for each of the voice-enabled devices to respond to the speech utterance. The system may determine confidence scores that the intents are responsive to the speech utterance, and select the intent with the highest confidence score. The system may use the selected intent to generate a command for the corresponding voice-enabled device to respond to the user.</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng
recordid cdi_epo_espacenet_US10685669B1
source esp@cenet
subjects ACOUSTICS
CALCULATING
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
MUSICAL INSTRUMENTS
PHYSICS
SPEECH ANALYSIS OR SYNTHESIS
SPEECH OR AUDIO CODING OR DECODING
SPEECH OR VOICE PROCESSING
SPEECH RECOGNITION
title Device selection from audio data
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-27T21%3A45%3A17IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=Torbert,%20Charles%20James&rft.date=2020-06-16&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EUS10685669B1%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true