CUSTOM DISPLAY POST PROCESSING IN SPEECH RECOGNITION

Solutions for custom display post processing (DPP) in speech recognition (SR) use a customized multi-stage DPP pipeline that transforms a stream of SR tokens from lexical form to display form. A first transformation stage of the DPP pipeline receives the stream of tokens, in turn, by an upstream fil...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: RAHMEL, Heiko Willy, CHANG, Shuangyu, ZHAO, Che, BEHRE, Piyush, KIBRE, Nicholas, SHAHID, Khuram, VARADHARAJAN, Padma, LIN, Edward C, LIU, Wei
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator RAHMEL, Heiko Willy
CHANG, Shuangyu
ZHAO, Che
BEHRE, Piyush
KIBRE, Nicholas
SHAHID, Khuram
VARADHARAJAN, Padma
LIN, Edward C
LIU, Wei
description Solutions for custom display post processing (DPP) in speech recognition (SR) use a customized multi-stage DPP pipeline that transforms a stream of SR tokens from lexical form to display form. A first transformation stage of the DPP pipeline receives the stream of tokens, in turn, by an upstream filter, a base model stage, and a downstream filter, and transforms a first aspect of the stream of tokens (e.g., disfluency, inverse text normalization (ITN), capitalization, etc.) from lexical form into display form. The upstream filter and/or the downstream filter alter the stream of tokens to change the default behavior of the DPP pipeline into custom behavior. Additional transformation stages of the DPP pipeline perform further transforms, allowing for outputting final text in a display format that is customized for a specific user. This permits each user to efficiently leverage a common baseline DPP pipeline to produce a custom output.
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_US2024403539A1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>US2024403539A1</sourcerecordid><originalsourceid>FETCH-epo_espacenet_US2024403539A13</originalsourceid><addsrcrecordid>eNrjZDBxDg0O8fdVcPEMDvBxjFQI8A8OUQgI8nd2DQ729HNX8PRTCA5wdXX2UAhydfZ39_MM8fT342FgTUvMKU7lhdLcDMpuriHOHrqpBfnxqcUFicmpeakl8aHBRgZGJiYGxqbGlo6GxsSpAgCFFCiT</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>CUSTOM DISPLAY POST PROCESSING IN SPEECH RECOGNITION</title><source>esp@cenet</source><creator>RAHMEL, Heiko Willy ; CHANG, Shuangyu ; ZHAO, Che ; BEHRE, Piyush ; KIBRE, Nicholas ; SHAHID, Khuram ; VARADHARAJAN, Padma ; LIN, Edward C ; LIU, Wei</creator><creatorcontrib>RAHMEL, Heiko Willy ; CHANG, Shuangyu ; ZHAO, Che ; BEHRE, Piyush ; KIBRE, Nicholas ; SHAHID, Khuram ; VARADHARAJAN, Padma ; LIN, Edward C ; LIU, Wei</creatorcontrib><description>Solutions for custom display post processing (DPP) in speech recognition (SR) use a customized multi-stage DPP pipeline that transforms a stream of SR tokens from lexical form to display form. A first transformation stage of the DPP pipeline receives the stream of tokens, in turn, by an upstream filter, a base model stage, and a downstream filter, and transforms a first aspect of the stream of tokens (e.g., disfluency, inverse text normalization (ITN), capitalization, etc.) from lexical form into display form. The upstream filter and/or the downstream filter alter the stream of tokens to change the default behavior of the DPP pipeline into custom behavior. Additional transformation stages of the DPP pipeline perform further transforms, allowing for outputting final text in a display format that is customized for a specific user. This permits each user to efficiently leverage a common baseline DPP pipeline to produce a custom output.</description><language>eng</language><subject>CALCULATING ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2024</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20241205&amp;DB=EPODOC&amp;CC=US&amp;NR=2024403539A1$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25543,76293</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20241205&amp;DB=EPODOC&amp;CC=US&amp;NR=2024403539A1$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>RAHMEL, Heiko Willy</creatorcontrib><creatorcontrib>CHANG, Shuangyu</creatorcontrib><creatorcontrib>ZHAO, Che</creatorcontrib><creatorcontrib>BEHRE, Piyush</creatorcontrib><creatorcontrib>KIBRE, Nicholas</creatorcontrib><creatorcontrib>SHAHID, Khuram</creatorcontrib><creatorcontrib>VARADHARAJAN, Padma</creatorcontrib><creatorcontrib>LIN, Edward C</creatorcontrib><creatorcontrib>LIU, Wei</creatorcontrib><title>CUSTOM DISPLAY POST PROCESSING IN SPEECH RECOGNITION</title><description>Solutions for custom display post processing (DPP) in speech recognition (SR) use a customized multi-stage DPP pipeline that transforms a stream of SR tokens from lexical form to display form. A first transformation stage of the DPP pipeline receives the stream of tokens, in turn, by an upstream filter, a base model stage, and a downstream filter, and transforms a first aspect of the stream of tokens (e.g., disfluency, inverse text normalization (ITN), capitalization, etc.) from lexical form into display form. The upstream filter and/or the downstream filter alter the stream of tokens to change the default behavior of the DPP pipeline into custom behavior. Additional transformation stages of the DPP pipeline perform further transforms, allowing for outputting final text in a display format that is customized for a specific user. This permits each user to efficiently leverage a common baseline DPP pipeline to produce a custom output.</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2024</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZDBxDg0O8fdVcPEMDvBxjFQI8A8OUQgI8nd2DQ729HNX8PRTCA5wdXX2UAhydfZ39_MM8fT342FgTUvMKU7lhdLcDMpuriHOHrqpBfnxqcUFicmpeakl8aHBRgZGJiYGxqbGlo6GxsSpAgCFFCiT</recordid><startdate>20241205</startdate><enddate>20241205</enddate><creator>RAHMEL, Heiko Willy</creator><creator>CHANG, Shuangyu</creator><creator>ZHAO, Che</creator><creator>BEHRE, Piyush</creator><creator>KIBRE, Nicholas</creator><creator>SHAHID, Khuram</creator><creator>VARADHARAJAN, Padma</creator><creator>LIN, Edward C</creator><creator>LIU, Wei</creator><scope>EVB</scope></search><sort><creationdate>20241205</creationdate><title>CUSTOM DISPLAY POST PROCESSING IN SPEECH RECOGNITION</title><author>RAHMEL, Heiko Willy ; CHANG, Shuangyu ; ZHAO, Che ; BEHRE, Piyush ; KIBRE, Nicholas ; SHAHID, Khuram ; VARADHARAJAN, Padma ; LIN, Edward C ; LIU, Wei</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_US2024403539A13</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2024</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>RAHMEL, Heiko Willy</creatorcontrib><creatorcontrib>CHANG, Shuangyu</creatorcontrib><creatorcontrib>ZHAO, Che</creatorcontrib><creatorcontrib>BEHRE, Piyush</creatorcontrib><creatorcontrib>KIBRE, Nicholas</creatorcontrib><creatorcontrib>SHAHID, Khuram</creatorcontrib><creatorcontrib>VARADHARAJAN, Padma</creatorcontrib><creatorcontrib>LIN, Edward C</creatorcontrib><creatorcontrib>LIU, Wei</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>RAHMEL, Heiko Willy</au><au>CHANG, Shuangyu</au><au>ZHAO, Che</au><au>BEHRE, Piyush</au><au>KIBRE, Nicholas</au><au>SHAHID, Khuram</au><au>VARADHARAJAN, Padma</au><au>LIN, Edward C</au><au>LIU, Wei</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>CUSTOM DISPLAY POST PROCESSING IN SPEECH RECOGNITION</title><date>2024-12-05</date><risdate>2024</risdate><abstract>Solutions for custom display post processing (DPP) in speech recognition (SR) use a customized multi-stage DPP pipeline that transforms a stream of SR tokens from lexical form to display form. A first transformation stage of the DPP pipeline receives the stream of tokens, in turn, by an upstream filter, a base model stage, and a downstream filter, and transforms a first aspect of the stream of tokens (e.g., disfluency, inverse text normalization (ITN), capitalization, etc.) from lexical form into display form. The upstream filter and/or the downstream filter alter the stream of tokens to change the default behavior of the DPP pipeline into custom behavior. Additional transformation stages of the DPP pipeline perform further transforms, allowing for outputting final text in a display format that is customized for a specific user. This permits each user to efficiently leverage a common baseline DPP pipeline to produce a custom output.</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng
recordid cdi_epo_espacenet_US2024403539A1
source esp@cenet
subjects CALCULATING
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
PHYSICS
title CUSTOM DISPLAY POST PROCESSING IN SPEECH RECOGNITION
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-23T16%3A18%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=RAHMEL,%20Heiko%20Willy&rft.date=2024-12-05&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EUS2024403539A1%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true