Scalable and embedded codec for speech and audio signals

A system and method for processing of audio and speech signals is disclosed, which provide compatibility over a range of communication devices operating at different sampling frequencies and/or bit rates. The analyzer of the system divides the input signal in different portions, at least one of whic...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	WANG WEI, DUNN ROBERT B, CAMPANA DAVID A, MCAULAY ROBERT J, CHEN JUIN-HWEY, AGUILAR JOSEPH GERARD, SUN XIAOQUIN, WATKINS CRAIG, ZOPF ROBERT W
Format:	Patent
Sprache:	eng
Schlagworte:	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	WANG WEI DUNN ROBERT B CAMPANA DAVID A MCAULAY ROBERT J CHEN JUIN-HWEY AGUILAR JOSEPH GERARD SUN XIAOQUIN WATKINS CRAIG ZOPF ROBERT W
description	A system and method for processing of audio and speech signals is disclosed, which provide compatibility over a range of communication devices operating at different sampling frequencies and/or bit rates. The analyzer of the system divides the input signal in different portions, at least one of which carries information sufficient to provide intelligible reconstruction of the input signal. The analyzer also encodes separate information about other portions of the signal in an embedded manner, so that a smooth transition can be achieved from low bit-rate to high bit-rate applications. Accordingly, communication devices operating at different sampling rates and/or bit-rates can extract corresponding information from the output bit stream of the analyzer. In the present invention embedded information generally relates to separate parameters of the input signal, or to additional resolution in the transmission of original signal parameters. Non-linear techniques for enhancing the overall performance of the system are also disclosed. Also disclosed is a novel method of improving the quantization of signal parameters. In a specific embodiment the input signal is processed in two or more modes dependent on the state of the signal in a frame. When the signal is determined to be in a transition state, the encoder provides phase information about N sinusoids, which the decoder end uses to improve the quality of the output signal at low bit rates.
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_US9047865B2</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>US9047865B2</sourcerecordid><originalsourceid>FETCH-epo_espacenet_US9047865B23</originalsourceid><addsrcrecordid>eNrjZLAITk7MSUzKSVVIzEtRSM1NSk1JSU1RSM5PSU1WSMsvUiguSE1NzgDLJpamZOYrFGem5yXmFPMwsKYBqVReKM3NoODmGuLsoZtakB-fWlyQmJyal1oSHxpsaWBibmFm6mRkTIQSAFtdLWY</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Scalable and embedded codec for speech and audio signals</title><source>esp@cenet</source><creator>WANG WEI ; DUNN ROBERT B ; CAMPANA DAVID A ; MCAULAY ROBERT J ; CHEN JUIN-HWEY ; AGUILAR JOSEPH GERARD ; SUN XIAOQUIN ; WATKINS CRAIG ; ZOPF ROBERT W</creator><creatorcontrib>WANG WEI ; DUNN ROBERT B ; CAMPANA DAVID A ; MCAULAY ROBERT J ; CHEN JUIN-HWEY ; AGUILAR JOSEPH GERARD ; SUN XIAOQUIN ; WATKINS CRAIG ; ZOPF ROBERT W</creatorcontrib><description>A system and method for processing of audio and speech signals is disclosed, which provide compatibility over a range of communication devices operating at different sampling frequencies and/or bit rates. The analyzer of the system divides the input signal in different portions, at least one of which carries information sufficient to provide intelligible reconstruction of the input signal. The analyzer also encodes separate information about other portions of the signal in an embedded manner, so that a smooth transition can be achieved from low bit-rate to high bit-rate applications. Accordingly, communication devices operating at different sampling rates and/or bit-rates can extract corresponding information from the output bit stream of the analyzer. In the present invention embedded information generally relates to separate parameters of the input signal, or to additional resolution in the transmission of original signal parameters. Non-linear techniques for enhancing the overall performance of the system are also disclosed. Also disclosed is a novel method of improving the quantization of signal parameters. In a specific embodiment the input signal is processed in two or more modes dependent on the state of the signal in a frame. When the signal is determined to be in a transition state, the encoder provides phase information about N sinusoids, which the decoder end uses to improve the quality of the output signal at low bit rates.</description><language>eng</language><subject>ACOUSTICS ; MUSICAL INSTRUMENTS ; PHYSICS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION</subject><creationdate>2015</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20150602&DB=EPODOC&CC=US&NR=9047865B2$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76289</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20150602&DB=EPODOC&CC=US&NR=9047865B2$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>WANG WEI</creatorcontrib><creatorcontrib>DUNN ROBERT B</creatorcontrib><creatorcontrib>CAMPANA DAVID A</creatorcontrib><creatorcontrib>MCAULAY ROBERT J</creatorcontrib><creatorcontrib>CHEN JUIN-HWEY</creatorcontrib><creatorcontrib>AGUILAR JOSEPH GERARD</creatorcontrib><creatorcontrib>SUN XIAOQUIN</creatorcontrib><creatorcontrib>WATKINS CRAIG</creatorcontrib><creatorcontrib>ZOPF ROBERT W</creatorcontrib><title>Scalable and embedded codec for speech and audio signals</title><description>A system and method for processing of audio and speech signals is disclosed, which provide compatibility over a range of communication devices operating at different sampling frequencies and/or bit rates. The analyzer of the system divides the input signal in different portions, at least one of which carries information sufficient to provide intelligible reconstruction of the input signal. The analyzer also encodes separate information about other portions of the signal in an embedded manner, so that a smooth transition can be achieved from low bit-rate to high bit-rate applications. Accordingly, communication devices operating at different sampling rates and/or bit-rates can extract corresponding information from the output bit stream of the analyzer. In the present invention embedded information generally relates to separate parameters of the input signal, or to additional resolution in the transmission of original signal parameters. Non-linear techniques for enhancing the overall performance of the system are also disclosed. Also disclosed is a novel method of improving the quantization of signal parameters. In a specific embodiment the input signal is processed in two or more modes dependent on the state of the signal in a frame. When the signal is determined to be in a transition state, the encoder provides phase information about N sinusoids, which the decoder end uses to improve the quality of the output signal at low bit rates.</description><subject>ACOUSTICS</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2015</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZLAITk7MSUzKSVVIzEtRSM1NSk1JSU1RSM5PSU1WSMsvUiguSE1NzgDLJpamZOYrFGem5yXmFPMwsKYBqVReKM3NoODmGuLsoZtakB-fWlyQmJyal1oSHxpsaWBibmFm6mRkTIQSAFtdLWY</recordid><startdate>20150602</startdate><enddate>20150602</enddate><creator>WANG WEI</creator><creator>DUNN ROBERT B</creator><creator>CAMPANA DAVID A</creator><creator>MCAULAY ROBERT J</creator><creator>CHEN JUIN-HWEY</creator><creator>AGUILAR JOSEPH GERARD</creator><creator>SUN XIAOQUIN</creator><creator>WATKINS CRAIG</creator><creator>ZOPF ROBERT W</creator><scope>EVB</scope></search><sort><creationdate>20150602</creationdate><title>Scalable and embedded codec for speech and audio signals</title><author>WANG WEI ; DUNN ROBERT B ; CAMPANA DAVID A ; MCAULAY ROBERT J ; CHEN JUIN-HWEY ; AGUILAR JOSEPH GERARD ; SUN XIAOQUIN ; WATKINS CRAIG ; ZOPF ROBERT W</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_US9047865B23</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2015</creationdate><topic>ACOUSTICS</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><toplevel>online_resources</toplevel><creatorcontrib>WANG WEI</creatorcontrib><creatorcontrib>DUNN ROBERT B</creatorcontrib><creatorcontrib>CAMPANA DAVID A</creatorcontrib><creatorcontrib>MCAULAY ROBERT J</creatorcontrib><creatorcontrib>CHEN JUIN-HWEY</creatorcontrib><creatorcontrib>AGUILAR JOSEPH GERARD</creatorcontrib><creatorcontrib>SUN XIAOQUIN</creatorcontrib><creatorcontrib>WATKINS CRAIG</creatorcontrib><creatorcontrib>ZOPF ROBERT W</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>WANG WEI</au><au>DUNN ROBERT B</au><au>CAMPANA DAVID A</au><au>MCAULAY ROBERT J</au><au>CHEN JUIN-HWEY</au><au>AGUILAR JOSEPH GERARD</au><au>SUN XIAOQUIN</au><au>WATKINS CRAIG</au><au>ZOPF ROBERT W</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Scalable and embedded codec for speech and audio signals</title><date>2015-06-02</date><risdate>2015</risdate><abstract>A system and method for processing of audio and speech signals is disclosed, which provide compatibility over a range of communication devices operating at different sampling frequencies and/or bit rates. The analyzer of the system divides the input signal in different portions, at least one of which carries information sufficient to provide intelligible reconstruction of the input signal. The analyzer also encodes separate information about other portions of the signal in an embedded manner, so that a smooth transition can be achieved from low bit-rate to high bit-rate applications. Accordingly, communication devices operating at different sampling rates and/or bit-rates can extract corresponding information from the output bit stream of the analyzer. In the present invention embedded information generally relates to separate parameters of the input signal, or to additional resolution in the transmission of original signal parameters. Non-linear techniques for enhancing the overall performance of the system are also disclosed. Also disclosed is a novel method of improving the quantization of signal parameters. In a specific embodiment the input signal is processed in two or more modes dependent on the state of the signal in a frame. When the signal is determined to be in a transition state, the encoder provides phase information about N sinusoids, which the decoder end uses to improve the quality of the output signal at low bit rates.</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	eng
recordid	cdi_epo_espacenet_US9047865B2
source	esp@cenet
subjects	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
title	Scalable and embedded codec for speech and audio signals
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-06T05%3A13%3A06IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=WANG%20WEI&rft.date=2015-06-02&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EUS9047865B2%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true