Boosting Gaussian mixtures in an LVCSR system

In this paper, we apply boosting to the problem of frame-level phone classification, and use the resulting system to perform voicemail transcription. We develop parallel, hierarchical, and restricted versions of the classic AdaBoost algorithm, which enable the technique to be used in large-scale spe...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Zweig, G., Padmanabhan, M.
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Acoustic applications Acoustic testing Boosting Error analysis Large-scale systems Neural networks Probability distribution Speech recognition System testing Voice mail
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	1530 vol.3
container_issue
container_start_page	1527
container_title
container_volume	3
creator	Zweig, G. Padmanabhan, M.
description	In this paper, we apply boosting to the problem of frame-level phone classification, and use the resulting system to perform voicemail transcription. We develop parallel, hierarchical, and restricted versions of the classic AdaBoost algorithm, which enable the technique to be used in large-scale speech recognition tasks with hundreds of thousands of Gaussians and tens of millions of training frames. We report small but consistent improvements in both frame recognition accuracy and word error rate.
doi_str_mv	10.1109/ICASSP.2000.861945
format	Conference Proceeding
fullrecord	<record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_861945</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>861945</ieee_id><sourcerecordid>861945</sourcerecordid><originalsourceid>FETCH-LOGICAL-i172t-6fc75da3016f9a133b4dcfa2a8f6b3b3bb93e6c636b3fca719aea421f40851f13</originalsourceid><addsrcrecordid>eNotT9tKxDAUDF7Auu4P7FN-IDUnSXN51KKrUFCsim_LaTeRiO1K0wX37w2szMAwDzPMELICXgJwd_1Y37Ttcyk456XV4FR1QgohjWPg-McpWTpjeabUwklxRgqoBGcalLsglyl95Zw1yhaE3e52aY7jJ13jPqWIIx3i77yffKJxpNk273X7QtMhzX64IucBv5Nf_uuCvN3fvdYPrHla50kNi2DEzHToTbVFyUEHhyBlp7Z9QIE26E5mdE563WuZXejRgEOPSkBQ3FYQQC7I6tgbvfebnykOOB02x6PyD8-cRN0</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Boosting Gaussian mixtures in an LVCSR system</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Zweig, G. ; Padmanabhan, M.</creator><creatorcontrib>Zweig, G. ; Padmanabhan, M.</creatorcontrib><description>In this paper, we apply boosting to the problem of frame-level phone classification, and use the resulting system to perform voicemail transcription. We develop parallel, hierarchical, and restricted versions of the classic AdaBoost algorithm, which enable the technique to be used in large-scale speech recognition tasks with hundreds of thousands of Gaussians and tens of millions of training frames. We report small but consistent improvements in both frame recognition accuracy and word error rate.</description><identifier>ISSN: 1520-6149</identifier><identifier>ISBN: 9780780362932</identifier><identifier>ISBN: 0780362934</identifier><identifier>EISSN: 2379-190X</identifier><identifier>DOI: 10.1109/ICASSP.2000.861945</identifier><language>eng</language><publisher>IEEE</publisher><subject>Acoustic applications ; Acoustic testing ; Boosting ; Error analysis ; Large-scale systems ; Neural networks ; Probability distribution ; Speech recognition ; System testing ; Voice mail</subject><ispartof>2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100), 2000, Vol.3, p.1527-1530 vol.3</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/861945$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2058,4050,4051,27925,54920</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/861945$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Zweig, G.</creatorcontrib><creatorcontrib>Padmanabhan, M.</creatorcontrib><title>Boosting Gaussian mixtures in an LVCSR system</title><title>2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)</title><addtitle>ICASSP</addtitle><description>In this paper, we apply boosting to the problem of frame-level phone classification, and use the resulting system to perform voicemail transcription. We develop parallel, hierarchical, and restricted versions of the classic AdaBoost algorithm, which enable the technique to be used in large-scale speech recognition tasks with hundreds of thousands of Gaussians and tens of millions of training frames. We report small but consistent improvements in both frame recognition accuracy and word error rate.</description><subject>Acoustic applications</subject><subject>Acoustic testing</subject><subject>Boosting</subject><subject>Error analysis</subject><subject>Large-scale systems</subject><subject>Neural networks</subject><subject>Probability distribution</subject><subject>Speech recognition</subject><subject>System testing</subject><subject>Voice mail</subject><issn>1520-6149</issn><issn>2379-190X</issn><isbn>9780780362932</isbn><isbn>0780362934</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2000</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNotT9tKxDAUDF7Auu4P7FN-IDUnSXN51KKrUFCsim_LaTeRiO1K0wX37w2szMAwDzPMELICXgJwd_1Y37Ttcyk456XV4FR1QgohjWPg-McpWTpjeabUwklxRgqoBGcalLsglyl95Zw1yhaE3e52aY7jJ13jPqWIIx3i77yffKJxpNk273X7QtMhzX64IucBv5Nf_uuCvN3fvdYPrHla50kNi2DEzHToTbVFyUEHhyBlp7Z9QIE26E5mdE563WuZXejRgEOPSkBQ3FYQQC7I6tgbvfebnykOOB02x6PyD8-cRN0</recordid><startdate>2000</startdate><enddate>2000</enddate><creator>Zweig, G.</creator><creator>Padmanabhan, M.</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>2000</creationdate><title>Boosting Gaussian mixtures in an LVCSR system</title><author>Zweig, G. ; Padmanabhan, M.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i172t-6fc75da3016f9a133b4dcfa2a8f6b3b3bb93e6c636b3fca719aea421f40851f13</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2000</creationdate><topic>Acoustic applications</topic><topic>Acoustic testing</topic><topic>Boosting</topic><topic>Error analysis</topic><topic>Large-scale systems</topic><topic>Neural networks</topic><topic>Probability distribution</topic><topic>Speech recognition</topic><topic>System testing</topic><topic>Voice mail</topic><toplevel>online_resources</toplevel><creatorcontrib>Zweig, G.</creatorcontrib><creatorcontrib>Padmanabhan, M.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Zweig, G.</au><au>Padmanabhan, M.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Boosting Gaussian mixtures in an LVCSR system</atitle><btitle>2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)</btitle><stitle>ICASSP</stitle><date>2000</date><risdate>2000</risdate><volume>3</volume><spage>1527</spage><epage>1530 vol.3</epage><pages>1527-1530 vol.3</pages><issn>1520-6149</issn><eissn>2379-190X</eissn><isbn>9780780362932</isbn><isbn>0780362934</isbn><abstract>In this paper, we apply boosting to the problem of frame-level phone classification, and use the resulting system to perform voicemail transcription. We develop parallel, hierarchical, and restricted versions of the classic AdaBoost algorithm, which enable the technique to be used in large-scale speech recognition tasks with hundreds of thousands of Gaussians and tens of millions of training frames. We report small but consistent improvements in both frame recognition accuracy and word error rate.</abstract><pub>IEEE</pub><doi>10.1109/ICASSP.2000.861945</doi></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 1520-6149
ispartof	2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100), 2000, Vol.3, p.1527-1530 vol.3
issn	1520-6149 2379-190X
language	eng
recordid	cdi_ieee_primary_861945
source	IEEE Electronic Library (IEL) Conference Proceedings
subjects	Acoustic applications Acoustic testing Boosting Error analysis Large-scale systems Neural networks Probability distribution Speech recognition System testing Voice mail
title	Boosting Gaussian mixtures in an LVCSR system
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-21T12%3A17%3A45IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Boosting%20Gaussian%20mixtures%20in%20an%20LVCSR%20system&rft.btitle=2000%20IEEE%20International%20Conference%20on%20Acoustics,%20Speech,%20and%20Signal%20Processing.%20Proceedings%20(Cat.%20No.00CH37100)&rft.au=Zweig,%20G.&rft.date=2000&rft.volume=3&rft.spage=1527&rft.epage=1530%20vol.3&rft.pages=1527-1530%20vol.3&rft.issn=1520-6149&rft.eissn=2379-190X&rft.isbn=9780780362932&rft.isbn_list=0780362934&rft_id=info:doi/10.1109/ICASSP.2000.861945&rft_dat=%3Cieee_6IE%3E861945%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=861945&rfr_iscdi=true