Morpholexical and Discriminative Language Models for Turkish Automatic Speech Recognition

This paper introduces two complementary language modeling approaches for morphologically rich languages aiming to alleviate out-of-vocabulary (OOV) word problem and to exploit morphology as a knowledge source. The first model, morpholexical language model, is a generative n -gram model, where modeli...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on audio, speech, and language processing speech, and language processing, 2012-10, Vol.20 (8), p.2341-2351
Hauptverfasser:	Sak, Haşim, Saraclar, Murat, Gungor, Tunga
Format:	Artikel
Sprache:	eng
Schlagworte:	Applied sciences Artificial intelligence Automatic speech recognition (ASR) Computational modeling Computer science control theory systems Connectionism. Neural networks disambiguation discriminative model Exact sciences and technology Information, signal and communications theory Miscellaneous morpholexical language model Morphology Pragmatics reranking Signal processing Speech Speech processing Telecommunications and information theory Transducers Vocabulary
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	2351
container_issue	8
container_start_page	2341
container_title	IEEE transactions on audio, speech, and language processing
container_volume	20
creator	Sak, Haşim Saraclar, Murat Gungor, Tunga
description	This paper introduces two complementary language modeling approaches for morphologically rich languages aiming to alleviate out-of-vocabulary (OOV) word problem and to exploit morphology as a knowledge source. The first model, morpholexical language model, is a generative n -gram model, where modeling units are lexical-grammatical morphemes instead of commonly used words or statistical sub-words. This paper also proposes a novel approach for integrating the morphology into an automatic speech recognition (ASR) system in the finite-state transducer framework as a knowledge source. We accomplish that by building a morpholexical search network obtained by the composition of lexical transducer of a computational lexicon with a morpholexical language model. The second model is a linear reranking model trained discriminatively with a variant of the perceptron algorithm using morpholexical features. This variant of the perceptron algorithm, WER-sensitive perceptron, is shown to perform better for reranking n -best candidates obtained with the generative model. We apply the proposed models in Turkish broadcast news transcription task and give experimental results. The morpholexical model leads to an elegant morphology-integrated search network with unlimited vocabulary. Thus, it is highly effective in alleviating OOV problem and improves the word error rate (WER) over word and statistical sub-word models by 1.8% and 0.4% absolute, respectively. The discriminatively trained morpholexical model further improves the WER of the system by 0.8% absolute.
doi_str_mv	10.1109/TASL.2012.2201477
format	Article
fullrecord	<record><control><sourceid>pascalfrancis_RIE</sourceid><recordid>TN_cdi_ieee_primary_6205357</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>6205357</ieee_id><sourcerecordid>26381172</sourcerecordid><originalsourceid>FETCH-LOGICAL-c361t-aa7cbfd964ca95311a7604593990f4faa1a9cdb40a6e6f0136d85d18d3835ef63</originalsourceid><addsrcrecordid>eNo9kMtOwzAQRS0EEqXwAYiNNyxTPPEj8bLiLaVComXBKpo6dmtIk8hOEfw9qVp1MzPS3Du6cwi5BjYBYPpuMZ0Xk5RBOkmHKrLshIxAyjzJdCpOjzOoc3IR4xdjgisBI_I5a0O3bmv76w3WFJuKPvhogt_4Bnv_Y2mBzWqLK0tnbWXrSF0b6GIbvn1c0-m2bzeDzNB5Z61Z03dr2lXje982l-TMYR3t1aGPycfT4-L-JSnenl_vp0ViuII-QczM0lVaCYNacgDMFBNSc62ZEw4RUJtqKRgqqxwDrqpcVpBXPOfSOsXHBPZ3TWhjDNaV3ZAew18JrNyxKXdsyh2b8sBm8NzuPR3G4W0XsDE-Ho2p4jlAlg66m73OW2uPa5UyyWXG_wEaXm5r</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Morpholexical and Discriminative Language Models for Turkish Automatic Speech Recognition</title><source>IEEE Electronic Library (IEL)</source><creator>Sak, Haşim ; Saraclar, Murat ; Gungor, Tunga</creator><creatorcontrib>Sak, Haşim ; Saraclar, Murat ; Gungor, Tunga</creatorcontrib><description>This paper introduces two complementary language modeling approaches for morphologically rich languages aiming to alleviate out-of-vocabulary (OOV) word problem and to exploit morphology as a knowledge source. The first model, morpholexical language model, is a generative n -gram model, where modeling units are lexical-grammatical morphemes instead of commonly used words or statistical sub-words. This paper also proposes a novel approach for integrating the morphology into an automatic speech recognition (ASR) system in the finite-state transducer framework as a knowledge source. We accomplish that by building a morpholexical search network obtained by the composition of lexical transducer of a computational lexicon with a morpholexical language model. The second model is a linear reranking model trained discriminatively with a variant of the perceptron algorithm using morpholexical features. This variant of the perceptron algorithm, WER-sensitive perceptron, is shown to perform better for reranking n -best candidates obtained with the generative model. We apply the proposed models in Turkish broadcast news transcription task and give experimental results. The morpholexical model leads to an elegant morphology-integrated search network with unlimited vocabulary. Thus, it is highly effective in alleviating OOV problem and improves the word error rate (WER) over word and statistical sub-word models by 1.8% and 0.4% absolute, respectively. The discriminatively trained morpholexical model further improves the WER of the system by 0.8% absolute.</description><identifier>ISSN: 1558-7916</identifier><identifier>EISSN: 1558-7924</identifier><identifier>DOI: 10.1109/TASL.2012.2201477</identifier><identifier>CODEN: ITASD8</identifier><language>eng</language><publisher>Piscataway, NJ: IEEE</publisher><subject>Applied sciences ; Artificial intelligence ; Automatic speech recognition (ASR) ; Computational modeling ; Computer science; control theory; systems ; Connectionism. Neural networks ; disambiguation ; discriminative model ; Exact sciences and technology ; Information, signal and communications theory ; Miscellaneous ; morpholexical language model ; Morphology ; Pragmatics ; reranking ; Signal processing ; Speech ; Speech processing ; Telecommunications and information theory ; Transducers ; Vocabulary</subject><ispartof>IEEE transactions on audio, speech, and language processing, 2012-10, Vol.20 (8), p.2341-2351</ispartof><rights>2015 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c361t-aa7cbfd964ca95311a7604593990f4faa1a9cdb40a6e6f0136d85d18d3835ef63</citedby><cites>FETCH-LOGICAL-c361t-aa7cbfd964ca95311a7604593990f4faa1a9cdb40a6e6f0136d85d18d3835ef63</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/6205357$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,777,781,793,27905,27906,54739</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/6205357$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=26381172$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Sak, Haşim</creatorcontrib><creatorcontrib>Saraclar, Murat</creatorcontrib><creatorcontrib>Gungor, Tunga</creatorcontrib><title>Morpholexical and Discriminative Language Models for Turkish Automatic Speech Recognition</title><title>IEEE transactions on audio, speech, and language processing</title><addtitle>TASL</addtitle><description>This paper introduces two complementary language modeling approaches for morphologically rich languages aiming to alleviate out-of-vocabulary (OOV) word problem and to exploit morphology as a knowledge source. The first model, morpholexical language model, is a generative n -gram model, where modeling units are lexical-grammatical morphemes instead of commonly used words or statistical sub-words. This paper also proposes a novel approach for integrating the morphology into an automatic speech recognition (ASR) system in the finite-state transducer framework as a knowledge source. We accomplish that by building a morpholexical search network obtained by the composition of lexical transducer of a computational lexicon with a morpholexical language model. The second model is a linear reranking model trained discriminatively with a variant of the perceptron algorithm using morpholexical features. This variant of the perceptron algorithm, WER-sensitive perceptron, is shown to perform better for reranking n -best candidates obtained with the generative model. We apply the proposed models in Turkish broadcast news transcription task and give experimental results. The morpholexical model leads to an elegant morphology-integrated search network with unlimited vocabulary. Thus, it is highly effective in alleviating OOV problem and improves the word error rate (WER) over word and statistical sub-word models by 1.8% and 0.4% absolute, respectively. The discriminatively trained morpholexical model further improves the WER of the system by 0.8% absolute.</description><subject>Applied sciences</subject><subject>Artificial intelligence</subject><subject>Automatic speech recognition (ASR)</subject><subject>Computational modeling</subject><subject>Computer science; control theory; systems</subject><subject>Connectionism. Neural networks</subject><subject>disambiguation</subject><subject>discriminative model</subject><subject>Exact sciences and technology</subject><subject>Information, signal and communications theory</subject><subject>Miscellaneous</subject><subject>morpholexical language model</subject><subject>Morphology</subject><subject>Pragmatics</subject><subject>reranking</subject><subject>Signal processing</subject><subject>Speech</subject><subject>Speech processing</subject><subject>Telecommunications and information theory</subject><subject>Transducers</subject><subject>Vocabulary</subject><issn>1558-7916</issn><issn>1558-7924</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2012</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9kMtOwzAQRS0EEqXwAYiNNyxTPPEj8bLiLaVComXBKpo6dmtIk8hOEfw9qVp1MzPS3Du6cwi5BjYBYPpuMZ0Xk5RBOkmHKrLshIxAyjzJdCpOjzOoc3IR4xdjgisBI_I5a0O3bmv76w3WFJuKPvhogt_4Bnv_Y2mBzWqLK0tnbWXrSF0b6GIbvn1c0-m2bzeDzNB5Z61Z03dr2lXje982l-TMYR3t1aGPycfT4-L-JSnenl_vp0ViuII-QczM0lVaCYNacgDMFBNSc62ZEw4RUJtqKRgqqxwDrqpcVpBXPOfSOsXHBPZ3TWhjDNaV3ZAew18JrNyxKXdsyh2b8sBm8NzuPR3G4W0XsDE-Ho2p4jlAlg66m73OW2uPa5UyyWXG_wEaXm5r</recordid><startdate>20121001</startdate><enddate>20121001</enddate><creator>Sak, Haşim</creator><creator>Saraclar, Murat</creator><creator>Gungor, Tunga</creator><general>IEEE</general><general>Institute of Electrical and Electronics Engineers</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20121001</creationdate><title>Morpholexical and Discriminative Language Models for Turkish Automatic Speech Recognition</title><author>Sak, Haşim ; Saraclar, Murat ; Gungor, Tunga</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c361t-aa7cbfd964ca95311a7604593990f4faa1a9cdb40a6e6f0136d85d18d3835ef63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Applied sciences</topic><topic>Artificial intelligence</topic><topic>Automatic speech recognition (ASR)</topic><topic>Computational modeling</topic><topic>Computer science; control theory; systems</topic><topic>Connectionism. Neural networks</topic><topic>disambiguation</topic><topic>discriminative model</topic><topic>Exact sciences and technology</topic><topic>Information, signal and communications theory</topic><topic>Miscellaneous</topic><topic>morpholexical language model</topic><topic>Morphology</topic><topic>Pragmatics</topic><topic>reranking</topic><topic>Signal processing</topic><topic>Speech</topic><topic>Speech processing</topic><topic>Telecommunications and information theory</topic><topic>Transducers</topic><topic>Vocabulary</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sak, Haşim</creatorcontrib><creatorcontrib>Saraclar, Murat</creatorcontrib><creatorcontrib>Gungor, Tunga</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>Pascal-Francis</collection><collection>CrossRef</collection><jtitle>IEEE transactions on audio, speech, and language processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Sak, Haşim</au><au>Saraclar, Murat</au><au>Gungor, Tunga</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Morpholexical and Discriminative Language Models for Turkish Automatic Speech Recognition</atitle><jtitle>IEEE transactions on audio, speech, and language processing</jtitle><stitle>TASL</stitle><date>2012-10-01</date><risdate>2012</risdate><volume>20</volume><issue>8</issue><spage>2341</spage><epage>2351</epage><pages>2341-2351</pages><issn>1558-7916</issn><eissn>1558-7924</eissn><coden>ITASD8</coden><abstract>This paper introduces two complementary language modeling approaches for morphologically rich languages aiming to alleviate out-of-vocabulary (OOV) word problem and to exploit morphology as a knowledge source. The first model, morpholexical language model, is a generative n -gram model, where modeling units are lexical-grammatical morphemes instead of commonly used words or statistical sub-words. This paper also proposes a novel approach for integrating the morphology into an automatic speech recognition (ASR) system in the finite-state transducer framework as a knowledge source. We accomplish that by building a morpholexical search network obtained by the composition of lexical transducer of a computational lexicon with a morpholexical language model. The second model is a linear reranking model trained discriminatively with a variant of the perceptron algorithm using morpholexical features. This variant of the perceptron algorithm, WER-sensitive perceptron, is shown to perform better for reranking n -best candidates obtained with the generative model. We apply the proposed models in Turkish broadcast news transcription task and give experimental results. The morpholexical model leads to an elegant morphology-integrated search network with unlimited vocabulary. Thus, it is highly effective in alleviating OOV problem and improves the word error rate (WER) over word and statistical sub-word models by 1.8% and 0.4% absolute, respectively. The discriminatively trained morpholexical model further improves the WER of the system by 0.8% absolute.</abstract><cop>Piscataway, NJ</cop><pub>IEEE</pub><doi>10.1109/TASL.2012.2201477</doi><tpages>11</tpages></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 1558-7916
ispartof	IEEE transactions on audio, speech, and language processing, 2012-10, Vol.20 (8), p.2341-2351
issn	1558-7916 1558-7924
language	eng
recordid	cdi_ieee_primary_6205357
source	IEEE Electronic Library (IEL)
subjects	Applied sciences Artificial intelligence Automatic speech recognition (ASR) Computational modeling Computer science control theory systems Connectionism. Neural networks disambiguation discriminative model Exact sciences and technology Information, signal and communications theory Miscellaneous morpholexical language model Morphology Pragmatics reranking Signal processing Speech Speech processing Telecommunications and information theory Transducers Vocabulary
title	Morpholexical and Discriminative Language Models for Turkish Automatic Speech Recognition
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-18T10%3A44%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-pascalfrancis_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Morpholexical%20and%20Discriminative%20Language%20Models%20for%20Turkish%20Automatic%20Speech%20Recognition&rft.jtitle=IEEE%20transactions%20on%20audio,%20speech,%20and%20language%20processing&rft.au=Sak,%20Ha%C5%9Fim&rft.date=2012-10-01&rft.volume=20&rft.issue=8&rft.spage=2341&rft.epage=2351&rft.pages=2341-2351&rft.issn=1558-7916&rft.eissn=1558-7924&rft.coden=ITASD8&rft_id=info:doi/10.1109/TASL.2012.2201477&rft_dat=%3Cpascalfrancis_RIE%3E26381172%3C/pascalfrancis_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=6205357&rfr_iscdi=true