Morpholexical and Discriminative Language Models for Turkish Automatic Speech Recognition

This paper introduces two complementary language modeling approaches for morphologically rich languages aiming to alleviate out-of-vocabulary (OOV) word problem and to exploit morphology as a knowledge source. The first model, morpholexical language model, is a generative n -gram model, where modeli...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on audio, speech, and language processing speech, and language processing, 2012-10, Vol.20 (8), p.2341-2351
Hauptverfasser: Sak, Haşim, Saraclar, Murat, Gungor, Tunga
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 2351
container_issue 8
container_start_page 2341
container_title IEEE transactions on audio, speech, and language processing
container_volume 20
creator Sak, Haşim
Saraclar, Murat
Gungor, Tunga
description This paper introduces two complementary language modeling approaches for morphologically rich languages aiming to alleviate out-of-vocabulary (OOV) word problem and to exploit morphology as a knowledge source. The first model, morpholexical language model, is a generative n -gram model, where modeling units are lexical-grammatical morphemes instead of commonly used words or statistical sub-words. This paper also proposes a novel approach for integrating the morphology into an automatic speech recognition (ASR) system in the finite-state transducer framework as a knowledge source. We accomplish that by building a morpholexical search network obtained by the composition of lexical transducer of a computational lexicon with a morpholexical language model. The second model is a linear reranking model trained discriminatively with a variant of the perceptron algorithm using morpholexical features. This variant of the perceptron algorithm, WER-sensitive perceptron, is shown to perform better for reranking n -best candidates obtained with the generative model. We apply the proposed models in Turkish broadcast news transcription task and give experimental results. The morpholexical model leads to an elegant morphology-integrated search network with unlimited vocabulary. Thus, it is highly effective in alleviating OOV problem and improves the word error rate (WER) over word and statistical sub-word models by 1.8% and 0.4% absolute, respectively. The discriminatively trained morpholexical model further improves the WER of the system by 0.8% absolute.
doi_str_mv 10.1109/TASL.2012.2201477
format Article
fullrecord <record><control><sourceid>pascalfrancis_RIE</sourceid><recordid>TN_cdi_ieee_primary_6205357</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>6205357</ieee_id><sourcerecordid>26381172</sourcerecordid><originalsourceid>FETCH-LOGICAL-c361t-aa7cbfd964ca95311a7604593990f4faa1a9cdb40a6e6f0136d85d18d3835ef63</originalsourceid><addsrcrecordid>eNo9kMtOwzAQRS0EEqXwAYiNNyxTPPEj8bLiLaVComXBKpo6dmtIk8hOEfw9qVp1MzPS3Du6cwi5BjYBYPpuMZ0Xk5RBOkmHKrLshIxAyjzJdCpOjzOoc3IR4xdjgisBI_I5a0O3bmv76w3WFJuKPvhogt_4Bnv_Y2mBzWqLK0tnbWXrSF0b6GIbvn1c0-m2bzeDzNB5Z61Z03dr2lXje982l-TMYR3t1aGPycfT4-L-JSnenl_vp0ViuII-QczM0lVaCYNacgDMFBNSc62ZEw4RUJtqKRgqqxwDrqpcVpBXPOfSOsXHBPZ3TWhjDNaV3ZAew18JrNyxKXdsyh2b8sBm8NzuPR3G4W0XsDE-Ho2p4jlAlg66m73OW2uPa5UyyWXG_wEaXm5r</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Morpholexical and Discriminative Language Models for Turkish Automatic Speech Recognition</title><source>IEEE Electronic Library (IEL)</source><creator>Sak, Haşim ; Saraclar, Murat ; Gungor, Tunga</creator><creatorcontrib>Sak, Haşim ; Saraclar, Murat ; Gungor, Tunga</creatorcontrib><description>This paper introduces two complementary language modeling approaches for morphologically rich languages aiming to alleviate out-of-vocabulary (OOV) word problem and to exploit morphology as a knowledge source. The first model, morpholexical language model, is a generative n -gram model, where modeling units are lexical-grammatical morphemes instead of commonly used words or statistical sub-words. This paper also proposes a novel approach for integrating the morphology into an automatic speech recognition (ASR) system in the finite-state transducer framework as a knowledge source. We accomplish that by building a morpholexical search network obtained by the composition of lexical transducer of a computational lexicon with a morpholexical language model. The second model is a linear reranking model trained discriminatively with a variant of the perceptron algorithm using morpholexical features. This variant of the perceptron algorithm, WER-sensitive perceptron, is shown to perform better for reranking n -best candidates obtained with the generative model. We apply the proposed models in Turkish broadcast news transcription task and give experimental results. The morpholexical model leads to an elegant morphology-integrated search network with unlimited vocabulary. Thus, it is highly effective in alleviating OOV problem and improves the word error rate (WER) over word and statistical sub-word models by 1.8% and 0.4% absolute, respectively. The discriminatively trained morpholexical model further improves the WER of the system by 0.8% absolute.</description><identifier>ISSN: 1558-7916</identifier><identifier>EISSN: 1558-7924</identifier><identifier>DOI: 10.1109/TASL.2012.2201477</identifier><identifier>CODEN: ITASD8</identifier><language>eng</language><publisher>Piscataway, NJ: IEEE</publisher><subject>Applied sciences ; Artificial intelligence ; Automatic speech recognition (ASR) ; Computational modeling ; Computer science; control theory; systems ; Connectionism. Neural networks ; disambiguation ; discriminative model ; Exact sciences and technology ; Information, signal and communications theory ; Miscellaneous ; morpholexical language model ; Morphology ; Pragmatics ; reranking ; Signal processing ; Speech ; Speech processing ; Telecommunications and information theory ; Transducers ; Vocabulary</subject><ispartof>IEEE transactions on audio, speech, and language processing, 2012-10, Vol.20 (8), p.2341-2351</ispartof><rights>2015 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c361t-aa7cbfd964ca95311a7604593990f4faa1a9cdb40a6e6f0136d85d18d3835ef63</citedby><cites>FETCH-LOGICAL-c361t-aa7cbfd964ca95311a7604593990f4faa1a9cdb40a6e6f0136d85d18d3835ef63</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/6205357$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,777,781,793,27905,27906,54739</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/6205357$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=26381172$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Sak, Haşim</creatorcontrib><creatorcontrib>Saraclar, Murat</creatorcontrib><creatorcontrib>Gungor, Tunga</creatorcontrib><title>Morpholexical and Discriminative Language Models for Turkish Automatic Speech Recognition</title><title>IEEE transactions on audio, speech, and language processing</title><addtitle>TASL</addtitle><description>This paper introduces two complementary language modeling approaches for morphologically rich languages aiming to alleviate out-of-vocabulary (OOV) word problem and to exploit morphology as a knowledge source. The first model, morpholexical language model, is a generative n -gram model, where modeling units are lexical-grammatical morphemes instead of commonly used words or statistical sub-words. This paper also proposes a novel approach for integrating the morphology into an automatic speech recognition (ASR) system in the finite-state transducer framework as a knowledge source. We accomplish that by building a morpholexical search network obtained by the composition of lexical transducer of a computational lexicon with a morpholexical language model. The second model is a linear reranking model trained discriminatively with a variant of the perceptron algorithm using morpholexical features. This variant of the perceptron algorithm, WER-sensitive perceptron, is shown to perform better for reranking n -best candidates obtained with the generative model. We apply the proposed models in Turkish broadcast news transcription task and give experimental results. The morpholexical model leads to an elegant morphology-integrated search network with unlimited vocabulary. Thus, it is highly effective in alleviating OOV problem and improves the word error rate (WER) over word and statistical sub-word models by 1.8% and 0.4% absolute, respectively. The discriminatively trained morpholexical model further improves the WER of the system by 0.8% absolute.</description><subject>Applied sciences</subject><subject>Artificial intelligence</subject><subject>Automatic speech recognition (ASR)</subject><subject>Computational modeling</subject><subject>Computer science; control theory; systems</subject><subject>Connectionism. Neural networks</subject><subject>disambiguation</subject><subject>discriminative model</subject><subject>Exact sciences and technology</subject><subject>Information, signal and communications theory</subject><subject>Miscellaneous</subject><subject>morpholexical language model</subject><subject>Morphology</subject><subject>Pragmatics</subject><subject>reranking</subject><subject>Signal processing</subject><subject>Speech</subject><subject>Speech processing</subject><subject>Telecommunications and information theory</subject><subject>Transducers</subject><subject>Vocabulary</subject><issn>1558-7916</issn><issn>1558-7924</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2012</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9kMtOwzAQRS0EEqXwAYiNNyxTPPEj8bLiLaVComXBKpo6dmtIk8hOEfw9qVp1MzPS3Du6cwi5BjYBYPpuMZ0Xk5RBOkmHKrLshIxAyjzJdCpOjzOoc3IR4xdjgisBI_I5a0O3bmv76w3WFJuKPvhogt_4Bnv_Y2mBzWqLK0tnbWXrSF0b6GIbvn1c0-m2bzeDzNB5Z61Z03dr2lXje982l-TMYR3t1aGPycfT4-L-JSnenl_vp0ViuII-QczM0lVaCYNacgDMFBNSc62ZEw4RUJtqKRgqqxwDrqpcVpBXPOfSOsXHBPZ3TWhjDNaV3ZAew18JrNyxKXdsyh2b8sBm8NzuPR3G4W0XsDE-Ho2p4jlAlg66m73OW2uPa5UyyWXG_wEaXm5r</recordid><startdate>20121001</startdate><enddate>20121001</enddate><creator>Sak, Haşim</creator><creator>Saraclar, Murat</creator><creator>Gungor, Tunga</creator><general>IEEE</general><general>Institute of Electrical and Electronics Engineers</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20121001</creationdate><title>Morpholexical and Discriminative Language Models for Turkish Automatic Speech Recognition</title><author>Sak, Haşim ; Saraclar, Murat ; Gungor, Tunga</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c361t-aa7cbfd964ca95311a7604593990f4faa1a9cdb40a6e6f0136d85d18d3835ef63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Applied sciences</topic><topic>Artificial intelligence</topic><topic>Automatic speech recognition (ASR)</topic><topic>Computational modeling</topic><topic>Computer science; control theory; systems</topic><topic>Connectionism. Neural networks</topic><topic>disambiguation</topic><topic>discriminative model</topic><topic>Exact sciences and technology</topic><topic>Information, signal and communications theory</topic><topic>Miscellaneous</topic><topic>morpholexical language model</topic><topic>Morphology</topic><topic>Pragmatics</topic><topic>reranking</topic><topic>Signal processing</topic><topic>Speech</topic><topic>Speech processing</topic><topic>Telecommunications and information theory</topic><topic>Transducers</topic><topic>Vocabulary</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sak, Haşim</creatorcontrib><creatorcontrib>Saraclar, Murat</creatorcontrib><creatorcontrib>Gungor, Tunga</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>Pascal-Francis</collection><collection>CrossRef</collection><jtitle>IEEE transactions on audio, speech, and language processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Sak, Haşim</au><au>Saraclar, Murat</au><au>Gungor, Tunga</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Morpholexical and Discriminative Language Models for Turkish Automatic Speech Recognition</atitle><jtitle>IEEE transactions on audio, speech, and language processing</jtitle><stitle>TASL</stitle><date>2012-10-01</date><risdate>2012</risdate><volume>20</volume><issue>8</issue><spage>2341</spage><epage>2351</epage><pages>2341-2351</pages><issn>1558-7916</issn><eissn>1558-7924</eissn><coden>ITASD8</coden><abstract>This paper introduces two complementary language modeling approaches for morphologically rich languages aiming to alleviate out-of-vocabulary (OOV) word problem and to exploit morphology as a knowledge source. The first model, morpholexical language model, is a generative n -gram model, where modeling units are lexical-grammatical morphemes instead of commonly used words or statistical sub-words. This paper also proposes a novel approach for integrating the morphology into an automatic speech recognition (ASR) system in the finite-state transducer framework as a knowledge source. We accomplish that by building a morpholexical search network obtained by the composition of lexical transducer of a computational lexicon with a morpholexical language model. The second model is a linear reranking model trained discriminatively with a variant of the perceptron algorithm using morpholexical features. This variant of the perceptron algorithm, WER-sensitive perceptron, is shown to perform better for reranking n -best candidates obtained with the generative model. We apply the proposed models in Turkish broadcast news transcription task and give experimental results. The morpholexical model leads to an elegant morphology-integrated search network with unlimited vocabulary. Thus, it is highly effective in alleviating OOV problem and improves the word error rate (WER) over word and statistical sub-word models by 1.8% and 0.4% absolute, respectively. The discriminatively trained morpholexical model further improves the WER of the system by 0.8% absolute.</abstract><cop>Piscataway, NJ</cop><pub>IEEE</pub><doi>10.1109/TASL.2012.2201477</doi><tpages>11</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1558-7916
ispartof IEEE transactions on audio, speech, and language processing, 2012-10, Vol.20 (8), p.2341-2351
issn 1558-7916
1558-7924
language eng
recordid cdi_ieee_primary_6205357
source IEEE Electronic Library (IEL)
subjects Applied sciences
Artificial intelligence
Automatic speech recognition (ASR)
Computational modeling
Computer science
control theory
systems
Connectionism. Neural networks
disambiguation
discriminative model
Exact sciences and technology
Information, signal and communications theory
Miscellaneous
morpholexical language model
Morphology
Pragmatics
reranking
Signal processing
Speech
Speech processing
Telecommunications and information theory
Transducers
Vocabulary
title Morpholexical and Discriminative Language Models for Turkish Automatic Speech Recognition
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-18T10%3A44%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-pascalfrancis_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Morpholexical%20and%20Discriminative%20Language%20Models%20for%20Turkish%20Automatic%20Speech%20Recognition&rft.jtitle=IEEE%20transactions%20on%20audio,%20speech,%20and%20language%20processing&rft.au=Sak,%20Ha%C5%9Fim&rft.date=2012-10-01&rft.volume=20&rft.issue=8&rft.spage=2341&rft.epage=2351&rft.pages=2341-2351&rft.issn=1558-7916&rft.eissn=1558-7924&rft.coden=ITASD8&rft_id=info:doi/10.1109/TASL.2012.2201477&rft_dat=%3Cpascalfrancis_RIE%3E26381172%3C/pascalfrancis_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=6205357&rfr_iscdi=true