Morpholexical and Discriminative Language Models for Turkish Automatic Speech Recognition
This paper introduces two complementary language modeling approaches for morphologically rich languages aiming to alleviate out-of-vocabulary (OOV) word problem and to exploit morphology as a knowledge source. The first model, morpholexical language model, is a generative n -gram model, where modeli...
Gespeichert in:
Veröffentlicht in: | IEEE transactions on audio, speech, and language processing speech, and language processing, 2012-10, Vol.20 (8), p.2341-2351 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 2351 |
---|---|
container_issue | 8 |
container_start_page | 2341 |
container_title | IEEE transactions on audio, speech, and language processing |
container_volume | 20 |
creator | Sak, Haşim Saraclar, Murat Gungor, Tunga |
description | This paper introduces two complementary language modeling approaches for morphologically rich languages aiming to alleviate out-of-vocabulary (OOV) word problem and to exploit morphology as a knowledge source. The first model, morpholexical language model, is a generative n -gram model, where modeling units are lexical-grammatical morphemes instead of commonly used words or statistical sub-words. This paper also proposes a novel approach for integrating the morphology into an automatic speech recognition (ASR) system in the finite-state transducer framework as a knowledge source. We accomplish that by building a morpholexical search network obtained by the composition of lexical transducer of a computational lexicon with a morpholexical language model. The second model is a linear reranking model trained discriminatively with a variant of the perceptron algorithm using morpholexical features. This variant of the perceptron algorithm, WER-sensitive perceptron, is shown to perform better for reranking n -best candidates obtained with the generative model. We apply the proposed models in Turkish broadcast news transcription task and give experimental results. The morpholexical model leads to an elegant morphology-integrated search network with unlimited vocabulary. Thus, it is highly effective in alleviating OOV problem and improves the word error rate (WER) over word and statistical sub-word models by 1.8% and 0.4% absolute, respectively. The discriminatively trained morpholexical model further improves the WER of the system by 0.8% absolute. |
doi_str_mv | 10.1109/TASL.2012.2201477 |
format | Article |
fullrecord | <record><control><sourceid>pascalfrancis_RIE</sourceid><recordid>TN_cdi_ieee_primary_6205357</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>6205357</ieee_id><sourcerecordid>26381172</sourcerecordid><originalsourceid>FETCH-LOGICAL-c361t-aa7cbfd964ca95311a7604593990f4faa1a9cdb40a6e6f0136d85d18d3835ef63</originalsourceid><addsrcrecordid>eNo9kMtOwzAQRS0EEqXwAYiNNyxTPPEj8bLiLaVComXBKpo6dmtIk8hOEfw9qVp1MzPS3Du6cwi5BjYBYPpuMZ0Xk5RBOkmHKrLshIxAyjzJdCpOjzOoc3IR4xdjgisBI_I5a0O3bmv76w3WFJuKPvhogt_4Bnv_Y2mBzWqLK0tnbWXrSF0b6GIbvn1c0-m2bzeDzNB5Z61Z03dr2lXje982l-TMYR3t1aGPycfT4-L-JSnenl_vp0ViuII-QczM0lVaCYNacgDMFBNSc62ZEw4RUJtqKRgqqxwDrqpcVpBXPOfSOsXHBPZ3TWhjDNaV3ZAew18JrNyxKXdsyh2b8sBm8NzuPR3G4W0XsDE-Ho2p4jlAlg66m73OW2uPa5UyyWXG_wEaXm5r</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Morpholexical and Discriminative Language Models for Turkish Automatic Speech Recognition</title><source>IEEE Electronic Library (IEL)</source><creator>Sak, Haşim ; Saraclar, Murat ; Gungor, Tunga</creator><creatorcontrib>Sak, Haşim ; Saraclar, Murat ; Gungor, Tunga</creatorcontrib><description>This paper introduces two complementary language modeling approaches for morphologically rich languages aiming to alleviate out-of-vocabulary (OOV) word problem and to exploit morphology as a knowledge source. The first model, morpholexical language model, is a generative n -gram model, where modeling units are lexical-grammatical morphemes instead of commonly used words or statistical sub-words. This paper also proposes a novel approach for integrating the morphology into an automatic speech recognition (ASR) system in the finite-state transducer framework as a knowledge source. We accomplish that by building a morpholexical search network obtained by the composition of lexical transducer of a computational lexicon with a morpholexical language model. The second model is a linear reranking model trained discriminatively with a variant of the perceptron algorithm using morpholexical features. This variant of the perceptron algorithm, WER-sensitive perceptron, is shown to perform better for reranking n -best candidates obtained with the generative model. We apply the proposed models in Turkish broadcast news transcription task and give experimental results. The morpholexical model leads to an elegant morphology-integrated search network with unlimited vocabulary. Thus, it is highly effective in alleviating OOV problem and improves the word error rate (WER) over word and statistical sub-word models by 1.8% and 0.4% absolute, respectively. The discriminatively trained morpholexical model further improves the WER of the system by 0.8% absolute.</description><identifier>ISSN: 1558-7916</identifier><identifier>EISSN: 1558-7924</identifier><identifier>DOI: 10.1109/TASL.2012.2201477</identifier><identifier>CODEN: ITASD8</identifier><language>eng</language><publisher>Piscataway, NJ: IEEE</publisher><subject>Applied sciences ; Artificial intelligence ; Automatic speech recognition (ASR) ; Computational modeling ; Computer science; control theory; systems ; Connectionism. Neural networks ; disambiguation ; discriminative model ; Exact sciences and technology ; Information, signal and communications theory ; Miscellaneous ; morpholexical language model ; Morphology ; Pragmatics ; reranking ; Signal processing ; Speech ; Speech processing ; Telecommunications and information theory ; Transducers ; Vocabulary</subject><ispartof>IEEE transactions on audio, speech, and language processing, 2012-10, Vol.20 (8), p.2341-2351</ispartof><rights>2015 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c361t-aa7cbfd964ca95311a7604593990f4faa1a9cdb40a6e6f0136d85d18d3835ef63</citedby><cites>FETCH-LOGICAL-c361t-aa7cbfd964ca95311a7604593990f4faa1a9cdb40a6e6f0136d85d18d3835ef63</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/6205357$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,777,781,793,27905,27906,54739</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/6205357$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=26381172$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Sak, Haşim</creatorcontrib><creatorcontrib>Saraclar, Murat</creatorcontrib><creatorcontrib>Gungor, Tunga</creatorcontrib><title>Morpholexical and Discriminative Language Models for Turkish Automatic Speech Recognition</title><title>IEEE transactions on audio, speech, and language processing</title><addtitle>TASL</addtitle><description>This paper introduces two complementary language modeling approaches for morphologically rich languages aiming to alleviate out-of-vocabulary (OOV) word problem and to exploit morphology as a knowledge source. The first model, morpholexical language model, is a generative n -gram model, where modeling units are lexical-grammatical morphemes instead of commonly used words or statistical sub-words. This paper also proposes a novel approach for integrating the morphology into an automatic speech recognition (ASR) system in the finite-state transducer framework as a knowledge source. We accomplish that by building a morpholexical search network obtained by the composition of lexical transducer of a computational lexicon with a morpholexical language model. The second model is a linear reranking model trained discriminatively with a variant of the perceptron algorithm using morpholexical features. This variant of the perceptron algorithm, WER-sensitive perceptron, is shown to perform better for reranking n -best candidates obtained with the generative model. We apply the proposed models in Turkish broadcast news transcription task and give experimental results. The morpholexical model leads to an elegant morphology-integrated search network with unlimited vocabulary. Thus, it is highly effective in alleviating OOV problem and improves the word error rate (WER) over word and statistical sub-word models by 1.8% and 0.4% absolute, respectively. The discriminatively trained morpholexical model further improves the WER of the system by 0.8% absolute.</description><subject>Applied sciences</subject><subject>Artificial intelligence</subject><subject>Automatic speech recognition (ASR)</subject><subject>Computational modeling</subject><subject>Computer science; control theory; systems</subject><subject>Connectionism. Neural networks</subject><subject>disambiguation</subject><subject>discriminative model</subject><subject>Exact sciences and technology</subject><subject>Information, signal and communications theory</subject><subject>Miscellaneous</subject><subject>morpholexical language model</subject><subject>Morphology</subject><subject>Pragmatics</subject><subject>reranking</subject><subject>Signal processing</subject><subject>Speech</subject><subject>Speech processing</subject><subject>Telecommunications and information theory</subject><subject>Transducers</subject><subject>Vocabulary</subject><issn>1558-7916</issn><issn>1558-7924</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2012</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9kMtOwzAQRS0EEqXwAYiNNyxTPPEj8bLiLaVComXBKpo6dmtIk8hOEfw9qVp1MzPS3Du6cwi5BjYBYPpuMZ0Xk5RBOkmHKrLshIxAyjzJdCpOjzOoc3IR4xdjgisBI_I5a0O3bmv76w3WFJuKPvhogt_4Bnv_Y2mBzWqLK0tnbWXrSF0b6GIbvn1c0-m2bzeDzNB5Z61Z03dr2lXje982l-TMYR3t1aGPycfT4-L-JSnenl_vp0ViuII-QczM0lVaCYNacgDMFBNSc62ZEw4RUJtqKRgqqxwDrqpcVpBXPOfSOsXHBPZ3TWhjDNaV3ZAew18JrNyxKXdsyh2b8sBm8NzuPR3G4W0XsDE-Ho2p4jlAlg66m73OW2uPa5UyyWXG_wEaXm5r</recordid><startdate>20121001</startdate><enddate>20121001</enddate><creator>Sak, Haşim</creator><creator>Saraclar, Murat</creator><creator>Gungor, Tunga</creator><general>IEEE</general><general>Institute of Electrical and Electronics Engineers</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20121001</creationdate><title>Morpholexical and Discriminative Language Models for Turkish Automatic Speech Recognition</title><author>Sak, Haşim ; Saraclar, Murat ; Gungor, Tunga</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c361t-aa7cbfd964ca95311a7604593990f4faa1a9cdb40a6e6f0136d85d18d3835ef63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Applied sciences</topic><topic>Artificial intelligence</topic><topic>Automatic speech recognition (ASR)</topic><topic>Computational modeling</topic><topic>Computer science; control theory; systems</topic><topic>Connectionism. Neural networks</topic><topic>disambiguation</topic><topic>discriminative model</topic><topic>Exact sciences and technology</topic><topic>Information, signal and communications theory</topic><topic>Miscellaneous</topic><topic>morpholexical language model</topic><topic>Morphology</topic><topic>Pragmatics</topic><topic>reranking</topic><topic>Signal processing</topic><topic>Speech</topic><topic>Speech processing</topic><topic>Telecommunications and information theory</topic><topic>Transducers</topic><topic>Vocabulary</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sak, Haşim</creatorcontrib><creatorcontrib>Saraclar, Murat</creatorcontrib><creatorcontrib>Gungor, Tunga</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>Pascal-Francis</collection><collection>CrossRef</collection><jtitle>IEEE transactions on audio, speech, and language processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Sak, Haşim</au><au>Saraclar, Murat</au><au>Gungor, Tunga</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Morpholexical and Discriminative Language Models for Turkish Automatic Speech Recognition</atitle><jtitle>IEEE transactions on audio, speech, and language processing</jtitle><stitle>TASL</stitle><date>2012-10-01</date><risdate>2012</risdate><volume>20</volume><issue>8</issue><spage>2341</spage><epage>2351</epage><pages>2341-2351</pages><issn>1558-7916</issn><eissn>1558-7924</eissn><coden>ITASD8</coden><abstract>This paper introduces two complementary language modeling approaches for morphologically rich languages aiming to alleviate out-of-vocabulary (OOV) word problem and to exploit morphology as a knowledge source. The first model, morpholexical language model, is a generative n -gram model, where modeling units are lexical-grammatical morphemes instead of commonly used words or statistical sub-words. This paper also proposes a novel approach for integrating the morphology into an automatic speech recognition (ASR) system in the finite-state transducer framework as a knowledge source. We accomplish that by building a morpholexical search network obtained by the composition of lexical transducer of a computational lexicon with a morpholexical language model. The second model is a linear reranking model trained discriminatively with a variant of the perceptron algorithm using morpholexical features. This variant of the perceptron algorithm, WER-sensitive perceptron, is shown to perform better for reranking n -best candidates obtained with the generative model. We apply the proposed models in Turkish broadcast news transcription task and give experimental results. The morpholexical model leads to an elegant morphology-integrated search network with unlimited vocabulary. Thus, it is highly effective in alleviating OOV problem and improves the word error rate (WER) over word and statistical sub-word models by 1.8% and 0.4% absolute, respectively. The discriminatively trained morpholexical model further improves the WER of the system by 0.8% absolute.</abstract><cop>Piscataway, NJ</cop><pub>IEEE</pub><doi>10.1109/TASL.2012.2201477</doi><tpages>11</tpages></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 1558-7916 |
ispartof | IEEE transactions on audio, speech, and language processing, 2012-10, Vol.20 (8), p.2341-2351 |
issn | 1558-7916 1558-7924 |
language | eng |
recordid | cdi_ieee_primary_6205357 |
source | IEEE Electronic Library (IEL) |
subjects | Applied sciences Artificial intelligence Automatic speech recognition (ASR) Computational modeling Computer science control theory systems Connectionism. Neural networks disambiguation discriminative model Exact sciences and technology Information, signal and communications theory Miscellaneous morpholexical language model Morphology Pragmatics reranking Signal processing Speech Speech processing Telecommunications and information theory Transducers Vocabulary |
title | Morpholexical and Discriminative Language Models for Turkish Automatic Speech Recognition |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-18T10%3A44%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-pascalfrancis_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Morpholexical%20and%20Discriminative%20Language%20Models%20for%20Turkish%20Automatic%20Speech%20Recognition&rft.jtitle=IEEE%20transactions%20on%20audio,%20speech,%20and%20language%20processing&rft.au=Sak,%20Ha%C5%9Fim&rft.date=2012-10-01&rft.volume=20&rft.issue=8&rft.spage=2341&rft.epage=2351&rft.pages=2341-2351&rft.issn=1558-7916&rft.eissn=1558-7924&rft.coden=ITASD8&rft_id=info:doi/10.1109/TASL.2012.2201477&rft_dat=%3Cpascalfrancis_RIE%3E26381172%3C/pascalfrancis_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=6205357&rfr_iscdi=true |