An exploratory research on grammar checking of Bangla sentences using statistical language models

N-gram based language models are very popular and extensively used statistical methods for solving various natural language processing problems including grammar checking. Smoothing is one of the most effective techniques used in building a language model to deal with data sparsity problem. Kneser-N...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of electrical and computer engineering (Malacca, Malacca) Malacca), 2020-06, Vol.10 (3), p.3244
Hauptverfasser: Rahman, M. D. Riazur, Habib, M. D. Tarek, Rahman, M. D. Sadekur, Islam, Gazi Zahirul, Khan, M. D. Abbas Ali
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 3
container_start_page 3244
container_title International journal of electrical and computer engineering (Malacca, Malacca)
container_volume 10
creator Rahman, M. D. Riazur
Habib, M. D. Tarek
Rahman, M. D. Sadekur
Islam, Gazi Zahirul
Khan, M. D. Abbas Ali
description N-gram based language models are very popular and extensively used statistical methods for solving various natural language processing problems including grammar checking. Smoothing is one of the most effective techniques used in building a language model to deal with data sparsity problem. Kneser-Ney is one of the most prominently used and successful smoothing technique for language modelling. In our previous work, we presented a Witten-Bell smoothing based language modelling technique for checking grammatical correctness of Bangla sentences which showed promising results outperforming previous methods. In this work, we proposed an improved method using Kneser-Ney smoothing based n-gram language model for grammar checking and performed a comparative performance analysis between Kneser-Ney and Witten-Bell smoothing techniques for the same purpose. We also provided an improved technique for calculating the optimum threshold which further enhanced the the results. Our experimental results show that, Kneser-Ney outperforms Witten-Bell as a smoothing technique when used with n-gram LMs for checking grammatical correctness of Bangla sentences.
doi_str_mv 10.11591/ijece.v10i3.pp3244-3252
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2368789787</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2368789787</sourcerecordid><originalsourceid>FETCH-LOGICAL-c147t-1977d5d555ba3dfc2617cb64429a3fb32015d730b7f07df97584e245a01488223</originalsourceid><addsrcrecordid>eNpNkMtOwzAQRS0EElXpP1hineJn7CxLxaNSJTawthxnnKakcbATRP-etGXBbGakezSjOQhhSpaUyoI-NHtwsPympOHLvudMiIwzya7QjBGtM62Ivv4336JFSnsylc5zVsgZsqsOw0_fhmiHEI84QgIb3Q6HDtfRHg42YrcD99l0NQ4eP9qubi1O0A3QOUh4TKckDXZo0tA42-J2QkZbAz6ECtp0h268bRMs_vocfTw_va9fs-3by2a92maOCjVktFCqkpWUsrS88o7lVLkyF4IVlvuSM0JlpTgplSeq8oWSWgAT0hIqtGaMz9H9ZW8fw9cIaTD7MMZuOmkYz7XShdJqovSFcjGkFMGbPjbTk0dDiTk7NWen5uzUXJyak1P-C2LvbY0</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2368789787</pqid></control><display><type>article</type><title>An exploratory research on grammar checking of Bangla sentences using statistical language models</title><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><creator>Rahman, M. D. Riazur ; Habib, M. D. Tarek ; Rahman, M. D. Sadekur ; Islam, Gazi Zahirul ; Khan, M. D. Abbas Ali</creator><creatorcontrib>Rahman, M. D. Riazur ; Habib, M. D. Tarek ; Rahman, M. D. Sadekur ; Islam, Gazi Zahirul ; Khan, M. D. Abbas Ali</creatorcontrib><description>N-gram based language models are very popular and extensively used statistical methods for solving various natural language processing problems including grammar checking. Smoothing is one of the most effective techniques used in building a language model to deal with data sparsity problem. Kneser-Ney is one of the most prominently used and successful smoothing technique for language modelling. In our previous work, we presented a Witten-Bell smoothing based language modelling technique for checking grammatical correctness of Bangla sentences which showed promising results outperforming previous methods. In this work, we proposed an improved method using Kneser-Ney smoothing based n-gram language model for grammar checking and performed a comparative performance analysis between Kneser-Ney and Witten-Bell smoothing techniques for the same purpose. We also provided an improved technique for calculating the optimum threshold which further enhanced the the results. Our experimental results show that, Kneser-Ney outperforms Witten-Bell as a smoothing technique when used with n-gram LMs for checking grammatical correctness of Bangla sentences.</description><identifier>ISSN: 2088-8708</identifier><identifier>EISSN: 2088-8708</identifier><identifier>DOI: 10.11591/ijece.v10i3.pp3244-3252</identifier><language>eng</language><publisher>Yogyakarta: IAES Institute of Advanced Engineering and Science</publisher><subject>Grammar ; Language ; Modelling ; Natural language processing ; Sentences ; Smoothing ; Statistical methods</subject><ispartof>International journal of electrical and computer engineering (Malacca, Malacca), 2020-06, Vol.10 (3), p.3244</ispartof><rights>Copyright IAES Institute of Advanced Engineering and Science Jun 2020</rights><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Rahman, M. D. Riazur</creatorcontrib><creatorcontrib>Habib, M. D. Tarek</creatorcontrib><creatorcontrib>Rahman, M. D. Sadekur</creatorcontrib><creatorcontrib>Islam, Gazi Zahirul</creatorcontrib><creatorcontrib>Khan, M. D. Abbas Ali</creatorcontrib><title>An exploratory research on grammar checking of Bangla sentences using statistical language models</title><title>International journal of electrical and computer engineering (Malacca, Malacca)</title><description>N-gram based language models are very popular and extensively used statistical methods for solving various natural language processing problems including grammar checking. Smoothing is one of the most effective techniques used in building a language model to deal with data sparsity problem. Kneser-Ney is one of the most prominently used and successful smoothing technique for language modelling. In our previous work, we presented a Witten-Bell smoothing based language modelling technique for checking grammatical correctness of Bangla sentences which showed promising results outperforming previous methods. In this work, we proposed an improved method using Kneser-Ney smoothing based n-gram language model for grammar checking and performed a comparative performance analysis between Kneser-Ney and Witten-Bell smoothing techniques for the same purpose. We also provided an improved technique for calculating the optimum threshold which further enhanced the the results. Our experimental results show that, Kneser-Ney outperforms Witten-Bell as a smoothing technique when used with n-gram LMs for checking grammatical correctness of Bangla sentences.</description><subject>Grammar</subject><subject>Language</subject><subject>Modelling</subject><subject>Natural language processing</subject><subject>Sentences</subject><subject>Smoothing</subject><subject>Statistical methods</subject><issn>2088-8708</issn><issn>2088-8708</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNpNkMtOwzAQRS0EElXpP1hineJn7CxLxaNSJTawthxnnKakcbATRP-etGXBbGakezSjOQhhSpaUyoI-NHtwsPympOHLvudMiIwzya7QjBGtM62Ivv4336JFSnsylc5zVsgZsqsOw0_fhmiHEI84QgIb3Q6HDtfRHg42YrcD99l0NQ4eP9qubi1O0A3QOUh4TKckDXZo0tA42-J2QkZbAz6ECtp0h268bRMs_vocfTw_va9fs-3by2a92maOCjVktFCqkpWUsrS88o7lVLkyF4IVlvuSM0JlpTgplSeq8oWSWgAT0hIqtGaMz9H9ZW8fw9cIaTD7MMZuOmkYz7XShdJqovSFcjGkFMGbPjbTk0dDiTk7NWen5uzUXJyak1P-C2LvbY0</recordid><startdate>20200601</startdate><enddate>20200601</enddate><creator>Rahman, M. D. Riazur</creator><creator>Habib, M. D. Tarek</creator><creator>Rahman, M. D. Sadekur</creator><creator>Islam, Gazi Zahirul</creator><creator>Khan, M. D. Abbas Ali</creator><general>IAES Institute of Advanced Engineering and Science</general><scope>AAYXX</scope><scope>CITATION</scope><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BVBZV</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>L6V</scope><scope>M7S</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20200601</creationdate><title>An exploratory research on grammar checking of Bangla sentences using statistical language models</title><author>Rahman, M. D. Riazur ; Habib, M. D. Tarek ; Rahman, M. D. Sadekur ; Islam, Gazi Zahirul ; Khan, M. D. Abbas Ali</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c147t-1977d5d555ba3dfc2617cb64429a3fb32015d730b7f07df97584e245a01488223</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Grammar</topic><topic>Language</topic><topic>Modelling</topic><topic>Natural language processing</topic><topic>Sentences</topic><topic>Smoothing</topic><topic>Statistical methods</topic><toplevel>online_resources</toplevel><creatorcontrib>Rahman, M. D. Riazur</creatorcontrib><creatorcontrib>Habib, M. D. Tarek</creatorcontrib><creatorcontrib>Rahman, M. D. Sadekur</creatorcontrib><creatorcontrib>Islam, Gazi Zahirul</creatorcontrib><creatorcontrib>Khan, M. D. Abbas Ali</creatorcontrib><collection>CrossRef</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>East &amp; South Asia Database</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><jtitle>International journal of electrical and computer engineering (Malacca, Malacca)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Rahman, M. D. Riazur</au><au>Habib, M. D. Tarek</au><au>Rahman, M. D. Sadekur</au><au>Islam, Gazi Zahirul</au><au>Khan, M. D. Abbas Ali</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>An exploratory research on grammar checking of Bangla sentences using statistical language models</atitle><jtitle>International journal of electrical and computer engineering (Malacca, Malacca)</jtitle><date>2020-06-01</date><risdate>2020</risdate><volume>10</volume><issue>3</issue><spage>3244</spage><pages>3244-</pages><issn>2088-8708</issn><eissn>2088-8708</eissn><abstract>N-gram based language models are very popular and extensively used statistical methods for solving various natural language processing problems including grammar checking. Smoothing is one of the most effective techniques used in building a language model to deal with data sparsity problem. Kneser-Ney is one of the most prominently used and successful smoothing technique for language modelling. In our previous work, we presented a Witten-Bell smoothing based language modelling technique for checking grammatical correctness of Bangla sentences which showed promising results outperforming previous methods. In this work, we proposed an improved method using Kneser-Ney smoothing based n-gram language model for grammar checking and performed a comparative performance analysis between Kneser-Ney and Witten-Bell smoothing techniques for the same purpose. We also provided an improved technique for calculating the optimum threshold which further enhanced the the results. Our experimental results show that, Kneser-Ney outperforms Witten-Bell as a smoothing technique when used with n-gram LMs for checking grammatical correctness of Bangla sentences.</abstract><cop>Yogyakarta</cop><pub>IAES Institute of Advanced Engineering and Science</pub><doi>10.11591/ijece.v10i3.pp3244-3252</doi></addata></record>
fulltext fulltext
identifier ISSN: 2088-8708
ispartof International journal of electrical and computer engineering (Malacca, Malacca), 2020-06, Vol.10 (3), p.3244
issn 2088-8708
2088-8708
language eng
recordid cdi_proquest_journals_2368789787
source Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals
subjects Grammar
Language
Modelling
Natural language processing
Sentences
Smoothing
Statistical methods
title An exploratory research on grammar checking of Bangla sentences using statistical language models
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-14T12%3A50%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=An%20exploratory%20research%20on%20grammar%20checking%20of%20Bangla%20sentences%20using%20statistical%20language%20models&rft.jtitle=International%20journal%20of%20electrical%20and%20computer%20engineering%20(Malacca,%20Malacca)&rft.au=Rahman,%20M.%20D.%20Riazur&rft.date=2020-06-01&rft.volume=10&rft.issue=3&rft.spage=3244&rft.pages=3244-&rft.issn=2088-8708&rft.eissn=2088-8708&rft_id=info:doi/10.11591/ijece.v10i3.pp3244-3252&rft_dat=%3Cproquest_cross%3E2368789787%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2368789787&rft_id=info:pmid/&rfr_iscdi=true