An exploratory research on grammar checking of Bangla sentences using statistical language models
N-gram based language models are very popular and extensively used statistical methods for solving various natural language processing problems including grammar checking. Smoothing is one of the most effective techniques used in building a language model to deal with data sparsity problem. Kneser-N...
Gespeichert in:
Veröffentlicht in: | International journal of electrical and computer engineering (Malacca, Malacca) Malacca), 2020-06, Vol.10 (3), p.3244 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | 3 |
container_start_page | 3244 |
container_title | International journal of electrical and computer engineering (Malacca, Malacca) |
container_volume | 10 |
creator | Rahman, M. D. Riazur Habib, M. D. Tarek Rahman, M. D. Sadekur Islam, Gazi Zahirul Khan, M. D. Abbas Ali |
description | N-gram based language models are very popular and extensively used statistical methods for solving various natural language processing problems including grammar checking. Smoothing is one of the most effective techniques used in building a language model to deal with data sparsity problem. Kneser-Ney is one of the most prominently used and successful smoothing technique for language modelling. In our previous work, we presented a Witten-Bell smoothing based language modelling technique for checking grammatical correctness of Bangla sentences which showed promising results outperforming previous methods. In this work, we proposed an improved method using Kneser-Ney smoothing based n-gram language model for grammar checking and performed a comparative performance analysis between Kneser-Ney and Witten-Bell smoothing techniques for the same purpose. We also provided an improved technique for calculating the optimum threshold which further enhanced the the results. Our experimental results show that, Kneser-Ney outperforms Witten-Bell as a smoothing technique when used with n-gram LMs for checking grammatical correctness of Bangla sentences. |
doi_str_mv | 10.11591/ijece.v10i3.pp3244-3252 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2368789787</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2368789787</sourcerecordid><originalsourceid>FETCH-LOGICAL-c147t-1977d5d555ba3dfc2617cb64429a3fb32015d730b7f07df97584e245a01488223</originalsourceid><addsrcrecordid>eNpNkMtOwzAQRS0EElXpP1hineJn7CxLxaNSJTawthxnnKakcbATRP-etGXBbGakezSjOQhhSpaUyoI-NHtwsPympOHLvudMiIwzya7QjBGtM62Ivv4336JFSnsylc5zVsgZsqsOw0_fhmiHEI84QgIb3Q6HDtfRHg42YrcD99l0NQ4eP9qubi1O0A3QOUh4TKckDXZo0tA42-J2QkZbAz6ECtp0h268bRMs_vocfTw_va9fs-3by2a92maOCjVktFCqkpWUsrS88o7lVLkyF4IVlvuSM0JlpTgplSeq8oWSWgAT0hIqtGaMz9H9ZW8fw9cIaTD7MMZuOmkYz7XShdJqovSFcjGkFMGbPjbTk0dDiTk7NWen5uzUXJyak1P-C2LvbY0</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2368789787</pqid></control><display><type>article</type><title>An exploratory research on grammar checking of Bangla sentences using statistical language models</title><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><creator>Rahman, M. D. Riazur ; Habib, M. D. Tarek ; Rahman, M. D. Sadekur ; Islam, Gazi Zahirul ; Khan, M. D. Abbas Ali</creator><creatorcontrib>Rahman, M. D. Riazur ; Habib, M. D. Tarek ; Rahman, M. D. Sadekur ; Islam, Gazi Zahirul ; Khan, M. D. Abbas Ali</creatorcontrib><description>N-gram based language models are very popular and extensively used statistical methods for solving various natural language processing problems including grammar checking. Smoothing is one of the most effective techniques used in building a language model to deal with data sparsity problem. Kneser-Ney is one of the most prominently used and successful smoothing technique for language modelling. In our previous work, we presented a Witten-Bell smoothing based language modelling technique for checking grammatical correctness of Bangla sentences which showed promising results outperforming previous methods. In this work, we proposed an improved method using Kneser-Ney smoothing based n-gram language model for grammar checking and performed a comparative performance analysis between Kneser-Ney and Witten-Bell smoothing techniques for the same purpose. We also provided an improved technique for calculating the optimum threshold which further enhanced the the results. Our experimental results show that, Kneser-Ney outperforms Witten-Bell as a smoothing technique when used with n-gram LMs for checking grammatical correctness of Bangla sentences.</description><identifier>ISSN: 2088-8708</identifier><identifier>EISSN: 2088-8708</identifier><identifier>DOI: 10.11591/ijece.v10i3.pp3244-3252</identifier><language>eng</language><publisher>Yogyakarta: IAES Institute of Advanced Engineering and Science</publisher><subject>Grammar ; Language ; Modelling ; Natural language processing ; Sentences ; Smoothing ; Statistical methods</subject><ispartof>International journal of electrical and computer engineering (Malacca, Malacca), 2020-06, Vol.10 (3), p.3244</ispartof><rights>Copyright IAES Institute of Advanced Engineering and Science Jun 2020</rights><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Rahman, M. D. Riazur</creatorcontrib><creatorcontrib>Habib, M. D. Tarek</creatorcontrib><creatorcontrib>Rahman, M. D. Sadekur</creatorcontrib><creatorcontrib>Islam, Gazi Zahirul</creatorcontrib><creatorcontrib>Khan, M. D. Abbas Ali</creatorcontrib><title>An exploratory research on grammar checking of Bangla sentences using statistical language models</title><title>International journal of electrical and computer engineering (Malacca, Malacca)</title><description>N-gram based language models are very popular and extensively used statistical methods for solving various natural language processing problems including grammar checking. Smoothing is one of the most effective techniques used in building a language model to deal with data sparsity problem. Kneser-Ney is one of the most prominently used and successful smoothing technique for language modelling. In our previous work, we presented a Witten-Bell smoothing based language modelling technique for checking grammatical correctness of Bangla sentences which showed promising results outperforming previous methods. In this work, we proposed an improved method using Kneser-Ney smoothing based n-gram language model for grammar checking and performed a comparative performance analysis between Kneser-Ney and Witten-Bell smoothing techniques for the same purpose. We also provided an improved technique for calculating the optimum threshold which further enhanced the the results. Our experimental results show that, Kneser-Ney outperforms Witten-Bell as a smoothing technique when used with n-gram LMs for checking grammatical correctness of Bangla sentences.</description><subject>Grammar</subject><subject>Language</subject><subject>Modelling</subject><subject>Natural language processing</subject><subject>Sentences</subject><subject>Smoothing</subject><subject>Statistical methods</subject><issn>2088-8708</issn><issn>2088-8708</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNpNkMtOwzAQRS0EElXpP1hineJn7CxLxaNSJTawthxnnKakcbATRP-etGXBbGakezSjOQhhSpaUyoI-NHtwsPympOHLvudMiIwzya7QjBGtM62Ivv4336JFSnsylc5zVsgZsqsOw0_fhmiHEI84QgIb3Q6HDtfRHg42YrcD99l0NQ4eP9qubi1O0A3QOUh4TKckDXZo0tA42-J2QkZbAz6ECtp0h268bRMs_vocfTw_va9fs-3by2a92maOCjVktFCqkpWUsrS88o7lVLkyF4IVlvuSM0JlpTgplSeq8oWSWgAT0hIqtGaMz9H9ZW8fw9cIaTD7MMZuOmkYz7XShdJqovSFcjGkFMGbPjbTk0dDiTk7NWen5uzUXJyak1P-C2LvbY0</recordid><startdate>20200601</startdate><enddate>20200601</enddate><creator>Rahman, M. D. Riazur</creator><creator>Habib, M. D. Tarek</creator><creator>Rahman, M. D. Sadekur</creator><creator>Islam, Gazi Zahirul</creator><creator>Khan, M. D. Abbas Ali</creator><general>IAES Institute of Advanced Engineering and Science</general><scope>AAYXX</scope><scope>CITATION</scope><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BVBZV</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>L6V</scope><scope>M7S</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20200601</creationdate><title>An exploratory research on grammar checking of Bangla sentences using statistical language models</title><author>Rahman, M. D. Riazur ; Habib, M. D. Tarek ; Rahman, M. D. Sadekur ; Islam, Gazi Zahirul ; Khan, M. D. Abbas Ali</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c147t-1977d5d555ba3dfc2617cb64429a3fb32015d730b7f07df97584e245a01488223</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Grammar</topic><topic>Language</topic><topic>Modelling</topic><topic>Natural language processing</topic><topic>Sentences</topic><topic>Smoothing</topic><topic>Statistical methods</topic><toplevel>online_resources</toplevel><creatorcontrib>Rahman, M. D. Riazur</creatorcontrib><creatorcontrib>Habib, M. D. Tarek</creatorcontrib><creatorcontrib>Rahman, M. D. Sadekur</creatorcontrib><creatorcontrib>Islam, Gazi Zahirul</creatorcontrib><creatorcontrib>Khan, M. D. Abbas Ali</creatorcontrib><collection>CrossRef</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>East & South Asia Database</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><jtitle>International journal of electrical and computer engineering (Malacca, Malacca)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Rahman, M. D. Riazur</au><au>Habib, M. D. Tarek</au><au>Rahman, M. D. Sadekur</au><au>Islam, Gazi Zahirul</au><au>Khan, M. D. Abbas Ali</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>An exploratory research on grammar checking of Bangla sentences using statistical language models</atitle><jtitle>International journal of electrical and computer engineering (Malacca, Malacca)</jtitle><date>2020-06-01</date><risdate>2020</risdate><volume>10</volume><issue>3</issue><spage>3244</spage><pages>3244-</pages><issn>2088-8708</issn><eissn>2088-8708</eissn><abstract>N-gram based language models are very popular and extensively used statistical methods for solving various natural language processing problems including grammar checking. Smoothing is one of the most effective techniques used in building a language model to deal with data sparsity problem. Kneser-Ney is one of the most prominently used and successful smoothing technique for language modelling. In our previous work, we presented a Witten-Bell smoothing based language modelling technique for checking grammatical correctness of Bangla sentences which showed promising results outperforming previous methods. In this work, we proposed an improved method using Kneser-Ney smoothing based n-gram language model for grammar checking and performed a comparative performance analysis between Kneser-Ney and Witten-Bell smoothing techniques for the same purpose. We also provided an improved technique for calculating the optimum threshold which further enhanced the the results. Our experimental results show that, Kneser-Ney outperforms Witten-Bell as a smoothing technique when used with n-gram LMs for checking grammatical correctness of Bangla sentences.</abstract><cop>Yogyakarta</cop><pub>IAES Institute of Advanced Engineering and Science</pub><doi>10.11591/ijece.v10i3.pp3244-3252</doi></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2088-8708 |
ispartof | International journal of electrical and computer engineering (Malacca, Malacca), 2020-06, Vol.10 (3), p.3244 |
issn | 2088-8708 2088-8708 |
language | eng |
recordid | cdi_proquest_journals_2368789787 |
source | Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals |
subjects | Grammar Language Modelling Natural language processing Sentences Smoothing Statistical methods |
title | An exploratory research on grammar checking of Bangla sentences using statistical language models |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-14T12%3A50%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=An%20exploratory%20research%20on%20grammar%20checking%20of%20Bangla%20sentences%20using%20statistical%20language%20models&rft.jtitle=International%20journal%20of%20electrical%20and%20computer%20engineering%20(Malacca,%20Malacca)&rft.au=Rahman,%20M.%20D.%20Riazur&rft.date=2020-06-01&rft.volume=10&rft.issue=3&rft.spage=3244&rft.pages=3244-&rft.issn=2088-8708&rft.eissn=2088-8708&rft_id=info:doi/10.11591/ijece.v10i3.pp3244-3252&rft_dat=%3Cproquest_cross%3E2368789787%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2368789787&rft_id=info:pmid/&rfr_iscdi=true |