Normalization of Myanmar Grammatical Categories for Part-of-Speech Tagging

In this paper, we analyze the syntactic structure of Myanmar grammatical categories to be able to use in tagging Myanmar text with standard Part-of-Speech (POS) tags. In Myanmar lexicon, all words are annotated with basic tags and these words can be called as stem words or root words. The Myanmar PO...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of computer applications 2011-01, Vol.36 (1)
Hauptverfasser: Myint, Phyu Hninn, Htwe, Tin Myat, Thein, Ni Lar
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 1
container_start_page
container_title International journal of computer applications
container_volume 36
creator Myint, Phyu Hninn
Htwe, Tin Myat
Thein, Ni Lar
description In this paper, we analyze the syntactic structure of Myanmar grammatical categories to be able to use in tagging Myanmar text with standard Part-of-Speech (POS) tags. In Myanmar lexicon, all words are annotated with basic tags and these words can be called as stem words or root words. The Myanmar POS tagged corpus creation, which has been proposed in [11], used basic POS tagging for each word. Therefore, all words in this corpus have been tagged with only basic tags as in lexicon. For standard POS tagging, normalization step is needed to form more meaningful words and annotate some words with more appropriate finer POS tags and categories. The finer tags can be called as standard POS tags and these can be used to directly concatenate with English POS tags. These tags are very useful in Myanmar to English Machine Translation System. Hence, the main aim of this study is to develop the customized lexical rules in order to deduce finer or standard POS tag from basic POS tags combinations. By analyzing Myanmar grammatical categories, 27 rules are defined to normalize them. Evaluation has been made on a basic POS tagged corpus which contains 1000 basic POS tagged sentences and it yields full satisfaction for all words in these sentences.
doi_str_mv 10.5120/4454-6233
format Article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_miscellaneous_1010915654</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1010915654</sourcerecordid><originalsourceid>FETCH-LOGICAL-p604-345a6423c936241254fea04696967a370ebf7aeb23238c0d87d6b4af8e5f5ed43</originalsourceid><addsrcrecordid>eNpdTktLAzEYDKJgqT34D4InL9EkXx6boxRtlfoAey9ft8m6ZXdTk92D_npX9CDOHGYYhmEIORf8SgvJr5XSihkJcEQm3FnNiqKwx3_8KZnlvOcjwEnj1IQ8PMXUYlN_Yl_HjsZAHz-wazHRRcK2HdMSGzrH3lcx1T7TEBN9wdSzGNjrwfvyja6xququOiMnAZvsZ786Jeu72_V8yVbPi_v5zYodDFcMlEajJJQOjFRCahU8cmXcSItgud8Gi34rQUJR8l1hd2arMBReB-13Cqbk8mf2kOL74HO_aetc-qbBzschbwQX3Alt9Hf14l91H4fUjec2TgBY0NLAF8HrWpQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>913373526</pqid></control><display><type>article</type><title>Normalization of Myanmar Grammatical Categories for Part-of-Speech Tagging</title><source>EZB Electronic Journals Library</source><creator>Myint, Phyu Hninn ; Htwe, Tin Myat ; Thein, Ni Lar</creator><creatorcontrib>Myint, Phyu Hninn ; Htwe, Tin Myat ; Thein, Ni Lar</creatorcontrib><description>In this paper, we analyze the syntactic structure of Myanmar grammatical categories to be able to use in tagging Myanmar text with standard Part-of-Speech (POS) tags. In Myanmar lexicon, all words are annotated with basic tags and these words can be called as stem words or root words. The Myanmar POS tagged corpus creation, which has been proposed in [11], used basic POS tagging for each word. Therefore, all words in this corpus have been tagged with only basic tags as in lexicon. For standard POS tagging, normalization step is needed to form more meaningful words and annotate some words with more appropriate finer POS tags and categories. The finer tags can be called as standard POS tags and these can be used to directly concatenate with English POS tags. These tags are very useful in Myanmar to English Machine Translation System. Hence, the main aim of this study is to develop the customized lexical rules in order to deduce finer or standard POS tag from basic POS tags combinations. By analyzing Myanmar grammatical categories, 27 rules are defined to normalize them. Evaluation has been made on a basic POS tagged corpus which contains 1000 basic POS tagged sentences and it yields full satisfaction for all words in these sentences.</description><identifier>ISSN: 0975-8887</identifier><identifier>EISSN: 0975-8887</identifier><identifier>DOI: 10.5120/4454-6233</identifier><language>eng</language><publisher>New York: Foundation of Computer Science</publisher><subject>Categories ; Machine translation ; Marking ; Roots ; Sentences ; Tags ; Texts</subject><ispartof>International journal of computer applications, 2011-01, Vol.36 (1)</ispartof><rights>Copyright Foundation of Computer Science 2011</rights><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,777,781,27905,27906</link.rule.ids></links><search><creatorcontrib>Myint, Phyu Hninn</creatorcontrib><creatorcontrib>Htwe, Tin Myat</creatorcontrib><creatorcontrib>Thein, Ni Lar</creatorcontrib><title>Normalization of Myanmar Grammatical Categories for Part-of-Speech Tagging</title><title>International journal of computer applications</title><description>In this paper, we analyze the syntactic structure of Myanmar grammatical categories to be able to use in tagging Myanmar text with standard Part-of-Speech (POS) tags. In Myanmar lexicon, all words are annotated with basic tags and these words can be called as stem words or root words. The Myanmar POS tagged corpus creation, which has been proposed in [11], used basic POS tagging for each word. Therefore, all words in this corpus have been tagged with only basic tags as in lexicon. For standard POS tagging, normalization step is needed to form more meaningful words and annotate some words with more appropriate finer POS tags and categories. The finer tags can be called as standard POS tags and these can be used to directly concatenate with English POS tags. These tags are very useful in Myanmar to English Machine Translation System. Hence, the main aim of this study is to develop the customized lexical rules in order to deduce finer or standard POS tag from basic POS tags combinations. By analyzing Myanmar grammatical categories, 27 rules are defined to normalize them. Evaluation has been made on a basic POS tagged corpus which contains 1000 basic POS tagged sentences and it yields full satisfaction for all words in these sentences.</description><subject>Categories</subject><subject>Machine translation</subject><subject>Marking</subject><subject>Roots</subject><subject>Sentences</subject><subject>Tags</subject><subject>Texts</subject><issn>0975-8887</issn><issn>0975-8887</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2011</creationdate><recordtype>article</recordtype><recordid>eNpdTktLAzEYDKJgqT34D4InL9EkXx6boxRtlfoAey9ft8m6ZXdTk92D_npX9CDOHGYYhmEIORf8SgvJr5XSihkJcEQm3FnNiqKwx3_8KZnlvOcjwEnj1IQ8PMXUYlN_Yl_HjsZAHz-wazHRRcK2HdMSGzrH3lcx1T7TEBN9wdSzGNjrwfvyja6xququOiMnAZvsZ786Jeu72_V8yVbPi_v5zYodDFcMlEajJJQOjFRCahU8cmXcSItgud8Gi34rQUJR8l1hd2arMBReB-13Cqbk8mf2kOL74HO_aetc-qbBzschbwQX3Alt9Hf14l91H4fUjec2TgBY0NLAF8HrWpQ</recordid><startdate>20110101</startdate><enddate>20110101</enddate><creator>Myint, Phyu Hninn</creator><creator>Htwe, Tin Myat</creator><creator>Thein, Ni Lar</creator><general>Foundation of Computer Science</general><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20110101</creationdate><title>Normalization of Myanmar Grammatical Categories for Part-of-Speech Tagging</title><author>Myint, Phyu Hninn ; Htwe, Tin Myat ; Thein, Ni Lar</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p604-345a6423c936241254fea04696967a370ebf7aeb23238c0d87d6b4af8e5f5ed43</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2011</creationdate><topic>Categories</topic><topic>Machine translation</topic><topic>Marking</topic><topic>Roots</topic><topic>Sentences</topic><topic>Tags</topic><topic>Texts</topic><toplevel>online_resources</toplevel><creatorcontrib>Myint, Phyu Hninn</creatorcontrib><creatorcontrib>Htwe, Tin Myat</creatorcontrib><creatorcontrib>Thein, Ni Lar</creatorcontrib><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>International journal of computer applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Myint, Phyu Hninn</au><au>Htwe, Tin Myat</au><au>Thein, Ni Lar</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Normalization of Myanmar Grammatical Categories for Part-of-Speech Tagging</atitle><jtitle>International journal of computer applications</jtitle><date>2011-01-01</date><risdate>2011</risdate><volume>36</volume><issue>1</issue><issn>0975-8887</issn><eissn>0975-8887</eissn><abstract>In this paper, we analyze the syntactic structure of Myanmar grammatical categories to be able to use in tagging Myanmar text with standard Part-of-Speech (POS) tags. In Myanmar lexicon, all words are annotated with basic tags and these words can be called as stem words or root words. The Myanmar POS tagged corpus creation, which has been proposed in [11], used basic POS tagging for each word. Therefore, all words in this corpus have been tagged with only basic tags as in lexicon. For standard POS tagging, normalization step is needed to form more meaningful words and annotate some words with more appropriate finer POS tags and categories. The finer tags can be called as standard POS tags and these can be used to directly concatenate with English POS tags. These tags are very useful in Myanmar to English Machine Translation System. Hence, the main aim of this study is to develop the customized lexical rules in order to deduce finer or standard POS tag from basic POS tags combinations. By analyzing Myanmar grammatical categories, 27 rules are defined to normalize them. Evaluation has been made on a basic POS tagged corpus which contains 1000 basic POS tagged sentences and it yields full satisfaction for all words in these sentences.</abstract><cop>New York</cop><pub>Foundation of Computer Science</pub><doi>10.5120/4454-6233</doi></addata></record>
fulltext fulltext
identifier ISSN: 0975-8887
ispartof International journal of computer applications, 2011-01, Vol.36 (1)
issn 0975-8887
0975-8887
language eng
recordid cdi_proquest_miscellaneous_1010915654
source EZB Electronic Journals Library
subjects Categories
Machine translation
Marking
Roots
Sentences
Tags
Texts
title Normalization of Myanmar Grammatical Categories for Part-of-Speech Tagging
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T17%3A45%3A34IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Normalization%20of%20Myanmar%20Grammatical%20Categories%20for%20Part-of-Speech%20Tagging&rft.jtitle=International%20journal%20of%20computer%20applications&rft.au=Myint,%20Phyu%20Hninn&rft.date=2011-01-01&rft.volume=36&rft.issue=1&rft.issn=0975-8887&rft.eissn=0975-8887&rft_id=info:doi/10.5120/4454-6233&rft_dat=%3Cproquest%3E1010915654%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=913373526&rft_id=info:pmid/&rfr_iscdi=true