Normalization of Myanmar Grammatical Categories for Part-of-Speech Tagging
In this paper, we analyze the syntactic structure of Myanmar grammatical categories to be able to use in tagging Myanmar text with standard Part-of-Speech (POS) tags. In Myanmar lexicon, all words are annotated with basic tags and these words can be called as stem words or root words. The Myanmar PO...
Gespeichert in:
Veröffentlicht in: | International journal of computer applications 2011-01, Vol.36 (1) |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | 1 |
container_start_page | |
container_title | International journal of computer applications |
container_volume | 36 |
creator | Myint, Phyu Hninn Htwe, Tin Myat Thein, Ni Lar |
description | In this paper, we analyze the syntactic structure of Myanmar grammatical categories to be able to use in tagging Myanmar text with standard Part-of-Speech (POS) tags. In Myanmar lexicon, all words are annotated with basic tags and these words can be called as stem words or root words. The Myanmar POS tagged corpus creation, which has been proposed in [11], used basic POS tagging for each word. Therefore, all words in this corpus have been tagged with only basic tags as in lexicon. For standard POS tagging, normalization step is needed to form more meaningful words and annotate some words with more appropriate finer POS tags and categories. The finer tags can be called as standard POS tags and these can be used to directly concatenate with English POS tags. These tags are very useful in Myanmar to English Machine Translation System. Hence, the main aim of this study is to develop the customized lexical rules in order to deduce finer or standard POS tag from basic POS tags combinations. By analyzing Myanmar grammatical categories, 27 rules are defined to normalize them. Evaluation has been made on a basic POS tagged corpus which contains 1000 basic POS tagged sentences and it yields full satisfaction for all words in these sentences. |
doi_str_mv | 10.5120/4454-6233 |
format | Article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_miscellaneous_1010915654</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1010915654</sourcerecordid><originalsourceid>FETCH-LOGICAL-p604-345a6423c936241254fea04696967a370ebf7aeb23238c0d87d6b4af8e5f5ed43</originalsourceid><addsrcrecordid>eNpdTktLAzEYDKJgqT34D4InL9EkXx6boxRtlfoAey9ft8m6ZXdTk92D_npX9CDOHGYYhmEIORf8SgvJr5XSihkJcEQm3FnNiqKwx3_8KZnlvOcjwEnj1IQ8PMXUYlN_Yl_HjsZAHz-wazHRRcK2HdMSGzrH3lcx1T7TEBN9wdSzGNjrwfvyja6xququOiMnAZvsZ786Jeu72_V8yVbPi_v5zYodDFcMlEajJJQOjFRCahU8cmXcSItgud8Gi34rQUJR8l1hd2arMBReB-13Cqbk8mf2kOL74HO_aetc-qbBzschbwQX3Alt9Hf14l91H4fUjec2TgBY0NLAF8HrWpQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>913373526</pqid></control><display><type>article</type><title>Normalization of Myanmar Grammatical Categories for Part-of-Speech Tagging</title><source>EZB Electronic Journals Library</source><creator>Myint, Phyu Hninn ; Htwe, Tin Myat ; Thein, Ni Lar</creator><creatorcontrib>Myint, Phyu Hninn ; Htwe, Tin Myat ; Thein, Ni Lar</creatorcontrib><description>In this paper, we analyze the syntactic structure of Myanmar grammatical categories to be able to use in tagging Myanmar text with standard Part-of-Speech (POS) tags. In Myanmar lexicon, all words are annotated with basic tags and these words can be called as stem words or root words. The Myanmar POS tagged corpus creation, which has been proposed in [11], used basic POS tagging for each word. Therefore, all words in this corpus have been tagged with only basic tags as in lexicon. For standard POS tagging, normalization step is needed to form more meaningful words and annotate some words with more appropriate finer POS tags and categories. The finer tags can be called as standard POS tags and these can be used to directly concatenate with English POS tags. These tags are very useful in Myanmar to English Machine Translation System. Hence, the main aim of this study is to develop the customized lexical rules in order to deduce finer or standard POS tag from basic POS tags combinations. By analyzing Myanmar grammatical categories, 27 rules are defined to normalize them. Evaluation has been made on a basic POS tagged corpus which contains 1000 basic POS tagged sentences and it yields full satisfaction for all words in these sentences.</description><identifier>ISSN: 0975-8887</identifier><identifier>EISSN: 0975-8887</identifier><identifier>DOI: 10.5120/4454-6233</identifier><language>eng</language><publisher>New York: Foundation of Computer Science</publisher><subject>Categories ; Machine translation ; Marking ; Roots ; Sentences ; Tags ; Texts</subject><ispartof>International journal of computer applications, 2011-01, Vol.36 (1)</ispartof><rights>Copyright Foundation of Computer Science 2011</rights><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,777,781,27905,27906</link.rule.ids></links><search><creatorcontrib>Myint, Phyu Hninn</creatorcontrib><creatorcontrib>Htwe, Tin Myat</creatorcontrib><creatorcontrib>Thein, Ni Lar</creatorcontrib><title>Normalization of Myanmar Grammatical Categories for Part-of-Speech Tagging</title><title>International journal of computer applications</title><description>In this paper, we analyze the syntactic structure of Myanmar grammatical categories to be able to use in tagging Myanmar text with standard Part-of-Speech (POS) tags. In Myanmar lexicon, all words are annotated with basic tags and these words can be called as stem words or root words. The Myanmar POS tagged corpus creation, which has been proposed in [11], used basic POS tagging for each word. Therefore, all words in this corpus have been tagged with only basic tags as in lexicon. For standard POS tagging, normalization step is needed to form more meaningful words and annotate some words with more appropriate finer POS tags and categories. The finer tags can be called as standard POS tags and these can be used to directly concatenate with English POS tags. These tags are very useful in Myanmar to English Machine Translation System. Hence, the main aim of this study is to develop the customized lexical rules in order to deduce finer or standard POS tag from basic POS tags combinations. By analyzing Myanmar grammatical categories, 27 rules are defined to normalize them. Evaluation has been made on a basic POS tagged corpus which contains 1000 basic POS tagged sentences and it yields full satisfaction for all words in these sentences.</description><subject>Categories</subject><subject>Machine translation</subject><subject>Marking</subject><subject>Roots</subject><subject>Sentences</subject><subject>Tags</subject><subject>Texts</subject><issn>0975-8887</issn><issn>0975-8887</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2011</creationdate><recordtype>article</recordtype><recordid>eNpdTktLAzEYDKJgqT34D4InL9EkXx6boxRtlfoAey9ft8m6ZXdTk92D_npX9CDOHGYYhmEIORf8SgvJr5XSihkJcEQm3FnNiqKwx3_8KZnlvOcjwEnj1IQ8PMXUYlN_Yl_HjsZAHz-wazHRRcK2HdMSGzrH3lcx1T7TEBN9wdSzGNjrwfvyja6xququOiMnAZvsZ786Jeu72_V8yVbPi_v5zYodDFcMlEajJJQOjFRCahU8cmXcSItgud8Gi34rQUJR8l1hd2arMBReB-13Cqbk8mf2kOL74HO_aetc-qbBzschbwQX3Alt9Hf14l91H4fUjec2TgBY0NLAF8HrWpQ</recordid><startdate>20110101</startdate><enddate>20110101</enddate><creator>Myint, Phyu Hninn</creator><creator>Htwe, Tin Myat</creator><creator>Thein, Ni Lar</creator><general>Foundation of Computer Science</general><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20110101</creationdate><title>Normalization of Myanmar Grammatical Categories for Part-of-Speech Tagging</title><author>Myint, Phyu Hninn ; Htwe, Tin Myat ; Thein, Ni Lar</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p604-345a6423c936241254fea04696967a370ebf7aeb23238c0d87d6b4af8e5f5ed43</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2011</creationdate><topic>Categories</topic><topic>Machine translation</topic><topic>Marking</topic><topic>Roots</topic><topic>Sentences</topic><topic>Tags</topic><topic>Texts</topic><toplevel>online_resources</toplevel><creatorcontrib>Myint, Phyu Hninn</creatorcontrib><creatorcontrib>Htwe, Tin Myat</creatorcontrib><creatorcontrib>Thein, Ni Lar</creatorcontrib><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>International journal of computer applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Myint, Phyu Hninn</au><au>Htwe, Tin Myat</au><au>Thein, Ni Lar</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Normalization of Myanmar Grammatical Categories for Part-of-Speech Tagging</atitle><jtitle>International journal of computer applications</jtitle><date>2011-01-01</date><risdate>2011</risdate><volume>36</volume><issue>1</issue><issn>0975-8887</issn><eissn>0975-8887</eissn><abstract>In this paper, we analyze the syntactic structure of Myanmar grammatical categories to be able to use in tagging Myanmar text with standard Part-of-Speech (POS) tags. In Myanmar lexicon, all words are annotated with basic tags and these words can be called as stem words or root words. The Myanmar POS tagged corpus creation, which has been proposed in [11], used basic POS tagging for each word. Therefore, all words in this corpus have been tagged with only basic tags as in lexicon. For standard POS tagging, normalization step is needed to form more meaningful words and annotate some words with more appropriate finer POS tags and categories. The finer tags can be called as standard POS tags and these can be used to directly concatenate with English POS tags. These tags are very useful in Myanmar to English Machine Translation System. Hence, the main aim of this study is to develop the customized lexical rules in order to deduce finer or standard POS tag from basic POS tags combinations. By analyzing Myanmar grammatical categories, 27 rules are defined to normalize them. Evaluation has been made on a basic POS tagged corpus which contains 1000 basic POS tagged sentences and it yields full satisfaction for all words in these sentences.</abstract><cop>New York</cop><pub>Foundation of Computer Science</pub><doi>10.5120/4454-6233</doi></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0975-8887 |
ispartof | International journal of computer applications, 2011-01, Vol.36 (1) |
issn | 0975-8887 0975-8887 |
language | eng |
recordid | cdi_proquest_miscellaneous_1010915654 |
source | EZB Electronic Journals Library |
subjects | Categories Machine translation Marking Roots Sentences Tags Texts |
title | Normalization of Myanmar Grammatical Categories for Part-of-Speech Tagging |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T17%3A45%3A34IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Normalization%20of%20Myanmar%20Grammatical%20Categories%20for%20Part-of-Speech%20Tagging&rft.jtitle=International%20journal%20of%20computer%20applications&rft.au=Myint,%20Phyu%20Hninn&rft.date=2011-01-01&rft.volume=36&rft.issue=1&rft.issn=0975-8887&rft.eissn=0975-8887&rft_id=info:doi/10.5120/4454-6233&rft_dat=%3Cproquest%3E1010915654%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=913373526&rft_id=info:pmid/&rfr_iscdi=true |