Effect of Word Embedding Variable Parameters on Arabic Sentiment Analysis Performance

Social media such as Twitter, Facebook, etc. has led to a generated growing number of comments that contains users opinions. Sentiment analysis research deals with these comments to extract opinions which are positive or negative. Arabic language is a rich morphological language; thus, classical tec...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Alnawas, Anwar, ARICI, Nursal
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Alnawas, Anwar
ARICI, Nursal
description Social media such as Twitter, Facebook, etc. has led to a generated growing number of comments that contains users opinions. Sentiment analysis research deals with these comments to extract opinions which are positive or negative. Arabic language is a rich morphological language; thus, classical techniques of English sentiment analysis cannot be used for Arabic. Word embedding technique can be considered as one of successful methods to gaping the morphological problem of Arabic. Many works have been done for Arabic sentiment analysis based on word embedding, but there is no study focused on variable parameters. This study will discuss three parameters (Window size, Dimension of vector and Negative Sample) for Arabic sentiment analysis using DBOW and DMPV architectures. A large corpus of previous works generated to learn word representations and extract features. Four binary classifiers (Logistic Regression, Decision Tree, Support Vector Machine and Naive Bayes) are used to detect sentiment. The performance of classifiers evaluated based on; Precision, Recall and F1-score.
doi_str_mv 10.48550/arxiv.2101.02906
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2101_02906</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2101_02906</sourcerecordid><originalsourceid>FETCH-LOGICAL-a676-4ef56d1c08b9d7a72d1ea71b0c6dd1adb4cf8ba52c97bb1060304c7062bbd8c3</originalsourceid><addsrcrecordid>eNotz81KxDAUhuFsXMjoBbgyN9B60p-kXZah_sCAAzPqspyTnEigTSUt4ty9Orr53t0HjxA3CvKqqWu4w_QVPvNCgcqhaEFfipfee7arnL18m5OT_UTsXIjv8hVTQBpZ7jHhxCunRc5RdgkpWHnguIbpZ2QXcTwtYZF7Tn5OE0bLV-LC47jw9X834nDfH7eP2e754Wnb7TLURmcV-1o7ZaGh1hk0hVOMRhFY7ZxCR5X1DWFd2NYQKdBQQmUN6ILINbbciNu_1zNr-EhhwnQafnnDmVd-A6ClS6k</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Effect of Word Embedding Variable Parameters on Arabic Sentiment Analysis Performance</title><source>arXiv.org</source><creator>Alnawas, Anwar ; ARICI, Nursal</creator><creatorcontrib>Alnawas, Anwar ; ARICI, Nursal</creatorcontrib><description>Social media such as Twitter, Facebook, etc. has led to a generated growing number of comments that contains users opinions. Sentiment analysis research deals with these comments to extract opinions which are positive or negative. Arabic language is a rich morphological language; thus, classical techniques of English sentiment analysis cannot be used for Arabic. Word embedding technique can be considered as one of successful methods to gaping the morphological problem of Arabic. Many works have been done for Arabic sentiment analysis based on word embedding, but there is no study focused on variable parameters. This study will discuss three parameters (Window size, Dimension of vector and Negative Sample) for Arabic sentiment analysis using DBOW and DMPV architectures. A large corpus of previous works generated to learn word representations and extract features. Four binary classifiers (Logistic Regression, Decision Tree, Support Vector Machine and Naive Bayes) are used to detect sentiment. The performance of classifiers evaluated based on; Precision, Recall and F1-score.</description><identifier>DOI: 10.48550/arxiv.2101.02906</identifier><language>eng</language><subject>Computer Science - Computation and Language ; Computer Science - Learning</subject><creationdate>2021-01</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2101.02906$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2101.02906$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Alnawas, Anwar</creatorcontrib><creatorcontrib>ARICI, Nursal</creatorcontrib><title>Effect of Word Embedding Variable Parameters on Arabic Sentiment Analysis Performance</title><description>Social media such as Twitter, Facebook, etc. has led to a generated growing number of comments that contains users opinions. Sentiment analysis research deals with these comments to extract opinions which are positive or negative. Arabic language is a rich morphological language; thus, classical techniques of English sentiment analysis cannot be used for Arabic. Word embedding technique can be considered as one of successful methods to gaping the morphological problem of Arabic. Many works have been done for Arabic sentiment analysis based on word embedding, but there is no study focused on variable parameters. This study will discuss three parameters (Window size, Dimension of vector and Negative Sample) for Arabic sentiment analysis using DBOW and DMPV architectures. A large corpus of previous works generated to learn word representations and extract features. Four binary classifiers (Logistic Regression, Decision Tree, Support Vector Machine and Naive Bayes) are used to detect sentiment. The performance of classifiers evaluated based on; Precision, Recall and F1-score.</description><subject>Computer Science - Computation and Language</subject><subject>Computer Science - Learning</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz81KxDAUhuFsXMjoBbgyN9B60p-kXZah_sCAAzPqspyTnEigTSUt4ty9Orr53t0HjxA3CvKqqWu4w_QVPvNCgcqhaEFfipfee7arnL18m5OT_UTsXIjv8hVTQBpZ7jHhxCunRc5RdgkpWHnguIbpZ2QXcTwtYZF7Tn5OE0bLV-LC47jw9X834nDfH7eP2e754Wnb7TLURmcV-1o7ZaGh1hk0hVOMRhFY7ZxCR5X1DWFd2NYQKdBQQmUN6ILINbbciNu_1zNr-EhhwnQafnnDmVd-A6ClS6k</recordid><startdate>20210108</startdate><enddate>20210108</enddate><creator>Alnawas, Anwar</creator><creator>ARICI, Nursal</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20210108</creationdate><title>Effect of Word Embedding Variable Parameters on Arabic Sentiment Analysis Performance</title><author>Alnawas, Anwar ; ARICI, Nursal</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a676-4ef56d1c08b9d7a72d1ea71b0c6dd1adb4cf8ba52c97bb1060304c7062bbd8c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Computer Science - Computation and Language</topic><topic>Computer Science - Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Alnawas, Anwar</creatorcontrib><creatorcontrib>ARICI, Nursal</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Alnawas, Anwar</au><au>ARICI, Nursal</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Effect of Word Embedding Variable Parameters on Arabic Sentiment Analysis Performance</atitle><date>2021-01-08</date><risdate>2021</risdate><abstract>Social media such as Twitter, Facebook, etc. has led to a generated growing number of comments that contains users opinions. Sentiment analysis research deals with these comments to extract opinions which are positive or negative. Arabic language is a rich morphological language; thus, classical techniques of English sentiment analysis cannot be used for Arabic. Word embedding technique can be considered as one of successful methods to gaping the morphological problem of Arabic. Many works have been done for Arabic sentiment analysis based on word embedding, but there is no study focused on variable parameters. This study will discuss three parameters (Window size, Dimension of vector and Negative Sample) for Arabic sentiment analysis using DBOW and DMPV architectures. A large corpus of previous works generated to learn word representations and extract features. Four binary classifiers (Logistic Regression, Decision Tree, Support Vector Machine and Naive Bayes) are used to detect sentiment. The performance of classifiers evaluated based on; Precision, Recall and F1-score.</abstract><doi>10.48550/arxiv.2101.02906</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2101.02906
ispartof
issn
language eng
recordid cdi_arxiv_primary_2101_02906
source arXiv.org
subjects Computer Science - Computation and Language
Computer Science - Learning
title Effect of Word Embedding Variable Parameters on Arabic Sentiment Analysis Performance
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-25T17%3A15%3A12IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Effect%20of%20Word%20Embedding%20Variable%20Parameters%20on%20Arabic%20Sentiment%20Analysis%20Performance&rft.au=Alnawas,%20Anwar&rft.date=2021-01-08&rft_id=info:doi/10.48550/arxiv.2101.02906&rft_dat=%3Carxiv_GOX%3E2101_02906%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true