Annotated Corpus of Mesopotamian-Iraqi Dialect for Sentiment Analysis in Social Media

Research on Sentiment Analysis in social media by using Mesopotamian-Iraqi Dialect (MID) of Arabic language was rarely found, there is no reliable dataset developed in MID neither an annotated corpus for the sentiment analysis of social media in this dialect. Therefore, this gap was the main stumbli...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of advanced computer science & applications 2021, Vol.12 (4)
Hauptverfasser: Askar, Al-Khafaji Ali J, Nur, Nilam
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 4
container_start_page
container_title International journal of advanced computer science & applications
container_volume 12
creator Askar, Al-Khafaji Ali J
Nur, Nilam
description Research on Sentiment Analysis in social media by using Mesopotamian-Iraqi Dialect (MID) of Arabic language was rarely found, there is no reliable dataset developed in MID neither an annotated corpus for the sentiment analysis of social media in this dialect. Therefore, this gap was the main stumbling block for researchers of sentiment analysis in MID, for this reason, this paper introduced the development of an annotated corpus of Mesopotamian-Iraqi Dialect for sentiment analysis in social media and named it as (ACMID) stands for (the annotated corpus of Mesopotamian-Iraqi Dialect) to help researchers in future for using this corpus for their studies, to the best of our knowledge this is the first annotated corpus that both classify polarity as well as emotion classification in MID. Likewise, Facebook as the most popular social platform among Iraqis was used to extract the data from its popular Iraqi pages. 5000 comments were extracted from these pages classified by its polarity (Positive, Negative, Neutral, Spam) by two Iraqi annotators, these annotators were simultaneously classifying the same comments according to Ekman seven universal emotions (Anger, Fear, Disgust, Happiness, Sadness, Surprise, Contempt) or no emotion. Cohen's kappa coefficient was then used to compare the two annotators’ results to find the reliability of these results. The data shows a comparable value among the two annotators for the polarity classification as high as 0.82, while for the emotion classification the result was 0.65.
doi_str_mv 10.14569/IJACSA.2021.0120413
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2655119327</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2655119327</sourcerecordid><originalsourceid>FETCH-LOGICAL-c274t-4a9e3f041edb45431515a6d69ef4ad91a074f3c2b2a1831458f50812473617a53</originalsourceid><addsrcrecordid>eNotkE1rwzAMhs3YYKXrP9jBsHM6y5_JMWRfHR07dIXdjJvY4NLGqZ0e-u_ntdVBEuKVePUg9AhkDlzI6nnxWTerek4JhTkBSjiwGzShIGQhhCK3574sgKjfezRLaUtysIrKkk3Quu77MJrRdrgJcTgmHBz-sikMebr3pi8W0Rw8fvFmZ9sRuxDxyvaj3-eE697sTskn7Hu8Cm3W5N3Omwd058wu2dm1TtH67fWn-SiW3--Lpl4WLVV8LLipLHPZsO02XHAGAoSRnays46arwBDFHWvphhooWf62dIKUQLliEpQRbIqeLneHGA5Hm0a9DceYTSVNpRAAFaMqq_hF1caQUrROD9HvTTxpIPrMUF8Y6n-G-sqQ_QHT4GMr</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2655119327</pqid></control><display><type>article</type><title>Annotated Corpus of Mesopotamian-Iraqi Dialect for Sentiment Analysis in Social Media</title><source>EZB-FREE-00999 freely available EZB journals</source><creator>Askar, Al-Khafaji Ali J ; Nur, Nilam</creator><creatorcontrib>Askar, Al-Khafaji Ali J ; Nur, Nilam</creatorcontrib><description>Research on Sentiment Analysis in social media by using Mesopotamian-Iraqi Dialect (MID) of Arabic language was rarely found, there is no reliable dataset developed in MID neither an annotated corpus for the sentiment analysis of social media in this dialect. Therefore, this gap was the main stumbling block for researchers of sentiment analysis in MID, for this reason, this paper introduced the development of an annotated corpus of Mesopotamian-Iraqi Dialect for sentiment analysis in social media and named it as (ACMID) stands for (the annotated corpus of Mesopotamian-Iraqi Dialect) to help researchers in future for using this corpus for their studies, to the best of our knowledge this is the first annotated corpus that both classify polarity as well as emotion classification in MID. Likewise, Facebook as the most popular social platform among Iraqis was used to extract the data from its popular Iraqi pages. 5000 comments were extracted from these pages classified by its polarity (Positive, Negative, Neutral, Spam) by two Iraqi annotators, these annotators were simultaneously classifying the same comments according to Ekman seven universal emotions (Anger, Fear, Disgust, Happiness, Sadness, Surprise, Contempt) or no emotion. Cohen's kappa coefficient was then used to compare the two annotators’ results to find the reliability of these results. The data shows a comparable value among the two annotators for the polarity classification as high as 0.82, while for the emotion classification the result was 0.65.</description><identifier>ISSN: 2158-107X</identifier><identifier>EISSN: 2156-5570</identifier><identifier>DOI: 10.14569/IJACSA.2021.0120413</identifier><language>eng</language><publisher>West Yorkshire: Science and Information (SAI) Organization Limited</publisher><subject>Annotations ; Classification ; Data mining ; Digital media ; Emotions ; Sentiment analysis ; Social networks</subject><ispartof>International journal of advanced computer science &amp; applications, 2021, Vol.12 (4)</ispartof><rights>2021. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,4023,27922,27923,27924</link.rule.ids></links><search><creatorcontrib>Askar, Al-Khafaji Ali J</creatorcontrib><creatorcontrib>Nur, Nilam</creatorcontrib><title>Annotated Corpus of Mesopotamian-Iraqi Dialect for Sentiment Analysis in Social Media</title><title>International journal of advanced computer science &amp; applications</title><description>Research on Sentiment Analysis in social media by using Mesopotamian-Iraqi Dialect (MID) of Arabic language was rarely found, there is no reliable dataset developed in MID neither an annotated corpus for the sentiment analysis of social media in this dialect. Therefore, this gap was the main stumbling block for researchers of sentiment analysis in MID, for this reason, this paper introduced the development of an annotated corpus of Mesopotamian-Iraqi Dialect for sentiment analysis in social media and named it as (ACMID) stands for (the annotated corpus of Mesopotamian-Iraqi Dialect) to help researchers in future for using this corpus for their studies, to the best of our knowledge this is the first annotated corpus that both classify polarity as well as emotion classification in MID. Likewise, Facebook as the most popular social platform among Iraqis was used to extract the data from its popular Iraqi pages. 5000 comments were extracted from these pages classified by its polarity (Positive, Negative, Neutral, Spam) by two Iraqi annotators, these annotators were simultaneously classifying the same comments according to Ekman seven universal emotions (Anger, Fear, Disgust, Happiness, Sadness, Surprise, Contempt) or no emotion. Cohen's kappa coefficient was then used to compare the two annotators’ results to find the reliability of these results. The data shows a comparable value among the two annotators for the polarity classification as high as 0.82, while for the emotion classification the result was 0.65.</description><subject>Annotations</subject><subject>Classification</subject><subject>Data mining</subject><subject>Digital media</subject><subject>Emotions</subject><subject>Sentiment analysis</subject><subject>Social networks</subject><issn>2158-107X</issn><issn>2156-5570</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>8G5</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>GUQSH</sourceid><sourceid>M2O</sourceid><recordid>eNotkE1rwzAMhs3YYKXrP9jBsHM6y5_JMWRfHR07dIXdjJvY4NLGqZ0e-u_ntdVBEuKVePUg9AhkDlzI6nnxWTerek4JhTkBSjiwGzShIGQhhCK3574sgKjfezRLaUtysIrKkk3Quu77MJrRdrgJcTgmHBz-sikMebr3pi8W0Rw8fvFmZ9sRuxDxyvaj3-eE697sTskn7Hu8Cm3W5N3Omwd058wu2dm1TtH67fWn-SiW3--Lpl4WLVV8LLipLHPZsO02XHAGAoSRnays46arwBDFHWvphhooWf62dIKUQLliEpQRbIqeLneHGA5Hm0a9DceYTSVNpRAAFaMqq_hF1caQUrROD9HvTTxpIPrMUF8Y6n-G-sqQ_QHT4GMr</recordid><startdate>2021</startdate><enddate>2021</enddate><creator>Askar, Al-Khafaji Ali J</creator><creator>Nur, Nilam</creator><general>Science and Information (SAI) Organization Limited</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7XB</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>M2O</scope><scope>MBDVC</scope><scope>P5Z</scope><scope>P62</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope></search><sort><creationdate>2021</creationdate><title>Annotated Corpus of Mesopotamian-Iraqi Dialect for Sentiment Analysis in Social Media</title><author>Askar, Al-Khafaji Ali J ; Nur, Nilam</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c274t-4a9e3f041edb45431515a6d69ef4ad91a074f3c2b2a1831458f50812473617a53</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Annotations</topic><topic>Classification</topic><topic>Data mining</topic><topic>Digital media</topic><topic>Emotions</topic><topic>Sentiment analysis</topic><topic>Social networks</topic><toplevel>online_resources</toplevel><creatorcontrib>Askar, Al-Khafaji Ali J</creatorcontrib><creatorcontrib>Nur, Nilam</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection (ProQuest)</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Research Library</collection><collection>Research Library (Corporate)</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><jtitle>International journal of advanced computer science &amp; applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Askar, Al-Khafaji Ali J</au><au>Nur, Nilam</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Annotated Corpus of Mesopotamian-Iraqi Dialect for Sentiment Analysis in Social Media</atitle><jtitle>International journal of advanced computer science &amp; applications</jtitle><date>2021</date><risdate>2021</risdate><volume>12</volume><issue>4</issue><issn>2158-107X</issn><eissn>2156-5570</eissn><abstract>Research on Sentiment Analysis in social media by using Mesopotamian-Iraqi Dialect (MID) of Arabic language was rarely found, there is no reliable dataset developed in MID neither an annotated corpus for the sentiment analysis of social media in this dialect. Therefore, this gap was the main stumbling block for researchers of sentiment analysis in MID, for this reason, this paper introduced the development of an annotated corpus of Mesopotamian-Iraqi Dialect for sentiment analysis in social media and named it as (ACMID) stands for (the annotated corpus of Mesopotamian-Iraqi Dialect) to help researchers in future for using this corpus for their studies, to the best of our knowledge this is the first annotated corpus that both classify polarity as well as emotion classification in MID. Likewise, Facebook as the most popular social platform among Iraqis was used to extract the data from its popular Iraqi pages. 5000 comments were extracted from these pages classified by its polarity (Positive, Negative, Neutral, Spam) by two Iraqi annotators, these annotators were simultaneously classifying the same comments according to Ekman seven universal emotions (Anger, Fear, Disgust, Happiness, Sadness, Surprise, Contempt) or no emotion. Cohen's kappa coefficient was then used to compare the two annotators’ results to find the reliability of these results. The data shows a comparable value among the two annotators for the polarity classification as high as 0.82, while for the emotion classification the result was 0.65.</abstract><cop>West Yorkshire</cop><pub>Science and Information (SAI) Organization Limited</pub><doi>10.14569/IJACSA.2021.0120413</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2158-107X
ispartof International journal of advanced computer science & applications, 2021, Vol.12 (4)
issn 2158-107X
2156-5570
language eng
recordid cdi_proquest_journals_2655119327
source EZB-FREE-00999 freely available EZB journals
subjects Annotations
Classification
Data mining
Digital media
Emotions
Sentiment analysis
Social networks
title Annotated Corpus of Mesopotamian-Iraqi Dialect for Sentiment Analysis in Social Media
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-09T02%3A52%3A50IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Annotated%20Corpus%20of%20Mesopotamian-Iraqi%20Dialect%20for%20Sentiment%20Analysis%20in%20Social%20Media&rft.jtitle=International%20journal%20of%20advanced%20computer%20science%20&%20applications&rft.au=Askar,%20Al-Khafaji%20Ali%20J&rft.date=2021&rft.volume=12&rft.issue=4&rft.issn=2158-107X&rft.eissn=2156-5570&rft_id=info:doi/10.14569/IJACSA.2021.0120413&rft_dat=%3Cproquest_cross%3E2655119327%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2655119327&rft_id=info:pmid/&rfr_iscdi=true