Annotated Corpus of Mesopotamian-Iraqi Dialect for Sentiment Analysis in Social Media
Research on Sentiment Analysis in social media by using Mesopotamian-Iraqi Dialect (MID) of Arabic language was rarely found, there is no reliable dataset developed in MID neither an annotated corpus for the sentiment analysis of social media in this dialect. Therefore, this gap was the main stumbli...
Gespeichert in:
Veröffentlicht in: | International journal of advanced computer science & applications 2021, Vol.12 (4) |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | 4 |
container_start_page | |
container_title | International journal of advanced computer science & applications |
container_volume | 12 |
creator | Askar, Al-Khafaji Ali J Nur, Nilam |
description | Research on Sentiment Analysis in social media by using Mesopotamian-Iraqi Dialect (MID) of Arabic language was rarely found, there is no reliable dataset developed in MID neither an annotated corpus for the sentiment analysis of social media in this dialect. Therefore, this gap was the main stumbling block for researchers of sentiment analysis in MID, for this reason, this paper introduced the development of an annotated corpus of Mesopotamian-Iraqi Dialect for sentiment analysis in social media and named it as (ACMID) stands for (the annotated corpus of Mesopotamian-Iraqi Dialect) to help researchers in future for using this corpus for their studies, to the best of our knowledge this is the first annotated corpus that both classify polarity as well as emotion classification in MID. Likewise, Facebook as the most popular social platform among Iraqis was used to extract the data from its popular Iraqi pages. 5000 comments were extracted from these pages classified by its polarity (Positive, Negative, Neutral, Spam) by two Iraqi annotators, these annotators were simultaneously classifying the same comments according to Ekman seven universal emotions (Anger, Fear, Disgust, Happiness, Sadness, Surprise, Contempt) or no emotion. Cohen's kappa coefficient was then used to compare the two annotators’ results to find the reliability of these results. The data shows a comparable value among the two annotators for the polarity classification as high as 0.82, while for the emotion classification the result was 0.65. |
doi_str_mv | 10.14569/IJACSA.2021.0120413 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2655119327</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2655119327</sourcerecordid><originalsourceid>FETCH-LOGICAL-c274t-4a9e3f041edb45431515a6d69ef4ad91a074f3c2b2a1831458f50812473617a53</originalsourceid><addsrcrecordid>eNotkE1rwzAMhs3YYKXrP9jBsHM6y5_JMWRfHR07dIXdjJvY4NLGqZ0e-u_ntdVBEuKVePUg9AhkDlzI6nnxWTerek4JhTkBSjiwGzShIGQhhCK3574sgKjfezRLaUtysIrKkk3Quu77MJrRdrgJcTgmHBz-sikMebr3pi8W0Rw8fvFmZ9sRuxDxyvaj3-eE697sTskn7Hu8Cm3W5N3Omwd058wu2dm1TtH67fWn-SiW3--Lpl4WLVV8LLipLHPZsO02XHAGAoSRnays46arwBDFHWvphhooWf62dIKUQLliEpQRbIqeLneHGA5Hm0a9DceYTSVNpRAAFaMqq_hF1caQUrROD9HvTTxpIPrMUF8Y6n-G-sqQ_QHT4GMr</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2655119327</pqid></control><display><type>article</type><title>Annotated Corpus of Mesopotamian-Iraqi Dialect for Sentiment Analysis in Social Media</title><source>EZB-FREE-00999 freely available EZB journals</source><creator>Askar, Al-Khafaji Ali J ; Nur, Nilam</creator><creatorcontrib>Askar, Al-Khafaji Ali J ; Nur, Nilam</creatorcontrib><description>Research on Sentiment Analysis in social media by using Mesopotamian-Iraqi Dialect (MID) of Arabic language was rarely found, there is no reliable dataset developed in MID neither an annotated corpus for the sentiment analysis of social media in this dialect. Therefore, this gap was the main stumbling block for researchers of sentiment analysis in MID, for this reason, this paper introduced the development of an annotated corpus of Mesopotamian-Iraqi Dialect for sentiment analysis in social media and named it as (ACMID) stands for (the annotated corpus of Mesopotamian-Iraqi Dialect) to help researchers in future for using this corpus for their studies, to the best of our knowledge this is the first annotated corpus that both classify polarity as well as emotion classification in MID. Likewise, Facebook as the most popular social platform among Iraqis was used to extract the data from its popular Iraqi pages. 5000 comments were extracted from these pages classified by its polarity (Positive, Negative, Neutral, Spam) by two Iraqi annotators, these annotators were simultaneously classifying the same comments according to Ekman seven universal emotions (Anger, Fear, Disgust, Happiness, Sadness, Surprise, Contempt) or no emotion. Cohen's kappa coefficient was then used to compare the two annotators’ results to find the reliability of these results. The data shows a comparable value among the two annotators for the polarity classification as high as 0.82, while for the emotion classification the result was 0.65.</description><identifier>ISSN: 2158-107X</identifier><identifier>EISSN: 2156-5570</identifier><identifier>DOI: 10.14569/IJACSA.2021.0120413</identifier><language>eng</language><publisher>West Yorkshire: Science and Information (SAI) Organization Limited</publisher><subject>Annotations ; Classification ; Data mining ; Digital media ; Emotions ; Sentiment analysis ; Social networks</subject><ispartof>International journal of advanced computer science & applications, 2021, Vol.12 (4)</ispartof><rights>2021. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,4023,27922,27923,27924</link.rule.ids></links><search><creatorcontrib>Askar, Al-Khafaji Ali J</creatorcontrib><creatorcontrib>Nur, Nilam</creatorcontrib><title>Annotated Corpus of Mesopotamian-Iraqi Dialect for Sentiment Analysis in Social Media</title><title>International journal of advanced computer science & applications</title><description>Research on Sentiment Analysis in social media by using Mesopotamian-Iraqi Dialect (MID) of Arabic language was rarely found, there is no reliable dataset developed in MID neither an annotated corpus for the sentiment analysis of social media in this dialect. Therefore, this gap was the main stumbling block for researchers of sentiment analysis in MID, for this reason, this paper introduced the development of an annotated corpus of Mesopotamian-Iraqi Dialect for sentiment analysis in social media and named it as (ACMID) stands for (the annotated corpus of Mesopotamian-Iraqi Dialect) to help researchers in future for using this corpus for their studies, to the best of our knowledge this is the first annotated corpus that both classify polarity as well as emotion classification in MID. Likewise, Facebook as the most popular social platform among Iraqis was used to extract the data from its popular Iraqi pages. 5000 comments were extracted from these pages classified by its polarity (Positive, Negative, Neutral, Spam) by two Iraqi annotators, these annotators were simultaneously classifying the same comments according to Ekman seven universal emotions (Anger, Fear, Disgust, Happiness, Sadness, Surprise, Contempt) or no emotion. Cohen's kappa coefficient was then used to compare the two annotators’ results to find the reliability of these results. The data shows a comparable value among the two annotators for the polarity classification as high as 0.82, while for the emotion classification the result was 0.65.</description><subject>Annotations</subject><subject>Classification</subject><subject>Data mining</subject><subject>Digital media</subject><subject>Emotions</subject><subject>Sentiment analysis</subject><subject>Social networks</subject><issn>2158-107X</issn><issn>2156-5570</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>8G5</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>GUQSH</sourceid><sourceid>M2O</sourceid><recordid>eNotkE1rwzAMhs3YYKXrP9jBsHM6y5_JMWRfHR07dIXdjJvY4NLGqZ0e-u_ntdVBEuKVePUg9AhkDlzI6nnxWTerek4JhTkBSjiwGzShIGQhhCK3574sgKjfezRLaUtysIrKkk3Quu77MJrRdrgJcTgmHBz-sikMebr3pi8W0Rw8fvFmZ9sRuxDxyvaj3-eE697sTskn7Hu8Cm3W5N3Omwd058wu2dm1TtH67fWn-SiW3--Lpl4WLVV8LLipLHPZsO02XHAGAoSRnays46arwBDFHWvphhooWf62dIKUQLliEpQRbIqeLneHGA5Hm0a9DceYTSVNpRAAFaMqq_hF1caQUrROD9HvTTxpIPrMUF8Y6n-G-sqQ_QHT4GMr</recordid><startdate>2021</startdate><enddate>2021</enddate><creator>Askar, Al-Khafaji Ali J</creator><creator>Nur, Nilam</creator><general>Science and Information (SAI) Organization Limited</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7XB</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>M2O</scope><scope>MBDVC</scope><scope>P5Z</scope><scope>P62</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope></search><sort><creationdate>2021</creationdate><title>Annotated Corpus of Mesopotamian-Iraqi Dialect for Sentiment Analysis in Social Media</title><author>Askar, Al-Khafaji Ali J ; Nur, Nilam</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c274t-4a9e3f041edb45431515a6d69ef4ad91a074f3c2b2a1831458f50812473617a53</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Annotations</topic><topic>Classification</topic><topic>Data mining</topic><topic>Digital media</topic><topic>Emotions</topic><topic>Sentiment analysis</topic><topic>Social networks</topic><toplevel>online_resources</toplevel><creatorcontrib>Askar, Al-Khafaji Ali J</creatorcontrib><creatorcontrib>Nur, Nilam</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection (ProQuest)</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Research Library</collection><collection>Research Library (Corporate)</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><jtitle>International journal of advanced computer science & applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Askar, Al-Khafaji Ali J</au><au>Nur, Nilam</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Annotated Corpus of Mesopotamian-Iraqi Dialect for Sentiment Analysis in Social Media</atitle><jtitle>International journal of advanced computer science & applications</jtitle><date>2021</date><risdate>2021</risdate><volume>12</volume><issue>4</issue><issn>2158-107X</issn><eissn>2156-5570</eissn><abstract>Research on Sentiment Analysis in social media by using Mesopotamian-Iraqi Dialect (MID) of Arabic language was rarely found, there is no reliable dataset developed in MID neither an annotated corpus for the sentiment analysis of social media in this dialect. Therefore, this gap was the main stumbling block for researchers of sentiment analysis in MID, for this reason, this paper introduced the development of an annotated corpus of Mesopotamian-Iraqi Dialect for sentiment analysis in social media and named it as (ACMID) stands for (the annotated corpus of Mesopotamian-Iraqi Dialect) to help researchers in future for using this corpus for their studies, to the best of our knowledge this is the first annotated corpus that both classify polarity as well as emotion classification in MID. Likewise, Facebook as the most popular social platform among Iraqis was used to extract the data from its popular Iraqi pages. 5000 comments were extracted from these pages classified by its polarity (Positive, Negative, Neutral, Spam) by two Iraqi annotators, these annotators were simultaneously classifying the same comments according to Ekman seven universal emotions (Anger, Fear, Disgust, Happiness, Sadness, Surprise, Contempt) or no emotion. Cohen's kappa coefficient was then used to compare the two annotators’ results to find the reliability of these results. The data shows a comparable value among the two annotators for the polarity classification as high as 0.82, while for the emotion classification the result was 0.65.</abstract><cop>West Yorkshire</cop><pub>Science and Information (SAI) Organization Limited</pub><doi>10.14569/IJACSA.2021.0120413</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2158-107X |
ispartof | International journal of advanced computer science & applications, 2021, Vol.12 (4) |
issn | 2158-107X 2156-5570 |
language | eng |
recordid | cdi_proquest_journals_2655119327 |
source | EZB-FREE-00999 freely available EZB journals |
subjects | Annotations Classification Data mining Digital media Emotions Sentiment analysis Social networks |
title | Annotated Corpus of Mesopotamian-Iraqi Dialect for Sentiment Analysis in Social Media |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-09T02%3A52%3A50IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Annotated%20Corpus%20of%20Mesopotamian-Iraqi%20Dialect%20for%20Sentiment%20Analysis%20in%20Social%20Media&rft.jtitle=International%20journal%20of%20advanced%20computer%20science%20&%20applications&rft.au=Askar,%20Al-Khafaji%20Ali%20J&rft.date=2021&rft.volume=12&rft.issue=4&rft.issn=2158-107X&rft.eissn=2156-5570&rft_id=info:doi/10.14569/IJACSA.2021.0120413&rft_dat=%3Cproquest_cross%3E2655119327%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2655119327&rft_id=info:pmid/&rfr_iscdi=true |