Arabic Dialect Identification under Scrutiny: Limitations of Single-label Classification

Automatic Arabic Dialect Identification (ADI) of text has gained great popularity since it was introduced in the early 2010s. Multiple datasets were developed, and yearly shared tasks have been running since 2018. However, ADI systems are reported to fail in distinguishing between the micro-dialects...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Keleg, Amr, Magdy, Walid
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Computation and Language
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Keleg, Amr Magdy, Walid
description	Automatic Arabic Dialect Identification (ADI) of text has gained great popularity since it was introduced in the early 2010s. Multiple datasets were developed, and yearly shared tasks have been running since 2018. However, ADI systems are reported to fail in distinguishing between the micro-dialects of Arabic. We argue that the currently adopted framing of the ADI task as a single-label classification problem is one of the main reasons for that. We highlight the limitation of the incompleteness of the Dialect labels and demonstrate how it impacts the evaluation of ADI systems. A manual error analysis for the predictions of an ADI, performed by 7 native speakers of different Arabic dialects, revealed that $\approx$ 66% of the validated errors are not true errors. Consequently, we propose framing ADI as a multi-label classification task and give recommendations for designing new ADI datasets.
doi_str_mv	10.48550/arxiv.2310.13661
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2310_13661</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2310_13661</sourcerecordid><originalsourceid>FETCH-LOGICAL-a671-b1c44f7d5494af49cace09a11259adf6d672aff40f418a3407026b06aa6c1a7d3</originalsourceid><addsrcrecordid>eNo9j8tOwzAURL1hgQofwAr_QIod39gNuyq8KkVi0S7YRTd-oCu5LnJcRP8eCKirkWZ0RjqM3UixhFXTiDvMX_S5rNVPIZXW8pK9rTOOZPkDYfS28I3zqVAgi4UOiR-T85lvbT4WSqd73tOeyjxN_BD4ltJ79FXE0UfeRZymM3rFLgLGyV__54Ltnh533UvVvz5vunVfoTayGqUFCMY10AIGaC1aL1qUsm5adEE7bWoMAUQAuUIFwohaj0IjaivROLVgt3-3s9rwkWmP-TT8Kg6zovoGScFNEA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Arabic Dialect Identification under Scrutiny: Limitations of Single-label Classification</title><source>arXiv.org</source><creator>Keleg, Amr ; Magdy, Walid</creator><creatorcontrib>Keleg, Amr ; Magdy, Walid</creatorcontrib><description>Automatic Arabic Dialect Identification (ADI) of text has gained great popularity since it was introduced in the early 2010s. Multiple datasets were developed, and yearly shared tasks have been running since 2018. However, ADI systems are reported to fail in distinguishing between the micro-dialects of Arabic. We argue that the currently adopted framing of the ADI task as a single-label classification problem is one of the main reasons for that. We highlight the limitation of the incompleteness of the Dialect labels and demonstrate how it impacts the evaluation of ADI systems. A manual error analysis for the predictions of an ADI, performed by 7 native speakers of different Arabic dialects, revealed that $\approx$ 66% of the validated errors are not true errors. Consequently, we propose framing ADI as a multi-label classification task and give recommendations for designing new ADI datasets.</description><identifier>DOI: 10.48550/arxiv.2310.13661</identifier><language>eng</language><subject>Computer Science - Computation and Language</subject><creationdate>2023-10</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2310.13661$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2310.13661$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Keleg, Amr</creatorcontrib><creatorcontrib>Magdy, Walid</creatorcontrib><title>Arabic Dialect Identification under Scrutiny: Limitations of Single-label Classification</title><description>Automatic Arabic Dialect Identification (ADI) of text has gained great popularity since it was introduced in the early 2010s. Multiple datasets were developed, and yearly shared tasks have been running since 2018. However, ADI systems are reported to fail in distinguishing between the micro-dialects of Arabic. We argue that the currently adopted framing of the ADI task as a single-label classification problem is one of the main reasons for that. We highlight the limitation of the incompleteness of the Dialect labels and demonstrate how it impacts the evaluation of ADI systems. A manual error analysis for the predictions of an ADI, performed by 7 native speakers of different Arabic dialects, revealed that $\approx$ 66% of the validated errors are not true errors. Consequently, we propose framing ADI as a multi-label classification task and give recommendations for designing new ADI datasets.</description><subject>Computer Science - Computation and Language</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNo9j8tOwzAURL1hgQofwAr_QIod39gNuyq8KkVi0S7YRTd-oCu5LnJcRP8eCKirkWZ0RjqM3UixhFXTiDvMX_S5rNVPIZXW8pK9rTOOZPkDYfS28I3zqVAgi4UOiR-T85lvbT4WSqd73tOeyjxN_BD4ltJ79FXE0UfeRZymM3rFLgLGyV__54Ltnh533UvVvz5vunVfoTayGqUFCMY10AIGaC1aL1qUsm5adEE7bWoMAUQAuUIFwohaj0IjaivROLVgt3-3s9rwkWmP-TT8Kg6zovoGScFNEA</recordid><startdate>20231020</startdate><enddate>20231020</enddate><creator>Keleg, Amr</creator><creator>Magdy, Walid</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20231020</creationdate><title>Arabic Dialect Identification under Scrutiny: Limitations of Single-label Classification</title><author>Keleg, Amr ; Magdy, Walid</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a671-b1c44f7d5494af49cace09a11259adf6d672aff40f418a3407026b06aa6c1a7d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Computer Science - Computation and Language</topic><toplevel>online_resources</toplevel><creatorcontrib>Keleg, Amr</creatorcontrib><creatorcontrib>Magdy, Walid</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Keleg, Amr</au><au>Magdy, Walid</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Arabic Dialect Identification under Scrutiny: Limitations of Single-label Classification</atitle><date>2023-10-20</date><risdate>2023</risdate><abstract>Automatic Arabic Dialect Identification (ADI) of text has gained great popularity since it was introduced in the early 2010s. Multiple datasets were developed, and yearly shared tasks have been running since 2018. However, ADI systems are reported to fail in distinguishing between the micro-dialects of Arabic. We argue that the currently adopted framing of the ADI task as a single-label classification problem is one of the main reasons for that. We highlight the limitation of the incompleteness of the Dialect labels and demonstrate how it impacts the evaluation of ADI systems. A manual error analysis for the predictions of an ADI, performed by 7 native speakers of different Arabic dialects, revealed that $\approx$ 66% of the validated errors are not true errors. Consequently, we propose framing ADI as a multi-label classification task and give recommendations for designing new ADI datasets.</abstract><doi>10.48550/arxiv.2310.13661</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.2310.13661
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_2310_13661
source	arXiv.org
subjects	Computer Science - Computation and Language
title	Arabic Dialect Identification under Scrutiny: Limitations of Single-label Classification
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-01T07%3A23%3A05IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Arabic%20Dialect%20Identification%20under%20Scrutiny:%20Limitations%20of%20Single-label%20Classification&rft.au=Keleg,%20Amr&rft.date=2023-10-20&rft_id=info:doi/10.48550/arxiv.2310.13661&rft_dat=%3Carxiv_GOX%3E2310_13661%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true