CEHA: A Dataset of Conflict Events in the Horn of Africa
Natural Language Processing (NLP) of news articles can play an important role in understanding the dynamics and causes of violent conflict. Despite the availability of datasets categorizing various conflict events, the existing labels often do not cover all of the fine-grained violent conflict event...
Gespeichert in:
Hauptverfasser: | , , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Bai, Rui Lu, Di Ran, Shihao Olson, Elizabeth Lamba, Hemank Cahill, Aoife Tetreault, Joel Jaimes, Alex |
description | Natural Language Processing (NLP) of news articles can play an important role
in understanding the dynamics and causes of violent conflict. Despite the
availability of datasets categorizing various conflict events, the existing
labels often do not cover all of the fine-grained violent conflict event types
relevant to areas like the Horn of Africa. In this paper, we introduce a new
benchmark dataset Conflict Events in the Horn of Africa region (CEHA) and
propose a new task for identifying violent conflict events using online
resources with this dataset. The dataset consists of 500 English event
descriptions regarding conflict events in the Horn of Africa region with
fine-grained event-type definitions that emphasize the cause of the conflict.
This dataset categorizes the key types of conflict risk according to specific
areas required by stakeholders in the Humanitarian-Peace-Development Nexus.
Additionally, we conduct extensive experiments on two tasks supported by this
dataset: Event-relevance Classification and Event-type Classification. Our
baseline models demonstrate the challenging nature of these tasks and the
usefulness of our dataset for model evaluations in low-resource settings with
limited number of training data. |
doi_str_mv | 10.48550/arxiv.2412.13511 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2412_13511</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2412_13511</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2412_135113</originalsourceid><addsrcrecordid>eNpjYJA0NNAzsTA1NdBPLKrILNMzMjE00jM0NjU05GSwcHb1cLRScFRwSSxJLE4tUchPU3DOz0vLyUwuUXAtS80rKVbIzFMoyUhV8MgvygNJO6YVZSYn8jCwpiXmFKfyQmluBnk31xBnD12wHfEFRZm5iUWV8SC74sF2GRNWAQAKQTGA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>CEHA: A Dataset of Conflict Events in the Horn of Africa</title><source>arXiv.org</source><creator>Bai, Rui ; Lu, Di ; Ran, Shihao ; Olson, Elizabeth ; Lamba, Hemank ; Cahill, Aoife ; Tetreault, Joel ; Jaimes, Alex</creator><creatorcontrib>Bai, Rui ; Lu, Di ; Ran, Shihao ; Olson, Elizabeth ; Lamba, Hemank ; Cahill, Aoife ; Tetreault, Joel ; Jaimes, Alex</creatorcontrib><description>Natural Language Processing (NLP) of news articles can play an important role
in understanding the dynamics and causes of violent conflict. Despite the
availability of datasets categorizing various conflict events, the existing
labels often do not cover all of the fine-grained violent conflict event types
relevant to areas like the Horn of Africa. In this paper, we introduce a new
benchmark dataset Conflict Events in the Horn of Africa region (CEHA) and
propose a new task for identifying violent conflict events using online
resources with this dataset. The dataset consists of 500 English event
descriptions regarding conflict events in the Horn of Africa region with
fine-grained event-type definitions that emphasize the cause of the conflict.
This dataset categorizes the key types of conflict risk according to specific
areas required by stakeholders in the Humanitarian-Peace-Development Nexus.
Additionally, we conduct extensive experiments on two tasks supported by this
dataset: Event-relevance Classification and Event-type Classification. Our
baseline models demonstrate the challenging nature of these tasks and the
usefulness of our dataset for model evaluations in low-resource settings with
limited number of training data.</description><identifier>DOI: 10.48550/arxiv.2412.13511</identifier><language>eng</language><subject>Computer Science - Computation and Language</subject><creationdate>2024-12</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2412.13511$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2412.13511$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Bai, Rui</creatorcontrib><creatorcontrib>Lu, Di</creatorcontrib><creatorcontrib>Ran, Shihao</creatorcontrib><creatorcontrib>Olson, Elizabeth</creatorcontrib><creatorcontrib>Lamba, Hemank</creatorcontrib><creatorcontrib>Cahill, Aoife</creatorcontrib><creatorcontrib>Tetreault, Joel</creatorcontrib><creatorcontrib>Jaimes, Alex</creatorcontrib><title>CEHA: A Dataset of Conflict Events in the Horn of Africa</title><description>Natural Language Processing (NLP) of news articles can play an important role
in understanding the dynamics and causes of violent conflict. Despite the
availability of datasets categorizing various conflict events, the existing
labels often do not cover all of the fine-grained violent conflict event types
relevant to areas like the Horn of Africa. In this paper, we introduce a new
benchmark dataset Conflict Events in the Horn of Africa region (CEHA) and
propose a new task for identifying violent conflict events using online
resources with this dataset. The dataset consists of 500 English event
descriptions regarding conflict events in the Horn of Africa region with
fine-grained event-type definitions that emphasize the cause of the conflict.
This dataset categorizes the key types of conflict risk according to specific
areas required by stakeholders in the Humanitarian-Peace-Development Nexus.
Additionally, we conduct extensive experiments on two tasks supported by this
dataset: Event-relevance Classification and Event-type Classification. Our
baseline models demonstrate the challenging nature of these tasks and the
usefulness of our dataset for model evaluations in low-resource settings with
limited number of training data.</description><subject>Computer Science - Computation and Language</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNpjYJA0NNAzsTA1NdBPLKrILNMzMjE00jM0NjU05GSwcHb1cLRScFRwSSxJLE4tUchPU3DOz0vLyUwuUXAtS80rKVbIzFMoyUhV8MgvygNJO6YVZSYn8jCwpiXmFKfyQmluBnk31xBnD12wHfEFRZm5iUWV8SC74sF2GRNWAQAKQTGA</recordid><startdate>20241218</startdate><enddate>20241218</enddate><creator>Bai, Rui</creator><creator>Lu, Di</creator><creator>Ran, Shihao</creator><creator>Olson, Elizabeth</creator><creator>Lamba, Hemank</creator><creator>Cahill, Aoife</creator><creator>Tetreault, Joel</creator><creator>Jaimes, Alex</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20241218</creationdate><title>CEHA: A Dataset of Conflict Events in the Horn of Africa</title><author>Bai, Rui ; Lu, Di ; Ran, Shihao ; Olson, Elizabeth ; Lamba, Hemank ; Cahill, Aoife ; Tetreault, Joel ; Jaimes, Alex</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2412_135113</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Computation and Language</topic><toplevel>online_resources</toplevel><creatorcontrib>Bai, Rui</creatorcontrib><creatorcontrib>Lu, Di</creatorcontrib><creatorcontrib>Ran, Shihao</creatorcontrib><creatorcontrib>Olson, Elizabeth</creatorcontrib><creatorcontrib>Lamba, Hemank</creatorcontrib><creatorcontrib>Cahill, Aoife</creatorcontrib><creatorcontrib>Tetreault, Joel</creatorcontrib><creatorcontrib>Jaimes, Alex</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Bai, Rui</au><au>Lu, Di</au><au>Ran, Shihao</au><au>Olson, Elizabeth</au><au>Lamba, Hemank</au><au>Cahill, Aoife</au><au>Tetreault, Joel</au><au>Jaimes, Alex</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>CEHA: A Dataset of Conflict Events in the Horn of Africa</atitle><date>2024-12-18</date><risdate>2024</risdate><abstract>Natural Language Processing (NLP) of news articles can play an important role
in understanding the dynamics and causes of violent conflict. Despite the
availability of datasets categorizing various conflict events, the existing
labels often do not cover all of the fine-grained violent conflict event types
relevant to areas like the Horn of Africa. In this paper, we introduce a new
benchmark dataset Conflict Events in the Horn of Africa region (CEHA) and
propose a new task for identifying violent conflict events using online
resources with this dataset. The dataset consists of 500 English event
descriptions regarding conflict events in the Horn of Africa region with
fine-grained event-type definitions that emphasize the cause of the conflict.
This dataset categorizes the key types of conflict risk according to specific
areas required by stakeholders in the Humanitarian-Peace-Development Nexus.
Additionally, we conduct extensive experiments on two tasks supported by this
dataset: Event-relevance Classification and Event-type Classification. Our
baseline models demonstrate the challenging nature of these tasks and the
usefulness of our dataset for model evaluations in low-resource settings with
limited number of training data.</abstract><doi>10.48550/arxiv.2412.13511</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2412.13511 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2412_13511 |
source | arXiv.org |
subjects | Computer Science - Computation and Language |
title | CEHA: A Dataset of Conflict Events in the Horn of Africa |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-13T18%3A03%3A48IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=CEHA:%20A%20Dataset%20of%20Conflict%20Events%20in%20the%20Horn%20of%20Africa&rft.au=Bai,%20Rui&rft.date=2024-12-18&rft_id=info:doi/10.48550/arxiv.2412.13511&rft_dat=%3Carxiv_GOX%3E2412_13511%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |