ICPR 2024 Competition on Multilingual Claim-Span Identification

A lot of claims are made in social media posts, which may contain misinformation or fake news. Hence, it is crucial to identify claims as a first step towards claim verification. Given the huge number of social media posts, the task of identifying claims needs to be automated. This competition deals...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Poddar, Soham, Paul, Biswajit, Basu, Moumita, Ghosh, Saptarshi
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Poddar, Soham
Paul, Biswajit
Basu, Moumita
Ghosh, Saptarshi
description A lot of claims are made in social media posts, which may contain misinformation or fake news. Hence, it is crucial to identify claims as a first step towards claim verification. Given the huge number of social media posts, the task of identifying claims needs to be automated. This competition deals with the task of 'Claim Span Identification' in which, given a text, parts / spans that correspond to claims are to be identified. This task is more challenging than the traditional binary classification of text into claim or not-claim, and requires state-of-the-art methods in Pattern Recognition, Natural Language Processing and Machine Learning. For this competition, we used a newly developed dataset called HECSI containing about 8K posts in English and about 8K posts in Hindi with claim-spans marked by human annotators. This paper gives an overview of the competition, and the solutions developed by the participating teams.
doi_str_mv 10.48550/arxiv.2411.19579
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2411_19579</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2411_19579</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2411_195793</originalsourceid><addsrcrecordid>eNpjYJA0NNAzsTA1NdBPLKrILNMzMjE01DO0NDW35GSw93QOCFIwMjAyUXDOzy1ILcksyczPUwAi39KcksyczLz00sQcBeecxMxc3eCCxDwFz5TUvJLMtMzkRJBKHgbWtMSc4lReKM3NIO_mGuLsoQu2Kr6gKDM3sagyHmRlPNhKY8IqABG0NUw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>ICPR 2024 Competition on Multilingual Claim-Span Identification</title><source>arXiv.org</source><creator>Poddar, Soham ; Paul, Biswajit ; Basu, Moumita ; Ghosh, Saptarshi</creator><creatorcontrib>Poddar, Soham ; Paul, Biswajit ; Basu, Moumita ; Ghosh, Saptarshi</creatorcontrib><description>A lot of claims are made in social media posts, which may contain misinformation or fake news. Hence, it is crucial to identify claims as a first step towards claim verification. Given the huge number of social media posts, the task of identifying claims needs to be automated. This competition deals with the task of 'Claim Span Identification' in which, given a text, parts / spans that correspond to claims are to be identified. This task is more challenging than the traditional binary classification of text into claim or not-claim, and requires state-of-the-art methods in Pattern Recognition, Natural Language Processing and Machine Learning. For this competition, we used a newly developed dataset called HECSI containing about 8K posts in English and about 8K posts in Hindi with claim-spans marked by human annotators. This paper gives an overview of the competition, and the solutions developed by the participating teams.</description><identifier>DOI: 10.48550/arxiv.2411.19579</identifier><language>eng</language><subject>Computer Science - Computation and Language</subject><creationdate>2024-11</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2411.19579$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2411.19579$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Poddar, Soham</creatorcontrib><creatorcontrib>Paul, Biswajit</creatorcontrib><creatorcontrib>Basu, Moumita</creatorcontrib><creatorcontrib>Ghosh, Saptarshi</creatorcontrib><title>ICPR 2024 Competition on Multilingual Claim-Span Identification</title><description>A lot of claims are made in social media posts, which may contain misinformation or fake news. Hence, it is crucial to identify claims as a first step towards claim verification. Given the huge number of social media posts, the task of identifying claims needs to be automated. This competition deals with the task of 'Claim Span Identification' in which, given a text, parts / spans that correspond to claims are to be identified. This task is more challenging than the traditional binary classification of text into claim or not-claim, and requires state-of-the-art methods in Pattern Recognition, Natural Language Processing and Machine Learning. For this competition, we used a newly developed dataset called HECSI containing about 8K posts in English and about 8K posts in Hindi with claim-spans marked by human annotators. This paper gives an overview of the competition, and the solutions developed by the participating teams.</description><subject>Computer Science - Computation and Language</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNpjYJA0NNAzsTA1NdBPLKrILNMzMjE01DO0NDW35GSw93QOCFIwMjAyUXDOzy1ILcksyczPUwAi39KcksyczLz00sQcBeecxMxc3eCCxDwFz5TUvJLMtMzkRJBKHgbWtMSc4lReKM3NIO_mGuLsoQu2Kr6gKDM3sagyHmRlPNhKY8IqABG0NUw</recordid><startdate>20241129</startdate><enddate>20241129</enddate><creator>Poddar, Soham</creator><creator>Paul, Biswajit</creator><creator>Basu, Moumita</creator><creator>Ghosh, Saptarshi</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20241129</creationdate><title>ICPR 2024 Competition on Multilingual Claim-Span Identification</title><author>Poddar, Soham ; Paul, Biswajit ; Basu, Moumita ; Ghosh, Saptarshi</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2411_195793</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Computation and Language</topic><toplevel>online_resources</toplevel><creatorcontrib>Poddar, Soham</creatorcontrib><creatorcontrib>Paul, Biswajit</creatorcontrib><creatorcontrib>Basu, Moumita</creatorcontrib><creatorcontrib>Ghosh, Saptarshi</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Poddar, Soham</au><au>Paul, Biswajit</au><au>Basu, Moumita</au><au>Ghosh, Saptarshi</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>ICPR 2024 Competition on Multilingual Claim-Span Identification</atitle><date>2024-11-29</date><risdate>2024</risdate><abstract>A lot of claims are made in social media posts, which may contain misinformation or fake news. Hence, it is crucial to identify claims as a first step towards claim verification. Given the huge number of social media posts, the task of identifying claims needs to be automated. This competition deals with the task of 'Claim Span Identification' in which, given a text, parts / spans that correspond to claims are to be identified. This task is more challenging than the traditional binary classification of text into claim or not-claim, and requires state-of-the-art methods in Pattern Recognition, Natural Language Processing and Machine Learning. For this competition, we used a newly developed dataset called HECSI containing about 8K posts in English and about 8K posts in Hindi with claim-spans marked by human annotators. This paper gives an overview of the competition, and the solutions developed by the participating teams.</abstract><doi>10.48550/arxiv.2411.19579</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2411.19579
ispartof
issn
language eng
recordid cdi_arxiv_primary_2411_19579
source arXiv.org
subjects Computer Science - Computation and Language
title ICPR 2024 Competition on Multilingual Claim-Span Identification
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T07%3A05%3A25IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=ICPR%202024%20Competition%20on%20Multilingual%20Claim-Span%20Identification&rft.au=Poddar,%20Soham&rft.date=2024-11-29&rft_id=info:doi/10.48550/arxiv.2411.19579&rft_dat=%3Carxiv_GOX%3E2411_19579%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true