ICPR 2024 Competition on Multilingual Claim-Span Identification

A lot of claims are made in social media posts, which may contain misinformation or fake news. Hence, it is crucial to identify claims as a first step towards claim verification. Given the huge number of social media posts, the task of identifying claims needs to be automated. This competition deals...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Poddar, Soham, Paul, Biswajit, Basu, Moumita, Ghosh, Saptarshi
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Computation and Language
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Poddar, Soham Paul, Biswajit Basu, Moumita Ghosh, Saptarshi
description	A lot of claims are made in social media posts, which may contain misinformation or fake news. Hence, it is crucial to identify claims as a first step towards claim verification. Given the huge number of social media posts, the task of identifying claims needs to be automated. This competition deals with the task of 'Claim Span Identification' in which, given a text, parts / spans that correspond to claims are to be identified. This task is more challenging than the traditional binary classification of text into claim or not-claim, and requires state-of-the-art methods in Pattern Recognition, Natural Language Processing and Machine Learning. For this competition, we used a newly developed dataset called HECSI containing about 8K posts in English and about 8K posts in Hindi with claim-spans marked by human annotators. This paper gives an overview of the competition, and the solutions developed by the participating teams.
doi_str_mv	10.48550/arxiv.2411.19579
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2411_19579</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2411_19579</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2411_195793</originalsourceid><addsrcrecordid>eNpjYJA0NNAzsTA1NdBPLKrILNMzMjE01DO0NDW35GSw93QOCFIwMjAyUXDOzy1ILcksyczPUwAi39KcksyczLz00sQcBeecxMxc3eCCxDwFz5TUvJLMtMzkRJBKHgbWtMSc4lReKM3NIO_mGuLsoQu2Kr6gKDM3sagyHmRlPNhKY8IqABG0NUw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>ICPR 2024 Competition on Multilingual Claim-Span Identification</title><source>arXiv.org</source><creator>Poddar, Soham ; Paul, Biswajit ; Basu, Moumita ; Ghosh, Saptarshi</creator><creatorcontrib>Poddar, Soham ; Paul, Biswajit ; Basu, Moumita ; Ghosh, Saptarshi</creatorcontrib><description>A lot of claims are made in social media posts, which may contain misinformation or fake news. Hence, it is crucial to identify claims as a first step towards claim verification. Given the huge number of social media posts, the task of identifying claims needs to be automated. This competition deals with the task of 'Claim Span Identification' in which, given a text, parts / spans that correspond to claims are to be identified. This task is more challenging than the traditional binary classification of text into claim or not-claim, and requires state-of-the-art methods in Pattern Recognition, Natural Language Processing and Machine Learning. For this competition, we used a newly developed dataset called HECSI containing about 8K posts in English and about 8K posts in Hindi with claim-spans marked by human annotators. This paper gives an overview of the competition, and the solutions developed by the participating teams.</description><identifier>DOI: 10.48550/arxiv.2411.19579</identifier><language>eng</language><subject>Computer Science - Computation and Language</subject><creationdate>2024-11</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2411.19579$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2411.19579$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Poddar, Soham</creatorcontrib><creatorcontrib>Paul, Biswajit</creatorcontrib><creatorcontrib>Basu, Moumita</creatorcontrib><creatorcontrib>Ghosh, Saptarshi</creatorcontrib><title>ICPR 2024 Competition on Multilingual Claim-Span Identification</title><description>A lot of claims are made in social media posts, which may contain misinformation or fake news. Hence, it is crucial to identify claims as a first step towards claim verification. Given the huge number of social media posts, the task of identifying claims needs to be automated. This competition deals with the task of 'Claim Span Identification' in which, given a text, parts / spans that correspond to claims are to be identified. This task is more challenging than the traditional binary classification of text into claim or not-claim, and requires state-of-the-art methods in Pattern Recognition, Natural Language Processing and Machine Learning. For this competition, we used a newly developed dataset called HECSI containing about 8K posts in English and about 8K posts in Hindi with claim-spans marked by human annotators. This paper gives an overview of the competition, and the solutions developed by the participating teams.</description><subject>Computer Science - Computation and Language</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNpjYJA0NNAzsTA1NdBPLKrILNMzMjE01DO0NDW35GSw93QOCFIwMjAyUXDOzy1ILcksyczPUwAi39KcksyczLz00sQcBeecxMxc3eCCxDwFz5TUvJLMtMzkRJBKHgbWtMSc4lReKM3NIO_mGuLsoQu2Kr6gKDM3sagyHmRlPNhKY8IqABG0NUw</recordid><startdate>20241129</startdate><enddate>20241129</enddate><creator>Poddar, Soham</creator><creator>Paul, Biswajit</creator><creator>Basu, Moumita</creator><creator>Ghosh, Saptarshi</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20241129</creationdate><title>ICPR 2024 Competition on Multilingual Claim-Span Identification</title><author>Poddar, Soham ; Paul, Biswajit ; Basu, Moumita ; Ghosh, Saptarshi</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2411_195793</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Computation and Language</topic><toplevel>online_resources</toplevel><creatorcontrib>Poddar, Soham</creatorcontrib><creatorcontrib>Paul, Biswajit</creatorcontrib><creatorcontrib>Basu, Moumita</creatorcontrib><creatorcontrib>Ghosh, Saptarshi</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Poddar, Soham</au><au>Paul, Biswajit</au><au>Basu, Moumita</au><au>Ghosh, Saptarshi</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>ICPR 2024 Competition on Multilingual Claim-Span Identification</atitle><date>2024-11-29</date><risdate>2024</risdate><abstract>A lot of claims are made in social media posts, which may contain misinformation or fake news. Hence, it is crucial to identify claims as a first step towards claim verification. Given the huge number of social media posts, the task of identifying claims needs to be automated. This competition deals with the task of 'Claim Span Identification' in which, given a text, parts / spans that correspond to claims are to be identified. This task is more challenging than the traditional binary classification of text into claim or not-claim, and requires state-of-the-art methods in Pattern Recognition, Natural Language Processing and Machine Learning. For this competition, we used a newly developed dataset called HECSI containing about 8K posts in English and about 8K posts in Hindi with claim-spans marked by human annotators. This paper gives an overview of the competition, and the solutions developed by the participating teams.</abstract><doi>10.48550/arxiv.2411.19579</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.2411.19579
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_2411_19579
source	arXiv.org
subjects	Computer Science - Computation and Language
title	ICPR 2024 Competition on Multilingual Claim-Span Identification
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T07%3A05%3A25IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=ICPR%202024%20Competition%20on%20Multilingual%20Claim-Span%20Identification&rft.au=Poddar,%20Soham&rft.date=2024-11-29&rft_id=info:doi/10.48550/arxiv.2411.19579&rft_dat=%3Carxiv_GOX%3E2411_19579%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true