HaSpeeDe 2 Dataset

The HaSpeeDe2 dataset collects 8,012 tweets and 500 news headlines annotated for the presence of hate speech, stereotypes and nominal utterance. The dataset has been used in the context of the HaSpeeDe task (http://www.di.unito.it/~tutreeb/haspeede-evalita20/index.html), organized as part of the EVA...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Frenda, Simona, Russo, Irene, Stranisci, Marco, Comandini, Gloria, Caselli, Tommaso, Di Nuovo, Elisa, Patti, Viviana, Bosco, Cristina, Sanguinetti, Manuela
Format:	Dataset
Sprache:	eng
Schlagworte:	facebook comments hate speech Italian language news headlines nominal utterance sentiment analysis social media language stereotypes tweets
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Frenda, Simona Russo, Irene Stranisci, Marco Comandini, Gloria Caselli, Tommaso Di Nuovo, Elisa Patti, Viviana Bosco, Cristina Sanguinetti, Manuela
description	The HaSpeeDe2 dataset collects 8,012 tweets and 500 news headlines annotated for the presence of hate speech, stereotypes and nominal utterance. The dataset has been used in the context of the HaSpeeDe task (http://www.di.unito.it/~tutreeb/haspeede-evalita20/index.html), organized as part of the EVALITA 2020 evaluation campaign (http://www.evalita.it/2020). In order to meet the GDPR requirements, texts have been pseudonymized replacing all original IDs in both datasets with newly-generated ones. Mentions, emails, person names (excluded public person names), and phone numbers have been masked with, respectively, the labels MENTION, EMAIL, PERSON, PHONE, followed by a number to distinguish between different entities of the same kind within the same text.
doi_str_mv	10.57771/2ycx-wy17
format	Dataset
fullrecord	<record><control><sourceid>datacite_PQ8</sourceid><recordid>TN_cdi_datacite_primary_10_57771_2ycx_wy17</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_57771_2ycx_wy17</sourcerecordid><originalsourceid>FETCH-datacite_primary_10_57771_2ycx_wy173</originalsourceid><addsrcrecordid>eNpjYBAyNNAzNTc3N9Q3qkyu0C2vNDTnZBDySAwuSE11SVUwUnBJLEksTi3hYWBNS8wpTuWF0twMWm6uIc4euilA-eTMktT4gqLM3MSiynhDg3iwgfEgA-NBBhqTpBgABessIA</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>dataset</recordtype></control><display><type>dataset</type><title>HaSpeeDe 2 Dataset</title><source>DataCite</source><creator>Frenda, Simona ; Russo, Irene ; Stranisci, Marco ; Comandini, Gloria ; Caselli, Tommaso ; Di Nuovo, Elisa ; Patti, Viviana ; Bosco, Cristina ; Sanguinetti, Manuela</creator><creatorcontrib>Frenda, Simona ; Russo, Irene ; Stranisci, Marco ; Comandini, Gloria ; Caselli, Tommaso ; Di Nuovo, Elisa ; Patti, Viviana ; Bosco, Cristina ; Sanguinetti, Manuela</creatorcontrib><description>The HaSpeeDe2 dataset collects 8,012 tweets and 500 news headlines annotated for the presence of hate speech, stereotypes and nominal utterance. The dataset has been used in the context of the HaSpeeDe task (http://www.di.unito.it/~tutreeb/haspeede-evalita20/index.html), organized as part of the EVALITA 2020 evaluation campaign (http://www.evalita.it/2020). In order to meet the GDPR requirements, texts have been pseudonymized replacing all original IDs in both datasets with newly-generated ones. Mentions, emails, person names (excluded public person names), and phone numbers have been masked with, respectively, the labels MENTION, EMAIL, PERSON, PHONE, followed by a number to distinguish between different entities of the same kind within the same text.</description><identifier>DOI: 10.57771/2ycx-wy17</identifier><language>eng</language><publisher>ELG</publisher><subject>facebook comments ; hate speech ; Italian language ; news headlines ; nominal utterance ; sentiment analysis ; social media language ; stereotypes ; tweets</subject><creationdate>2021</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,1888</link.rule.ids><linktorsrc>$$Uhttps://commons.datacite.org/doi.org/10.57771/2ycx-wy17$$EView_record_in_DataCite.org$$FView_record_in_$$GDataCite.org$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Frenda, Simona</creatorcontrib><creatorcontrib>Russo, Irene</creatorcontrib><creatorcontrib>Stranisci, Marco</creatorcontrib><creatorcontrib>Comandini, Gloria</creatorcontrib><creatorcontrib>Caselli, Tommaso</creatorcontrib><creatorcontrib>Di Nuovo, Elisa</creatorcontrib><creatorcontrib>Patti, Viviana</creatorcontrib><creatorcontrib>Bosco, Cristina</creatorcontrib><creatorcontrib>Sanguinetti, Manuela</creatorcontrib><title>HaSpeeDe 2 Dataset</title><description>The HaSpeeDe2 dataset collects 8,012 tweets and 500 news headlines annotated for the presence of hate speech, stereotypes and nominal utterance. The dataset has been used in the context of the HaSpeeDe task (http://www.di.unito.it/~tutreeb/haspeede-evalita20/index.html), organized as part of the EVALITA 2020 evaluation campaign (http://www.evalita.it/2020). In order to meet the GDPR requirements, texts have been pseudonymized replacing all original IDs in both datasets with newly-generated ones. Mentions, emails, person names (excluded public person names), and phone numbers have been masked with, respectively, the labels MENTION, EMAIL, PERSON, PHONE, followed by a number to distinguish between different entities of the same kind within the same text.</description><subject>facebook comments</subject><subject>hate speech</subject><subject>Italian language</subject><subject>news headlines</subject><subject>nominal utterance</subject><subject>sentiment analysis</subject><subject>social media language</subject><subject>stereotypes</subject><subject>tweets</subject><fulltext>true</fulltext><rsrctype>dataset</rsrctype><creationdate>2021</creationdate><recordtype>dataset</recordtype><sourceid>PQ8</sourceid><recordid>eNpjYBAyNNAzNTc3N9Q3qkyu0C2vNDTnZBDySAwuSE11SVUwUnBJLEksTi3hYWBNS8wpTuWF0twMWm6uIc4euilA-eTMktT4gqLM3MSiynhDg3iwgfEgA-NBBhqTpBgABessIA</recordid><startdate>2021</startdate><enddate>2021</enddate><creator>Frenda, Simona</creator><creator>Russo, Irene</creator><creator>Stranisci, Marco</creator><creator>Comandini, Gloria</creator><creator>Caselli, Tommaso</creator><creator>Di Nuovo, Elisa</creator><creator>Patti, Viviana</creator><creator>Bosco, Cristina</creator><creator>Sanguinetti, Manuela</creator><general>ELG</general><scope>DYCCY</scope><scope>PQ8</scope></search><sort><creationdate>2021</creationdate><title>HaSpeeDe 2 Dataset</title><author>Frenda, Simona ; Russo, Irene ; Stranisci, Marco ; Comandini, Gloria ; Caselli, Tommaso ; Di Nuovo, Elisa ; Patti, Viviana ; Bosco, Cristina ; Sanguinetti, Manuela</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-datacite_primary_10_57771_2ycx_wy173</frbrgroupid><rsrctype>datasets</rsrctype><prefilter>datasets</prefilter><language>eng</language><creationdate>2021</creationdate><topic>facebook comments</topic><topic>hate speech</topic><topic>Italian language</topic><topic>news headlines</topic><topic>nominal utterance</topic><topic>sentiment analysis</topic><topic>social media language</topic><topic>stereotypes</topic><topic>tweets</topic><toplevel>online_resources</toplevel><creatorcontrib>Frenda, Simona</creatorcontrib><creatorcontrib>Russo, Irene</creatorcontrib><creatorcontrib>Stranisci, Marco</creatorcontrib><creatorcontrib>Comandini, Gloria</creatorcontrib><creatorcontrib>Caselli, Tommaso</creatorcontrib><creatorcontrib>Di Nuovo, Elisa</creatorcontrib><creatorcontrib>Patti, Viviana</creatorcontrib><creatorcontrib>Bosco, Cristina</creatorcontrib><creatorcontrib>Sanguinetti, Manuela</creatorcontrib><collection>DataCite (Open Access)</collection><collection>DataCite</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Frenda, Simona</au><au>Russo, Irene</au><au>Stranisci, Marco</au><au>Comandini, Gloria</au><au>Caselli, Tommaso</au><au>Di Nuovo, Elisa</au><au>Patti, Viviana</au><au>Bosco, Cristina</au><au>Sanguinetti, Manuela</au><format>book</format><genre>unknown</genre><ristype>DATA</ristype><title>HaSpeeDe 2 Dataset</title><date>2021</date><risdate>2021</risdate><abstract>The HaSpeeDe2 dataset collects 8,012 tweets and 500 news headlines annotated for the presence of hate speech, stereotypes and nominal utterance. The dataset has been used in the context of the HaSpeeDe task (http://www.di.unito.it/~tutreeb/haspeede-evalita20/index.html), organized as part of the EVALITA 2020 evaluation campaign (http://www.evalita.it/2020). In order to meet the GDPR requirements, texts have been pseudonymized replacing all original IDs in both datasets with newly-generated ones. Mentions, emails, person names (excluded public person names), and phone numbers have been masked with, respectively, the labels MENTION, EMAIL, PERSON, PHONE, followed by a number to distinguish between different entities of the same kind within the same text.</abstract><pub>ELG</pub><doi>10.57771/2ycx-wy17</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.57771/2ycx-wy17
ispartof
issn
language	eng
recordid	cdi_datacite_primary_10_57771_2ycx_wy17
source	DataCite
subjects	facebook comments hate speech Italian language news headlines nominal utterance sentiment analysis social media language stereotypes tweets
title	HaSpeeDe 2 Dataset
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-28T04%3A23%3A06IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-datacite_PQ8&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=unknown&rft.au=Frenda,%20Simona&rft.date=2021&rft_id=info:doi/10.57771/2ycx-wy17&rft_dat=%3Cdatacite_PQ8%3E10_57771_2ycx_wy17%3C/datacite_PQ8%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true