COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements

Warning: This paper contains content that may be offensive or upsetting. Understanding the harms and offensiveness of statements requires reasoning about the social and situational context in which statements are made. For example, the utterance "your English is very good" may implicitly s...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Zhou, Xuhui, Zhu, Hao, Yerukola, Akhila, Davidson, Thomas, Hwang, Jena D, Swayamdipta, Swabha, Sap, Maarten
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Zhou, Xuhui
Zhu, Hao
Yerukola, Akhila
Davidson, Thomas
Hwang, Jena D
Swayamdipta, Swabha
Sap, Maarten
description Warning: This paper contains content that may be offensive or upsetting. Understanding the harms and offensiveness of statements requires reasoning about the social and situational context in which statements are made. For example, the utterance "your English is very good" may implicitly signal an insult when uttered by a white man to a non-white colleague, but uttered by an ESL teacher to their student would be interpreted as a genuine compliment. Such contextual factors have been largely ignored by previous approaches to toxic language detection. We introduce COBRA frames, the first context-aware formalism for explaining the intents, reactions, and harms of offensive or biased statements grounded in their social and situational context. We create COBRACORPUS, a dataset of 33k potentially offensive statements paired with machine-generated contexts and free-text explanations of offensiveness, implied biases, speaker intents, and listener reactions. To study the contextual dynamics of offensiveness, we train models to generate COBRA explanations, with and without access to the context. We find that explanations by context-agnostic models are significantly worse than by context-aware ones, especially in situations where the context inverts the statement's offensiveness (29% accuracy drop). Our work highlights the importance and feasibility of contextualized NLP by modeling social factors.
doi_str_mv 10.48550/arxiv.2306.01985
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2306_01985</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2306_01985</sourcerecordid><originalsourceid>FETCH-LOGICAL-a675-c97f192548cbc6459ef8b19ddd99d8400ef9a97171a2b16a63a334b2536f18b93</originalsourceid><addsrcrecordid>eNotz09LwzAcxvFcPMj0BXjy9wZak-ZPE2-zbE4YFOru5ZcmkcKaSpON-e7V6emB7-GBDyEPjJZCS0mfcLmM57LiVJWUGS1vSde0L90atgtOPj1DM8fsL_mER-g8pjmO8QPQzqcMmxD8kBNgdLDDZUowB2h_Ykzj2cN7xuwnH3O6IzcBj8nf_--KHLabQ7Mr9u3rW7PeF6hqWQymDsxUUujBDkpI44O2zDjnjHFaUOqDQVOzmmFlmULFkXNhK8lVYNoaviKPf7dXVP-5jBMuX_0vrr_i-DdLnUlR</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements</title><source>arXiv.org</source><creator>Zhou, Xuhui ; Zhu, Hao ; Yerukola, Akhila ; Davidson, Thomas ; Hwang, Jena D ; Swayamdipta, Swabha ; Sap, Maarten</creator><creatorcontrib>Zhou, Xuhui ; Zhu, Hao ; Yerukola, Akhila ; Davidson, Thomas ; Hwang, Jena D ; Swayamdipta, Swabha ; Sap, Maarten</creatorcontrib><description>Warning: This paper contains content that may be offensive or upsetting. Understanding the harms and offensiveness of statements requires reasoning about the social and situational context in which statements are made. For example, the utterance "your English is very good" may implicitly signal an insult when uttered by a white man to a non-white colleague, but uttered by an ESL teacher to their student would be interpreted as a genuine compliment. Such contextual factors have been largely ignored by previous approaches to toxic language detection. We introduce COBRA frames, the first context-aware formalism for explaining the intents, reactions, and harms of offensive or biased statements grounded in their social and situational context. We create COBRACORPUS, a dataset of 33k potentially offensive statements paired with machine-generated contexts and free-text explanations of offensiveness, implied biases, speaker intents, and listener reactions. To study the contextual dynamics of offensiveness, we train models to generate COBRA explanations, with and without access to the context. We find that explanations by context-agnostic models are significantly worse than by context-aware ones, especially in situations where the context inverts the statement's offensiveness (29% accuracy drop). Our work highlights the importance and feasibility of contextualized NLP by modeling social factors.</description><identifier>DOI: 10.48550/arxiv.2306.01985</identifier><language>eng</language><subject>Computer Science - Computation and Language</subject><creationdate>2023-06</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2306.01985$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2306.01985$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Zhou, Xuhui</creatorcontrib><creatorcontrib>Zhu, Hao</creatorcontrib><creatorcontrib>Yerukola, Akhila</creatorcontrib><creatorcontrib>Davidson, Thomas</creatorcontrib><creatorcontrib>Hwang, Jena D</creatorcontrib><creatorcontrib>Swayamdipta, Swabha</creatorcontrib><creatorcontrib>Sap, Maarten</creatorcontrib><title>COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements</title><description>Warning: This paper contains content that may be offensive or upsetting. Understanding the harms and offensiveness of statements requires reasoning about the social and situational context in which statements are made. For example, the utterance "your English is very good" may implicitly signal an insult when uttered by a white man to a non-white colleague, but uttered by an ESL teacher to their student would be interpreted as a genuine compliment. Such contextual factors have been largely ignored by previous approaches to toxic language detection. We introduce COBRA frames, the first context-aware formalism for explaining the intents, reactions, and harms of offensive or biased statements grounded in their social and situational context. We create COBRACORPUS, a dataset of 33k potentially offensive statements paired with machine-generated contexts and free-text explanations of offensiveness, implied biases, speaker intents, and listener reactions. To study the contextual dynamics of offensiveness, we train models to generate COBRA explanations, with and without access to the context. We find that explanations by context-agnostic models are significantly worse than by context-aware ones, especially in situations where the context inverts the statement's offensiveness (29% accuracy drop). Our work highlights the importance and feasibility of contextualized NLP by modeling social factors.</description><subject>Computer Science - Computation and Language</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz09LwzAcxvFcPMj0BXjy9wZak-ZPE2-zbE4YFOru5ZcmkcKaSpON-e7V6emB7-GBDyEPjJZCS0mfcLmM57LiVJWUGS1vSde0L90atgtOPj1DM8fsL_mER-g8pjmO8QPQzqcMmxD8kBNgdLDDZUowB2h_Ykzj2cN7xuwnH3O6IzcBj8nf_--KHLabQ7Mr9u3rW7PeF6hqWQymDsxUUujBDkpI44O2zDjnjHFaUOqDQVOzmmFlmULFkXNhK8lVYNoaviKPf7dXVP-5jBMuX_0vrr_i-DdLnUlR</recordid><startdate>20230602</startdate><enddate>20230602</enddate><creator>Zhou, Xuhui</creator><creator>Zhu, Hao</creator><creator>Yerukola, Akhila</creator><creator>Davidson, Thomas</creator><creator>Hwang, Jena D</creator><creator>Swayamdipta, Swabha</creator><creator>Sap, Maarten</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20230602</creationdate><title>COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements</title><author>Zhou, Xuhui ; Zhu, Hao ; Yerukola, Akhila ; Davidson, Thomas ; Hwang, Jena D ; Swayamdipta, Swabha ; Sap, Maarten</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a675-c97f192548cbc6459ef8b19ddd99d8400ef9a97171a2b16a63a334b2536f18b93</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Computer Science - Computation and Language</topic><toplevel>online_resources</toplevel><creatorcontrib>Zhou, Xuhui</creatorcontrib><creatorcontrib>Zhu, Hao</creatorcontrib><creatorcontrib>Yerukola, Akhila</creatorcontrib><creatorcontrib>Davidson, Thomas</creatorcontrib><creatorcontrib>Hwang, Jena D</creatorcontrib><creatorcontrib>Swayamdipta, Swabha</creatorcontrib><creatorcontrib>Sap, Maarten</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Zhou, Xuhui</au><au>Zhu, Hao</au><au>Yerukola, Akhila</au><au>Davidson, Thomas</au><au>Hwang, Jena D</au><au>Swayamdipta, Swabha</au><au>Sap, Maarten</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements</atitle><date>2023-06-02</date><risdate>2023</risdate><abstract>Warning: This paper contains content that may be offensive or upsetting. Understanding the harms and offensiveness of statements requires reasoning about the social and situational context in which statements are made. For example, the utterance "your English is very good" may implicitly signal an insult when uttered by a white man to a non-white colleague, but uttered by an ESL teacher to their student would be interpreted as a genuine compliment. Such contextual factors have been largely ignored by previous approaches to toxic language detection. We introduce COBRA frames, the first context-aware formalism for explaining the intents, reactions, and harms of offensive or biased statements grounded in their social and situational context. We create COBRACORPUS, a dataset of 33k potentially offensive statements paired with machine-generated contexts and free-text explanations of offensiveness, implied biases, speaker intents, and listener reactions. To study the contextual dynamics of offensiveness, we train models to generate COBRA explanations, with and without access to the context. We find that explanations by context-agnostic models are significantly worse than by context-aware ones, especially in situations where the context inverts the statement's offensiveness (29% accuracy drop). Our work highlights the importance and feasibility of contextualized NLP by modeling social factors.</abstract><doi>10.48550/arxiv.2306.01985</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2306.01985
ispartof
issn
language eng
recordid cdi_arxiv_primary_2306_01985
source arXiv.org
subjects Computer Science - Computation and Language
title COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-03T07%3A54%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=COBRA%20Frames:%20Contextual%20Reasoning%20about%20Effects%20and%20Harms%20of%20Offensive%20Statements&rft.au=Zhou,%20Xuhui&rft.date=2023-06-02&rft_id=info:doi/10.48550/arxiv.2306.01985&rft_dat=%3Carxiv_GOX%3E2306_01985%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true