COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements
Warning: This paper contains content that may be offensive or upsetting. Understanding the harms and offensiveness of statements requires reasoning about the social and situational context in which statements are made. For example, the utterance "your English is very good" may implicitly s...
Gespeichert in:
Hauptverfasser: | , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Zhou, Xuhui Zhu, Hao Yerukola, Akhila Davidson, Thomas Hwang, Jena D Swayamdipta, Swabha Sap, Maarten |
description | Warning: This paper contains content that may be offensive or upsetting.
Understanding the harms and offensiveness of statements requires reasoning
about the social and situational context in which statements are made. For
example, the utterance "your English is very good" may implicitly signal an
insult when uttered by a white man to a non-white colleague, but uttered by an
ESL teacher to their student would be interpreted as a genuine compliment. Such
contextual factors have been largely ignored by previous approaches to toxic
language detection. We introduce COBRA frames, the first context-aware
formalism for explaining the intents, reactions, and harms of offensive or
biased statements grounded in their social and situational context. We create
COBRACORPUS, a dataset of 33k potentially offensive statements paired with
machine-generated contexts and free-text explanations of offensiveness, implied
biases, speaker intents, and listener reactions. To study the contextual
dynamics of offensiveness, we train models to generate COBRA explanations, with
and without access to the context. We find that explanations by
context-agnostic models are significantly worse than by context-aware ones,
especially in situations where the context inverts the statement's
offensiveness (29% accuracy drop). Our work highlights the importance and
feasibility of contextualized NLP by modeling social factors. |
doi_str_mv | 10.48550/arxiv.2306.01985 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2306_01985</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2306_01985</sourcerecordid><originalsourceid>FETCH-LOGICAL-a675-c97f192548cbc6459ef8b19ddd99d8400ef9a97171a2b16a63a334b2536f18b93</originalsourceid><addsrcrecordid>eNotz09LwzAcxvFcPMj0BXjy9wZak-ZPE2-zbE4YFOru5ZcmkcKaSpON-e7V6emB7-GBDyEPjJZCS0mfcLmM57LiVJWUGS1vSde0L90atgtOPj1DM8fsL_mER-g8pjmO8QPQzqcMmxD8kBNgdLDDZUowB2h_Ykzj2cN7xuwnH3O6IzcBj8nf_--KHLabQ7Mr9u3rW7PeF6hqWQymDsxUUujBDkpI44O2zDjnjHFaUOqDQVOzmmFlmULFkXNhK8lVYNoaviKPf7dXVP-5jBMuX_0vrr_i-DdLnUlR</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements</title><source>arXiv.org</source><creator>Zhou, Xuhui ; Zhu, Hao ; Yerukola, Akhila ; Davidson, Thomas ; Hwang, Jena D ; Swayamdipta, Swabha ; Sap, Maarten</creator><creatorcontrib>Zhou, Xuhui ; Zhu, Hao ; Yerukola, Akhila ; Davidson, Thomas ; Hwang, Jena D ; Swayamdipta, Swabha ; Sap, Maarten</creatorcontrib><description>Warning: This paper contains content that may be offensive or upsetting.
Understanding the harms and offensiveness of statements requires reasoning
about the social and situational context in which statements are made. For
example, the utterance "your English is very good" may implicitly signal an
insult when uttered by a white man to a non-white colleague, but uttered by an
ESL teacher to their student would be interpreted as a genuine compliment. Such
contextual factors have been largely ignored by previous approaches to toxic
language detection. We introduce COBRA frames, the first context-aware
formalism for explaining the intents, reactions, and harms of offensive or
biased statements grounded in their social and situational context. We create
COBRACORPUS, a dataset of 33k potentially offensive statements paired with
machine-generated contexts and free-text explanations of offensiveness, implied
biases, speaker intents, and listener reactions. To study the contextual
dynamics of offensiveness, we train models to generate COBRA explanations, with
and without access to the context. We find that explanations by
context-agnostic models are significantly worse than by context-aware ones,
especially in situations where the context inverts the statement's
offensiveness (29% accuracy drop). Our work highlights the importance and
feasibility of contextualized NLP by modeling social factors.</description><identifier>DOI: 10.48550/arxiv.2306.01985</identifier><language>eng</language><subject>Computer Science - Computation and Language</subject><creationdate>2023-06</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2306.01985$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2306.01985$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Zhou, Xuhui</creatorcontrib><creatorcontrib>Zhu, Hao</creatorcontrib><creatorcontrib>Yerukola, Akhila</creatorcontrib><creatorcontrib>Davidson, Thomas</creatorcontrib><creatorcontrib>Hwang, Jena D</creatorcontrib><creatorcontrib>Swayamdipta, Swabha</creatorcontrib><creatorcontrib>Sap, Maarten</creatorcontrib><title>COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements</title><description>Warning: This paper contains content that may be offensive or upsetting.
Understanding the harms and offensiveness of statements requires reasoning
about the social and situational context in which statements are made. For
example, the utterance "your English is very good" may implicitly signal an
insult when uttered by a white man to a non-white colleague, but uttered by an
ESL teacher to their student would be interpreted as a genuine compliment. Such
contextual factors have been largely ignored by previous approaches to toxic
language detection. We introduce COBRA frames, the first context-aware
formalism for explaining the intents, reactions, and harms of offensive or
biased statements grounded in their social and situational context. We create
COBRACORPUS, a dataset of 33k potentially offensive statements paired with
machine-generated contexts and free-text explanations of offensiveness, implied
biases, speaker intents, and listener reactions. To study the contextual
dynamics of offensiveness, we train models to generate COBRA explanations, with
and without access to the context. We find that explanations by
context-agnostic models are significantly worse than by context-aware ones,
especially in situations where the context inverts the statement's
offensiveness (29% accuracy drop). Our work highlights the importance and
feasibility of contextualized NLP by modeling social factors.</description><subject>Computer Science - Computation and Language</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz09LwzAcxvFcPMj0BXjy9wZak-ZPE2-zbE4YFOru5ZcmkcKaSpON-e7V6emB7-GBDyEPjJZCS0mfcLmM57LiVJWUGS1vSde0L90atgtOPj1DM8fsL_mER-g8pjmO8QPQzqcMmxD8kBNgdLDDZUowB2h_Ykzj2cN7xuwnH3O6IzcBj8nf_--KHLabQ7Mr9u3rW7PeF6hqWQymDsxUUujBDkpI44O2zDjnjHFaUOqDQVOzmmFlmULFkXNhK8lVYNoaviKPf7dXVP-5jBMuX_0vrr_i-DdLnUlR</recordid><startdate>20230602</startdate><enddate>20230602</enddate><creator>Zhou, Xuhui</creator><creator>Zhu, Hao</creator><creator>Yerukola, Akhila</creator><creator>Davidson, Thomas</creator><creator>Hwang, Jena D</creator><creator>Swayamdipta, Swabha</creator><creator>Sap, Maarten</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20230602</creationdate><title>COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements</title><author>Zhou, Xuhui ; Zhu, Hao ; Yerukola, Akhila ; Davidson, Thomas ; Hwang, Jena D ; Swayamdipta, Swabha ; Sap, Maarten</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a675-c97f192548cbc6459ef8b19ddd99d8400ef9a97171a2b16a63a334b2536f18b93</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Computer Science - Computation and Language</topic><toplevel>online_resources</toplevel><creatorcontrib>Zhou, Xuhui</creatorcontrib><creatorcontrib>Zhu, Hao</creatorcontrib><creatorcontrib>Yerukola, Akhila</creatorcontrib><creatorcontrib>Davidson, Thomas</creatorcontrib><creatorcontrib>Hwang, Jena D</creatorcontrib><creatorcontrib>Swayamdipta, Swabha</creatorcontrib><creatorcontrib>Sap, Maarten</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Zhou, Xuhui</au><au>Zhu, Hao</au><au>Yerukola, Akhila</au><au>Davidson, Thomas</au><au>Hwang, Jena D</au><au>Swayamdipta, Swabha</au><au>Sap, Maarten</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements</atitle><date>2023-06-02</date><risdate>2023</risdate><abstract>Warning: This paper contains content that may be offensive or upsetting.
Understanding the harms and offensiveness of statements requires reasoning
about the social and situational context in which statements are made. For
example, the utterance "your English is very good" may implicitly signal an
insult when uttered by a white man to a non-white colleague, but uttered by an
ESL teacher to their student would be interpreted as a genuine compliment. Such
contextual factors have been largely ignored by previous approaches to toxic
language detection. We introduce COBRA frames, the first context-aware
formalism for explaining the intents, reactions, and harms of offensive or
biased statements grounded in their social and situational context. We create
COBRACORPUS, a dataset of 33k potentially offensive statements paired with
machine-generated contexts and free-text explanations of offensiveness, implied
biases, speaker intents, and listener reactions. To study the contextual
dynamics of offensiveness, we train models to generate COBRA explanations, with
and without access to the context. We find that explanations by
context-agnostic models are significantly worse than by context-aware ones,
especially in situations where the context inverts the statement's
offensiveness (29% accuracy drop). Our work highlights the importance and
feasibility of contextualized NLP by modeling social factors.</abstract><doi>10.48550/arxiv.2306.01985</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2306.01985 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2306_01985 |
source | arXiv.org |
subjects | Computer Science - Computation and Language |
title | COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-03T07%3A54%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=COBRA%20Frames:%20Contextual%20Reasoning%20about%20Effects%20and%20Harms%20of%20Offensive%20Statements&rft.au=Zhou,%20Xuhui&rft.date=2023-06-02&rft_id=info:doi/10.48550/arxiv.2306.01985&rft_dat=%3Carxiv_GOX%3E2306_01985%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |