The Moral Foundations Weibo Corpus
Moral sentiments expressed in natural language significantly influence both online and offline environments, shaping behavioral styles and interaction patterns, including social media selfpresentation, cyberbullying, adherence to social norms, and ethical decision-making. To effectively measure mora...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Cao, Renjie Hu, Miaoyan Wei, Jiahan Ihnaini, Baha |
description | Moral sentiments expressed in natural language significantly influence both
online and offline environments, shaping behavioral styles and interaction
patterns, including social media selfpresentation, cyberbullying, adherence to
social norms, and ethical decision-making. To effectively measure moral
sentiments in natural language processing texts, it is crucial to utilize
large, annotated datasets that provide nuanced understanding for accurate
analysis and modeltraining. However, existing corpora, while valuable, often
face linguistic limitations. To address this gap in the Chinese language
domain,we introduce the Moral Foundation Weibo Corpus. This corpus consists of
25,671 Chinese comments on Weibo, encompassing six diverse topic areas. Each
comment is manually annotated by at least three systematically trained
annotators based on ten moral categories derived from a grounded theory of
morality. To assess annotator reliability, we present the kappa testresults, a
gold standard for measuring consistency. Additionally, we apply several the
latest large language models to supplement the manual annotations, conducting
analytical experiments to compare their performance and report baseline results
for moral sentiment classification. |
doi_str_mv | 10.48550/arxiv.2411.09612 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2411_09612</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2411_09612</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2411_096123</originalsourceid><addsrcrecordid>eNpjYJA0NNAzsTA1NdBPLKrILNMzMjE01DOwNDM04mRQCslIVfDNL0rMUXDLL81LSSzJzM8rVghPzUzKV3DOLyooLeZhYE1LzClO5YXS3Azybq4hzh66YNPiC4oycxOLKuNBpsaDTTUmrAIAPB4rPA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>The Moral Foundations Weibo Corpus</title><source>arXiv.org</source><creator>Cao, Renjie ; Hu, Miaoyan ; Wei, Jiahan ; Ihnaini, Baha</creator><creatorcontrib>Cao, Renjie ; Hu, Miaoyan ; Wei, Jiahan ; Ihnaini, Baha</creatorcontrib><description>Moral sentiments expressed in natural language significantly influence both
online and offline environments, shaping behavioral styles and interaction
patterns, including social media selfpresentation, cyberbullying, adherence to
social norms, and ethical decision-making. To effectively measure moral
sentiments in natural language processing texts, it is crucial to utilize
large, annotated datasets that provide nuanced understanding for accurate
analysis and modeltraining. However, existing corpora, while valuable, often
face linguistic limitations. To address this gap in the Chinese language
domain,we introduce the Moral Foundation Weibo Corpus. This corpus consists of
25,671 Chinese comments on Weibo, encompassing six diverse topic areas. Each
comment is manually annotated by at least three systematically trained
annotators based on ten moral categories derived from a grounded theory of
morality. To assess annotator reliability, we present the kappa testresults, a
gold standard for measuring consistency. Additionally, we apply several the
latest large language models to supplement the manual annotations, conducting
analytical experiments to compare their performance and report baseline results
for moral sentiment classification.</description><identifier>DOI: 10.48550/arxiv.2411.09612</identifier><language>eng</language><subject>Computer Science - Computation and Language ; Computer Science - Learning</subject><creationdate>2024-11</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2411.09612$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2411.09612$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Cao, Renjie</creatorcontrib><creatorcontrib>Hu, Miaoyan</creatorcontrib><creatorcontrib>Wei, Jiahan</creatorcontrib><creatorcontrib>Ihnaini, Baha</creatorcontrib><title>The Moral Foundations Weibo Corpus</title><description>Moral sentiments expressed in natural language significantly influence both
online and offline environments, shaping behavioral styles and interaction
patterns, including social media selfpresentation, cyberbullying, adherence to
social norms, and ethical decision-making. To effectively measure moral
sentiments in natural language processing texts, it is crucial to utilize
large, annotated datasets that provide nuanced understanding for accurate
analysis and modeltraining. However, existing corpora, while valuable, often
face linguistic limitations. To address this gap in the Chinese language
domain,we introduce the Moral Foundation Weibo Corpus. This corpus consists of
25,671 Chinese comments on Weibo, encompassing six diverse topic areas. Each
comment is manually annotated by at least three systematically trained
annotators based on ten moral categories derived from a grounded theory of
morality. To assess annotator reliability, we present the kappa testresults, a
gold standard for measuring consistency. Additionally, we apply several the
latest large language models to supplement the manual annotations, conducting
analytical experiments to compare their performance and report baseline results
for moral sentiment classification.</description><subject>Computer Science - Computation and Language</subject><subject>Computer Science - Learning</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNpjYJA0NNAzsTA1NdBPLKrILNMzMjE01DOwNDM04mRQCslIVfDNL0rMUXDLL81LSSzJzM8rVghPzUzKV3DOLyooLeZhYE1LzClO5YXS3Azybq4hzh66YNPiC4oycxOLKuNBpsaDTTUmrAIAPB4rPA</recordid><startdate>20241114</startdate><enddate>20241114</enddate><creator>Cao, Renjie</creator><creator>Hu, Miaoyan</creator><creator>Wei, Jiahan</creator><creator>Ihnaini, Baha</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20241114</creationdate><title>The Moral Foundations Weibo Corpus</title><author>Cao, Renjie ; Hu, Miaoyan ; Wei, Jiahan ; Ihnaini, Baha</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2411_096123</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Computation and Language</topic><topic>Computer Science - Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Cao, Renjie</creatorcontrib><creatorcontrib>Hu, Miaoyan</creatorcontrib><creatorcontrib>Wei, Jiahan</creatorcontrib><creatorcontrib>Ihnaini, Baha</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Cao, Renjie</au><au>Hu, Miaoyan</au><au>Wei, Jiahan</au><au>Ihnaini, Baha</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>The Moral Foundations Weibo Corpus</atitle><date>2024-11-14</date><risdate>2024</risdate><abstract>Moral sentiments expressed in natural language significantly influence both
online and offline environments, shaping behavioral styles and interaction
patterns, including social media selfpresentation, cyberbullying, adherence to
social norms, and ethical decision-making. To effectively measure moral
sentiments in natural language processing texts, it is crucial to utilize
large, annotated datasets that provide nuanced understanding for accurate
analysis and modeltraining. However, existing corpora, while valuable, often
face linguistic limitations. To address this gap in the Chinese language
domain,we introduce the Moral Foundation Weibo Corpus. This corpus consists of
25,671 Chinese comments on Weibo, encompassing six diverse topic areas. Each
comment is manually annotated by at least three systematically trained
annotators based on ten moral categories derived from a grounded theory of
morality. To assess annotator reliability, we present the kappa testresults, a
gold standard for measuring consistency. Additionally, we apply several the
latest large language models to supplement the manual annotations, conducting
analytical experiments to compare their performance and report baseline results
for moral sentiment classification.</abstract><doi>10.48550/arxiv.2411.09612</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2411.09612 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2411_09612 |
source | arXiv.org |
subjects | Computer Science - Computation and Language Computer Science - Learning |
title | The Moral Foundations Weibo Corpus |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-06T17%3A55%3A25IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=The%20Moral%20Foundations%20Weibo%20Corpus&rft.au=Cao,%20Renjie&rft.date=2024-11-14&rft_id=info:doi/10.48550/arxiv.2411.09612&rft_dat=%3Carxiv_GOX%3E2411_09612%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |