NLP and machine learning to measure peace from news media

“Hate speech” can mobilize violence and destruction.  What are the characteristics of “peace speech” that reflect and support the social processes that maintain peace?  In this study we used a data driven, machine learning approach to identify the words most associated with lower-peace versus higher...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Liebovitch, Larry, Powers, William, Shi, Lin, Chen-Carrel, Allegra, Loustaunau, Philippe, Coleman, Peter
Format: Dataset
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Liebovitch, Larry
Powers, William
Shi, Lin
Chen-Carrel, Allegra
Loustaunau, Philippe
Coleman, Peter
description “Hate speech” can mobilize violence and destruction.  What are the characteristics of “peace speech” that reflect and support the social processes that maintain peace?  In this study we used a data driven, machine learning approach to identify the words most associated with lower-peace versus higher-peace countries. Logistic regression and random forest classifiers were trained using five respected, traditional peace indices: Global Peace Index, Positive Peace Index, World Happiness Index, Fragile States Index, and Human Development Index. The feature inputs into the machine learning model were the word frequencies from the news media in each country and the output classifications were the level of peace in that country.  The machine learning model was successful in properly classifying the level of peace from the news media in a country (both accuracy and F1: 96% - 100%). We also used that trained machine model to create a machine learning peace index that measured the level of peace in countries, including countries not in the training set, which correlated with the average of those five traditional peace indices (r-squared = 0.8349). Using the random forest feature importance method we found that the words in news media in lower-peace countries were characterized by words related to government, order, control and fear (such as government, state, law, security and court), while higher-peace countries were characterized by an increased prevalence of words related to optimism for the future and fun (such as time, like, home, believe and game).
doi_str_mv 10.5061/dryad.2v6wwpzv6
format Dataset
fullrecord <record><control><sourceid>datacite_PQ8</sourceid><recordid>TN_cdi_datacite_primary_10_5061_dryad_2v6wwpzv6</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_5061_dryad_2v6wwpzv6</sourcerecordid><originalsourceid>FETCH-datacite_primary_10_5061_dryad_2v6wwpzv63</originalsourceid><addsrcrecordid>eNpjYBA3NNAzNTAz1E8pqkxM0TMqMysvL6gqM-NksPTzCVBIzEtRyE1MzsjMS1XISU0sysvMS1coyVfITU0sLi1KVShITUxOVUgrys9VyEstLwaKp2Qm8jCwpiXmFKfyQmluBn031xBnD92UxJLE5MyS1PiCoszcxKLKeEODeJDl8WDL4-GWG5OuAwBWKUHx</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>dataset</recordtype></control><display><type>dataset</type><title>NLP and machine learning to measure peace from news media</title><source>DataCite</source><creator>Liebovitch, Larry ; Powers, William ; Shi, Lin ; Chen-Carrel, Allegra ; Loustaunau, Philippe ; Coleman, Peter</creator><creatorcontrib>Liebovitch, Larry ; Powers, William ; Shi, Lin ; Chen-Carrel, Allegra ; Loustaunau, Philippe ; Coleman, Peter</creatorcontrib><description>“Hate speech” can mobilize violence and destruction.  What are the characteristics of “peace speech” that reflect and support the social processes that maintain peace?  In this study we used a data driven, machine learning approach to identify the words most associated with lower-peace versus higher-peace countries. Logistic regression and random forest classifiers were trained using five respected, traditional peace indices: Global Peace Index, Positive Peace Index, World Happiness Index, Fragile States Index, and Human Development Index. The feature inputs into the machine learning model were the word frequencies from the news media in each country and the output classifications were the level of peace in that country.  The machine learning model was successful in properly classifying the level of peace from the news media in a country (both accuracy and F1: 96% - 100%). We also used that trained machine model to create a machine learning peace index that measured the level of peace in countries, including countries not in the training set, which correlated with the average of those five traditional peace indices (r-squared = 0.8349). Using the random forest feature importance method we found that the words in news media in lower-peace countries were characterized by words related to government, order, control and fear (such as government, state, law, security and court), while higher-peace countries were characterized by an increased prevalence of words related to optimism for the future and fun (such as time, like, home, believe and game).</description><identifier>DOI: 10.5061/dryad.2v6wwpzv6</identifier><language>eng</language><publisher>Dryad</publisher><subject>FOS: Social sciences ; Machine learning ; measuring peace ; Natural language processing</subject><creationdate>2023</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><orcidid>0000-0002-8842-163X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>780,1894</link.rule.ids><linktorsrc>$$Uhttps://commons.datacite.org/doi.org/10.5061/dryad.2v6wwpzv6$$EView_record_in_DataCite.org$$FView_record_in_$$GDataCite.org$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Liebovitch, Larry</creatorcontrib><creatorcontrib>Powers, William</creatorcontrib><creatorcontrib>Shi, Lin</creatorcontrib><creatorcontrib>Chen-Carrel, Allegra</creatorcontrib><creatorcontrib>Loustaunau, Philippe</creatorcontrib><creatorcontrib>Coleman, Peter</creatorcontrib><title>NLP and machine learning to measure peace from news media</title><description>“Hate speech” can mobilize violence and destruction.  What are the characteristics of “peace speech” that reflect and support the social processes that maintain peace?  In this study we used a data driven, machine learning approach to identify the words most associated with lower-peace versus higher-peace countries. Logistic regression and random forest classifiers were trained using five respected, traditional peace indices: Global Peace Index, Positive Peace Index, World Happiness Index, Fragile States Index, and Human Development Index. The feature inputs into the machine learning model were the word frequencies from the news media in each country and the output classifications were the level of peace in that country.  The machine learning model was successful in properly classifying the level of peace from the news media in a country (both accuracy and F1: 96% - 100%). We also used that trained machine model to create a machine learning peace index that measured the level of peace in countries, including countries not in the training set, which correlated with the average of those five traditional peace indices (r-squared = 0.8349). Using the random forest feature importance method we found that the words in news media in lower-peace countries were characterized by words related to government, order, control and fear (such as government, state, law, security and court), while higher-peace countries were characterized by an increased prevalence of words related to optimism for the future and fun (such as time, like, home, believe and game).</description><subject>FOS: Social sciences</subject><subject>Machine learning</subject><subject>measuring peace</subject><subject>Natural language processing</subject><fulltext>true</fulltext><rsrctype>dataset</rsrctype><creationdate>2023</creationdate><recordtype>dataset</recordtype><sourceid>PQ8</sourceid><recordid>eNpjYBA3NNAzNTAz1E8pqkxM0TMqMysvL6gqM-NksPTzCVBIzEtRyE1MzsjMS1XISU0sysvMS1coyVfITU0sLi1KVShITUxOVUgrys9VyEstLwaKp2Qm8jCwpiXmFKfyQmluBn031xBnD92UxJLE5MyS1PiCoszcxKLKeEODeJDl8WDL4-GWG5OuAwBWKUHx</recordid><startdate>20231115</startdate><enddate>20231115</enddate><creator>Liebovitch, Larry</creator><creator>Powers, William</creator><creator>Shi, Lin</creator><creator>Chen-Carrel, Allegra</creator><creator>Loustaunau, Philippe</creator><creator>Coleman, Peter</creator><general>Dryad</general><scope>DYCCY</scope><scope>PQ8</scope><orcidid>https://orcid.org/0000-0002-8842-163X</orcidid></search><sort><creationdate>20231115</creationdate><title>NLP and machine learning to measure peace from news media</title><author>Liebovitch, Larry ; Powers, William ; Shi, Lin ; Chen-Carrel, Allegra ; Loustaunau, Philippe ; Coleman, Peter</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-datacite_primary_10_5061_dryad_2v6wwpzv63</frbrgroupid><rsrctype>datasets</rsrctype><prefilter>datasets</prefilter><language>eng</language><creationdate>2023</creationdate><topic>FOS: Social sciences</topic><topic>Machine learning</topic><topic>measuring peace</topic><topic>Natural language processing</topic><toplevel>online_resources</toplevel><creatorcontrib>Liebovitch, Larry</creatorcontrib><creatorcontrib>Powers, William</creatorcontrib><creatorcontrib>Shi, Lin</creatorcontrib><creatorcontrib>Chen-Carrel, Allegra</creatorcontrib><creatorcontrib>Loustaunau, Philippe</creatorcontrib><creatorcontrib>Coleman, Peter</creatorcontrib><collection>DataCite (Open Access)</collection><collection>DataCite</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Liebovitch, Larry</au><au>Powers, William</au><au>Shi, Lin</au><au>Chen-Carrel, Allegra</au><au>Loustaunau, Philippe</au><au>Coleman, Peter</au><format>book</format><genre>unknown</genre><ristype>DATA</ristype><title>NLP and machine learning to measure peace from news media</title><date>2023-11-15</date><risdate>2023</risdate><abstract>“Hate speech” can mobilize violence and destruction.  What are the characteristics of “peace speech” that reflect and support the social processes that maintain peace?  In this study we used a data driven, machine learning approach to identify the words most associated with lower-peace versus higher-peace countries. Logistic regression and random forest classifiers were trained using five respected, traditional peace indices: Global Peace Index, Positive Peace Index, World Happiness Index, Fragile States Index, and Human Development Index. The feature inputs into the machine learning model were the word frequencies from the news media in each country and the output classifications were the level of peace in that country.  The machine learning model was successful in properly classifying the level of peace from the news media in a country (both accuracy and F1: 96% - 100%). We also used that trained machine model to create a machine learning peace index that measured the level of peace in countries, including countries not in the training set, which correlated with the average of those five traditional peace indices (r-squared = 0.8349). Using the random forest feature importance method we found that the words in news media in lower-peace countries were characterized by words related to government, order, control and fear (such as government, state, law, security and court), while higher-peace countries were characterized by an increased prevalence of words related to optimism for the future and fun (such as time, like, home, believe and game).</abstract><pub>Dryad</pub><doi>10.5061/dryad.2v6wwpzv6</doi><orcidid>https://orcid.org/0000-0002-8842-163X</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.5061/dryad.2v6wwpzv6
ispartof
issn
language eng
recordid cdi_datacite_primary_10_5061_dryad_2v6wwpzv6
source DataCite
subjects FOS: Social sciences
Machine learning
measuring peace
Natural language processing
title NLP and machine learning to measure peace from news media
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T12%3A14%3A50IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-datacite_PQ8&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=unknown&rft.au=Liebovitch,%20Larry&rft.date=2023-11-15&rft_id=info:doi/10.5061/dryad.2v6wwpzv6&rft_dat=%3Cdatacite_PQ8%3E10_5061_dryad_2v6wwpzv6%3C/datacite_PQ8%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true