The media framing dataset: Analyzing news narratives in Mexico and Colombia

This paper introduces “The Media Framing Dataset,” a dataset developed through an in-depth examination of news articles from 140 local newspapers in Mexico and Colombia, covering events from May 2022 to August 2023. Our dataset captures a broad spectrum of topics, including politics, immigration, pu...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Data in brief 2025-02, Vol.58, p.111284, Article 111284
Hauptverfasser: Cuadrado, Juan, Martinez, Elizabeth, Martinez-Santos, Juan Carlos, Puertas, Edwin
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page 111284
container_title Data in brief
container_volume 58
creator Cuadrado, Juan
Martinez, Elizabeth
Martinez-Santos, Juan Carlos
Puertas, Edwin
description This paper introduces “The Media Framing Dataset,” a dataset developed through an in-depth examination of news articles from 140 local newspapers in Mexico and Colombia, covering events from May 2022 to August 2023. Our dataset captures a broad spectrum of topics, including politics, immigration, public opinion, and crime. The data collection involved a meticulous keyword-based search strategy designed to identify articles that illustrate various news-framing dimensions, such as Economics, Policy, Morality, and more. To construct this dataset, we employed a combination of manual and automated annotation techniques. Articles were categorized based on specific framing dimensions using a structured framework, developed in collaboration with experts in computational linguistics. The annotation process, conducted by trained annotators from Mexicoʼs Delfin program, guarantees both precision and depth. “The Media Framing Dataset” serves as a valuable resource for NLP research with high potential for reuse. It is particularly suitable for analyzing cultural and linguistic nuances in media framing, assessing the impact of framing on public perception, and supporting the development of models that automatically detect framing techniques. Additionally, it provides a foundation for linguistic analysis and machine learning projects, enabling researchers and practitioners to explore media framing dynamics and develop innovative tools for media analysis.
doi_str_mv 10.1016/j.dib.2025.111284
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_11787578</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S2352340925000162</els_id><sourcerecordid>3162849601</sourcerecordid><originalsourceid>FETCH-LOGICAL-c334t-dd90ff00fb1959f32884f2ea427a3b5d0a220b957fc402ad6c1e0e206c3e38b43</originalsourceid><addsrcrecordid>eNp9kctOHDEQRa0IFBDhA7KJvGQzEz_a_YAFQqM8UEBsYG1V22XwqNsGu2fI5Ovj0QAim6yqVL51XbqHkM-czTnj9dfl3Pp-LphQc865aKsP5FBIJWayYt3eu_6AHOe8ZIxxVZWh-kgOZNd2qq7FIfl1-4B0ROuBugSjD_fUwgQZp1N6EWDY_NmOAj5nGiAlmPwaM_WBXuNvbyKFYOkiDnHsPXwi-w6GjMcv9Yjcff92u_g5u7r5cbm4uJoZKatpZm3HnGPM9bxTnZOibSsnECrRgOyVZSAE6zvVOFMxAbY2HBkKVhuJsu0reUTOd76Pq76cbjBMCQb9mPwIaaMjeP3vS_AP-j6uNedN26imLQ4nLw4pPq0wT3r02eAwQMC4ylryugTa1YwXKd9JTYo5J3Rv_3CmtyD0UhcQegtC70CUnS_vD3zbeI29CM52AiwxrT0mnY3HYAqHhGbSNvr_2P8FxreY5g</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3162849601</pqid></control><display><type>article</type><title>The media framing dataset: Analyzing news narratives in Mexico and Colombia</title><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>PubMed Central</source><source>Alma/SFX Local Collection</source><creator>Cuadrado, Juan ; Martinez, Elizabeth ; Martinez-Santos, Juan Carlos ; Puertas, Edwin</creator><creatorcontrib>Cuadrado, Juan ; Martinez, Elizabeth ; Martinez-Santos, Juan Carlos ; Puertas, Edwin</creatorcontrib><description>This paper introduces “The Media Framing Dataset,” a dataset developed through an in-depth examination of news articles from 140 local newspapers in Mexico and Colombia, covering events from May 2022 to August 2023. Our dataset captures a broad spectrum of topics, including politics, immigration, public opinion, and crime. The data collection involved a meticulous keyword-based search strategy designed to identify articles that illustrate various news-framing dimensions, such as Economics, Policy, Morality, and more. To construct this dataset, we employed a combination of manual and automated annotation techniques. Articles were categorized based on specific framing dimensions using a structured framework, developed in collaboration with experts in computational linguistics. The annotation process, conducted by trained annotators from Mexicoʼs Delfin program, guarantees both precision and depth. “The Media Framing Dataset” serves as a valuable resource for NLP research with high potential for reuse. It is particularly suitable for analyzing cultural and linguistic nuances in media framing, assessing the impact of framing on public perception, and supporting the development of models that automatically detect framing techniques. Additionally, it provides a foundation for linguistic analysis and machine learning projects, enabling researchers and practitioners to explore media framing dynamics and develop innovative tools for media analysis.</description><identifier>ISSN: 2352-3409</identifier><identifier>EISSN: 2352-3409</identifier><identifier>DOI: 10.1016/j.dib.2025.111284</identifier><identifier>PMID: 39895662</identifier><language>eng</language><publisher>Netherlands: Elsevier Inc</publisher><subject>Computational linguistics ; Content annotation ; Cross-cultural studies ; Data ; Media analysis ; News content ; NLP resources ; Sentiment analysis</subject><ispartof>Data in brief, 2025-02, Vol.58, p.111284, Article 111284</ispartof><rights>2025</rights><rights>2025 The Authors. Published by Elsevier Inc.</rights><rights>2025 The Authors. Published by Elsevier Inc. 2025</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c334t-dd90ff00fb1959f32884f2ea427a3b5d0a220b957fc402ad6c1e0e206c3e38b43</cites><orcidid>0000-0002-8226-1372 ; 0000-0002-0758-1851</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC11787578/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC11787578/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,860,881,27901,27902,53766,53768</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/39895662$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Cuadrado, Juan</creatorcontrib><creatorcontrib>Martinez, Elizabeth</creatorcontrib><creatorcontrib>Martinez-Santos, Juan Carlos</creatorcontrib><creatorcontrib>Puertas, Edwin</creatorcontrib><title>The media framing dataset: Analyzing news narratives in Mexico and Colombia</title><title>Data in brief</title><addtitle>Data Brief</addtitle><description>This paper introduces “The Media Framing Dataset,” a dataset developed through an in-depth examination of news articles from 140 local newspapers in Mexico and Colombia, covering events from May 2022 to August 2023. Our dataset captures a broad spectrum of topics, including politics, immigration, public opinion, and crime. The data collection involved a meticulous keyword-based search strategy designed to identify articles that illustrate various news-framing dimensions, such as Economics, Policy, Morality, and more. To construct this dataset, we employed a combination of manual and automated annotation techniques. Articles were categorized based on specific framing dimensions using a structured framework, developed in collaboration with experts in computational linguistics. The annotation process, conducted by trained annotators from Mexicoʼs Delfin program, guarantees both precision and depth. “The Media Framing Dataset” serves as a valuable resource for NLP research with high potential for reuse. It is particularly suitable for analyzing cultural and linguistic nuances in media framing, assessing the impact of framing on public perception, and supporting the development of models that automatically detect framing techniques. Additionally, it provides a foundation for linguistic analysis and machine learning projects, enabling researchers and practitioners to explore media framing dynamics and develop innovative tools for media analysis.</description><subject>Computational linguistics</subject><subject>Content annotation</subject><subject>Cross-cultural studies</subject><subject>Data</subject><subject>Media analysis</subject><subject>News content</subject><subject>NLP resources</subject><subject>Sentiment analysis</subject><issn>2352-3409</issn><issn>2352-3409</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2025</creationdate><recordtype>article</recordtype><recordid>eNp9kctOHDEQRa0IFBDhA7KJvGQzEz_a_YAFQqM8UEBsYG1V22XwqNsGu2fI5Ovj0QAim6yqVL51XbqHkM-czTnj9dfl3Pp-LphQc865aKsP5FBIJWayYt3eu_6AHOe8ZIxxVZWh-kgOZNd2qq7FIfl1-4B0ROuBugSjD_fUwgQZp1N6EWDY_NmOAj5nGiAlmPwaM_WBXuNvbyKFYOkiDnHsPXwi-w6GjMcv9Yjcff92u_g5u7r5cbm4uJoZKatpZm3HnGPM9bxTnZOibSsnECrRgOyVZSAE6zvVOFMxAbY2HBkKVhuJsu0reUTOd76Pq76cbjBMCQb9mPwIaaMjeP3vS_AP-j6uNedN26imLQ4nLw4pPq0wT3r02eAwQMC4ylryugTa1YwXKd9JTYo5J3Rv_3CmtyD0UhcQegtC70CUnS_vD3zbeI29CM52AiwxrT0mnY3HYAqHhGbSNvr_2P8FxreY5g</recordid><startdate>20250201</startdate><enddate>20250201</enddate><creator>Cuadrado, Juan</creator><creator>Martinez, Elizabeth</creator><creator>Martinez-Santos, Juan Carlos</creator><creator>Puertas, Edwin</creator><general>Elsevier Inc</general><general>Elsevier</general><scope>6I.</scope><scope>AAFTH</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0002-8226-1372</orcidid><orcidid>https://orcid.org/0000-0002-0758-1851</orcidid></search><sort><creationdate>20250201</creationdate><title>The media framing dataset: Analyzing news narratives in Mexico and Colombia</title><author>Cuadrado, Juan ; Martinez, Elizabeth ; Martinez-Santos, Juan Carlos ; Puertas, Edwin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c334t-dd90ff00fb1959f32884f2ea427a3b5d0a220b957fc402ad6c1e0e206c3e38b43</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2025</creationdate><topic>Computational linguistics</topic><topic>Content annotation</topic><topic>Cross-cultural studies</topic><topic>Data</topic><topic>Media analysis</topic><topic>News content</topic><topic>NLP resources</topic><topic>Sentiment analysis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Cuadrado, Juan</creatorcontrib><creatorcontrib>Martinez, Elizabeth</creatorcontrib><creatorcontrib>Martinez-Santos, Juan Carlos</creatorcontrib><creatorcontrib>Puertas, Edwin</creatorcontrib><collection>ScienceDirect Open Access Titles</collection><collection>Elsevier:ScienceDirect:Open Access</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Data in brief</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Cuadrado, Juan</au><au>Martinez, Elizabeth</au><au>Martinez-Santos, Juan Carlos</au><au>Puertas, Edwin</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>The media framing dataset: Analyzing news narratives in Mexico and Colombia</atitle><jtitle>Data in brief</jtitle><addtitle>Data Brief</addtitle><date>2025-02-01</date><risdate>2025</risdate><volume>58</volume><spage>111284</spage><pages>111284-</pages><artnum>111284</artnum><issn>2352-3409</issn><eissn>2352-3409</eissn><abstract>This paper introduces “The Media Framing Dataset,” a dataset developed through an in-depth examination of news articles from 140 local newspapers in Mexico and Colombia, covering events from May 2022 to August 2023. Our dataset captures a broad spectrum of topics, including politics, immigration, public opinion, and crime. The data collection involved a meticulous keyword-based search strategy designed to identify articles that illustrate various news-framing dimensions, such as Economics, Policy, Morality, and more. To construct this dataset, we employed a combination of manual and automated annotation techniques. Articles were categorized based on specific framing dimensions using a structured framework, developed in collaboration with experts in computational linguistics. The annotation process, conducted by trained annotators from Mexicoʼs Delfin program, guarantees both precision and depth. “The Media Framing Dataset” serves as a valuable resource for NLP research with high potential for reuse. It is particularly suitable for analyzing cultural and linguistic nuances in media framing, assessing the impact of framing on public perception, and supporting the development of models that automatically detect framing techniques. Additionally, it provides a foundation for linguistic analysis and machine learning projects, enabling researchers and practitioners to explore media framing dynamics and develop innovative tools for media analysis.</abstract><cop>Netherlands</cop><pub>Elsevier Inc</pub><pmid>39895662</pmid><doi>10.1016/j.dib.2025.111284</doi><orcidid>https://orcid.org/0000-0002-8226-1372</orcidid><orcidid>https://orcid.org/0000-0002-0758-1851</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2352-3409
ispartof Data in brief, 2025-02, Vol.58, p.111284, Article 111284
issn 2352-3409
2352-3409
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_11787578
source DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; PubMed Central; Alma/SFX Local Collection
subjects Computational linguistics
Content annotation
Cross-cultural studies
Data
Media analysis
News content
NLP resources
Sentiment analysis
title The media framing dataset: Analyzing news narratives in Mexico and Colombia
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-14T06%3A00%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=The%20media%20framing%20dataset:%20Analyzing%20news%20narratives%20in%20Mexico%20and%20Colombia&rft.jtitle=Data%20in%20brief&rft.au=Cuadrado,%20Juan&rft.date=2025-02-01&rft.volume=58&rft.spage=111284&rft.pages=111284-&rft.artnum=111284&rft.issn=2352-3409&rft.eissn=2352-3409&rft_id=info:doi/10.1016/j.dib.2025.111284&rft_dat=%3Cproquest_pubme%3E3162849601%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3162849601&rft_id=info:pmid/39895662&rft_els_id=S2352340925000162&rfr_iscdi=true