The media framing dataset: Analyzing news narratives in Mexico and Colombia
This paper introduces “The Media Framing Dataset,” a dataset developed through an in-depth examination of news articles from 140 local newspapers in Mexico and Colombia, covering events from May 2022 to August 2023. Our dataset captures a broad spectrum of topics, including politics, immigration, pu...
Gespeichert in:
Veröffentlicht in: | Data in brief 2025-02, Vol.58, p.111284, Article 111284 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | 111284 |
container_title | Data in brief |
container_volume | 58 |
creator | Cuadrado, Juan Martinez, Elizabeth Martinez-Santos, Juan Carlos Puertas, Edwin |
description | This paper introduces “The Media Framing Dataset,” a dataset developed through an in-depth examination of news articles from 140 local newspapers in Mexico and Colombia, covering events from May 2022 to August 2023. Our dataset captures a broad spectrum of topics, including politics, immigration, public opinion, and crime. The data collection involved a meticulous keyword-based search strategy designed to identify articles that illustrate various news-framing dimensions, such as Economics, Policy, Morality, and more.
To construct this dataset, we employed a combination of manual and automated annotation techniques. Articles were categorized based on specific framing dimensions using a structured framework, developed in collaboration with experts in computational linguistics. The annotation process, conducted by trained annotators from Mexicoʼs Delfin program, guarantees both precision and depth.
“The Media Framing Dataset” serves as a valuable resource for NLP research with high potential for reuse. It is particularly suitable for analyzing cultural and linguistic nuances in media framing, assessing the impact of framing on public perception, and supporting the development of models that automatically detect framing techniques. Additionally, it provides a foundation for linguistic analysis and machine learning projects, enabling researchers and practitioners to explore media framing dynamics and develop innovative tools for media analysis. |
doi_str_mv | 10.1016/j.dib.2025.111284 |
format | Article |
fullrecord | <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_11787578</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S2352340925000162</els_id><sourcerecordid>3162849601</sourcerecordid><originalsourceid>FETCH-LOGICAL-c334t-dd90ff00fb1959f32884f2ea427a3b5d0a220b957fc402ad6c1e0e206c3e38b43</originalsourceid><addsrcrecordid>eNp9kctOHDEQRa0IFBDhA7KJvGQzEz_a_YAFQqM8UEBsYG1V22XwqNsGu2fI5Ovj0QAim6yqVL51XbqHkM-czTnj9dfl3Pp-LphQc865aKsP5FBIJWayYt3eu_6AHOe8ZIxxVZWh-kgOZNd2qq7FIfl1-4B0ROuBugSjD_fUwgQZp1N6EWDY_NmOAj5nGiAlmPwaM_WBXuNvbyKFYOkiDnHsPXwi-w6GjMcv9Yjcff92u_g5u7r5cbm4uJoZKatpZm3HnGPM9bxTnZOibSsnECrRgOyVZSAE6zvVOFMxAbY2HBkKVhuJsu0reUTOd76Pq76cbjBMCQb9mPwIaaMjeP3vS_AP-j6uNedN26imLQ4nLw4pPq0wT3r02eAwQMC4ylryugTa1YwXKd9JTYo5J3Rv_3CmtyD0UhcQegtC70CUnS_vD3zbeI29CM52AiwxrT0mnY3HYAqHhGbSNvr_2P8FxreY5g</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3162849601</pqid></control><display><type>article</type><title>The media framing dataset: Analyzing news narratives in Mexico and Colombia</title><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>PubMed Central</source><source>Alma/SFX Local Collection</source><creator>Cuadrado, Juan ; Martinez, Elizabeth ; Martinez-Santos, Juan Carlos ; Puertas, Edwin</creator><creatorcontrib>Cuadrado, Juan ; Martinez, Elizabeth ; Martinez-Santos, Juan Carlos ; Puertas, Edwin</creatorcontrib><description>This paper introduces “The Media Framing Dataset,” a dataset developed through an in-depth examination of news articles from 140 local newspapers in Mexico and Colombia, covering events from May 2022 to August 2023. Our dataset captures a broad spectrum of topics, including politics, immigration, public opinion, and crime. The data collection involved a meticulous keyword-based search strategy designed to identify articles that illustrate various news-framing dimensions, such as Economics, Policy, Morality, and more.
To construct this dataset, we employed a combination of manual and automated annotation techniques. Articles were categorized based on specific framing dimensions using a structured framework, developed in collaboration with experts in computational linguistics. The annotation process, conducted by trained annotators from Mexicoʼs Delfin program, guarantees both precision and depth.
“The Media Framing Dataset” serves as a valuable resource for NLP research with high potential for reuse. It is particularly suitable for analyzing cultural and linguistic nuances in media framing, assessing the impact of framing on public perception, and supporting the development of models that automatically detect framing techniques. Additionally, it provides a foundation for linguistic analysis and machine learning projects, enabling researchers and practitioners to explore media framing dynamics and develop innovative tools for media analysis.</description><identifier>ISSN: 2352-3409</identifier><identifier>EISSN: 2352-3409</identifier><identifier>DOI: 10.1016/j.dib.2025.111284</identifier><identifier>PMID: 39895662</identifier><language>eng</language><publisher>Netherlands: Elsevier Inc</publisher><subject>Computational linguistics ; Content annotation ; Cross-cultural studies ; Data ; Media analysis ; News content ; NLP resources ; Sentiment analysis</subject><ispartof>Data in brief, 2025-02, Vol.58, p.111284, Article 111284</ispartof><rights>2025</rights><rights>2025 The Authors. Published by Elsevier Inc.</rights><rights>2025 The Authors. Published by Elsevier Inc. 2025</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c334t-dd90ff00fb1959f32884f2ea427a3b5d0a220b957fc402ad6c1e0e206c3e38b43</cites><orcidid>0000-0002-8226-1372 ; 0000-0002-0758-1851</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC11787578/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC11787578/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,860,881,27901,27902,53766,53768</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/39895662$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Cuadrado, Juan</creatorcontrib><creatorcontrib>Martinez, Elizabeth</creatorcontrib><creatorcontrib>Martinez-Santos, Juan Carlos</creatorcontrib><creatorcontrib>Puertas, Edwin</creatorcontrib><title>The media framing dataset: Analyzing news narratives in Mexico and Colombia</title><title>Data in brief</title><addtitle>Data Brief</addtitle><description>This paper introduces “The Media Framing Dataset,” a dataset developed through an in-depth examination of news articles from 140 local newspapers in Mexico and Colombia, covering events from May 2022 to August 2023. Our dataset captures a broad spectrum of topics, including politics, immigration, public opinion, and crime. The data collection involved a meticulous keyword-based search strategy designed to identify articles that illustrate various news-framing dimensions, such as Economics, Policy, Morality, and more.
To construct this dataset, we employed a combination of manual and automated annotation techniques. Articles were categorized based on specific framing dimensions using a structured framework, developed in collaboration with experts in computational linguistics. The annotation process, conducted by trained annotators from Mexicoʼs Delfin program, guarantees both precision and depth.
“The Media Framing Dataset” serves as a valuable resource for NLP research with high potential for reuse. It is particularly suitable for analyzing cultural and linguistic nuances in media framing, assessing the impact of framing on public perception, and supporting the development of models that automatically detect framing techniques. Additionally, it provides a foundation for linguistic analysis and machine learning projects, enabling researchers and practitioners to explore media framing dynamics and develop innovative tools for media analysis.</description><subject>Computational linguistics</subject><subject>Content annotation</subject><subject>Cross-cultural studies</subject><subject>Data</subject><subject>Media analysis</subject><subject>News content</subject><subject>NLP resources</subject><subject>Sentiment analysis</subject><issn>2352-3409</issn><issn>2352-3409</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2025</creationdate><recordtype>article</recordtype><recordid>eNp9kctOHDEQRa0IFBDhA7KJvGQzEz_a_YAFQqM8UEBsYG1V22XwqNsGu2fI5Ovj0QAim6yqVL51XbqHkM-czTnj9dfl3Pp-LphQc865aKsP5FBIJWayYt3eu_6AHOe8ZIxxVZWh-kgOZNd2qq7FIfl1-4B0ROuBugSjD_fUwgQZp1N6EWDY_NmOAj5nGiAlmPwaM_WBXuNvbyKFYOkiDnHsPXwi-w6GjMcv9Yjcff92u_g5u7r5cbm4uJoZKatpZm3HnGPM9bxTnZOibSsnECrRgOyVZSAE6zvVOFMxAbY2HBkKVhuJsu0reUTOd76Pq76cbjBMCQb9mPwIaaMjeP3vS_AP-j6uNedN26imLQ4nLw4pPq0wT3r02eAwQMC4ylryugTa1YwXKd9JTYo5J3Rv_3CmtyD0UhcQegtC70CUnS_vD3zbeI29CM52AiwxrT0mnY3HYAqHhGbSNvr_2P8FxreY5g</recordid><startdate>20250201</startdate><enddate>20250201</enddate><creator>Cuadrado, Juan</creator><creator>Martinez, Elizabeth</creator><creator>Martinez-Santos, Juan Carlos</creator><creator>Puertas, Edwin</creator><general>Elsevier Inc</general><general>Elsevier</general><scope>6I.</scope><scope>AAFTH</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0002-8226-1372</orcidid><orcidid>https://orcid.org/0000-0002-0758-1851</orcidid></search><sort><creationdate>20250201</creationdate><title>The media framing dataset: Analyzing news narratives in Mexico and Colombia</title><author>Cuadrado, Juan ; Martinez, Elizabeth ; Martinez-Santos, Juan Carlos ; Puertas, Edwin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c334t-dd90ff00fb1959f32884f2ea427a3b5d0a220b957fc402ad6c1e0e206c3e38b43</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2025</creationdate><topic>Computational linguistics</topic><topic>Content annotation</topic><topic>Cross-cultural studies</topic><topic>Data</topic><topic>Media analysis</topic><topic>News content</topic><topic>NLP resources</topic><topic>Sentiment analysis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Cuadrado, Juan</creatorcontrib><creatorcontrib>Martinez, Elizabeth</creatorcontrib><creatorcontrib>Martinez-Santos, Juan Carlos</creatorcontrib><creatorcontrib>Puertas, Edwin</creatorcontrib><collection>ScienceDirect Open Access Titles</collection><collection>Elsevier:ScienceDirect:Open Access</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Data in brief</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Cuadrado, Juan</au><au>Martinez, Elizabeth</au><au>Martinez-Santos, Juan Carlos</au><au>Puertas, Edwin</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>The media framing dataset: Analyzing news narratives in Mexico and Colombia</atitle><jtitle>Data in brief</jtitle><addtitle>Data Brief</addtitle><date>2025-02-01</date><risdate>2025</risdate><volume>58</volume><spage>111284</spage><pages>111284-</pages><artnum>111284</artnum><issn>2352-3409</issn><eissn>2352-3409</eissn><abstract>This paper introduces “The Media Framing Dataset,” a dataset developed through an in-depth examination of news articles from 140 local newspapers in Mexico and Colombia, covering events from May 2022 to August 2023. Our dataset captures a broad spectrum of topics, including politics, immigration, public opinion, and crime. The data collection involved a meticulous keyword-based search strategy designed to identify articles that illustrate various news-framing dimensions, such as Economics, Policy, Morality, and more.
To construct this dataset, we employed a combination of manual and automated annotation techniques. Articles were categorized based on specific framing dimensions using a structured framework, developed in collaboration with experts in computational linguistics. The annotation process, conducted by trained annotators from Mexicoʼs Delfin program, guarantees both precision and depth.
“The Media Framing Dataset” serves as a valuable resource for NLP research with high potential for reuse. It is particularly suitable for analyzing cultural and linguistic nuances in media framing, assessing the impact of framing on public perception, and supporting the development of models that automatically detect framing techniques. Additionally, it provides a foundation for linguistic analysis and machine learning projects, enabling researchers and practitioners to explore media framing dynamics and develop innovative tools for media analysis.</abstract><cop>Netherlands</cop><pub>Elsevier Inc</pub><pmid>39895662</pmid><doi>10.1016/j.dib.2025.111284</doi><orcidid>https://orcid.org/0000-0002-8226-1372</orcidid><orcidid>https://orcid.org/0000-0002-0758-1851</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2352-3409 |
ispartof | Data in brief, 2025-02, Vol.58, p.111284, Article 111284 |
issn | 2352-3409 2352-3409 |
language | eng |
recordid | cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_11787578 |
source | DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; PubMed Central; Alma/SFX Local Collection |
subjects | Computational linguistics Content annotation Cross-cultural studies Data Media analysis News content NLP resources Sentiment analysis |
title | The media framing dataset: Analyzing news narratives in Mexico and Colombia |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-14T06%3A00%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=The%20media%20framing%20dataset:%20Analyzing%20news%20narratives%20in%20Mexico%20and%20Colombia&rft.jtitle=Data%20in%20brief&rft.au=Cuadrado,%20Juan&rft.date=2025-02-01&rft.volume=58&rft.spage=111284&rft.pages=111284-&rft.artnum=111284&rft.issn=2352-3409&rft.eissn=2352-3409&rft_id=info:doi/10.1016/j.dib.2025.111284&rft_dat=%3Cproquest_pubme%3E3162849601%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3162849601&rft_id=info:pmid/39895662&rft_els_id=S2352340925000162&rfr_iscdi=true |