EconPapers    
Economics at your fingertips  
 

Web Scraping Chilean News Media: A Dataset for Analyzing Social Unrest Coverage (2019–2023)

Ignacio Molina, José Morales and Brian Keith ()
Additional contact information
Ignacio Molina: Department of Systems and Computing Engineering, Universidad Católica del Norte, Antofagasta 1270398, Chile
José Morales: School of Journalism, Universidad Católica del Norte, Antofagasta 1270398, Chile
Brian Keith: Department of Systems and Computing Engineering, Universidad Católica del Norte, Antofagasta 1270398, Chile

Data, 2025, vol. 10, issue 11, 1-19

Abstract: This paper presents a dataset of Chilean news media coverage during the social unrest and constitutional processes from 2019 to 2023. Using Python-based web scraping with BeautifulSoup and Selenium, we collected articles from 15 Chilean news outlets between 15 November 2019 and 17 December 2023. The initial collection of 1254 articles was filtered to 931 usable data points after removing non-relevant content, duplicates, and articles unrelated to the Chilean social outburst. Each news outlet required specific extraction approaches due to varying HTML structures, with some outlets inaccessible due to paywalls or anti-scraping mechanisms. The dataset is structured in JSON format with standardized fields including title, content, date, author, and source metadata. This resource supports research on media coverage during political events and provides data for Spanish-language processing tasks. The dataset and extraction code are publicly available on GitHub.

Keywords: web scraping; Chilean social outburst; news media dataset; data collection; estallido social (search for similar items in EconPapers)
JEL-codes: C8 C80 C81 C82 C83 (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2306-5729/10/11/174/pdf (application/pdf)
https://www.mdpi.com/2306-5729/10/11/174/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jdataj:v:10:y:2025:i:11:p:174-:d:1784891

Access Statistics for this article

Data is currently edited by Ms. Becky Zhang

More articles in Data from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-11-01
Handle: RePEc:gam:jdataj:v:10:y:2025:i:11:p:174-:d:1784891