corona100d: German-language Twitter dataset of the first 100 days after Chancellor Merkel addressed the coronavirus outbreak on TV

Rieger, Jonas; von Nordheim, Gerret

corona100d: German-language Twitter dataset of the first 100 days after Chancellor Merkel addressed the coronavirus outbreak on TV

Jonas Rieger and Gerret von Nordheim

No 4, DoCMA Working Papers from TU Dortmund University, Dortmund Center for Data-based Media Analysis (DoCMA)

Abstract: In this paper, we present a German-language Twitter dataset related to the Covid-19 pandemic. We show how the R (R Core Team 2020) package rtweet (Kearney 2019) and a combination of keywords can be used to create the dataset and provide a way to rehydrate most of the tweets. The dataset consists of 3 699 623 tweets from 2020/03/19 to 2020/06/26 and was constructed from hourly API requests of 50 000 tweets. In a brief analysis, we give first insights into the dataset and provide approaches that can be refined in further research.

Keywords: Covid-19; SARS-CoV-2; scraper; data; text; developer; dev; Twitter (search for similar items in EconPapers)
Date: 2021
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.econstor.eu/bitstream/10419/231349/1/1748390163.pdf (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:zbw:docmaw:4

DOI: 10.17877/DE290R-21911

Access Statistics for this paper

More papers in DoCMA Working Papers from TU Dortmund University, Dortmund Center for Data-based Media Analysis (DoCMA)
Bibliographic data for series maintained by ZBW - Leibniz Information Centre for Economics ().