TubeStats and TokStats: Research Tools for Random Samples of YouTube and TikTok
Kevin Zheng,
Reagan Keeney,
Ryan McGrady,
Vikramaditya Jaisingh and
Ethan Zuckerman
Additional contact information
Kevin Zheng: School of Information, University of Michigan, USA
Reagan Keeney: Manning College of Information and Computer Sciences, University of Massachusetts Amherst, USA
Ryan McGrady: Manning College of Information and Computer Sciences, University of Massachusetts Amherst, USA
Vikramaditya Jaisingh: Manning College of Information and Computer Sciences, University of Massachusetts Amherst, USA
Ethan Zuckerman: Manning College of Information and Computer Sciences, University of Massachusetts Amherst, USA
Media and Communication, 2026, vol. 14
Abstract:
YouTube and TikTok are two of the most popular digital communications platforms in the world, playing a disproportionately large role in global communications infrastructure in general and the consumption and dissemination of information in particular. As neither platform provides adequate mechanisms to produce representative samples of the content they host, researchers largely depend on opportunistic samples of popular, recommended, or otherwise known content. In this article, we present two dashboard-based tools, TubeStats and TokStats, built upon our recent research into random sampling techniques for each platform. These tools provide platform-wide statistics such as the number of hosted videos, view count distributions, linguistic distributions, and growth over time, which researchers can use to quantify and contextualize their research. We explain the architecture and sampling pipeline of each tool as well as the unique technical and methodological affordances and constraints involved with each. We document how these related techniques and tools have been applied by our lab, other scholars, and journalists to contextualize non-representative samples, compare platform use across languages and regions, and examine quotidian uses of the platforms that attention-optimized samples may obscure, as well as the broader range of methodological possibilities that representative sampling opens for platform research. Not to be taken for granted, we also explain the many challenges we face in developing and maintaining such tools, with implications for the practical development of open research infrastructures.
Keywords: open research infrastructure; platform studies; random sampling; social science tools; TikTok; YouTube (search for similar items in EconPapers)
Date: 2026
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.cogitatiopress.com/mediaandcommunication/article/view/12085 (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:cog:meanco:v14:y:2026:a:12085
DOI: 10.17645/mac.12085
Access Statistics for this article
Media and Communication is currently edited by Raquel Silva
More articles in Media and Communication from Cogitatio Press
Bibliographic data for series maintained by António Vieira () and IT Department ().