A Dataset on Linguistic Connectivity Across and Within Countries
Tamara Gurevich,
Peter Herman,
Farid Toubal and
Yoto Yotov
Post-Print from HAL
Abstract:
We construct a new global dataset on common language. The data cover 242 countries and territories and are based on information about the speakers of 6,675 languages. Using data from Ethnologue, we provide 11 bilateral measures reflecting different dimensions of linguistic connections within and between countries, including common official languages, common native and acquired languages, and linguistic proximity across different languages. A key novelty of the dataset is that it includes consistently defined information on linguistic relationships not only between different countries but within the administrative borders of countries as well.
Keywords: Society; Economics; Communication (search for similar items in EconPapers)
Date: 2025-03-31
References: Add references at CitEc
Citations:
Published in Scientific Data , 2025, 12 (1), pp.542. ⟨10.1038/s41597-025-04692-8⟩
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:hal:journl:hal-05492389
DOI: 10.1038/s41597-025-04692-8
Access Statistics for this paper
More papers in Post-Print from HAL
Bibliographic data for series maintained by CCSD ().