EconPapers    
Economics at your fingertips  
 

We Just Ran Twenty-Three Million Queries of the World Bank's Website - Working Paper 362

Sarah Dykstra
Authors registered in the RePEc Author Service: Justin Sandefur

No 362, Working Papers from Center for Global Development

Abstract: Much of the data underlying global poverty and inequality estimates is not in the public domain, but can be accessed in small pieces using the World Bank’s PovcalNet online tool. To overcome these limitations and reproduce this database in a format more useful to researchers, we ran approximately 23 million queries of the World Bank’s web site, accessing only information that was already in the public domain. This web scraping exercise produced 10,000 points on the cumulative distribution of income or consumption from each of 942 surveys spanning 127 countries over the period 1977 to 2012. This short note describes our methodology, briefly discusses some of the relevant intellectual property issues, and illustrates the kind of calculations that are facilitated by this data set, including growth incidence curves and poverty rates using alternative PPP indices. The full data can be downloaded at www.cgdev.org/povcalnet.

Keywords: poverty; inequality; consumption; income distribution; open data (search for similar items in EconPapers)
JEL-codes: D31 I32 O57 (search for similar items in EconPapers)
Pages: 20 pages
Date: 2014-04
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (11)

Downloads: (external link)
http://www.cgdev.org/sites/default/files/dykstra-s ... net-world-bank_1.pdf

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:cgd:wpaper:362

Access Statistics for this paper

More papers in Working Papers from Center for Global Development Contact information at EDIRC.
Bibliographic data for series maintained by Publications Manager ().

 
Page updated 2025-06-17
Handle: RePEc:cgd:wpaper:362