Human Biographical Record (HBR)
Arash Nekoei and
Fabian Sinn
No 15825, CEPR Discussion Papers from Centre for Economic Policy Research
Abstract:
We construct a new dataset of more than seven million notable individuals across recorded human history, the Human Biographical Record (HBR). With Wikidata as the backbone, HBR adds further information from various digital sources, including Wikipedia in all 292 languages. Machine learning and text analysis combine the sources and extract information on date and place of birth and death, gender, occupation, education, and family background. This paper discusses HBR's construction and its completeness, coverage, accuracy, and also its strength and weakness relative to prior datasets. HBR is the first part of a larger project, the human record project that we briefly introduce.
Keywords: Bid data; Machine learning; Economic history (search for similar items in EconPapers)
Date: 2021-02
New Economics Papers: this item is included in nep-big and nep-cmp
References: Add references at CitEc
Citations:
Downloads: (external link)
https://cepr.org/publications/DP15825 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:cpr:ceprdp:15825
Ordering information: This working paper can be ordered from
https://cepr.org/publications/DP15825
Access Statistics for this paper
More papers in CEPR Discussion Papers from Centre for Economic Policy Research 33 Great Sutton Street, London EC1V 0DX, UK.
Bibliographic data for series maintained by CEPR ().