Human Biographical Record (HBR)
Arash Nekoei and
Fabian Sinn
No 15825, CEPR Discussion Papers from C.E.P.R. Discussion Papers
Abstract:
We construct a new dataset of more than seven million notable individuals across recorded human history, the Human Biographical Record (HBR). With Wikidata as the backbone, HBR adds further information from various digital sources, including Wikipedia in all 292 languages. Machine learning and text analysis combine the sources and extract information on date and place of birth and death, gender, occupation, education, and family background. This paper discusses HBR's construction and its completeness, coverage, accuracy, and also its strength and weakness relative to prior datasets. HBR is the first part of a larger project, the human record project that we briefly introduce.
Keywords: Bid data; Machine learning; Economic history (search for similar items in EconPapers)
Date: 2021-02
New Economics Papers: this item is included in nep-big and nep-cmp
References: Add references at CitEc
Citations:
Downloads: (external link)
https://cepr.org/publications/DP15825 (application/pdf)
CEPR Discussion Papers are free to download for our researchers, subscribers and members. If you fall into one of these categories but have trouble downloading our papers, please contact us at subscribers@cepr.org
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:cpr:ceprdp:15825
Ordering information: This working paper can be ordered from
https://cepr.org/publications/DP15825
Access Statistics for this paper
More papers in CEPR Discussion Papers from C.E.P.R. Discussion Papers Centre for Economic Policy Research, 33 Great Sutton Street, London EC1V 0DX.
Bibliographic data for series maintained by ().