DECOVID: A UK Two-Center Harmonized Database of Acute Care Electronic Health Records for COVID-19 Research
Consortium Decovid,
Louis J. M. Aslett,
Andreea Avramescu,
Nicholas Bakewell,
Isabel Birds,
Louise Bowler,
Michael P. J. Camilleri,
Sheng-Chia Chung,
David A. Clifton,
Samuel N. Cohen,
Nathan Constantine-Cooke,
Eric G. Daub,
Shaun Davidson,
Spiros Denaxas,
Karla Diaz-Ordaz,
Richard Feltbower,
Suzy Gallier,
Stephen Gardiner,
Francesca Gasperoni,
Robert J. B. Goudie (),
Rebecca E. Green,
Marlous Hall,
Chris Holmes,
John R. Hurst,
Mark M. Iles,
Joao Jorge,
Emma Karoune,
Ruth Keogh,
Ruairidh King,
Ruth King,
Paul D. W. Kirk,
Roman Klapaukh,
Samaneh Kouchaki,
Alvina G. Lai,
Nathan Lea,
Clemence Leyrat,
Kezhi Li,
Watjana Lilaonitkul,
Huiqi Y. Lu,
Terry Lyons,
Ann Marie Mallon,
Andrew Manderson,
Nicolò Margaritella,
Joshua Matteson,
Sam Morley,
Hannah Nicholls,
Martin O’Reilly,
Christina Pagel,
Edward Palmer,
Jack Roberts,
Timothy J. Roberts,
David S. Robertson,
James Robinson,
Patrick Rockenschaub,
Roy Ruddle,
Elizabeth Sapey,
Luis Santos,
Andrew A. S. Soltan,
Fang Gao Smith,
Colin Starr,
Oliver Strickson,
Li Su,
Mia S. Tackney,
Johan H. Thygesen,
Ana Torralbo,
Alice Turner,
Catalina A. Vallejos,
Chenyang Wang,
Kirstie Whitaker,
Tony Whitehouse,
David R. Westhead,
Wai Keong Wong,
Yue Wu,
Lingyi Yang and
Xiaoxu Zou
Additional contact information
Louis J. M. Aslett: Department of Mathematical Sciences, Durham University, Durham DH1 3LE, UK
Andreea Avramescu: The Alan Turing Institute, London NW1 2DB, UK
Nicholas Bakewell: MRC Biostatistics Unit, University of Cambridge, Cambridge CB2 0SR, UK
Isabel Birds: School of Molecular and Cellular Biology, University of Leeds, Leeds LS2 9JT, UK
Louise Bowler: The Alan Turing Institute, London NW1 2DB, UK
Michael P. J. Camilleri: School of Informatics, University of Edinburgh, Edinburgh EH4 2XU, UK
Sheng-Chia Chung: Institute of Cardiovascular Science, University College London, London NW1 2DA, UK
David A. Clifton: Department of Engineering Science, University of Oxford, Oxford OX3 7DQ, UK
Samuel N. Cohen: Mathematical Institute, University of Oxford, Oxford OX2 6GG, UK
Nathan Constantine-Cooke: Institute of Genetics and Cancer, University of Edinburgh, Edinburgh EH4 2XU, UK
Eric G. Daub: The Alan Turing Institute, London NW1 2DB, UK
Shaun Davidson: Institute of Biomedical Engineering, University of Oxford, Oxford OX3 7DQ, UK
Spiros Denaxas: Institute of Health Informatics, University College London, London NW1 2DA, UK
Karla Diaz-Ordaz: Department of Statistical Science, University College London, London WC1E 6BT, UK
Richard Feltbower: Child Health Outcomes Research at Leeds (CHORAL), School of Medicine, University of Leeds, Leeds LS2 9LU, UK
Suzy Gallier: University Hospitals Birmingham NHS Foundation Trust, Birmingham B15 2GW, UK
Stephen Gardiner: Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Medicine, University of Oxford, Oxford OX3 7LF, UK
Francesca Gasperoni: MRC Biostatistics Unit, University of Cambridge, Cambridge CB2 0SR, UK
Robert J. B. Goudie: MRC Biostatistics Unit, University of Cambridge, Cambridge CB2 0SR, UK
Rebecca E. Green: The Alan Turing Institute, London NW1 2DB, UK
Marlous Hall: Leeds Institute for Data Analytics, University of Leeds, Leeds LS2 9NL, UK
Chris Holmes: Department of Statistics, University of Oxford, Oxford OX1 3LB, UK
John R. Hurst: UCL Respiratory, University College London, London WC1E 6JF, UK
Mark M. Iles: Leeds Institute for Data Analytics, University of Leeds, Leeds LS2 9NL, UK
Joao Jorge: Institute of Biomedical Engineering, University of Oxford, Oxford OX3 7DQ, UK
Emma Karoune: The Alan Turing Institute, London NW1 2DB, UK
Ruth Keogh: Department of Medical Statistics, London School of Hygiene and Tropical Medicine, London WC1E 7HT, UK
Ruairidh King: The Alan Turing Institute, London NW1 2DB, UK
Ruth King: School of Mathematics and Maxwell Institute for Mathematical Sciences, University of Edinburgh, Edinburgh EH9 3FD, UK
Paul D. W. Kirk: MRC Biostatistics Unit, University of Cambridge, Cambridge CB2 0SR, UK
Roman Klapaukh: Research Software Development Group, University College London, London WC1E 6BT, UK
Samaneh Kouchaki: Institute of Biomedical Engineering, University of Oxford, Oxford OX3 7DQ, UK
Alvina G. Lai: Institute of Health Informatics, University College London, London NW1 2DA, UK
Nathan Lea: Institute of Health Informatics, University College London, London NW1 2DA, UK
Clemence Leyrat: Department of Medical Statistics, London School of Hygiene and Tropical Medicine, London WC1E 7HT, UK
Kezhi Li: Institute of Health Informatics, University College London, London NW1 2DA, UK
Watjana Lilaonitkul: Global Business School for Health, University College London, London E20 2AE, UK
Huiqi Y. Lu: Department of Engineering Science, University of Oxford, Oxford OX3 7DQ, UK
Terry Lyons: Mathematical Institute, University of Oxford, Oxford OX2 6GG, UK
Ann Marie Mallon: The Alan Turing Institute, London NW1 2DB, UK
Andrew Manderson: MRC Biostatistics Unit, University of Cambridge, Cambridge CB2 0SR, UK
Nicolò Margaritella: School of Mathematics and Statistics, University of St Andrews, St Andrews KY16 9SS, UK
Joshua Matteson: Institute of Health Informatics, University College London, London NW1 2DA, UK
Sam Morley: Mathematical Institute, University of Oxford, Oxford OX2 6GG, UK
Hannah Nicholls: The Alan Turing Institute, London NW1 2DB, UK
Martin O’Reilly: The Alan Turing Institute, London NW1 2DB, UK
Christina Pagel: Clinical Operational Research Unit, University College London, London WC1H 0BT, UK
Edward Palmer: University College London Hospital, London NW1 2BU, UK
Jack Roberts: The Alan Turing Institute, London NW1 2DB, UK
Timothy J. Roberts: Institute of Health Informatics, University College London, London NW1 2DA, UK
David S. Robertson: MRC Biostatistics Unit, University of Cambridge, Cambridge CB2 0SR, UK
James Robinson: The Alan Turing Institute, London NW1 2DB, UK
Patrick Rockenschaub: Institute of Health Informatics, University College London, London NW1 2DA, UK
Roy Ruddle: Leeds Institute for Data Analytics, University of Leeds, Leeds LS2 9NL, UK
Elizabeth Sapey: University Hospitals Birmingham NHS Foundation Trust, Birmingham B15 2GW, UK
Luis Santos: The Alan Turing Institute, London NW1 2DB, UK
Andrew A. S. Soltan: Department of Oncology, University of Oxford, Oxford OX3 7LE, UK
Fang Gao Smith: University Hospitals Birmingham NHS Foundation Trust, Birmingham B15 2GW, UK
Colin Starr: MRC Biostatistics Unit, University of Cambridge, Cambridge CB2 0SR, UK
Oliver Strickson: The Alan Turing Institute, London NW1 2DB, UK
Li Su: MRC Biostatistics Unit, University of Cambridge, Cambridge CB2 0SR, UK
Mia S. Tackney: MRC Biostatistics Unit, University of Cambridge, Cambridge CB2 0SR, UK
Johan H. Thygesen: Institute of Health Informatics, University College London, London NW1 2DA, UK
Ana Torralbo: Institute of Health Informatics, University College London, London NW1 2DA, UK
Alice Turner: University Hospitals Birmingham NHS Foundation Trust, Birmingham B15 2GW, UK
Catalina A. Vallejos: Institute of Genetics and Cancer, University of Edinburgh, Edinburgh EH4 2XU, UK
Chenyang Wang: Department of Engineering Science, University of Oxford, Oxford OX3 7DQ, UK
Kirstie Whitaker: The Alan Turing Institute, London NW1 2DB, UK
Tony Whitehouse: University Hospitals Birmingham NHS Foundation Trust, Birmingham B15 2GW, UK
David R. Westhead: School of Molecular and Cellular Biology, University of Leeds, Leeds LS2 9JT, UK
Wai Keong Wong: Cambridge University Hospitals, Cambridge CB2 0QQ, UK
Yue Wu: Department of Mathematics and Statistics, University of Strathclyde, Glasgow G1 1XH, UK
Lingyi Yang: Mathematical Institute, University of Oxford, Oxford OX2 6GG, UK
Xiaoxu Zou: University Hospitals Birmingham NHS Foundation Trust, Birmingham B15 2GW, UK
Data, 2025, vol. 10, issue 12, 1-27
Abstract:
The DECOVID database contains harmonized pseudonymized electronic health record (EHR) data on all adult (≥18 years old) patients presenting to two large, digitally mature centers in the United Kingdom between 1 January 2020 and 28 February 2021, with follow-up until at least 28 March 2021. The database was originally developed to support the COVID-19 response but is now available via the PIONEER data hub for researchers to explore a wide range of research questions, including exploratory analyses, risk factor assessment, prediction modeling, and comparative effectiveness studies. Raw data were extracted from local EHRs and transformed into a standardized form (Observational Health Data Sciences and Informatics-Common Data Model version 5.3.1). The database includes 165,420 patients across 256,804 hospital presentations. For these patients, highly granular data are available, including patient demographics, longitudinal vital signs, physiology, treatments, laboratory findings, clinical diagnoses, and outcomes. There are 10,030 patients with COVID-19, of whom 1472 died in hospital.
Keywords: hospital data; electronic health record; COVID-19 (search for similar items in EconPapers)
JEL-codes: C8 C80 C81 C82 C83 (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2306-5729/10/12/195/pdf (application/pdf)
https://www.mdpi.com/2306-5729/10/12/195/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jdataj:v:10:y:2025:i:12:p:195-:d:1801747
Access Statistics for this article
Data is currently edited by Ms. Becky Zhang
More articles in Data from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().