Citation: Clinical Practice Research Datalink. (2021). CPRD COVID-19 symptoms and risk factors synthetic dataset April 2021 (Version 2021.04.001) [Data set]. Clinical Practice Research Datalink. https://doi.org/10.48329/fbjh-es87
This synthetic dataset is based on anonymised real primary care patient data extracted from the CPRD Aurum database. The dataset focuses on patients presenting to primary care with symptoms indicative of COVID-19 (confirmed/suspected COVID-19) and control patients with negative COVID-19 test results. The dataset includes data on sociodemographic and clinical risk factors.
The development of this dataset was funded by NHSX using the synthetic data generation and evaluation framework developed under a grant from the Regulators’ Pioneer Fund launched by The Department for Business, Energy and Industrial Strategy (BEIS) and managed by Innovate UK.
The dataset includes 779,546 patients.
Further information is available on the Synthetic data web page.