CCU037: Improving methods to minimise bias in ethnicity data for more representative and generalisable models, using CVD in COVID-19 as an example

Project lead:
Sara Khalid, University of Oxford

This research project is awarded through a funding call by Health Data Research UK and the Alan Turing Institute as part of the wider Data and Connectivity National Core Study.

Further details on this project are available here.


Digital ethnicity data in population-wide electronic health records in England: a description of completeness, coverage, and granularity of diversity

  • Paper submitted to a journal (decision pending)
  • medRxiv preprint 11/11/22 can be viewed here
  • Code and phenotypes used in this study are available in GitHub here