Phenotyping - British Heart Foundation

Phenotyping algorithms

All phenotyping algorithms created or used in research we support are shared via the HDR UK Phenotype Library and on each projects Github repository in accordance with our Publication and Dissemination Policy and the CVD-COVID-UK Ways of Working.

You can access all BHF Data Science Centre phenotypes in the Phenotype Library here

We encourage all developers and users of phenotype definitions to follow the FAIR (findable, accessible, interoperable and reusable) Guiding Principles by: 

Sharing their phenotype definitions publicly and freely, ideally via an open access repository (e.g., the HDR UK Phenotype Library)

Annotating phenotyping definitions with rich data and metadata to support reuse (see our recommended list here)

Citing all phenotype definitions in publications using the relevant accession identifier (ID) or digital object identifier (DOI) (e.g., listing the phenotype name, repository and accession ID/DOI within a table or as supplementary information)

Where it is not possible to submit phenotype definitions to a repository (e.g., due to the coding terminology not being supported), sharing the full code list and rich data/metadata as detailed above (e.g., using the YAML and CSV file formats of the Phenotype Library in a publicly accessible location, e.g., on GitHub)

The recommendations above are a requirement for all research supported by the BHF Data Science Centre (see our Publication and Dissemination Policy).

You can read our full recommendations here.

Submitting to the Phenotype Library

The Phenotype Library is an openly–accessible, searchable repository of electronic health record phenotyping algorithms. The Phenotype Library contains structured data and metadata describing each phenotyping algorithm to ensure researchers can identify, interpret and re-use algorithms.

Instructions and materials to support submission to the Phenotype Library are available here.

Additional support

Documentation on the Phenotype Library can be found both here and here

Any questions or problems that occur during the submission process should be directed to the Phenotype Library, via their contact page.

Sharing via GitHub

We recommend that researchers include information describing all phenotyping algorithms used in their research in any Github or similar code repository for their project. At a minimum this should include:

A file citing the accession IDs (including version) or DOI of all phenotyping algorithms that are already included in the Phenotype Library or other repository.
For any phenotyping algorithms not included in a repository this should include:
- A file containing the list of medical ontology terms/codes (code list) for each phenotyping algorithm.
- Metadata describing each phenotyping algorithm.

At minimum this should include the information represented in the Phenotype Library, as described in the submission materials described above.