Key Datasets

This selection of datasets has been identified and reviewed as helpful for use by researchers in cardiovascular research and clinical trials. The datasets below are obtained from secondary (hospital) care, and we plan to add further information to include primary care data at a later date.

Please note - these datasets are not identical to the data accessible via the CVD-COVID-UK/COVID-IMPACT consortium. Please visit our CVD-COVID-UK/COVID-IMPACT pages to find out more about the consortium.

Data for England

NHS England – HES Admitted Patient Care (HES APC)

Key Area: Admitted Care

Dataset Description: The HES APC data set includes details of episodes of care where the patient is admitted into hospital, which includes regular day or night attending patients.

Completion Requirements: Mandatory completion by NHS Trusts

Data Standards: Diagnosis and operations coded to ICD-10 and OPCS-24 respectively (see Information Standards)

Data Dictionary: Available here

Data Quality Information: Included within DQMI (monthly data quality report produced monthly by NHS).

Each healthcare provider has a score for the APC data based on % of valid and complete data. Linkage can be completed with the Master Person Service (MPS), automatically matching 99% of patients.

Data Provided by: NHS England

Apply for Data: Data can be applied for via NHS England – Data Access Request Service (DARS) or NHS DigiTrials

This dataset is also available within NHS England’s Secure Data Environment, accessed via the NIHR-BHF Cardiovascular Partnership’s CVD-COVID-UK flagship project looking at the relationship between COVID-19 and cardiovascular disease.

This dataset may also be available via other organisations. You can search via the Health Data Research Innovation Gateway.

NHS England – Emergency Care Dataset (ECDS)

Key Area: Emergency care

Dataset Description: This dataset replaces the HES A&E dataset (A&E data continued to be mapped to HES A&E dataset until March 2020). Data includes details of all A&E attendances at NHS hospitals in England, including minor injury units and NHS walk-in centres, as well as 24 hour and consultant-led emergency departments.

Completion Requirements: Mandatory completion by NHS Trusts

Data Standards: Includes diagnosis mapped to SNOMED, arrival date/time and initial chief complaint (see Information Standards)

Data Dictionary: Find out more here

Data Quality Information: Included within DQMI (monthly data quality report) produced monthly by NHS England. Each Healthcare provider has a score for the ECDS data based on % of valid and complete data.

Linkage can be completed with the Master Person Service (MPS), automatically matching 99% of patients.

Data Provided by: NHS England

Apply for Data: Data Access Guidance

This dataset may also be available via other organisations. You can search via the Health Data Research Innovation Gateway.

NHS England – HES Outpatient

Key Area: Outpatient Care

Dataset Description: HES Outpatient data are a collection of individual records of outpatient appointments occurring in England only. It includes information on the type of outpatient consultation, appointment dates, the main specialty and treatment specialty under which the patient was treated, referral source and waiting times.

Completion Requirements: Mandatory completion by NHS Trusts

Data Standards: Includes diagnosis mapped to ICD-10, number and type of outpatient procedures mapped to OCPS-24 (see Information Standards)

Data Dictionary: Find our more here

Data Quality Information: Included within DQMI (monthly data quality report) produced monthly by NHS England). Each Healthcare provider has a score for the Outpatient data based on % of valid and complete data.

Linkage can be completed with the Master Person Service (MPS), automatically matching 99% of patients.

Data Provided by: NHS England

Apply for Data: Extract data can be applied for via NHS England – Data Access Request Service (DARS) or NHS DigiTrials

This dataset is also available within NHS England’s Secure Data Environment, accessed via the CVD-COVID-UK Consortium looking at the relationship between COVID-19 and cardiovascular disease.

This dataset may also be available via other organisations. You can search via the Health Data Research Innovation Gateway.

NHS England – Diagnostic Imaging Dataset

Key Area: Imaging

Dataset Description: The Diagnostic Imaging Dataset (DID) is a central collection of detailed information about diagnostic imaging tests carried out on NHS patients, extracted from local Radiology Information Systems (RISs) and submitted monthly. The dataset is collected at patient level and includes patient identifiers to enable linkage to other datasets, most notably cancer registration data.

Completion Requirements: Mandatory completion for NHS Imaging Services (note this may exclude some partner services, obstetric ultrasound and screening exams)

Data Standards: Imaging codes – SNOMED-CT or NICIP (National Interim Clinical Imaging Procedure (NICIP) Code Set) (see Information Standards)

Data Dictionary: See DID extract dictionary, under Data collected section. Find out more here

Data Quality Information: NHS England provide a quality report every month. This includes coverage, completeness, NHS Number and DOB quality, and also highlights any issues with provider site codes for uploads.

Data Provided by: NHS England

Apply for Data: Extract data can be applied for via NHS England – Data Access Request Service (DARS)

Data for England and Wales

NHS England – HES Linked to Civil Registration of Deaths (CRD)

Key Areas: Mortality/Cancer registration, patient tracking

Dataset Description: Linking HES data to mortality data from the Office for National Statistics. Reviewed in 2020, enhancements have been made to death reporting, The available data fields are generated from 12 mortality datasets.
Note: The Civil Registration of Deaths is available as a dataset without HES linkage. HES linked to Civil Registration of Deaths is listed here due to the data integrity work that has been completed on this dataset and the ability to make use of the HES linked Master Person Service.

Completion Requirements: Statutory requirement for death registration

Data Standards: Cause of death coded to ICD-10 code that identifies the medical condition judged to be the underlying cause of death. Underlying cause of death can be defined as: a) the disease or injury which initiated the train of morbid events leading directly to death; or b) the circumstances of the accident or violence which produced the fatal injury. The underlying cause may be a long-standing, chronic disease or disorder that predisposed the patient to later fatal complications.

Data Dictionary: Find out more here

Data Quality Information: Linkage can be completed with the Master Person Service (MPS), automatically matching 99% of patients. NHS trusts and other healthcare organisations submit activity data every month, which is then run through a series of data quality checks and cleaning procedures before it is published.

Data Provided by: Office for National Statistics

Apply for Data: Extract data can be applied for via NHS England – Data Access Request Service (DARS) or NHS DigiTrials

The Civil Registration of Death dataset  is also available within NHS England’s Secure Data Environment, accessed via the NIHR-BHF Cardiovascular Partnership’s CVD-COVID-UK flagship project looking at the relationship between COVID-19 and cardiovascular disease.

This dataset may also be available via other organisations. You can search via the Health Data Research Innovation Gateway.

NHS England and Diabetes UK – National Diabetes Audit (NDA)

Key Area: Diabetes

Dataset Description: One of the largest annual clinical audits in the world. It measures the effectiveness of diabetes healthcare against NICE Clinical Guidelines and NICE Quality Standards in England and Wales, for both primary and secondary care (specialist diabetes services).

For patients with a diagnosis of diabetes and no opt-out, the following data is collected during the audit period: demographics/clinical observations, HbA1c readings, blood pressure, BMI and cholesterol. As at 2020/21 report, over 6500 practices were participating.

The NDA is commissioned and managed under contract by the Healthcare Quality Improvement Partnership (HQIP) on behalf of NHS England and the Welsh Government. The NDA is delivered by NHS England in collaboration with Diabetes UK.

Completion Requirements: Mandatory completion (Direction from NHS England under section 254)

Data Standards: Data in SNOMED codes from 2018 onwards (see Information Standards)

Data Dictionary: Find out more here

(2019-20 SNOMED Code Set) (see Information Standards)

Data Quality Information: Once the data is collected from GP system suppliers (GPSS), validation on the file structure and contents is carried out before files are accepted by NHS England’s GP Data Collector system. All data is reviewed by NHS England for errors and queries raised. The NHS England Data Flow Diagram – NDA Core Aug 19 shows the process.

Data Provided by: NHS England and Wales

Apply for Data: Extract data can be applied for via NHS England – Data Access Request Service (DARS) or NHS DigiTrials

National Institute for Cardiovascular Outcomes Research (NICOR) – Multiple Cardiovascular Audits

Key Area: Adult Cardiac Surgery/ Angioplasty/Cardiac Rhythm Management/ Congenital Heart Disease/Myocardial Ischaemia (Heart Attack)

Dataset Description: NICOR collects data and produces analysis to enable hospitals and healthcare improvement bodies to monitor and improve the quality of care and outcomes of cardiovascular patients. The National Cardiac Audit Programme has six sub audits. An application can be made for data from one or multiple audits.

  • National Adult Cardiac Surgery Audit (NACSA): collects data on all major heart operations carried out in NHS hospitals and a selection of private hospitals throughout the UK.
  • Adult Percutaneous Coronary Interventions (angioplasty audit): collects information about percutaneous coronary interventions (PCI) that have taken place.
  • National Audit of Cardiac Rhythm Management (NACRM): collects information about all implanted cardiac devices and all patients receiving interventional procedures for management of cardiac rhythm disorders in the UK.
  • National Congenital Heart Disease Audit (NCHDA): focuses on monitoring activity levels and outcomes following cardiovascular procedures with an  aim of improving the quality of specialist congenital cardiovascular care by providing reliable, risk adjusted and independently validated data, including individual procedural counts, activity levels, access to high quality fetal cardiology screening for antenatal diagnosis, and most importantly, patient outcomes with respect to mortality and postprocedural complications.
  • National Heart Failure Audit (NHFA): covers all patients with an unscheduled admission to hospital, who are discharged with a primary diagnosis of heart failure.
  • Myocardial Ischaemia National Audit Project (MINAP): a domain within NCAP that contains information about the care provided to patients who are admitted to hospital with acute coronary syndromes (heart attack).

Completion Requirements: Mandatory completion. Audit completion requirement is annual closing at the end of March. This can impact on the level of completeness throughout the year depending on how individual trusts choose to submit data.

Data Standards: Coding is specific to these datasets (see Information Standards)

Data Dictionary: Find out more here. The list of fields also show which fields are mandatory for completion.

Data Quality Information: NICOR complete data linkage with NHS England for all datasets for survival status prior to data being available for research. NICOR data can be linked with HES data from NHS England. Each participating hospital has access to NICOR data tools where they can assess data completeness reports and the comparison against national averages.

Data Provided By: NHS England & Wales. Pre-2020/1 data includes NHS Scotland data.

Apply for Data: Extract data can be applied for via NHS England – Data Access Request Service (DARS). However, you should contact NICOR’s data access team to discuss prior to submission. Details of the process can be found on the NICOR ‘For Researchers’ page.

The PCI, MINAP, NHFA, NACSA, NACRM, NCHDA (and TAVI) datasets are also available within NHS England’s Secure Data Environment, accessed via the NIHR-BHF Cardiovascular Partnership’s CVD-COVID-UK flagship project looking at the relationship between COVID-19 and cardiovascular disease.

This dataset may also be available via other organisations. You can search via the Health Data Research Innovation Gateway.

Data for Scotland

National Health Service Central Register (NHSCR)/NRS Deaths

Key Area: Mortality/cancer registration, patient tracking

Dataset Description: Data includes date and cause of death, cancer registration information, notification of name changes/name of health area in the UK where a patient is registered.

Completion Requirements: Statutory requirement for death registration

Data Standards: Cause of death coded to ICD-10, cancer registration information, notification of name changes/name of health area in the UK where a patient is registered. Cancer data is coded to ICD-10 and ICD-O-3 (see Information Standards)

Data Dictionary: Extract layout is available on request from dg.nhscr-scotland-medicalresearch@nhs.scot 

Data Quality Information: Quality checks are completed on all death data, all ICD-10 codes are reviewed by an experienced medical coder. Documentation on the checks is available on the NRS website.

Data Provided by: National Records of Scotland

Apply for Data: Apply to NHSCR

The NRS death data is also available within NHS England’s Secure Data Environment, accessed via the NIHR-BHF Cardiovascular Partnership’s CVD-COVID-UK flagship project looking at the relationship between COVID-19 and cardiovascular disease.

This dataset may also be available via other organisations. You can search via the Health Data Research Innovation Gateway.

General Acute Inpatient and Day Case – Scottish Morbidity Record (SMR01)

Key Area: Admitted Care

Dataset Description: Data includes where episode took place, the admission type (includes patient injury classifications such as self-inflicted or home accident). PHS statistics for Angina, CHD, Myocardial infarction and Heart failure utilise SMR01. Data is also available for Stroke, however, please also see the Scottish Stroke Care Audit below.

Completion Requirements: Mandatory completion

Data Standards: Data includes where episode took place, the admission type (includes patient injury classifications such as self-inflicted or home accident), diagnosis classified as ICD-10 (see Information Standards)

Data Dictionary: Find out more here

Data Quality Information: Data undergoes validation including checks for missing/invalid data and consistency. Record completeness reports are also available quarterly here.

Data Provided by: NHS Scotland

Apply for Data: Data can be applied for via Public Health Scotland.

The General Acute Inpatient and Day Case is also available within NHS England’s Secure Data Environment, accessed via the NIHR-BHF Cardiovascular Partnership’s CVD-COVID-UK flagship project looking at the relationship between COVID-19 and cardiovascular disease.

This dataset may also be available via other organisations. You can search via the Health Data Research Innovation Gateway.

Scotland Accident & Emergency Datamart (A&E2)

Key Area: Emergency care

Dataset Description: Scottish NHS Boards are required to submit data on all attendances for emergency care, including A&E Department, Emergency Department, Minor Injuries Units and the trolleyed area of an Assessment Unit (which may be further described as Medical, Surgical, Combined, or Paediatric)

Completion Requirements: Mandatory

Data Standards: Data mirroring ECDS for NHS England, with presenting complaint, arrival date & time and diagnosis at discharge coded to ICD-10 (see Information Standards)

Data Dictionary: Find out more here

Data Quality Information: In addition to local data quality checks, the data submitted to the A&E DataMart is subject to a validation procedure. As well as ensuring that data for the correct site and time period is submitted, the validation ensures that the data is in the correct format and only contains accepted codes.

Data Provided by: NHS Scotland

Apply for Data: Data can be applied for via Public Health Scotland

The Scotland Accident & Emergency Datamart is also available within NHS England’s Secure Data Environment, accessed via the NIHR-BHF Cardiovascular Partnership’s CVD-COVID-UK flagship project looking at the relationship between COVID-19 and cardiovascular disease.

This dataset may also be available via other organisations. You can search via the Health Data Research Innovation Gateway.

Outpatient Appointments and Attendances – Scottish Morbidity Record (SMR00)

Key Dataset: Outpatient Care

Dataset Description: The Outpatient Appointments and Attendances (SMR00) dataset collects episode level data from patients on new and follow up appointments at outpatient clinics in all specialities (except A&E and Genito-Urinary Medicine).

Completion Requirements: Mandatory

Data Standards: Data includes the speciality, the type of facility (Outpatients, Day Centre etc.), mode, referral source etc. Data coded to ICD-10 (see Information Standards)

Data Dictionary: Find out more here

Data Quality Information: Data undergoes validation including checks for missing/invalid data and consistency. Record completeness reports are also available quarterly here.

Data Provided by: NHS Scotland

Apply for Data: Data can be applied for via Public Health Scotland.

The Scotland Outpatient Attendance is also available within NHS England’s Secure Data Environment, accessed via the NIHR-BHF Cardiovascular Partnership’s CVD-COVID-UK flagship project looking at the relationship between COVID-19 and cardiovascular disease.

This dataset may also be available via other organisations. You can search via the Health Data Research Innovation Gateway.

Scottish Stroke Care Audit (SSCA)

Key Area: Stroke

Dataset Description: The Scottish Stroke Care Audit (SSCA) includes all hospitals managing acute stroke in Scotland and collects data to review against the Scottish Stroke Care Standards.

Completion Requirements: Completion of SSCA audit forms is mandatory for all acute hospitals admitting patients with stroke. However, if the patient is boarding in a general medical ward or outlying ward, these maybe captured by SMR01. The SSCA contains more detail regarding stroke outcomes. Both SMR01 and SSCA only capture inpatient details; if a patient suffers a Transient Ischaemic Attack and is not admitted, this will not be represented.

Data Standards: ICD-10 categories utilised for final diagnosis (see Information Standards)

Data Dictionary: Find out more here

Data Quality Information: Audit data are subject to validation at data entry and quality assurance procedures. For the SSCA annual report, data on patients who have a final diagnosis of stroke is compared to SMR01 data to ensure that no patients have been missed.

Data Provided by: NHS Scotland

Apply for Data: Data can be applied for via Public Health Scotland.

The Scottish Stroke Care Audit is also available within NHS England’s Secure Data Environment, accessed via the NIHR-BHF Cardiovascular Partnership’s CVD-COVID-UK flagship project looking at the relationship between COVID-19 and cardiovascular disease.

This dataset may also be available via other organisations. You can search via the Health Data Research Innovation Gateway.

SCI-Diabetes (Scottish Care Information – Diabetes Collaboration)

Key Dataset: Diabetes

Dataset Description: SCI Diabetes is a live clinical medical record system for diabetes patients that is used in Scotland, and is available for research purposes. SCI Diabetes is thought to cover >99% of the population who are diagnosed with diabetes based on the fact that all general practices in Scotland contribute data. Some patients with a diagnosis of pre-diabetes are included. Primary care data is via automated extract, selected secondary care data is added manually by hospital staff. Lab data is also included and is via automated extract.

Completion Requirements: Clinical data – not all data fields are mandatory for completion

Data Standards: READ Codes/ICD-10 and OCPS-24 codes (see Information Standards)

Data Dictionary: Data dictionary available on request from National Education Scotland.

Data Quality Information: This is not a curated dataset – a dashboard has been created within the system to allow health boards to check their levels of completeness.

Data Provided by: NHS Scotland

Apply for Data: Single health board data is available with Scottish regional safehaven/TRE. Apply via Health Board Caldicott Guardian. The Scottish Diabetes Research Network (SDRN) receive a regular data feed and will review research applications from Scottish based researchers for collaborative projects. SDRN can be contacted here. Extracts on a national level should be applied for via Public Health Scotland.

Data for Wales

Emergency Department Data Set (EDDS)

Key Area: Emergency Care

Dataset Description: The Emergency Department Data Set (EDDS) captures attendances at Accident & Emergency departments and Minor Injury Units (MIUs) in Welsh hospitals.

Completion Requirements: Mandatory

Data Standards: Diagnosis and operations coded to ICD-10 and OPCS-24 respectively (see Information Standards)

Data Dictionary: Find out more here

Data Quality Information: Data consistency and validity are monitored on a monthly basis here.

Data Provided by: NHS Wales

Apply for Data: Apply for bespoke data requests via Digital Health and Care Wales.

This dataset is also available within the SAIL Databank for Wales via the CVD-COVID-UK/COVID-IMPACT Consortium.

This dataset may also be available via other organisations. You can search via the Health Data Research Innovation Gateway.

Outpatient Dataset for Wales (OPDW)

Key Area: Outpatient Care

Dataset Description: The Outpatient Data Set for Wales (OPDW) captures data for all consultant or nurse led clinic activity where that meets the criteria outlined in Terms Clinic Appointment – Consultant Led (Outpatients) or Terms Clinic Appointment – Nurse Led (Outpatients). This includes Face to face appointments, Virtual clinic appointments (patient contact) Phone, Virtual clinic appointments (patient contact) Video and Virtual clinical reviews (non-patient contact). Any activity which is not consultant or nurse should be excluded. Also excluded are: Community nurse led activity

Completion Requirements: Mandatory

Data Standards: ICD-10 (see Information Standards)

Data Dictionary: Find out more here

Data Quality Information: Information about data quality monitoring can be found here

Data Provided by: NHS Wales

Apply for Data: Apply for bespoke data requests via Digital Health and Care Wales.

This dataset is also available within the SAIL Databank for Wales via the CVD-COVID-UK/COVID-IMPACT Consortium.

This dataset may also be available via other organisations. You can search via the Health Data Research Innovation Gateway.

Patient Episode Database for Wales (PEDW)

Key Area: Admitted Care

Dataset Description: The PEDW dataset captures data for all consultant-led admitted patient activity, regardless of the patient’s area of residence. Any activity which is not undertaken by a consultant (or a member of their firm) should be excluded. All NHS activity regardless of the care provider is included. Private patients seen at an NHS organisation are also included.

Completion Requirements: Mandatory

Data Standards: Diagnosis and operations coded to ICD-10 and OPCS-24 respectively. (see Information Standards)

Data Dictionary: Find out more here

Data Quality Information: Data consistency and validity are monitored on a monthly basis here.

Data Provided by: NHS Wales

Apply for Data: Apply for bespoke data requests via Digital Health and Care Wales.

This dataset is also available within the SAIL Databank for Wales via the CVD-COVID-UK/COVID-IMPACT Consortium.

This dataset may also be available via other organisations. You can search via the Health Data Research Innovation Gateway.

Data for England, Wales and Northern Ireland

Sentinel Stroke National Audit Programme (SSNAP)

Dataset Description: The clinical audit collects a minimum dataset for stroke patients in England, Wales and Northern Ireland in every acute hospital, and follows the pathway through recovery, rehabilitation, and outcomes at the point of 6 month assessment. It is the only national stroke register in the world to collect longitudinal data on the processes and outcomes of stroke care up to six months post stroke. Every year data from approximately 85,000 patients are submitted to the audit webtool for analysis, representing over 90% of all stroke hospital admissions in the NHS.

Completion Requirements: The core clinical audit is mandatory, there is also an acute organisational collected biennially.

Data Standards: Coding is specific to this dataset

Data Dictionary: Find out more here

Data Quality Information: Information available under SSNAP methodology here.

Data Provided by: King’s College London

Apply for Data: Extract data can be applied for via SSNAP directly, via NHS England – Data Access Request Service (DARS) or NHS DigiTrials.

This dataset is also available within NHS England’s Secure Data Environment, accessed via the NIHR-BHF Cardiovascular Partnership’s CVD-COVID-UK flagship project looking at the relationship between COVID-19 and cardiovascular disease.

This dataset may also be available via other organisations. You can search via the Health Data Research Innovation Gateway.

Key Data Access Guidance

How do we access data and other considerations.

Read more