Skip to content

Identifiers in the CIP-API

The aim of this section is to describe some of the different identifiers present in the data returned by the CIP-API

Definitions of any additional identifiers not listed below can be found in the GeL Report Models Documentation

Attention

The CIP-API does not hold any "strong" personal identifiable data e.g. first names, surnames, dates of birth, NHS numbers addresses. Third Party Integrators (TPIs) that require this data need to use the CIP-API in combination with the re-identification service, as described here

Genomic Medicine Service Identifiers

Identifier Example Description
Referral ID r12345678921 The NGIS referral ID is a system assigned identifier for each referral.
The NGIS referral ID is in a human readable format and has the following characteristics:
- Twelve characters
- rYYNNNNNNNNC
- r is the literal "r" character
- Y is the two-digit year (21 for 2021) when the referral was created
- N is a 8 character string of digits (0-9)
- C is a calculated check digit.
Patient ID p98765432190 The NGIS patient ID is a system assigned identifier for each patient.
The NGIS patient ID is in a human readable format and has the following characteristics:
- Twelve characters
- pNNNNNNNNNNC
- p is the literal "p" character
- N is a ten character string of digits (0-9)
- C is a calculated check digit.
Generally, a patient will have a single Sample ID.
Cohort ID r12345678921_10000 This identifier is the "Family ID / Group ID" with a "_" suffix version number. It represents the "version" of the data used in analysis in our internal catalog database
Sample ID LP1000999-DNA_A01 Also known as the "LP number" or "Plate Key", this is the unique identifier for each WGS sample. It is composed of the 96 well plate ID (e.g. LP1000999) and the co-ordinates of the well on that plate "A01" that the sample was in when the it was sequenced by Illumina
Interpretation Request ID SAP-1234-1
ILMN-321-1
This identifier is an auto incremented ID from the CIP-API database that represents the results of the WGS pipeline interpretation that has happened for that family. It is made up of a DSS prefix (e.g. SAP-), a request id (e.g. 1234), and a version (e.g. -1)
Lab Sample ID 1234567890 This identifier comes from the laboratory that submitted the sample. It is the ID that is on the tube of DNA that the sample aliquot was taken from when the referral was first ordered.

100K Genomes Project Identifiers

Identifier Example Description
Family ID / Group ID 123456789 or FM50009999 This is similar to the GMS referral ID, it represents a family (made up one or more related 100K Project participants) or two samples (tumour-normal) from a 100K Project Cancer Participant
Participant ID 987654321 This identifier represents a single 100K project participant. Generally, a participant will have a single Sample ID. In some instances we have 100K participants with only phenotypic data and no WGS data - in these instances the participant ID is prefixed with NR_ and the samples array in the Pedigree Member is empty
Proband ID 987654321 This identifier represents the "index" case in a 100K Project rare disease family
Cohort ID 123456789_10000 This identifier is the "Family ID / Group ID" with a "_" suffix version number. It represents the "version" of the data used in analysis in our internal catalog database
Sample ID LP1000999-DNA_A01 Also known as the "LP number" or "Plate Key", this is the unique identifier for each WGS sample. It is composed of the 96 well plate ID (e.g. LP1000999) and the co-ordinates of the well on that plate "A01" that the sample was in when the it was sequenced by Illumina
Interpretation Request ID SAP-1234-1
ILMN-321-1
This identifier is an auto incremented ID from the CIP-API database that represents the results of the WGS pipeline interpretation that has happened for that family. It is made up of a DSS prefix (e.g. SAP-), a request id (e.g. 1234), and a version (e.g. -1)

Last update: 2022-03-25