Question

Forum:Bioinformatics in a clinical diagnostic setting versus research (academia)

4

Entering edit mode

8.5 years ago

mbyvcm ▴ 480

This question is primarily directed at next-generation sequencing pipelines/workflows, specifically aimed at variant discovery. I was wondering if there are any additional practices, quality control steps, that are implemented in a clinical diagnostic setting which are not routinely used by bioinformaticians woking on academic research projects?

genome SNP next-gen-sequencing • 3.7k views

ADD COMMENT • link updated 2.3 years ago by Ram 45k • written 8.5 years ago by mbyvcm ▴ 480

1

Entering edit mode

It at least used to be the case that the private companies had much larger datasets to compare against (e.g., for filtering out non-causative variants). Since Exac and similar consortia have been running I'm not sure how much this is the case anymore, though.

ADD REPLY • link 8.5 years ago by Devon Ryan 105k

score 7 · Answer 1 · 2017-01-07

In a clinical setting, variant calls are being used for diagnostic purposes. Effectively they have to be correct, there is a lot less margin for error, and the pathological significance of variants needs to be relatively well supported. There are regulatory guidelines for clinicial bioinformatics as well governed under CAP and CLIA in the US. Typically, this includes documentation of pipeline development, and validation of the pipeline against deeply sequenced in house samples and "gold standard" datasets (e.g., Genome in a Bottle), so as to assess pipeline sensitive and specificity for known variants.

Couple useful links:

https://software.broadinstitute.org/gatk/best-practices/

http://www.bioplanet.com/GCAT

score 5 · Answer 2 · 2017-01-09

5

Entering edit mode

8.5 years ago

Robert Sicko ▴ 640

mforde84 has summed it up well, CLIA and CAP guidelines need to be followed for clinical NGS. Some additional resources for you:

College of American Pathologists' guidelines

American College of Medical Genetics and Genomics guidelines

Specifically focused on bioinformatics for clinical NGS

ADD COMMENT • link 8.5 years ago by Robert Sicko ▴ 640

0

Entering edit mode

+1 for ACMG guidelines. Although, I don't approve of their recommendation of "use IVS if you prefer that to c. notations"

ADD REPLY • link 8.5 years ago by Ram 45k

score 2 · Answer 3 · 2017-01-09

In clinical diagnostic labs, one important QC distinction is that controls with established known positive variants must be included on every sequencing run that includes one or more patient samples. If sufficient coverage and variant frequency for these variants can't be confirmed, the run must be failed. The thresholds for these values are established by limit of detection analysis. Well characterized wildtype/negative controls must also be present to confirm against contamination with false positives.

This is discussed in the ACMG and CAP guidelines linked earlier, but I think it bears calling out specifically.

Of course academic labs and genomics cores sometimes include positive controls, but is not a strict requirement, and they may not have to throw out all of the results if QC criteria aren't met.

score 1 · Answer 4 · 2017-01-09

Pretty much as mford84 - for the UK labs there are some guidelines available from the ACGS website

http://www.acgs.uk.com/committees/quality-committee/best-practice-guidelines/

http://www.acgs.uk.com/media/983872/bpg_for_targeted_next_generation_sequencing_-_approved_dec_2015.pdf

http://www.acgs.uk.com/media/1025075/ngs_bioinformatics_bpg_final_version_2016.pdf

Inspections are usually via UKAS https://www.ukas.com/ (including for NGS tests and associated bioinformatics pipelines).

There's been a fair amount of additional work on how to validate against gold standards such as Genome in a Bottle, and a growing body of control samples to test against (mainly an Indel specific dataset atm). Most labs also have large in-house control datasets derived from Sanger sequencing and Microarray data - although this of course only tests for variants that have been found by these technologies previously.

Specific points for the NGS pipelines are audits, version control via something like Github, revalidation on software updates etc.

Edit: Should also add that at the moment many UK labs will also Sanger confirm any likely pathogenic variants (ddPCR or MicroArray for CNVs), found by the NGS pipelines.