Question: Concordance Of Ngs Data To Wg Snp Array Data
4
gravatar for jvijai
8.2 years ago by
jvijai1.1k
United States
jvijai1.1k wrote:

Hi, We want to check the concordance of the NGS data obtained from targeted SOLiD sequencing with the Affymetrix 6.0 data generated on the same sample. Both are genomic germline DNA, not tumor.
Is there a simple tool that will take the SAM file or variant file and look for concordance with the individuals genotype file (say if extracted from PLINK)?

Also hoping to hear how to do this otherwise, such as looking for the positions and alleles and then doing comparison, tabulating the number of concordant vs discordants etc.,

Thanks

ADD COMMENTlink written 8.2 years ago by jvijai1.1k
3
gravatar for Jorge Amigo
8.0 years ago by
Jorge Amigo11k
Santiago de Compostela, Spain
Jorge Amigo11k wrote:

the affy6.0 chip has a translation file that has all the typed genome positions tagged, with reference and alternative allele. considering that NGS will only detect how your sample differs from the current reference, taking affy6.0 alleles and positions there are 2 obvious checks you may perform:

  1. check if all the SNPs that affy6.0 has typed as alternative allele WERE FOUND by NGS (hence considered variants).
  2. check if all the SNPs that affy6.0 has typed as reference allele WERE NOT FOUND by NGS (hence considered reference).

keep in mind that genotyping is "limited" to the fixed positions stored in the chip, so you won't be able to do anything in reverse direction, i.e. things that arise from NGS that are not described on the affy6.0 chip. but still this check will be powerful, since the typing quality of affy6.0 is quite high, so at least you'll be able to have ~1M straight checks (SNP positions) and another ~1M not-so-straight checks (CNVs), depending on your NGS experiment type.

ADD COMMENTlink written 8.0 years ago by Jorge Amigo11k
3
gravatar for Adam
8.0 years ago by
Adam990
United States
Adam990 wrote:

If in VCF format, you can use the vcftools --diff options to do this. Alternatively, you could convert the format to PLINK and use the diff mode of that program.

ADD COMMENTlink written 8.0 years ago by Adam990
3
gravatar for Adao
7.5 years ago by
Adao30
Adao30 wrote:

Believe it or not, my PI want me to do this by excel.

ADD COMMENTlink written 7.5 years ago by Adao30

Just have to say wow...

ADD REPLYlink written 7.3 years ago by Alexbmp0

Wow... that's a tough job.

ADD REPLYlink written 7.3 years ago by Alexbmp0

excel really? there's excel templates which work as analysis tools for bioinformatics from the microsoft website. NOT sure if they might have something like this. but conceptually, I think a linux 'join' command is what u need to work fast

ADD REPLYlink written 7.2 years ago by Kevin610
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1792 users visited in the last hour