Question: Finding SNPs between two inbred genetic lines
gravatar for colin.kern
2.3 years ago by
United States
colin.kern930 wrote:

I have DNA sequencing from two highly inbred genetic lines of chicken, so I expect them to be almost entirely homozygous. I'm interested in getting a VCF file of the SNPs that exist between these two lines. It seems like there are two ways I can do this using GATK:

1) Run GATK twice, once on the data from each line, then look for SNPs that exist with the reference genome in one but not the other, or SNPs that exist in both but that have different alternate bases.

2) Run GATK once with both sets of data, then filter the results by SNPs with allele frequency of 0.5 or frequency of 1.0 with two alternate alleles.

Are these two methods equivalent, or would one produce more accurate results than the other? Is there a more direct way to call SNPs between two genetic lines, where it doesn't call SNPs relative to the reference genome?

snp • 516 views
ADD COMMENTlink written 2.3 years ago by colin.kern930

If you've VCF files may be giving a try to VCF-compare could be good. It tells your mismatches/matches per person. You can filter SNPs based on overlap in reference and other sets eventually. You can use vcftools to filter snps from vcf files.

ADD REPLYlink written 2.3 years ago by Bioinformatics_NewComer320
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1291 users visited in the last hour