Question: What happens to heterozygous sites when you go from reference sequence to sequence modified by variants?
gravatar for jxiang15
5.2 years ago by
United States
jxiang1510 wrote:

In a previous post, New Fasta Sequence From Reference Fasta And Variant Calls File?, it was recommended to use either vcftools or FastaAlternateReferenceMaker (  if you have a reference sequence and a variant file, and you to get a new FASTA file. 

However, with the 1000 genomes data, the data is phased.  So at heterozygous sites, should the ALT allele be substituted or should the REF allele be left in the sequence? 

Thanks in advance

snp sequence • 1.6k views
ADD COMMENTlink modified 5.2 years ago by geek_y10k • written 5.2 years ago by jxiang1510
gravatar for geek_y
5.2 years ago by
geek_y10k wrote:

FastaAlternateReferenceMaker's --useIUPAC option will place the IUPAC code wherever there is heterozygous state.

ADD COMMENTlink written 5.2 years ago by geek_y10k

What happens if you don't put that option, what is the default behaviour?

ADD REPLYlink written 5.2 years ago by jxiang1510
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1241 users visited in the last hour