I am new to the annotation of variants using annoVar.
I try to understand the values listed in the column "Otherinfo" (Columns : K & M) by reading the documentation of annoVar.
However I am still not clear about how these values were generated.
The column K looks like SIFT score. This information come from the database you add in you command line for the annotation.
Thanks for answering. I will try to look at it.
Titus, I am not sure about that. Other information does not come from the databases specified in the command line. It comes from the extra columns that were in your input file:
--otherinfo print out otherinfo (infomration after fifth column in queryfile)
kathrine.tan, go back to your input file (input file to ANNOVAR), and there you will see to what the other information relates.
and also please post function/command options used to generate annovar input (probably, from VCF). kathrine.tan
Thanks for answering.
Here is a the command I used to generate the output. The input was a vcf file.
table_annovar.pl -protocol gene -operation g -buildver ct18 -vcfinput PASS.vcf ct18db_180915/
NOTICE: Running with system command <convert2annovar.pl -includeinfo="" -allsample="" -withfreq="" -format="" vcf4="" pass.vcf="" >="" pass.vcf.avinput="">
NOTICE: Finished reading 1888 lines from VCF file
NOTICE: A total of 1863 locus in VCF file passed QC threshold, representing 1863 SNPs (1612 transitions and 251 transversions) and 0 indels/substitutions
NOTICE: Finished writing allele frequencies based on 180711 SNP genotypes (156364 transitions and 24347 transversions) and 0 indels/substitutions for 97 samples
Could the column K be the allele frequencies?
Here are some screenshots of the vcf file:
I see, you originally tried to annotate a VCF, but ANNOVAR then detected this and automatically converted it to ANNOVAR format via
convert2annovar.pl -includeinfo="" -allsample="" -withfreq="" -format="" vcf4="" pass.vcf="" >="" pass.vcf.avinput=""
Specifically, it uses these command-line parameters:
--includeinfo include supporting information in output
--withfreq for --allsample, print frequency information instead (for vcf4 format)
So, your column K does appear to be allele frequencies (of your sample cohort). You should check a few examples just to be sure.
Thank you so much.
I will check it again.
Login before adding your answer.
Use of this site constitutes acceptance of our User Agreement and Privacy