Question: VCF Output file column explanation
gravatar for kathrine.tan
18 months ago by
kathrine.tan10 wrote:

enter image description herescreenshot of my annoVar output

I am new to the annotation of variants using annoVar. I try to understand the values listed in the column "Otherinfo" (Columns : K & M) by reading the documentation of annoVar. However I am still not clear about how these values were generated.

annovar snp gene genome • 778 views
ADD COMMENTlink modified 18 months ago by Kevin Blighe56k • written 18 months ago by kathrine.tan10

The column K looks like SIFT score. This information come from the database you add in you command line for the annotation.

ADD REPLYlink written 18 months ago by Titus910

Dear Titus,

Thanks for answering. I will try to look at it.


ADD REPLYlink written 18 months ago by kathrine.tan10
gravatar for Kevin Blighe
18 months ago by
Kevin Blighe56k
Kevin Blighe56k wrote:

Titus, I am not sure about that. Other information does not come from the databases specified in the command line. It comes from the extra columns that were in your input file:

--otherinfo                 print out otherinfo (infomration after fifth column in queryfile)

kathrine.tan, go back to your input file (input file to ANNOVAR), and there you will see to what the other information relates.


ADD COMMENTlink written 18 months ago by Kevin Blighe56k

and also please post function/command options used to generate annovar input (probably, from VCF). kathrine.tan

ADD REPLYlink written 18 months ago by cpad011212k

Dear Kevin, Thanks for answering. Here is a the command I used to generate the output. The input was a vcf file. -protocol gene -operation g -buildver ct18 -vcfinput PASS.vcf ct18db_180915/

NOTICE: Running with system command < -includeinfo="" -allsample="" -withfreq="" -format="" vcf4="" pass.vcf="" &gt;="" pass.vcf.avinput=""> NOTICE: Finished reading 1888 lines from VCF file NOTICE: A total of 1863 locus in VCF file passed QC threshold, representing 1863 SNPs (1612 transitions and 251 transversions) and 0 indels/substitutions NOTICE: Finished writing allele frequencies based on 180711 SNP genotypes (156364 transitions and 24347 transversions) and 0 indels/substitutions for 97 samples

Could the column K be the allele frequencies?

Here are some screenshots of the vcf file: enter image description here

enter image description here


ADD REPLYlink modified 18 months ago • written 18 months ago by kathrine.tan10

I see, you originally tried to annotate a VCF, but ANNOVAR then detected this and automatically converted it to ANNOVAR format via -includeinfo="" -allsample="" -withfreq="" -format="" vcf4="" pass.vcf="" &gt;="" pass.vcf.avinput=""

Specifically, it uses these command-line parameters:

--includeinfo               include supporting information in output
--withfreq                  for --allsample, print frequency information instead (for vcf4 format)

So, your column K does appear to be allele frequencies (of your sample cohort). You should check a few examples just to be sure.

ADD REPLYlink written 18 months ago by Kevin Blighe56k

Dear Kevin,

Thank you so much. I will check it again.


ADD REPLYlink written 18 months ago by kathrine.tan10
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1772 users visited in the last hour