Question: How can I get the informaiton about base quality(BQ) in VCF file?
1
gravatar for lanyuhao1991
5.6 years ago by
lanyuhao199110
China
lanyuhao199110 wrote:

Hi everyone,

I use samtools(version 0.18) to do snp calling.But the final VCF(version 4.1) file do not have the BQ(base quality).

How can I get the informaiton about BQ ? Because I should filter the VCF files based on BQ > 15.

The command :

  • samtools mpileup -uf ref.fa aln.bam | bcftools view -vcg - > var.raw.vcf

The final VCF :


 

##fileformat=VCFv4.1
##samtoolsVersion=0.1.18 (r982:295)
##INFO=<ID=DP,Number=1,Type=Integer,Description="Raw read depth">
##INFO=<ID=DP4,Number=4,Type=Integer,Description="# high-quality ref-forward bases, ref-reverse, alt-forward and alt-reverse bases">
##INFO=<ID=MQ,Number=1,Type=Integer,Description="Root-mean-square mapping quality of covering reads">
##INFO=<ID=FQ,Number=1,Type=Float,Description="Phred probability of all samples being the same">
##INFO=<ID=AF1,Number=1,Type=Float,Description="Max-likelihood estimate of the first ALT allele frequency (assuming HWE)">
##INFO=<ID=AC1,Number=1,Type=Float,Description="Max-likelihood estimate of the first ALT allele count (no HWE assumption)">
##INFO=<ID=G3,Number=3,Type=Float,Description="ML estimate of genotype frequencies">
##INFO=<ID=HWE,Number=1,Type=Float,Description="Chi^2 based HWE test P-value based on G3">
##INFO=<ID=CLR,Number=1,Type=Integer,Description="Log ratio of genotype likelihoods with and without the constraint">
##INFO=<ID=UGT,Number=1,Type=String,Description="The most probable unconstrained genotype configuration in the trio">
##INFO=<ID=CGT,Number=1,Type=String,Description="The most probable constrained genotype configuration in the trio">
##INFO=<ID=PV4,Number=4,Type=Float,Description="P-values for strand bias, baseQ bias, mapQ bias and tail distance bias">
##INFO=<ID=INDEL,Number=0,Type=Flag,Description="Indicates that the variant is an INDEL.">
##INFO=<ID=PC2,Number=2,Type=Integer,Description="Phred probability of the nonRef allele frequency in group1 samples being larger (,smaller) than in group2.">
##INFO=<ID=PCHI2,Number=1,Type=Float,Description="Posterior weighted chi^2 P-value for testing the association between group1 and group2 samples.">
##INFO=<ID=QCHI2,Number=1,Type=Integer,Description="Phred scaled PCHI2.">
##INFO=<ID=PR,Number=1,Type=Integer,Description="# permutations yielding a smaller PCHI2.">
##INFO=<ID=VDB,Number=1,Type=Float,Description="Variant Distance Bias">
##FORMAT=<ID=GT,Number=1,Type=String,Description="Genotype">
##FORMAT=<ID=GQ,Number=1,Type=Integer,Description="Genotype Quality">
##FORMAT=<ID=GL,Number=3,Type=Float,Description="Likelihoods for RR,RA,AA genotypes (R=ref,A=alt)">
##FORMAT=<ID=DP,Number=1,Type=Integer,Description="# high-quality bases">
##FORMAT=<ID=SP,Number=1,Type=Integer,Description="Phred-scaled strand bias P-value">
##FORMAT=<ID=PL,Number=G,Type=Integer,Description="List of Phred-scaled genotype likelihoods">
#CHROM    POS    ID    REF    ALT    QUAL    FILTER    INFO    FORMAT    unknown
gi|261414414|ref|NC_013410.1|    148    .    A    G    10.4    .    DP=8;VDB=0.0429;AF1=0.5001;AC1=1;DP4=3,0,4,1;MQ=18;FQ=5.73;PV4=1,0.0017,0.024,0.19    GT:PL:GQ    0/1:40,0,32:35

 

Thanks a lot!

 

snp bq vcf • 2.9k views
ADD COMMENTlink modified 5.6 years ago by Dan D7.0k • written 5.6 years ago by lanyuhao199110
2
gravatar for Dan D
5.6 years ago by
Dan D7.0k
Tennessee
Dan D7.0k wrote:

The base quality filtering is already done by samtools mpileup using the -Q parameter. If you used samtools mpileup to create your VCF, then it is already filtered on base quality (default is 13).

ADD COMMENTlink modified 8 weeks ago by RamRS25k • written 5.6 years ago by Dan D7.0k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2104 users visited in the last hour