How to output all variant alleles at a given site, regardless of quality
1
0
Entering edit mode
7.3 years ago

How can one use either samtools mpileup or any other tool to, for a given position, output all alternate alleles regardless of quality? Just "samtools mpileup" (and the parameters I've tried so far) gives <*> for a position for which I observe an alternate base in IGV. (The purpose of doing this is for quality control)

snp variant mpileup • 1.6k views
ADD COMMENT
2
Entering edit mode
7.3 years ago

You can do this with BBMap's CallVariants tool:

callvariants.sh in=mapped.bam ref=ref.fasta ploidy=2 vcf=all.vcf clearfilters

"clearfilters" sets all filters to zero, so no variant will fail. Your vcf file might be pretty big, though.

ADD COMMENT
0
Entering edit mode

Works! Although in this case I was only interested in looking up specific regions, but subsetting the BAM file using samtools first solves the problem. However, I got some Java errors when running callvariants.sh: "Invalid maximum heap size: -Xmx101593m. The specified size exceeds the maximum representable size.". Lowering the memory parameters to "-Xmx2048m -Xms2048m" still gave errors about "java.lang.OutOfMemoryError: Java heap space", but the program finished anyway and the output VCF seemed to look fine.

ADD REPLY
0
Entering edit mode

Oh... I'm guessing that you are using an older 32-bit version of Java? I suggest you upgrade to Java 8 64-bit version.

ADD REPLY
0
Entering edit mode

You're right, that was the issue!

ADD REPLY

Login before adding your answer.

Traffic: 2276 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6