Variant Calling For Ultra High Coverage Region Of Whole Exome Sequencing Data
2
0
Entering edit mode
11.4 years ago
lyz10302012 ▴ 450

Hi,

How does everyone solve the variant calling (SNVs, Indels) problem in ultra high coverage region of whole exome sequencing data? For example, if the ref allele count is 278 and the alt allele count is 83, how can we determine if this site is a variant, or not?

Thanks.

variant calling • 2.3k views
ADD COMMENT
1
Entering edit mode
11.4 years ago
Vivek ★ 2.7k

If want to make the calls by your self, you could likely calculate the alternate allele fraction for known sites intersecting with dbSNP, 1K genomes, ESP project etc and create a cut-off based on that.

ADD COMMENT
0
Entering edit mode
11.4 years ago
brentp 24k

I don't know if < 400 counts as ultra-high any more, but most variant calling software will take other things into account such as the quality distributions of the reference bases compared to those of the alternate, strand bias, etc.

That coverage is low enough that if you get a smallish number of candidates, you can load the bam(s) up in IGV and have a look. That usually helps to determine if there are alignment issues in that region or if things look off.

ADD COMMENT

Login before adding your answer.

Traffic: 1606 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6