Filtering microbial variants
1
0
Entering edit mode
3.8 years ago
Domu • 0

I am analyzing microbial genomes where I need to find out what variants are present in my test samples and not in my control. I called variants in them both using the bcftools mpileup + call pipeline. Now I need to do the following:

1) Remove variants present in control from my test samples.

2) Filter variants

Can someone tell me how to do these? What parameters should I use for filtering? Are there good references out there for this?

Thanks in advance.

snp • 554 views
ADD COMMENT
0
Entering edit mode
3.8 years ago

You should be able to find a lot of information about this already on this site. What research have you done ?

Also, I prefer multisample SNV calling, so you get a VCF with columns for each sample. This makes filtering a lot easier.

Finally, I prefer to filter in Galaxy using eg SnpSift Filter and others and give entire histories to my collaborators containing all the filter steps, because you can easily over-filter and then face complaints about "missing SNVs".

ADD COMMENT
0
Entering edit mode

Thanks for your answer. I tried searching on google with the term 'parameters for filtering microbial variants' with the aim of finding filtering values based on the INFO field in the VCF file, but was usually directed to papers or tutorials either talking about sources of error and different filtering software. I figured this is a common enough problem and maybe I am not searching with the right keywords so I decided to ask the more experienced users here.

I will definitely look into SnpSift, it seems like a cool tool. Regarding the variants, it turns out that using the GATK prescribed values works well even for microbial genomes (a colleague has used it intensively with good results) so I will now be using that.

ADD REPLY

Login before adding your answer.

Traffic: 2685 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6