Different assembler and difference in genomic variants. Is it possible?
0
0
Entering edit mode
3.5 years ago

Hi,

I assembled a viral genome using two different assemblers e.g., Megahit and Metaspades. Both the assemblers uses de bruijn graphs but might use different k-mers because both the assemblers uses multi k-mer approach.

I aligned the contigs to the reference and called variants using BCFtools. I am getting 18 SNPs that are found in one of the assemblers. Im surprised to see the degree of disagreement in variant calling between two assembly methods. Is it expected or what would be the possible explanation for that?

Assembly • 764 views
ADD COMMENT
0
Entering edit mode

How likely is it that your sample is not clonal?

ADD REPLY
0
Entering edit mode

Its a ssRNA virus but not sure about clonality. Its almost impossible to have two different strains in the sample.

ADD REPLY
0
Entering edit mode

As fars as I know metaspades perform a read correction error step before starting the assembly step

ADD REPLY
0
Entering edit mode

Yes, thats a great point. Thanks!

ADD REPLY

Login before adding your answer.

Traffic: 1964 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6