Choosing K-mer size in Kmergenie for denovo assembly
2
2
Entering edit mode
9.7 years ago

How should we select the correct kmer size for kmergenie analysis? I have read in wiki page that 'largest k-mer size to consider'. In that case I am looking for valuable comments from you people.

next-gen Assembly genome • 4.7k views
ADD COMMENT
0
Entering edit mode

Thank you 5heikki

But can you explain a little bit more. Are you saying to run with different k values less than max read length and take an average of them ?

ADD REPLY
0
Entering edit mode

I'm saying you should make assemblies with many different k-mer values and then in the end you merge all the assemblies into one. Also, you obviously can't have larger k-mer values than your read length, I mean, how would you e.g. get a 120-mer from a 90 bp read?

ADD REPLY
0
Entering edit mode

Ok 5heikki. I will try the way you mentioned. Open for more comments if any. thanks a lot

ADD REPLY
3
Entering edit mode
9.7 years ago
Rayan Chikhi ★ 1.5k

There may be a misunderstanding here.

"Selecting the correct kmer size for kmergenie analysis" implies that kmergenie would perhaps require user assistance?

This is precisely the opposite, it is designed to choose k for you. Just run kmergenie without any parameter, and it will explicitly print what it thinks is a good kmer size. Most of the time, the value is accurate.

Additionally, for unusual cases, you can take a look at the generated HTML report to make a more informed decision.

ADD COMMENT
1
Entering edit mode

Your explanation seems to be exactly answering my doubt, Rayan Chikhi. I will try this way as well. Thanks a ton.

ADD REPLY
2
Entering edit mode
9.7 years ago
5heikki 11k

Smaller than the length of your reads, larger than 20. Attempt many different values and merge the assemblies in the end.

ADD COMMENT

Login before adding your answer.

Traffic: 1345 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6