How to determine the K-mer size?
1
0
Entering edit mode
2.1 years ago

I have reads with 151bp length. I heard that the convention is to set the k-mer size as 1/2 to 2/3 the read length. In this case, it would mean that the k-mer size should be 75-100bp. However, isn't this range too erroneous? I want to correct my reads using Musket and it recommends default kmer size of 21.

k-mer • 915 views
ADD COMMENT
2
Entering edit mode

You can use something like jellyfish to get a count of possible k-mers: https://github.com/gmarcais/Jellyfish

I heard that the convention is to set the k-mer size as 1/2 to 2/3 the read length

That is probably for assemblies not what you are trying to do.

ADD REPLY
3
Entering edit mode
2.1 years ago
Mensur Dlakic ★ 27k

I think you are mixing up k-mer sizes for assembly purposes vs. k-mer sizes for error-correction purposes. K-mer size is not universal regardless of application. The rule of thumb you mention applies to assembly, and even in such cases it is best to let the assemblers figure out on their own what k-mer size to use. For example, SPAdes has an auto function that will test several k-mers and automatically determine the best.

If musket recommends k=21, that should be a safe choice. I have used k=21 with bfc and it works well.

ADD COMMENT

Login before adding your answer.

Traffic: 1263 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6