How does Kraken2 Work?
Entering edit mode
2.0 years ago
bk2070 • 0

I've read the manual, but I'm still confused. Does Kraken2 break up the reads you give it into 31-32 KMERS and looks for matches against its database? If that's the case, does this mean it gives a couple nucleotide mismatches? Also, how does this work with its confidence scoring parameter? If I increase this, will it make it more stringent?

Last but not least, if I were to assemble my metagenome shotgun dataset into contigs, would it even be worth using Kraken2 if its still going to break the contigs into 31-32KMERS.


kraken2 • 2.7k views
Entering edit mode
2.0 years ago
Asaf 8.6k
  1. Yes, it breaks the reads into all the possible kmers. A perfect hit will have all the kmers mapped, if you have one mismatch it will impact all kmers that contain this nucleotide.

  2. The confidence score is basically the number of matching kmers divided by the total number of kmers.

  3. You can use kraken2 for contigs but you can use other methods which will give more precise results potentially.

Entering edit mode

Asaf, to your third point, what other methods for contigs would you recommend? I use direct mapping of contigs as an alternative, but that doesn't give me LCA classifications when there's no species hit, which is what I like about Kraken.


Login before adding your answer.

Traffic: 2471 users visited in the last hour
Help About
Access RSS

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6