Question: Jellyfish: every other kmer count is zero
gravatar for Lina F
2.0 years ago by
Lina F160
Boston, MA
Lina F160 wrote:

Hi all,

I found a tutorial suggesting how to use Jellyfish to estimate genome size:

However, after running jellyfish count and jellyfish histo the output shows that every other kmer count is zero.

Below is my code, trying several values of k.

I feel like I'm missing something simple -- why are the odd k-mer counts zero?

Thanks for any advice!


for K in 21 23 25 27 29 31;
  jellyfish count -t 20 -C -m $K -s 5G -o output_${K}.jf --min-quality=20 --quality-start=33 all.fastq
  jellyfish histo -f output_${K}.jf > histogram_${K}.txt
  jellyfish stats -v -o stats_${K}.txt output_${K}.jf

head histogram_31.txt
0 0
1 0
2 14028836
3 0
4 2053267
5 0
6 966831
7 0
8 554663
9 0

cat stats_31.txt
Unique:    0
Distinct:  37557758
Total:     2901177252
Max_count: 2419076

Edited to add the contents of the stats file.

ADD COMMENTlink modified 2.0 years ago • written 2.0 years ago by Lina F160

You also have no k-mers with frequency of 1, which is extremely unlikely. Did you somehow doubled up your input fastq? Did you copy the original fastq at some point and concatenated the copy to the original?

ADD REPLYlink written 2.0 years ago by Damian Kao15k

I double checked and I did not double up my input fastq files. However, I am using both fwd and rev read files. In total I have 29.5 million read pairs. Should I downsample this?

EDITED to add: I just ran the code with only the FWD read files and now I get 1mers and odd kmers in general.

I realized my input data was wrong (my R1 and my R2 files were indeed the same, they were just given to me with different names)

Thanks for the helpful advice!

ADD REPLYlink modified 2.0 years ago • written 2.0 years ago by Lina F160
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 943 users visited in the last hour