Insert size different from company with BBMap
1
0
Entering edit mode
6.5 years ago
Sharon ▴ 610

Hi All

Insert size in BBMap is different than what the company mentioned? Which is correct? We got ~150, company says its 200 ! How can we have reads of 150 each and inner mate -135 and insert size is 200. Insert size - 2 * read length = inner mate then if inner mate is -135 and insert is 200, then, we have double read = 335 , then read is longer than 150, while we have 15o bases in the fastq for each file.

Thanks

RNA-Seq • 2.3k views
ADD COMMENT
1
Entering edit mode
6.5 years ago
GenoMax 142k

BBMap. Since it is based on actual merging of reads and/or alignment data. Not sure which way you did the analysis ( C: Target fragment size versus final insert size ).

ADD COMMENT
0
Entering edit mode

Sorry, I made update before your comment. So we can't have inner mate with -135 and insert = 200 and read length is 150 in fastq file , right? I used this:

bbmerge.sh in1=r1.fq in2=r2.fq ihist=ihist.txt
ADD REPLY
1
Entering edit mode

Can you restate your question referring to this image (A: What is the different between Read and Fragment in RNA-seq? )? If the reads overlap then then is no inner distance (ref fig).

ADD REPLY
0
Entering edit mode

I think if the reads overlap, we will have negative inner mate, right? And if the insert size is shorter than the sum of the 2 reads then the reads will overlap, right? Like if we have 150 bases in each fastq file, and we have insert size as claimed by the company 200, and the inner mate as claimed by the company is -135, then the numbers don't sum.

insert size - 2 *read length = inner mate

200 - ~300 != -135

but if the insert size is 158 as by bbmap, then we can have :

158 - ~300 = ~-142 right?

ADD REPLY
1
Entering edit mode

Is the company counting the adapters in the fragment length they are quoting?

If BBMap estimates the fragment size to be 158 then it means that you are sequencing 150 bp from end #1 leaving a 8 bp piece on the right. When R2 comes along, it is going to cover this 8 bp piece first but will leave a corresponding 8 bp fragment on the left side of the fragment.

|-------------------------------->8 bp|    R1 - 150 bp
|-------------------------------------|    Fragment 158 bp
|8 bp<--------------------------------|    R2 - 150 bp
ADD REPLY
0
Entering edit mode

I will check what the company mean. But does BBmap estimate fragment size or insert size? And can I trust BBMap and argue with company based on BBMap estimation?

ADD REPLY
1
Entering edit mode

Insert size. Fragment size is insert size + length of 2 adapters. If you have a reference available then you should be able to see this clearly if you use the mapping method. Map only a million or so reads.

ADD REPLY
0
Entering edit mode

Thanks genomax so much. I think it is clear now, BBmap estimates makes more sense !

ADD REPLY

Login before adding your answer.

Traffic: 1208 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6