Visualize Insert Size For Mate-Pairs And Calculate Statistics
2
9
Entering edit mode
10.2 years ago

enter image description here

I want to visualise insert sizes in my mate-pair library and calculate statistics like mean, standard deviation..

I have found this tool http://blog.malde.org/posts/bamstats.html that makes beautiful graphs like the one above (the plot shows mate pair library - green - contaminated by paired end reads in red).

However, based on output of this program it seems that there are plenty of paired-end reads in my library and only few mate-pairs (orientation outwards, outies on the plot). I want to verify this with another tool. Picard's CollectInsertSizeMetrics works only for paired-end reads according to manual. What are other possibilities?

Moreover, I got pretty wild statistics from bamstats package, probably because of outliers in my data:

#Alignment             count       prop       mean      stdev       skew       kurt
innies                11853    33.53%    861169.9    8776007.0       12.2      161.7
outies                 4560    12.90%    2644111.4    15288468.3        6.7       46.7
lefties                4681    13.24%    2769622.5    16057397.2        6.4       42.8
righties               4585    12.97%    3128289.8    17007034.4        6.2       39.5

Any suggestions are welcome. I need three things: estimated mean (after removal of outliers), standard deviation (again after removal of outliers) and visualisation. Both, histograms and boxplots work for me.

Thanks a lot!

statistics • 5.0k views
ADD COMMENT
0
Entering edit mode
9.9 years ago
Andre Elias ▴ 110

Hi, it looks interesting, but I'm a bit confused. I did some googling and apparently it's a haskell program, so after digging around and installing haskell, it complains my samtools is not >0.2.1. I checked on the samtools official website and the most current version is 0.1.9. I looked online again and apparently there is a samtools 0.2.2 as a 'cabal' package, but it requires a few other dependencies…

Am I missing something? Is there any way that's a little more straightforward to install/compile Bamstats?

ADD COMMENT
0
Entering edit mode

Did you figure out how to install this? Please let me know if you get this thing working.

Thanks,

ADD REPLY
0
Entering edit mode
9.8 years ago
madk00k ▴ 360

Try Qualimap. It computes mean, median and standard deviation of the insert size. Additionally it computes the histogram of insert sizes and plots insert size across reference.

ADD COMMENT

Login before adding your answer.

Traffic: 2174 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6