Dear all,
I have SAM files after mapping small RNA reads to genome. I want to know how many reads of small RNAs mapped to individual genes and what are the lengths of mapped reads to each genes. If it's too complicated to get all different length of reads, can I know for individual genes how many 24nt sRNAs mapped?
Or first get the mapped 24 nt reads, then see how many of each gene has?
Thanks!!
Hi Medhat,
Thanks for your solution! When I use the commands below, it was killed.
Here is my bed file for genes:
I used htseq-count to count numbers of mapped reads of each gene. However, I do not know which option I can use to get the lengths of each mapped reads. For example, there are a group of small RNAs mapped to gene AT1G01010. Among those sRNAs, I want to know how many of them are 21 nt sRNAs, 22 nt sRNAs, 23 nt sRNAs ... etc.
htseq-count I do not know option that will give you length.
Killed it means that there is no enough memory; how much memory you have?
a work around could be find here https://github.com/arq5x/bedtools2/issues/232
It's only 8 GB RAM, so seems it can not be done....
did you tried this work around in the link ?
I tried bedtools intersect -a foo.sorted.bed -b foo.sorted.bed -sorted -wa -wb
Here is error again
Bedtools Intersect Error in this link, says 'The end site is before the start site.' seems getting more complicated, I'm not sure if this is what I want now...