Remove duplicates in featurecounts
2
0
Entering edit mode
7.4 years ago
lkmklsmn ▴ 950

Hi Biostars,

I used the featureCounts function from the Rsubread package in order to count reads. The first time I used my 'regular' bam files. The second time I used Picard Tools to mark duplicates in each bam file and used this data as substrate. I am getting the same number of counts in both cases using featureCounts. I was wondering if it is possible to tell featureCounts to exclude duplicates from the counting. Is there an option doing it within Rsubread or do I have to actually remove (as opposed to mark) the duplicate reads?

Thanks

featureCounts Picard RNA-Seq Rsubread • 3.2k views
ADD COMMENT
2
Entering edit mode
7.4 years ago
dbpzdbpz ▴ 150

The new version of subread (1.4.5) will provide a function to ignore reads or fragments that have the 0x400 flag (the duplicate read flag). This version should be released in days.

ADD COMMENT
0
Entering edit mode

Sometimes in reserach "in days" can turn into weeks, months etc.

Do you know of any official release date for 1.4.5?

ADD REPLY
0
Entering edit mode

Sorry that it was delayed for weeks. The 1.4.5 version of subread (inc. featureCounts) was released yesterday on sf.net.

http://sourceforge.net/projects/subread/

ADD REPLY
0
Entering edit mode
7.4 years ago
Chris Fields ★ 2.2k

It doesn't look as if this is supported, though technically it shouldn't be hard to add (they already parse read bit flags and attributes). Maybe ask the authors? Mailing list is here:

https://groups.google.com/forum/#!forum/subread

ADD COMMENT

Login before adding your answer.

Traffic: 2445 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6