Question: FastX Collapser and duplicate removal
0
gravatar for seta
2.0 years ago by
seta1.1k
Sweden
seta1.1k wrote:

Hi all,

I don't work with FastX collapse, but I read that it combines identical reads to a single read and keeps count of the reads. However, I didn't understand its meaning from "keeps count of the read". Could you please a bit explain to me? if it meant the program show the count of reads in, for example, in the header of the single read or keeps in the memory or what else?

Thanks

ADD COMMENTlink written 2.0 years ago by seta1.1k

I would suggest the you switch to clumpify.sh from BBMap suite. You can dedupe the data and it will keep a count of duplicate reads in the header of the deduplicated file (count=).

ADD REPLYlink written 2.0 years ago by genomax67k

Thanks, genomax2 for your suggestion. I don't want to use FastX collapser now, just to understand what it does. Please kindly tell me if it also shows a count of reads in the header of the single (deduplicated) reads?

ADD REPLYlink modified 2.0 years ago • written 2.0 years ago by seta1.1k

Should be easy enough to test. Sorry I have not used fastx-toolkit for a few years now.

ADD REPLYlink written 2.0 years ago by genomax67k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1198 users visited in the last hour