Question: About clustering genes isoforms in RNA-Seq
1
gravatar for Lucas Peres
2.3 years ago by
Lucas Peres60
Brazil, Belém
Lucas Peres60 wrote:

Hello everyone,

I am studying RNA-Seq pipelines and would like some advice on how to deal with genes isoforms. Since we deal with sequencing of eucaryotes at the lab, such occurrence is very common. We use Trinity for de novo transcriptome assembly and we intend to use CD-HIT (cd-hit-est), at first, to cluster the isoforms. The questions are: the best time to perform this step is before or after the transcriptome assembly? Can CD-HIT be used as a preprocessing tool to remove duplicates/redundant reads after sequencing? Or is it best used to improve the assembly afterwards (as we are intending)?

If you know other approaches/tools to handle this situation, please let me know.

Thank you in advance! :)

rna-seq • 1.0k views
ADD COMMENTlink written 2.3 years ago by Lucas Peres60
1

Hi Lucas, Check out Corset. They also discuss CD-HIT in that paper.

ADD REPLYlink written 2.3 years ago by Jake Warner800

Thank you Jacob. I'll take a look.

ADD REPLYlink written 2.3 years ago by Lucas Peres60
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1401 users visited in the last hour