Question: combine or split, that's a question for transcriptome assembly
0
gravatar for dukecomeback
23 months ago by
dukecomeback40
China
dukecomeback40 wrote:

I combined 10 libraries before doing tophat+cufflinks and trinity+PASA assembling, end up with each process gave me hundred thousands of predicted transcripts. I believe there must be a lot of false positive. But would it be better if I run with each library respectively, then use some tools like cuffmerge to merge the result assembly? Does anyone has experience comparing these? I would be really grateful for your sharing.

    Sincerely,
         Kang
rna-seq assembly • 718 views
ADD COMMENTlink modified 23 months ago by Rob3.3k • written 23 months ago by dukecomeback40

Since you are doing de novo assembly using trinity I assume you don't have a (decent) reference genome available? But still, you are using tophat? (Which is, as said by Rob, deprecated.). Please be as complete as possible when asking questions, information such as the organism you are working on is important.

ADD REPLYlink written 23 months ago by WouterDeCoster38k

I'm actually trying to build a genome annotation pipeline here. I hope to extract the overlap part from the two process to get some high quality genes.

ADD REPLYlink written 23 months ago by dukecomeback40
1
gravatar for Rob
23 months ago by
Rob3.3k
United States
Rob3.3k wrote:

By combining the samples prior to assembly, you increase the likelihood of generating (computationally) chimeric transcripts. You might try to assembly separately and then combine the assemblies using e.g. TACO. Also, TopHat has been deprecated by the developers. For reference-based assembly, you might try HISAT + StringTe instead of TopHat + Cufflinks.

ADD COMMENTlink written 23 months ago by Rob3.3k

Thank you so much for your sharing, I really didn't know the whole TopHat being deprecated thing.

ADD REPLYlink written 23 months ago by dukecomeback40

See this paper: Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown. (If you can't get access to that publication, let me know and I'll -cough- help you.) There are also other alternatives, including alignment with STAR and bbmap, or pseudo-alignment using kallisto or salmon.

ADD REPLYlink written 23 months ago by WouterDeCoster38k

thanks for that link, WouterDeCoster, it can be accessed. :)

ADD REPLYlink written 23 months ago by dukecomeback40
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1673 users visited in the last hour