Question: if i do mistake in library type parameter of trinity assembly, do i need to reassemble it again?
0
gravatar for biologo
4 months ago by
biologo10
biologo10 wrote:

Dear friends,

i was doing the trinity assembly, but i made a small mistake that for the --library type parameter, i used the FR, but actually it was RF, till i did the blast that i found the known gene matched to the reverse strand, cause the trinity take times and CPU, can i just reverse the Trinity.fasta file, and no need redo it again? does it works???

thank you so much for your help.

rna-seq assembly • 214 views
ADD COMMENTlink modified 4 months ago by colindaven410 • written 4 months ago by biologo10
1

No, you'll need to redo it, but you should get a much better assembly.

Colin

ADD REPLYlink written 4 months ago by colindaven410

i got it, thank you, but your extral advice remind me that actually the quality is not good, even i isolate the llongest unigene, i got over 430,000 transcriptome.

ADD REPLYlink written 4 months ago by biologo10

Thats a lot of transcripts, and they are probably highly fragmented. I'm sure you did read trimming before, right ? Check your data with FASTQC before and after trimming too.

ADD REPLYlink modified 4 months ago • written 4 months ago by colindaven410

yes, i did. trim the adaptor and filter some short or low quality reads, but the trinity result seems no better options, thank you for your patience and your kind reply.

ADD REPLYlink written 4 months ago by biologo10

You might try out Bridger

ADD REPLYlink written 4 months ago by Vijay Lakhujani1.3k

yes, i also considered that, but my colleague suggested me use ingap-cdg, have you ever use that, he told me it really works, and i also did the test, and only left 7,000,000 transcripts, seems good, but i am still doubt about the result.

ADD REPLYlink written 4 months ago by biologo10
1
gravatar for colindaven
4 months ago by
colindaven410
colindaven410 wrote:

It might be low quality data, but please tell us step by step what was done exactly. Also, you can run it through a trinity workflow for example on Galaxy main.

How many reads do you have ?

Another assembler - CLC is easy and quite good, or soapdenovo-trans has a decent reputation - might help too.

Also, just map your transcripts to the genome (I like Gmap) then visualize with Jbrowse or IGV etc. Are multiple redundant transcript fragments present ?

ADD COMMENTlink written 4 months ago by colindaven410

accepted,but the thing is there's no reference genome for this species, and for some reasons, the genome assembly is really hard, to much AT ratio and repeat region, so, we just wanna focusing on the transcriptome assembly. thank you for your kind reply.

ADD REPLYlink written 4 months ago by biologo10
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 584 users visited in the last hour