Question: Having made a paired-end and stranded-forward sequencing (NGS) implies that I will only get positive reading frames?
0
gravatar for sofiablancoglez
4 months ago by
University of Vigo
sofiablancoglez0 wrote:

Hello!

I'm doing a de novo assembly with reads obtained with Stranded type illumina sequencing. I guess it's Stranded Forward since the Read 1 map with the cDNA antisense and the Read 2 with the cDNA sense (and therefore they are in the opposite direction, Read 1 sense, Read 2 antisense). However, after doing the assembly de novo, I translate the transcripts obtained and the proteins present a negative frame. Wouldn't it be expected that all of them would have a positive frame since it is a stranded-forward protocol, where the reads sense would be taken as reference? Or am I getting confused and is it really a reverse protocol?

Greetings

ADD COMMENTlink modified 4 months ago • written 4 months ago by sofiablancoglez0

For de novo assembly, if not already completed, try Trinity.

Then you can use the tool transdecoder on your contigs to predict ORFs and get protein seqs etc.

ADD REPLYlink written 4 months ago by colindaven2.6k
0
gravatar for Istvan Albert
4 months ago by
Istvan Albert ♦♦ 86k
University Park, USA
Istvan Albert ♦♦ 86k wrote:

There are different protocols and the naming is unfortunately often confusing. What you describe is the most common protocol where the second in pair is in sense orientation.

Of course the assembler needs to be aware of the strandedness as well (and should be used with the appropriate parameters) and the actual meaning of the parameters is quite confusing, for example for Stringtie should I use -rf or -fr with the Illumina Trueseq, I always need to look it up and even then I am not sure that I got it right.

Long story short this sounds it was used the wrong way.

ADD COMMENTlink modified 4 months ago • written 4 months ago by Istvan Albert ♦♦ 86k
0
gravatar for sofiablancoglez
4 months ago by
University of Vigo
sofiablancoglez0 wrote:

Thank you very much for your answer! I detected the error. The sequencing protocol was Forward type and implies that the reads (the readings of the mould strings) taken as a reference for the assembly go in a 5'--3' direction, but these are complementary to the 'real' or mould strings and therefore the assembly results in 3'--5' transcripts (real direction of the sequences that were sequenced obtaining reads in opposite directions). Therefore when translated into proteins all the frames are negative.

The truth is that this is an unfortunate tongue twister...

ADD COMMENTlink written 4 months ago by sofiablancoglez0
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2026 users visited in the last hour
_