Question: Direct cDNA nanopore strandness discover - 2D approach
0
gravatar for ruisergioluis
12 months ago by
ruisergioluis30 wrote:

I have recently started working with RNA nonopore MinIon long reads. However, at the moment I have more doubts than answers.
From my (weak) understanding, there are 2/3 main approaches regardless to RNA nanopore miniIon:

  1. direct RNA
  2. direct cDNA (PCR-free)
  3. cDNA (with a PCR step)

The data that I have is from the direct cDNA (PCR-free protocol). And my doubts start here. From the nanopore website, it looks like this protocol generates 1D reads (so, reads with TTTTTTT at the start OR AAAAAAA at the ending). Can you confirm it? However, looking to my fastq-trimmed file, I have 2D reads, which means I have reads that start with TTTTTTT AND end with AAAAAAA. Does it be possible for direct cDNA protocol. If so, Can I conclude the strandness from these 2D data?

Thank you all in advance,

Best,

Rui Luís

ADD COMMENTlink written 12 months ago by ruisergioluis30

looking to my fastq-trimmed file, I have 2D reads

How did you conclude this?

I have reads that start with TTTTTTT AND end with AAAAAAA

That's biologically unlikely, although you could have rare chimeric molecules.

ADD REPLYlink written 12 months ago by WouterDeCoster43k

Thank you for your comment! Correct me please if I'm wrong, but a 2D read is one that has cDNA double strand with a hairpin making the ligation between them in one of the sides. So, if the majority of my reads start with TTTTT and end with AAAAA, and the reads look a mirror (the sequence until the middle is the reverse complement of the sequence from the middle to the end), I concluded that I had 2D data. Is it a wrong way to think?

ADD REPLYlink written 12 months ago by ruisergioluis30

a 2D read is one that has cDNA double strand with a hairpin making the ligation between them in one of the sides. So, if the majority of my reads start with TTTTT and end with AAAAA, and the reads look a mirror (the sequence until the middle is the reverse complement of the sequence from the middle to the end),

Please talk to the one who generated the data to figure out which protocol exactly was used. If basecalling was performed correctly (and it looks like it wasn't) you should NOT get such mirrored reads, but rather the consensus of the template and complement read.

That said, 2D sequencing is dead and deprecated, so your data must be rather old or non-standard.

ADD REPLYlink written 12 months ago by WouterDeCoster43k

Thank you so much for your answer. It helped a lot to direct my attentions.

The data was produced using the most recent protocol. So, it is probably my completely fault. Right now I am working with the basecalled files sent by the facility. But, having into account what you said, I think I should re-make the basecalling step. What is the software that you advise?

ADD REPLYlink written 12 months ago by ruisergioluis30

The data was produced using the most recent protocol.

Could it be 1D^2 data?

I think I should re-make the basecalling step. What is the software that you advise?

You should use the Guppy basecaller.

ADD REPLYlink written 12 months ago by WouterDeCoster43k

Could it be 1D^2 data?

Is this the latest cDNA PCR-free nanopore protocol?

For the 1D², is expected to find out the "mirror reads"? (even for a wrongly performed basecalling step)

ADD REPLYlink written 12 months ago by ruisergioluis30

Is this the latest cDNA PCR-free nanopore protocol?

Are you asking me which protocol was used to generate your data?

For the 1D², is expected to find out the "mirror reads"? (even for a wrongly performed basecalling step)

No, that would surprise me.

ADD REPLYlink written 12 months ago by WouterDeCoster43k

Thank you for your advice!

ADD REPLYlink written 12 months ago by ruisergioluis30
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 957 users visited in the last hour