Question

Direct cDNA nanopore strandness discover - 2D approach

0

Entering edit mode

4.9 years ago

ruisergioluis ▴ 40

I have recently started working with RNA nonopore MinIon long reads. However, at the moment I have more doubts than answers.
From my (weak) understanding, there are 2/3 main approaches regardless to RNA nanopore miniIon:

direct RNA
direct cDNA (PCR-free)
cDNA (with a PCR step)

The data that I have is from the direct cDNA (PCR-free protocol). And my doubts start here. From the nanopore website, it looks like this protocol generates 1D reads (so, reads with TTTTTTT at the start OR AAAAAAA at the ending). Can you confirm it? However, looking to my fastq-trimmed file, I have 2D reads, which means I have reads that start with TTTTTTT AND end with AAAAAAA. Does it be possible for direct cDNA protocol. If so, Can I conclude the strandness from these 2D data?

Thank you all in advance,

Best,

Rui Luís

nanopore long reads direct cDNA strand • 2.0k views

ADD COMMENT • link 4.9 years ago by ruisergioluis ▴ 40

0

Entering edit mode

looking to my fastq-trimmed file, I have 2D reads

How did you conclude this?

I have reads that start with TTTTTTT AND end with AAAAAAA

That's biologically unlikely, although you could have rare chimeric molecules.

ADD REPLY • link 4.9 years ago by WouterDeCoster 47k

0

Entering edit mode

Thank you for your comment! Correct me please if I'm wrong, but a 2D read is one that has cDNA double strand with a hairpin making the ligation between them in one of the sides. So, if the majority of my reads start with TTTTT and end with AAAAA, and the reads look a mirror (the sequence until the middle is the reverse complement of the sequence from the middle to the end), I concluded that I had 2D data. Is it a wrong way to think?

ADD REPLY • link 4.9 years ago by ruisergioluis ▴ 40

0

Entering edit mode

a 2D read is one that has cDNA double strand with a hairpin making the ligation between them in one of the sides. So, if the majority of my reads start with TTTTT and end with AAAAA, and the reads look a mirror (the sequence until the middle is the reverse complement of the sequence from the middle to the end),

Please talk to the one who generated the data to figure out which protocol exactly was used. If basecalling was performed correctly (and it looks like it wasn't) you should NOT get such mirrored reads, but rather the consensus of the template and complement read.

That said, 2D sequencing is dead and deprecated, so your data must be rather old or non-standard.

ADD REPLY • link 4.9 years ago by WouterDeCoster 47k

0

Entering edit mode

Thank you so much for your answer. It helped a lot to direct my attentions.

The data was produced using the most recent protocol. So, it is probably my completely fault. Right now I am working with the basecalled files sent by the facility. But, having into account what you said, I think I should re-make the basecalling step. What is the software that you advise?

ADD REPLY • link 4.9 years ago by ruisergioluis ▴ 40

0

Entering edit mode

The data was produced using the most recent protocol.

Could it be 1D^2 data?

I think I should re-make the basecalling step. What is the software that you advise?

You should use the Guppy basecaller.

ADD REPLY • link 4.9 years ago by WouterDeCoster 47k

0

Entering edit mode

Could it be 1D^2 data?

Is this the latest cDNA PCR-free nanopore protocol?

For the 1D², is expected to find out the "mirror reads"? (even for a wrongly performed basecalling step)

ADD REPLY • link 4.9 years ago by ruisergioluis ▴ 40

0

Entering edit mode

Is this the latest cDNA PCR-free nanopore protocol?

Are you asking me which protocol was used to generate your data?

For the 1D², is expected to find out the "mirror reads"? (even for a wrongly performed basecalling step)

No, that would surprise me.

ADD REPLY • link 4.9 years ago by WouterDeCoster 47k

0

Entering edit mode

Thank you for your advice!

ADD REPLY • link 4.9 years ago by ruisergioluis ▴ 40