Question: Poly-G in head of read NovaSeq
0
gravatar for godth13teen
4 weeks ago by
godth13teen40
godth13teen40 wrote:

Hi, I recently got some problems with the output of NovaSeq 6000. I ran sample in paired mode, then found out that a portion of read 2 has poly-G (around 50bp) at the beginning. I understand that NovaSeq is a 2-color system, so the poly-G is likely signal lost, but I don't understand why it only appears at the beginning of the second read. I have considered:

  • DNA strand break from sample: if this happen, then the read 1 won't have signal of the break at all
  • DNA strand break from after fragmentation: if this happen, then the polyG will occur even at the tail of read 1.
  • Reverse strand break at head: then how can the tail still have base? I think it will falls out of the adapter, am I right?
  • Reverse strand break at tail: then polyG will occur at the tail, like many other reported case.

This phenomenon is really confusing and I haven't found any answer/explanation for it yet. I have tried it will new library prep kit but this still happens. I am using TruSeq DNA PCR-Free from Illumina

Any help/advice is warmly welcome!

Thank you

sequencing • 116 views
ADD COMMENTlink modified 4 weeks ago • written 4 weeks ago by godth13teen40

I agree, that's weird. I don't have experience with NovaSeq but with NextSeq it's usually all G's and due to problems in the DNA synthesis for the 2nd read (we assumed). I would ask an Illumina representative.

ADD REPLYlink written 4 weeks ago by Asaf8.3k

I ask Illumina representative but they haven't give me a clear answer yet, unfortunately, they suggest trim the 25G from the read to pass the fastqc, but I disagree with that method, it didn't fix the problem.

ADD REPLYlink written 4 weeks ago by godth13teen40

You have not told us what kind of libraries are these. If you are doing something non-standard then you need to consider non-standard solutions for downstream data processing. If you got this result with 2 different runs (even different libraries?) then it is a reproducible one. You may also want to consult the kit vendor to see what may be going on. May just be a bad lib prep kit.

ADD REPLYlink modified 4 weeks ago • written 4 weeks ago by genomax87k

I am using TruSeq DNA PCR-Free from Illumina, when I reported the problem, they give me 2 new kits for testing but the problem occurred again

ADD REPLYlink written 4 weeks ago by godth13teen40

I would throw any read that starts with a long G stretch

ADD REPLYlink written 4 weeks ago by Asaf8.3k

Yes, I also considered throwing away both the bad reads and their pair. But as I said, this method is just for data processing, not the real cause of the problem

ADD REPLYlink written 4 weeks ago by godth13teen40
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 723 users visited in the last hour