Question

Why do I have sequential 100 G (GGGGGGGGGG...) in R2 FastQC report?

1

Entering edit mode

2.9 years ago

Giulia.cosenza ▴ 110

We did a shotgun paired-ended sequencing (200 cycles, 100 forward and 100 reverse) on a sample and than we took off the adapters and checked the fastQC report. R1 is quite good and doesn't show consistent problems, but R2 seems to still have adapters and there are a lot of overrepresented sequence of sequential 100 G (GGGGGGGGGG...), since its length the repetition is not just at the 3' but it characterized the entire fragment. How is this possible if besides R1 doesn't show sequential G stretches ? Any idea?

enter image description here

FastQC • 3.1k views

ADD COMMENT • link updated 2.9 years ago by Arup Ghosh 3.3k • written 2.9 years ago by Giulia.cosenza ▴ 110

0

Entering edit mode

Please include details related to the sequencing instrument/chemistry and how the adapter was trimmed.

ADD REPLY • link 2.9 years ago by Arup Ghosh 3.3k

score 4 · Answer 1 · 2022-08-12

4

Entering edit mode

2.9 years ago

rpolicastro 13k

G is absense of color in illumina sequencing, so it's likely the machine lost track of the clusters/spots when switching from R1 to R2 read sequencing.

ADD COMMENT • link 2.9 years ago by rpolicastro 13k

1

Entering edit mode

In addition to what rpolicastro said, if every read is poly-G in read 2 part of your run then there is a hadrware/software/reagent problem with the run. Your sequencing provider should have investigated this with Illumina instead of releasing the data.

ADD REPLY • link 2.9 years ago by GenoMax 152k

0

Entering edit mode

No, not every R2, you can see the % in the picture I posted

ADD REPLY • link 2.9 years ago by Giulia.cosenza ▴ 110

0

Entering edit mode

That's still a large percentage, so it may be worth talking to the sequencing provider.

ADD REPLY • link 2.9 years ago by rpolicastro 13k