I'm working on an RNA-seq project and fastqc keeps identifying overrepresented sequences consisting of poly(C) followed by poly(T). I see a range from
CCCCCCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT to CCCCCCCCCCCCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT
I know the poly(T) is probably from how the RNA was enriched bu where is the poly(C) coming from? Has anybody else seen this before?