What makes up the sequence that follows an adaptor?
1
1
Entering edit mode
8.5 years ago
knowah ▴ 10

I've been mapping some bisulfite-seq data from SRA (SRR577617) and noticed that in the raw reads there are some sequences which have the adaptor at or near the 5' end, implying that no genomic DNA was sequenced (unless I'm mistaken, both reported adaptors should be trimmed on the 3' side, leaving an empty sequence after trimming). In that case, what makes up the several dozen base calls that follow the 3' end of the reported adaptor? Is this just noise caused by the sequencer (HiSeq) trying to call bases from a flowcell spot where no more synthesis is occurring? I would assume so but the sequences tend to be highly enriched for Ts and depleted in Cs, implying bisulfite converted DNA.

Edit

After looking more closely I realize that my assumption about the adaptors was wrong. The second adaptor listed in the SRA entry (which is similar to the RC of the other adaptor listed) was present on the 5' end, meaning the sequences I was seeing 3' of that adaptor were indeed biological.

next-gen-sequencing • 1.6k views
ADD COMMENT
1
Entering edit mode
8.5 years ago

I believe that Illumina systems will start producing all AAAAAs

ADD COMMENT
0
Entering edit mode

I think NextSeqs produce Gs in those cases, but they're different from everything else Illumina makes.

ADD REPLY
0
Entering edit mode

Thank you for your response -- it seems that is the case. However I was seeing different sequences 3' of the adaptor, which I now realize was due to the fact that that adaptor was present on the 5' end (see edit)

ADD REPLY

Login before adding your answer.

Traffic: 2934 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6