Question: How Can I Tell What Is The Adapter Used In A Sequence Read Archive (Sra) Sample?
11
gravatar for Doctoroots
10.0 years ago by
Doctoroots790
Doctoroots790 wrote:

Hi all, i want to perform some analysis on small RNA sequence samples retrieved from the Sequence Read Archive (SRA). since adapter contamination is common in small RNA seuquencing results, its important to clip the adapter from the retrieved fastq file before the downstream analysis. to clip the adapter, i need to know its sequence.

so my question is : how do i know what was the adapter used in the sample preparation?

an example experiment : SRAsmallRNA_experiment

sra adaptor • 18k views
ADD COMMENTlink modified 9.9 years ago by Alastair Kerr5.3k • written 10.0 years ago by Doctoroots790
1

Found by google ("illumina adapter sequence"): http://seqanswers.com/forums/showthread.php?t=198

ADD REPLYlink written 10.0 years ago by Michael Dondrup48k
1

Hi Michael, i know this sequence list, but it seems that sometimes a different adapter is used, or a similar adapter with a few nucleotides difference at the beginning or end. i wanted to know if there is a way to find the exact adapter used. thanks.

ADD REPLYlink written 10.0 years ago by Doctoroots790
14
gravatar for Alastair Kerr
10.0 years ago by
Alastair Kerr5.3k
Manchester/UK/Cancer Biomarker Centre at CRUK-MI
Alastair Kerr5.3k wrote:

Run the sequences through fastQC It will also detect loads of other weirdness as well including both adaptor and primer contamination, quality problems etc. All sequences for the contaminants are present in the config file called "contaminant_list.txt"

ADD COMMENTlink modified 10.0 years ago • written 10.0 years ago by Alastair Kerr5.3k

Cool tool! Wish I had known about this earlier

ADD REPLYlink written 10.0 years ago by Michael Dondrup48k

I thought the contaminant_list.txt is an input file, isn't it? There is a option on fastqc command that to specify a non-default file which contains the list of contaminants to screen overrepresented sequences against.

ADD REPLYlink written 7.2 years ago by sckinta600

Dear Alastaire

I use fastQC for find adapter,it reports "illumina universal adapter" for some RNA-seq and small RNA-seq data,do you know what its sequence is?

ADD REPLYlink modified 4.4 years ago • written 4.4 years ago by Edalat30

BBMap suite includes sequences of all popular adapters in a file called "adapters.fa" in resources directory you will find in BBMap folder.

ADD REPLYlink written 4.4 years ago by GenoMax94k

thank you for your attention,this file has "TruSeq_Universal_Adapter" sequence, I need sequence of "illumina universal adapter".

ADD REPLYlink written 4.4 years ago by Edalat30

TruSeq is the name of the illumina sequencing protocol, so they are the same. Recent Hiseq and Miseq machines should be covered by TruSeq version3 adapter sequences, at least that is what works with Trimmomatic.

ADD REPLYlink written 4.4 years ago by Michael Dondrup48k
1

Dear Michael

I sent an E-mail to manager of FastQC and ask him about this adapter,ha said The universal adapter isn’t actually an adapter sequence as such, it’s the common sequence which exists at the start of all of the illumina adapter and primer sequences before they diverge into the different sub-variants. This is the first sequence you encounter when you read off the end of an insert on an illumina run so you can use it for detecting and trimming adapters without having to know exactly which one was used.

it is an additional info?

or

It means I should remove 3and 5 terminal nucleotide for trim universal adapter?

ADD REPLYlink modified 16 months ago by _r_am32k • written 4.4 years ago by Edalat30

Just use the sequence of universal adapter. Trimming programs will remove all sequence 3' of the point where they find the universal adapter. If you use bbbduk.sh you can point it to the entire adapters.fa file and it will do the rest.

ADD REPLYlink written 4.4 years ago by GenoMax94k

then I should use "truseq universal illumina" sequence when FastQC reports "illumina universal adapter".

I do not use software except windows software,it is better to say I use clc genomics only.

ADD REPLYlink modified 4.4 years ago • written 4.4 years ago by Edalat30

You can use that sequence with CLC according to these directions.

ADD REPLYlink modified 4.4 years ago • written 4.4 years ago by GenoMax94k

yes,I would like to tell you that I can not use bbMap and other non-windows software.

I really appreciate your help .

ADD REPLYlink modified 4.4 years ago • written 4.4 years ago by Edalat30

BBMap is pure Java and will happily run on windows as long as you have Java installed.

ADD REPLYlink written 4.4 years ago by GenoMax94k

really?! then I should try it,however I think BBMap operate like clc for trimming adapter.

ADD REPLYlink written 4.4 years ago by Edalat30
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2499 users visited in the last hour
_