Question: How to remove adapters from DNA-seq illumina reads?
0
gravatar for hed.robin
2.4 years ago by
hed.robin30
hed.robin30 wrote:

Hi all,

I have some metagenomics reads, I trim them with trimmomatic and remove adapters using some illumina adapters library I found on the internet.

Then I look at them in Fastqc and it says that "adapter content" is very bad and "kmer content" is very bad as well (which must mean that there are indeed lots of adapters), the question is -- how do I remove these adapters?

How does Fastqc know they are there? Is it using some adapter database? If so, I would like to use that database to remove them! Where is the Fullest Possible Illumina Adapters Library (because the one I'm using right now isn't full enough)? Or Fastqc is mistaking?

sequencing trimming • 1.6k views
ADD COMMENTlink modified 2.4 years ago by Macspider2.8k • written 2.4 years ago by hed.robin30
1

there is a nice tool called BBmap from which you can use bbduck.sh that contains what you need

and it would be nice if you can support us with the tool, command and parameters that would make the answer more clear

ADD REPLYlink modified 2.4 years ago • written 2.4 years ago by Medhat8.2k
1

Program is actually called bbduk.sh. BBMap suite also includes a adapters.fa file (in "resources" directory in source) that includes common commercial adapter sequences so you don't need to search for dodgy sources on the net or type the sequences in by hand.

ADD REPLYlink modified 2.4 years ago • written 2.4 years ago by genomax66k

according to the author documentation he wrote

I'd like to introduce another member of the BBTools package, BBDuk.

which will take you to the BBmap web (maybe I am mistaken about it)

ADD REPLYlink written 2.4 years ago by Medhat8.2k

duk stands for decontamination using k-mers. You wrote bbduck.sh in the post above (which is a mistake I sometimes do). I was just correcting the program name. All BBTools are included in a single download.

ADD REPLYlink modified 2.4 years ago • written 2.4 years ago by genomax66k

you are totally right I did not see this C (my bad :) )

ADD REPLYlink written 2.4 years ago by Medhat8.2k

@hed.robin Yes. FastQC uses a database kept inside Configuration directory. Look for files: adapter_list.txt and contaminant_list.txt

ADD REPLYlink written 2.4 years ago by Persistent LABS740
0
gravatar for Macspider
2.4 years ago by
Macspider2.8k
Vienna - BOKU
Macspider2.8k wrote:

using some illumina adapters library I found on the internet

And did you verify that those are the same ones that are used in the sequencing reaction that was performed to realize your data? I've been using trimmomatic a lot and never had problems, I guess that you might have specified the wrong adapters!

Check your sequencing information: which Illumina technology was used?

Then, search for it here: http://support.illumina.com/content/dam/illumina-support/documents/documentation/chemistry_documentation/experiment-design/illumina-adapter-sequences_1000000002694-01.pdf

You'll find the right sequences. ;)

ADD COMMENTlink written 2.4 years ago by Macspider2.8k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1778 users visited in the last hour