Question: How to remove adaptors if you don't know their names/sequences? (eg from public data)
gravatar for rioualen
3.6 years ago by
rioualen460 wrote:


I would like to use cutadapt and/or trimmomatic to trim my sequences. I'm usually analyzing data from GEO or Array Express, and it's not always possible to get information about adaptors. Also, it's not always possible to trust information provided there...

I can see that fastQC detects adaptor sequences, including those with a slightly different sequence. So I'm thinking there should be a way to trim adaptors automatically, without knowing their sequence or name?

Thanks for tips

Edit: just found out FastQC uses a file called "contaminant_list.txt". I guess I could write some script to look for these adapters in my data, but I'm not sure it's the best way to go though.

ADD COMMENTlink modified 3.6 years ago by ivivek_ngs5.0k • written 3.6 years ago by rioualen460
gravatar for ivivek_ngs
3.6 years ago by
Seattle,WA, USA
ivivek_ngs5.0k wrote:

use cutadapt or trimmotic or BBmap (BBmap includes popular adapter.fa file which can be used) Ideally if its Illumina sequences then you can have the custom file for the sequences and use them for removal. In other way is to check the over-represented sequences which can also be checked and removed.

Check the links below

Link 1


ADD COMMENTlink written 3.6 years ago by ivivek_ngs5.0k

Thanks for the links, I'll look into it.

I need to integrate that in my snakemake pipelines and I'd like it to be compatible with multiQC. Trim_galore should do the trick!

ADD REPLYlink written 3.6 years ago by rioualen460
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1109 users visited in the last hour