How to remove poly T in RNA-sequencing data
4
3
Entering edit mode
8.9 years ago
seta ★ 1.9k

Hi all,

I'm using Trim_galore tool for quality and adaptor trimming, but I found no option to remove poly T in the sequencing data resulted from poly A enrichment libraries sequencing. Could anybody please let me know how should remove poly T in data? Thanks

RNA-Seq alignment Assembly sequencing • 12k views
ADD COMMENT
1
Entering edit mode

Have you tried mapping the RNA-seq data yet? It may not be necessary to remove polyA stretches because they won't map to a unique location and you may already have enough reads to not worry about it. Another poster asked a similar question: Statistics About Poly-A Tails In Rnaseq Reads .

However there are other posts that may help you such as: Trim Poly-A Tail And Trailing Nucleotides

ADD REPLY
1
Entering edit mode
8.9 years ago
h.mon 35k

It is not clear-cut trimming poly-A tails is beneficial, are you certain you need to trim the poly-A tail? As Jason pointed, many short read mappers won't be affected by poly-A tails. For assembly, it may help discern splice variants (see comment on MIRA's manual).

ADD COMMENT
1
Entering edit mode
8.8 years ago
chen ★ 2.5k

There is a tool available on Github for removing PolyA, PolyT, PolyC, PolyG

https://github.com/haploxer/after

  • Automatic Filtering, Trimming, and Error Removing for fastq data
  • Currently it supports Illumina 1.8 or newer format
  • AFTER can simply go through all fastq files in a folder and then output a good folder and a bad folder, which contains good reads and bad reads of each fastq file

Besides remove PolyX, it also can do:

  • Trim reads at front and tail according to bad per base sequence content
  • Detect and eliminate bubble artifact caused by sequencer due to fluid dynamics issue
  • Filter low-quality reads
ADD COMMENT
0
Entering edit mode
8.9 years ago
michael.ante ★ 3.8k

You can use bbduk from the bbmap suite. Just create a polyA.fa (e.g. >polyA\nAAAAAAAAAAAAA) and zip it into the bbmap resources folder; run then bbduk with ref=resources/polyA.fa.gz. Maybe it is necessary to add a polyT sequence into your fasta.

I'm quite sure you can add the sequences to the adapter-file of trim_galore likewise.

Cheers,
Michael

ADD COMMENT
0
Entering edit mode

Thanks for all help.

ADD REPLY
0
Entering edit mode
7.9 years ago

Hello,

You can also try UrQt.

It performs poly-N trimming as well as quality trimming, searching for the best larger fragment in the whole read. It has a high percentage of base conservation. It is pretty effective.

ADD COMMENT

Login before adding your answer.

Traffic: 1623 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6