Question: trimming of Rna seq
1
gravatar for fatimarasool
4 days ago by
fatimarasool10
fatimarasool10 wrote:

I am working on wheat genome, I want to do analyze comparative genome analysis of 3 varieties of wheat .I have sequence files from illumina 1.9 in fastq format. I checked the quality of reads by fastqc tool.GC content are not in normal range.(47- 49). What is the normal value of % GC for RNA seq reads ? The other question is that kmers are also not in correct range.How can I correct it. For trimming,adaptor seq file is required,but i don't have this file..Is that possible to remove these two error?If yes then how can I do?Can I skip the trimming step and go to next step of mapping?

In this file all parameter values are correct except kmer s and GC content.Is there any need to trim it?If yes then how can I do?

file:///home/comsats-ra/fatimamphilldata/G1_cleaned_R1_fastqc.html#M11

sequencing rna-seq alignment • 160 views
ADD COMMENTlink modified 4 days ago by chen1.2k • written 4 days ago by fatimarasool10
2
gravatar for Devon Ryan
4 days ago by
Devon Ryan73k
Freiburg, Germany
Devon Ryan73k wrote:

One expects many failed FastQC modules in RNAseq datasets. GC content should be similar over samples, but otherwise ignore a "Fail" in FastQC there. Similarly, you expect enriched k-mers. You should not attempt to correct this, it's already correct.

You can trim reads with Trim Galore!, which has the default adapters all built in. Having said that, it's quicker to just use STAR for alignment, in which case you don't need to bother trimming adapters.

ADD COMMENTlink written 4 days ago by Devon Ryan73k
2
gravatar for Gjain
4 days ago by
Gjain5.1k
Göttingen, Germany
Gjain5.1k wrote:

You can also look into DNApi : De novo adapter prediction algorithm for small RNA sequencing data

link: https://github.com/jnktsj/DNApi

de novo adapter prediction (iterative) algorithm for small RNA sequencing data. DNApi requires Python (2 or 3) under a Linux/Unix environment. DNApi accept (un)compressed FASTQ files or redirected standard input (stdin) as an input. You can simply run:

$ python dnapi.py <fastq>

or

> $ <process-generates-fastq> | python dnapi.py -

To see the detailed usage, type:

$ python dnapi.py [-h | --help]

DNApi can predict most 3′ adapters correctly with the default parameters. However, if you want to tweak the parameters or want to run other prediction modes, see [prediction modes and parameters] (https://github.com/jnktsj/DNApi#prediction-modes-and-parameters) for more detail.

ADD COMMENTlink modified 4 days ago • written 4 days ago by Gjain5.1k
2
gravatar for chen
4 days ago by
chen1.2k
OpenGene
chen1.2k wrote:

You can use fastp to trim adapters for Illumina sequencing data, without the need of knowing the adapter sequences.

Just download fastp and run:

fastp -i in.fq -o out.fq

And then everything is done, the adapters are trimmed in out.fq

For paired end data, the command is like:

fastp -i in1.fq -o out1.fq -I in2.fq -O out2.fq

Gzip is supported for both input and output.

ADD COMMENTlink written 4 days ago by chen1.2k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1016 users visited in the last hour