Question

RNA Seq analysis

0

Entering edit mode

2.3 years ago

Micro_bioinfo • 0

Dear all

I have tried to map raw RNA Seq (paired fastq) sequence with the reference fungal genome using STAR. My Log.final.out file looks like:

I want to know why only 26.01% of read Uniquely mapped with the reference genome? Is it due to the adapter sequence contamination (not provided by the service provider). If so how to identify and remove them.

I am a beginner in RNA Seq analysis and any suggestion in this regard will be highly useful in my work.

Thanks

reads percentage mapped Uniquely Low of • 979 views

ADD COMMENT • link 2.3 years ago by Micro_bioinfo • 0

0

Entering edit mode

Run fastqc to know whether adapters are present.

ADD REPLY • link 2.3 years ago by ATpoint 81k

0

Entering edit mode

Dear Sir

I have checked the fastqc results. It shows

Per Base Sequence Quality >35 Sequence Length Distribution 100 – 102 bp Adapter Content : Solid small RNA adapter very low or not available

Surprisingly multiqc_report showing the presence of adapter

enter image description here

Do I needs to perform any preprocessing of these fastq sequences before performing the mapping? How to find the adapters ?

or do I needs to make any changes in the used parameters.

STAR --genomeDir index/ --runThreadN 16 --readFilesIn 1_1.fastq 1_2.fastq --outFileNamePrefix results/ --outSAMtype BAM SortedByCoordinate --outSAMattributes Standard

thanks

ADD REPLY • link 2.3 years ago by Micro_bioinfo • 0

0

Entering edit mode

always assess the quality of your sequencing data first. You say you don't know what the adapter sequence is but it very well may be one of the standard Illumina sequences for which FastQC will recognize. Else, if the company deems this information proprietary you can certainly make an argument that they should at least trim adapters from your reads for you.

It might also help for interpreting your results to know what parameters you used when running STAR as these can impact how many of your reads end up mapped to the reference.

ADD REPLY • link 2.3 years ago by jv ★ 1.8k

0

Entering edit mode

I found only 0.53% adapter sequence in my fastq data. Will it cause errors during mapping?

ADD REPLY • link 2.3 years ago by Micro_bioinfo • 0