Question

rna-seq reads not passing fastqc

0

Entering edit mode

9 months ago

Luna • 0

Hello everyone,

I am analyzing RNA-Seq data and have encountered the following issues that I need clarification on: per base sequence content image

Per Base Sequence Content:

 The per-base sequence content varies throughout the read, not just in the first 15 positions. Is this indicative of a problem with the sequencing or the data itself?

Overrepresented Sequences:

 There are multiple overrepresented sequences in the data. After performing BLASTn, I found that they match the genome of the organism I am studying. Is it common to have such overrepresented sequences that align with the organism's genome, or does this point to a potential bias in library preparation?

Adapter Content:

 Despite not running Trimmomatic yet, there appears to be no adapter content in the raw data. However, the paper from which I obtained the dataset mentions using the Illumina TruSeq RNA kit for library preparation, which typically adds identifiable adapters. How is it possible that no adapter sequences are detected in the data?

Any insights into these issues would be greatly appreciated!

Thank you!

rna-seq fastqc • 950 views

ADD COMMENT • link 9 months ago by Luna • 0

1

Entering edit mode

rna-seq reads not passing fastqc

The pass/fail metric definitions are editable limits in a config file. A failure in one of the FastQC metric does not immediately indicate that the data is bad, nor is that an indication that analysis needs to stop before all green check marks are obtained.

Always keep the context of the experiment in mind when interpreting FastQC results. Default limits/metrics defined in FastQC config file are for "plain" genome sequencing, so a number of other types of experiments (ChIPseq, ATACseq, RNAseq) can result in some test in FastQC "failing".

ADD REPLY • link 9 months ago by GenoMax 154k

0

Entering edit mode

Thank you for the reply!

ADD REPLY • link 9 months ago by Luna • 0

0

Entering edit mode

what is the phred score distribution?

ADD REPLY • link 9 months ago by 1769mkc ★ 1.3k

0

Entering edit mode

above 30 throughout

ADD REPLY • link 9 months ago by Luna • 0

score 4 · Answer 1 · 2025-01-14

Don't overinterpret these fastqc things. Almost certainly the data are prefectly fine. I see nothing unusual here. You do not see adapters if the insert size of your fragments is larger than the sequencing length which is typically the case. It's fine. Do the alignment and downstream analysis and only go back if something is odd.