rna-seq reads not passing fastqc
1
0
Entering edit mode
9 months ago
Luna • 0

Hello everyone,

I am analyzing RNA-Seq data and have encountered the following issues that I need clarification on: per base sequence content image

  • Per Base Sequence Content:
     The per-base sequence content varies throughout the read, not just in the first 15 positions. Is this indicative of a problem with the sequencing or the data itself?
    
  • Overrepresented Sequences:
     There are multiple overrepresented sequences in the data. After performing BLASTn, I found that they match the genome of the organism I am studying. Is it common to have such overrepresented sequences that align with the organism's genome, or does this point to a potential bias in library preparation?
    
  • Adapter Content:
     Despite not running Trimmomatic yet, there appears to be no adapter content in the raw data. However, the paper from which I obtained the dataset mentions using the Illumina TruSeq RNA kit for library preparation, which typically adds identifiable adapters. How is it possible that no adapter sequences are detected in the data?
    

Any insights into these issues would be greatly appreciated!

Thank you!

rna-seq fastqc • 941 views
ADD COMMENT
1
Entering edit mode

rna-seq reads not passing fastqc

The pass/fail metric definitions are editable limits in a config file. A failure in one of the FastQC metric does not immediately indicate that the data is bad, nor is that an indication that analysis needs to stop before all green check marks are obtained.

Always keep the context of the experiment in mind when interpreting FastQC results. Default limits/metrics defined in FastQC config file are for "plain" genome sequencing, so a number of other types of experiments (ChIPseq, ATACseq, RNAseq) can result in some test in FastQC "failing".

ADD REPLY
0
Entering edit mode

Thank you for the reply!

ADD REPLY
0
Entering edit mode

what is the phred score distribution?

ADD REPLY
0
Entering edit mode

above 30 throughout

ADD REPLY
4
Entering edit mode
9 months ago
ATpoint 89k

Don't overinterpret these fastqc things. Almost certainly the data are prefectly fine. I see nothing unusual here. You do not see adapters if the insert size of your fragments is larger than the sequencing length which is typically the case. It's fine. Do the alignment and downstream analysis and only go back if something is odd.

ADD COMMENT

Login before adding your answer.

Traffic: 5377 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6