Error with per tile sequence quality (by FastQC)
1
0
Entering edit mode
2.0 years ago
Tong ▴ 10

NEW beginner

I have run some samples on Miseq. After a quality check by FastQC, some of my samples had error with per tile sequence quality at a specific position. If I am not wrong, I think it was caused by the preparation of flowcell becase the samples with this kind of error were shown at the same specific position.

My questions are...

  1. Please give me some explanations; what is the cause of this error?
  2. How to solve this problem or how to improve the quality of my sequences?
  3. Can I ignore this error?
  4. I have heard that FilterByTile might can help me on this, but I do not know how to use it. Please give me some suggestions

Thank you in advance.

Error with per tile sequence quality

NGS DNA-seq fastqc • 2.4k views
ADD COMMENT
0
Entering edit mode

I would not worry too much in this case since you had poor quality only in one tile, while the rest of the flowcell looks good. This should not affect downstream analysis. However, it would be good to have a look also at the plot "per base sequence quality". Can you upload it?

ADD REPLY
0
Entering edit mode

Sure! Here it is.

Per bas sequence quality

ADD REPLY
0
Entering edit mode

Marco Pannone Could you please tell me how to interprete thes two plots? Thank you.

ADD REPLY
0
Entering edit mode

You attached the plot under the section "per sequence quality scores", while I asked for the plot in "per base sequence quality".

ADD REPLY
0
Entering edit mode

I'm very sorry. You can now find it below. enter image description here.

ADD REPLY
1
Entering edit mode

You have not very good base quality towards the 3'-end of your reads, which is very common in sequencing, especially if you have quite long reads (like in your case). I would suggest you perform some trimming using a Phred score cutoff (<30 should be appropriate) in order to eliminate all low-quality reads. You can use any popular trimming tool, such as trimgalore (https://github.com/FelixKrueger/TrimGalore/blob/master/Docs/Trim_Galore_User_Guide.md#step-1-quality-trimming) or cutadapt (https://cutadapt.readthedocs.io/en/stable/guide.html). Look through the documentation and you should be able to make the appropriate command line by yourself. Does the FastQC reports issues in other fields too (overrepresented sequences, adapter content, etc.)?

ADD REPLY
0
Entering edit mode

Thank you for your suggestion.

Actually, these above plots are from the trimmed read with Q20 (99%). After trimming, there are only two issues remaining. Another one is sequence length distribution (you can see it below). enter image description here

In addition, there were many issues reported by FastQC before trimming. enter image description here enter image description here

However, this is the report with Q30 as you recommended. The same problems still exist. enter image description here

ADD REPLY
0
Entering edit mode
2.0 years ago
Prash ▴ 270

As Tong replied, the per tile sequence quality is just because of poor quality reads towards the end which could be further trimmed or ignored. If you are using WES, you should however be careful to check the variant caller pipeline is void of finding such mutations in those regions If you are using RNA-Seq, pl check the transcript read counts

In either case, you could use paired end read ( Read 2) and crosscheck

Hope this helps, Prash

ADD COMMENT
0
Entering edit mode

Prash Thank you for your comment. I think I am using DNA-Seq, and my next step for NGS analysis will be de novo assembly.

Is anything should be checked or concerned?

ADD REPLY
0
Entering edit mode

Nothing much except that, if you could use trimgalore carefully to trim additional bp from the 3' end of all reads to avoid problems

ADD REPLY

Login before adding your answer.

Traffic: 2496 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6