Quality check of Medip Seq Data
1
0
Entering edit mode
7.4 years ago

Dear sir/mam,

Hi

i have query regarding Quality check analysis using fastqc. please need help i have done QC analysis for MEDIP SEQ data from illumina using fastqc following parameters didn't pass the criteria(images attached).

  1. Per base sequence content
  2. Per sequence GC content
  3. Sequence Length Distribution
  4. Sequence Duplication Levels
  5. Overrepresented sequences
  6. Kmer Content

https://postimg.org/gallery/1eoe8a78g/

the rest parameters passed the criteria. is it fine to go ahead with alignment or shall i trim the sequence length.

Thank you

Fastqc qualityanalysis MedipSeq • 2.3k views
ADD COMMENT
0
Entering edit mode

You didn't find any overrepresented sequences?

ADD REPLY
0
Entering edit mode

yes in one of the fastq file it had overrepresented sequence of GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG

other fastq file is good.

i have trim the data using Trimmomatic and Trim Galore tool (defaults settings) in galaxy and did fastqc but still QC images shows the same.

please need help.

ADD REPLY
0
Entering edit mode

Is this NextSeq or MiniSeq data?

ADD REPLY
0
Entering edit mode

yes its from NextSeq

ADD REPLY
1
Entering edit mode

That explains polyG stretches, due to the two-colour chemistry. G means absence of signal. Probably best to trim polyG tails. For more information, see this post on qcfail

ADD REPLY
0
Entering edit mode

what about the Kmer content?

ADD REPLY
0
Entering edit mode
7.4 years ago
mks002 ▴ 220

You can do a simple check for per base quality result from FastQC. If there is quality drop, you can do trimming based on quality and then check the fastQC of the trimmed data.

ADD COMMENT

Login before adding your answer.

Traffic: 3287 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6