I have tried to optimize the trimmomatic command by varying all the available parameters but it is of no use. Can any one please provide a possible solution to this problem. Your suggestions will be much appreciated.
You left out a critical piece of information. What kind of data is this? Is there a fixed sequence tag at the beginning of all reads. Are these amplicons?
No one can comment intelligently on this without knowing what it is. No one can understand what these graphs mean without the context of knowing what the samples are.
I would totally expect a sample of Plasmodium falciparum, or tubercuolsis bacterium to "fail" the GC content test. But that doesn't mean anything is wrong! It means the assumptions of the test are wrong on samples like that, and the automated flagging of "bad" results is nonsense, and should be ignored.
RNASeq also can "fail" the duplicate sequence test, because you might have some RNASeq molecules in rather high abundance.
You look like you are sequencing amplicons, so you probably expect most of the reads to look alike, so why are you dismayed to see that most of the reads look alike?
FastQC is a nice tool to let you have a peek at the overall sanity of your data, but one should use critical judgement over those failing reports. For example the Per base sequence content can be due to adapter sequences. They are more warnings to keep in mind than "no go" errors.
I would align those reads to the reference genome/transcriptome and check the percentage for aligned reads. If you think you are losing too many reads, check the unaligned reads and investigate why they are not aligning. In any case, what should not align to your reference will not align (like potential contamination at GC content level), duplicated reads will be flagged as such in the alignment file, etc...
You left out a critical piece of information. What kind of data is this? Is there a fixed sequence tag at the beginning of all reads. Are these amplicons?
Dear all,
Thank you so much for your valuable and critical inputs
BR
Deepak