Per Base Sequence Content
Any experienced Whole Exome Sequencing analyst/experimentalist:
the Per Base Sequence Content of the initial couple of bases from the 3' end of my WES reads have some type of compositional bias. Is this some "harmless" procedural or technical bias occurring during sequencing or is it something to be worried about for the downstream analysis?
Here is the graphic of the Per Base Sequence Content (FastQC)
If these were created using Nextera type (tagmentation) libraries then yes. See this paper (Figure 1).
They were created on an Illumina HiSeq 2000 using an Agilent SureSelect Human All Exon v.2 kit. Not sure, but that might be a bit different.
What was used for creating the actual libraries for sequencing?
I am sorry, that is the only information I have. I have no direct contact to the laboratory. I could try to find out though. Anything particular to ask about or consider?