GC content in metagenomic data
0
0
Entering edit mode
6.2 years ago
bioinfo ▴ 840

I have sequenced a few metagenomes on Illumina (DNA extracted from fish tissue). The average number of reads I got is around 40 million/sample. I have done the preliminary QC on the datasets, and I can say that the quality of the dataset is quite high (based on the FastQC report) but I see a clear GC bias (only 35-37% GC contents). Can anyone guide me whether it is due to species composition of the samples or PCR amplification during library prep or later in the cluster generation step. Is there any other reason for this GC bias in metagenomic samples?

metagenomics Assembly • 1.6k views
ADD COMMENT
1
Entering edit mode

Do you know what the expected GC of your likely organisms are? GC content is rarely if ever 50:50, so that doesn't sound super out of the ordinary so far...

ADD REPLY
0
Entering edit mode

What sort of metagenomic samples are you extracting from tissue? Metagenomic samples are a more or less complex and diverse collection of organisms, some of which might be low GC.

ADD REPLY

Login before adding your answer.

Traffic: 917 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6