Question: GC content in metagenomic data
gravatar for bioinfo
2.2 years ago by
bioinfo790 wrote:

I have sequenced a few metagenomes on Illumina (DNA extracted from fish tissue). The average number of reads I got is around 40 million/sample. I have done the preliminary QC on the datasets, and I can say that the quality of the dataset is quite high (based on the FastQC report) but I see a clear GC bias (only 35-37% GC contents). Can anyone guide me whether it is due to species composition of the samples or PCR amplification during library prep or later in the cluster generation step. Is there any other reason for this GC bias in metagenomic samples?

metagenomics assembly • 716 views
ADD COMMENTlink modified 2.2 years ago • written 2.2 years ago by bioinfo790

Do you know what the expected GC of your likely organisms are? GC content is rarely if ever 50:50, so that doesn't sound super out of the ordinary so far...

ADD REPLYlink written 2.2 years ago by Joe18k

What sort of metagenomic samples are you extracting from tissue? Metagenomic samples are a more or less complex and diverse collection of organisms, some of which might be low GC.

ADD REPLYlink written 2.2 years ago by Carambakaracho2.2k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1354 users visited in the last hour