GC content in metagenomic data

0

Entering edit mode

6.9 years ago

bioinfo ▴ 840

I have sequenced a few metagenomes on Illumina (DNA extracted from fish tissue). The average number of reads I got is around 40 million/sample. I have done the preliminary QC on the datasets, and I can say that the quality of the dataset is quite high (based on the FastQC report) but I see a clear GC bias (only 35-37% GC contents). Can anyone guide me whether it is due to species composition of the samples or PCR amplification during library prep or later in the cluster generation step. Is there any other reason for this GC bias in metagenomic samples?

metagenomics Assembly • 1.9k views

ADD COMMENT • link 6.9 years ago by bioinfo ▴ 840

1

Entering edit mode

Do you know what the expected GC of your likely organisms are? GC content is rarely if ever 50:50, so that doesn't sound super out of the ordinary so far...

ADD REPLY • link 6.9 years ago by Joe 22k

0

Entering edit mode

What sort of metagenomic samples are you extracting from tissue? Metagenomic samples are a more or less complex and diverse collection of organisms, some of which might be low GC.

ADD REPLY • link 6.9 years ago by Carambakaracho ★ 3.3k

Login before adding your answer.