Tutorial: Get GC Content from UCSC with perl script (mysql)
gravatar for Shicheng Guo
22 months ago by
Shicheng Guo7.4k
Shicheng Guo7.4k wrote:

Hi All,

I have a bed file contents hundred of human genomic regions. I want to get some basic characteristics for these genomic regions, like GC contents et.c. Any perl script could do it without download the fastq files for these regions.

I know if you download the fasta files for these regions, you can use the following script to calculate GC contents:


or like this:

mysql --user=genome --host=genome-mysql.cse.ucsc.edu -A -e "select chrom, size from  hg19.chromInfo" > hg19.chrom.sizes


ucsc tutorial myql gc content perl • 1.2k views
ADD COMMENTlink modified 17 months ago by Michael Dondrup45k • written 22 months ago by Shicheng Guo7.4k

The post in the current version seems to be missing a valid question or is incomplete. Can you take a look and amend as needed?

ADD REPLYlink written 22 months ago by genomax62k

I am trying to understand your question. Is it as follows:

How can I find nucleotide composition (GC content and such) of genomic regions from bed file using online tools that do not download the reference fasta file to my server/computer. Preferably using ucsc server for the computation.

I believe you might find public versions of Galaxy as the best way to handle such projects. You can get data to Galaxy (like bed files for particular regions) directly from UCSC (no need to download to your server or computer).

ADD REPLYlink written 22 months ago by Petr Ponomarenko2.6k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1782 users visited in the last hour