Official gene symbol list to BED file
1
2
Entering edit mode
6.2 years ago
rbronste ▴ 420

Simply trying to generate an annotated bed file for a specific assembly only from a list of official gene symbols, in my case derived from DESeq2 results. Is there a pretty straightforward way to go about doing this?

bed gene symbol DEseq2 • 2.9k views
ADD COMMENT
0
Entering edit mode

on a Mac need a space between -v and OFS -v OFS=

ADD REPLY
3
Entering edit mode
6.2 years ago

You could get HGNC symbols via RefGene (e.g., hg38):

$ wget -qO- http://hgdownload.cse.ucsc.edu/goldenpath/hg38/database/refGene.txt.gz \
  | gunzip -c - \
  | awk -v OFS="\t" '{ if (!match($13, /.*-[0-9]+/)) { print $3, $5, $6, $13, ".", $4; } }' - \
  | sort-bed - \
  > refGene.hg38.sorted.bed

The sort-bed tool is via BEDOPS.

Once you have this, you could do bedops or grep operations on differentially-labeled genomic regions or gene names, resp.

ADD COMMENT

Login before adding your answer.

Traffic: 1471 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6