Mapping genome region to gene symbols in CNV files from TCGA
2
2
Entering edit mode
9.0 years ago
Na Sed ▴ 310

I have just downloaded CNV level 3 files from TCGA database.

As you know, in these files there are three columns: chromosome, start, and end which presents the coordinates of genes.

Now, I would like to map them to gene symbols, but I don't know.

I appreciate any help.

Thanks

R CNV Gene symbols genome region • 5.3k views
ADD COMMENT
8
Entering edit mode
9.0 years ago
raunakms ★ 1.1k
  1. Get the RefSeq Gene co-ordinate from UCSC Table Browser (output in bed format)
  2. Convert your input coordinates (from TCGA CNV Level-3) into bed format
  3. Use BEDTools, command: intersectbed to match two bed files. This will give you a list of cnv segments and the matched RefSeq Genes.

For Example: (refer to the documentations of BEDTools for the usage of -wa -wb)

intersectBed -a file_cnv_seg.bed -b file_ucsc_refseq.bed -wa -wb > file_output.bed
ADD COMMENT
4
Entering edit mode
ADD COMMENT

Login before adding your answer.

Traffic: 1478 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6