How to convert a whole genome gene position reference number (ID) list to the actual text gene names?
1
0
Entering edit mode
8.1 years ago

How to convert a whole genome gene position reference number (ID) list to the actual text gene names?

I have the NextGene output with POS ID column (.vcf, csv). Need toreturn tuberculosis genome gene symbols.

vcf wgs • 2.0k views
ADD COMMENT
0
Entering edit mode
8.1 years ago

Make sure our file is sorted and the first 3 columns of your file are: chromosome name | start position | end position

You would need to download a file from ENSEMBL biomart for your genome of interest that starts with first 3 columns as chromosome name | start position | end position | gene name ..... (this must be sorted too)

Then you would need to use bedtools to merge them by -intersect option.

ADD COMMENT

Login before adding your answer.

Traffic: 2660 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6