how to get promoter coordinates?
1
2
Entering edit mode
9.6 years ago
epigene ▴ 590

I'm curious how you get all the promoter coordinates? Specifically, how you deal with one genes having multiple isofroms and multiple promoters? If there are 30k genes, do you ensure 30k promoters, i.e. one promoter per gene? What's the best practice? There is no standard way of doing it from what I can tell. So how do you do it?

genome promoter • 3.1k views
ADD COMMENT
2
Entering edit mode
9.6 years ago

Personally I get the promoter region for each transcript and then possibly merge them within a gene if they overlap (assuming that what I'm doing next is compatible with merging them). BTW, this is really easy to do in R with GenomicFeatures and a TranscriptDb.

ADD COMMENT
0
Entering edit mode

Thanks for the quick reply. That's actually what I was doing as well! But if I define 2k upstream of TSS to be a promoter, by doing the merging, you could end up with very long promoters (>10k), which is a bit of concern to me. What do you think about that?

ADD REPLY
0
Entering edit mode

Whether that makes sense or not is dependent on what you want to do downstream. Obviously if you need to do any plotting of enrichment as a function of proximity to the TSS then merging wouldn't be a good idea.

ADD REPLY

Login before adding your answer.

Traffic: 1994 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6