Question: Conversion tool from ENST(ensembl) transcript to NM transcript(NCBI)
gravatar for gmora
23 months ago by
gmora10 wrote:

So I have hundreds ensemble gene transcripts which needs to be converted to NCBI transcripts. I am worried about their being position changes in known mutations. I think one of reasons for this change is to have consecutive exons numbering (1,2,3 etc) instead of (1b,2,3,5) that is found in ensembl.

Gene Transcript Location Position Type Nuc Change AA Change Nuc Name AA Name
A ENST0001 exon 1 69aa c G-->C A-->P(150) c.448G>C p.Ala150Pro

This is roughly what the file looks like before conversion. I think between the formats the Position (AA or nucleotide) will change as will the Nuc Name. * denotes likely area's of change.

Any help would be appreciated

ADD COMMENTlink modified 4 months ago by Biostar ♦♦ 20 • written 23 months ago by gmora10
gravatar for Devon Ryan
23 months ago by
Devon Ryan78k
Freiburg, Germany
Devon Ryan78k wrote:

Use biomart. I've never seen a 1b exon in an Ensembl annotation. Not all Ensembl transcripts will have a RefSeq counterpart, but those that do should be the same.

As an aside, there's apparently a small bug in Ensembl biomart at the moment where occasionally nuccore yeast gene ID's are returned rather than refseq IDs. I'll report that so it gets fixed, though it's trivial to filter thankfully (they don't start with NM_).

ADD COMMENTlink written 23 months ago by Devon Ryan78k

Please report this to our helpdesk, so that it can be fixed.

ADD REPLYlink modified 23 months ago • written 23 months ago by Denise - Open Targets4.4k
gravatar for Denise - Open Targets
23 months ago by
UK, Hinxton, EMBL-EBI
Denise - Open Targets4.4k wrote:

You can use BioMart to convert ENST into NMs. Check our tutorial page. Under 'filters', you input your list of ENSTs and under 'attributes' you select NM from the 'External' field. You will then get the correspondence between them. In Ensembl, we assign stable IDs to exons such as ENSE00001184784, and their numbers are consecutive, rather than things such as 1b, 2, 3, 5. See this example. 'Ala150Pro' notations are from HGVS. See this post. If you have your variants in HGVS notation such as ENST00000415952.1:c.-149-34206G>T, 7:g.24926827C>A, you can use the Ensembl VEP to find out where they map to in both RefSeq and Ensembl genes.

ADD COMMENTlink modified 23 months ago • written 23 months ago by Denise - Open Targets4.4k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 629 users visited in the last hour