I am using Biomart to get transcripts start and end from a list of 87 NM_transcripts I have like this
NM_000090.3
NM_000138.4
NM_000169.2
The steps I am following are: CHOOSE DATABASE ---> Ensemble Genes 105
CHOOSE DATASET ---> Human genes
Go to Filters, go to GENE, click on Input external references ID list [Max 500 advised.
Select RefSeq mRNA id(s) [e.g. NM_000014]
Copy and paste my transcripts
Click on count (to see if input was accepted)
Click on Attributes
Click on GENE
Select Gene start (bp) and Gene end (bp) and also Gene name (to check if they match because I also have this info.
Click Results
And then I got this
Gene start (bp) Gene end (bp) Gene name
577 647 MT-TF
648 1601 MT-RNR1
1602 1670 MT-TV
1671 3229 MT-RNR2
3230 3304 MT-TL1
3307 4262 MT-ND1
4263 4331 MT-TI
4329 4400 MT-TQ
4402 4469 MT-TM
4470 5511 MT-ND2
What is significantly less genes that the ones I introduce
What I am doing wrong?
Looks like these are genes in Mitochondria? There are only 37 of those in Ensembl.