Question: getting fasta sequence of a list of ensemble gene IDs
0
gravatar for A
3.8 years ago by
A3.6k
A3.6k wrote:

hi,

I retrieve the Ensembl gene IDs of yeast ribisomal RNA from biomart-Ensembl

Ensembl Gene ID
RDN25-1
RDN18-2
RDN5-4
15S_rRNA
RDN37-2
RDN5-6
RDN5-3
RDN58-1
RDN18-1
RDN37-1

I need to download the fasta file of these IDs but in Ensembl I could not such a option. do you have any suggestion please?

thank you

sequence biomart ensembl gene • 1.7k views
ADD COMMENTlink modified 3.8 years ago by Emily_Ensembl19k • written 3.8 years ago by A3.6k
4
gravatar for genomax
3.8 years ago by
genomax74k
United States
genomax74k wrote:

You can use BioMart. Follow the sequence below.

Ensembl --> BioMart --> Choose Database --> "Ensembl Genes" --> Select Yeast Genome from list --> Filters (in the left pane) --> Gene --> Input external referenced ID list --> Paste ID's in --> Attributes (left pane) --> Sequence --> Select as needed --> Results Button at top of the page --> Export to "File" as "FASTA".

ADD COMMENTlink modified 3.8 years ago • written 3.8 years ago by genomax74k

thank you so much.

sorry I searched for Arabidopsis rRNA in plant biomart, I found the IDs but the sequence was unavalaible then I tried the IDs in NCBI nucleotide but there is also nothing there.

 

ADD REPLYlink modified 3.8 years ago • written 3.8 years ago by A3.6k
1

Are your ID's from Yeast or Arabidopsis? The ones in your original post are yeast.

ADD REPLYlink written 3.8 years ago by genomax74k
1

For Arabidopsis rRNA use the following path:

Ensembl Plant --> BioMart --> Choose Database --> Plant Mart --> Select Arabidopsis Genome from list --> Filters (in the left pane) --> Gene --> Gene type (4th option) --> Select "rRNA" --> Attributes (left pane) --> Sequence --> Select features as needed (Unspliced gene may be one option) --> Results Button at top of the page --> Export to "File" as "FASTA".

ADD REPLYlink modified 3.8 years ago • written 3.8 years ago by genomax74k

thank you soooo much genomax2,

my IDs were from yeas that you resolved my problem. then I asked you about Arabidopsis that your tip worked well as already for Arabidopsis too

ADD REPLYlink modified 3.8 years ago • written 3.8 years ago by A3.6k
1

I'm sorry, I'm very confused. The list are bread yeast IDs. You can easily query and get the sequence, for example. What does Arabidopsis have to do with it?

ADD REPLYlink written 3.8 years ago by Emily_Ensembl19k
1
gravatar for Jenez
3.8 years ago by
Jenez520
Sweden
Jenez520 wrote:

I could not find a decent way of automating the process through eutils (which I'm sure there are plenty of people who can show you how), but if you don't mind doing these few manually then you can find the sequence quite easily through both the

ncbi gene database:

http://www.ncbi.nlm.nih.gov/gene/9164935

or SGD's database:

http://www.yeastgenome.org/locus/S000006484/sequence

ADD COMMENTlink written 3.8 years ago by Jenez520

thank you Jenez, you right but manually is error prone and somehow time consuming

ADD REPLYlink written 3.8 years ago by A3.6k
1

Funny, automating the workflow I would argue is more error prone seeing how data is rarely standardized across the whole data set you are looking at.

ADD REPLYlink written 3.8 years ago by Jenez520

no Jenes, I mean your suggested ways via below is time consuming. about automating, I don't have any programming skill to do so

ncbi gene database:

http://www.ncbi.nlm.nih.gov/gene/9164935

or SGD's database:

http://www.yeastgenome.org/locus/S000006484/sequence

ADD REPLYlink written 3.8 years ago by A3.6k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1259 users visited in the last hour