biomaRt getSequence results of gene_exon in wrong order
1
0
Entering edit mode
5 months ago
Hematocite • 0

Hi everyone,

I have been using the biomart package to retrieve the exon sequences of certain genes. In particular I am only interested in the first exon of the gene which also includes the 5' UTR.

For this I have used the following code:

ID <- "ENST00000502732.6"
sequence <-getSequence(id = ID ,seqType = "gene_exon", type = "ensembl_transcript_id_version", mart = ensembl)

When I look at the resulting dataframe I obtain all the exon sequences. However, the exons are in the wrong order. Meaning that the sequence of exon1 is not necessarily in row 1 (and the rows are not labeled). Since I am particularly interested in exon 1, this is quite annoying as I would need to check the sequences manually again.

Am I missing something here?

Any help is appreciated. Also if there is another way to retrieve sequences of one particular exon it would be great to know.

Thanks in advance!

R Biomart getsequence order exons • 194 views
ADD COMMENT
1
Entering edit mode
5 months ago
swbarnes2 9.8k

I don't think you are missing anything; things come out of biomart unordered. You'll have to sort and filter yourself.

ADD COMMENT
0
Entering edit mode

Thanks for your answer. Do I have to do it manually? Or can yo uthink of a different way? I have a list of 100 genes and doing it all manually might be quite labor intensive

ADD REPLY
2
Entering edit mode

Re-query, and this time, ask for exon rank, not just sequence.

ADD REPLY

Login before adding your answer.

Traffic: 1039 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6