Tophat/Bowtie Not Using Gene Symbols From Gtf File
1
1
Entering edit mode
10.9 years ago
jobinv ★ 1.1k

I have an issue with my RNA-seq analysis, and was wondering whether any of you know how to solve it? Basically, I want to be able to annotate my analysis with gene symbols. What I have done is to supply Tophat/Bowtie with a gtf file from Ensembl, which contains these gene symbols. However, instead of making use of these gene symbols (the column "gene_name" in the gtf file, for example "DDX11L1"), it seems to use the Ensembl gene ID's instead (the column "gene_id" in the gtf file, for example "ENSG00000223972").

How do I get Tophat/Bowtie to use the "gene_name" column instead? Is this possible?

rna-seq bowtie annotation ensembl • 2.6k views
ADD COMMENT
2
Entering edit mode
10.9 years ago
jobinv ★ 1.1k

It seems I had been misunderstanding the setup of this for a while now. Precisely because each individual gene_name links to multiple gene_id's (because of for example multiple transcripts of the same gene), this cannot be done directly. It's better to rather annotate the transcripts to gene symbols at a later point during the downstream analysis, I've learnt.

ADD COMMENT

Login before adding your answer.

Traffic: 2662 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6