gravatar for uki_al
3.0 years ago by
uki_al50 wrote:

Hi I have a question about the differences between the FASTA files that can be downloaded from the ensembl ftp ( and the ncbi ftp (

As far as I could get tell, both are GRCH37 versions, so I was curious are the references identical or not? If they are, could I use the FASTA file downloaded from the ensembl ftp together with the gene-annotation file downloaded from ncbi ftp?

I know UCSC differs by chromosome naming, and I know there are tools that can convert from one to another, that's why I opt to download UCSC FASTA and GTF and use them together. I was also using up until now the ensembl FASTA and GTF together. But I was just curious, if I want to use ncbi GTF, do I need to download the FASTA from the ncbi ftp, or will the ensembl one do the job? From what I understood, they should be identical, I just couldn't confirm this...

ADD COMMENTlink modified 2.8 years ago by Biostar ♦♦ 20 • written 3.0 years ago by uki_al50
gravatar for Jean-Karim Heriche
3.0 years ago by
EMBL Heidelberg, Germany
Jean-Karim Heriche22k wrote:

The assembly may be the same although they could differ due to the differential application of patches. Regardless, the annotations would definitely be different between the different resources as they each annotate the genome in their own way. Switching between or mixing references during a project is asking for trouble.

ADD COMMENTlink written 3.0 years ago by Jean-Karim Heriche22k

For reference: chromosome coordinates remain unchanged by patches.

ADD REPLYlink modified 3.0 years ago • written 3.0 years ago by genomax83k

Thanks, I wasn't sure about that.

ADD REPLYlink written 3.0 years ago by Jean-Karim Heriche22k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2208 users visited in the last hour