What is the difference between refrence genome and annotation ?
1
0
Entering edit mode
10.0 years ago
Y Tb ▴ 230

I am very new in bioinformatics field and I want to know what is the different between reference genome file and annotation file. Also what is the best website that I can download these files from it for human.

RNA-Seq next-gen • 8.2k views
ADD COMMENT
2
Entering edit mode
10.0 years ago
Xingyu Yang ▴ 280

Reference genome file is a description of the genome sequence. And annotation file is a description of where genetic element(intron, exon) located in the genome, in the form begin and end coordinate. Reference genome file are mostly in .fasta format and annotation are mostly in .gff or .bed format. Another format .genbank sometime contain both reference and annotation information. Google each format for details.

For human, the best way to download that file is http://genome.ucsc.edu/. You can also download it from ncbi.

ADD COMMENT
0
Entering edit mode

Thanks Xingyu Yang, so what is the difference between GTF AND GFF annotation format.

ADD REPLY
0
Entering edit mode

They are pretty similar. GTF refers to version 2 of GFF (the most recent version is GFF3).

ADD REPLY
0
Entering edit mode

Thanks again Xingyu Yang, Could you please send me the direct link to download the human annotation file, and what about annotation file from Ensembl website.

ADD REPLY
0
Entering edit mode

If you want a direct link, I would recommend you download it here:http://cufflinks.cbcb.umd.edu/igenomes.html

ADD REPLY
0
Entering edit mode

I follow the link, and I found that

Ensembl    GRCh37    17297 MB    May 14 17:23

GRCh37 link: ftp://igenome:G3nom3s4u@ussd-ftp.illumina.com/Homo_sapiens/Ensembl/GRCh37/Homo_sapiens_Ensembl_GRCh37.tar.gz

So is the human annotation file is about 17.9 GB

ADD REPLY
0
Entering edit mode

It include everything. Like different format of annotation, annotation of ncRNA, reference sequence, indexed reference sequences.

If you just want the annotation file, find it on ncbi ftp:ftp://ftp.ncbi.nih.gov/genomes/Homo_sapiens. Annotation file are in the GFF folder. The annotation file include ncRNA

ADD REPLY
0
Entering edit mode

I visited this link but it confused me because there are many files there so which of them is the annotation file for human

ADD REPLY

Login before adding your answer.

Traffic: 2598 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6