Entering edit mode
5.7 years ago
endrebak
▴
980
Here is some sample data from zebrafish, release 95:
#!genome-build GRCz11
#!genome-version GRCz11
#!genome-date 2017-05
#!genome-build-accession NCBI:GCA_000002035.4
#!genebuild-last-updated 2018-04
4 ensembl gene 17308 18211 . - . gene_id "ENSDARG00000102141"; gene_version "2"; gene_name "ptpn12"; gene_source "ensembl"; gene_biotype "protein_coding";
4 ensembl transcript 17308 18211 . - . gene_id "ENSDARG00000102141"; gene_version "2"; transcript_id "ENSDART00000171737"; transcript_version "2"; gene_name "ptpn12"; gene_source "ensembl"; gene_biotype "protein_coding"; transcript_name "ptpn12-201"; transcript_source "ensembl"; transcript_biotype "protein_coding";
4 ensembl exon 18134 18211 . - . gene_id "ENSDARG00000102141"; gene_version "2"; transcript_id "ENSDART00000171737"; transcript_version "2"; exon_number "1"; gene_name "ptpn12"; gene_source "ensembl"; gene_biotype "protein_coding"; transcript_name "ptpn12-201"; transcript_source "ensembl"; transcript_biotype "protein_coding"; exon_id "ENSDARE00001173708"; exon_version "2";
4 ensembl CDS 18134 18211 . - 0 gene_id "ENSDARG00000102141"; gene_version "2"; transcript_id "ENSDART00000171737"; transcript_version "2"; exon_number "1"; gene_name "ptpn12"; gene_source "ensembl"; gene_biotype "protein_coding"; transcript_name "ptpn12-201"; transcript_source "ensembl"; transcript_biotype "protein_coding"; protein_id "ENSDARP00000130978"; protein_version "1";
4 ensembl exon 17948 18046 . - . gene_id "ENSDARG00000102141"; gene_version "2"; transcript_id "ENSDART00000171737"; transcript_version "2"; exon_number "2"; gene_name "ptpn12"; gene_source "ensembl"; gene_biotype "protein_coding"; transcript_name "ptpn12-201"; transcript_source "ensembl"; transcript_biotype "protein_coding"; exon_id "ENSDARE00001162488"; exon_version "1";
4 ensembl CDS 17948 18046 . - 0 gene_id "ENSDARG00000102141"; gene_version "2"; transcript_id "ENSDART00000171737"; transcript_version "2"; exon_number "2"; gene_name "ptpn12"; gene_source "ensembl"; gene_biotype "protein_coding"; transcript_name "ptpn12-201"; transcript_source "ensembl"; transcript_biotype "protein_coding"; protein_id "ENSDARP00000130978"; protein_version "1";
4 ensembl exon 17681 17772 . - . gene_id "ENSDARG00000102141"; gene_version "2"; transcript_id "ENSDART00000171737"; transcript_version "2"; exon_number "3"; gene_name "ptpn12"; gene_source "ensembl"; gene_biotype "protein_coding"; transcript_name "ptpn12-201"; transcript_source "ensembl"; transcript_biotype "protein_coding"; exon_id "ENSDARE00001173438"; exon_version "1";
4 ensembl CDS 17681 17772 . - 0 gene_id "ENSDARG00000102141"; gene_version "2"; transcript_id "ENSDART00000171737"; transcript_version "2"; exon_number "3"; gene_name "ptpn12"; gene_source "ensembl"; gene_biotype "protein_coding"; transcript_name "ptpn12-201"; transcript_source "ensembl"; transcript_biotype "protein_coding"; protein_id "ENSDARP00000130978"; protein_version "1";
4 ensembl exon 17308 17548 . - . gene_id "ENSDARG00000102141"; gene_version "2"; transcript_id "ENSDART00000171737"; transcript_version "2"; exon_number "4"; gene_name "ptpn12"; gene_source "ensembl"; gene_biotype "protein_coding"; transcript_name "ptpn12-201"; transcript_source "ensembl"; transcript_biotype "protein_coding"; exon_id "ENSDARE00001189332"; exon_version "2";
4 ensembl CDS 17308 17548 . - 1 gene_id "ENSDARG00000102141"; gene_version "2"; transcript_id "ENSDART00000171737"; transcript_version "2"; exon_number "4"; gene_name "ptpn12"; gene_source "ensembl"; gene_biotype "protein_coding"; transcript_name "ptpn12-201"; transcript_source "ensembl"; transcript_biotype "protein_coding"; protein_id "ENSDARP00000130978"; protein_version "1";
Is there an easy way to get the same data through mysql or is ftp the way to go? I do not care about the header.
Thanks. Mysql is safer protocol than ftp is the reason. But I guess ftp is the way to go :)
Note also that the perl API makes SQL queries under the hood so you could write a perl script to get the same data via database queries without having to bother writing the SQL queries yourself.