Exponentially Increasing Genomes Slide
6
14
Entering edit mode
14.2 years ago
Lee Katz ★ 3.2k

I always see a slide in talks what shows an increasing number of genomes available in GenBank or other database. Where is this slide from? I have seen an outdated one from Genomes Online but nothing recent.

How can I find this graph and cite it for my own talk?

genome graph • 5.8k views
ADD COMMENT
0
Entering edit mode

I guewss the genomes online one is the best answer. Thank you for the boost on my question giovanni.

Maybe a better question would be, where are these data so that we can generate our own pretty graphs? But then again, I realize that the data are out there--you just have to find them and bring them together yourself!

Although, everyone gave really great answers and I learned a lot from going through your links and what you said. Thank you all!

ADD REPLY
11
Entering edit mode
14.2 years ago
brentp 24k

check here

ADD COMMENT
0
Entering edit mode

what a pity the graph is so damn ugly!

ADD REPLY
0
Entering edit mode

I had to use internet explorer to get the numbers, but it's suggesting the relative growth rate is decreasing, and that 2000 was an outlier year (and obviously 1983).

ADD REPLY
10
Entering edit mode
14.2 years ago
Mary 11k

There was another really good graphic that Lincoln Stein used in his talk at Beyond The Genome last week. It is available from this paper:

The case for cloud computing in genome informatics

It is figure 2 in there. It shows the slope of sequence data pre-NGS, and the change recently. And also the point where we have now crossed storage vs production: we have now passed the point where we can afford to store it:

"The cost of genome sequencing is now decreasing several times faster than the cost of storage, promising that at some time in the not too distant future it will cost less to sequence a base of DNA than to store it on a hard disk....The various members of the genome informatics ecosystem are now facing a potential tsunami of genome data that will swamp our storage systems and crush our compute clusters."

Also at this meeting people were trying to change the meme from big scary data (deluge, tsunami, etc) to "data bonanza". People were attempting to use that--but they still seemed scared :)

ADD COMMENT
0
Entering edit mode

I like the information added within this response very much.

ADD REPLY
0
Entering edit mode

lol, I wasn't there, but you can count me with the scared ones. I do have ideas, and a plan, for dealing with a certain amount of data growth. But if this keeps going indefinitely, where will we end up? That's what I'm afraid of. Is Pac Bio going to save me from short reads? Or are they just going to multiply the data volume? Or both at the same time, plus a continuing flood of 2nd-gen data?

Or more generally - what is the new equilibrium going to look like, and when are we going to get there? The fact that I don't know is what makes me nervous.

ADD REPLY
0
Entering edit mode

I'm denied access to the article from a University of California :(

ADD REPLY
8
Entering edit mode
14.2 years ago
User 59 13k

You might want to have a look at the statistics from GOLD the 'Genomes OnLine Database' here as this has statistics at the genome, not basepair level.

ADD COMMENT
0
Entering edit mode

I just realized that they actually have the data in an Excel spreadsheet at the top of the page which is what I wanted. http://genomesonline.org/Gold_Stats.xls

ADD REPLY
6
Entering edit mode
14.2 years ago

See Genome Project Statistic.

update ... and the (rather incomplete) category in wikipedia Sequenced genomes.

ADD COMMENT
4
Entering edit mode
13.7 years ago
Yannick Wurm ★ 2.5k

This one is helpful too

http://www.genome.gov/sequencingcosts/

alt text

ADD COMMENT
1
Entering edit mode
12.7 years ago
Bjoernsen ▴ 40

I recommend you to use diArk for the latest genome files. The stats can be found using http://www.diark.org/diark/statistics

ADD COMMENT
0
Entering edit mode

That's a bunch of neat plots, thanks for sharing this.

ADD REPLY

Login before adding your answer.

Traffic: 1100 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6