Question: Genbank: Gi Numbers Vs Accession Numbers?
2
gravatar for Bio_X2Y
9.1 years ago by
Bio_X2Y3.8k
Ireland
Bio_X2Y3.8k wrote:

Some documentation I found suggests that a GenBank GI number will change each time the sequence changes - even if only one base is affected. The Accession number, on the other hand, remains the same.

However, the accession number is usually qualified with a version number suffix, e.g. "GL000191.1". As far as I know, this version number also increments each time the sequence changes.

Does this mean that the relationship between GI number and the Accession.Version pair is one-to-one, and so either would be equally suitable as a unique identifier for a sequence?

Thanks.

genbank identifiers • 11k views
ADD COMMENTlink written 9.1 years ago by Bio_X2Y3.8k
3

Yes, that's right. Accession.Version is probably preferred by humans, and GI number by machines.

ADD REPLYlink written 9.1 years ago by Brad Chapman9.5k
2

Sorry, I can't resist :-P http://lmgtfy.com/?q=NCBI+gi

ADD REPLYlink modified 5 months ago by RamRS25k • written 9.1 years ago by Pierre Lindenbaum126k

@Pierre, hmmmm, while I can see the funny side, I had read that page, but I still wasn't 100% clear if the relationship was always one-to-one... :) e.g. I once thought that the version number would increment if the sequence changed OR the meta-information of the sequence changed, e.g. the gene symbol. I now know that isn't the case now, but I thought maybe GI would increment on a meta change....

ADD REPLYlink written 9.1 years ago by Bio_X2Y3.8k
1
gravatar for Pierre Lindenbaum
9.1 years ago by
France/Nantes/Institut du Thorax - INSERM UMR1087
Pierre Lindenbaum126k wrote:

Yes, you're right: see http://www.ncbi.nlm.nih.gov/Sitemap/sequenceIDs.html

The two systems of identifiers run in parallel to each other. That is, when any change is made to a sequence, it receives a new GI number AND an increase to its version number.

ADD COMMENTlink written 9.1 years ago by Pierre Lindenbaum126k
1
gravatar for Jarretinha
9.1 years ago by
Jarretinha3.3k
São Paulo, Brazil
Jarretinha3.3k wrote:

Your observation is true. They run in parallel, but GI system is older. So, there's a lot of sequences tagged with version 1 but with many GIs in its history. For example, check the history of L42023 and its subsequences at the NCBI Sequence Revision History. You'll see a many GI changes without a version number associated with it.

Besides that, GI and versions change only when the sequence itself changes, not the annotations. Changes in annotations can be traced only by modification date. So, GIs and versions are mostly useless.

ADD COMMENTlink written 9.1 years ago by Jarretinha3.3k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 770 users visited in the last hour