How to query nr database using a list of GI numbers to get xml files?
2
0
Entering edit mode
8.3 years ago
grayapply2009 ▴ 280

I have a list of GI or Accession numbers. Now I want to query the nr database to get xml files for annotation purpose. How do I do this?

xml annotation blast gi • 2.6k views
ADD COMMENT
2
Entering edit mode

seach this site for : NCBI efetch

ADD REPLY
1
Entering edit mode
8.3 years ago

The nr database is a collection of protein sequences. Are you interested in extracting the protein sequences associated you list of GIs? If so, you can use Batch Entrez (http://www.ncbi.nlm.nih.gov/sites/batchentrez). From the drop down list select protein. Upload your list and click retrieve.

ADD COMMENT
0
Entering edit mode

I'm trying to use blast2go to annotate my sequences. Blast2go accepts xml files from the local nr blast. I'm just wondering if it is possible to get xml files with just GI numbers.

ADD REPLY
0
Entering edit mode

Hi @grayapply2009

I am still not very clear as to what you want to do. Did you run blast locally? What are the GI or accession numbers you refer to in the first post?

If you are running blast on the command line, you can generate the output as XML using the -outfmt 5 parameter. You can then feed that output to blast2GO.

Also, nr is not a good source of GO annotations. Perhaps start with a well annotated DB, such as SwissProt or Trembl.

ADD REPLY
0
Entering edit mode

Thanks, my friend. I'll try that.

ADD REPLY
1
Entering edit mode
8.3 years ago

If you are annotating trancriptomics data, see the very helpful Trinotate documentation https://trinotate.github.io/

ADD COMMENT
0
Entering edit mode

That's a good one, thanks.

ADD REPLY

Login before adding your answer.

Traffic: 2673 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6