Question: Getting gene sequence from ncbi BioJava
gravatar for Bioaln
5.8 years ago by
Bioaln340 wrote:

Hello. I've been dealing with sequence parsing lately and I can't seem to download a gene sequence from NCBI. My previous code returns me the gene name (for example TGFB1). So again, what I am trying to accomplish here is use java code to fetch gene sequence (I've been tying with geneRICH class in BioJava but it doesn't seem to have that option, only accession number and genbank id).


Thanks for any help.

identifiers biojava • 2.3k views
ADD COMMENTlink modified 5.8 years ago by Pierre Lindenbaum129k • written 5.8 years ago by Bioaln340
gravatar for Pierre Lindenbaum
5.8 years ago by
France/Nantes/Institut du Thorax - INSERM UMR1087
Pierre Lindenbaum129k wrote:

you need to find the sequences associated to this gene (e.g : refseq sequences) using NCBI utilities. e.g: Get Fasta File With Protein Sequences Given Entrez Gene Ids


furthermore, Biojava is not really needed to fetch the sequence. You can use xjc to generate the classes

xjc -dtd ""
parsing a schema...
compiling a schema...


and use those classes to parse a ncbi EUtilities efetch URL

see (old!)



ADD COMMENTlink written 5.8 years ago by Pierre Lindenbaum129k

Wow, thanks for the thorough answer. I will look into those possibilities.

ADD REPLYlink written 5.8 years ago by Bioaln340
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 744 users visited in the last hour