Question: ORFs from blastx results
0
gravatar for giulia.zancolli
2.5 years ago by
giulia.zancolli0 wrote:

Hello, I would need to get the ORFs sequences both nucleotides and translated (aa) from blastx hits. I know transDecoder generate .cds and .pep files corresponding to nucleotide and aminoacids fasta files, but for some transcripts the ORFs found by TransDecoder are not correct whereas blastx correctly identified the ORFs. Thus, I would rather use the ORFs found by blastx than TransDecoder. However, I cannot find a way to get the ORFs sequences, both nucleotide and amino acids, from the blastx results! Is there a way to do that? Thank you! Giulia

ADD COMMENTlink modified 2.5 years ago by abascalfederico1.1k • written 2.5 years ago by giulia.zancolli0
2

based on your keyword tags and assuming intronless coding data, you can enable the qseq in output format and then translate. here are previously asked similar questions, might be helpful 1, 2

ADD REPLYlink written 2.5 years ago by Prasad1.5k

Hi Prasad, Thank you for the help, I didn't notice the qseq option in the blastx huge list!

Giulia

Dr Giulia Zancolli Molecular Ecology and Fisheries Genetics Lab School of Biological Sciences Bangor University Environment Centre Wales Bangor LL57 2UW UK

Web: http://mefgl.bangor.ac.uk/staff/zancolli.php

ADD REPLYlink written 2.5 years ago by giulia.zancolli0
0
gravatar for abascalfederico
2.5 years ago by
abascalfederico1.1k
Spain
abascalfederico1.1k wrote:

I don't think there is a way to obtain translations from BlastX. Blastx translates a nucleotide query sequence in the six possible frames and compares them to protein sequences. The query sequence can be a fragment of a genome, a gene (with introns) or a CDS. Hence, Blastx does not know how to translate your sequences, it can only tell you which virtual translations give significant hits with proteins.

What kind of sequences are you searching for? If you are searching CDS (no introns) you can identify the correct frame in BlastX output and search for the initiating Met. ORFinder (https://www.ncbi.nlm.nih.gov/orffinder/) can be of help here.

Another thing you can do is to download the Blastx output and write a script to extract the translations that hit a protein sequence.

Hope this helps, Federico

ADD COMMENTlink written 2.5 years ago by abascalfederico1.1k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1146 users visited in the last hour