Getting the genomic positions for a set of protein sequences
1
0
Entering edit mode
8.7 years ago
ahmad • 0

Hi,

I have a fasta file of protein sequences belonging to a known species. My goal is to retrieve the genomic coordinates for each protein sequence. I'm using blastp where I have my protein sequences as query and searching against protein database of the known species. Blastp was successful in mapping those query sequences to known regions, however, it doesn't give any information about the genomic positions of those proteins.. Is it possible to do this using blastp? If not, could you recommend any tool to do that?

sequencing sequence protein • 1.5k views
ADD COMMENT
0
Entering edit mode

tblastn ? Or UCSC BLAT ?

ADD REPLY
0
Entering edit mode
8.7 years ago
glihm ▴ 660

Hi,

First question, are you working with command line or online requests?

Using tblastn from NCBI server, if there is a match well characterized you have ALL the genomic information clicking on the sequence ID (Which chromosome, region, position etc...). Otherwise, BioPython parsers allow you to recover this information in output files if the targeted sequences are described.

ADD COMMENT

Login before adding your answer.

Traffic: 3001 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6