Use BioprojectID form an Excel file to do an alignment between the corresponding sequence and 5 other sequences separately. Obtain importante informations from this and repeat this for 5000sequences
0
0
Entering edit mode
8.2 years ago
chups519 ▴ 10

Hey

I'm in a project where i have to use a Bioproject ID from an Excel file and with the corresponding sequence (written with amino acids instead of nucleotides) compare it with another 5 sequences (multiple alignment, that you can do manually with this data base http://blast.ncbi.nlm.nih.gov/Blast.cgi?PROGRAM=blastp&PAGE_TYPE=BlastSearch&LINK_LOC=blasthome) From here i have to pick up useful information such as 'description', 'e-value' and 'accession number' and put it in an another Excel file.

I just start learning how to program because i never had to program before this project and i'm a bit lost here. Can any of you guys help me with this please? Notice that i have to create a script able to do this for 5000 diferente Bioproject ID's and can't do any of this steps by hand.

Even if you can't help me with i thank you right now for your attention

sequencing gene blast genome • 1.7k views
ADD COMMENT

Login before adding your answer.

Traffic: 1393 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6