Entering edit mode
5.5 years ago
biobio
•
0
I'm a python beginner, and I'd like to find a really short protein sequence from uniprot data.
I have the file open as this,
fastafile = open('/Users/desktop/uniprot_sprot.fasta','r')
read1 = fastafile.readlines()
protdict = {}
for i in range(0,len(read1),2):
protdict[read1[i]]=read1[i+1]
And I want to find out if there's a matching sequence in the data, and if there is, the name of the sequence. Please help!! I would really appreciate it.
readlines
without python: linearize fasta, sort on length