Swiss-Prot Id check
2
0
Entering edit mode
4.0 years ago
trumbia ▴ 10

Hello,

I have huge file of protein IDs and I need to check them if they are swiss-prot ids? Do you have any suggestion for me?

Thank you.

Protein Swiss-Prot • 1.7k views
ADD COMMENT
1
Entering edit mode

Please list a few of the IDs that you have. Thank you.

ADD REPLY
0
Entering edit mode

Yes, well, they appear to be UniProt; however, the species are mixed, as I see human, mouse, and rat IDs. What do you need to do with these?; from where did you obtain them? Thank you!

ADD REPLY
0
Entering edit mode

Ah! Mensur answered! See below.

ADD REPLY
4
Entering edit mode
4.0 years ago
GenoMax 141k

You could use UniProt ID converter. Paste your ID's and map.

ADD COMMENT
2
Entering edit mode
4.0 years ago
Mensur Dlakic ★ 27k

This is a crude way, but it should work.

Download and unpack Swiss-Prot:

wget ftp://ftp.uniprot.org/pub/databases/uniprot/current_release/knowledgebase/complete/uniprot_sprot.fasta.gz

gunzip uniprot_sprot.fasta.gz

Save your IDs into a file, say ID.txt. I did so for the 5 ID numbers you provided. Then simply (f)grep your ID file against the database, and either count the hits or save them in a file.

Counting:

fgrep -f ID.txt uniprot_sprot.fasta | wc -l

It counts 3 matches.

Printing the output:

fgrep -f ID.txt uniprot_sprot.fasta

>sp|Q8C050|KS6A5_MOUSE Ribosomal protein S6 kinase alpha-5 OS=Mus musculus OX=10090 GN=Rps6ka5 PE=1 SV=2
>sp|P47811|MK14_MOUSE Mitogen-activated protein kinase 14 OS=Mus musculus OX=10090 GN=Mapk14 PE=1 SV=3
>sp|Q15672|TWST1_HUMAN Twist-related protein 1 OS=Homo sapiens OX=9606 GN=TWIST1 PE=1 SV=1
ADD COMMENT
0
Entering edit mode

Thank you for your help.

ADD REPLY

Login before adding your answer.

Traffic: 2905 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6