I need to download a fasta file from uniprot, containing a list of the protein sequences of every secretory protein described to date with 200 or less amino acids.
Can somebody help me to filter these proteins and generate a list with IDs and sequences?
I'm trying to filter the proteins on the website, but I'm not finding how to do it. Every known secretory protein, from all species.
How about: https://www.uniprot.org/uniprotkb?dir=descend&facets=length:[1%20TO%20200]&query=secretory&sort=length
If you are happy with this result, click on "Download" and make sure to "choose all" to get the fasta file.
Subset from Bacteria can be obtained using this --> LINK
Subtract the result from above to get non-bacterial sequences.