Get cleaved SARS-CoV-2 protein sequences (all major variants)?
Entering edit mode
3 days ago
ngarber • 0

I have an algorithm for assessing the presence of a degenerate motif in lists of protein sequences, and I would like to run it on SARS-CoV-2 proteins.

However, some SARS-CoV-2 proteins are cleaved post-translation to make multiple functional proteins. Is there a way to get the post-cleavage list of protein sequences?

Ideally I'd like this to be for all proteins from all major variants, but not have a gargantuan amount of data. There are almost a million sequenced SARS-CoV-2 genomes now, and that's more than my poor old laptop can handle! So if there was a way to, say, get reference consensus sequences (like RefSeq, except RefSeq doesn't have variants) for the major variants (alpha, delta, omicron, omicron subvariants BA.4 and BA.5, etc.), that would be really helpful.

I'm working in Python, so I can either download through Python or use a website, either is fine. Thanks!

motif variant Python SARS-CoV-2 cleavage • 164 views
Entering edit mode

Not sure if this is what you are looking for but you can get protein sequences from EBI Covid Portal:

There is also NCBI's SARS protein portal section:,%20taxid:2697049


Login before adding your answer.

Traffic: 1118 users visited in the last hour
Help About
Access RSS

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6