Calculating classical bioinformatics features from protein sequences
2
0
Entering edit mode
21 months ago

Hi all,

I hope this finds you well

I was wondering if you are familiar of any integrated tool that I could use to calculate bioinformatics features from protein sequences e.g. physicochemical properties (e.g. disorder and secondary structure features), amino acid composition ... There are multiple tools to calculate features individually however I am looking for a tool that I can use to calculate many features at once rather than relying on each tool separately.

I am asking this in the context of an assessment work studying how language models perform at representing proteins in comparison with representing proteins using classical features.

Any suggestions/comments would be much appreciated.

Thanks,
Sergio

proteins • 751 views
ADD COMMENT
1
Entering edit mode
21 months ago
Jiyao Wang ▴ 370

For SwissProt sequences, you can use iCn3D Node.js script to output sequence-based annotations (such as Conserved Domains, SNP, ClinVar, etc) and structure-based annotations (such as 3D domains). The example script "annotation.js" is listed at https://github.com/ncbi/icn3d/tree/master/icn3dnode. This script retrieves all annotations from iCn3D UI: https://structure.ncbi.nlm.nih.gov/icn3d/share.html?LRpYHQU55gi5XUwb7 enter image description here

ADD COMMENT
0
Entering edit mode
21 months ago
Jeremy ▴ 910

Check out Expasy ProtParam. Given an amino acid sequence, it returns 14 parameters, including molecular weight, isoelectric point, and amino acid composition.

Expasy ProtParam

ADD COMMENT

Login before adding your answer.

Traffic: 1738 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6