Protein Sequence Analyses
2
0
Entering edit mode
4.1 years ago
l.souza ▴ 70

Hello everybody,

I have ~2000 sequences of a capsid protein from different serotypes of a virus. I aligned them to check the identity, and now I wanna study the protein properties, such aa composition, isoelectric point, molecular weight, GRAVY, instability index, etc. Should I analyze each of the sequences I have, or I can use a consensus one? (If using the consensus, what is the best software to get it?)

I really appreciate any help!

protein sequence consensus • 1.5k views
2
Entering edit mode
4.1 years ago

Have you tried the ProtParam tool, http://web.expasy.org/protparam ?

0
Entering edit mode

Yes! I usually use this when analyzing physical properties of proteins...

1
Entering edit mode
4.1 years ago
Joe 19k

I don't think there's a simple answer to this, and I'd encourage you to think about what you actually want to know about the protein. It's all well and good to say that the pI of your capsid protein is x, but is that useful? Do you care about minute fluctuations in pI which you would almost certainly lose by taking a consensus?

A consensus is useful in certain situations but not others.

#### pI:

Propka can give you pI for your sequences, and if you have that many, you might want to run just a consensus through - I'd say this only makes sense if your 2000 sequences are already pretty similar however.

#### GRAVY etc:

I believe CodonW will output a lot of parameters, including GRAVY and AA composition, but I'm not super familiar with the program.

#### Consensus sequences:

Consensus sequences can be gotten out of BioPython (I've actually been writing this myself recently. It's what they call a 'dumb' consensus though, it doesn't have many configurable parameters. See http://biopython.org/DIST/docs/tutorial/Tutorial.html#htoc301

from Bio.Align import AlignInfo
from Bio import AlignIO

alignment = AlignIO.read('~/path/to/alignment.aln', 'format') # where format is a supported type (see BioPython docs)
summary_align = AlignInfo.SummaryInfo(alignment)
consensus = summary_align.dumb_consensus()