Question: Entropy From Msa
0
10.0 years ago by
User 0063220
User 0063220 wrote:

Dear All

I'd like to write an algorithm to calculate Shannon Entropy in an MSA. How can I consider gaps and colums fully conserved?

multiple • 3.9k views
written 10.0 years ago by User 0063220

Hi! In order for the members of the forum to help you, you should define your problem better. Here are a few tips for writing better questions. 1) Describe what you are doing. For people to help you, they must have a good understanding of what you are trying to accomplish. 2) Describe what you have tried so far, so that you don't get answers that tell you to do stuff you have already done. 3) Ask a specific question, avoid asking everything at the same time. 4) In order to get informative answers, you must write informative and well formated questions. Cheers!

2
10.0 years ago by
Alex1.5k
Theodosius Dobzhansky Center for Genome Bioinformatics
Alex1.5k wrote:

I'm not sure that understand your question.

By definition:

``````Shannon_column_entropy = - sum(px*log(px)), where px - a frequency of each nucleotide.
MSA_entropy = sum(Shannon_column_entropies)
``````

With gaps you have two alternatives:

1. gap is "nucleotide" then entropy = `- sum(pa*loga+pc*logc+pg*logg+pt*logt+p_*log_)`
2. or skip gaps then entropy = `- sum(pa*loga+pc(logc+pg*logg+pt*logt)`

If Shannon_column_entropy = 0 then a column is fully conserved.

And check this question Entropy From A Multiple Sequence Alignment With Gaps

1
10.0 years ago by