Question: What is the '?' in the result of R package 'msa' function 'msa'?
1
gravatar for YUCHEN CHANG
9 months ago by
YUCHEN CHANG10
Melbourne
YUCHEN CHANG10 wrote:

Hi everyone,

Recently I am using the R package called 'msa', and I have several questions very confused after getting the result.

  1. For my understanding, the last line of MSA result (which title is 'con') is the consensus sequence. In this case, by the handbook of MSA, the consensus residue is the frequency over 80% (by default). Then I am confused about the question mark appear in this line. Is it indicating that in this position we have several candidates which are both in a high frequency?

  2. Since the conserved score (compute by function 'msaConservationScore') is a bit differ from the consensus sequence, how can I know which part of the sequence alignment is matched together? I am guessing maybe the regions which consensus sequence have symbols but not a hyphen (-)?

Thanks everyone!

Image-1

ADD COMMENTlink modified 9 months ago by zx87549.0k • written 9 months ago by YUCHEN CHANG10
1

Conservation symbols in MSA are generally categorized using the following scheme:

An * (asterisk) indicates positions which have a single, fully conserved residue.
A : (colon) indicates conservation between groups of strongly similar properties - scoring > 0.5 in the Gonnet PAM 250 matrix.
A . (period) indicates conservation between groups of weakly similar properties - scoring =< 0.5 in the Gonnet PAM 250 matrix.
  

? in this case appears to indicate that there is no discernible similarity in the residues in that position.

ADD REPLYlink modified 9 months ago • written 9 months ago by genomax78k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1539 users visited in the last hour