Question: Why do PFAM alignments contain 'X'?
gravatar for traviata
18 days ago by
traviata0 wrote:

I'm working with the PFAM family RNA_pol_Rpb2_1 (PF04563). I've discovered that some of the sequences in the full alignment I downloaded, including >G1N5G7_MELGA/1-331, contain the character X even though X is not a valid protein character.

Why is this? Is it related to some sequences being low-quality or are some sequences using an entirely different alphabet?

pfam • 115 views
ADD COMMENTlink modified 18 days ago by mbens100 • written 18 days ago by traviata0
gravatar for mbens
18 days ago by
mbens100 wrote:

The nomenclature for amino acids includes 'X' for 'unknown amino acid' (IUPAC). According to Uniprot, second amino acid of G1N5G7_MELGA is unknown.

ADD COMMENTlink modified 17 days ago • written 18 days ago by mbens100
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 637 users visited in the last hour