Question: Why do PFAM alignments contain 'X'?
0
gravatar for traviata
9 months ago by
traviata0
traviata0 wrote:

I'm working with the PFAM family RNA_pol_Rpb2_1 (PF04563). I've discovered that some of the sequences in the full alignment I downloaded, including >G1N5G7_MELGA/1-331, contain the character X even though X is not a valid protein character.

Why is this? Is it related to some sequences being low-quality or are some sequences using an entirely different alphabet?

pfam • 305 views
ADD COMMENTlink modified 9 months ago by mbens100 • written 9 months ago by traviata0
2
gravatar for mbens
9 months ago by
mbens100
Germany
mbens100 wrote:

The nomenclature for amino acids includes 'X' for 'unknown amino acid' (IUPAC). According to Uniprot, second amino acid of G1N5G7_MELGA is unknown.

ADD COMMENTlink modified 9 months ago • written 9 months ago by mbens100
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1770 users visited in the last hour