predicted protein and sequence
1
0
Entering edit mode
4.3 years ago
mdfardin374 ▴ 10

Hello all, I have a protein database for an organism downloaded from NCBI and almost all the proteins have prefix XP_. Does it mean that all the proteins with prefix XP_ are predicted and have no experimental evidence?

sequence • 841 views
ADD COMMENT
2
Entering edit mode
4.3 years ago
GenoMax 141k

From this FAQ entry at NCBI.

Accession numbers that begin with the prefix XM_ (mRNA), XR_ (non-coding RNA), and XP_ (protein) are model RefSeqs produced either by NCBI’s genome annotation pipeline or copied from computationally annotated submissions to the INSDC. These RefSeq records are derived from the genome sequence and have varying levels of transcript or protein homology support. They represent the predicted transcripts and proteins annotated on the NCBI RefSeq contigs and may differ from INSDC mRNA submissions or from the subsequently curated RefSeq records (with NM_, NR_, or NP_ accession prefixes). These differences may reflect real sequence variation (polymorphism), or errors or gaps in the available genome sequence. The support for model RefSeq records should be further evaluated by comparing them to other sequence information available in Gene, Related Sequences, and BLAST reports.

ADD COMMENT

Login before adding your answer.

Traffic: 1839 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6