Amino Acid - Unknown Symbol
3
2
Entering edit mode
13.0 years ago
Allen Alger ▴ 20

The following amino acid sequence was published in a research paper I'm studying. The sequence contains a ")" about 3/4 of the way into the sequence. What does this ")" mean?

This is what it looks like: The ")" is in the next to the last line.

1 SUMO3-11Arg PTD – SODMLS - MatureTFAM
MGHHHHHHGGMSEEKPKEGVKTENDHINLKVAGQDGSVVQFKIKRHTPLSKLMKAYCERQGLS
MRQIRFRFDGQPINETDTPAQLEMEDEDTIDVFQQQTGGRRRRRRRRRRRGEGDIMG
EWGNEIFGAI AGFLGGE MLSRAVCGTSR QLPPVLGYLGSRQ SSVLASCPKKPVSSYLR
FSKEQLPIFK AQNPDAKTTELIRRIAQRWR ELPDSKKKIYQDAYRAEWQVYKEEISRFKE
QLTPSQIM SLEKEIMD KHLKRKAM TKKKELTLLGKPKRPRSAYN VYVAERFQEA
KGDSPQEKLK TVKENWKNLS DSEKELYIQH AKEDETRYHN EMKSWEEQ MIEVGRKD
LLRRTIKKQR KYGAEEC) KGDSPQEKLK TVKENWKNLS DSEKELYIQH AKEDETRYHN
EMKSWEEQ MIEVGRKD LLRRTIKKQR KYGAEEC

Allen

amino-acids sequence protein • 2.8k views
ADD COMMENT
1
Entering edit mode

Please contact the authors, ask them what it is about, and let us know the answer here!

ADD REPLY
3
Entering edit mode
13.0 years ago
Neilfws 49k

Unless the authors have defined the symbol elsewhere in the paper, it means that the journal editors are not very good at proof-reading. The ")" has no standard meaning in this context and may be a typographical error.

EDIT: Having looked at the paper (this one I assume?), I see the authors have used all kinds of odd notation in describing the sequence. However, the ")" symbol still seems oddly out of place.

ADD COMMENT
2
Entering edit mode
13.0 years ago
Woa ★ 2.9k

This stretch of sequence 'KGDSPQEKLK TVKENWKNLS DSEKELYIQH AKEDETRYHN EMKSWEEQ MIEVGRKD LLRRTIKKQR KYGAEEC' occurs before and after the parentheses ")" symbol and could be just a printing error. Blast the protein (omitting the parentheses) against Uniprot and you'll see the first hit contains no such repeat.

ADD COMMENT
1
Entering edit mode

After reading the literature it looks like that it is a chimeric protein sequence made of some fused domains/linkers. So, the ')' seems to mark some domain boundary.

ADD REPLY
0
Entering edit mode
13.0 years ago
Joseph Hughes ★ 3.0k

A quick protein blast of the sequence shows that beyond the bracket, the blast score is below 200. This is probably the result of some sort of typo as the last 160 aa are in fact a repeat:

KGDSPQEKLK TVKENWKNLS DSEKELYIQH AKEDETRYHN EMKSWEEQ MIEVGRKD LLRRTIKKQR KYGAEEC)

KGDSPQEKLK TVKENWKNLS DSEKELYIQH AKEDETRYHN EMKSWEEQ MIEVGRKD LLRRTIKKQR KYGAEEC

When looking at the full length coding sequence of the top blast match there is no such repeat:

MAFLRSMWGVLTALGRSGAELCTGCGSRLRSPFSFAYLPRWFSSVLASCPKKPVSSYLRFSKEQLPIFKA QNPDAKTTELIRRIAQRWRELPDSKKKIYQDAYRAEWQVYKEEISRFKEQLTPSQIMSLEKEIMDKHLKR KAMTKKKELTLLGKPKRPRSAYNVYVAERFQEA KGDSPQEKLK TVKENWKNLS DSEKELYIQH AKEDETR YHN EMKSWEEQ MIEVGRKD LLRRTIKKQR KYGAEEC

ADD COMMENT

Login before adding your answer.

Traffic: 1762 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6