Question

Is there a way to translate amino acid sequences into atomic sequences?

0

Entering edit mode

8.0 years ago

Naresh ▴ 60

Hi, Is there any software which will translate amino acid sequence into atomic sequences.

Please help me.

Kind regards Naresh

sequence • 3.5k views

ADD COMMENT • link updated 8.0 years ago by Ibrahim Tanyalcin ★ 1.2k • written 8.0 years ago by Naresh ▴ 60

3

Entering edit mode

What are "Atomic sequences"?

ADD REPLY • link 8.0 years ago by GenoMax 141k

0

Entering edit mode

I mean to say at atomic level., such as C,N,O,S,H.

ADD REPLY • link 8.0 years ago by Naresh ▴ 60

0

Entering edit mode

So you want to convert MCNSSC (peptide) into C230 N30 H200 O23 S34 (this example is non-sense but something like adding up all C's, N's, S', O' etc)?

ADD REPLY • link 8.0 years ago by GenoMax 141k

0

Entering edit mode

https://www.google.co.kr/webhp?sourceid=chrome-instant&ion=1&espv=2&ie=UTF-8#q=CARd%3A+Carbon+distribution+analysis+program+for+protein+sequences

Please see this paper. He has made an algorithm for changing protein sequence to atomic levels..

ADD REPLY • link 8.0 years ago by Naresh ▴ 60

score 2 · Answer 1 · 2016-04-25

2

Entering edit mode

8.0 years ago

wdiwdi ▴ 380

Yes. Newer (3.454 and later) versions of the Cactvs Cheminformatics toolkit (academic downloads at www.xemistry.com/academic) support the decoding of 1- and 3-letter amino acid codes into atomic connection tables (with stereochemistry).

Example command line, Python version (you'd probably script it instead of typing it), which writes two sample mini peptides as SD files:

cspy
pycactvs>e=Ens('aa:AlaGluAsn')
pycactvs>Molfile.Write('minipeptide1.sdf',e)
[ens0]
pycactvs>e=Ens('aa:AEN')
pycactvs>Molfile.Write('minipeptide2.sdf',e)
[ens1]
pycactvs>

ADD COMMENT • link 8.0 years ago by wdiwdi ▴ 380

0

Entering edit mode

Read protein sequence MCNSSCMGGMNRR Change into atomic sequence CCCCCSNOHHHHHHHHHCCCSNOHHHHHHHHCCCCNNOOHHHHHHCCCNOOOHHHHHHHCCCC

ADD REPLY • link 8.0 years ago by Naresh ▴ 60

1

Entering edit mode

Why not write a simple script? Pretty much all you need is amino acid to atoms translation table.

ADD REPLY • link 8.0 years ago by 5heikki 11k

0

Entering edit mode

This is confusing. What is the meaning of this?

ADD REPLY • link 8.0 years ago by wdiwdi ▴ 380

0

Entering edit mode

If you take the protein sequence - M stands for Methionine. at the atomic level, Methionine is made of atoms like C,N,O,S,H.. So, my need is that if i give protein sequence, the software must change into atomic level like C,N,O,S,H.

Hope you understand my question now..

ADD REPLY • link 8.0 years ago by Naresh ▴ 60

1

Entering edit mode

I think we understand what your question is, I don't think anybody understands why.

ADD REPLY • link 8.0 years ago by Emily 23k

0

Entering edit mode

Here is an answer for the "why": C: Is there a way to translate amino acid sequences into atomic sequences?

ADD REPLY • link 8.0 years ago by GenoMax 141k

0

Entering edit mode

eseq=''
t=Table.Ref('elementdata')
for a in 'MCNSSCMGGMNRR':
        with Ens('aa:'+a) as e:
                e.hadd()
                for i,count in enumerate(e.E_ELEMENT_COUNT):
                        eseq += t.cellget(i,'symbol')*count
print(eseq)

Output: HHHHHHHHHHHCCCCCNOOSHHHHHHHCCCNOOSHHHHHHHHCCCCNNOOOHHHHHHHCCCNOOOHHHHHHHCCCNOOOHHHHHHHCCCNOOSHHHHHHHHHHHCCCCCNOOSHHHHHCCNOOHHHHHCCNOOHHHHHHHHHHHCCCCCNOOSHHHHHHHHCCCCNNOOOHHHHHHHHHHHHHHCCCCCCNNNNOOHHHHHHHHHHHHHHCCCCCCNNNNOO

If you have any element order sorting criteria - this is left as an exercise to the reader (judging from your sample desired output, you are not using the standard Hill system)

ADD REPLY • link 8.0 years ago by wdiwdi ▴ 380

score 1 · Answer 2 · 2016-04-26

1

Entering edit mode

8.0 years ago

Ibrahim Tanyalcin ★ 1.2k

Dear,

Try opening a pdb file with a text editor :).

Regards,

ADD COMMENT • link 8.0 years ago by Ibrahim Tanyalcin ★ 1.2k

score 0 · Answer 3 · 2016-04-25

Do you really want just an atomic sequence? Or are you asking for the molecular structure in 2D or 3D?

In either case, you should keep in mind that back-translation (amino acid sequence to nucleotide sequence) will be ambiguous given that multiple codons code for each amino acid. You can use a tool like Backtranseq to convert from amino acid sequence to nucleotide sequence and it will use the most common codons for a given organism: http://www.ebi.ac.uk/Tools/st/emboss_backtranseq/

If you do want the molecular structure, you can then give the nucleotide sequence to a free, open-source tool like 3DNA: http://x3dna.org/

There is also a web-based version if you just want to test it out: http://w3dna.rutgers.edu/rebuild. Just select a DNA type and enter the sequence.

Both the offline and online versions will generate a PDB that can be looked at either in a text editor, or visualized with software like VMD or PyMOL. The w3dna online version also offers an in-browser viewer called Jmol for visualization.