Secondary Structure Alignment Method
2
2
Entering edit mode
6.7 years ago
virpatel3 ▴ 20

I have many genes for many bacteriophage with precomputed secondary structures such as:

HHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCHHHHHHHH

I want to align each gene's secondary structure with the SSE (secondary structure elements) of the other bacteriophages. So for instance, I may also have

HHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCHHHHHHH

for another gene. I would like to be able to align these genes so that I can find the evolutionary relationships between the bacteriophages.

Does anyone have a suggestion for the algorithms I should use for this project?

secondary structure alignment • 2.5k views
1
Entering edit mode

I never faced the problem of aligning secondary structure so I'm probably overlooking a lot of complications, but by the way the question is asked the first and easiest thing that comes to my mind is to align one set of genes vs the other with BLAST.

1
Entering edit mode

Blast is good but him ask for the alignment of the secondary structures and i think that blast is not good in this case. yes is true that the blast is a good on line tools for alignment but the problem is that blast compute the alignment on the basis of a database searching the possible releated sequences.

in your case virpatel3 i'm suggest to create an algorithm of alignment.

sure i never heard about the alignment of SS elements.

0
Entering edit mode

You could use Biopython's alignment module pairwise2 with a custom scoring function. Documentation

1
Entering edit mode
6.7 years ago
a.polo88 ▴ 110

ok i tried on the website clustalW

it seems to work

http://www.ebi.ac.uk/Tools/services/web/toolresult.ebi?jobId=clustalw2-I20150222-140308-0817-61234194-pg

try it!

if you will use it remember that fro each of sequences the format is

> name of protein or what do you want

sequence

example :

>  name

HCCCHHHC

>name1

HCCCHHHHHHHC

http://www.ebi.ac.uk/Tools/services/web/toolresult.ebi?jobId=clustalw2-I20150222-140838-0639-99240772-oy

0
Entering edit mode
2.2 years ago
The ▴ 160

If I correctly remember , there was a paper in PNAS by George D Rose in late 90s which dealt with this. The goal was to find out structurally similar sequences based on predicted secondary structure. So far I remember they employed a simple scoring system (match/mismatch binary type):

Edit: Here's the paper: "Seeking an ancient enzyme in Methanococcus jannaschii using orf, a program based on predicted secondary structure comparisons"

https://www.pnas.org/content/95/6/2818