Question: Automated sliding window analysis of % identity on a Clustal Omega DNA alignment
gravatar for c.merrick
3.7 years ago by
c.merrick0 wrote:

I would like to align ~15 DNA sequences of 1-2kb, then assess the degree of homology across the alignment using a sliding window. It will be very laborious to do this manually, even using quite large jumps in the windows. Is there a facility to do such analysis automatically with the Clustal Omega tool (or other tool)? Many thanks Catherine Merrick

alignment • 1.5k views
ADD COMMENTlink modified 3.7 years ago by Whetting1.5k • written 3.7 years ago by c.merrick0

Sorry, just a pet peeve, but "degree of homology" makes no sense. Either a sequence is homologous, or it is not. I.e. either they share a common ancestor or they do not! check here for a way to get you started.

ADD REPLYlink written 3.7 years ago by Whetting1.5k

Thank you. (I take the point re semantics, but it doesn't stop biologists in common parlance saying things like 'highly homologous'!). I think I now have various ways of getting the measures I need (entropy scores), inc. this and this ... so I need to bin them appropriately across the alignment.

ADD REPLYlink written 3.7 years ago by c.merrick0
gravatar for Whetting
3.7 years ago by
Bethesda, MD
Whetting1.5k wrote:

Too many biologists do indeed say % homology, but that does not mean you should. It is wrong, not just a semantics issue...

Not tested and written on my tiny phone screen, but assuming you can get Biopython running, something like this should generate the input files for your favorite program?
This should print out a bunch of alignments o length window with overlap step.

from Bio import AlignIO


alignment ="alignment_in_fasta_format.fas","fasta"):
    for r in range(0,len(alignment[1]),step):
        with open("window_"+r+".fas") as out: 
            print >>out, ">%s\n%s" %(alignment[:,r:r+window])
ADD COMMENTlink written 3.7 years ago by Whetting1.5k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 821 users visited in the last hour