Question: Cutting Off The First Domain From All The Proteins In An Alignment
gravatar for khorms
5.7 years ago by
khorms30 wrote:

I have a multiple alignment of some proteins from the same family. All of them have 2 functional domains. Both of them are presented in Pfam database. There are no overlaps of these two domains in my alignment. I want cut the first domain off from all the proteins in the alignment. But I don't know how conservative is the boundary of first domain is. I don't want to watch each sequence individually and cut off the first domain by hand. Does there exist some way to do it automatically?

alignment • 1.5k views
ADD COMMENTlink modified 5.7 years ago by Niek De Klein2.5k • written 5.7 years ago by khorms30

Few lines of python code will do the job.

ADD REPLYlink written 5.7 years ago by Pappu1.9k
gravatar for Niek De Klein
5.7 years ago by
Niek De Klein2.5k
Niek De Klein2.5k wrote:

Not an automatic way, but faster than doing them all individually:

If you use program Clustal X (and probably also the newer version Clustal Omega, but when I worked with sequence alignments this wasn't out yet so don't know if the GUI is the same), after aligning the sequences you can select the columns that contain the alignment of the first domain, and you can delete these columns (either right-click delete/remove or something similar, or there is an option in the upper menu). If you know that the alignment of the first domain is correct, you can remove the first domain from all sequences with one click.

This is probably also possible in other alignment programs, but I only know about Clustal X for sure.

ADD COMMENTlink written 5.7 years ago by Niek De Klein2.5k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1314 users visited in the last hour