Is always correct to include an outgroup in phylogenetic trees?
0
0
Entering edit mode
9.2 years ago
dago ★ 2.8k

I am currently working with several hypothetical bacterial proteins.

One approach I applied to characterize them is comparing them with similar sequences. I retrieved similar proteins based on blastp search and on domain architecture comparison.

My aim is to infer a putative function for my unknown proteins based on the similarity with known ones. I thought to align all sequences, defined conserved region and perform a phylogenetic reconstruction.

In case my unknown sequences are clustering together with known ones I can speculate that they might have similar function.

Now my question is, would it make sense to include an outgroup in the tree?

Including an outgroup would mean decrese the quality of the alignment, since the outgroup seqs will create many gaps that will decrease the number of conserved regions to keep for building the tree.

What you opinion about that?

protein alignment phylogeny • 3.2k views
ADD COMMENT
0
Entering edit mode

IMO you could just midpoint root but what's the point of making the trees to begin with? Why not just cluster with some clustering algorithm and be done with it?

ADD REPLY
0
Entering edit mode

You mean grouping the protein using something like blastclust? Wouldn't less robust than a tree? the clustering is based on alignment similarity and I think you might introduce artefact by choosing the similarity cutoff. Am I missing something?

ADD REPLY
0
Entering edit mode

Well, first of all, since it's bacterial proteins, you should probably go for global alignment instead of local alignment for clustering (so e.g. cd-hit). Second, you could do the clustering at e.g. 80-99% similarity and see how the clusters react. Just ideas..

ADD REPLY
0
Entering edit mode

I could indeed give it a try. I cannot find the standalone version of CD-HIT, you know a working link? GOT IT. Sorry

ADD REPLY

Login before adding your answer.

Traffic: 2302 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6