Question: Core genome Phylogeny
gravatar for memshez
6.3 years ago by
memshez0 wrote:

Hi All,

I am trying to do a core genome phylogeny of 8 different species of a bacteria. I performed blastclust to make the clusters of homologous proteins. one of the clusters contains two copies of same protein from one species. I want to ask if I should keep one copy in the cluster and make allignment or should I discard that cluster completely from the allignment.

And I would like to ask one more thing, is there some length threshold for the protein sequences to make allignment or we can take the shorter sequences like 50 residues also.

Please help!!

sequence alignment • 2.2k views
ADD COMMENTlink modified 6.3 years ago • written 6.3 years ago by memshez0

Thanks 5heikki, both copies are exactly similar so then I think I can use any one of them. 

ADD REPLYlink written 6.3 years ago by memshez0
gravatar for 5heikki
6.3 years ago by
5heikki9.0k wrote:

In my experience, doing ~50 ribosomal protein bacterial phylogenies doesn't change much (at all really), no matter what copy you end up using. I personally go by default to the copies that give the best match to a given ribosomal protein hmm, since they're the least likely to be pseudo genes.

ADD COMMENTlink written 6.3 years ago by 5heikki9.0k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1726 users visited in the last hour