Question: Removal Of Sequences With >33% Gap From The Alignment Before Tree Construction
gravatar for Pappu
6.1 years ago by
Pappu1.9k wrote:

I read about removing sequences with >33% gap from the alignment after alignment trimming, before phylogenetic tree construction. I am wondering it this makes sense or one should realign the sequences. Is there is paper which discusses the effect of gaps in the alignment/alignment programs for maximum likelihood phylogentic tree construction?

ADD COMMENTlink modified 6.1 years ago by DG7.1k • written 6.1 years ago by Pappu1.9k
gravatar for DG
6.1 years ago by
DG7.1k wrote:

It really depends a lot on what you are doing. Is this single gene or multi-gene/phylogenomic analyses? Those incomplete or heavyily gapped sequences may be of interest so you don't want to remove them. Amino acid or nucleotide alignments? What model of evolution? What phylogenetic program? Some programs handle gaps differently than others for instance. People may have started using a 33% rule of thumb, but only because that means the sequence is "missing" or lacking information at 1/3 of positions in your alignment. If it is because it is a partial sequence that is one thing, but in large datasets they often represent partial pseudogene or paralagous sequences which is why they are often removed.

There are no hard and fast cutoffs in phylogenetics really, because there are a lot of factors that go in to setting up your experiment.

ADD COMMENTlink written 6.1 years ago by DG7.1k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1904 users visited in the last hour