Removal Of Sequences With >33% Gap From The Alignment Before Tree Construction
1
0
Entering edit mode
10.2 years ago
Pappu ★ 2.1k

I read about removing sequences with >33% gap from the alignment after alignment trimming, before phylogenetic tree construction. I am wondering it this makes sense or one should realign the sequences. Is there is paper which discusses the effect of gaps in the alignment/alignment programs for maximum likelihood phylogentic tree construction?

• 3.4k views
ADD COMMENT
1
Entering edit mode
10.2 years ago
DG 7.3k

It really depends a lot on what you are doing. Is this single gene or multi-gene/phylogenomic analyses? Those incomplete or heavyily gapped sequences may be of interest so you don't want to remove them. Amino acid or nucleotide alignments? What model of evolution? What phylogenetic program? Some programs handle gaps differently than others for instance. People may have started using a 33% rule of thumb, but only because that means the sequence is "missing" or lacking information at 1/3 of positions in your alignment. If it is because it is a partial sequence that is one thing, but in large datasets they often represent partial pseudogene or paralagous sequences which is why they are often removed.

There are no hard and fast cutoffs in phylogenetics really, because there are a lot of factors that go in to setting up your experiment.

ADD COMMENT

Login before adding your answer.

Traffic: 1611 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6