Entering edit mode
5.7 years ago
Elizabeth
▴
30
Hello everyone. I recently ran Alienness to scan for HGT candidates in our eukaryotic protein sequence set (around 11500 proteins). The results section has a file titled 'possible contamination' and there are about 5000 proteins in it. This doesn't make sense. We ran BUSCO on our genome and predicted protein set and CEGMA on the genome set and the numbers look pretty good. I am not sure what to make of this result. Please advise.
Alienness, HGT, BUSCO, CEGMA .... Unless I'm wrong, these are not 'common' acronyms. Please edit your post to add the revelant links...
I apologize.
HGT : Horizontal Gene Transfer
Alienness http://alienness.sophia.inra.fr/cgi/index.cgi CEGMA http://korflab.ucdavis.edu/datasets/cegma/ BUSCO https://busco.ezlab.org/