I'm trying to analize undescribed genes by their functional categories (eggnog) and it would be very helpful to receive suggestions regarding how to deal with these "chimeric" codes such as 'IQ', 'ET', and 'EGIPQ'. I've read that researchers often keep just one ID per gene, but I aim to keep as much info as possible.
Thanks in advance.
Thanks for answering so fast, it was very helpful. I shouldn't pay much attention to if COGs are separated by a comma or not then. As long as they are assigned to a gene, for basic enrichment analyses it's irrelevant if they were detected in the same or different domains. Have a great day!