Entering edit mode
5.6 years ago
isu2017
•
0
Hi everyone,
I recently finished running the braker pipeline on a previously unannotated organism. The pipeline just finished, with the final GTF file from Augustus being produced. However, the total amount of genes in his the file are much larger than what we expected (close to 100,000 were predicted.) Additionally, each gene is labeled as "gene1", "gene2", ect. So, my questions are:
1.) How can I reduce the redundancy in the amount of predicted genes. 2.) How can I annotate the genes and retrieve gene names
Thanks!