Entering edit mode
                    3.6 years ago
        O.rka
        
    
        ▴
    
    750
    I'm trying to understand the logic of converting the complex MetaEuk identifiers to the more simple identifiers adopted by BUSCO so I can reproduce a manual step for gene calls prior to BUSCO. However, what I thought was the case isn't actually correct. I thought you just get the high and low values.
>Transcript_22646|m.42971|NODE_14273_length_2360_cov_2.257701|+|428|5.129e-119|1|98|742|98[98]:742[742]:645[645] 
VPVFIMPLFNKYEPLDDGTLKTRIYELAASLKYPLTKLFVMDGSKR...
>Transcript_22646|m.42972|NODE_14273_length_2360_cov_2.257701|-|290|1.787e-77|2|98|742|742[742]:356[356]:387[387]|283[283]:98[98]:186[186]
HQGRMTVMIKRIILVRVHGAQVFQMNLAQTTLQLFGPTQINRKAIG...
Both of these (with my incorrect logic) would return the following: 
NODE_14273_length_2360_cov_2.257701_98:742