I was trying to find true orthologs for a set of sequences using OrthoMCL program. I made it upto step 8- orthoMCLBlastParser. I provided my blast output in -m8 format. When I ran the orthoMCLBlastParser it asks for the taxonID of the subject sequences. I modified my blast output file by providing an id to subject sequences like 'xxx|YYYYYY'. But still getting the same error.
Can someone help me for this.
I am just copying a few lines from my blast output file and error given by the orthoMCLBlastParser.
Fields: query id, subject id, % identity, alignment length, mismatches, gap opens, q. start, q. end, s. start, s. end, evalue, bit score
100 hits found
ppp|scf8123 xxx|Pd3R61150.1|PACid:197844 56.00 800 239 15 9 804 13 703 0.0 824
ppp|scf8123 xxx|Pd3R61150.1|PACid:197366 95.88 826 32 2 1 824 1 826 0.0 1557
acquiring genes from ppp.fasta couldn't find taxon for gene 'xxx|Pd3R61150.1|PACid:197844' at /Downloads/orthomclSoftware-v2.0.2/bin/orthomclBlastParser line 103, line 1.
Please note that I removed the first 5 lines of the output file otherwise it gives me the error: couldn't find taxon for gene 'BLASTP' at /Downloads/orthomclSoftware-v2.0.2/bin/orthomclBlastParser line 103, line 1.