First and foremost is there a premade tool capiable of doing this? I have searched but not been able to find anything.
We would like to combine this with using the same protein sequences from Uniprot and create a DIAMOND database so use this tool for the same purpose of annotation nitrogen fixing genes. We would then cross reference the two tools and take hits that appear in both.
If this methodology valid and is uniprot a good place to go in order to scrape the protein sequences for a database?