I have 4,54,871 sequences and I want to blast them against protein database. I need top 5 results of each blast and wanted to store those results in a single file.
I have downloaded the BLAST+ and trying to do with standalone blast.
I wonder if this is the right way or some other methods are there?
Does anyone have the same experience? Any suggestions?
It would be nice if someone can share the R script for this. I am absolutely new in this field and have to complete this assignment.
Thanks in advance
Try to see if you can cluster the sequences - if lots of them are similar, it may be helpful