I have a peptide file that looks like this:
cluster_9694 MEGVNARNVVLNKGPQQMSSSLKDRLKRCGRYYSSPTTPKPSIGFLPTPDSSAASSQSLNLDCPTPTCDNGPVSETPPSTILTPSSSDKHGDSTFDDSSSLSCSTPVLKVRDSQGSHPRVKRLRLSSQACFKVQFPPESNSGRGENHPKQQVLNSSLLEDSISEISTCDDKTEADVNSPEKVQPTITPGMCSQSSDHVRNHESQEEHCTDVVPESMSCEELVAKKLQLQKRLKENRECLRKLKMVKLYRSKNDLSSLDDLILKWRGVSQQVLLDLYQNLPQPKPNLSGFLQHLNLDMEFLKFNLEEESFN*
And the output of blastp that looks like this:
cluster_9694 SFR1_HUMAN 41.176 102 58 1 210 309 141 242 5.64e-14 73.6
cluster_14420 SYMC_XENLA 50.676 961 408 13 4 956 2 904 0.0 935
Whats the best way to merge the annotation result with the fasta file so I have it looking like this
cluster_9694 SFR1_HUMAN MEGVNARNVVLNKGPQQMSSSLKDRLKRCGRYYSSPTTPKPSIGFLPTPDSSAASSQSLNLDCPTPTCDNGPVSETPPSTILTPSSSDKHGDSTFDDSSSLSCSTPVLKVRDSQGSHPRVKRLRLSSQACFKVQFPPESNSGRGENHPKQQVLNSSLLEDSISEISTCDDKTEADVNSPEKVQPTITPGMCSQSSDHVRNHESQEEHCTDVVPESMSCEELVAKKLQLQKRLKENRECLRKLKMVKLYRSKNDLSSLDDLILKWRGVSQQVLLDLYQNLPQPKPNLSGFLQHLNLDMEFLKFNLEEESFN*
Do I sort first and then paste? If I sort will the IDs and sequences in fasta file get shuffled? Do I use awk? Please advice.