my question is very simple. I have reads from a human RNASeq and I would like to check for contamination on a specific virus family. So my pipeline looks like
1) Align on the human genome (tophat2) 2) Keep only unaligned reads 3) Align on the virus genomes 4) Keep only aligned reads 5) BLAST
However, I would like to know the species name for each of my BLAST hits. So, now I'm launching it like this: blastx -query $query -db nr -out blast.txt -num_threads 4 -outfmt 6 -evalue 10e-3 -word_size 2
But the output contains only the GI name. Is there a way to get the species name from a simple BLAST run? Or an alternative way to launch BLAST to obtain this information in a text format?
Thanks a lot :-)