my question is very simple. I have reads from a human RNASeq and I would like to check for contamination on a specific virus family. So my pipeline looks like
1) Align on the human genome (tophat2) 2) Keep only unaligned reads 3) Align on the virus genomes 4) Keep only aligned reads 5) BLAST
However, I would like to know the species name for each of my BLAST hits. So, now I'm launching it like this: blastx -query $query -db nr -out blast.txt -num_threads 4 -outfmt 6 -evalue 10e-3 -word_size 2
But the output contains only the GI name. Is there a way to get the species name from a simple BLAST run? Or an alternative way to launch BLAST to obtain this information in a text format?
Thanks a lot :-)
see How to make a BLAST database with taxonids from NCBI query. ; Taxonomy of blast hits
I would say this is a duplicate of Scientific names in blast output and databases. Please take a look, I asked and responded myself this same question.