Dear BioStars, Hi.
I have run blastx against NcBI nr using Diamond with tabular output and sensitive option
the result is as bellow:
TRINITY_DN212758_c0_g1_i1.......XP_002531646.1.....81.3 107 20 0 3 323 199 305 2.9e-41 176.4 TRINITY_DN212728_c0_g1_i1.....XP_014502021.1....89.2 37 4 0 3 113 403 439 8.6e-10 71.2 TRINITY_DN212793_c0_g1_i1.....XP_015200040.1.....91.8 61 5 0 665 483 238 298 9.7e-23 115.9
But I need something like this in the second row :
Q: (1) Is there any converter tools for this task or if not, (2) which option I must add to my Diamond script ?
NOTE1: maybe it is the SwissProt IDs ?
NOTE2: my Diamond script:
diamond blastx -d nr -q Trinity_FM.fasta -o blastX-all-sesnitive.outfmt6 -f 6 -p 22 --evalue 0.000001 -k 1 --sensitive
NOTE3: these are the Diamond software tabular options
Value 6 may be followed by a space-separated list of these keywords:
qseqid means Query Seq - id qlen means Query sequence length sseqid means Subject Seq - id sallseqid means All subject Seq - id(s), separated by a ';' slen means Subject sequence length qstart means Start of alignment in query qend means End of alignment in query sstart means Start of alignment in subject send means End of alignment in subject qseq means Aligned part of query sequence sseq means Aligned part of subject sequence evalue means Expect value bitscore means Bit score score means Raw score length means Alignment length pident means Percentage of identical matches nident means Number of identical matches mismatch means Number of mismatches positive means Number of positive - scoring matches gapopen means Number of gap openings gaps means Total number of gaps ppos means Percentage of positive - scoring matches qframe means Query frame stitle means Subject Title salltitles means All Subject Title(s), separated by a '<>' qcovhsp means Query Coverage Per HSP