Hi,
I'm trying to search an NCBI conserved domain against a large genome.
I dowloaded NCBI's CDD PSSM files and indexed the genome both as a nucl dbtype as well as a prot dbtype.
Now I'm trying to run psi-blast from the command line with one of the PSSM files (CHL00001.smp) against my indexed genome and I'm getting these warnings:
FastaReader: Hyphens are invalid and will be ignored around line 16147
FASTA-Reader: Ignoring invalid residues at position(s): On line 16147: 1, 3-18, 20-22, 25-26, 28-29
FASTA-Reader: Ignoring invalid residues at position(s): On line 16148: 1, 3-4, 6-8, 10, 12-13
And this happens even if I use deltablast, blastp and tblastn.
I'm assuming the PSSM file is not of the format the blast is accepts (though it seems weird since this PSSM file is from NCBI).
Any idea?