Hello! I downloaded 3 reference fasta files(IGHV, IGHD, IGHJ) on IMGT.
COMMAND like: makeblastdb –in IGHV.fasta –dbtype nucl –out IGHVdb –parse_seqids
It runs successfully when i use IGHD.fasta.
IGHD.fasta:
>J00256|IGHJ4*01|Homo sapiens|F|J-REGION|1912..1959|48 nt|3| | | | |48+0=48| | |
actactttgactactggggccaaggaaccctggtcaccgtctcctcag
>X86355|IGHJ4*02|Homo sapiens|F|J-REGION|1480..1527|48 nt|3| | | | |48+0=48| | |
actactttgactactggggccagggaaccctggtcaccgtctcctcag
>M25625|IGHJ4*03|Homo sapiens|F|J-REGION|446..493|48 nt|3| | | | |48+0=48| | |
gctactttgactactggggccaagggaccctggtcaccgtctcctcag
But when i use IGHJ.fasta, it shows: BLAST options error: IGHJ.fasta does not match input format type, default input type is FASTA.
IGHJ.fasta:
>X97051|IGHD1-1*01|Homo sapiens|F|D-REGION|33714..33730|17 nt|1| | | | |17+0=17| | |
ggtacaactggaacgac
>X13972|IGHD1-14*01|Homo sapiens|ORF|D-REGION|14518..14534|17 nt|1| | | | |17+0=17| | |
ggtataaccggaaccac
>X97051|IGHD1-20*01|Homo sapiens|F|D-REGION|62015..62031|17 nt|1| | | | |17+0=17| | |
ggtataactggaacgac
I compared IGHJ.fasta with IGHD.fasta. I didn't find obvious format difference.
I searched the fasta format online and had a try. I kept the sequence but changed the name.
So IGHJ.fasta became:
>seq1
ggtacaactggaacgac
>seq2
ggtataaccggaaccac
>seq3
ggtataactggaacgac
And it works!
So i think there must be some mistake in the sequence name, i don't know how to figure it.
Thanks for your patience!
Using only the first sequence entry seems already enough to reproduce the problem.