Hello all,
I have a gene sequence(approximately 1300 bp), the presence of which I want to check in a large number of genomes. (strains of the same bacteria, in most of which I'm sure it has to be present). What is the best way to do it?
Should I try doing multiple alignment?
I tried using seqkit: seqkit grep -c -s -f gene.fasta multiple_genes.fasta -C
, but I'm not sure this is the way, since even with possible mismatches = 50 I got only a quarter of genomes, which can hardly be true.
thanks, I will try doing it!