Entering edit mode
9 hours ago
saamhasan55
▴
30
I generated a genome assembly and wanted to check for contaminant sequences using blobtools. I carried out a genome alignment between the assembly and fastq reads using bwa-mem, used to diamond blastx to search my sequences against the blast database and then used blobtools. But blobtools finds no taxonomic hits, I only get undefined or no-hits. The diamond output has taxonomic IDs and when I search some sample predicted proteins from my assembly they map. Why is blobtools not finding any taxonomic hits?
Why do you think you have contaminant sequences in your assembly? Generally when libraries are made from an organismm there should be no extraneous sequence present.