Question: BLAST pipeline operations
gravatar for skbrimer
3.8 years ago by
United States
skbrimer620 wrote:

So, I have been reading a lot about host removal with viral data. More than one group is just using BLAST to their specific host to remove possible contamination.

How does that work? I know how to download a genome from NCBI. I have ncbi tools on my machines and can create a custom db using only the my host(s) of choice. However I'm fuzzy on how that actually removes them from the read pool.

Can BLAST take all of your reads and only output the reads that have no match or is it more of piping the results to a file and removing everything that has a good match via a script?

I understand BLAST is a slower way of doing this so what would be the advantage of this say over BBsplit (from the BBMap) that can map to multiple references at once? Or just concatenating all the host/viral dna/rna into one file and mapping to it?

qc assembly • 877 views
ADD COMMENTlink written 3.8 years ago by skbrimer620
gravatar for Prasad
3.8 years ago by
Prasad1.6k wrote:

if you are looking for contaminated read removal from the data, try DeconSeq

ADD COMMENTlink written 3.8 years ago by Prasad1.6k

Thanks, that looks interesting :)

ADD REPLYlink written 3.8 years ago by skbrimer620
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 732 users visited in the last hour