Question: BLAST pipeline operations
gravatar for skbrimer
10 months ago by
United States
skbrimer350 wrote:

So, I have been reading a lot about host removal with viral data. More than one group is just using BLAST to their specific host to remove possible contamination.

How does that work? I know how to download a genome from NCBI. I have ncbi tools on my machines and can create a custom db using only the my host(s) of choice. However I'm fuzzy on how that actually removes them from the read pool.

Can BLAST take all of your reads and only output the reads that have no match or is it more of piping the results to a file and removing everything that has a good match via a script?

I understand BLAST is a slower way of doing this so what would be the advantage of this say over BBsplit (from the BBMap) that can map to multiple references at once? Or just concatenating all the host/viral dna/rna into one file and mapping to it?

qc assembly • 270 views
ADD COMMENTlink written 10 months ago by skbrimer350
gravatar for Prasad
10 months ago by
Prasad1.5k wrote:

if you are looking for contaminated read removal from the data, try DeconSeq

ADD COMMENTlink written 10 months ago by Prasad1.5k

Thanks, that looks interesting :)

ADD REPLYlink written 10 months ago by skbrimer350
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 466 users visited in the last hour