Question: BLAST pipeline operations
gravatar for skbrimer
6 months ago by
United States
skbrimer330 wrote:

So, I have been reading a lot about host removal with viral data. More than one group is just using BLAST to their specific host to remove possible contamination.

How does that work? I know how to download a genome from NCBI. I have ncbi tools on my machines and can create a custom db using only the my host(s) of choice. However I'm fuzzy on how that actually removes them from the read pool.

Can BLAST take all of your reads and only output the reads that have no match or is it more of piping the results to a file and removing everything that has a good match via a script?

I understand BLAST is a slower way of doing this so what would be the advantage of this say over BBsplit (from the BBMap) that can map to multiple references at once? Or just concatenating all the host/viral dna/rna into one file and mapping to it?

qc assembly • 200 views
ADD COMMENTlink written 6 months ago by skbrimer330
gravatar for Prasad
6 months ago by
Prasad1.4k wrote:

if you are looking for contaminated read removal from the data, try DeconSeq

ADD COMMENTlink written 6 months ago by Prasad1.4k

Thanks, that looks interesting :)

ADD REPLYlink written 6 months ago by skbrimer330
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1587 users visited in the last hour