Hi
I have paired-end reads from human DNA samples, where I am trying to determine the metagenomic viral profile for each sample. I also have negative controls which were run through the same protocol as the human DNA samples (processed, library prep, sequenced, trimmed for adapters/barcodes). The next step would be to remove any reads found in my negative controls from my human samples. Does anyone know what the best approach/tool for this would be?
Thanks! Any help would be greatly appreciated!
Hi, I'm at this step too. There are mixed views about the utility of removing negative control reads. So I'd like to do both, i.e. perform my analysis once without removing negative control reads and once with removing negative control reads, and then compare and discuss results.
Logically, in mothur, one should be able to generate an accnos file with all the sequences to be removed and then use the remove.seqs command to accomplish just that. However, apparently this doesn't work after preclustering... the exact reason why was never given on the mothur forum. A solution was suggested using the sed command in bash, which is what I'm trying to learn how to do now.
Does anybody out there have a sed command solution for this problem? Any pointers or example code would be greatly appreciated.