Question: How to Align to multiple reference genomes -> Discard multiply mapped reads?
gravatar for nattzy94
12 months ago by
nattzy9420 wrote:


I have reads containing E. coli, K. pneumoniae and GAPDH spike-in. I would like to align these reads to the 3 genomes and then discard reads that map to more than one of the references.

So far, I have concatenated all 3 fasta files into 1 composite genome. I have then used

bwa mem -c 1 <composite_genome.fasta> <Sample_x_R1> <Sample_x_R2>.

Can I check if this would be the correct way to do this or have I gone wrong somewhere?

cmd bwa alignment • 388 views
ADD COMMENTlink modified 12 months ago by h.mon29k • written 12 months ago by nattzy9420
gravatar for h.mon
12 months ago by
h.mon29k wrote:

BBSplit is a tool specifically designed with your goal in mind. See the tool announcement and the online documentation.

ADD COMMENTlink written 12 months ago by h.mon29k

Hey h.mon,

Thanks for the suggestion. I have used BBsplit to generate the fastq files that mapped to the corresponding genomes. Just to check, in order to get the number of mapped reads, do I just convert the .fastq files to .bam files and use samtools?

ADD REPLYlink written 12 months ago by nattzy9420
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 860 users visited in the last hour