Mapping reads to nuclear, chloroplast, and mitochondria genome separately but all resulting bam files are similar sizes
1
0
Entering edit mode
17 months ago

I mapped reads onto a nuclear, chloroplast, and mitochondrial genome separately using bwa mem and then sorted using samtools. Entire genome: 179 Mb, chloroplast: 169Kb, mitochondria: 43Kb. The resulting bam files are all similar in sizes (23G, 27G, and 27G). Is their an error? Why would this be the case?

bwamem chloroplast genome mitochondrial genome • 358 views
ADD COMMENT
3
Entering edit mode
17 months ago
ATpoint 49k

Because a BAM contains all reads (mapped and unmapped) that were in the original fastq file, so this is normal and expected.

ADD COMMENT
0
Entering edit mode

so for further process just remove unmapped reads from those bam files?

ADD REPLY
0
Entering edit mode

Technically you can do that to reduce file size. If you want that is on you.

ADD REPLY
0
Entering edit mode

This would be dependent on aligner options. Some aligners will allow you to drop unmapped reads.

ADD REPLY

Login before adding your answer.

Traffic: 1492 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6