Bwa Combined Reference Vs Sequential Alignment
1
1
Entering edit mode
12.3 years ago

Hi,

I am currently developing a pipeline for novel viral detection and I am using BWA for filtering the mapped sequnces.

I recognized during my tests that I have a different number of unmapped reads after I align my sample against the whole genome once, in contrast to if I align each chromosome seperately and pipe the output.

Is this causing BWA heuristically concept, that I get a different number of mapped sequences by using the same references? Is this normal?

What is better, to align chromosome by chromosome or to align the whole genome at once?

Would be pelased if anyone could help.

bwa • 2.8k views
ADD COMMENT
1
Entering edit mode

Answered here on SeqAnswers if there's no takers: http://seqanswers.com/forums/showthread.php?p=62059

ADD REPLY
2
Entering edit mode
12.3 years ago

A related question on Biostar would be Exome sequencing: masking the non-genic sequences?

The answer to your question is that in general you should always align against the full reference genome that your samples could come from.

For example as mentioned above even though the process attempts to capture the exome a lot of reads could come from other regions and if you'd mask these regions during alignment you could be mistakenly assign the reads to exons.

ADD COMMENT

Login before adding your answer.

Traffic: 1999 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6