Is it ok to do Indel realignment after removing subsets of reads?
1
0
Entering edit mode
9 months ago

Hello community,

I have now filtered out all reads that aligned to mtDNA and to unclassified contigs (not being part of defined chromosomes). I realized however that I also have to do the indel re-alignment, and I think that I should have done this before filtering. I am not sure though, would it be too different doing it before or after? I imagine, that in principle it should not make any difference.

I would be thankful for some explanation.

Thank you!

Good summer!

Alignment bam Indel • 490 views
ADD COMMENT
2
Entering edit mode
9 months ago
Zhenyu Zhang ★ 1.2k

It shouldn't make any difference in principle, as you said. But I don't like the general practice people removing reads from BAM files. Why don't you just keep all the reads there? Basically I see a lot of cons, and not much pros doing that (except save say a little bit storage).

ADD COMMENT
0
Entering edit mode

Hi,

Thank you for your answer, in fact I have the before and after bam files (even though it does occupy quite a bit of space). But what cons do you mean? I am removing these reads so whatever variants I find downstream do not belong to the mtDNA or the unclassified contigs. You know of ways to do this later in the pipeline? Do you convert to some other format?

Thank you,

Edit: Ok, I have found this: Should I Remove The Unmapped Reads From My Bam ?. I understand your point

ADD REPLY
1
Entering edit mode

If you kept all the reads, you only need to keep one copy. My general bioinformatics practice is to keep everything and doing the filtering as later as possible. In that case, when you want to change some steps and redo any analysis, you don't have to redo everything again.

ADD REPLY

Login before adding your answer.

Traffic: 1495 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6