Unaligning BAM files
2
0
Entering edit mode
7.2 years ago
M. Khan • 0

Can anyone help me unalign BAM files that have been aligned using TMAP? I want to run my raw reads through GATK best practices but the BAM files we received from our core facility are already aligned using TMAP. Thanks!

TMAP BAM IonTorrent • 2.3k views
ADD COMMENT
1
Entering edit mode

GATK accepts sorted bam files. So, why you want to unalign to raw reads?

ADD REPLY
0
Entering edit mode

I want to be able to use the same methodology in a previous study on a different type of cancer.

ADD REPLY
0
Entering edit mode

Can you please elaborate in brief?

ADD REPLY
0
Entering edit mode

Basically I want to

  1. Mark and remove PCR duplicates with the Picard package.
  2. Realign Indels with known sites and base quality score recalibration were performed with GATK (Genome Analysis Toolkit), in line with current best practices in the next-generation sequencing field for variant detection, to produce variant-caller ready reads in BAM format.
  3. Call variants with MuTect
  4. Use Invex to find passenger vs driver mutations
ADD REPLY
0
Entering edit mode

For these analysis, you don't need raw read files. you can use bam files that you have got. Sort them, if it is not. used this sorted files for MarkDuplicates and further analysis.

So you don't need to obtain reads from your bam files. If you still want to obtain raw reads use bedtools bamtofastq conversion tool.

You may not get unaligned reads in fastq files that you obtain from bam files.

ADD REPLY
0
Entering edit mode

You want to run your reads through "GATK best practices" to do what?

In the meantime, I have the following program: https://github.com/grenaud/libbam/blob/master/removeTagsMapping.cpp

ADD REPLY
0
Entering edit mode

I am trying to find mutations that are present in diseased tissue versus the patients blood.

ADD REPLY
0
Entering edit mode

so you want to run quality score recall and indel realignment and stuff ? You need mapped reads for this.

ADD REPLY
0
Entering edit mode

I do, but I'm trying to use the Galaxy Project and that won't work with TMAP aligned reads

ADD REPLY
0
Entering edit mode

The best thing to do is to ask the sequencing core to provide you with the original fastq files. Also, beware of the the homopolymer repeat error in the Ion torrent data.

ADD REPLY
0
Entering edit mode
7.2 years ago
donfreed ★ 1.6k
samtools sort -n my_alignment.bam my_alignment_qsorted
bedtools bamtofastq -i my_alignment_qsorted.bam -fq /dev/stdout -fq2 /dev/stdout | bwa mem -p /mnt/data/reference/hs37d5.fa - | samtools view -Sb > realigned.bam

If you want to run the GATK, I recommend:

bedtools bamtofastq -i my_alignment_qsorted.bam -fq /dev/stdout -fq2 /dev/stdout | bwa mem -p -R "@RG\tID:foo\tSM:my_sample" /mnt/data/reference/hs37d5.fa - | samblaster | samtools view -Sb - > realigned.bam
ADD COMMENT
0
Entering edit mode
7.2 years ago

Is bamtofastq from bedtools what you need?

ADD COMMENT

Login before adding your answer.

Traffic: 866 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6