nanopore unaligned bam files
1
1
Entering edit mode
13 days ago
sarahmanderni ▴ 130

Hi,

I have received nanopore data with primary analysis from a sequencing center. It includes pod5 file, fastq files and bam files (both pass and fail folders for each). I checked the bam files and looks like all reads are currently unmapped, despite the methylation information (MM/ML tags) are avaialable. So, looks like the basecalling was performed using a modified-base-aware model but the alignment itself has been skipped or wrong reference genome or something like this. So, I have 2 questions: 1) is this normal to receive the bam files in this shape and 2) is there a way to do the alignment using the fastq files or extracted sequences from bam files (using example minimap2) and merge that with the methyl info available at bam files? So, I hoping not to require to go back to pod5 files and both base-calling and alignment (example with dorado) as the pod5 files are extremely large. Thanks in advance!

nanopore • 618 views
ADD COMMENT
2
Entering edit mode
13 days ago
GenoMax 152k

is this normal to receive the bam files in this shape and

Yes. If you asked for methylation calls then this is a way to capture that information. Programs like modkit use this information: https://github.com/nanoporetech/modkit

is there a way to do the alignment using the fastq files or extracted sequences from bam files (using example minimap2) and merge that with the methyl info available at bam files?

Yes you can directly use the fastq files for alignments with minimap2, if you have no interest in methylation.

ADD COMMENT
1
Entering edit mode

I think OP wants to align AND keep the methylation info, which is indeed the tricky part ...

Not entirely sure but I think the dorado aligner subcommand takes bam as input (and I would thus assume/hope it will take over the extra bam info such as for instance the methylation info)

ADD REPLY
3
Entering edit mode

OP wants to align AND keep the methylation info

Then use something like

minimap2 -t use_N_cores -Y -y -ax map-ont your_minimap_index_path <(samtools fastq -@
 use_N_cores -T MM,ML your_unaligned_methyl_call.bam) | samtools sort --write-index -o methyl_
aligned.bam -

colindaven has an answer to start with fastq files (if they have calls) --> Nanopore long-read sequencing doubts and problems

ADD REPLY
0
Entering edit mode

yes keeping both mapping info and methylation info at the same time is the intention. Thanks so much for the reosponse! I will try it

ADD REPLY
1
Entering edit mode

When you have a bam make sure you check the methyl tags MM and ML are present. Something like

samtools view -h x.bam | grep MM
samtools view -h x.bam | grep ML
ADD REPLY

Login before adding your answer.

Traffic: 2033 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6