BAM file's reference sequence IDs doesnt match contigs ID's
1
0
Entering edit mode
5 weeks ago

Hello I have been experienceing some issues while using binning tools, on the coverage estimation modules of the binning tools I have got 0% of coverage and the reason seems to be that BAM file's contigs IDs are not the same that the contigs IDs of my contigs.fasta file.

here are the contigs IDs on my bam files:

K00171:838:H25L5BBXX:6:1202:15179:21957 163     **k119_248961**
K00171:838:H25L5BBXX:6:2102:14174:7767  163     **k119_248961**


....

and here are the contigs.fasta file's contigs IDs:

>c_000000000001

>c_000000000002

>c_000000000003


....

how can I deal with this? is there a option when using samtools that i can use to cope with this (or any tool option) ?

files metagenomics alignment bam binning • 370 views
1
Entering edit mode
5 weeks ago
GenoMax 100k

If you aligned your data to right set of reference sequences then those fasta headers would be in your BAM. You should re-map your data to correct reference.

0
Entering edit mode

yes I aligned the reads to the correct reference sequences, getting 60-90% of alignment rate with bowtie2, I ran again the alignment and got th same results ): ( diffrent sequences IDs, the same that I described above).

what can be happening?, I made the assembly using megahit if that helps

0
Entering edit mode

If you need id's shown in contigs.fasta example above in your BAM file then you need to create an index with contigs.fasta file and then align your data to that index.

0
Entering edit mode

the problem was regarding to the alignment process using bowtie2, I reindexed my contigs and now I got the proper contigs names on my bam files :)