Question

Very low mapping rate of a bam file, reasons?

2

Entering edit mode

8.0 years ago

nidoo.beg ▴ 20

I have got following results for my bam file.

186895644 + 0 in total (QC-passed reads + QC-failed reads)
0 + 0 duplicates
1627664 + 0 mapped (0.87%:-nan%)
0 + 0 paired in sequencing
0 + 0 read1
0 + 0 read2
0 + 0 properly paired (-nan%:-nan%)
0 + 0 with itself and mate mapped
0 + 0 singletons (-nan%:-nan%)
0 + 0 with mate mapped to a different chr
0 + 0 with mate mapped to a different chr (mapQ>=5)

How to interpret these values? And is this a good alignment for bam file to proceed further?

Thanks in advance!

next-gen alignment bam • 3.8k views

ADD COMMENT • link updated 13 months ago by Ram 43k • written 8.0 years ago by nidoo.beg ▴ 20

2

Entering edit mode

It's hard to give advice without knowing what your experiment is, but the flagstat shows that you tried to align 186895644 reads, but only 1627664 reads aligned (0.87%). This is an extremely low mapping rate, but without knowing what you are assaying here, you need to decide if this is appropriate.

ADD REPLY • link 8.0 years ago by James Ashmore ★ 3.4k

0

Entering edit mode

iam aligning loxl1 gene obtained for exfoliation glaucoma patients(retina cells) on chromosome 15 to find SNP,s (single nucleotide polymorphism).Loxl1 gene is present on chromosome 15 so iam interested in aligning it on chr15..

ADD REPLY • link 8.0 years ago by nidoo.beg ▴ 20

0

Entering edit mode

iam aligning loxl1 gene obtained for exfoliation glaucoma patients(retina cells) on chromosome 15 to find SNP,s (single nucleotide polymorphism).Loxl1 gene is present on chromosome 15 so iam interested in aligning it on chr15..

ADD REPLY • link 8.0 years ago by nidoo.beg ▴ 20

0

Entering edit mode

iam aligning loxl1 gene obtained for exfoliation glaucoma patients(retina cells) on chromosome 15 to find SNP,s (single nucleotide polymorphism).Loxl1 gene is present on chromosome 15 so iam interested in aligning it on chr15..

ADD REPLY • link 8.0 years ago by nidoo.beg ▴ 20

2

Entering edit mode

what sequencing technology did you use ?

ADD REPLY • link 8.0 years ago by Antonio R. Franco ★ 5.1k

0

Entering edit mode

these reads are of loxl1 gene obtained from illumina.

ADD REPLY • link 8.0 years ago by nidoo.beg ▴ 20

0

Entering edit mode

Illumina is fine. You have mapped to Chr15 only I will try to map to the whole genome, to see what happens

ADD REPLY • link 8.0 years ago by Antonio R. Franco ★ 5.1k

1

Entering edit mode

Reference you used might be wrong, also the mapped ones could be the most common genic regions between various species, just rRNA regions too

ADD REPLY • link 8.0 years ago by Rohit ★ 1.5k

0

Entering edit mode

iam using chromosome 15 as a reference as i have NGS data for loxl1 gene responsible for exfoliation glaucoma and i want to identify SNP,s..

ADD REPLY • link 8.0 years ago by nidoo.beg ▴ 20

1

Entering edit mode

When you ask questions on Biostars this kind of information is critical and should have been provided in the original question.
Still 0.87% alignment seems low. Is your data from targeted/capture sequencing?

ADD REPLY • link 8.0 years ago by GenoMax 141k

0

Entering edit mode

its targeted genome sequencing, i,ll repeat steps again and will see the results. thanks for responding..

ADD REPLY • link 8.0 years ago by nidoo.beg ▴ 20

2

Entering edit mode

You need to specify the size of the targeted region (was it just that one gene)? It kind of seems an overkill to use NGS for one gene. What does rest of 99% of the file represent?

ADD REPLY • link 8.0 years ago by GenoMax 141k

0

Entering edit mode

yes it is just one gene obtained from 50 cases(patients)..

ADD REPLY • link 8.0 years ago by nidoo.beg ▴ 20

0

Entering edit mode

Then what is the easiest way to detect SNP,s in just one gene without over-killing NGS??

ADD REPLY • link 8.0 years ago by nidoo.beg ▴ 20

0

Entering edit mode

Sanger sequencing might be cheaper.

ADD REPLY • link 8.0 years ago by Devon Ryan 104k

score 4 · Answer 1 · 2016-04-25

4

Entering edit mode

8.0 years ago

Devon Ryan 104k

Less than 1% of your reads had alignments. That would normally indicate that the results are useless and that something went wrong.

ADD COMMENT • link 8.0 years ago by Devon Ryan 104k

score 4 · Answer 2 · 2016-04-25

4

Entering edit mode

8.0 years ago

Sukhi Singh 11k

Very low mapping rate , (1627664/186895644)*100 = 0.87%

Was the correct genome assembly used? Run the file via fastQScreen to rule out cross-contamination.

ADD COMMENT • link 8.0 years ago by Sukhi Singh 11k

0

Entering edit mode

yes i had done this step. thanks alot

ADD REPLY • link 8.0 years ago by nidoo.beg ▴ 20

score 2 · Answer 3 · 2016-04-25

2

Entering edit mode

8.0 years ago

agata88 ▴ 870

You have total QC (quality control) passed reads : 186895644 but mapped only 1627664 (0.87%). Do you have PE reads? Because none of the reads were property paired. In my opinion something went wrong, did you choose correct reference genome?

Best,

Agata

ADD COMMENT • link 8.0 years ago by agata88 ▴ 870

0

Entering edit mode

i think there is some issue with reference, i,ll do the steps again thanks

ADD REPLY • link 8.0 years ago by nidoo.beg ▴ 20

0

Entering edit mode

no . i have single end reads

ADD REPLY • link 8.0 years ago by nidoo.beg ▴ 20

score 0 · Answer 4 · 2016-04-25

0

Entering edit mode

8.0 years ago

Antonio R. Franco ★ 5.1k

What program did you use for mapping? What were its setting? What did you sequenced and what did you use as reference?

ADD COMMENT • link 8.0 years ago by Antonio R. Franco ★ 5.1k