I have got following results for my bam file.
186895644 + 0 in total (QC-passed reads + QC-failed reads)
0 + 0 duplicates
1627664 + 0 mapped (0.87%:-nan%)
0 + 0 paired in sequencing
0 + 0 read1
0 + 0 read2
0 + 0 properly paired (-nan%:-nan%)
0 + 0 with itself and mate mapped
0 + 0 singletons (-nan%:-nan%)
0 + 0 with mate mapped to a different chr
0 + 0 with mate mapped to a different chr (mapQ>=5)
How to interpret these values? And is this a good alignment for bam file to proceed further?
Thanks in advance!
It's hard to give advice without knowing what your experiment is, but the flagstat shows that you tried to align 186895644 reads, but only 1627664 reads aligned (0.87%). This is an extremely low mapping rate, but without knowing what you are assaying here, you need to decide if this is appropriate.
iam aligning loxl1 gene obtained for exfoliation glaucoma patients(retina cells) on chromosome 15 to find SNP,s (single nucleotide polymorphism).Loxl1 gene is present on chromosome 15 so iam interested in aligning it on chr15..
iam aligning loxl1 gene obtained for exfoliation glaucoma patients(retina cells) on chromosome 15 to find SNP,s (single nucleotide polymorphism).Loxl1 gene is present on chromosome 15 so iam interested in aligning it on chr15..
iam aligning loxl1 gene obtained for exfoliation glaucoma patients(retina cells) on chromosome 15 to find SNP,s (single nucleotide polymorphism).Loxl1 gene is present on chromosome 15 so iam interested in aligning it on chr15..
what sequencing technology did you use ?
these reads are of loxl1 gene obtained from illumina.
Illumina is fine. You have mapped to Chr15 only I will try to map to the whole genome, to see what happens
Reference you used might be wrong, also the mapped ones could be the most common genic regions between various species, just rRNA regions too
iam using chromosome 15 as a reference as i have NGS data for loxl1 gene responsible for exfoliation glaucoma and i want to identify SNP,s..
When you ask questions on Biostars this kind of information is critical and should have been provided in the original question.
Still 0.87% alignment seems low. Is your data from targeted/capture sequencing?
its targeted genome sequencing, i,ll repeat steps again and will see the results. thanks for responding..
You need to specify the size of the targeted region (was it just that one gene)? It kind of seems an overkill to use NGS for one gene. What does rest of 99% of the file represent?
yes it is just one gene obtained from 50 cases(patients)..
Then what is the easiest way to detect SNP,s in just one gene without over-killing NGS??
Sanger sequencing might be cheaper.