unmapped reads with pysam
1
1
Entering edit mode
2.5 years ago

I have a bam file produces by BWA-MEM. It has some unmapped reads, e.g.

D00733:389:CD1T7ANXX:3:1101:1572:2235   77  *   0   0   *   *   0   0   CAGTTTCACTGTATAAATTGCTTATACTTAGACATGCATGGCTTAATCTT  AAB=AFGDGCFFGGGGGGGGGCGGGGGGGGGGGGGGGGGGGGGGGGGGGF  AS:i:0  XS:i:0
D00733:389:CD1T7ANXX:3:1101:1572:2235   141 *   0   0   *   *   0   0   GTATCTTCTAGAGAGAGGGAATGGGCGAGAGAAAAAGAGATTTCGGTTTC  BBB@BGGGGGGGGFGGGGGGGGEGGGGGGDGFGGGGGGGGEGGGGGFGGG  AS:i:0  XS:i:0
D00733:389:CD1T7ANXX:3:1101:6797:2243   77  *   0   0   *   *   0   0   TGTCTGGACCTGGTGAGTTTCCCCGTGTTGAGTCAAATTAAGCCGCAGGC  3A<0BDGGGGGGGGGGGGGFGGGGGFGGGGGGGGGGGGGGGGGGGGGGGG  AS:i:0  XS:i:0

I use pysam to count some stats on the bam file, but for some reason pysam does not find these unmapped reads.

bam=pysam.AlignmentFile(file,"rb")
for line in bam.fetch():
    line=line.tostring(bam)
    line=line.split("\t")
    if  line[2]=="*":
        print(line)

The code does not return anything

Any ideas how to fix this?

Thanks

pysam • 1.2k views
ADD COMMENT
2
Entering edit mode
2.5 years ago
Asaf 8.6k

you should add until_eof=True in the fetch()

ADD COMMENT
0
Entering edit mode

Accepted :) Thank you!

ADD REPLY

Login before adding your answer.

Traffic: 1600 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6