what does CIGAR '22M2007N79M-61p82M19S' means?
2
0
Entering edit mode
7.8 years ago
juncheng ▴ 200

Hi,

This cigar is from paired end 101bp RNA-seq data.

I kind of know the others, but especially what does -61p means? The '-' here means minus?

Best

ChIP-Seq • 1.9k views
ADD COMMENT
1
Entering edit mode

That shouldn't exist, can you post the whole alignment (just that one line)?

ADD REPLY
0
Entering edit mode

BTW, this actually just looks like a corrupt line.

ADD REPLY
0
Entering edit mode

Yes, you are right. I found it only appears in the Chimeric.out.junction file of STAR. The STAR develop group might can explain.

ADD REPLY
0
Entering edit mode
7.8 years ago
juncheng ▴ 200

Note, that unlike standard SAM, both mates are recorded in one line here. The gap of length L between the mates is marked by the "Lp" in the CIGAR string.

If the mates overlap, L<0.

That's the explanation. Thanks @Devon Ryan

ADD COMMENT
1
Entering edit mode

Guess we replied at the same time. Yes, STAR doesn't use the same CIGAR format in that file that's used in SAM/BAM files.

ADD REPLY
0
Entering edit mode
7.8 years ago

In the future, you might want to mention at the beginning that this CIGAR string is coming from STAR's chimeric.out.junction file. The format of that file is described here and that string only makes sense it that context (i.e., it's not a regular CIGAR string). So the string describes both mates and the -61p is the distance between the mates, where a negative value means the mates overlap.

ADD COMMENT

Login before adding your answer.

Traffic: 1910 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6