Question: Standard options for mapping with STAR in context of RNA-seq data analysis regarding splicing
gravatar for caggtaagtat
3.0 years ago by
caggtaagtat1.4k wrote:


During my analysis of RNA-seq data regarding alternative splicing and splicing pattern, I came across some standard options of the STAR algorithm, which I could not quiet follow.

--alignSJoverhangMin 8
(minimum overhang for unannotated junctions)
--alignSJDBoverhangMin 1
(minimum overhang for annotated junctions)

I have concerns using this standard options, which regulate the minimal overlap of a read over the exon junction. I don't understand why one nucleotide overlap is enough to map a read to an annotated exon junction. And why would you not generally use the same minimum overlaps for a junction to begin with?

So my question is, what would you use as an minimal overlap? Would you use the standard settings or for example 6nt for both junctions?

rna-seq options star splicing • 1.6k views
ADD COMMENTlink modified 3.0 years ago by Devon Ryan98k • written 3.0 years ago by caggtaagtat1.4k
gravatar for Devon Ryan
3.0 years ago by
Devon Ryan98k
Freiburg, Germany
Devon Ryan98k wrote:

1 base is enough for an annotated junction because the it's known a priori to be possible. The threshold is higher for novel junctions, simply because they're novel and smaller values will tend to lead to more spurious findings.

As a general rule, always start with the default settings (except for the number of threads).

ADD COMMENTlink written 3.0 years ago by Devon Ryan98k

Ok I will continue using the default settings than.

However, I still can't wrap my head around why lowering the treshold for annotated junctions does not lead to missmapped reads at this position? How is this possible? Or does STAR prevent missmapping at positions of annotated junctions at another level?

ADD REPLYlink written 3.0 years ago by caggtaagtat1.4k

I'm not sure why you'd think such mappings would be wrong, they're incredibly likely to be correct for the simple reason that the junction is annotated.

ADD REPLYlink written 3.0 years ago by Devon Ryan98k

Would'nt the possibilty for wrong mapping be 3/4, when a read is mapped with just one nucleotide overlap?

EDIT: Thanks for editing btw, this forum is great! :)

ADD REPLYlink modified 3.0 years ago • written 3.0 years ago by caggtaagtat1.4k

You a priori expect reads spanning that junction, so no the probability would be reasonably low.

ADD REPLYlink written 3.0 years ago by Devon Ryan98k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1934 users visited in the last hour