Question: Standard options for mapping with STAR in context of RNA-seq data analysis regarding splicing
0
gravatar for caggtaagtat
18 months ago by
caggtaagtat700
caggtaagtat700 wrote:

Hi,

During my analysis of RNA-seq data regarding alternative splicing and splicing pattern, I came across some standard options of the STAR algorithm, which I could not quiet follow.

--alignSJoverhangMin 8
(minimum overhang for unannotated junctions)
--alignSJDBoverhangMin 1
(minimum overhang for annotated junctions)

I have concerns using this standard options, which regulate the minimal overlap of a read over the exon junction. I don't understand why one nucleotide overlap is enough to map a read to an annotated exon junction. And why would you not generally use the same minimum overlaps for a junction to begin with?

So my question is, what would you use as an minimal overlap? Would you use the standard settings or for example 6nt for both junctions?

rna-seq options star splicing • 838 views
ADD COMMENTlink modified 18 months ago by Devon Ryan91k • written 18 months ago by caggtaagtat700
2
gravatar for Devon Ryan
18 months ago by
Devon Ryan91k
Freiburg, Germany
Devon Ryan91k wrote:

1 base is enough for an annotated junction because the it's known a priori to be possible. The threshold is higher for novel junctions, simply because they're novel and smaller values will tend to lead to more spurious findings.

As a general rule, always start with the default settings (except for the number of threads).

ADD COMMENTlink written 18 months ago by Devon Ryan91k

Ok I will continue using the default settings than.

However, I still can't wrap my head around why lowering the treshold for annotated junctions does not lead to missmapped reads at this position? How is this possible? Or does STAR prevent missmapping at positions of annotated junctions at another level?

ADD REPLYlink written 18 months ago by caggtaagtat700

I'm not sure why you'd think such mappings would be wrong, they're incredibly likely to be correct for the simple reason that the junction is annotated.

ADD REPLYlink written 18 months ago by Devon Ryan91k

Would'nt the possibilty for wrong mapping be 3/4, when a read is mapped with just one nucleotide overlap?

EDIT: Thanks for editing btw, this forum is great! :)

ADD REPLYlink modified 18 months ago • written 18 months ago by caggtaagtat700

You a priori expect reads spanning that junction, so no the probability would be reasonably low.

ADD REPLYlink written 18 months ago by Devon Ryan91k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1079 users visited in the last hour