Question: Detection of splicing defects in RNA-Seq data of Arabidopsis thaliana
0
gravatar for julius
8 months ago by
julius0
Germany/Frankfurt
julius0 wrote:

Hello everyone,

I'm studying the effects of chemicals on plants with a focus on potential disturbances in the process of splicing. Therefore, I treated my plants with said chemicals and extracted RNA after several time-points.

After the wet-lab procedures I sequenced the mRNA using Illumina NextSeq with 150 bp, 20 x 10⁶ reads per sample in paried-end mode ("only" 20 x 10⁶ reads as this first trial is somewhat a proof of concept).

After trimming the reads with Trim Galore 0.6.4 and aligning them to the genome of Arabidopsis using STAR aligner 2.7.1a, I'm now searching for a way of detecting reads on genes indicating miss-spliced mRNA.

So far I've tried to do this with rMATS v3.1.0, but couldn't find real defects in splicing. The results I'm getting with this approach are mainly shifts in the presence of several iso-forms of mRNA but nothing which is not already annotated in the corresponding gtf-File.

Nevertheless, there are such events, as I've included a known inhibitor of the spliceosome, which also yields miss-spliced reads ( if checking the BAM-files in the IGV-viewer). But again, those events are not detected by rMATS.

Does anyone know a tool for my purposes or can give me advice on how to detect those events with rMATS?

I've attached a representative picture from IGV-viewer, showing the control (untreated// mock-treated) as well as two samples which were treated with inhibitors. I've also run cufflinks on the BAM files to generate gtf files for the present transcripst, showing there are intron containing transcripts.

Test design: 3 (biological) replicates per sample 3 sample types: mock, positive, compound of interest

Code used:

STAR --runThreadN 4 --runMode genomeGenerate --genomeDir /Atha_genome/ --genomeFastaFiles  GCF_000001735.4_TAIR10.1_genomic.fna --sjdbGTFfile  GCF_000001735.4_TAIR10.1_genomic.gtf --sjdbOverhang 149

STAR --runThreadN 4 --genomeDir /Atha_genome --readFilesIn /treated/T1_R1_1.fq /treated/T1_R1_2.fq --outFileNamePrefix T1_R1 --outSAMtype BAM SortedByCoordinate --alignIntronMin 10 --alignIntronMax 1000 --alignEndsType EndToEnd --twopassMode Basic --twopass1readsN -1

python rmats.py --b1 untreated.txt --b2 treated.txt --od /rMATS_out/ -t paired -readLength 150 --libType fr-secondstrand --gtf  GCF_000001735.4_TAIR10.1_genomic.gtf --nthread 4 --tstat 4 --cstat 0.0001

Thanks in advance,

Cheers,

Julius

enter image description here

rna-seq plants splicing rmats • 256 views
ADD COMMENTlink modified 8 months ago by ATpoint41k • written 8 months ago by julius0
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1299 users visited in the last hour