Detection of splicing defects in RNA-Seq data of Arabidopsis thaliana
0
0
Entering edit mode
4.1 years ago
julius • 0

Hello everyone,

I'm studying the effects of chemicals on plants with a focus on potential disturbances in the process of splicing. Therefore, I treated my plants with said chemicals and extracted RNA after several time-points.

After the wet-lab procedures I sequenced the mRNA using Illumina NextSeq with 150 bp, 20 x 10⁶ reads per sample in paried-end mode ("only" 20 x 10⁶ reads as this first trial is somewhat a proof of concept).

After trimming the reads with Trim Galore 0.6.4 and aligning them to the genome of Arabidopsis using STAR aligner 2.7.1a, I'm now searching for a way of detecting reads on genes indicating miss-spliced mRNA.

So far I've tried to do this with rMATS v3.1.0, but couldn't find real defects in splicing. The results I'm getting with this approach are mainly shifts in the presence of several iso-forms of mRNA but nothing which is not already annotated in the corresponding gtf-File.

Nevertheless, there are such events, as I've included a known inhibitor of the spliceosome, which also yields miss-spliced reads ( if checking the BAM-files in the IGV-viewer). But again, those events are not detected by rMATS.

Does anyone know a tool for my purposes or can give me advice on how to detect those events with rMATS?

I've attached a representative picture from IGV-viewer, showing the control (untreated// mock-treated) as well as two samples which were treated with inhibitors. I've also run cufflinks on the BAM files to generate gtf files for the present transcripst, showing there are intron containing transcripts.

Test design: 3 (biological) replicates per sample 3 sample types: mock, positive, compound of interest

Code used:

STAR --runThreadN 4 --runMode genomeGenerate --genomeDir /Atha_genome/ --genomeFastaFiles  GCF_000001735.4_TAIR10.1_genomic.fna --sjdbGTFfile  GCF_000001735.4_TAIR10.1_genomic.gtf --sjdbOverhang 149

STAR --runThreadN 4 --genomeDir /Atha_genome --readFilesIn /treated/T1_R1_1.fq /treated/T1_R1_2.fq --outFileNamePrefix T1_R1 --outSAMtype BAM SortedByCoordinate --alignIntronMin 10 --alignIntronMax 1000 --alignEndsType EndToEnd --twopassMode Basic --twopass1readsN -1

python rmats.py --b1 untreated.txt --b2 treated.txt --od /rMATS_out/ -t paired -readLength 150 --libType fr-secondstrand --gtf  GCF_000001735.4_TAIR10.1_genomic.gtf --nthread 4 --tstat 4 --cstat 0.0001

Thanks in advance,

Cheers,

Julius

enter image description here

RNA-Seq Splicing plants rmats • 830 views
ADD COMMENT

Login before adding your answer.

Traffic: 2842 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6