Question

rsem calculate expression using STAR. Reference genome format?

2

Entering edit mode

6.0 years ago

dtatarak ▴ 30

I am using rsem to calculate expression levels in a dataset of paired end reads. I want to use STAR as the aligner. I have written the command like this:

rsem-calculate-expression --star -p 16 --paired-end DT_01_R1.fastq DT_01_R2.fastq /path/to/reference/genome/ rsem_output/DT_01

I'm pretty sure this will work, What I don't know is this: do I need to generate an annotated reference genome for STAR before running this? I'm using the zebrafish genome, and I have downloaded both the .fasta file and the .gtf file from Ensembl. What do I need to do with these before I run the aligning and expression calculation I've written above? Or do I just point it to the directory containing the .fasta and .gtf files? Thanks very much!

rsem STAR • 15k views

ADD COMMENT • link updated 2.8 years ago by Ram 43k • written 6.0 years ago by dtatarak ▴ 30

0

Entering edit mode

Ok great thanks! I thought that was the case, but the manual didn't make it clear to me.

ADD REPLY • link 6.0 years ago by dtatarak ▴ 30

h.mon · Accepted Answer · 2018-05-04

11

Entering edit mode

6.0 years ago

h.mon 35k

You have to prepare the transcriptome reference first, see rsem-prepare-reference. See a tutorial here.

Alternatively, you can map with STAR to the genome (you will have to prepare the index yourself), using STAR --quantMode transcriptomeSAM, and then use STAR's Aligned.toTranscriptome.out.bam output as input to rsem-calculate-expression.

ADD COMMENT • link 6.0 years ago by h.mon 35k

0

Entering edit mode

Hi h.mon How we can do quantification with RSEM by using STAR aligned .bam(aligned sepeartely by using STAR) file. Can i get the code?

ADD REPLY • link 5.8 years ago by k.kathirvel93 ▴ 300

0

Entering edit mode

What is the command you use to align with STAR?

ADD REPLY • link 5.8 years ago by h.mon 35k

0

Entering edit mode

STAR --genomeDir /home/STAR/Indexed_Reference/ --runThreadN 16 \
    --sjdbGTFfile /home/ensemble_GRCh38.77/Homo_sapiens.GRCh38.77.gtf \
    --readFilesIn /home/_1.fastq  /home/2.fastq --sjdbOverhang 101 \
    --outSAMtype BAM SortedByCoordinate \
    --outFileNamePrefix /home/STAR_Output/Mapping/output

Should I mention --quantMode also?

ADD REPLY • link updated 5.8 years ago by h.mon 35k • written 5.8 years ago by k.kathirvel93 ▴ 300

2

Entering edit mode

Yes, you should have used --quantMode TranscriptomeSAM --quantTranscriptomeBam IndelSoftclipSingleend.

However, RSEM is finicky with the annotation, I would suggest you follow its tutorial because you may end up with lots of bam files that won't work with RSEM, unless the annotation strictly follows what RSEM expects.

ADD REPLY • link updated 2.8 years ago by Ram 43k • written 5.8 years ago by h.mon 35k