Question: RNA-seq biological replicates
gravatar for amy16
2.1 years ago by
amy1640 wrote:

I've got PE Illumina Hi-seq RNA-seq data. I trimmed the adapters and then aligned the reads to the reference genome. Now before proceeding towards transcript assembly and quantification, I would like to know how to screen down which of the three biological replicates should I take forward for further analysis?

rna-seq • 782 views
ADD COMMENTlink modified 2.1 years ago by Kevin Blighe52k • written 2.1 years ago by amy1640
gravatar for Kevin Blighe
2.1 years ago by
Kevin Blighe52k
Kevin Blighe52k wrote:


The question is somewhat strange because it implies that you decided to use replicates with the outlook that 1 or more of them would fail(?) I'm not sure that I would spent a couple of hundred pound sterling GBP or ~!0,000 Rupee if I was later going to decide to ditch 1 or more of the samples.

If you've used a Hi-Seq and the laboratory personnel is experienced, then I imagine that you can any of the replicates.

Procedures that most people do with replicates:

  • process them as separate samples and then, after normalization, check how they line up on PC1 vs. PC2 via principal components analysis
  • average counts over the replicates post normalisation (this was more common in cDNA microarray analysis)
  • concatenate the raw data FASTQ files together (zcat piped into gzip) and then process them as a single sample

You mention assembly, so, I would concatenate your samples together and then do the de novo transcriptome assembly on the concatenated sample. Whilst saying this, all transcriptome assemblers that I've used allow you to specify multiple samples at the command line and then it merges them together anyway.

If any of the samples 'failed', I doubt that you'd have the data in hand right now. You should be able to confirm the basic quality of the samples by contacting the lab that did the sequencing, or just check the reports that they sent.

Good luck, Kevin

ADD COMMENTlink written 2.1 years ago by Kevin Blighe52k

Please accept my colleague's answer too. We give the same answer.

ADD REPLYlink written 2.1 years ago by Kevin Blighe52k
gravatar for genomax
2.1 years ago by
United States
genomax75k wrote:

All three. If you have enough reads in all three then you could assemble them independently and then merge the data to create a more comprehensive representation of the transcriptome.

ADD COMMENTlink written 2.1 years ago by genomax75k

Got there while I was writing the answer. You're the master at comments!

We more or less give the same advice/

ADD REPLYlink written 2.1 years ago by Kevin Blighe52k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1844 users visited in the last hour