Merging RNA-seq sample with several replicate in which step?
1
0
Entering edit mode
6.1 years ago
star ▴ 350

I have some samples with 3 technical replicates, I have aligned each one and also got count table using HTSeq for each one. Now, I would like to know, it is better to merge 3 replicates together before getting count table (after alignment and then get count table using HTSeq) or after getting count table for each one using HTSeq?

I think merging is not proper, and I think one of below options could work:

  • select better sequence from different replicates of same sample based on number of ambiguous counts or Qc of alignment?

  • Merging and getting average?

Thanks a lot in advance!

RNA-Seq R genome alignment • 5.5k views
ADD COMMENT
0
Entering edit mode

some samples with 3 technical replicates

Technical replicates have no value for HT sequencing data. Your count tables for three must be mostly similar (unless you have large variations in read numbers per sample).

Do you rather have three biological replicates, which would be very valuable?

ADD REPLY
0
Entering edit mode

my count tables for technical replicates are different. Also I have 3 biological replicate that I should merge them (they also should be merged after count table or after alignment?)

ADD REPLY
1
Entering edit mode

Also I have 3 biological replicate that I should merge them

No, you should not merge biological replicates.

ADD REPLY
0
Entering edit mode

Thank you. but how about technical replicate, what is your opinion?

ADD REPLY
2
Entering edit mode
6.1 years ago

It turns out that at least for Illumina samples, technical replicates aren't really needed. The Illumina prep doesn't add much in the way of technical variance.

DESeq2 has a method for collapsing technical replicates, in which case, you would do the counts on each one separately, then let DESeq merge them. But I think you'd be okay merging first, then counting. No need to average, just add all the counts together.

Never merge biological replicates. Those must remain separate to know the biological variance.

ADD COMMENT
1
Entering edit mode

It sounds like OP did technical replicates for some samples and not others. For starters, this situation should be avoided.

Now that he has technical replicates, I would process then separately, check if they are really consistent with each other (PCA plot should be fine for that), and merge the counts only after checking no batch effects are found.

ADD REPLY
0
Entering edit mode

@ swbarnes2 :Thanks a lot. So you mean is that merge technical replicate together after alignment and then get count table?

ADD REPLY

Login before adding your answer.

Traffic: 2724 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6