Question: combining reads from multiple lanes for DE analysis
gravatar for Pappu
7 months ago by
Pappu1.9k wrote:

I am tying to do differential expression (DE) analysis between different conditions. Now I have same samples sequenced at multiple lanes to improve coverage. I am wondering if it would be better to combine those reads from different lanes for quantification or quantify the reads separately before DE analysis. Having more samples for DE analysis would improve the statistical power I suppose.

rna-seq • 220 views
ADD COMMENTlink modified 7 months ago by ATpoint22k • written 7 months ago by Pappu1.9k

If the same sample is on multiple lanes in the same flowcell you can safely combine those files prior to analysis. If the same sample has run on multiple lanes across flowcells you could still combine the data but you may want to use RG to keep track of FC in case there is some batch effect (there generally should not be any, but if you want to be extra careful).

ADD REPLYlink modified 7 months ago • written 7 months ago by genomax71k
gravatar for ATpoint
7 months ago by
ATpoint22k wrote:

There is no gold standard for this. You can quantify and quality control all runs separately and then sum up the counts from the technical replicates prior to feeding them into the DE analysis to make sure to technical problems occurred during the sequencing run, which is very rare in my experience. Still, do not treat these lane replicates as independent replicates in the DE analysis, as they are simply sequencing and not biological replicates. Technically this is of course possible but it would underestimate the sample dispersion and create significance where is none.

ADD COMMENTlink modified 7 months ago • written 7 months ago by ATpoint22k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 551 users visited in the last hour