Question: How to Merge 2 Technical Replicates From the same cDNA library (but run on different sequencers)?
0
gravatar for scideas
4 months ago by
scideas0
scideas0 wrote:

Could anyone inform me about the best approach for merging 2 technical replicates from the same 10x cDNA library, but run on different sequencers? Since the same cell barcodes are detected across both runs for many reads, I would like to merge them together while still accounting for batch effects.

Current approaches with Seurat, etc. assign new names to duplicated cell barcodes while merging counts or normalized data, treating the sample as though it was derived from two distinct cDNA libraries. This seems less accurate and reduces the actual read depth per cell for experiment. If the samples were run on the same instrument, I would otherwise just merge the two matrices and sum counts from identical barcodes (assuming that the technical variation was minimal).

Thanks for the help!

ADD COMMENTlink modified 4 months ago by swbarnes29.2k • written 4 months ago by scideas0

but run on different sequencers

It would be useful to add if the runs were done on same chemistry/sequencer type or different. e.g. both runs were on NovaSeq or were they run on two different types of sequencers e.g. NextSeq and HiSeq.

ADD REPLYlink written 4 months ago by GenoMax92k

Thanks for the note. To clarify, both of these were run on a NovaSeq. However, they were run at different universities across the US.

ADD REPLYlink written 4 months ago by scideas0
0
gravatar for swbarnes2
4 months ago by
swbarnes29.2k
United States
swbarnes29.2k wrote:

If you were calling variants, I might worry about slightly differing basecalling biases between different instruments, but as far as assigning reads to genes, I don't think that would be impacted much. I'd run the 10x pipeline with both fastqs together.

But I guess if you wnat to be sure, run them separately, and see how they look in Loupe when you aggregate them.

ADD COMMENTlink written 4 months ago by swbarnes29.2k

Thanks for the suggestion @swbarnes2. After checking out the cellranger aggr webpage, I found the following which was very informative:

cellranger aggr is not designed for combining multiple sequencing runs of the same GEM Well. For that, you should pass a list of FASTQ files from multiple sequencing runs of the same GEM well to the --fastqs argument of cellranger count.

So I'll give that a go and we'll see what happens.

ADD REPLYlink written 4 months ago by scideas0

If you want to see how different the two runs are, it will work for that. But once you are satisfied that they are identical, rerun cellranger count with both fastqs.

ADD REPLYlink written 4 months ago by swbarnes29.2k

Sounds good- thank you again

ADD REPLYlink written 4 months ago by scideas0
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1920 users visited in the last hour