How is it possible to detect if samples from different humans are mixed while demiltiplexing? we have 4 samples per lane and 8 lanes in total. After demultiplexing, it turned out that some samples have double the size of other samples. The average size per sample is 10GB, but for our last run, what we got is samples with the following sizes:
- 10GB (which is normal)
- 5 GB
- 15 GB
Which appears as if some reads from certain samples when demultiplexing where linked to the wrong sample.
I already have the fastq files, BAM files and VCF files.
How can I verify computationally that read mixing happened?
Edit: each sample is sequenced twice on two different lanes. So there is a sample-collection-across-different-lanes step after demultiplexing.