I recently started playing with Hi-C data. The goal is to use them in order to scaffold a genome assembly. So I downloaded a single pair of files (_R1 and _R2) which contained >680 million reads! Does this sound familiar to anyone who has used Hi-C before? Is it usual to get so many reads when doing Hi-C sequencing?
I'm only asking because I'm more used to getting 50-70 million reads per (Illumina) run. Whenever I got more than that, it was because something went wrong during sequencing.