I am trying to obtain the overall size of the exome, i.e. every single position that has coverage. As a control I need to find the theoretical size from the reference genome. I have a few questions:
- Is there an official size for the exome from the GRCh38 genome?
- if not, which is there efficient way to calculate it? A script to merge/coalesce all the exon from the reference genome?