I am trying to apply the GATK best practices for calling variants but from mice WGS data. According to GATK, one of the reasons for using an interval list in processing WGS data is "to exclude regions that have bad or uninformative data where a tool is getting stuck”. GATK bundle includes such an interval list fo the human genome however I have not been able to find any interval list of “good” genomic regions for the mouse genome (GRCm38).
Also, since GATK does not provide information on the provenance of this file it is hard to replicate it for the mouse genome. I am looking for an interval list for GRCm38 or information on the source of the human interval file. Any help is highly appreciated.
Were you able to find this interval list?
ADD COMMENTand not the answer field for comments.