I am working on a simple algorithm to enable an aligner to handle structural variations, for the first step i need to know the distribution of the length of Structural Variations, for this purpose i need a database (or a reliable file)* providing structural variation information for me to calculate the distribution, anyone knows a database providing this for me? or any other way to this task?
the best thing i want is the estimated distribution but even knowing some statistics could be good, for example "90% of SVs are less than 200 bases long" or some other evident information like this.
the human genome SVs length Distribution will be good but having some wider information is better. *: update thanks everyone