rmsk primary table has > 5 million records for repetitive elements. There were 989434 entries belong to "LINE" class of repeat. When I checked the lengths of these LINEs, a majority of them (973906) were shorter than 5kb. I took the "genoStart" and "genoEnd" to calculate the lengths. Aren't these supposed to be Long interspersed nuclear element?
How many of LINEs are present in the mouse genome? What is there length distribution? How to get the genome coordinates (mm10) of LINEs?
Please help Thanks