I'm currently involved in an annotation project focused on identifying as many repeats as possible in a recently assembled (1st draft) genome. Without a library of repeats/TEs to start from, I'm using a number of denovo detection pipelines (RepeatModeler, REPET, etc) to create a library of family consensus sequences.
I'd also like to take advantage of tools that search for individual TE copies based on structure, rather than by all-by-all alignment. For example, LTRharvest or LTR_STRUC (there are many more). A lot of these tools produce results detailing the individual copies of repeats in the genome, rather than families or consensus sequences.
So: What is the best way to get families/consensus sequences from these individual copies? What tools could I use to cluster the sequences and extract common groups?