I want to parse and index genomes in embl format compressed with gz/xz which I downloaded from EBI. The problem is that I can only work with the uncompressed files to list some features. But the uncompressed files are too big.
As far as I understood, if I compress them using bgzip from tabix, I can use them in Biopython  Can I directly index the xz file compressed with LZMA2 which gives much smaller file? It should be possible in principle  I am wondering if anyone has done it.