I've been exploring gffutils lately. All is well when I work with small gff file (less then a Mb in size). However I'm trying to make a database file with Mus_musculus.GRCm38.81.gff(gtf) files and it takes very long time.. The gtf file is around Gb in size. I left it running and it was still running after a day (like 25 hours). I ctrl^c it. Right now trying to make database file using GFF file instead and its been running for few hours thus far. GFF is around 300 Mb in size.
Here is the command I'm running
db = gffutils.create_db(myGFF, dbfn='Mus_musculus_GFF.db', force=True, keep_order=True, merge_strategy='merge', sort_attribute_values=True)
Am I doing something wrong or it can take days to make a database file?
by the way PC specs are: i7-4600U CPU @ 2.10GHz with 16 Gb of RAM.
Also can I multi thread and will it help..?