I have some RNA sequencing reads to align to the human reference genome. I found the genome FASTA files on both GENCODE and ENSEMBL:
But after unzipping them, I found that they are 3.1G and 60G respectively. Why is that? And which one should I use? (considering the purpose of the project is to detect gene fusion from the sequencing reads).