Do anyone know any tool or script to eddit the header and ad a "chr" prefix to the chromosome name header directly in a .vcf.gz file?? ... i don't want even to uncompress it, since i have the experience that a huge .vcf can't be compressed by bgzip, at least in my computer.
I am working with some genomic files in the .bam and .vcf format, i tried to retrieve some genic regions, and i already sorted that out, beggining with a large one human chromosome .bam file.
When working with that whole chromosome file, i realized that the header had the name of the chromosome without the preffix "chr", only the number, and that gave me a hard time when trying to run mpileup just in the middle of the workflow.
Now, following my working path, i got a big one chromosome .vcf.gz file, which i indexed with tabix in order to retrieve the desired region with ease, but i get the same problem as before, the name of the chromosome is lacking the "chr" prefix, which happens to be not compatible with the .fasta reference file it needs to run the command, just now beggining with a .vcf.gz file.
I thought about going back from .vcf.gz to bam, on which i already know the syntax to eddit the headers, but that means doing around four file transformations. That will spend lots of time.
Thanks in advance for any orientation.