Hello, Can somebody tell me how to uncompress 1000 Genome vcf.gz files? I am performing an RNA-editing analysis and would like to substract annotated SNPs/INDELs. I have already done so using dbSNP data with bedtools instersect, but am still stuck with the 1000 Genome Project *.vcf.gz files. I downloaded these for each chromosome and then concatenated them. These files are in a format that gunzip/gzip -d wont recognize. I tried using this file unzipped in bedtools intersect but it wasn't reconized. Many thanks,
I'm not sure if I am understanding your description correctly, but if you concatenated the .gz files before trying to unzip them, that would probably be the problem. So if this is what you did, try unzipping them first individually and then concatenating them.