In VCF files, what character encodings are used in practice?
What character encodings are expected for the format?
The examples I've seen seem to include single-byte characters. It's not clear whether it's intended to be 7-bit US-ASCII or 8-bit Latin1.
The 4.2 format specification doesn't mention encodings for VCF files. (It does specify that Unicode characters are not supported in characters and strings in BCF files)
7 minutes with a direct quote. Is Google spelled P-I-E-R-R-E in France?
@Ram: no surprise: I've been recently involved in some conversations about encoding things (xml...) in VCF https://github.com/samtools/hts-specs/issues/75
Nice discussion. I'm pro-JSON though, lightweight and easier to parse, no?