annotating MAF files
Entering edit mode
6.8 years ago
newDNASeqer ▴ 710

I've read quite a few posts on this forum on MAF files:

Working With Maf Files (Mutation Annotation Format) From The Tcga (The Cancer Genome Atlas) and Converting Vcf File To Maf

But one thing I do not understand is why people would want to convert VCF to MAF format. My understanding on VCF files is that they can be annotated by tools such as Annovar to identify amino acid residues mutated, this information is more convenient than chromosomal coordinates in MAF files.

I've also noticed that almost all TCGA data are in MAF format, if I want to study a cancer type of interest, what's the appropriate way of annotating MAF files from TCGA?


MAF annotation • 3.6k views
Entering edit mode

You've answered the first part of your question. The only reason for MAF is "that's what TCGA uses".

Entering edit mode
6.7 years ago

The only reason that MAFs exist is for a human-readable list of mutations that folks could load into a spreadsheet for manual review. VCFs are preferred in bioinformatics pipelines because they are a superset of the information that a MAF can contain, and they discourage the use of spreadsheets! TCGA generates VCFs for some cancer types, but you need TCGA credentials to access those, because they contain some germline calls. Given a TCGA MAF, the maf2vcf script will convert it into a generic VCF format, which you can then annotate with Ensembl's VEP, snpEff, Annovar, etc.


Login before adding your answer.

Traffic: 1705 users visited in the last hour
Help About
Access RSS

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6