Question: Multiple alignment format (MAF) to VCF
0
gravatar for ksw
3.7 years ago by
ksw40
United States
ksw40 wrote:

Hi, I am wondering the best way to convert a Multiple Alignment Format file (MAF) generated with Mugsy (http://mugsy.sourceforge.net/) into a VCF format for downstream analysis?

I have looked through past posts and haven't found an answer. It seems that there was an earlier branch of Biopython which could do this but I don't think it is being updated.

If anyone has suggestions, that would be incredible! Thank you!

ADD COMMENTlink modified 3.5 years ago by Biostar ♦♦ 20 • written 3.7 years ago by ksw40

Did the past posts include this one: converting maf to vcf There is a script linked in that post that is still available to try.

ADD REPLYlink written 3.7 years ago by genomax75k

Thanks for your reply. Yes, I looked at the script you posted--it converts files in Mutation Annotation Format (another MAF file, specific to cancer genotyping, I think) to VCF format. The MAF format I am struggling with is Multiple Alignment Format.

ADD REPLYlink written 3.7 years ago by ksw40

A different MAF. What does you file look like (can you show a few lines)? Perhaps this may help: Getting A Vcf File From A Fasta Alignment

ADD REPLYlink written 3.7 years ago by genomax75k

Thanks, I'll look into the script you posted. The first lines of the MAF:

##maf version=1 scoring=mugsy a score=640492 label=1 mult=3 s B31.Chromosome 4603 638466 + 910724 AAAATCTCAGGAGAGGGATCAAATGGGGGATGCATAATTCATCCTTCAAGGGTAAGAGACCCAATTACTACTTTACTTAGTATCGTAAAATTATTAAAAATGAAAGAACTTTACCAAATATGGTGTAAATTGTCTAAAAACCACTATAAAGAAAAGTACGACTTAAAAGATATTTTAAATACAACAAATTTTTATAGTAATGTAATAGTATCATCCAAAAAAGCCAATTTAACAAATCTAAAAATAGAAAATCAAGAAATTTTAAAAAGCAATTATGAAAATCTATTGATTAAAGAGATAAAAAGTAATAAGTTATTTCAAGAATTATCAGTAGTTGATTATGAAATCATTAACTATGAAGGCAAAAGACAATCTAAGATTAGAACAGGAGATTCTTCGGGAGGATTAAAGGTATTGT

ADD REPLYlink modified 3.7 years ago • written 3.7 years ago by ksw40
1

To update this thread: my issue was that I have called variants using short read data (i.e. generating BAM, then VCF files with respect to a reference genome). I am now interested in including outgroup (FASTA) sequences and need to call variants between the outgroups and the FASTA reference genome. The following tool takes a multiple FASTA alignment (for me, my reference genome and outgroup genomes) and can generate a VCF file:

[https://github.com/sanger-pathogens/snp-sites]

Thanks for your help!

ADD REPLYlink written 3.7 years ago by ksw40
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1245 users visited in the last hour