Tool:Mutect: Accurate Identification Of Somatic Point Mutations
2
2
Entering edit mode
12.5 years ago
zx8754 12k

MuTect is a method developed at the Broad Institute for the reliable and accurate identification of somatic point mutations in next generation sequencing data of cancer genomes.

MuTect link

mutect • 5.1k views
ADD COMMENT
1
Entering edit mode

I would like to ask if there is any possibility to retrieve Mutect output in vcf format? I am trying to do that but to no avail. I am using Mutect for the first time for my exome data normal /tumor pair and now I have retrieved around 1800 novel variations for my pair, I want to know if any additional downstream filtering can be applied or not then I would like to apply them to get more HC ones and then annotate them. I am confident of annotating them with ANNOVAR but I feel ANNOVAR is not complete and that is the reason I want to annotate with snpEFF wihch sadly does not let you annotate in .txt format anymore. So can you please tell me how to get the output in VCF format from mutect?

Thanks

ADD REPLY
2
Entering edit mode
10.2 years ago
ivivek_ngs ★ 5.2k

@zx8754

Yes please find the below code, This should work. Let me know if you face any problems, It works for the mutect file which are filtered first for just KEEP and COVERED so that you get a out.txt of mutect with only high confidence variants and then convert it to the vcf format.

#!/usr/bin/python

import sys
text=open(sys.argv[1]).readlines()
text=filter(lambda x:x.split('\t')[31].strip()=='KEEP',text)
text=map(lambda x:x.split('\t')[0]+'\t'+x.split('\t')[1]+'\t.\t'+x.split('\t')[2]+'\t'+x.split('\t')[3]+'\t.\tPASS\t.\n',text)
file=open(sys.argv[1].replace('.txt','.vcf'),'w')

file.write('##fileformat=VCFv4.0\n')
file.write('##source=dbSNP\n')
file.write('##dbSNP_BUILD_ID=137\n')
file.write('##reference=hg19\n')
file.write('#CHROM\tPOS\tID\tREF\tALT\tQUAL\tFILTER\tINFO\n')

for i in text:
        file.write(i)
file.close()
ADD COMMENT
0
Entering edit mode
10.2 years ago
mmterpstra • 0

You might also look at: https://github.com/broadinstitute/mutect

Last I checked (~6months ago) the github page hosted the most recent version of mutect. Also with adding the vcf format output

ADD COMMENT
0
Entering edit mode

Thanks for the input. I managed to write a script to convert my text to vcf format.

ADD REPLY
0
Entering edit mode

Do you mind sharing?

ADD REPLY

Login before adding your answer.

Traffic: 1697 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6