'map' error sequenza from Strelka vcf.somatic.snvs in python 3.8 and tier 1 or 2?

0

Entering edit mode

4.3 years ago

heetak.lee • 0

I have tried to implement sequenza-utils on conda env (python 3.8)

Command is "sequenza-utils snp2seqz --vcf strelka_somatic_snvs.vcf -gc hg19_gc50.wig.gz --preset strelka2_som --output GA060TP95.som.seqz.gz"

1.Then, I encountered "map is not subscriptable" in sequenza/vcf.py parse_strelka2_som function (In this part, we can check .somatic.snvs.vcf from Strelka2 is used in sequenza). so, I modified map(int, ~~~) to list(map(int,~~~))

2.Different contig names in between .vcf files and wig files cause empty seqz.gz file. based on what is the difference between GRCh37 and hg19 ?, I modified vcf.py to fit contig names to hg19 (e.g. "1" to "chr1" for 1-22, X, and Y chromosomes)

Finally, fixed. I hope this reports will be helpful to estimate tumor purity from Strelka output on python3.8

p.s. when I read source code, vcf.py "sequenza-utils" uses Tier 2 allele depth from Strelka 2 pipeline. Is it reasonable?

software error • 796 views

ADD COMMENT • link 4.3 years ago by heetak.lee • 0

Login before adding your answer.