Question: Finding distribution of indels from vcf file
gravatar for ApoorvaB
2.9 years ago by
United States
ApoorvaB230 wrote:

I have a vcf file from a single cell experiment. I got the basic statistics using vcf-stat utility in vcf-tools. I am interested in the distribution of indels. How can i get a barplot of genome-wide indel distribution (number of indels per chromosome) as well as a plot of number of insertions/deletions versus size of insertion/deletion? Is there a tool that I can use ?

barplots samtools vcf • 1.4k views
ADD COMMENTlink modified 2.9 years ago by Brice Sarver2.8k • written 2.9 years ago by ApoorvaB230
gravatar for Brice Sarver
2.9 years ago by
Brice Sarver2.8k
United States
Brice Sarver2.8k wrote:

The VCF will have all of this information. One way to summarize all of your data would be to use grep to thin your VCF down to just indels, then read the (tab delimited) result into R and plot whatever you want. Note that, per record, you'll have a chromosome, start position, and stop position, which should give you everything you'll need. Other tools might be able to handle this, but it's just basic text parsing to get what you want.

ADD COMMENTlink written 2.9 years ago by Brice Sarver2.8k

That helped. thanks !

ADD REPLYlink written 2.9 years ago by ApoorvaB230
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1519 users visited in the last hour