Cnv Visualisation Tools
3
7
Entering edit mode
10.6 years ago
Vikas Bansal ★ 2.4k

Dear all,

I am working on CNV detection (ReadDepth approach) and got my final results in this format ->

CHROM    START    END          GC%       READ_DEPTH

chr1      0        1000    0.551000    1345.162598
chr2      0        1000    0.478000    697.162659
chr2      1000    2000    0.431000    685.829346

and I did analysis with some different reference genome (fasta format, only chromosome names and lengths are different) not with hg18 or hg19.

I am looking for a visualization tool like this.

I do not want so complicated, just the yellow lines ( chromosome positions on x axis and read depth on y axis) So that I can easily see where readdepth is low (deletion) and where it is high (duplication).

cnv visualization • 12k views
7
Entering edit mode
10.6 years ago

Hi Vikas,

I just uploaded a tool for plotting segments to the readDepth Google Code repo:

It should take either the segs.dat or alts.dat output from readDepth and make pretty pictures with it.

You can do whole genome views:

Or zoom into specific chromosomes:

It's also got lots of other features for labeling or highlighting the plots.

I've been meaning to post that for a while now - let me know if you have questions or run into any problems.

1
Entering edit mode

Look at the "low-res" parameter, which gives options for visually expanding small regions so that they appear larger. Also consider doing one chromosome at a time using the chr parameter.

0
Entering edit mode

Hi Chris. Thanks a lot for your reply. As you said -> It should take either the segs.dat or alts.dat output from readDepth, But I used some other approach to get my output file. Also I have file, whose format is chr, start, stop and copy number. I can add 1 more column as size of bin (stop - start). And in your *.dat files are -> The columns are chr, start, stop, number of bins, absolute copy number call, do you think it will be good for my data with this approach?

0
Entering edit mode

Also my reference genome is completely different but is in fasta format (basically only the chromosome names and number is different), so is it fine?

0
Entering edit mode

It will work fine as long as you have an entrypoints file listing all of your chromosomes (chr, length) and your data is in 5-col format like (chr, st, sp, bins, score). The script doesn't use the fourth column at all, so really, all you need is position and score.

0
Entering edit mode

hi, i want to complete the figures just like above,but my data is ont the sequence data but the microarray.what should i do ? please.

0
Entering edit mode

Hi Chris. I tried your code and on your given test-data (output- expectedAlts.dat), its working fine. I think because you have window size very large there (stop - start i:e col3 - col2 from expectedAlts.dat). But I have window of only 100 and in my graph I cant see anything. Can you please tell me, how to fix this?

2
Entering edit mode
10.6 years ago

If I may, I'll link to the tool we have developed. CNAnorm (I recommend the linked developer version 1.1.8). The input data is very similar, it does require control reads, but, if you want to use ReadDepth approach, you can set READ_DEPTH as "test" and a constant value for "control" (the average of READ_DEPTH?).

In this vignette you can see the output of a segmented genome, but it is also possible to visualize a "smoothed" signal and zoom to a single chromosome or to a fraction of it.

In the latest developer version 1.1.8 we added a "colorful" option to better visualize gain and loss, three extra normalisation methods ('median', 'mode' and 'closest' ) which is best for germline CNV.

further information and supproting files on our website or on the associated paper

0
Entering edit mode

Thanks for your reply. I am not able to install this package in R

-> R>     biocLite("CNAnorm")
Using R version 2.12.1, biocinstall version 2.7.7.
Installing Bioconductor version 2.7 packages:
[1] "CNAnorm"
In getDependencies(pkgs, dependencies, available, lib) :
package 'CNAnorm' is not available


May be because the name of the package has changed?

0
Entering edit mode

Thanks for your reply. I am not able to install this package in R -> R> biocLite("CNAnorm") Using R version 2.12.1, biocinstall version 2.7.7. Installing Bioconductor version 2.7 packages: [1] "CNAnorm" Please wait... In getDependencies(pkgs, dependencies, available, lib) : package 'CNAnorm' is not available . May be because the name of the package has changed?

0
Entering edit mode

And also, it says that it is for GC correction and normalization, but I have already done everything. I just want to see my output file in a viewer, so that I can easily see where readdepth is low (deletion) and where it is high (duplication) as compare to average.

0
Entering edit mode

HI. Yes, the main function is GC corerction and normalisation, but you can quickly go through, skip the GC correction, normalise on median and then use the plotting functions. I am not sure why you can't install it. First try the "release" version using biocLite http://www.bioconductor.org/packages/release/bioc/html/CNAnorm.html If it works, try the developer version compiling it (you will need make and a fortran compiler)

0
Entering edit mode

Thanks. I tried it but that is not what I was expecting. I am looking for some tool which makes graph or its like a viewer (like IGV) so that I can have chromosome positions on x axis and read depth on y axis and a average line. So the read depth below average line will indicate deletion and above the average line will indicate duplication.

2
Entering edit mode
10.6 years ago

You may also try generic genomic data visualization tools like ggbio (bioC/R package) or use glyphs from Bio::Graphics library for your task. See the extensive discussion on genomic visualization tools here.