Ideogram Cnv With Bioconductor Does That Look Good ?
2
1
Entering edit mode
10.8 years ago

I have generated an ideogram of Chromosome 1 of our first sample, the annotation are CNVs identified found in this sample with Breakdancer. Here is the output figure :

http://tinypic.com/r/iwjdar/6

http://i46.tinypic.com/iwjdar.jpg

Does that look good to you ? It looks a bit weird to me, I think I need to group results with Gviz, they do it in the manual with denseAnnTrack :

 plotTracks(denseAnnTrack, showOverplotting = TRUE)


I thought of the following :

• removing overlap when bigger than 1bp
• draw a heat map instead of plotTrack
• group results like in the manual
• maybe data are wrong from the beginning?
• select CNV from Breakdancer that are supported with number of reads of 10 and higher

What do you think ?

r bioconductor cnv • 2.9k views
1
Entering edit mode
10.8 years ago

Nice you got the plot running! Of course it looks awesome graphically (there is some room for fine-tuning the colors and collapse levels though), but that's possibly not what meant.

The data displayed indeed look disturbing to me, and that would make me double or triple check each step. I was expecting short variants that do not overlap (too much). But your variant calls are all over the place and seem to cover large parts of the cromosome, and also they overlap. How can that possibly be true?

So I would check:

1. Are your data really depicted correctly, or is there an error in the plotting? Try to make a preliminary plot with another tool.
2. Compare with the variant call files, do they really contain that many huge variants?
3. Check if your variant call pipeline is sane.

Hope this helps

0
Entering edit mode

indeed I checked the file and data have large variants called. I think Breakdancer didn't do the job properly so I will check and see if a second run of Breakdancer gives the same output. The R script works well thou :)

0
Entering edit mode

Yes, this looks like a large over-prediction, maybe the default parameters are not very strict such that wrong read pairs or misalignments cause this. I would set a minimum number of read pairs, and also QC the reads, removing all incorrectly paired reads, and low quality alignments before feeding the SAM files into Breakdancer. Also I would remove all pairs that are much more distant from each other than expected. I don't know, is there is a maximum possible insert size for the pairs? If so, that should be the cutoff for a CNV length, shouldn't it?