Question

How To Get The Read Depth?

7

Entering edit mode

12.0 years ago

madkitty ▴ 690

My boss said he wanted to get the read depth graph for each CNV we found. The only data I have is a file with the number of reads aligned to each position (let's say from position 1 to 500 on the genome) Let's say we have a CNV deletion identified from position 100 to 150 .. How am I suppose to output the read depth for that CNV ?

genome sequencing read depth-of-coverage • 17k views

ADD COMMENT • link updated 12.0 years ago by Chris Miller 22k • written 12.0 years ago by madkitty ▴ 690

0

Entering edit mode

Thanks a lot for your answers guys! All of them are really useful :)

ADD REPLY • link 12.0 years ago by madkitty ▴ 690

score 8 · Answer 1 · 2012-08-15

8

Entering edit mode

12.0 years ago

Chris Miller 22k

You'll probably want to plot the read depth in small bins, so that you can visualize the changes more clearly. I'd recommend grabbing the CNV region, plus some flanks with samtools, then writing a little script that parses the output and returns the average depth of coverage in 100bp bins. Plot these values for the tumor and the normal across the region, and if all goes well, you'll see something like this:

enter image description here

Normal is on top, tumor is on bottom, and you can clearly see the deletion in the tumor.

ADD COMMENT • link 12.0 years ago by Chris Miller 22k

0

Entering edit mode

that's really nice! Exactly what I need, but for the script can I just use a package that already exists in R ? Are you aware of any ?

ADD REPLY • link 12.0 years ago by madkitty ▴ 690

2

Entering edit mode

For a simple implementation in python, try ngCGH. In R, see the GenomicRanges and Shortread packages and the coverage() method.

ADD REPLY • link 12.0 years ago by Sean Davis 27k

score 4 · Answer 2 · 2012-08-15

4

Entering edit mode

12.0 years ago

Stefano Berri 4.4k

If you have a sorted and indexed bam file you could do

samtools view yourFile.bam chr1:1000-5000 | wc

this will give you the number of reads mapping over chromosome 1 from position 1000 to 5000. For comparison you should also do the same with the normal and, at least, normalise for total number of reads.

However, how did you figure out that you have a CNV in that region? What program did you use? Can't the program you used tell you?

also consider this is very crude, as the number of reads have noise.

ADD COMMENT • link 12.0 years ago by Stefano Berri 4.4k

0

Entering edit mode

Indeed, as your CNV calling algorithm probably used read-depth as part of its protocol, you should really be sure you have a good handle on what it is doing as well as what you expect to plot. That read depth is usually on a per nucleotide basis but you can also use the number of reads that overlap with your called interval as a proxy.

ADD REPLY • link 12.0 years ago by DG 7.3k

0

Entering edit mode

Thanks for your answer. I used Breakdancer to find CNVs, Breakdancer already gives me the number of reads aligned to the CNV found, but my boss doesn't believe breakdancer and wants to see a figure similar to the one posted by Chris Miller below...

ADD REPLY • link 12.0 years ago by madkitty ▴ 690

score 2 · Answer 3 · 2012-08-15

2

Entering edit mode

12.0 years ago

Nikolay Vyahhi ★ 1.3k

AFAIK read depth for CNV is coverage for this region in alignment.

E.g. if you identified that there is no gene/region (deletion), then it's depth = 0.

E.g. if you identified 2 copies of the same region, then depth will be about twice as usual depth (given uniformity of coverage).

ADD COMMENT • link 12.0 years ago by Nikolay Vyahhi ★ 1.3k

score 1 · Answer 4 · 2012-08-15

1

Entering edit mode

12.0 years ago

Sean Davis 27k

You might try:

samtools depth -r chr1:100-150 bamfile.bam

I agree with Stefano, though, that you probably need to understand what your CNV caller did before proceeding.

ADD COMMENT • link 12.0 years ago by Sean Davis 27k

score 1 · Answer 5 · 2012-08-15

1

Entering edit mode

12.0 years ago

Ian 6.0k

If you can get a BED file (chr start end) of CNV regions you can use bedtools to get coverage of reads for each CNV region.

bedtools coverage -abam reads.bam -b cnv_regions.bed

ADD COMMENT • link 12.0 years ago by Ian 6.0k