Graphing GC Bias in Sequence Capture
2
0
Entering edit mode
5.0 years ago
Jon17 ▴ 20

Anyone have recommendations for making a chart like this for sequence capture?

Picard CollectGcBiasMetrics only works for whole genome analysis.

Picard hsMetrics tends do do funny things with window sizes.

I've had some success with a bedtools & custom scripting but can't reproduce other people's material. I'm close, but I don't know if they are right or if I am right.

Also, anyone got any favorite illumina read simulators that creates reads given a bed file and produces a known gc bias? I'm trying to make control data.

enter image description here

sequencing gcbias bed picard • 1.6k views
ADD COMMENT
1
Entering edit mode
4.9 years ago
trausch ★ 1.9k

Alfred calculates the reference GC, target region GC and sample GC content. Here is an example, just select "DNA-Seq, Exome" from the reference data set collection and then jump to "GC Content". Command should be:

alfred qc -b targets.bed -r genome.fa -o out.stats.tsv.gz -j out.stats.json.gz exome.bam
ADD COMMENT
0
Entering edit mode

Thank you!!!!!!!!!!!!!!!!!

ADD REPLY
0
Entering edit mode
4.4 years ago
Chao Tang ▴ 90

May be the function letterFrequency in package Biostrings is a good choice for GC content calculating.

gs_seq <- BSgenome::getSeq(Hsapiens, gr) letterFrequency(gs_seq, "GC", as.prob=TRUE)

ADD COMMENT

Login before adding your answer.

Traffic: 2773 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6