Question: Varscan, Using The Copycaller
1
gravatar for Mdeng
7.8 years ago by
Mdeng520
Germany
Mdeng520 wrote:

Hi Folks,

After reading this question, I downloaded the VarScan program, but have some questions about how to use it.

I have 10 Samples, 2 from each patient, one tumor, one normal The data is from a Solid 4 sequencer, and is single end reads, 50BP in length.

So, it seems to be possible to get CNVs using the varscan copyCaller. But how to use it? "java -jar VarScan.v2.2.5.jar copyCaller -h" prints:

USAGE: VarScan copyCaller [output.copynumber] OPTIONS
    OPTIONS:
    --regions-file    A list of regions (e.g. exons) to use for segmentation
    --output-file    Output file to contain the calls
    --min-coverage    Minimum read depth at a position to make a call [10]
    --min-region-size    Minimum size (in bases) for a region to be counted [10]

I am really not sure about the syntax and the input.

  • What is the input? Bam files? pileup files? Another format? If so, how does it look?
  • How do i use it? e.g. varscan copycaller normal.bam tumor.bam output.bam? (or same with pileup?)
  • Or is there any other "workflow", maybe pre and post analysis/steps?

With best,

Mario

varscan cnv • 5.3k views
ADD COMMENTlink modified 5.7 years ago by Biostar ♦♦ 20 • written 7.8 years ago by Mdeng520
1

Not a direct answer to your question, but I wrote this little python script: http://pypi.python.org/pypi/ngCGH for performing this analysis starting from BAM files. There is a script to use R and the DNAcopy package to segment the results.

ADD REPLYlink written 7.8 years ago by Sean Davis25k
3
gravatar for Dan Koboldt
7.8 years ago by
Dan Koboldt60
Dan Koboldt60 wrote:

Mario,

You should actually use the "VarScan copynumber" command, not "copyCaller", as the latter expects an old format. The input should be 1.) pileup or mpileup for normal sample 2.) pileup or mpileup for tumor sample

The command usage is like this: java -jar VarScan.jar copynumber [normal.pileup] [tumor.pileup] output-basename

The output of the above command will have the format: chromosome chr_start chr_stop normal_depth tumor_depth log2_ratio

You can feed this into a copy number segmentation program; I recommend the DNAcopy library of the BioConductor project.

Please e-mail me or post to the VarScan sourceforge forum if you have further questions.

Sincerely,

Dan Koboldt dkoboldt [at] genome [dot] wustl [dot] edu

ADD COMMENTlink written 7.8 years ago by Dan Koboldt60

Great,

I am new at using varscan and did not know about the copynumber command. On your website its just listet in the java doc, isn't it?

Thanks a lot,

Mario

ADD REPLYlink written 7.8 years ago by Mdeng520

Hey,

I am perfectly fine now. Everything worked. Now I am looking for a way to visualise this data. I think the R plots from the DNAcopy package don't look good with massive data. Is there maybe a way to load this data to IGV or use another plot?

With best,

Mario

ADD REPLYlink written 7.8 years ago by Mdeng520

Hi Mdeng,

Can you please share how did you end up visualizing your results from massive data.

cheers

ADD REPLYlink written 3.8 years ago by Chirag Nepal2.2k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 789 users visited in the last hour