Question: ICGC Variant Calling Pipelines
2
gravatar for morovatunc
3.4 years ago by
morovatunc400
Turkey
morovatunc400 wrote:

Hi,

I would like to ask if any of you know the callers of the pipelines that are stated on ICGC such as broad, embl and sanger? I checked the metadata however the website which directed me required id/password accession. These are pipelines that were used for calling mutations which produced VCF files. 

Thank you

Best,

Tunc

mutation calling icgc • 2.1k views
ADD COMMENTlink modified 3.4 years ago by ivivek_ngs4.8k • written 3.4 years ago by morovatunc400
6
gravatar for ivivek_ngs
3.4 years ago by
ivivek_ngs4.8k
Seattle,WA, USA
ivivek_ngs4.8k wrote:

Can you put the link you have been looking into? Usually the broad mutation caller is GATK toolkit and Mutect. Often VarScan2 is also used widely. But am assuming that this one you are looking into are the ICGC -TCGA project which is run by multiple sites in the world. I not sure if the publication is out yet. So it shows only the status of the calls made at all different contributing sites that are a part of this challenge so sequence WGS for large PAN-Cancer panel.

You can take a look at this link which has a somatic workflow for SNVs . This is of Sanger

I am not sure for the embl but Delly is one which is widely used but that is for SVs and not SNVs. I guess you have to do a bit more digging.

As I said you can find the below links for the different workflows that are being used by 3 major workflow managers , and I believe this is being circulated across different centers where samples are being analyzed.

I think this link has most of the necessary information you are looking for.

ADD COMMENTlink modified 10 months ago by RamRS22k • written 3.4 years ago by ivivek_ngs4.8k

I am trying to get information about all the prostate cancer patients mutation calls. Therefore, I dont aim a specific file. Filter of the VCF have 3 options which you also stated; EMBL/DKFZ, Broad and Sanger. Thank you very much tho.

Also one more question, this kind of irrelevant to this topic but, When I download VCF which belongs to Sanger workflow. I couldn't open them with text edit. Shouldn't vcf be in  text file format?

Thank you very much for your contribution. I almost got my hope about this fact.

Tunc

ADD REPLYlink written 3.4 years ago by morovatunc400

First of all I would be a bit greedy , if you find my answer helpful please accept it as answer and upvote the same and also if any comments and reply are helpful then you can upvote them so that anyone else having similar questions can refer to it as well.

Coming to your second question you can always use some packages dedicated to VCF in R or bedtools to see the VCF format file. I am supposing the files you download are compressed so you cannot see them in text edit. You can download them and open in terminal with nano or vim or if you want to do it locally you can open them in excel. Or you can just awk the vcf files to get the columns you are interested in for your downstream use.

VCFTools

BEDTools

I believe this would suffice.

ADD REPLYlink modified 10 months ago by RamRS22k • written 3.4 years ago by ivivek_ngs4.8k

I am trying to get information about all the prostate cancer patients mutation calls. Therefore, I dont aim a specific file. Filter of the VCF have 3 options which you also stated; EMBL/DKFZ, Broad and Sanger. Thank you very much tho.

Also one more question, this kind of irrelevant to this topic but, When I download VCF which belongs to Sanger workflow. I couldn't open them with text edit. Shouldn't vcf be in text file format?

Thank you very much for your contribution. I almost got my hope about this fact.

Tunc

ADD REPLYlink modified 10 months ago by RamRS22k • written 3.4 years ago by morovatunc400
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1471 users visited in the last hour