Question: Tumor purity estimation by allele frequency of COSMIC identified somatic mutations
1
gravatar for ejoffe
12 days ago by
ejoffe10
ejoffe10 wrote:

Hi,

This is very possibly a layman question …..

I have a MAF file with sequencing data for lymphoma specimens. I have no data regarding the tumor purity of the samples. There are no matched normal samples. Germline mutations have been filtered out based on mapping to the 1000 genome. Each mutation is mapped to COSMIC and tagged as pathogenic, likely pathogenic or unknown.

I have read about the various tools for estimating sample purity from the sequencing data (e.g., CNVkit, THetA2, FACETS etc.).

However, I was wondering if there is an approach that uses the fact that the COSMIC mapped somatic mutations are supposed to be unique to the tumor cells in order to estimate the tumor purity and to normalize the values of the allele frequencies.

Thanks, E

sequencing next-gen R • 179 views
ADD COMMENTlink modified 12 days ago • written 12 days ago by ejoffe10
1

Germline mutations have been filtered out based on mapping to the 1000 genome

This only excludes COMMON variants that are present in 1KG. Still, having a matched normal, you would identify thousands of germline mutations that are not covered by 1KG in a WGS sample. Without matched normal, there is no way to discriminate somatic from germline variants.

ADD REPLYlink modified 12 days ago • written 12 days ago by ATPoint1.1k

Correct. See the recent ISOWN paper, where they tried really hard to distinguish germline from somatic in tumor-only samples and still, lots of germline events slipped through.

ADD REPLYlink written 12 days ago by Chris Miller18k

Partly because they don't adjust for purity and copy number, as they state in the Discussion. With normal contamination, there are ways to discriminate somatic from germline. At least there are ways to calculate those probabilities accurately.

ADD REPLYlink written 12 days ago by markus.riester210
1

COSMIC mapped somatic mutations are supposed to be unique to the tumor cells

There are actually many germline variants in COSMIC, since a lot of them have never been validated. COSMIC mutations have a "confirmed somatic" field to distinguish truly somatic from questionable.

ADD REPLYlink modified 11 days ago • written 11 days ago by igor4.5k
0
gravatar for Chris Miller
12 days ago by
Chris Miller18k
Washington University in St. Louis, MO
Chris Miller18k wrote:

And how will you know if those mutations are in the founding clone or a subclonal population? Or if they are copy-number altered, skewing their VAFs up or down, depending on which allele is lost? Or perhaps both CN-altered and Subclonal?

Purity, ploidy, and copy number inference are all inextricably linked in tumor samples, which is why those more complex methods exist.

ADD COMMENTlink written 12 days ago by Chris Miller18k
0
gravatar for ejoffe
12 days ago by
ejoffe10
ejoffe10 wrote:

Thank you all for your answers !!!

ADD COMMENTlink written 12 days ago by ejoffe10
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1394 users visited in the last hour