Question: how can I understand the barcode in TCGA
gravatar for Learner
7 days ago by
Learner 150
Learner 150 wrote:

I am a bit confused about finding control and tumor samples in TCGA.

I have read this and many other posts but does not help much to know.

Lets have an example


TCGA means by itself 41 is TSS? if so, then what should I understand from it? 4097 is the participant number , right? 01 is sample ? or portion? A is vial or analyte?

I am trying to find Normal and Control in data

genomics • 86 views
ADD COMMENTlink modified 7 days ago by Kevin Blighe39k • written 7 days ago by Learner 150
gravatar for Kevin Blighe
7 days ago by
Kevin Blighe39k
Republic of Ireland
Kevin Blighe39k wrote:

TSS code of 41 indicates that it derives from Christiana Healthcare (source site) and is a glioblastoma multiforme sample.

The following 01 indicates that it is a tumor sample.

Take a look at my previous answer and my GitHub page:

The Code Tables will provide additional information.

Good luck, Kevin

ADD COMMENTlink modified 7 days ago • written 7 days ago by Kevin Blighe39k

@Kevin Blighe thanks Kevin, so 01 is the sample !!! how one can understand that they used sample and not portion ?

do you also know how to figure out the mutation and wild type of a specific gene in sample TCGA?

ADD REPLYlink written 7 days ago by Learner 150

The portion is indicated by the number after the sample number: barcode

If you obtain the Level 3 TCGA data, i.e., the Mutation Annotation Format (MAF) files, then all listed variants are somatic. There will be a column in the MAF files for 'panel of normals', I believe, which indicates that the somatic variant was additionally observed in an unrelated panel of normals (screened by Broad Institute, if my memory serves me correctly).

Only the Level 1 [protected] data has the original calls in the normal samples.

ADD REPLYlink written 7 days ago by Kevin Blighe39k

@ Kevin Blighe so we would not be able to get normal samples?

ADD REPLYlink written 7 days ago by Learner 150

Normal samples will have number 10 - 19 in the Sample field. These are available for copy number, RNA-seq, and other data-types (as open access Level 3), but not DNA-seq.

ADD REPLYlink written 7 days ago by Kevin Blighe39k

@Kevin Blighe Kevin, can you please check one of the sample for example with me? Lets look at BRCA, As an example, we pick up one of the sample TCGA-BH-A1FB-11A. So based on what you said I should see this as a normal sample, right? if I go to clinic, I can see that it is stage iib of breast cancer. Can you please comment?

ecab16bc-28d5-4c3a-8735-3c785bd047c4 TCGA-BH-A1FB TCGA-BRCA female 1940 white not hispanic or latino 2010 not reported not reported Infiltrating duct carcinoma, NOS stage iib

ADD REPLYlink written 18 hours ago by Learner 150

Hey, yes, that is a normal tissue sample from the cancer patient. The record from the clinical data, however, just relates generally to the patient, who had 'NOS stage iib'.

So, it is normal tissue from a NOS stage iib cancer patient.

ADD REPLYlink written 12 hours ago by Kevin Blighe39k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 885 users visited in the last hour