TCGA Barcodes Missing from Bulk Data
2
1
Entering edit mode
7.2 years ago
dirigible2012 ▴ 320

I have downloaded the Level_3 data for CNV (SNP Array) for Breast Cancer from the Bulk Download section of TCGA.

The file names look like this:

MOHEL_p_TCGA_271_274_275_N_GenomeWideSNP_6_A01_1320320.hg18.seg.txt

The data in the "Sample" column within the file is the same.

I was expecting a barcode like "TCGA-A2-A0D0-01".

How do I match the data to the patient?

Thanks for helping,

Stephanie

TCGA barcodes • 2.7k views
ADD COMMENT
4
Entering edit mode
7.2 years ago
komal.rathi ★ 3.9k

I think you should download the data using the Data Matrix. When you download the data, you automatically download files like FILE_SAMPLE_MAP.txt & file_manifest.txt. You will get the barcode along with the filename in these files. 

ADD COMMENT
0
Entering edit mode

Found it! Thanks.

ADD REPLY
0
Entering edit mode

dirigible2012 if my answer helped you, then please accept it by clicking on the tick mark

ADD REPLY
0
Entering edit mode
7.2 years ago
dirigible2012 ▴ 320

Found the answer -

The Bulk Download appears to be mainly for downloading historic versions of the archives, and doesn't seem to provide file to barcode mapping.

The File Search tool is the answer for downloading the LATEST version of the archive. Each download will contain a "file_manifest.txt" file, which contains the filenames and barcodes for all files in the archive.  

ADD COMMENT

Login before adding your answer.

Traffic: 1945 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6