I have a list of about 40 barcodes corresponding to TCGA whole exome samples. The CG Hub browser allows one to search by sample type, platform, assembly, library, etc. but not by barcode. If I restrict by the existing filters, I have about 650 samples that I still need to go through by hand to find the 40 or so that I need. Surely there must be a way to search and download specific TCGA barcodes from a list in CG Hub. Does anybody know how to do this?
The barcodes are metadata. You can use the little box on the top-right to search for specific barcodes. Or you can build HTTP queries like this:
If you want to do this more automatically with some scripts that query each of your 40 barcodes, then I'd recommend using the
cgquery command-line tool available here
Update (Nov 2016): CGHub has shut down, and a lot of TCGA's original data portal infrastructure. All sequence data has been harmonized to
GRCh38 and is hosted out of NCI's GDC Portal. If you want the original legacy data based in
GRCh37, use this link.