Question: (Closed) TCGA biolinks GDCdownload file sizes mismatch
gravatar for Nopstoc
3.4 years ago by
INBIOMED, Buenos Aires, Argentina
Nopstoc0 wrote:


I'm new to bioinformatics, and wanted to get some array data from TCGA, using TCGA biolinks.

I ran a query for 12 files, 2 groups of 6 samples with the same type of cancer, and when I ran GDC download I get this confusing information:

GDCdownload(query) GDCdownload will download 12 files. A total of 119.416574 MB Downloading as: Fri_Nov_25_09_39_55_2016.tar.gz Downloading: 31 MB [1] 1

the download stops at 31MB always. different chunks get me the same final size of 31 MB (for example in chunks of 2 files, I get the message that each chunk is 20 mb but they get completed at 5.1MB. setting chunks of 1 file each makes it so that I get only one downloaded and then an error message.

I tried working with the data as it downloaded but keep getting errors, I'm guessing due to missing information

Here's the query and graph for reference:

query2 <- GDCquery(project = "TCGA-BRCA",
                  data.category = "DNA Methylation", 
                  platform = "Illumina Human Methylation 27", barcode = casos) # casos is a character vector with the barcodes of the cases i wanted

and the error message

Group1:solid tissue normal Group2:primary solid tumor Error in TCGAanalyze_DMR(resu1, groupCol = "definition", group1 = "solid tissue normal", : Sorry, but solid tissue normal has no samples In addition: Warning message: In any(rowSums(! : coercing argument of type 'double' to logical

if I run


I get

[1] "Primary solid Tumor" "Primary solid Tumor" "Primary solid Tumor" "Primary solid Tumor" [5] "Primary solid Tumor" "Primary solid Tumor" "Solid Tissue Normal" "Solid Tissue Normal" [9] "Solid Tissue Normal" "Solid Tissue Normal" "Solid Tissue Normal" "Solid Tissue Normal"

Showing that groupCol is actually getting a column that can be splitted in 2.

Hoping to get some help, thank you in advance


R tcgabiolinks • 1.3k views
ADD COMMENTlink modified 6 months ago by Kevin Blighe56k • written 3.4 years ago by Nopstoc0

Post on the GitHub issues page:

It's not our role to follow up on all of these issues for developers who don't participate here on Biostars.


ADD REPLYlink written 6 months ago by Kevin Blighe56k
Please log in to add an answer.
The thread is closed. No new answers may be added.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1592 users visited in the last hour