Question: Downloding only RNA-Seq Raw counts data from ICGC
0
gravatar for David_emir
4 days ago by
David_emir310
India
David_emir310 wrote:

Hi,

I wanted to download RNA-Seq raw counts data from ICGC with samples as columns and gene_symbol as row names. I tried using dplyr but I am not able to get in the proper format. It would be of great help to me if you guys can let me know how to download this.

Sincerely,

Dave

P.S: any package in R would be helpful

raw counts icgc • 80 views
ADD COMMENTlink modified 3 days ago • written 4 days ago by David_emir310

I tried using dplyr but I am not able to get in the proper format.

Share you code here

ADD REPLYlink written 4 days ago by Vijay Lakhujani3.5k

Thanks Vijay, I tried to cut and paste sample_id rows as a column but it gave me a really messed up results.

ADD REPLYlink written 4 days ago by David_emir310

Share you code here

ADD REPLYlink modified 4 days ago • written 4 days ago by ATpoint12k

Hope this will help others, code is here.

mlr --tsv cut -o -f id4,id1,count2 then reshape -s id1,count2 input.tsv

Or

datamash --header-in --whitespace crosstab id4,id1 unique count2 < file

Thanks for your help

ADD REPLYlink modified 3 days ago by ATpoint12k • written 3 days ago by David_emir310
1
gravatar for David_emir
3 days ago by
David_emir310
India
David_emir310 wrote:

Thanks a lot, Guys, I got a solution for this for cross-tabulation (or pivot table) using GNU Datamash and with Miller, using reshape!

Dave

ADD COMMENTlink written 3 days ago by David_emir310
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1223 users visited in the last hour