Question: combine RNA-seq data count and len
0
gravatar for mikysyc2016
21 months ago by
mikysyc201670
mikysyc201670 wrote:

I have two txt file. One include transcription id and gene length, another one have transcription id and each sample reads, i want to combine them as one transcription id and reads count and length, how I can do it. I know vlookup can, but it is not good for big data. Thanks!

rna-seq • 494 views
ADD COMMENTlink modified 21 months ago by c.chakraborty160 • written 21 months ago by mikysyc201670

Give an example of both files and the intended output please.

ADD REPLYlink written 21 months ago by ATpoint31k

It looks like: one is :

Transcript KO1 KO2 KO3 WT1 WT2 WT3 78 79 81 66 68 70 27 28 29 NM_001011874 3 0 0 2 3 0 1 0 0 1 0 0 1 3 2 NM_001195662 0 0 0 2 1 0 0 0 0 0 0 0 0 0 0 NM_011283 0 0 0 2 1 0 0 0 0 0 0 0 0 0 0 NM_011441 769 153 314 871 158 399 289 224 888 275 270 1031 285 1360 821

.... another one is :

Transcript length NR_040439 1687 NM_013715 1239 NM_026493 4354 NM_001164233 2195 NM_027584 2328 NM_001102430 7042 NM_172851 5120

ADD REPLYlink modified 21 months ago • written 21 months ago by mikysyc201670
0
gravatar for c.chakraborty
21 months ago by
c.chakraborty160
c.chakraborty160 wrote:

You can upload both the files in R, and then create a data frame with transcription ID, gene.length, and read.counts together. Use reshape2 and plyr for merging the files. Check this link Help with 2 list in R, comparing gene ID to get refined information from both

ADD COMMENTlink written 21 months ago by c.chakraborty160

thank you for your reply. My case is a little bit different. The order of transcrip id for the two file is different. and one have around ~20000 id, another has ~30000id.

ADD REPLYlink written 21 months ago by mikysyc201670
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1353 users visited in the last hour