Question: Arrayexpress processed data
gravatar for alessandro.palma
2.6 years ago by
alessandro.palma30 wrote:

Hi, I have a question on how to identify transcript names in processed data from Array Express. I saved the processed data from this study: (E-GEOD-24395) and I got some files each one with three columns (reporter identifier, expression value and p-value).

Is there any way to find the transcript name (something similar to gene symbols) corresponding to the reporter identifiers? I searched for some of these reporter identifiers into all files provided within the study (simply opening each file and then doing ctrl+F) but I couldn't fine them anywhere. So, the problem is that I don't really know what these "Reporter identifiers" are...

I also noticed that the file named "A-MEXP-930.adf.txt" (where I can get the hugo IDs) is exactly the same length as the processed files (48,701 rows after removing the header), so I guess I could combine in a data frame the hugo IDs extracted from this file with the expression values for each sample from the other files, but I am not sure about the correspondence between hugo IDs and the reporter identifiers (the listed hugo IDs could have been sorted or manipulated someway, and there are also some blank values when I read them as a table in R).

Any help? Thanks

ADD COMMENTlink modified 2.6 years ago • written 2.6 years ago by alessandro.palma30

Thank you! Actually I verified my previous hypothesis (the A-MEXP-930.adf.txt file was exactly the same as the table provided by GEO). But I think downloading the table you suggested is better and simpler to do.

ADD REPLYlink written 2.6 years ago by alessandro.palma30

Please use ADD COMMENT/ADD REPLY when responding to existing posts to keep threads logically organized.

ADD REPLYlink written 2.6 years ago by genomax92k
gravatar for Rasoul Godini
2.6 years ago by
Rasoul Godini0 wrote:

Hi you can go to the platform which the job is done with, in your case it is GPL6106 (do this on NCBI by searching one of the source name for instance GSM601376 and find the platform). Download a table containing all information about probes, open it in an excel file and easily use V-lookup formula to place all corresponding gene symbols (or other information) for each reporter identifier in front of that. Actually, in here reporter identifier is the ID. Good Luck

ADD COMMENTlink written 2.6 years ago by Rasoul Godini0
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1323 users visited in the last hour