Question: Arrayexpress processed data
1
gravatar for alessandro.palma
11 months ago by
alessandro.palma30 wrote:

Hi, I have a question on how to identify transcript names in processed data from Array Express. I saved the processed data from this study: (E-GEOD-24395) and I got some files each one with three columns (reporter identifier, expression value and p-value).

Is there any way to find the transcript name (something similar to gene symbols) corresponding to the reporter identifiers? I searched for some of these reporter identifiers into all files provided within the study (simply opening each file and then doing ctrl+F) but I couldn't fine them anywhere. So, the problem is that I don't really know what these "Reporter identifiers" are...

I also noticed that the file named "A-MEXP-930.adf.txt" (where I can get the hugo IDs) is exactly the same length as the processed files (48,701 rows after removing the header), so I guess I could combine in a data frame the hugo IDs extracted from this file with the expression values for each sample from the other files, but I am not sure about the correspondence between hugo IDs and the reporter identifiers (the listed hugo IDs could have been sorted or manipulated someway, and there are also some blank values when I read them as a table in R).

Any help? Thanks

ADD COMMENTlink modified 11 months ago • written 11 months ago by alessandro.palma30

Thank you! Actually I verified my previous hypothesis (the A-MEXP-930.adf.txt file was exactly the same as the table provided by GEO). But I think downloading the table you suggested is better and simpler to do.

ADD REPLYlink written 11 months ago by alessandro.palma30

Please use ADD COMMENT/ADD REPLY when responding to existing posts to keep threads logically organized.

ADD REPLYlink written 11 months ago by genomax63k
0
gravatar for Rasoul Godini
11 months ago by
Rasoul Godini0 wrote:

Hi you can go to the platform which the job is done with, in your case it is GPL6106 (do this on NCBI by searching one of the source name for instance GSM601376 and find the platform). Download a table containing all information about probes, open it in an excel file and easily use V-lookup formula to place all corresponding gene symbols (or other information) for each reporter identifier in front of that. Actually, in here reporter identifier is the ID. Good Luck

ADD COMMENTlink written 11 months ago by Rasoul Godini0
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1854 users visited in the last hour