How do you clean GEO metadata before downstream analysis?
0
0
Entering edit mode
9 weeks ago
Alba • 0

Hey everyone,

I’ve been working a lot with GEO datasets lately and I always find the metadata messy and inconsistent (organism names vary, missing values, etc.). Especially when you want to combine multiple datasets or do a meta-analysis, it becomes a real bottleneck.

I’m curious:

  • How do you normally clean the metadata?
  • Do you use your own scripts?
  • Do you trust the platform annotations?

I recently stumbled upon a small browser-based tool that lets you input a GEO ID or upload a .soft file and it gives back cleaned metadata (normalizes organism names, highlights missing values, etc.).

Here’s the tool in case you want to take a look:
metagenclean[dot]streamlit[dot]app (Works best in Chrome; Safari has some JS issues)

I’d love to hear your thoughts:

  • Would this be useful in your workflow?
  • Anything missing that you’d want it to do?
  • Any red flags?

Really appreciate any feedback!

geo • 2.0k views
ADD COMMENT
1
Entering edit mode

Again... "I recently stumbled upon a small browser-based tool"

On the site: "Created by alba0gf"

That's the second time you've linked your own tool under the pretence of "this is just something I found online"

ADD REPLY
0
Entering edit mode

Agreed, just make a "Tool" post if you want to spread the word about it.

People tend to take those much better than these weak attempts at nesting a recommendation in a pointless question.

edit: Oh, I just saw, you did make a tool post as well. In that case, this question should be removed.

ADD REPLY
0
Entering edit mode

They did make a Tool post originally, but still phrased it as something they just came across rather than "I made thing". I dunno, it's odd.

ADD REPLY

Login before adding your answer.

Traffic: 2815 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6