Force Not to Ignore Repeats in UCSC Table Browser
1
0
Entering edit mode
5.6 years ago
mgru ▴ 20

I have a list of official gene symbols, each gene symbol repeats several times. Ex: AATK AATK AATK AATK

I want to output some things from UCSC's table browser, but I would like the same things repeated for every gene symbol. So for every row of AATK, to give me the same information again and again (saving me time and coding to repeat).

UCSC automatically ignores repeats and gives me 1 row of info per unique gene symbol.

Any way to turn that feature off?

Thanks!

ucsc • 890 views
ADD COMMENT
0
Entering edit mode
5.6 years ago

you don't want to do this. You want to get the regular output of the UCSC and the use something like join with your data.

ADD COMMENT
0
Entering edit mode

Will join repeat lines? Ie, UCSC gives multiple transcripts per gene. So I assume join would read the first, join correctly but ignore the second? Would you be able to give an example? Below would be an example of my info (secondary info) and UCSC results - 2 matrices.

Ex: UCSC Results

Col1-Transcript Col2-GeneName
Transcript1 AATK
Transcript2 AATK

Table to be joined

Col1-GeneName Col2-SecondaryInfo
AATK   A
AATK   B
AATK   C
AATK   D

Hopeful output after Join (or something else?)

  Col1-GeneName Col2-GeneName Col3-Transcript Col4-SecondaryInfo
AATK   AATK   Transcript1   A
AATK   AATK   Transcript2   A
AATK   AATK   Transcript1   B
AATK   AATK   Transcript2   B
AATK   AATK   Transcript1   C
AATK   AATK   Transcript2   C
AATK   AATK   Transcript1   D
AATK   AATK   Transcript2   D
ADD REPLY
0
Entering edit mode

Will join repeat lines?

just try, show us the command line, we'll help you if you get some problems. Hint: both files must be sorted on the common key.

ADD REPLY
1
Entering edit mode

Thanks! Had a couple of errors to be honest (parentheses, dash, upper case) despite sort on common key, but it did work!

ADD REPLY

Login before adding your answer.

Traffic: 2660 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6