How To Get The The Ncrna Loci From Ucsc Genome Browser Downloads
2
2
Entering edit mode
11.8 years ago
ashis.csedu ▴ 110

Hi, I am looking for ncRNA loci after reading a paper where the authors stated that they collected the ncRNA loci from the UCSC genome Browser. I've been looking for it in the download section and also in the Table browser. Can you help me to find out that (for example a sample specis: Mouse (mm9)? Thanks.

mouse • 7.8k views
ADD COMMENT
2
Entering edit mode
11.8 years ago
Vikas Bansal ★ 2.4k

Hi. From NCBI Refseq key -

NR_123456    RNA    Mixed    Non-coding transcripts including structural RNAs, transcribed pseudogenes, and others.

It describes the ID's starting with NR.

A UCSC answer is here

The information you are requesting can be retrieved using the Table
> Browser. Click on "Tables" on the blue bar on the top of the main page
> and make the following selections:
>
> clade: Mammal
> genome: Human
> assembly: hg19
> group: Genes and Gene Prediction Tracks
> track: RefSeqGenes
> table: refGene
> region: genome
> filter: click 'create' then in the 'name DOES match' field type: NR_*
> output format: select fields from primary and related tables
> output file: if you would rather have the results saved to a file
> instead of displaying in the browser window, enter the name you would
> like the output file to have, otherwise, leave blank
> file type returned: plain text
>
> Click "get output". From here, select the following: name, chrom,
> strand. Click "get output". This will provide a list of non-coding RNA
> names, positions, and strands. To retrieve sequences, go back to the
> table browser and set the same settings, except set 'output format' to
> 'sequence'.

You can just change the genome and assembly according to your preference.

ADD COMMENT
2
Entering edit mode

I didn't know about the NR_ prefix. Thanks. That makes it easy to use SQL:

mysql --user genome --host genome-mysql.cse.ucsc.edu  -AD mm9 -e \
     "select chrom, txStart, txEnd, name, strand from refGene WHERE name like 'NR_%'
ADD REPLY
0
Entering edit mode

Much elaborate workflow to get ncRNAs! Thanks Vikas Bansal.

ADD REPLY
0
Entering edit mode

Hi, Vikas Bansal. "NR" represent non-coding gene in refseq gene annotation. How about ucsc known gene database? Which id represent non-coding RNA for ucsc known gene database?

ADD REPLY
1
Entering edit mode
11.8 years ago
Wjeck ▴ 490

Not sure if this will work, but you could try to use the CDS start and CDS stop fields. I believe they are both equal to the Transcription start field for noncoding RNAs. For coding genes CDS start != CDS stop

ADD COMMENT
0
Entering edit mode

Thanks Wjeck. I got that by looking at the CDS positions. Really helps.

ADD REPLY

Login before adding your answer.

Traffic: 2592 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6