Question

Finding uORFs in non-model organisms

0

Entering edit mode

8.4 years ago

srgonzalesvanhorn • 0

Hello,

I would like to identify upstream ORFs in the 5' UTRs of genes within the genomes of several bat species (Pteropus alecto is one such genome, for example). These organisms seem to be somewhat annotated in NCBI, but 99% of the genes are 'predicted'. These genomes are not on Ensembl or within the UCSC genome browser. The lack of annotation is apparent when I view the genome files for download and see that they are listed in the directory CHR_UN, and not placed in directories labelled with specific chromosomes.

Based on this information, is it possible to still perform the analysis using listed gene locations, even though they are only predictions? My idea was to make a list of 5' UTRs for each predicted gene, then search for start codons/kozak sequence that also contain a stop codon. If this sounds feasible, how do I create a list of 5' UTRs from this type of data set?

Thanks for your suggestions.

Sarah

non-model-organism 5-prime-UTR annotation uORF • 2.3k views

ADD COMMENT • link updated 21 months ago by Ram 43k • written 8.4 years ago by srgonzalesvanhorn • 0

Ram · Answer 1 · 2016-01-05

0

Entering edit mode

8.3 years ago

srgonzalesvanhorn • 0

Update:

After digging into what I'm actually working with, I have a scaffold assemblies. Is it possible to perform uORF analysis when a complete genome isn't available?

ADD COMMENT • link updated 4.4 years ago by Ram 43k • written 8.3 years ago by srgonzalesvanhorn • 0

score 0 · Answer 2 · 2016-01-05

0

Entering edit mode

8.3 years ago

Antonio R. Franco ★ 5.1k

Look for the Genmark program. The bat genome ha to be close to the mouse one, and I am sure it will work nicely

ADD COMMENT • link 8.3 years ago by Antonio R. Franco ★ 5.1k

Ram · Answer 3 · 2016-01-06

0

Entering edit mode

8.3 years ago

Chirag Nepal ★ 2.4k

Transdecoder might help.

ADD COMMENT • link updated 4.4 years ago by Ram 43k • written 8.3 years ago by Chirag Nepal ★ 2.4k