Oligo Design From Ests
2
1
Entering edit mode
11.1 years ago
Ketil 4.1k

When designing oligos for a microarray from ESTs, it seems to be crucial to choose the correct direction (strand) for the oligos, but I can't seem to find anything in the literature on this, or how to do this. (I've written a small tool to conditionally reverse-complement ESTs using a dynamic programming algorithm that takes into account BlastX hits and poly-A etc, but I'm unsure how important this is for the results.)

Any pointers or opinions?

microarray est • 3.0k views
ADD COMMENT
2
Entering edit mode
11.1 years ago
Neilfws 49k

Correct strand is certainly crucial - wrong strand = no hybridization!

I guess there are 3 approaches:

  • if a researcher generates the ESTs in their lab, presumably they know whether they are 5' or 3' clones
  • if ESTs are obtained from dbEST, sequences are annotated as 5' or 3'
  • otherwise as you say, a software tool to match ESTs to chromosomal sequence is required

This looks useful: "ESTPiper – a web-based analysis pipeline for expressed sequence tags". It includes a tool for microarray oligonucleotide probe design and the paper discusses other EST analysis tools, some of which also do probe design.

Google search for EST strand probe design also throws up plenty of useful-looking results.

ADD COMMENT
0
Entering edit mode

I tried to rely on strand annotation from dbEST, but soon realized it is often wrong. I think this is due to clones being inserted the wrong way into the vector, I guess the rate this happens depends on the kit being used. This also means that although the researcher "knows" the orientation, she will often be wrong.

ADD REPLY
1
Entering edit mode
11.1 years ago
Darked89 4.2k

1) There is SeqClean utility which may help you get rid of artifacts: http://compbio.dfci.harvard.edu/tgi/software/

You may also try to construct rRNA library and for using it with SeqClean.

2) check your ESTs for common repeats. While some transposons are expressed, I am not sure if you want to have some pre-mRNA intronic sequences on the chip.

3) did you tried assembling all your ESTs?

4) you may also tblastx your ESTs against these from related species. There may not be any proteins @NCBI covering the less conserved protein parts from your species of interest.

ADD COMMENT
1
Entering edit mode

Thanks for bringing this up. I have had really poor results from the various EST cleaning tools, some old notes at http://blog.malde.org/index.php/2008/05/08/cleaning-up-sequences/. Do you have any independent resources demonstrating the effectiveness of these tools?

ADD REPLY
0
Entering edit mode

If the question is: did I create an artificial "EST" set with vectors/ribosomal sequences thrown in, then point mutated/ mutated with indels/flipped (all this can be seen in real EST data), then looked at results, then answer is: not yet. I am mapping 3+ millions of various species ESTs to a novel genome, some part of EST sets I mapped without seqclean pre-processing, so I hope to see the difference. GMAP, which I use for ESTs mapping is often overeager to call a match (on a protein level) with UTRs, and possibly with other sequences, so less dubious ends should give less nonsense matches.

ADD REPLY

Login before adding your answer.

Traffic: 2129 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6