I have EST as well as EST cluster data for the prediction of miRNAs in a particular genome. The EST sequences roughly come upto 170 MB while the EST cluster file is just 6 MB. I would normally use the cluster file but this big a difference in the size of sequences honestly is surprising to me. I have got a methodology planned out so could you please suggest as to which file I should use for the miRNA analysis.
Thanks in advance.