I am working on Unigene Database But I am having many doubts about the structure of Unigene database.Can anyone please help me by solving them :)
Question 1: How is a Unigene cluster build - I know its through Transcript based sequence similarity and genome based sequence similarity, however a much in-depth explanation about how exactly it is done or the algorithms involved is what I am looking for.
Question 2: When there are two clusters representing a single gene, then how are the two clusters different from each other and why was the cluster split into two clusters (for example RFXANK gene is represented by Hs.522868 and Hs.70936 clusters).