Question

Criteria for identifying tandem and segmental duplicates

1

Entering edit mode

9.1 years ago

kozhaki.seq ▴ 60

There are previous reports available on the estimated tandem and segmental duplicated genes (TDs and SDs) in rice (http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0063551). But I am not sure about the reliability of the data. Hence,I would like to analyze TDs and SD genes (not blocks) in rice.

Here, I am unable to make a criteria for TD and SD estimation (a condition which gives better reliable data). I noticed different groups used their-own criteria. For ex, the previous work uses "identity was > = 30% and the alignment covered > = 70% of the protein length" for TD. In other work, > 94% identity is used for SD (http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3179966/)

Can anyone give suggestion before my analysis? I mean a cut off for TDs and SDs identification separately.
If anyone know a software suit which deals such duplicated genes estimation, it would be helpful. (FYI: I found some published suits, but the webpage is not working).

Thank you.

alignment blast software duplication genome • 2.4k views

ADD COMMENT • link updated 23 months ago by Ram 43k • written 9.1 years ago by kozhaki.seq ▴ 60

score 0 · Answer 1 · 2015-07-29

0

Entering edit mode

8.7 years ago

Hranjeev ★ 1.5k

The typical criteria that defines a segmental duplication is that the sequence is at least 1kb and 90% identical to the duplicate.

ADD COMMENT • link 8.7 years ago by Hranjeev ★ 1.5k