Question

How To Find Paralogues?

5

Entering edit mode

13.9 years ago

nikulina ▴ 300

Good day! This morning I faced a problem. I decided to find paralogues of some human genes. Searching the web I've found the GeneDecks resource. I considered it to be not very convinient. Are their any alternative tools to do this job? Is it possible to gain such information using UCSC (unfortunatly I have not found the propper button there %))? Thanks.

ucsc • 5.8k views

ADD COMMENT • link updated 12.1 years ago by Larry_Parnell 16k • written 13.9 years ago by nikulina ▴ 300

Ram · Answer 1 · 2010-06-13

Would the genomicSuperDups track be fulfill your needs?

mysql --user=genome --host=genome-mysql.cse.ucsc.edu -A -D hg18 -e 'desc genomicSuperDups'
+----------------+------------------+------+-----+---------+-------+
| Field          | Type             | Null | Key | Default | Extra |
+----------------+------------------+------+-----+---------+-------+
| bin            | smallint(6)      | NO   |     | 0       |       |
| chrom          | varchar(255)     | NO   | MUL |         |       |
| chromStart     | int(10) unsigned | NO   |     | 0       |       |
| chromEnd       | int(10) unsigned | NO   |     | 0       |       |
| name           | varchar(255)     | NO   | MUL |         |       |
| score          | int(10) unsigned | NO   |     | 0       |       |
| strand         | char(1)          | NO   |     |         |       |
| otherChrom     | varchar(255)     | NO   |     |         |       |
| otherStart     | int(10) unsigned | NO   |     | 0       |       |
| otherEnd       | int(10) unsigned | NO   |     | 0       |       |
| otherSize      | int(10) unsigned | NO   |     | 0       |       |
|(...)
+----------------+------------------+------+-----+---------+-------+

This track shows regions detected as putative genomic duplications within the golden path. The following display conventions are used to distinguish levels of similarity:

Light to dark gray: 90 - 98% similarity

Light to dark yellow: 98 - 99% similarity

Light to dark orange: greater than 99% similarity

Red: duplications of greater than 98% similarity that lack sufficient Segmental Duplication Database evidence (most likely missed overlaps)

For a region to be included in the track, at least 1 Kb of the total sequence (containing at least 500 bp of non-RepeatMasked sequence) had to align and a sequence identity of at least 90% was required.

Ram · Answer 2 · 2010-06-13

6

Entering edit mode

13.9 years ago

Casey Bergman 18k

There are a number of other gene family databases that you can try:

ADD COMMENT • link updated 5.6 years ago by Ram 43k • written 13.9 years ago by Casey Bergman 18k

score 3 · Answer 3 · 2010-06-14

3

Entering edit mode

13.9 years ago

Fred Fleche 4.3k

[?]

http://epgd.biosino.org/EPGD/

ADD COMMENT • link 13.9 years ago by Fred Fleche 4.3k

score 2 · Answer 4 · 2011-09-01

The assumption is orthologous genes have identical or highly related functions and this sharing is greater than for paralogs. But Nehrt, Hahn et al challenge this by offering that "the most important factor in the evolution of function is not amino acid sequence, but rather the cellular context in which proteins act."

They combined experimentally derived function with gene expression data on nearly 9000 proteins.

This is certainly a controversial statement, but in a thought-provoking manner. After all, it is the integration of diverse data that are driving a lot of genomics. One example is GWAS (genome-wide association studies) + gene expression = better identification of likely causal variant. The same might be applied to the ortholog/paralog definition.