Question: Genome-wide homology human versus mouse
gravatar for Selenocysteine
5.7 years ago by
Dublin, Ireland
Selenocysteine600 wrote:

Hello everyone,

I am trying to get some information about the degree of homology/similarity between the human and mouse genome (whole genome, not sequence by sequence). In particular, I would like to get this information about:

- homology at the nucleotide level (whole genome and exome)

- homology at the coded aminoacid level (exome)

Are there any websites or previous publications available? All the information I am finding is kinda confused and in the end I am not able to sum it up in a coherent way.


Thanks a lot!

alignment genome • 6.0k views
ADD COMMENTlink modified 5.7 years ago by pld4.9k • written 5.7 years ago by Selenocysteine600
gravatar for Kamil
5.7 years ago by
Kamil2.0k wrote:

For data on homology between mouse and human, I recommend The Jackson Laboratory data:


ADD COMMENTlink written 5.7 years ago by Kamil2.0k
gravatar for pld
5.7 years ago by
United States
pld4.9k wrote:

I'm not sure what you really mean by "homolgy at the coded amino acid level (exome)" and how this fits with your first point of "whole genome, not sequence by sequence". Did you mean the exome homology, transcriptome homology, CDS homology or protein homology? By default, these will all be "sequence by sequence".

On the first point, what is the genomic homology, it really depends on how you are calculating it. You have to remember that the genome isn't a single contiguous piece of DNA. It is divided up into multiple chromosomes and mice and humans have different numbers of chromosomes. Mice have two fewer chromosomes than humans, which translates to a ~15% smaller genome compared to humans.

You also have to remember that the chromosomes between species are structured differently, so even though a gene might be perfectly conserved between both species, it may be located on different chromosomes between species. You have to also account for differences in the areas immediately surrounding that gene, the promoters and other features, as well as neighboring genes.

You also have instances where genes have been duplicated in one species but not the other, and whole families of genes can be expanded in one species compared to another.

This is probably why you can't reasonably sum things up into a single 'simple' value.


This breaks out some values:

It looks like 40% of the human genome will align at the nucleotide level, although I'm not sure this translates directly into 40% homology.

If you look at syntenic regions, about 90% of the mouse genome resides in syntenic blocks conserved between the human and mouse, but I'm not sure what the nucleotide homology between these regions are.

ADD COMMENTlink written 5.7 years ago by pld4.9k

Hi, by "not sequence by sequence" I meant that I don't want to manually assess the homology with sequence by sequence tools, like BLAST etc. I just want comprehensive homology data including every protein coding gene; of course this will be assessed sequence by sequence.

I know that it is not easy to determine the degree of homology between two different organisms (also, considering that it may vary among different individuals...), however I hope there are some comprehensive studies that at least try to approximate it.

ADD REPLYlink written 5.7 years ago by Selenocysteine600
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1239 users visited in the last hour