Table 1

Ensembl genomes used in this study

Genome identifier

Organism

No. of genes

No. of proteins


ENSP

'Human','Homo sapiens'

21971

60953


ENSPTR

'Chimpanzee','Pan troglodytes'

19829

39256


ENSPPY

'Orangutan','Pongo pygmaeus'

20068

29256


ENSMMU

'Macaque','Macaca mulatta'

21905

42370


ENSECA

'Horse','Equus caballus'

20322

28128


ENSCAF

'Dog','Canis familiaris'

19305

29804


ENSBTA

'Cow','Bos taurus'

21036

29517


ENSMUS

'Mouse','Mus musculus'

23873

43630


ENSRNO

'Rat','Rattus norvegicus'

22503

37672


ENSMOD

'Opossum','Monodelphis domestica'

19471

34132


ENSGAL

'Chicken','Gallus gallus'

16736

22945


ENSORL

'Medaka','Oryzias latipes'

19686

25174


ENSTNI

'Tetraodon','Tetraodon nigroviridis'

19602

23909


ENSDAR

'Zebrafish','Danio rerio'

21322

35967


Protein sequences were obtained from the Ensembl database version 51.

Prosdocimi et al. BMC Genomics 2012 13:5   doi:10.1186/1471-2164-13-5

Open Data