Table 2

Significant differences between the control set and disease set of genes. The features found to be significantly different between Ensembl genes found in OMIM and those not in OMIM. Significance was calculated using the Mann-Whitney U test unless otherwise noted.

Feature
Median in control set
Median in disease set
Significance

Gene length
19 k
27 k
P < 0.001
cDNA length
2,126 bp
2,442 bp
P < 0.001
Protein length
383 aa
494 aa
P < 0.001
3' UTR length
446 bp
488 bp
P < 0.01
Exon number
8
10
P < 0.001
Distance to neighbouring gene
46 kb
52 kb
P < 0.01
Protein identity with BRH in mouse
80%
87%
P < 0.001
Gene encodes signal peptide
17%
35%
P < 0.0001 (calculated using the chi squared test)
5' CpG islands
12%
16%
P < 0.028 (calculated using the chi squared test)

Adie et al. BMC Bioinformatics 2005 6:55   doi:10.1186/1471-2105-6-55