Table 2

Summary of gene datasets
Dataset Description Updated Genes Reference
hg18rpa Protein-coding genes on autosomal chromosomes (based on human genome hg18 and RefSeq genes) 2011 18,166 [35]
HI Recombination hotspots intersected genes 2010 1,156 -
MI Recombination middle spots intersected genes 2010 1,481 -
CI Recombination cold spots intersected genes 2010 1,594 -
HK Housekeeping genes 2011 1,974 [36]
SPD Genes encoding secreted proteins 2010 1,667 [37]
TiGER Tissue-specific genes 2008 4,710 [38]
DGD Duplicated genes 2012 4,393 [16]
AGE Genes classified by different evolutionary age 2011 16,418 [17]
OMIM Mendelian disorder associated genes in human 2012 2,624 [39]
MD Mendelian Disease Genes (at least one mutation in the particular gene is causative of the disease) 2011 1,629 [21]
CGC Genes with mutations have been causally implicated in cancer 2011 424 [23]
TICdb Reciprocal translocation associated genes in human tumours 2008 240 [40]
dbCRID Chromosomal rearrangement associated genes in human diseases 2010 401 [22]
InteCR Integrated chromosomal rearrangement associated genes in human diseases (Combined of dbCRID, TICdb and CGC genes) 2012 614 -

Summary of gene datasets used in this study. All datasets are mapped to hg18rpa. Gene numbers are counted according to non-redundant official gene symbols. Note that HI, MI and CI genes are defined in this study as mentioned above. Detailed gene lists are given in Additional file 3.

Zhou et al.

Zhou et al. BMC Genomics 2013 14:67   doi:10.1186/1471-2164-14-67

Open Data