Table 1

Summary of the five gene expression datasets
Dataset # Genes Class g # Samples h
NCI-60 Dataset 1 a 1092 f c1 / c2 60 (41/19)
NCI-60 Dataset 2 b 2266 f c1 / c2 60 (41/19)
NCI-60 Dataset 3 c 12625 c1 / c2 59 (40/19)
TCGA Dataset d 11861 c1 / c2 136 (47/89)
CCLE Dataset e 18988 c1 / c2 1036 (541/495)

a Affymetrix HUM6000 array data from Millenium Pharmaceuticals.

b cDNA array data from the Weinstein (NCI) and Brown & Botstein (Stanford) groups.

c Affymetrix U95A data from Novartis.

d Gene expression data (RNA-Seq) for glioblastoma multiforme (GBM).

e mRNA expression data for cancer cell lines (Affymetrix U133+2 arrays).

f Number of genes filtered by excluding the genes with missing expression in at least one sample.

g c1: functional p53 mutation; c2: non-functional p53 mutation or p53 wild-type.

h The sample size of each class is given in parenthesis.

Wang and Simon

Wang and Simon BMC Medical Genomics 2013 6:30   doi:10.1186/1755-8794-6-30

Open Data