Table 4

Gene expression microarray datasets used in this study.

Task & dataset

Number of classes

Number of genes

Number of samples

Prediction task


Dx-Alizadeh

3

4026

62

Diffuse large B-cell lymphoma, follicular lymphoma, chronic lymphocytic leukemia

Dx-Alon

2

2000

62

Colon tumors and normal tissues

Dx-Armstrong

3

11225

72

AML, ALL and mixed-lineage leukemia (MLL)

Dx-Bhattacharjee

5

12600

203

4 lung cancer types and normal tissues

Dx-Golub

3

5327

72

Acute myelogenous leukemia (AML), acute lymphoblastic leukemia (ALL) B-cell and ALL T-cell

Dx-Khan

4

2308

83

Small, round blue cell tumors of childhood

Dx-Nutt

4

10367

50

4 malignant glioma types

Dx-Pomeroy

5

5920

90

5 human brain tumor types

Dx-Ramaswamy

26

15009

308

14 various human tumor types and 12 normal tissue types

Dx-Ramaswamy2

2

13247

76

Metastatic and primary tumors

Dx-Shipp

2

5469

77

Diffuse large B-cell lymphomas and follicular lymphomas

Dx-Singh

2

10509

102

Prostate tumor and normal tissues

Dx-Staunton

9

5726

60

9 various human tumor types

Dx-Su

11

12533

174

11 various human tumor types

Px-Beer

2

7129

86

Lung adenocarcinoma survival

Px-Bhattacharjee

2

12600

62

Lung adenocarcinoma 4-year survival

Px-Iizuka

2

7070

60

Hepatocellular carcinoma 1-year recurrence-free survival

Px-Pomeroy

2

7129

60

Medulloblastoma survival

Px-Rosenwald

2

7399

240

Non-Hodgkin lymphoma survival

Px-Veer

2

24188

97

Breast cancer 5-year metastasis-free survival

Px-Veer2

3

24188

115

Breast cancer 5-year metastasis-free survival, metastasis within 5 years, germline BRCA1 mutation

Px-Yeoh

2

12240

233

Acute lymphocytic leukemia relapse-free survival


The reference paper for each dataset is provided in the Additional File 3.

Statnikov et al. BMC Bioinformatics 2008 9:319   doi:10.1186/1471-2105-9-319

Open Data