BMC Bioinformatics

official impact factor 3.03

Open Access Highly Access Methodology article

Microarray-based gene set analysis: a comparison of current methods

Sarah Song1 and Michael A Black2*

Author Affiliations

1 Department of Statistics, University of Auckland, Auckland, New Zealand

2 Department of Biochemistry, University of Otago, Dunedin, New Zealand

For all author emails, please log on.

BMC Bioinformatics 2008, 9:502 doi:10.1186/1471-2105-9-502

Published: 27 November 2008

Additional files

Additional file 1:

Detection rates for the six gene set analysis methods on simple simulated data. In contrast to Table 2, all altered gene sets were simulated so as to exhibit changes in the same direction. This resulted in a major performance improvement for the sigPathway approach. 100 data sets (each containing 20 genes in 20 sets) were analyzed by each method, with 10,000 permutations used to generate p-values to which FDR controlling adjustments [21] were made. An adjusted p-value of 0.05 was required for significance. The value in each cell relates to the proportion of each type of gene set activity correctly identified by each method. Standard errors are shown in parentheses.

Format: PDF Size: 23KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional file 2:

The distribution of difference and pairwise correlations in diabetes data[3]. (a) difference for each gene, positive value indicates up-regulation in DM2 samples (b) average difference for each gene set (c) pairwise correlations for all gene pairs using DM2 samples (d) average pairwise correlations for each gene set using DM2 samples (e) pairwise correlations for all pairs of genes using NGT samples (f) average pairwise correlations for each gene set using NGT samples.

Format: JPEG Size: 204KB Download file

Open Data

Additional file 3:

The distribution of difference and pairwise correlations in the leukemia data[22]. (a) difference for each gene, positive value indicates up-regulation in AML samples (b) average difference for each gene set (c) pairwise correlations for all gene pairs using AML samples (d) average pairwise correlations for each gene set using AML samples (e) pairwise correlations for all pairs of genes using ALL samples (f) average pairwise correlations for each gene set using ALL samples.

Format: JPEG Size: 197KB Download file

Open Data

Additional file 4:

Application of gene set analysis methods to diabetes data[3](all gene sets). Ranked (by p-value) gene sets produced by each of the six analysis methods. NP indicates the nominal p-values and AP indicates the FDR adjusted p-values.

Format: PDF Size: 74KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional file 5:

Application of gene set analysis methods to leukemia data[22](all gene sets). Ranked (by p-value) gene sets produced by each of the six analysis methods. NP indicates the nominal p-values and AP indicates the FDR adjusted p-values.

Format: PDF Size: 65KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data