BMC Bioinformatics

official impact factor 3.03

Open Access Highly Access Research article

Sample size and statistical power considerations in high-dimensionality data settings: a comparative study of classification algorithms

Yu Guo1, Armin Graber2, Robert N McBurney3 and Raji Balasubramanian4*

Author Affiliations

1 BG Medicine, Inc., 610 Lincoln St., Waltham, MA 02451, USA

2 Institute for Bioinformatics and Translational Research, UMIT, Eduard Wallnoefer Zentrum 1, 6060 Hall in Tyrol, Austria

3 Optimal Medicine Ltd., Warwick Enterprise Park, Wellesbourne, Warwick CV35 9EF, UK

4 Division of Biostatistics and Epidemiology, University of Massachusetts - Amherst, 715 North Pleasant Street, Amherst, MA 01003, USA

For all author emails, please log on.

BMC Bioinformatics 2010, 11:447 doi:10.1186/1471-2105-11-447

Published: 3 September 2010

Additional files

Additional file 1:

Simulation Results - Supplement. This file includes additional simulation results for the following settings: (i) Comparison of statistical power and average classification accuracy for classifiers KNN, PAM, RF and SVM, when n = 50; (ii) Estimates of average classification accuracy for classifiers KNN, PAM, RF and SVM, based on independent test sets.

Format: DOC Size: 877KB Download file

This file can be viewed with: Microsoft Word Viewer

Open Data