Email updates

Keep up to date with the latest news and articles from BMC Medical Informatics and Decision Making and BioMed Central.

Open Access Research article

Predicting disease risks from highly imbalanced data using random forest

Mohammed Khalilia1, Sounak Chakraborty2 and Mihail Popescu3*

1 Department of Computer Science, University of Missouri, Columbia, Missouri, USA

2 Department of Statistics, University of Missouri, Columbia, Missouri, USA

3 Department of Health Management and Informatics, University of Missouri, Columbia, Missouri, USA

For all author emails, please log on.

BMC Medical Informatics and Decision Making 2011, 11:51 doi:10.1186/1472-6947-11-51

Published: 29 July 2011

Additional files

Additional file 1:

Appendix. This file contains two algorithms. Algorithm 1, which describes the repeated random sub-sampling and algorithm 2 which briefly explains the general random forest for classification.

Format: DOC Size: 39KB Download file

This file can be viewed with: Microsoft Word Viewer

Open Data