Research article
Predicting disease risks from highly imbalanced data using random forest
1 Department of Computer Science, University of Missouri, Columbia, Missouri, USA
2 Department of Statistics, University of Missouri, Columbia, Missouri, USA
3 Department of Health Management and Informatics, University of Missouri, Columbia, Missouri, USA
BMC Medical Informatics and Decision Making 2011, 11:51 doi:10.1186/1472-6947-11-51
Published: 29 July 2011Additional files
Additional file 1:
Appendix. This file contains two algorithms. Algorithm 1, which describes the repeated random sub-sampling and algorithm 2 which briefly explains the general random forest for classification.
Format: DOC Size: 39KB Download file
This file can be viewed with: Microsoft Word Viewer


