BMC Bioinformatics

official impact factor 3.03

Open Access Highly Access Methodology article

Gene selection and classification of microarray data using random forest

Ramón Díaz-Uriarte1* and Sara Alvarez de Andrés2

Author Affiliations

1 Bioinformatics Unit, Biotechnology Programme, Spanish National Cancer Centre (CNIO), Melchor Fernandez Almagro 3, Madrid, 28029, Spain

2 Cytogenetics Unit, Biotechnology Programme, Spanish National Cancer Centre (CNIO), Melchor Fernández Almagro 3, Madrid, 28029, Spain

For all author emails, please log on.

BMC Bioinformatics 2006, 7:3 doi:10.1186/1471-2105-7-3

Published: 6 January 2006

Additional files

Additional File 1:

A PDF file with additional results, showing error rates and stability for simulated data under various parameters, as well as error rates and stabilities for the real microarray data with other parameters, and further details on the data sets, simulations, and alternative methods.

Format: PDF Size: 343KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional File 2:

A PDF file with additional plots of OOB error rate vs. mtry for both simulated data and real data under other parameters.

Format: PDF Size: 3.1MB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional File 3:

Source code for the R package varSelRF. This is a compressed (tar.gz) file ready to be installed with the usual R installation procedure under Linux/UNIX. Additional formats are available from CRAN [68], the Comprehensive R Archive Network.

Format: GZ Size: 27KB Download file

Open Data