BMC Bioinformatics

official impact factor 3.03

Open Access Highly Access

Gene selection and classification of microarray data using random forest

Ramón Díaz-Uriarte* and Sara Alvarez de Andrés

BMC Bioinformatics 2006, 7:3 doi:10.1186/1471-2105-7-3

Accesses  

  • Last 30 days: 564 accesses
  • Last 365 days: 5222 accesses
  • All time: 22580 accesses

Cited by

BioMed Central: 33 citations

Research article   Open Access Highly Accessed

Is EC class predictable from reaction mechanism?

Neetika Nath, John BO Mitchell BMC Bioinformatics 2012, 13:60 (24 April 2012)

Research article   Open Access

Gene selection and classification for cancer microarray data based on machine learning and similarity measures

Qingzhong Liu, Andrew H Sung, Zhongxue Chen, Jianzhong Liu, Lei Chen, Mengyu Qiao, Zhaohui Wang, Xudong Huang, Youping Deng BMC Genomics 2011, 12(Suppl 5):S1 (23 December 2011)

Methodology article   Open Access

Morphometic analysis of TCGA glioblastoma multiforme

Hang Chang, Gerald V Fontenay, Ju Han, Ge Cong, Frederick L Baehner, Joe W Gray, Paul T Spellman, Bahram Parvin BMC Bioinformatics 2011, 12:484 (20 December 2011)

Proceedings   Open Access

Large-scale risk prediction applied to Genetic Analysis Workshop 17 mini-exome sequence data

Gengxin Li, John Ferguson, Wei Zheng, Joon Lee, Xianghua Zhang, Lun Li, Jia Kang, Xiting Yan, Hongyu Zhao BMC Proceedings 2011, 5(Suppl 9):S46 (29 November 2011)

Research   Open Access

Random forests for verbal autopsy analysis: multisite validation study using clinical diagnostic gold standards

Abraham D Flaxman, Alireza Vahdatpour, Sean Green, Spencer L James, Christopher JL Murray, Population Health Metrics 2011, 9:29 (4 August 2011)

This article is part of a collection on Verbal autopsy:...

Research   Open Access

Predicting siRNA potency with random forests and support vector machines

Liangjiang Wang, Caiyan Huang, Jack Y Yang BMC Genomics 2010, 11(Suppl 3):S2 (1 December 2010)

Research article   Open Access Highly Accessed

The behaviour of random forest permutation-based variable importance measures under predictor correlation

Kristin K Nicodemus, James D Malley, Carolin Strobl, Andreas Ziegler BMC Bioinformatics 2010, 11:110 (27 February 2010)

Research article   Open Access

Optimal classifier selection and negative bias in error rate estimation: an empirical study on high-dimensional prediction

Anne-Laure Boulesteix, Carolin Strobl BMC Medical Research Methodology 2009, 9:85 (21 December 2009)

Proceedings   Open Access

Selection of important variables by statistical learning in genome-wide association analysis

Wei Yang, C Charles Gu BMC Proceedings 2009, 3(Suppl 7):S70 (15 December 2009)

Proceedings   Open Access

Detecting significant single-nucleotide polymorphisms in a rheumatoid arthritis study using random forests

Minghui Wang, Xiang Chen, Meizhuo Zhang, Wensheng Zhu, Kelly Cho, Heping Zhang BMC Proceedings 2009, 3(Suppl 7):S69 (15 December 2009)

Proceedings   Open Access

Evaluation of random forests performance for genome-wide association studies in the presence of interaction effects

Yoonhee Kim, Robert Wojciechowski, Heejong Sung, Rasika A Mathias, Li Wang, Alison P Klein, Rhoshel K Lenroot, James Malley, Joan E Bailey-Wilson BMC Proceedings 2009, 3(Suppl 7):S64 (15 December 2009)

Proceedings   Open Access

Graph ranking for exploratory gene data analysis

Cuilan Gao, Xin Dang, Yixin Chen, Dawn Wilkins BMC Bioinformatics 2009, 10(Suppl 11):S19 (8 October 2009)

Research article   Open Access Highly Accessed

Effects of sample size on robustness and prediction accuracy of a prognostic gene signature

Seon-Young Kim BMC Bioinformatics 2009, 10:147 (16 May 2009)

Methodology article   Open Access Highly Accessed

Performance of random forest when SNPs are in linkage disequilibrium

Yan A Meng, Yi Yu, L Adrienne Cupples, Lindsay A Farrer, Kathryn L Lunetta BMC Bioinformatics 2009, 10:78 (5 March 2009)

Research   Open Access

A random forest approach to the detection of epistatic interactions in case-control studies

Rui Jiang, Wanwan Tang, Xuebing Wu, Wenhui Fu BMC Bioinformatics 2009, 10(Suppl 1):S65 (30 January 2009)

Research   Open Access

Using random forest for reliable classification and cost-sensitive learning for medical diagnosis

Fan Yang, Hua-zhen Wang, Hong Mi, Cheng-de Lin, Wei-wen Cai BMC Bioinformatics 2009, 10(Suppl 1):S22 (30 January 2009)

Software   Open Access Highly Accessed

DFP: a Bioconductor package for fuzzy profile identification and gene reduction of microarray data

Daniel Glez-Peña, Rodrigo Álvarez, Fernando Díaz, Florentino Fdez-Riverola BMC Bioinformatics 2009, 10:37 (29 January 2009)

Research article   Open Access Highly Accessed

Prediction of glycosylation sites using random forests

Stephen E Hamby, Jonathan D Hirst BMC Bioinformatics 2008, 9:500 (27 November 2008)

Research article   Open Access

Application of two machine learning algorithms to genetic association studies in the presence of covariates

Bareng AS Nonyane, Andrea S Foulkes BMC Genetics 2008, 9:71 (14 November 2008)

Proceedings   Open Access

Very Important Pool (VIP) genes – an application for microarray-based molecular signatures

Zhenqiang Su, Huixiao Hong, Hong Fang, Leming Shi, Roger Perkins, Weida Tong BMC Bioinformatics 2008, 9(Suppl 9):S9 (12 August 2008)

Research article   Open Access Highly Accessed

A comprehensive comparison of random forests and support vector machines for microarray-based cancer classification

Alexander Statnikov, Lily Wang, Constantin F Aliferis BMC Bioinformatics 2008, 9:319 (22 July 2008)

Methodology article   Open Access Highly Accessed

Conditional variable importance for random forests

Carolin Strobl, Anne-Laure Boulesteix, Thomas Kneib, Thomas Augustin, Achim Zeileis BMC Bioinformatics 2008, 9:307 (11 July 2008)

Research article   Open Access

Genes related to apoptosis predict necrosis of the liver as a phenotype observed in rats exposed to a compendium of hepatotoxicants

Lingkang Huang, Alexandra N Heinloth, Zhao-Bang Zeng, Richard S Paules, Pierre R Bushel BMC Genomics 2008, 9:288 (16 June 2008)

Research article   Open Access

Impaired barrier function by dietary fructo-oligosaccharides (FOS) in rats is accompanied by increased colonic mitochondrial gene expression

Wendy Rodenburg, Jaap Keijer, Evelien Kramer, Carolien Vink, Roelof van der Meer, Ingeborg MJ Bovee-Oudenhoven BMC Genomics 2008, 9:144 (27 March 2008)

Research   Open Access

A comparative study of different machine learning methods on microarray gene expression data

Mehdi Pirooznia, Jack Y Yang, Mary Qu Yang, Youping Deng BMC Genomics 2008, 9(Suppl 1):S13 (20 March 2008)

Proceedings   Open Access

Picking single-nucleotide polymorphisms in forests

Daniel F Schwarz, Silke Szymczak, Andreas Ziegler, Inke R König BMC Proceedings 2007, 1(Suppl 1):S59 (18 December 2007)

Proceedings   Open Access

Two-stage approach for identifying single-nucleotide polymorphisms associated with rheumatoid arthritis using random forests and Bayesian networks

Yan Meng, Qiong Yang, Karen T Cuenco, L Adrienne Cupples, Anita L DeStefano, Kathryn L Lunetta BMC Proceedings 2007, 1(Suppl 1):S56 (18 December 2007)

Proceedings   Open Access

Genetic association studies for gene expressions: permutation-based mutual information in a comparison with standard ANOVA and as a novel approach for feature selection

Silke Szymczak, Angelo Nuzzo, Christian Fuchsberger, Daniel F Schwarz, Andreas Ziegler, Riccardo Bellazzi, Bernd-Wolfgang Igl BMC Proceedings 2007, 1(Suppl 1):S9 (18 December 2007)

Methodology article   Open Access

Gene selection for classification of microarray data based on the Bayes error

Ji-Gang Zhang, Hong-Wen Deng BMC Bioinformatics 2007, 8:370 (3 October 2007)

Software   Open Access

GeneSrF and varSelRF: a web-based tool and R package for gene selection and classification using random forest

Ramón Diaz-Uriarte BMC Bioinformatics 2007, 8:328 (3 September 2007)

Research article   Open Access

Selecting dissimilar genes for multi-class classification, an application in cancer subtyping

Zhipeng Cai, Randy Goebel, Mohammad R Salavatipour, Guohui Lin BMC Bioinformatics 2007, 8:206 (16 June 2007)

Methodology article   Open Access Highly Accessed

Bias in random forest variable importance measures: Illustrations, sources and a solution

Carolin Strobl, Anne-Laure Boulesteix, Achim Zeileis, Torsten Hothorn BMC Bioinformatics 2007, 8:25 (25 January 2007)

Random forest algorithms classify biological data, and can be biased when variables of different types are used such as sequence data and folding energy, but if the weighting of the importance of the variables is calculated differently this bias is corrected.

Research article   Open Access

Individualized markers optimize class prediction of microarray data

Pavlos Pavlidis, Panayiota Poirazi BMC Bioinformatics 2006, 7:345 (14 July 2006)