Schematic representation of the biomarker development pipeline for genomic microarray data. The analysis starts with a pre-filtering step applied to the full pre-processed data set (54613 probe sets from the Affymetrix Human Genome U133 Plus 2 GeneChip) on top of the funnel, followed by uni- and multivariate ranking and filtering steps before arriving at a biomarker panel. The numbers on the right indicate the number of features (probe sets) at each step. The biomarker development pipeline for proteomic data looks similar except that data sets are typically smaller and proteomic-specific pre-processing steps need to be applied.
Günther et al. BMC Bioinformatics 2012 13:326 doi:10.1186/1471-2105-13-326