Log on / register
Feedback | Support | My details
Open AccessResearch article

The ordering of expression among a few genes can provide simple cancer biomarkers and signal BRCA1 mutations

Xue Lin1 email, Bahman Afsari2 email, Luigi Marchionni3 email, Leslie Cope3 email, Giovanni Parmigiani3,4 email, Daniel Naiman1 email and Donald Geman1,5 email

1Department of Applied Mathematics and Statistics, The Johns Hopkins University, Baltimore, Maryland, USA

2Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, Maryland, USA

3Department of Oncology, The Johns Hopkins Kimmel Cancer Center, Baltimore, Maryland, USA

4Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, USA

5Institute for Computational Medicine, The Johns Hopkins University, Baltimore, Maryland, USA

author email corresponding author email

BMC Bioinformatics 2009, 10:256doi:10.1186/1471-2105-10-256

Published: 20 August 2009

Abstract

Background

A major challenge in computational biology is to extract knowledge about the genetic nature of disease from high-throughput data. However, an important obstacle to both biological understanding and clinical applications is the "black box" nature of the decision rules provided by most machine learning approaches, which usually involve many genes combined in a highly complex fashion. Achieving biologically relevant results argues for a different strategy. A promising alternative is to base prediction entirely upon the relative expression ordering of a small number of genes.

Results

We present a three-gene version of "relative expression analysis" (RXA), a rigorous and systematic comparison with earlier approaches in a variety of cancer studies, a clinically relevant application to predicting germline BRCA1 mutations in breast cancer and a cross-study validation for predicting ER status. In the BRCA1 study, RXA yields high accuracy with a simple decision rule: in tumors carrying mutations, the expression of a "reference gene" falls between the expression of two differentially expressed genes, PPP1CB and RNF14. An analysis of the protein-protein interactions among the triplet of genes and BRCA1 suggests that the classifier has a biological foundation.

Conclusion

RXA has the potential to identify genomic "marker interactions" with plausible biological interpretation and direct clinical applicability. It provides a general framework for understanding the roles of the genes involved in decision rules, as illustrated for the difficult and clinically relevant problem of identifying BRCA1 mutation carriers.


© 1999-2009 BioMed Central Ltd unless otherwise stated. Part of Springer Science+Business Media.