BMC Bioinformatics

official impact factor 3.03

Open Access Highly Access Research article

Profiling alternatively spliced mRNA isoforms for prostate cancer classification

Chaolin Zhang1,2, Hai-Ri Li3, Jian-Bing Fan4, Jessica Wang-Rodriguez5,7, Tracy Downs6,7, Xiang-Dong Fu3 and Michael Q Zhang1*

Author Affiliations

1 Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724, USA

2 Department of Biomedical Engineering, State University of New York at Stony Brook, NY 11794, USA

3 Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA 92093, USA

4 Illumina, Inc. San Diego, CA 92121, USA

5 Department of Pathology, University of California, San Diego, La Jolla, CA 92093, USA

6 Department of Surgery, University of California, San Diego, La Jolla, CA 92093, USA

7 VA San Diego Healthcare System, San Diego, CA 92161, USA

For all author emails, please log on.

BMC Bioinformatics 2006, 7:202 doi:10.1186/1471-2105-7-202

Published: 11 April 2006

Abstract

Background

Prostate cancer is one of the leading causes of cancer illness and death among men in the United States and world wide. There is an urgent need to discover good biomarkers for early clinical diagnosis and treatment. Previously, we developed an exon-junction microarray-based assay and profiled 1532 mRNA splice isoforms from 364 potential prostate cancer related genes in 38 prostate tissues. Here, we investigate the advantage of using splice isoforms, which couple transcriptional and splicing regulation, for cancer classification.

Results

As many as 464 splice isoforms from more than 200 genes are differentially regulated in tumors at a false discovery rate (FDR) of 0.05. Remarkably, about 30% of genes have isoforms that are called significant but do not exhibit differential expression at the overall mRNA level. A support vector machine (SVM) classifier trained on 128 signature isoforms can correctly predict 92% of the cases, which outperforms the classifier using overall mRNA abundance by about 5%. It is also observed that the classification performance can be improved using multivariate variable selection methods, which take correlation among variables into account.

Conclusion

These results demonstrate that profiling of splice isoforms is able to provide unique and important information which cannot be detected by conventional microarrays.