Do serum biomarkers really measure breast cancer?

Jesneck, Jonathan L; Mukherjee, Sayan; Yurkovetsky, Zoya; Clyde, Merlise; Marks, Jeffrey R; Lokshin, Anna E; Lo, Joseph Y

doi:10.1186/1471-2407-9-164

Research article
Open access
Published: 28 May 2009

Do serum biomarkers really measure breast cancer?

Jonathan L Jesneck^1,2,
Sayan Mukherjee^3,4,5,
Zoya Yurkovetsky^6,7,
Merlise Clyde³,
Jeffrey R Marks^4,8,
Anna E Lokshin^6,7 &
…
Joseph Y Lo^9,10,11

BMC Cancer volume 9, Article number: 164 (2009) Cite this article

10k Accesses
39 Citations
Metrics details

Abstract

Background

Because screening mammography for breast cancer is less effective for premenopausal women, we investigated the feasibility of a diagnostic blood test using serum proteins.

Methods

This study used a set of 98 serum proteins and chose diagnostically relevant subsets via various feature-selection techniques. Because of significant noise in the data set, we applied iterated Bayesian model averaging to account for model selection uncertainty and to improve generalization performance. We assessed generalization performance using leave-one-out cross-validation (LOOCV) and receiver operating characteristic (ROC) curve analysis.

Results

The classifiers were able to distinguish normal tissue from breast cancer with a classification performance of AUC = 0.82 ± 0.04 with the proteins MIF, MMP-9, and MPO. The classifiers distinguished normal tissue from benign lesions similarly at AUC = 0.80 ± 0.05. However, the serum proteins of benign and malignant lesions were indistinguishable (AUC = 0.55 ± 0.06). The classification tasks of normal vs. cancer and normal vs. benign selected the same top feature: MIF, which suggests that the biomarkers indicated inflammatory response rather than cancer.

Conclusion

Overall, the selected serum proteins showed moderate ability for detecting lesions. However, they are probably more indicative of secondary effects such as inflammation rather than specific for malignancy.

Peer Review reports

Background

The high prevalence of breast cancer motivates the development of better screening and diagnostic technologies. To complement mammography screening, which has moderate sensitivity [1] and specificity [2–4] and low positive-predictive value in younger women [1], researchers have investigated the efficacy of detecting breast cancer using proteins. Proteins offer detailed information about tissue health conditions, allowing the identification of cancer type and risk, and thereby prompting potentially better-targeted and more effective treatment. Some studies have correlated breast cancer prognosis with proteins in the tumor [5], such as hormone receptors, HER-2, urokinase plasminogen activator, and plasminogen activator inhibitor 1 [6, 7], and caPCNA [8]. However, accessing these local proteins requires biopsies, which is not practical for a screening regimen. We therefore consider serum proteins. Serum and plasma protein-based screening tests have already been developed for many diseases, such as Alzheimer's disease [9], cardiovascular disease [10], prostate cancer [11], and ovarian cancer [12].

For breast cancer, however, there are currently very few serum markers used clinically. Some studies have identified as possible breast cancer markers the proteins CA 15.3 [13–15], BR 27.29, tissue polypeptide antigen (TPA), tissue polypeptide specific antigen (TPS), shed HER-2 [15], and BC1, BC2, and BC3 [16, 17]. However, other studies found a lack of sufficient diagnostic ability in serum proteins, including CA 15.3 [13, 17–20], CA 125 [20], CA 19.9 [20], CA 125 [20], BR 27.29 [13, 18], and carcinoembryonic antigen (CEA) [13]. The European Group of Tumor Markers identified the MUC-1 mucin glycoproteins CA 15.3 and BR 27.29 as the best serum markers for breast cancer, but they could not recommend these proteins for diagnosis due to low sensitivity [18].

Cancer biomarkers are valued according to their specificity and sensitivity, suiting them for different clinical roles. For example, general population screening requires high sensitivity but not necessarily high specificity, if a low-cost secondary screen is available. Conversely, disease recurrence monitoring requires high specificity but not necessarily high sensitivity, if a more sensitive secondary test is available. Therefore to optimize clinical utility, it is important to measure the sensitivity and specificity of any proposed biomarker. Because solid tumors cause many changes in the surrounding tissue, it is likely that some potential biomarkers measure the body's response to cancer rather than the cancer itself. Cancer marker levels may be increased due to various secondary factors, such as therapy-mediated response [21–23] and benign diseases. For example, CA 15-3 levels increase in chronic active hepatitis, liver cirrhosis, sarcoidosis [24], hypothyroidism [25], megablastic anemia [26], and beta-thalassaemia [27]. To help measure biomarker specificity in our study, we included normal, benign, and malignant samples.

In general, breast cancer biomarker studies have found individual circulating proteins, but it is important to consider covariation of multiple protein levels. It is useful to know which combinations of proteins may yield high diagnostic performance, even though each protein individually might yield low performance [28, 29]. Some studies have detected discriminatory breast cancer biomarkers using mass spectrometry. [16, 30–36] Because of mass spectrometry's peak interpretation and reproducibility challenges [37, 38], scientists have searched for breast cancer biomarkers from predefined collections of known candidate markers using newer multiplex technologies, such as reverse-phase protein microarray [39]. To our knowledge, ours is the first study to assess a large set of serum proteins collectively by a sensitive and specific multiplex assay in order to identify the most promising proteins for breast cancer detection.

Although studies have shown correlations between serum proteins and breast cancer, it is often unclear how these correlations translate into clinical applicability for diagnosis. To quantify the diagnostic performance of a set of proposed biomarkers, we have implemented many statistical and machine-learning models. We have focused on Bayesian models, which provide full posterior predictive distributions. This detailed information would help a physician to judge how much emphasis to place on the classifier's diagnostic prediction.

The goal of this study was to quantify the ability of serum proteins to detect breast cancer. For measuring predictive performance on this noisy data set, we used many statistical classification models. To better understand the cancer-specificity of the screening test, we ran the classifiers on proteins from malignant lesions, benign lesions, and normal breast tissue. Our data indicated that some serum proteins can detect moderately well the presence of a breast lesion, but they could not distinguish benign from malignant lesions.

Methods

Enrolling subjects

This study enrolled 97 subjects undergoing diagnostic biopsy for breast cancer and 68 normal controls at Duke University Medical Center from June 1999 to October 2005. Women donating blood to this study were either undergoing image guided biopsy to diagnose a primary breast lesion (benign and cancer) or were undergoing routine screening mammography and had no evidence of breast abnormalities (normal). All subjects were enrolled only after obtaining written informed consent for this Duke IRB approved study (P.I., JRM, current registry number, 9204-07-11E1ER, Early Detection of Breast Cancer Using Circulating Markers). All subjects were premenopausal women. Table 1 shows the demographics of the study population. Additional file 1 shows subjects' ages and histology findings.

Table 1 Subject demographics

Full size table

Measuring serum protein levels with ELISA

Blood sera were collected under the HIPAA-compliant protocol "Blood and Tissue Bank for the Discovery and Validation of Circulating Breast Cancer Markers." Blood was collected from subjects prior to surgical resection. All specimens were collected in red-stoppered tubes and processed generally within 4 hours (but not greater than 12 hours) after collection and stored at -80°C. Sera were assayed using the Enzyme-Linked ImmunoSorbent Assay (ELISA, Luminex platform) and reagents in the Luminex Core Facility of University of Pittsburgh Cancer Institute. The Luminex protocol was optimized for analytical performance as described by Gorelik et al. [12]. One replicate per patient sample was performed with reactions from 100 beads measured and averaged. All samples were analyzed on the same day using the same lot of reagents. Complete information about characteristics of individual assays including inter- and intra-assay coefficients of variation (CVs) is available from the manufacturers of assays [see Additional file 2] and from the Luminex Core website http://www.upci.upmc.edu/luminex/sources.cfm. Based on our analysis of assays performed monthly within one month interval for 3 months using the same lot of reagents, the intra-assay CV for different analytes was in the range of 0.7–11% (typically < 5%) and inter-assay 3.7–19% (<11% for same lot reagents).

Biomarkers were selected based on the known literature reports about their association with breast cancer. The 98 assayed proteins are shown in Table 2, with further details in Additional file 2. In addition to the protein levels, patient age and race were also recorded.

Table 2 List of the 98 serum proteins measured by ELISA assay (Luminex platform)

Full size table

Regression with variable selection

In order to incorporate these proteins into a breast cancer screening tool, we built statistical models linking the protein levels to the probability of malignancy. We used the following three common regression models: linear regression Y _i= X _i β +ε _i, ε ~N(0, σ ²), logistic regression , and probit regression Pr(Y _i= 1|β) = Φ(X _i, β), where Y is the response vector (breast cancer diagnosis), X is the matrix of observed data (protein levels), β is the vector of coefficients, ε is additive noise, and Φ(·) is the cumulative distribution function of the normal distribution. These classical models become unstable and predict poorly if there are relatively few observations (curse of dataset sparsity [40]) and many features (curse of dimensionality [41]). It is better to choose a subset of useful features. We used stepwise feature selection [42–46] to choose the set of proteins that optimized model fit.

However, choosing only one feature subset for prediction comes with an inherent risk. When multiple possible statistical models fit the observed data similarly well, it is risky to make inferences and predictions based only on a single model [47]. In this case predictive performance suffers, because standard statistical inference typically ignores model uncertainty.

Accounting for model uncertainty with Bayesian model averaging

We accounted for model-selection ambiguity by using a Bayesian approach to average over the possible models. We considered a set of models M ₁,..., M _B, where each model M _kconsists of a family of distributions p(D|θ _k, M _k) indexed by the parameter vector θ _k, where D = (X, Y) is the observed data. Y is the response vector (breast cancer diagnosis), and X is the matrix of observed data (protein levels). Using a Bayesian method to average over the set of considered models [47–52], we first assigned a prior probability distribution p(D|M _k) to the parameters of each model M _k. This formulation allows a conditional factorization of the joint distribution,

(1)

Splitting the joint distribution in this way allowed us to implicitly embed the various models inside one large hierarchical mixture model. This form allowed us to fit these models using the computational machinery of Bayesian model averaging (BMA).

BMA accounts for model uncertainty by averaging over the posterior distributions of multiple models, allowing for more robust predictive performance. If we are interested in predicting a future observation D _ffrom the same process that generated the observed data D, then we can represent the predictive posterior distribution p(D _f|D) as an average over the models, weighted by their posterior probabilities [47, 53, 54]:

(2)

where the sum's first term p(D _f|D, M _k) is a posterior weighted mixture of conditional predictive distributions

(3)

and the sum's second term p(M _k|D) is a model's posterior distribution

(4)

which incorporates the model's marginal likelihood

(5)

Promoting computational efficiency with iterated BMA

BMA allows us to average over all possible models, containing all possible subsets of features. However, considering many models would require extensive computations, especially when computing the posterior predictive distributions. Such computations would be prohibitively long for a quick screening tool that is intended not to impede clinicians' workflow during routine breast cancer screening. Because it was computationally infeasible to consider all possible 2¹⁰⁰≈1.26*10³⁰ models, we first chose a set of the best fitting models. For computational efficiency in model selection, this study followed Yeung et al. [55] and used a deterministic search based on an Occam's window approach [54] and the "leaps and bounds" algorithm [56] to identify models with higher posterior probabilities.

In addition to choosing the best models, we also chose the best proteins. We applied an iterative adaptation of BMA [55]. This method initially ranks each feature separately by the ratio of between-group to within-group sum of squares (BSS/WSS) [57]. For protein j the ratio is

(6)

where I(·) is an indicator function, X _ijis the level of protein j in sample i, and are respectively the average levels of protein j in the normal and cancer groups, and is the average level of protein j over all samples.

Ordered by this BSS/WSS ranking, the features were iteratively fit into BMA models, which generated posterior probability distributions for the proteins' coefficients. We then discarded proteins that had low posterior probabilities of relevance, Pr(b _j≠ 0| D) < 1%, where

(7)

where Pr(b _j≠ 0| D) is the posterior probability that protein j's coefficient is nonzero, Γ is the subset of the considered models M ₁,..., M _Bthat include protein j. By discarding proteins that have small influence on classification, this iterative procedure keeps only the most relevant proteins.

Other models for high-dimensional data

To compare iterated BMA's classification and generalization performance, we also classified the data using two other dimensionality-reducing methods: a support vector machine (SVM) [58] with recursive feature selection [59, 60] and least-angle regression (LAR, a development of LASSO) [61].

All modeling was performed using the R statistical software (version 2.6.2), and specifically the BMA package (version 3.0.3) for iterated BMA, the packages e1071 (version 1.5–16) and R-SVM [31] for the SVM with recursive feature selection, the lars package (version 0.9–5) for least angle regression, and the ROCR package (version 1.0–2) for ROC analysis. We extended the BMA package to compute the full predictive distributions (See Equations 2–6) within cross-validation using an MCMC approach. Additional file 3 contains the R code and Additional file 4 contains the data in comma-delimited format.

Evaluating classification performance

The classifiers' performances were analyzed and compared using receiver operating characteristic (ROC) analysis [62]. ROC curve metrics were compared statistically using a nonparametric bootstrap method [63]. To estimate generalization performance on future cases, we first defined independent train and test sets by randomly choosing 70% of samples for training and optimizing the classifiers, and then we tested the classifiers on the remaining 30% of the samples. To compare with the train/test split, we also performed leave-one-out cross-validation (LOOCV). Feature selection was performed within each fold of the cross-validation.

Evaluating feature-Selection methods

We compared models with feature-selection techniques using two methods: feature concentration and classifier performance. For feature concentration, we performed feature selection within each fold of a LOOCV. We recorded how many times each feature was chosen. This method distinguished the feature selection methods that chose few versus many features. Using classifier performance, we investigated the effect of feature selection and sampling. We compared linear models run on the data with these four techniques: 1) no feature selection (using all the proteins in the model), 2) preselecting the features (using all the data to choose the best features, and then running the model using only those preselected features in LOOCV), 3) stepwise feature selection, and 4) iterated BMA. For each case, we ran the classifier in LOOCV.

Results

Classifier performance

The classifiers achieved moderate classification performance for both normal vs. malignant (AUC = 0.77 ± 0.02 on the test set, and AUC = 0.80 ± 0.02 for LOOCV) and normal vs. benign (AUC = 0.75 ± 0.03 on the test set, and AUC = 0.77 ± 0.02 for LOOCV), but very poor performance for malignant vs. benign tumors (AUC = 0.57 ± 0.05 on the test set, and AUC = 0.53 ± 0.03 for LOOCV). The classification performance is shown as ROC curves in Figure 1. Whereas the ROC curves show the classification performance over the entire range of prediction thresholds, we also considered the threshold of Pr(Y _i= 1|β) = 0.5 in particular. For this threshold, Table 3 shows the classification error. The models performed similarly, with approximately 20 false negatives and 10 false positives. All classifiers were run with leave-one-out cross-validation (LOOCV). The classifiers chose the best subsets of proteins for classification.

Table 3 Cross-validation classification errors, normal versus cancer

Full size table

Figure 2 plots the full posterior predictive distributions for BMA of probit models run with LOOCV. In general, the predictive distributions were more "decided" (concentrated further from the 0.5 probability line) for the tasks of normal vs. cancer and normal vs. benign, but they were less "decided" (concentrated closer to the 0.5 probability line) for benign vs. cancer. This trend indicated that the serum protein levels were very similar for malignant and benign lesions.

Selected serum proteins

The iterated BMA algorithm chose the best-fitting probit models. The chosen models and their proteins are shown in Figure 3. The best proteins for each classification task are listed in Table 4. The top protein for both normal vs. cancer and normal vs. benign was macrophage migration inhibitory factor (MIF), a known inflammatory agent [64–66]. Other selected proteins also play roles in inflammation and immune response, such as MMP-9 [67, 68], MPO [69], sVCAM-1 [70], ACTH [71], MICA [72], IL-5 [73], IL-12 p40 [74–76], MCP-1 [77], and IFNa [78–80]. For benign vs. cancer, the top protein was CA-125, which is used as a biomarker for ovarian cancer [12, 81–83]. However, the greater presence of CA-125 in cancer tissue was still too subtle to allow the classifiers to achieve good classification performance.

Table 4 Proteins chosen by BMA of linear models

Full size table

Complementary to the models' matrix plots for feature strength are the coefficients' marginal posterior probability distribution functions (PDFs), which the BMA technique calculates by including information from all considered models. Figure 4 shows the marginal posterior PDFs for the top coefficients for the BMA models. The coefficients' distributions are mixture models of a normal distribution and a point mass at zero. This point mass is much larger for the benign vs. cancer models than for normal vs. cancer and normal vs. benign models. The higher weight at zero indicates that the proteins are less suitable for distinguishing benign from malignant lesions than they are for distinguishing lesions from normal tissue.

Feature selection and Bayesian model averaging

While doing feature selection for normal vs. cancer within LOOCV, we recorded the counts of how many times each protein was selected. The selection frequencies are shown as a heatmap in Figure 5. Iterated BMA and least-angle regression selected the fewest proteins, whereas stepwise feature selection chose many more proteins. The strongest proteins were chosen consistently across all feature selection techniques, as shown by the horizontal lines in the figure.

We also investigated the effect of feature selection upon classifier generalization. Figure 6 shows the ROC and accuracy curves for linear models with various feature selection strategies. Preselecting the features generated a very optimistically biased classification performance. When the same feature selection technique (stepwise) was applied within each fold of LOOCV, the classification performance fell dramatically. Using no feature selection (using all proteins) had extremely poor classification performance – no better than guessing. The poor performance demonstrated that the linear model needs feature selection for good classification performance when the number of features is roughly the same as the number of samples in noisy data. Iterated BMA of linear models significantly outperformed the stepwise, single-model method. This performance increase demonstrated the better predictive ability of the BMA models; averaging a set of the most promising models was better than using only the single best model.

Discussion

The serum proteins allowed the classifiers to detect lesions (malignant or benign) from normal tissue moderately well, but they were very poor for distinguishing benign from malignant lesions. These classification results were consistent in both the test set and the leave-one-out cross-validation. This consistency implies that the classification results are not highly dependent on the sampling scheme but rather highlight consistent trends in our data set. The classification results show that the proteins were not specific for cancer and suggest that they may indicate states of biological or immunological stress. A good candidate for the dominating biological effect is inflammation, since the top protein selected for both normal vs. cancer and normal vs. benign was macrophage migration inhibitory factor (MIF), which is active in inflammation [64–66]. The best protein for distinguishing malignant from benign tumors was cancer antigen 125 (CA-125), which is a prominent serum marker for ovarian cancer [12, 81, 83]. However, CA-125 levels are influenced by other factors, such as age, hormone-replacement therapy, smoking habits, and use of oral contraceptives [84]. In general it is very difficult to ascertain whether biomarker proteins are generated by the cancer itself or by secondary effects, such as immune response. Once potential biomarker proteins are identified in initial studies, follow-up studies can focus on those proteins to discover their origin and role in the disease process. Helpful experimental study designs would control for known secondary causes of biomarker activity and would collect enough samples to average over unintended secondary causes. Longitudinal studies would also lessen the effect of transient secondary causes.

To quantify and compare classification performances, we used ROC analysis, which fairly compares classifiers that may be operating at different sensitivities due to arbitrary decision thresholds applied to the classifiers' output values. Although our data set comprised three classes (normal, benign, and cancer), current ROC methods required us to split an inherently three-class classification problem into three different two-class tasks: normal vs. benign, normal vs. cancer, and benign vs. cancer. The field of ROC analysis is still in development for the three-class problem; no consensus has yet been reached about how to quantitatively score the resulting six-dimensional ROC hypersurface [85–88]. However, for other methods of classifier comparison, such as the generalized Brier score or discrete counts of classification errors, full three-class models could have been used, albeit with decision thresholds.

This study investigated a group of 98 serum proteins (Table 2), which is relatively small sample of all detectable serum proteins. Future studies may identify other proteins with stronger relationships to breast cancer. Rather than relying on only a few proteins, upcoming protein technologies will allow the screening of large populations with protein-based tests that require a larger set of proteins. Microfluidics chips would simplify the process of automating blood tests in a high-throughput fashion. However, with current assay technology and cost-benefit analysis of screening programs, the fixed cost per protein assayed essentially limits the number of proteins that can be used for screening. To lower screening costs, we chose small subsets of the features via feature-selection methods. Iterated BMA and least-angle regression were able to classify well using a far smaller set of features than those chosen by stepwise feature selection.

High feature correlation impedes many feature-selection techniques. For stochastic feature-selection methods, two highly correlated features are each likely to be chosen in alternation. Similarly, a cluster of highly correlated features causes the feature selection technique to spread the feature selection rate among each feature in the cluster, essentially diluting each feature's selection rate. Severe dilution of selection rates can cause none of the cluster's features to be selected. Future work will entail adding cluster-based methods to the iterated BMA algorithm.

The currently proposed serum biomarkers for breast cancer are not sensitive or specific enough for breast cancer screening. However, better biomarkers may be identified with newer protein assay technology and larger data sets. A protein's subtle diagnostic ability may be enhanced by the assimilation of other medical information, such as gene expression and medical imaging. The proteins will boost diagnostic performance only if they provide complementary and non-redundant information with the clinical practice of mammograms, sonograms, and physical examination. The relationship of medical imaging and protein screening remains to be determined in future work.

Conclusion

We have performed feature-selection and classification techniques to identify blood serum proteins that are indicative of breast cancer in premenopausal women. The best features to detect breast cancer were MIF, MMP-9, and MPO. While the proteins could distinguish normal tissue from cancer and normal tissue from benign lesions, they could not distinguish benign from malignant lesions. Since the same protein (MIF) was chosen for both normal vs. cancer and normal vs. benign lesions, it is likely that this protein plays a role in the inflammatory response to a lesion, whether benign or malignant, rather than in a role specific for cancer. While the current set of proteins show moderate ability for detecting breast cancer, their true usefulness in a screening program remains to be seen in their integration with imaging-based screening practices.

Abbreviations

AUC:: for Area under the ROC curve
BMA:: for Bayesian model averaging
CV:: for coefficient of variation
ELISA:: for enzyme-linked immunosorbent assay
HIPAA:: for Health Insurance Portability and Accountability Act
LOOCV:: for leave-one-out cross-validation
MICA:: for human major histocompatibility complex class I chain-related A
MIF:: for macrophage migration inhibitory factor
MMP-9:: for matrix metalloproteinase 9
PDF:: for probability density function
and ROC:: for receiver operating characteristic.

References

Ferrini R, et al: Screening mammography for breast cancer: American College of Preventive Medicine practice policy statement. Am J Prev Med. 1996, 12 (5): 340-1.
CAS PubMed Google Scholar
Meyer JE, et al: Occult breast abnormalities: percutaneous preoperative needle localization. Radiology. 1984, 150: 335-337.
Article CAS PubMed Google Scholar
Rosenberg AL, et al: Clinically occult breast lesions: localization and significance. Radiology. 1987, 162: 167-170.
Article CAS PubMed Google Scholar
Yankaskas BC, et al: Needle localization biopsy of occult lesions of the breast. Radiology. 1988, 23: 729-733.
CAS Google Scholar
Kreunin P, et al: Proteomic profiling identifies breast tumor metastasis-associated factors in an isogenic model. Proteomics. 2007, 7 (2): 299-312.
Article CAS PubMed PubMed Central Google Scholar
Isaacs C, Stearns V, Hayes DF: New prognostic factors for breast cancer recurrence. Semin Oncol. 2001, 28 (1): 53-67.
Article CAS PubMed Google Scholar
Duffy MJ: Urokinase plasminogen activator and its inhibitor, PAI-1, as prognostic markers in breast cancer: from pilot to level 1 evidence studies. Clin Chem. 2002, 48 (8): 1194-7.
CAS PubMed Google Scholar
Malkas LH, et al: A cancer-associated PCNA expressed in breast cancer has implications as a potential biomarker. Proc Natl Acad Sci USA. 2006, 103 (51): 19472-7.
Article CAS PubMed PubMed Central Google Scholar
Hye A, et al: Proteome-based plasma biomarkers for Alzheimer's disease. Brain. 2006, 129 (Pt 11): 3042-50.
Article CAS PubMed Google Scholar
Wang TJ, et al: Multiple biomarkers for the prediction of first major cardiovascular events and death. N Engl J Med. 2006, 355 (25): 2631-9.
Article CAS PubMed Google Scholar
Polascik TJ, Oesterling JE, Partin AW: Prostate specific antigen: a decade of discovery–what we have learned and where we are going. J Urol. 1999, 162 (2): 293-306.
Article CAS PubMed Google Scholar
Gorelik E, et al: Multiplexed immunobead-based cytokine profiling for early detection of ovarian cancer. Cancer Epidemiol Biomarkers Prev. 2005, 14 (4): 981-7.
Article CAS PubMed Google Scholar
Duffy MJ: Serum tumor markers in breast cancer: are they of clinical value?. Clin Chem. 2006, 52 (3): 345-51.
Article CAS PubMed Google Scholar
Duffy MJ, et al: High preoperative CA 15-3 concentrations predict adverse outcome in node-negative and node-positive breast cancer: study of 600 patients with histologically confirmed breast cancer. Clin Chem. 2004, 50 (3): 559-63.
Article CAS PubMed Google Scholar
Cheung KL, Graves CR, Robertson JF: Tumour marker measurements in the diagnosis and monitoring of breast cancer. Cancer Treat Rev. 2000, 26 (2): 91-102.
Article CAS PubMed Google Scholar
Li J, et al: Proteomics and bioinformatics approaches for identification of serum biomarkers to detect breast cancer. Clin Chem. 2002, 48 (8): 1296-304.
CAS PubMed Google Scholar
Mathelin C, et al: Serum biomarkers for detection of breast cancers: A prospective study. Breast Cancer Res Treat. 2006, 96 (1): 83-90.
Article CAS PubMed Google Scholar
Molina R, et al: Tumor markers in breast cancer- European Group on Tumor Markers recommendations. Tumour Biol. 2005, 26 (6): 281-93.
Article PubMed Google Scholar
Lumachi F, et al: Relationship between tumor markers CEA and CA 15-3, TNM staging, estrogen receptor rate and MIB-1 index in patients with pT1–2 breast cancer. Anticancer Res. 2004, 24 (5B): 3221-4.
CAS PubMed Google Scholar
Skates SJ, et al: Pooling of case specimens to create standard serum sets for screening cancer biomarkers. Cancer Epidemiol Biomarkers Prev. 2007, 16 (2): 334-41.
Article CAS PubMed Google Scholar
Kiang DT, Greenberg LJ, Kennedy BJ: Tumor marker kinetics in the monitoring of breast cancer. Cancer. 1990, 65 (2): 193-9.
Article CAS PubMed Google Scholar
Yasasever V, et al: Utility of CA 15-3 and CEA in monitoring breast cancer patients with bone metastases: special emphasis on "spiking" phenomena. Clin Biochem. 1997, 30 (1): 53-6.
Article CAS PubMed Google Scholar
Pentheroudakis G, et al: The neutrophil, not the tumor: serum CA 15-3 elevation as a result of granulocyte–colony-stimulating factor-induced neutrophil MU1C overexpression and neutrophilia in patients with breast carcinoma receiving adjuvant chemotherapy. Cancer. 2004, 101 (8): 1767-75.
Article CAS PubMed Google Scholar
Colomer R, et al: Circulating CA 15-3 levels in the postsurgical follow-up of breast cancer patients and in non-malignant diseases. Breast Cancer Res Treat. 1989, 13 (2): 123-33.
Article CAS PubMed Google Scholar
Hashimoto T, Matsubara F: Changes in the tumor marker concentration in female patients with hyper-, eu-, and hypothyroidism. Endocrinol Jpn. 1989, 36 (6): 873-9.
Article CAS PubMed Google Scholar
Symeonidis A, et al: Increased serum CA-15.3 levels in patients with megaloblastic anemia due to vitamin B12 deficiency. Oncology. 2004, 67 (5–6): 359-67.
CAS PubMed Google Scholar
Symeonidis A, et al: Increased CA-15.3 levels in the serum of patients with homozygous beta-thalassaemia and sickle cell/beta-thalassaemia. Br J Haematol. 2006, 133 (6): 692-4.
Article CAS PubMed Google Scholar
Sahab ZJ, Semaan SM, Sang QXAS: Methodology and Applications of Disease Biomarker Identification in Human Serum. Biomarker Insights. 2007, 2: 21-43.
PubMed PubMed Central Google Scholar
Zissimopoulos A, et al: [Procollagen-I, collagen telopeptide I, CEA, CA 15-3 as compared to bone scintigraphy in patients with breast cancer]. Hell J Nucl Med. 2006, 9 (1): 60-4.
PubMed Google Scholar
Belluco C, et al: Serum proteomic analysis identifies a highly sensitive and specific discriminatory pattern in stage 1 breast cancer. Ann Surg Oncol. 2007, 14 (9): 2470-6.
Article PubMed Google Scholar
Bouchal P, et al: Biomarker discovery in low-grade breast cancer using isobaric stable isotope tags and two-dimensional liquid chromatography-tandem mass spectrometry (iTRAQ-2DLC-MS/MS) based quantitative proteomic analysis. J Proteome Res. 2009, 8 (1): 362-73.
Article CAS PubMed Google Scholar
Callesen AK, et al: Combined experimental and statistical strategy for mass spectrometry based serum protein profiling for diagnosis of breast cancer: a case-control study. J Proteome Res. 2008, 7 (4): 1419-26.
Article CAS PubMed Google Scholar
Laronga C, Drake RR: Proteomic approach to breast cancer. Cancer Control. 2007, 14 (4): 360-8.
PubMed Google Scholar
Rui Z, et al: Use of serological proteomic methods to find biomarkers associated with breast cancer. Proteomics. 2003, 3 (4): 433-9.
Article PubMed Google Scholar
Wulfkuhle JD, et al: New approaches to proteomic analysis of breast cancer. Proteomics. 2001, 1 (10): 1205-15.
Article CAS PubMed Google Scholar
Wulfkuhle JD, et al: Proteomics of human breast ductal carcinoma in situ. Cancer Res. 2002, 62 (22): 6740-9.
CAS PubMed Google Scholar
Callesen AK, et al: Reproducibility of mass spectrometry based protein profiles for diagnosis of breast cancer across clinical studies: a systematic review. J Proteome Res. 2008, 7 (4): 1395-402.
Article CAS PubMed Google Scholar
Gast MC, et al: SELDI-TOF MS serum protein profiles in breast cancer: assessment of robustness and validity. Cancer Biomark. 2006, 2 (6): 235-48.
CAS PubMed Google Scholar
Wulfkuhle JD, et al: Multiplexed cell signaling analysis of human breast cancer applications for personalized therapy. J Proteome Res. 2008, 7 (4): 1508-17.
Article CAS PubMed Google Scholar
Somorjai RL, Dolenko B, Baumgartner R: Class prediction and discovery using gene microarray and proteomics mass spectroscopy data: curses, caveats, cautions. Bioinformatics. 2003, 19 (12): 1484-91.
Article CAS PubMed Google Scholar
Bellman RE: Adaptive control processes: a guided tour. 1961, Princeton, N.J.,: Princeton University Press, 255-
Chapter Google Scholar
Sahiner B, et al: Stepwise linear discriminant analysis in computer-aided diagnosis: the effect of finite sample size. Medical Imaging 1999: Image Processing. 1999, San Diego, CA, USA: SPIE
Google Scholar
Sahiner B, et al: Feature selection and classifier performance in computer-aided diagnosis: the effect of finite sample size. Medical Physics. 2000, 27 (7): 1509-22.
Article CAS PubMed Google Scholar
Lachenbruch PA: Discriminant analysis. 1975, New York,: Hafner Press, ix: 128-
Google Scholar
Tatsuoka MM: Multivariate analysis; techniques foreducational and psychological research. 1971, New York,:Wiley, xiii: 310-
Google Scholar
Draper NR, Smith H: Applied regression analysis. Wiley series in probability and statistics. Texts and references section. 1998, New York: Wiley, xvii: 706-3
Google Scholar
Hoeting JA, et al: Bayesian model Averaging: A Tutorial. Statistical Science. 1999, 14 (4): 382-417.
Article Google Scholar
Hodges JS: Uncertainty, Policy Analysis and Statistics. Statistical Science. 1987, 2 (3): 259-275.
Article Google Scholar
Draper D: Assessment and Propagation of Model Uncertainty. Journal of the Royal Statistical Society. Series B (Methodological). 1995, 57 (1): 45-97.
Google Scholar
Clyde M, George EI: Model Uncertainty. Statistical Science. 2004, 19 (1): 81-94.
Article Google Scholar
Berger J, Pericchi L: Objective Bayesian methods for model section. Model Section. Edited by: Lahiri P. 2001, IMS: Beachwood, OH, 135-207.
Chapter Google Scholar
Chipman H, George EI, McCulloch R: The Practical Implementation of Bayesian Model Selection. Model Selection. Edited by: Lahiri P. 2001, IMS: Beachwood, OH, 65-134.
Chapter Google Scholar
Raftery AE: Bayesian Model Selection in Social Research. Sociological Methodology. 1995, 25: 111-163.
Article Google Scholar
Madigan D, Raftery AE: Model Selection and Accounting for Model Uncertainty in Graphical Models Using Occam's Window. Journal of the American Statistical Association. 1994, 89 (428): 1535-1546.
Article Google Scholar
Yeung KY, Bumgarner RE, Raftery AE: Bayesian model averaging: development of an improved multi-class, gene selection and classification tool for microarray data. Bioinformatics. 2005, 21 (10): 2394-2402.
Article CAS PubMed Google Scholar
Furnival GM, Wilson RW: Regressions by Leaps and Bounds. Technometrics. 1974, 16 (4): 499-511.
Article Google Scholar
Dudoit S, Fridlyand J, Speed TP: Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data. Journal of the American Statistical Association. 2002, 97 (457): 77-87.
Article CAS Google Scholar
Vapnik NV: The Nature of Statistical Learning Theory. Statistics for Engineering and Information Science. 1995, New York, NY: Springer-Verlag New York, Inc, 188-2
Google Scholar
Guyon I, et al: Gene Selection for Cancer Classification using Support Vector Machines. Machine Learning. 2002, 46 (1): 389-422.
Article Google Scholar
Zhang X, et al: Recursive SVM feature selection and sample classification for mass-spectrometry and microarray data. BMC Bioinformatics. 2006, 7: 197-
Article PubMed PubMed Central Google Scholar
Efron B, et al: Least Angle Regression. The Annals of Statistics. 2004, 32 (2): 407-451.
Article Google Scholar
Obuchowski NA: Receiver operating characteristic curves and their use in radiology. Radiology. 2003, 229 (1): 3-8.
Article PubMed Google Scholar
Efron B, Tibshirani RJ: An Introduction to the Bootstrap. Monographs on Statistics and Applied Probability. Edited by: Cox DR, et al. 1993, New York, NY: Chapman & Hall, 436-
Google Scholar
Lolis E, Bucala R: Macrophage migration inhibitory factor. Expert Opin Ther Targets. 2003, 7 (2): 153-64.
Article CAS PubMed Google Scholar
Morand EF: New therapeutic target in inflammatory disease: macrophage migration inhibitory factor. Intern Med J. 2005, 35 (7): 419-26.
Article CAS PubMed Google Scholar
Bucala R: MIF rediscovered: cytokine, pituitary hormone, and glucocorticoid-induced regulator of the immune response. Faseb J. 1996, 10 (14): 1607-13.
CAS PubMed Google Scholar
Poitevin S, et al: Type I collagen induces tissue factor expression and matrix metalloproteinase 9 production in human primary monocytes through a redox-sensitive pathway. J Thromb Haemost. 2008, 6 (9): 1586-94.
Article CAS PubMed Google Scholar
Vu TH, et al: MMP-9/gelatinase B is a key regulator of growth plate angiogenesis and apoptosis of hypertrophic chondrocytes. Cell. 1998, 93 (3): 411-22.
Article CAS PubMed PubMed Central Google Scholar
Eiserich JP, et al: Myeloperoxidase, a leukocyte-derived vascular NO oxidase. Science. 2002, 296 (5577): 2391-4.
Article CAS PubMed Google Scholar
Gearing AJ, Newman W: Circulating adhesion molecules in disease. Immunol Today. 1993, 14 (10): 506-12.
Article CAS PubMed Google Scholar
Gwynne JT, et al: Adrenal cholesterol uptake from plasma lipoproteins: regulation by corticotropin. Proc Natl Acad Sci USA. 1976, 73 (12): 4329-33.
Article CAS PubMed PubMed Central Google Scholar
Bauer S, et al: Activation of NK cells and T cells by NKG2D, a receptor for stress-inducible MICA. Science. 1999, 285 (5428): 727-9.
Article CAS PubMed Google Scholar
Clutterbuck EJ, Hirst EM, Sanderson CJ: Human interleukin-5 (IL-5) regulates the production of eosinophils in human bone marrow cultures: comparison and interaction with IL-1, IL-3, IL-6, and GMCSF. Blood. 1989, 73 (6): 1504-12.
CAS PubMed Google Scholar
Jana M, et al: Induction of tumor necrosis factor-alpha (TNF-alpha) by interleukin-12 p40 monomer and homodimer in microglia and macrophages. J Neurochem. 2003, 86 (2): 519-28.
Article CAS PubMed PubMed Central Google Scholar
Mattner F, et al: Treatment with homodimeric interleukin-12 (IL-12) p40 protects mice from IL-12-dependent shock but not from tumor necrosis factor alpha-dependent shock. Infect Immun. 1997, 65 (11): 4734-7.
CAS PubMed PubMed Central Google Scholar
Ma X, et al: The interleukin 12 p40 gene promoter is primed by interferon gamma in monocytic cells. J Exp Med. 1996, 183 (1): 147-57.
Article CAS PubMed Google Scholar
Taub DD, et al: Monocyte chemotactic protein-1 (MCP-1), -2, and -3 are chemotactic for human T lymphocytes. J Clin Invest. 1995, 95 (3): 1370-6.
Article CAS PubMed PubMed Central Google Scholar
Braganca J, Civas A: Type I interferon gene expression: differential expression of IFN-A genes induced by viruses and double-stranded RNA. Biochimie. 1998, 80 (8–9): 673-87.
Article CAS PubMed Google Scholar
Prummel MF, Laurberg P: Interferon-alpha and autoimmune thyroid disease. Thyroid. 2003, 13 (6): 547-51.
Article CAS PubMed Google Scholar
Imagawa A, et al: Autoimmune endocrine disease induced by recombinant interferon-alpha therapy for chronic active type C hepatitis. J Clin Endocrinol Metab. 1995, 80 (3): 922-6.
CAS PubMed Google Scholar
Nossov V, et al: The early detection of ovarian cancer: from traditional methods to proteomics. Can we really do better than serum CA-125?. Am J Obstet Gynecol. 2008, 199 (3): 215-23.
Article CAS PubMed Google Scholar
Mano A, et al: CA-125 AUC as a predictor for epithelial ovarian cancer relapse. Cancer Biomark. 2008, 4 (2): 73-81.
PubMed Google Scholar
Mano A, et al: CA-125 AUC as a new prognostic factor for patients with ovarian cancer. Gynecol Oncol. 2005, 97 (2): 529-34.
Article CAS PubMed Google Scholar
Dehaghani AS, et al: Factors influencing serum concentration of CA125 and CA15-3 in Iranian healthy postmenopausal women. Pathol Oncol Res. 2007, 13 (4): 360-4.
Article CAS PubMed Google Scholar
Dreiseitl S, Ohno-Machado L, Binder M: Comparing three-class diagnostic tests by three-way ROC analysis. Med Decis Making. 2000, 20 (3): 323-31.
Article CAS PubMed Google Scholar
Xin H, et al: Three-class ROC analysis-a decision theoretic approach under the ideal observer framework. Medical Imaging, IEEE Transactions on. 2006, 25 (5): 571-581.
Article Google Scholar
Edwards DC, et al: Estimating three-class ideal observer decision variables for computerized detection and classification of mammographic mass lesions. Med Phys. 2004, 31 (1): 81-90.
Article PubMed Google Scholar
Chan HP, et al: Design of three-class classifiers in computer-aided diagnosis: Monte Carlo simulation study. Medical Imaging 2003: Image Processing. 2003, San Diego, CA, USA: SPIE
Google Scholar

Pre-publication history

The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2407/9/164/prepub

Download references

Acknowledgements

The work was supported in part by the NIH (NIH CA 84955 and R01 CA-112437-01) and the U.S. Army Breast Cancer Research Program (Grant No. W81XWH-05-1-0292).

Author information

Authors and Affiliations

Cancer Program, Broad Institute of Harvard University and Massachusetts Institute of Technology, Cambridge, MA, 02142, USA
Jonathan L Jesneck
Department of Pediatric Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
Jonathan L Jesneck
Department of Statistical Science, Duke University, Durham, NC, 27708, USA
Sayan Mukherjee & Merlise Clyde
Institute for Genome Sciences and Policy, Duke University, Durham, NC, 27708, USA
Sayan Mukherjee & Jeffrey R Marks
Department of Computer Science, Duke University, Durham, NC, 27708, USA
Sayan Mukherjee
Division of Hematology/Oncology, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15213, USA
Zoya Yurkovetsky & Anna E Lokshin
University of Pittsburgh Cancer Institute, Pittsburgh, PA, 15213, USA
Zoya Yurkovetsky & Anna E Lokshin
Department of Experimental Surgery, Duke University Medical Center, Durham, North Carolina, 27710, USA
Jeffrey R Marks
Duke Advanced Imaging Labs, Department of Radiology, Duke University Medical Center, Durham, North Carolina, 27710, USA
Joseph Y Lo
Department of Biomedical Engineering, Duke University, Durham, NC, 27708, USA
Joseph Y Lo
Medical Physics Graduate Program, Duke University, Durham, NC, 27708, USA
Joseph Y Lo

Authors

Jonathan L Jesneck
View author publications
You can also search for this author in PubMed Google Scholar
Sayan Mukherjee
View author publications
You can also search for this author in PubMed Google Scholar
Zoya Yurkovetsky
View author publications
You can also search for this author in PubMed Google Scholar
Merlise Clyde
View author publications
You can also search for this author in PubMed Google Scholar
Jeffrey R Marks
View author publications
You can also search for this author in PubMed Google Scholar
Anna E Lokshin
View author publications
You can also search for this author in PubMed Google Scholar
Joseph Y Lo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Joseph Y Lo.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

JRM coordinated the sample collection and study design. AEL and ZY conducted the ELISA assays. JLJ performed the computations and statistical analysis. SM and MC guided the use of Bayesian Model Averaging in the study. JLJ and JYL contributed to the drafting of the manuscript, which was approved by all authors.

Electronic supplementary material

12885_2008_1498_MOESM1_ESM.xls

Additional File 1: Subject diagnoses and protein levels. These data respresent this study's selected serum proteins and their antibodies. (XLS 135 KB)

12885_2008_1498_MOESM2_ESM.xls

Additional File 2: Proteins and antibodies. These data show sample information with the serum protein levels measured by ELISA. (XLS 57 KB)

12885_2008_1498_MOESM3_ESM.zip

Additional File 3: Classification code in R. This file is a compressed folder containing this study's source code for the statistical program R. (ZIP 24 KB)

12885_2008_1498_MOESM4_ESM.csv

Additional File 4: Serum protein data. This file is a comma-delimited file of blood serum ELISA data intended for computational analysis in the R scripts of Additional file 1. (CSV 101 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Jesneck, J.L., Mukherjee, S., Yurkovetsky, Z. et al. Do serum biomarkers really measure breast cancer?. BMC Cancer 9, 164 (2009). https://doi.org/10.1186/1471-2407-9-164

Download citation

Received: 06 September 2008
Accepted: 28 May 2009
Published: 28 May 2009
DOI: https://doi.org/10.1186/1471-2407-9-164

Do serum biomarkers really measure breast cancer?

Abstract

Background

Methods

Results

Conclusion

Background

Methods

Enrolling subjects

Measuring serum protein levels with ELISA

Regression with variable selection

Accounting for model uncertainty with Bayesian model averaging

Promoting computational efficiency with iterated BMA

Other models for high-dimensional data

Evaluating classification performance

Evaluating feature-Selection methods

Results

Classifier performance

Selected serum proteins

Feature selection and Bayesian model averaging

Discussion

Conclusion

Abbreviations

References

Pre-publication history

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors' contributions

Electronic supplementary material

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Cancer

Contact us