<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>1755-8794-5-63</ui>
	<ji>1755-8794</ji>
	<fm>
		<dochead>Research article</dochead>
		<bibl>
			<title>
				<p>Clinical and multiple gene expression variables in survival analysis of breast cancer: Analysis with the hypertabastic survival model</p>
			</title>
			<aug>
				<au id="A1"><snm>Tabatabai</snm><mi>A</mi><fnm>Mohammad</fnm><insr iid="I1"/><email>mtabatabai@cameron.edu</email></au>
				<au id="A2" ca="yes"><snm>Eby</snm><mi>M</mi><fnm>Wayne</fnm><insr iid="I1"/><email>weby@cameron.edu</email></au>
				<au id="A3"><snm>Nimeh</snm><fnm>Nadim</fnm><insr iid="I2"/><email>nadim.nimeh@ccswok.org</email></au>
				<au id="A4"><snm>Li</snm><fnm>Hong</fnm><insr iid="I1"/><email>lhong@cameron.edu</email></au>
				<au id="A5"><snm>Singh</snm><mi>P</mi><fnm>Karan</fnm><insr iid="I3"/><email>kpsingh@uab.edu</email></au>
			</aug>
			<insg>
				<ins id="I1"><p>Department of Mathematical Sciences, Cameron University, Lawton, OK, 73505, USA</p></ins>
				<ins id="I2"><p>Cancer Centers of Southwest Oklahoma, Lawton, OK, 73505, USA</p></ins>
				<ins id="I3"><p>Department of Medicine, University of Alabama at Birmingham, Birmingham, AL, 35295, USA</p></ins>
			</insg>
			<source>BMC Medical Genomics</source>
			<issn>1755-8794</issn>
			<pubdate>2012</pubdate>
			<volume>5</volume>
			<issue>1</issue>
			<fpage>63</fpage>
			<url>http://www.biomedcentral.com/1755-8794/5/63</url>
			<xrefbib><pubidlist><pubid idtype="doi">10.1186/1755-8794-5-63</pubid><pubid idtype="pmpid">23241496</pubid></pubidlist></xrefbib>
		</bibl>
		<history><rec><date><day>28</day><month>10</month><year>2011</year></date></rec><acc><date><day>27</day><month>11</month><year>2012</year></date></acc><pub><date><day>14</day><month>12</month><year>2012</year></date></pub></history>
		<cpyrt><year>2012</year><collab>Tabatabai et al.; licensee BioMed Central Ltd.</collab><note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
		<kwdg>
			<kwd>Hypertabastic survival models</kwd>
			<kwd>Gene expression variables</kwd>
			<kwd>Breast cancer biomarkers</kwd>
			<kwd>Seventy gene signature</kwd>
			<kwd>ErbB2 overexpression</kwd>
			<kwd>Fibroblast core serum response</kwd>
		</kwdg>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<sec>
					<st>
						<p>Background</p>
					</st>
					<p>We explore the benefits of applying a new proportional hazard model to analyze survival of breast cancer patients. As a parametric model, the hypertabastic survival model offers a closer fit to experimental data than Cox regression, and furthermore provides explicit survival and hazard functions which can be used as additional tools in the survival analysis. In addition, one of our main concerns is utilization of multiple gene expression variables. Our analysis treats the important issue of interaction of different gene signatures in the survival analysis.</p>
				</sec>
				<sec>
					<st>
						<p>Methods</p>
					</st>
					<p>The hypertabastic proportional hazards model was applied in survival analysis of breast cancer patients. This model was compared, using statistical measures of goodness of fit, with models based on the semi-parametric Cox proportional hazards model and the parametric log-logistic and Weibull models. The explicit functions for hazard and survival were then used to analyze the dynamic behavior of hazard and survival functions.</p>
				</sec>
				<sec>
					<st>
						<p>Results</p>
					</st>
					<p>The hypertabastic model provided the best fit among all the models considered. Use of multiple gene expression variables also provided a considerable improvement in the goodness of fit of the model, as compared to use of only one. By utilizing the explicit survival and hazard functions provided by the model, we were able to determine the magnitude of the maximum rate of increase in hazard, and the maximum rate of decrease in survival, as well as the times when these occurred. We explore the influence of each gene expression variable on these extrema. Furthermore, in the cases of continuous gene expression variables, represented by a measure of correlation, we were able to investigate the dynamics with respect to changes in gene expression.</p>
				</sec>
				<sec>
					<st>
						<p>Conclusions</p>
					</st>
					<p>We observed that use of three different gene signatures in the model provided a greater combined effect and allowed us to assess the relative importance of each in determination of outcome in this data set. These results point to the potential to combine gene signatures to a greater effect in cases where each gene signature represents some distinct aspect of the cancer biology. Furthermore we conclude that the hypertabastic survival models can be an effective survival analysis tool for breast cancer patients.</p>
				</sec>
			</sec>
		</abs>
	</fm>
	<bdy>
		<sec>
			<st>
				<p>Background</p>
			</st>
			<p>A number of important papers have appeared in recent years using gene expression as a predictor of outcome in cancer patients, and it has become clear this genomic information will greatly improve prognostic capabilities. In the statistical survival analysis, these papers have utilized the semi-parametric Cox proportional hazard model and the Kaplan-Meiers estimator for the survival and hazard curves. One purpose of this paper is to show the advantages that can be gained by utilizing a parametric model, which allows use of explicitly defined, continuous hazard and survival functions for tools in analysis. Parametric models in general have a higher accuracy, and the recently introduced hypertabastic model 
				<abbrgrp>
					<abbr bid="B1">1</abbr>
				</abbrgrp> is shown to provide the best fit for the data set under consideration, among the other competing parametric models of Weibull and log-logistic. Although there may sometimes be a concern in using a parametric model rather than the semi-parametric Cox model in cases where the distribution of the data is unknown, these models have greater accuracy and provide more detailed information when they are applicable. The hypertabastic model has been shown to be robust with respect to departure of the data from the distribution 
				<abbrgrp>
					<abbr bid="B1">1</abbr>
					<abbr bid="B2">2</abbr>
				</abbrgrp>, making it an appropriate model to use in describing a wide variety of survival data. This model has also been shown to provide a good fit to breast cancer survival data in a recent paper 
				<abbrgrp>
					<abbr bid="B3">3</abbr>
				</abbrgrp>. Using the explicit hazard and survival functions provided by this model we demonstrate some of the potential for analysis of temporal dynamics of the progression of hazard and decrease in survival. We are able to use the survival function to explicitly compute probability of survival to a given time, and this prediction takes into account an individual patient&#8217;s profile with respect to any significant variables included in the model.</p>
			<p>Breast cancer patients with similar clinical profiles may experience widely differing outcomes and different responses to therapy, and means for more accuracy in prognosis will fill an important need. The development of variables with more prognostic power was a primary goal in the development of gene expression signatures for breast cancer outcome. Early papers utilizing gene expression to predict the progression of breast cancer determined several distinct categories 
				<abbrgrp>
					<abbr bid="B4">4</abbr>
				</abbrgrp>, which have become linked to molecular subtype. The different molecular subtypes had different prognoses, with basal-like and ErbB2+ tumors experiencing more invasive tumors and increased risk of recurrence, while the luminal subtype are characterized by less invasiveness and a better response to treatment. Luminal tumors were later subdivided 
				<abbrgrp>
					<abbr bid="B4">4</abbr>
				</abbrgrp> into Lumina A and Lumina B, with distinct prognosis. The authors 
				<abbrgrp>
					<abbr bid="B5">5</abbr>
				</abbrgrp> used microarrays and statistical methods to determine a list of genes whose expression correlated strongly to a positive outcome for the patients, based on short term distant metastasis. This research established a 70 gene signature which could be used for prognosis of tumors as poor or good outcome. Many other teams of researchers, such as 
				<abbrgrp>
					<abbr bid="B6">6</abbr>
					<abbr bid="B7">7</abbr>
				</abbrgrp>, have also used similar methods to establish a gene expression signature highly correlated to patient outcome. Based on the older idea 
				<abbrgrp>
					<abbr bid="B8">8</abbr>
				</abbrgrp> that tumors and wounds produce a similar microenvironment which facilitates proliferation and migration of cells and stimulates angiogenesis, the papers of Chang and collaborators 
				<abbrgrp>
					<abbr bid="B9">9</abbr>
					<abbr bid="B10">10</abbr>
				</abbrgrp> determined prognostic capabilities of gene expression signatures associated to wound healing.</p>
			<p>More recently researchers 
				<abbrgrp>
					<abbr bid="B11">11</abbr>
					<abbr bid="B12">12</abbr>
				</abbrgrp> have addressed issues of developing these methods for use together with standard variables for prognosis in clinical cases. In particular, 
				<abbrgrp>
					<abbr bid="B12">12</abbr>
				</abbrgrp> used model selection with Cox regression to determine the best set of predictors from among the standard clinical variables a collection of hundreds of gene signatures. These researchers came to the conclusion that gene expression variables are the most powerful predictors, and most of these gene signatures are comparable to the others in prognostic power. However, addition of clinical variables to the model displayed a small increase in the power of the model. Other researchers 
				<abbrgrp>
					<abbr bid="B13">13</abbr>
					<abbr bid="B14">14</abbr>
					<abbr bid="B15">15</abbr>
				</abbrgrp> have also noted that different gene expression signatures carry much of the same information. These researchers do not expect use of several different signatures to yield much improvement in prognosis. However, we note that Chang et al. 
				<abbrgrp>
					<abbr bid="B10">10</abbr>
				</abbrgrp> proposed use of both the seventy gene signature and the wound expression gene signature to a combined effect in prediction of patient risk. Furthermore the work of 
				<abbrgrp>
					<abbr bid="B16">16</abbr>
				</abbrgrp> develops a computational approach for prognosis which uses both gene expression and a means of classification into molecular subtype. The current study investigates the interaction between clinical variables and several gene signatures as predictors for outcome in breast cancer patients. We have found that combining several gene expression variables provides a model that best fits the survival data. Consistent with the results of Chang et al. 
				<abbrgrp>
					<abbr bid="B10">10</abbr>
				</abbrgrp> the model uses the seventy gene signature of 
				<abbrgrp>
					<abbr bid="B5">5</abbr>
				</abbrgrp> together with core serum response, a wound healing signature developed in 
				<abbrgrp>
					<abbr bid="B9">9</abbr>
				</abbrgrp>. In addition one of the gene expression signatures from 
				<abbrgrp>
					<abbr bid="B4">4</abbr>
				</abbrgrp> for classification into molecular subtype is shown to be statistically significant. This particular gene signature for ErbB2+ overexpression also relates to important aspects of the underlying breast cancer tumor biology explored by numerous researchers. The issue of what happens in the interactions of several significant gene expression variables also arises inherently in these considerations.</p>
			<p>Clinical trials have begun for gene expression signatures in breast cancer 
				<abbrgrp>
					<abbr bid="B17">17</abbr>
					<abbr bid="B18">18</abbr>
				</abbrgrp>, and these biomarkers can be expected to soon become available for use in the clinical setting. Furthermore researchers have begun development of a second generation of gene expression signatures, including analysis of signatures from nearby stromal cells 
				<abbrgrp>
					<abbr bid="B19">19</abbr>
				</abbrgrp>, immune response 
				<abbrgrp>
					<abbr bid="B20">20</abbr>
				</abbrgrp>, and mutations in cancer related pathways 
				<abbrgrp>
					<abbr bid="B21">21</abbr>
				</abbrgrp>. Gene expression profiles have additionally been developed for other aspects of breast cancer therapy response 
				<abbrgrp>
					<abbr bid="B22">22</abbr>
				</abbrgrp>, including response to radiotherapy and response to chemotherapy 
				<abbrgrp>
					<abbr bid="B23">23</abbr>
					<abbr bid="B24">24</abbr>
					<abbr bid="B25">25</abbr>
					<abbr bid="B26">26</abbr>
				</abbrgrp>.</p>
			<p>The combined model we form in this paper illustrates how a quantitative prediction of hazard and survival can be formed that incorporates the predictive capabilities of these three gene expression variables. Note that each of these variables has medical significance in breast cancer progression. In our discussion of this model in the Results and discussion section, we explore the role of these variables, how they affect one another in the context of the xmodel, and what information can be gained from variation in the levels of CSR correlation, ErbB2+ correlation, and good or poor seventy gene signature. This analysis and investigation addresses the important issue of how multiple gene expression signatures representing different aspects of the underlying biology can be combined and how they may interact. We have found a partial answer in the context of the given model; however it is far from complete in answering this important question. We claim this is an important issue that should receive further attention and possibly alternative approaches in modeling.</p>
		</sec>
		<sec>
			<st>
				<p>Methods</p>
			</st>
			<p>Here we present the proportional hazard form of the Hypertabastic model, which will be applied in the survival analysis of the breast cancer patients. One important feature of the hypertabastic survival model is the ability of the hazard function to assume many different shapes, in contrast to the Weibull, lognormal, and log logistic distributions. The hypertabastic distribution function is defined as</p>
			<p>
				<display-formula>
					<m:math name="1755-8794-5-63-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mrow>
   <m:mi>F</m:mi>
   <m:mfenced open="(" close=")">
      <m:mi>t</m:mi>
   </m:mfenced>
   <m:mo>=</m:mo>
   <m:mo stretchy="true">{</m:mo>
   <m:mtable columnalign="center">
      <m:mtr columnalign="center">
         <m:mtd columnalign="center">
            <m:mrow>
               <m:mn>1</m:mn>
               <m:mo>&#8722;</m:mo>
               <m:mi>s</m:mi>
               <m:mi>e</m:mi>
               <m:mi>c</m:mi>
               <m:mi>h</m:mi>
               <m:mfenced open="(" close=")">
                  <m:mrow>
                     <m:mi>&#945;</m:mi>
                     <m:mfenced open="[" close="]">
                        <m:mrow>
                           <m:mn>1</m:mn>
                           <m:mo>&#8722;</m:mo>
                           <m:msup>
                              <m:mi>t</m:mi>
                              <m:mi>&#946;</m:mi>
                           </m:msup>
                           <m:mi>c</m:mi>
                           <m:mi>o</m:mi>
                           <m:mi>t</m:mi>
                           <m:mi>h</m:mi>
                           <m:mfenced open="(" close=")">
                              <m:msup>
                                 <m:mi>t</m:mi>
                                 <m:mi>&#946;</m:mi>
                              </m:msup>
                           </m:mfenced>
                        </m:mrow>
                     </m:mfenced>
                     <m:mo>/</m:mo>
                     <m:mi>&#946;</m:mi>
                  </m:mrow>
               </m:mfenced>
            </m:mrow>
         </m:mtd>
      </m:mtr>
      <m:mtr columnalign="center">
         <m:mtd columnalign="center">
            <m:mn>0</m:mn>
         </m:mtd>
      </m:mtr>
   </m:mtable>
   <m:mtable columnalign="center">
      <m:mtr columnalign="center">
         <m:mtd columnalign="center">
            <m:mrow>
               <m:mi>t</m:mi>
               <m:mo>></m:mo>
               <m:mn>0</m:mn>
            </m:mrow>
         </m:mtd>
      </m:mtr>
      <m:mtr columnalign="center">
         <m:mtd columnalign="center">
            <m:mrow>
               <m:mi>t</m:mi>
               <m:mo>&#8804;</m:mo>
               <m:mn>0</m:mn>
               <m:mo>.</m:mo>
            </m:mrow>
         </m:mtd>
      </m:mtr>
   </m:mtable>
</m:mrow>
</m:math>
				</display-formula>
			</p>
			<p>The hypertabastic proportional hazard model has a hazard function of the form</p>
			<p>
				<display-formula id="M1">
					<m:math name="1755-8794-5-63-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mrow>
   <m:mi>h</m:mi>
   <m:mfenced open="(" close=")">
      <m:mrow>
         <m:mi>t</m:mi>
         <m:mo stretchy="true">|</m:mo>
         <m:mi>x</m:mi>
         <m:mo>,</m:mo>
         <m:mi>&#952;</m:mi>
      </m:mrow>
   </m:mfenced>
   <m:mo>=</m:mo>
   <m:msub>
      <m:mi>h</m:mi>
      <m:mn>0</m:mn>
   </m:msub>
   <m:mfenced open="(" close=")">
      <m:mi>t</m:mi>
   </m:mfenced>
   <m:mi>g</m:mi>
   <m:mfenced open="(" close=")">
      <m:mrow>
         <m:mi>x</m:mi>
         <m:mo stretchy="true">|</m:mo>
         <m:mi>&#952;</m:mi>
      </m:mrow>
   </m:mfenced>
</m:mrow>
</m:math>
				</display-formula>
			</p>
			<p>where h<sub>0</sub>(t) is the baseline hazard function, given by</p>
			<p>
				<display-formula>
					<m:math name="1755-8794-5-63-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mrow>
   <m:msub>
      <m:mi>h</m:mi>
      <m:mn>0</m:mn>
   </m:msub>
   <m:mfenced open="(" close=")">
      <m:mi>t</m:mi>
   </m:mfenced>
   <m:mo>=</m:mo>
   <m:mi>&#945;</m:mi>
   <m:mfenced open="[" close="]">
      <m:mrow>
         <m:msup>
            <m:mi>t</m:mi>
            <m:mrow>
               <m:mn>2</m:mn>
               <m:mi>&#946;</m:mi>
               <m:mo>&#8722;</m:mo>
               <m:mn>1</m:mn>
            </m:mrow>
         </m:msup>
         <m:mi>c</m:mi>
         <m:mi>s</m:mi>
         <m:mi>c</m:mi>
         <m:msup>
            <m:mi>h</m:mi>
            <m:mn>2</m:mn>
         </m:msup>
         <m:mfenced open="(" close=")">
            <m:msup>
               <m:mi>t</m:mi>
               <m:mi>&#946;</m:mi>
            </m:msup>
         </m:mfenced>
         <m:mo>&#8722;</m:mo>
         <m:msup>
            <m:mi>t</m:mi>
            <m:mrow>
               <m:mi>&#946;</m:mi>
               <m:mo>&#8722;</m:mo>
               <m:mn>1</m:mn>
            </m:mrow>
         </m:msup>
         <m:mi>c</m:mi>
         <m:mi>o</m:mi>
         <m:mi>t</m:mi>
         <m:mi>h</m:mi>
         <m:mfenced open="(" close=")">
            <m:msup>
               <m:mi>t</m:mi>
               <m:mi>&#946;</m:mi>
            </m:msup>
         </m:mfenced>
      </m:mrow>
   </m:mfenced>
   <m:mi>t</m:mi>
   <m:mi>a</m:mi>
   <m:mi>n</m:mi>
   <m:mi>h</m:mi>
   <m:mfenced open="[" close="]">
      <m:mrow>
         <m:mi>W</m:mi>
         <m:mfenced open="(" close=")">
            <m:mi>t</m:mi>
         </m:mfenced>
      </m:mrow>
   </m:mfenced>
</m:mrow>
</m:math>
				</display-formula>
			</p>
			<p>and where <it>W</it>(<it>t</it>)&#8201;=&#8201;<it>&#945;</it>[1&#8201;&#8722;&#8201;<it>t</it>
				<sup>
					<it>&#946;</it>
				</sup>
				<it>coth</it>(<it>t</it>
				<sup>
					<it>&#946;</it>
				</sup>)]/<it>&#946;</it>,&#8201;and <it>&#945;</it>,&#8201;<it>&#946;</it>&#8201;&gt;&#8201;0. These parameters &#945; and &#946; provide the flexibility of the hazard function to conform to the given data set. See 
				<abbrgrp>
					<abbr bid="B1">1</abbr>
				</abbrgrp> for examples of different distribution shapes associated to different values of these parameters. The function <it>g</it>(<it>x</it>|<it>&#952;</it>) is given by <it>g</it>(<it>x</it>|<it>&#952;</it>)&#8201;=&#8201;<it>Exp</it>[&#8721;<sub>
					<it>k</it>&#8201;=&#8201;1</sub>
				<sup>
					<it>p</it>
				</sup>
				<it>&#952;</it>
				<sub>
					<it>k</it>
				</sub>
				<it>x</it>
				<sub>
					<it>k</it>
				</sub>, where the x<sub>k</sub> are covariates and the &#952;<sub>k</sub> are the associated parameters. Similarly the hypertabastic survival function <it>S</it>(<it>t</it>|<it>x</it>,&#8201;<it>&#952;</it>) for the proportional hazards model has the form</p>
			<p>
				<display-formula id="M2">
					<m:math name="1755-8794-5-63-i4" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mrow>
   <m:mi>S</m:mi>
   <m:mfenced open="(" close=")">
      <m:mrow>
         <m:mi>t</m:mi>
         <m:mo stretchy="true">|</m:mo>
         <m:mi>x</m:mi>
         <m:mo>,</m:mo>
         <m:mi>&#952;</m:mi>
      </m:mrow>
   </m:mfenced>
   <m:mo>=</m:mo>
   <m:msup>
      <m:mfenced open="[" close="]">
         <m:mrow>
            <m:msub>
               <m:mi>S</m:mi>
               <m:mn>0</m:mn>
            </m:msub>
            <m:mfenced open="(" close=")">
               <m:mi>t</m:mi>
            </m:mfenced>
         </m:mrow>
      </m:mfenced>
      <m:mrow>
         <m:mi>g</m:mi>
         <m:mfenced open="(" close=")">
            <m:mrow>
               <m:mi>x</m:mi>
               <m:mo stretchy="true">|</m:mo>
               <m:mi>&#952;</m:mi>
            </m:mrow>
         </m:mfenced>
      </m:mrow>
   </m:msup>
</m:mrow>
</m:math>
				</display-formula>
			</p>
			<p>where S<sub>0</sub>(t) is the baseline survival function, given by</p>
			<p>
				<display-formula>
					<m:math name="1755-8794-5-63-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mrow>
   <m:msub>
      <m:mi>S</m:mi>
      <m:mn>0</m:mn>
   </m:msub>
   <m:mfenced open="(" close=")">
      <m:mi>t</m:mi>
   </m:mfenced>
   <m:mo>=</m:mo>
   <m:mi>s</m:mi>
   <m:mi>e</m:mi>
   <m:mi>c</m:mi>
   <m:mi>h</m:mi>
   <m:mfenced open="{" close="}">
      <m:mrow>
         <m:mi>&#945;</m:mi>
         <m:mfenced open="[" close="]">
            <m:mrow>
               <m:mn>1</m:mn>
               <m:mo>&#8722;</m:mo>
               <m:msup>
                  <m:mi>t</m:mi>
                  <m:mi>&#946;</m:mi>
               </m:msup>
               <m:mi>c</m:mi>
               <m:mi>o</m:mi>
               <m:mi>t</m:mi>
               <m:mi>h</m:mi>
               <m:mfenced open="(" close=")">
                  <m:msup>
                     <m:mi>t</m:mi>
                     <m:mi>&#946;</m:mi>
                  </m:msup>
               </m:mfenced>
            </m:mrow>
         </m:mfenced>
         <m:mo>/</m:mo>
         <m:mi>&#946;</m:mi>
      </m:mrow>
   </m:mfenced>
   <m:mtext>.</m:mtext>
</m:mrow>
</m:math>
				</display-formula>
			</p><p/>
			<p>For further detail, see 
				<abbrgrp>
					<abbr bid="B1">1</abbr>
					<abbr bid="B2">2</abbr>
				</abbrgrp>. Simulation studies with this model 
				<abbrgrp>
					<abbr bid="B2">2</abbr>
				</abbrgrp> have demonstrated some degree of robustness with respect to variations in the distribution of the data.</p>
			<p>This model is applied to the 295 patient study from the Netherlands Cancer Institute which is presented in 
				<abbrgrp>
					<abbr bid="B27">27</abbr>
				</abbrgrp> as a validation set for the seventy gene signature. All of these patients had stage I or II breast cancer but had no previous history of cancer. The study combined both lymph-node positive and lymph-node negative patients. All of these patients had been treated by modified radical mastectomy or breast-conserving surgery. Of the patients with lymph-node positive disease, 120 were treated with adjuvant chemotherapy and/or hormonal-therapy. For more information regarding this study, see 
				<abbrgrp>
					<abbr bid="B27">27</abbr>
				</abbrgrp>.</p>
			<p>Here we further discuss the different variables that were included as potential covariates in the model. The first class of variables was the clinical variables, including the following: estrogen receptor status (ERS), tumor grade (TG1 and TG2), age (AGE), diameter (DIAM), and lymph node status (LN1 and LN2). The primary gene expression variable we tested was the seventy gene signature (70G) of 
				<abbrgrp>
					<abbr bid="B5">5</abbr>
				</abbrgrp> which selected genes for prediction of early distant metastasis. From the study of the wound healing microenvironment by Chang et al. 
				<abbrgrp>
					<abbr bid="B9">9</abbr>
					<abbr bid="B10">10</abbr>
				</abbrgrp>, the wound response signature (WRS) and the core serum response correlation (CSR) were included as potential gene expression variables. The core serum response is developed in 
				<abbrgrp>
					<abbr bid="B9">9</abbr>
				</abbrgrp> to represent a canonical expression of fibroblasts activated by serum, and it is a cell-cycle independent set of genes in areas including vascularization, cell motility, and matrix remodeling, common to both the wound healing and tumor microenvironments. Finally, in the area of gene expression for classification of molecular subtype, we considered correlation used for validation in 
				<abbrgrp>
					<abbr bid="B27">27</abbr>
				</abbrgrp> (CVal), and with centroids for normal (CNorm), ErbB2+ (CERBB), Lumina A (CLumA), Lumina B (CLumB), and basal (CBas) from 
				<abbrgrp>
					<abbr bid="B6">6</abbr>
				</abbrgrp>.</p>
			<p>In implementation of the hypertabastic survival model to this set of data, we considered the clinical, gene expression, and classification variables described above. We applied a standard stepwise forward selection of variables procedure. In addition since some of the variables are highly correlated, we used a procedure that would ensure no two of the variables considered would have a pairwise correlation of 0.5 or higher. The parameters were estimated using a SAS program, and these parameter estimates were double checked using Mathematica. A SAS program for hypertabastic proportional hazard model using log-time is provided in the Additional file 
				<supplr sid="S1">1</supplr>: Documents.</p>
			<suppl id="S1">
				<title>
					<p>Additional file 1</p>
				</title>
				<text>
					<p>
						<b>Data cancer.</b>
					</p>
				</text>
				<file name="1755-8794-5-63-S1.docx">
   <p>Click here for file</p>
</file>
			</suppl>
			<p>Once the parameters had been estimated, these values were used in the survival function (2) and hazard function (1). Then Mathematica was utilized to sketch graphs of the hazard and survival functions for the desired cases. Further dynamic analysis of these curves and their derivatives was also made using Mathematica.</p>
		</sec>
		<sec>
			<st>
				<p>Results and discussion</p>
			</st>
			<sec>
				<st>
					<p>Model based on gene expression and clinical variables</p>
				</st>
				<p>In this section we apply the model selection procedure to determine an effective model to represent the survival of the breast cancer patients in the Netherlands study of 
					<abbrgrp>
						<abbr bid="B27">27</abbr>
					</abbrgrp>, described briefly above. In selecting from among the hypertabastic, log-logistic, and Weibull proportional hazard models, we compare these models using the &#8722;2 log-likelihood score and the Akaike Information Criterion (AIC) 
					<abbrgrp>
						<abbr bid="B28">28</abbr>
					</abbrgrp>. The Akaike Information Criterion is commonly used when selecting among several competing models, with the `smallest value corresponding to the best fit model. See Table 
					<tblr tid="T1">1</tblr> where we make a comparison of the three parametric distributions mentioned above. For purposes of comparison, we also include Cox regression. The covariates included in the model include AGE, 70G, CSR, and CERBB.</p>
				<table id="T1">
					<title>
						<p>Table 1</p>
					</title>
					<caption>
						<p>
							<b>Comparison of models</b>
						</p>
					</caption>
					<tgroup align="left" cols="4">
						<colspec align="left" colname="c1" colnum="1" colwidth="1*"/>
						<colspec align="left" colname="c2" colnum="2" colwidth="1*"/>
						<colspec align="left" colname="c3" colnum="3" colwidth="1*"/>
						<colspec align="left" colname="c4" colnum="4" colwidth="1*"/>
						<thead valign="top">
							<row rowsep="1">
								<entry colname="c1"/>
								<entry colname="c2">
									<p>
										<b>&#8722;2 Log likelihood</b>
									</p>
								</entry>
								<entry colname="c3">
									<p>
										<b>AIC</b>
									</p>
								</entry>
								<entry colname="c4">
									<p>
										<b>&#8722;2 Log likelihood without covariates</b>
									</p>
								</entry>
							</row>
						</thead>
						<tbody valign="top">
							<row rowsep="1">
								<entry colname="c1">
									<p>Hypertabastic</p>
								</entry>
								<entry colname="c2">
									<p>387.755</p>
								</entry>
								<entry colname="c3">
									<p>399.755</p>
								</entry>
								<entry colname="c4">
									<p>467.952</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>Weibull</p>
								</entry>
								<entry colname="c2">
									<p>399.000</p>
								</entry>
								<entry colname="c3">
									<p>411.000</p>
								</entry>
								<entry colname="c4">
									<p>474.089</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>Log Logistic</p>
								</entry>
								<entry colname="c2">
									<p>502.126</p>
								</entry>
								<entry colname="c3">
									<p>514.126</p>
								</entry>
								<entry colname="c4">
									<p>544.930</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>Cox Regression</p>
								</entry>
								<entry colname="c2">
									<p>764.001</p>
								</entry>
								<entry colname="c3">
									<p>772.001</p>
								</entry>
								<entry colname="c4">
									<p>836.598</p>
								</entry>
							</row>
						</tbody>
					</tgroup>
				</table>
				<p>In Table 
					<tblr tid="T2">2</tblr> we give the estimates a, b for the parameters &#945; and &#946; of the hypertabastic distribution and each of the model variables Age, Seventy gene signature, CSR correlation, and ErbB2+ correlation, together with the standard error, Wald test value and p-value. Among the three gene expression variables included in the model, CSR correlation (CSR) is clearly the most significant with the highest hazard ratio and smallest p-value.</p>
				<table id="T2">
					<title>
						<p>Table 2</p>
					</title>
					<caption>
						<p>
							<b>Parameter estimates and statistical significance for combined model</b>
						</p>
					</caption>
					<tgroup align="left" cols="6">
						<colspec align="left" colname="c1" colnum="1" colwidth="1*"/>
						<colspec align="left" colname="c2" colnum="2" colwidth="1*"/>
						<colspec align="left" colname="c3" colnum="3" colwidth="1*"/>
						<colspec align="left" colname="c4" colnum="4" colwidth="1*"/>
						<colspec align="left" colname="c5" colnum="5" colwidth="1*"/>
						<colspec align="left" colname="c6" colnum="6" colwidth="1*"/>
						<thead valign="top">
							<row rowsep="1">
								<entry colname="c1">
									<p>
										<b>Parameter</b>
									</p>
								</entry>
								<entry colname="c2">
									<p>
										<b>Estimate</b>
									</p>
								</entry>
								<entry colname="c3">
									<p>
										<b>Standard Dev.</b>
									</p>
								</entry>
								<entry colname="c4">
									<p>
										<b>Wald test</b>
									</p>
								</entry>
								<entry colname="c5">
									<p>
										<b>P-value</b>
									</p>
								</entry>
								<entry colname="c6">
									<p>
										<b>Hazard ratio</b>
									</p>
								</entry>
							</row>
						</thead>
						<tbody valign="top">
							<row rowsep="1">
								<entry colname="c1">
									<p>a (model)</p>
								</entry>
								<entry colname="c2">
									<p>0.7247</p>
								</entry>
								<entry colname="c3">
									<p>0.2888</p>
								</entry>
								<entry colname="c4">
									<p>6.298</p>
								</entry>
								<entry colname="c5">
									<p>0.01209</p>
								</entry>
								<entry colname="c6">
									<p>NA</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>b (model)</p>
								</entry>
								<entry colname="c2">
									<p>0.6205</p>
								</entry>
								<entry colname="c3">
									<p>0.1244</p>
								</entry>
								<entry colname="c4">
									<p>24.873</p>
								</entry>
								<entry colname="c5">
									<p>6.125 10^-7</p>
								</entry>
								<entry colname="c6">
									<p>NA</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>c (AGE)</p>
								</entry>
								<entry colname="c2">
									<p>&#8722;0.07350</p>
								</entry>
								<entry colname="c3">
									<p>0.01480</p>
								</entry>
								<entry colname="c4">
									<p>24.645</p>
								</entry>
								<entry colname="c5">
									<p>6.891 10^-7</p>
								</entry>
								<entry colname="c6">
									<p>0.9291</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>d (70G)</p>
								</entry>
								<entry colname="c2">
									<p>1.199</p>
								</entry>
								<entry colname="c3">
									<p>0.3872</p>
								</entry>
								<entry colname="c4">
									<p>9.585</p>
								</entry>
								<entry colname="c5">
									<p>0.001962</p>
								</entry>
								<entry colname="c6">
									<p>3.316</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>e (CSR)</p>
								</entry>
								<entry colname="c2">
									<p>2.661</p>
								</entry>
								<entry colname="c3">
									<p>0.7025</p>
								</entry>
								<entry colname="c4">
									<p>14.343</p>
								</entry>
								<entry colname="c5">
									<p>0.0001524</p>
								</entry>
								<entry colname="c6">
									<p>14.305</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>f (CERBB)</p>
								</entry>
								<entry colname="c2">
									<p>1.561</p>
								</entry>
								<entry colname="c3">
									<p>0.7285</p>
								</entry>
								<entry colname="c4">
									<p>4.594</p>
								</entry>
								<entry colname="c5">
									<p>0.03208</p>
								</entry>
								<entry colname="c6">
									<p>4.766</p>
								</entry>
							</row>
						</tbody>
					</tgroup>
				</table>
				<p>Inclusion of the clinical variables improved the goodness of fit of the model for each of the gene signatures considered, consistent with the results of 
					<abbrgrp>
						<abbr bid="B12">12</abbr>
					</abbrgrp>. Although the seventy gene signature gives the best fitting model of all gene expression variables when considered alone, there is a considerable improvement from inclusion of multiple gene expression variables. The combined model features the gene expression variables 70G, representing distant early metastasis, CSR, representing relation to a wound healing microenvironment which promotes cell migration and vascularization, and CERBB, representing ErbB2+/Her2 over-expression and relating to molecular subtype. The individual gene signatures of 70G, CSR, and CERBB yield models with values of AIC of 423.142, 436.056, and 448.248, respectively. However the combined model has a dramatic improvement, to 399.755. Since these signatures represent different aspects of the underlying cancer biology, it is perhaps not surprising the combination of the variables produces a model with a better fit to the data.</p>
				<p>In the absence of a combined model, researchers and doctors are already aware of the possibility for several important variables to point toward different conclusions. Our combined model addresses this question of how much weight to assign to each of several significant variables. This model offers a scientific approach to this issue, based on statistical techniques and quantitative analysis. The added advantage of use of a good-fitting parametric model, such as the hypertabastic survival model, is the ability to analyze the temporal dynamics of the hazard and survival functions, as we illustrate in the remainder of this section. Since two of the gene expression variables are continuous, as given by levels of correlation to an established gene expression, we are also able to investigate the dynamics of hazard and survival with respect to changes in level of gene expression.</p>
			</sec>
			<sec>
				<st>
					<p>Dynamics of survival and hazard</p>
				</st>
				<p>The temporal dynamics of hazard and survival curves for the combined model follow from the above determination of parameter values. In the following we work out the details of this time course, as well as the influence of the covariates, with particular attention to the gene expression variables and their interactions. In order to isolate the effects of one or two of the variables within the combined model we will hold all other variables at a fixed level, usually the median. We begin with the seventy gene signature 70G, both in relation to the other gene expression variables CSR and CERBB, and also in comparison to 70G as a single variable model.</p>
				<p>We now analyze the interaction between the seventy gene signature and CSR correlation within our multivariable model, while holding our other variable of ErbB2+ correlation fixed at its median value. The graphs in Figure 
					<figr fid="F1">1</figr> show the interrelation between the seventy gene signature and the CSR correlation for survival functions and their derivatives. The axes on the left contain the curves for a seventy gene signature with a good prognosis, while the curves on the axes on the right have a seventy gene signature for a poor prognosis. The graphs on each set of axes represent a passage from the minimum CSR correlation at the top to the maximum CSR correlation at the bottom, while the curve in the middle represents the survival curve when only the seventy gene signature is considered. These are followed by the graphs of the rate of change of survival.</p>
				<fig id="F1"><title><p>Figure 1</p></title><caption><p>Survival function for varying CSR correlation and seventy gene signature</p></caption><text>
   <p>
      <b>Survival function for varying CSR correlation and seventy gene signature.</b>
   </p>
</text><graphic file="1755-8794-5-63-1"/></fig>
				<p>Notice that when the seventy gene signature has a poor prognosis, the effect of CSR correlation on survival is also magnified. We can determine the maximum rate of decrease in survival probability for each of the cases, and these are given in Table 
					<tblr tid="T3">3</tblr>.</p>
				<table id="T3">
					<title>
						<p>Table 3</p>
					</title>
					<caption>
						<p>
							<b>Maximum rates of decrease in survival and increase in hazard with varying CSR</b>
						</p>
					</caption>
					<tgroup align="left" cols="4">
						<colspec align="left" colname="c1" colnum="1" colwidth="1*"/>
						<colspec align="left" colname="c2" colnum="2" colwidth="1*"/>
						<colspec align="left" colname="c3" colnum="3" colwidth="1*"/>
						<colspec align="left" colname="c4" colnum="4" colwidth="1*"/>
						<thead valign="top">
							<row rowsep="1">
								<entry colname="c1"/>
								<entry colname="c2"/>
								<entry colname="c3">
									<p>
										<b>Time of min</b>
									</p>
								</entry>
								<entry colname="c4">
									<p>
										<b>Veloc. at min</b>
									</p>
								</entry>
							</row>
						</thead>
						<tbody valign="top">
							<row rowsep="1">
								<entry colname="c1">
									<p>
										<b>Survival</b>
									</p>
								</entry>
								<entry colname="c2"/>
								<entry colname="c3"/>
								<entry colname="c4"/>
							</row>
							<row rowsep="1">
								<entry colname="c1" morerows="2">
									<p>Good prognosis (70G = 0)</p>
								</entry>
								<entry colname="c2">
									<p>CSR min</p>
								</entry>
								<entry colname="c3">
									<p>4.003</p>
								</entry>
								<entry colname="c4">
									<p>&#8722;0.004703</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c2">
									<p>CSR max</p>
								</entry>
								<entry colname="c3">
									<p>3.446</p>
								</entry>
								<entry colname="c4">
									<p>&#8722;0.04187</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c2">
									<p>Only 70 gene sig.</p>
								</entry>
								<entry colname="c3">
									<p>8.332</p>
								</entry>
								<entry colname="c4">
									<p>&#8722;0.007713</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1" morerows="2">
									<p>Poor prognosis (70G = 1)</p>
								</entry>
								<entry colname="c2">
									<p>CSR min</p>
								</entry>
								<entry colname="c3">
									<p>3.814</p>
								</entry>
								<entry colname="c4">
									<p>&#8722;0.01516</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c2">
									<p>CSR max</p>
								</entry>
								<entry colname="c3">
									<p>2.682</p>
								</entry>
								<entry colname="c4">
									<p>&#8722;0.1198</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c2">
									<p>Only 70 gene sig.</p>
								</entry>
								<entry colname="c3">
									<p>3.743</p>
								</entry>
								<entry colname="c4">
									<p>&#8722;0.05187</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>
										<b>Hazard</b>
									</p>
								</entry>
								<entry colname="c2"/>
								<entry colname="c3"/>
								<entry colname="c4"/>
							</row>
							<row rowsep="1">
								<entry colname="c1" morerows="2">
									<p>Good prognosis (70G = 0)</p>
								</entry>
								<entry colname="c2">
									<p>CSR min</p>
								</entry>
								<entry colname="c3">
									<p>2.187</p>
								</entry>
								<entry colname="c4">
									<p>0.006365</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c2">
									<p>CSR max</p>
								</entry>
								<entry colname="c3">
									<p>2.187</p>
								</entry>
								<entry colname="c4">
									<p>0.05976</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c2">
									<p>Only 70 gene sig.</p>
								</entry>
								<entry colname="c3">
									<p>5.107</p>
								</entry>
								<entry colname="c4">
									<p>0.009010</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1" morerows="2">
									<p>Poor prognosis (70G = 1)</p>
								</entry>
								<entry colname="c2">
									<p>CSR min</p>
								</entry>
								<entry colname="c3">
									<p>2.187</p>
								</entry>
								<entry colname="c4">
									<p>0.02111</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c2">
									<p>CSR max</p>
								</entry>
								<entry colname="c3">
									<p>2.187</p>
								</entry>
								<entry colname="c4">
									<p>0.1982</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c2">
									<p>Only 70 gene sig.</p>
								</entry>
								<entry colname="c3">
									<p>5.107</p>
								</entry>
								<entry colname="c4">
									<p>0.07695</p>
								</entry>
							</row>
						</tbody>
					</tgroup>
				</table>
				<p>We note that in the case of a poor prognosis for the seventy gene signature, the maximum rate of decrease in the survival function occurs sooner in all of the cases. Furthermore, this rate of change has a larger magnitude, indicating a larger rate of decrease in the survival function, when there is a poor prognosis. These graphs also compare the curve in the middle, where 70G is the only covariate with the curves on the outside. For these curves all four variables are included in the model, while the focus is on the variation in CSR correlation from the minimum value to the maximum value, with other variables at median level. Here the differences in shape also come about due to the variation in the values of &#945; and &#946; between these cases, a feature of the hypertabastic distribution allowing greater variability in the location and magnitude of the maximum rate of decrease for the survival functions.</p>
				<p>Figure 
					<figr fid="F2">2</figr> contains the hazard curves, together with their derivatives, for the same set of covariates. The significant difference appears again between graphs on the left, where the seventy gene signature shows a good prognosis and the graphs on the right, where it shows a poor prognosis. Again this shows the much larger effect of CSR correlation in the case of the poor prognosis. For instance at 20 years the difference in hazard values for minimum and maximum CSR is 0.5014 for a good prognosis, while this difference increases significantly to 1.663 for a poor prognosis in seventy gene signature. Table 
					<tblr tid="T3">3</tblr> shows the time and magnitude for the maximum rate of change of the hazard value.</p>
				<fig id="F2"><title><p>Figure 2</p></title><caption><p>Hazard function for varying CSR correlation and seventy gene signature</p></caption><text>
   <p>
      <b>Hazard function for varying CSR correlation and seventy gene signature.</b>
   </p>
</text><graphic file="1755-8794-5-63-2"/></fig>
				<p>For the two correlation variables (CSR and CERBB), an increased level of correlation is associated with a poor outcome, and both cases exhibit the same general profile of more invasiveness, more resistance to treatment, and shorter times until recurrence. In the following we compare the effect of the ErbB2+ correlation (CERBB) to the CSR correlation (CSR) treated above. We note that although there are some similarities, these biological processes measured by the two gene expression variables play different roles in tumor progression. The CSR correlation treated above deals with the role of fibroblasts in both wound healing and tumor progression in cancer and relates to the proposed wound-like phenotype that has been observed in a number of human cancers 
					<abbrgrp>
						<abbr bid="B10">10</abbr>
					</abbrgrp>. The CSR gene signature includes genes for cell motility, matrix remodeling, and angiogenesis, which correspond to increased risk of metastasis and the potential for a more invasive cancer. This signature gives a strong prediction of outcome in several cancer types. The role of ErbB2 in determining outcome has been established in numerous studies 
					<abbrgrp>
						<abbr bid="B29">29</abbr>
					</abbrgrp> and is independent of other prognostic factors. These protein tyrosine kinases in the HER (ErbB) signaling network play critical roles in cell signaling that regulate proliferation, migration, and survival 
					<abbrgrp>
						<abbr bid="B30">30</abbr>
					</abbrgrp>. Disruption of the signaling network of tyrosine kinases figures prominently in many known oncogenic mutations leading to neoplasms, including cases of breast carcinomas. HER2/neu has also been shown to disrupt the p53 tumor suppression pathway 
					<abbrgrp>
						<abbr bid="B31">31</abbr>
					</abbrgrp>. The action of this signaling network and its role in cancer progression continues to be studied in order to discover new therapies.</p>
				<p>The different means of action between ErbB2 and CSR allows for overlap of both these variables in determination of probability of survival. The effect of ErbB2+ correlation (CERB) in the survival model follows approximately the same pattern as the CSR correlation (CSR) described above, although the magnitude is somewhat smaller, as described below. The hazards ratio and p-values for these two variables are comparable when considered individually, with hazard ratios of (45.489) and (30.036) for CSR correlation and ErbB2+ correlation, respectively, and p-values of (1.462 10^-9) and (2.990 10^-7), respectively. However, when considered with all the other variables in the model, these become hazard ratios of (14.305) and (4.766) for CSR correlation and ErbB2+ correlation, respectively, and p-values of 0.0001524 and 0.03208, respectively. The effect of the seventy gene signature on the ErbB2+ correlation will be comparable to the effect on the CSR correlation, as demonstrated above. Thus the ErbB2+ correlation will display the same pattern as the CSR correlation, with a somewhat smaller magnitude due to the difference in hazard ratios. In the following we will also investigate each of these correlations, CSR and ErbB2+, as continuous variables within our overall model. We will also consider the relation between these variables below, where an increase in correlation of one variable can be expected to amplify the effects of the other, as observed above for the seventy gene signature.</p>
				<p>The graphs in Figure 
					<figr fid="F3">3</figr> show the survival curves, together with the derivatives, with the case where only the seventy gene signature (solid curve in center) is considered compared with the four variable model for varying levels of ErbB2+ correlation. The curves on the outside represent the minimum level of ErbB2+ (dotted curve at top) and the maximum level of ErbB2+ correlation (dashed curve at bottom), with the good seventy gene signature in the axes on the left and the poor seventy gene signature in the axes on the right. The location and velocities of the minima for the rate of change of the survival curve are given in Table 
					<tblr tid="T4">4</tblr>.</p>
				<fig id="F3"><title><p>Figure 3</p></title><caption><p>Survival function with varying ErbB2+ correlation</p></caption><text>
   <p>
      <b>Survival function with varying ErbB2+ correlation.</b>
   </p>
</text><graphic file="1755-8794-5-63-3"/></fig>
				<table id="T4">
					<title>
						<p>Table 4</p>
					</title>
					<caption>
						<p>
							<b>Maximum rate of decrease of survival function with varying ErbB2+ correlation</b>
						</p>
					</caption>
					<tgroup align="left" cols="4">
						<colspec align="left" colname="c1" colnum="1" colwidth="1*"/>
						<colspec align="left" colname="c2" colnum="2" colwidth="1*"/>
						<colspec align="left" colname="c3" colnum="3" colwidth="1*"/>
						<colspec align="left" colname="c4" colnum="4" colwidth="1*"/>
						<thead valign="top">
							<row rowsep="1">
								<entry colname="c1"/>
								<entry colname="c2"/>
								<entry colname="c3">
									<p>
										<b>Time</b>
									</p>
								</entry>
								<entry colname="c4">
									<p>
										<b>Velocity</b>
									</p>
								</entry>
							</row>
						</thead>
						<tbody valign="top">
							<row rowsep="1">
								<entry colname="c1" morerows="2">
									<p>Good prognosis</p>
								</entry>
								<entry colname="c2">
									<p>Min ErbB2+</p>
								</entry>
								<entry colname="c3">
									<p>3.929</p>
								</entry>
								<entry colname="c4">
									<p>&#8722;0.008628</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c2">
									<p>Max ErbB2+</p>
								</entry>
								<entry colname="c3">
									<p>3.627</p>
								</entry>
								<entry colname="c4">
									<p>&#8722;0.02704</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c2">
									<p>O7 only</p>
								</entry>
								<entry colname="c3">
									<p>8.332</p>
								</entry>
								<entry colname="c4">
									<p>&#8722;0.007713</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1" morerows="2">
									<p>Poor prognosis</p>
								</entry>
								<entry colname="c2">
									<p>Min ErbB2+</p>
								</entry>
								<entry colname="c3">
									<p>3.624</p>
								</entry>
								<entry colname="c4">
									<p>&#8722;0.02722</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c2">
									<p>Max ErbB2+</p>
								</entry>
								<entry colname="c3">
									<p>3.024</p>
								</entry>
								<entry colname="c4">
									<p>&#8722;0.07859</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c2">
									<p>O7 only</p>
								</entry>
								<entry colname="c3">
									<p>3.743</p>
								</entry>
								<entry colname="c4">
									<p>&#8722;0.05187</p>
								</entry>
							</row>
						</tbody>
					</tgroup>
				</table>
				<p>The effect of the ErbB2+ correlation is comparable to that for CSR correlation observed above, although the magnitude is smaller. The difference in 20 year survival rates between the minimum and maximum ErbB2+ correlations are 0.2316 in the case of good seventy gene signature and 0.4097 in the case of poor seventy gene signature. These are just over half of the effect observed for the difference between minimum CSR correlation and maximum CSR correlation, which is 0.4235 for the good seventy gene signature and 0.7021 for the poor seventy gene signature.</p>
				<p>In the remainder of the study we further describe interactions between our three gene expression variables, 70G, CSR, and CERBB, in determining the survival function. As the variables for CSR correlation and ErbB2+ correlation are continuous variables, we study the effect of variation of the level of correlation on the survival function. We first investigate separately the effects of each of these correlations, CSR and ErbB2+, in determining the probability of survival beyond ten years. Then, as a function of two variables we are able to investigate the combined effect of these two correlations on the probability of survival beyond ten years. We also use two variables to consider the effect of each of these individual variables in combination with time. In each case we analyze the survival function to explore quantitatively how change in the level of correlation will affect the prognosis and the probability of survival beyond a given time. It is also possible to determine at what time a given correlation will display its largest impact on survival. This analysis will further allow us to compare the influence of these two variables, CSR correlation and ErbB2+ correlation, and how they affect the survival and hazard curves, over time.</p>
				<p>We first investigate the role of CSR correlation (CSR) while holding the other variables at median level and assuming a poor prognosis in seventy gene signature (70G). We consider three fixed times, probability of survival past 5 years, past 10 years, and past 20 years. These survival curves, followed by their rates of change, are given in Figure 
					<figr fid="F4">4</figr>. The horizontal axis for CSR correlation varies from the minimum CSR correlation to the maximum CSR correlation for the data set, and our interest is primarily in this range of values for CSR correlation.</p>
				<fig id="F4"><title><p>Figure 4</p></title><caption><p>Survival and hazard at 5, 10, and years, as functions of CSR correlation</p></caption><text>
   <p>
      <b>Survival and hazard at 5, 10, and years, as functions of CSR correlation.</b>
   </p>
</text><graphic file="1755-8794-5-63-4"/></fig>
				<p>As expected, survival drops off with increasing CSR correlation. The effect from the CSR correlation increases with time, as may also be expected. For survival beyond 5 years, the decrease in survival with increasing CSR correlation occurs at an increasing rate throughout the experimental range of CSR correlations, reaching a maximum rate of decrease of (&#8722;0.8387) at the maximum correlation. However at 10 and 20 years, the effect of CSR correlation in decreasing survival is even larger, with a maximum rate of decrease occurring at correlations within the experimental range. The specific values are given in Table 
					<tblr tid="T5">5</tblr>. Clearly, as time increases the CSR correlation has a larger effect, with significant effects noticeable at much lower levels of correlation. Similarly, at the minimum values of correlation the effect of time is much less significant, and the survival rates are much higher.</p>
				<table id="T5">
					<title>
						<p>Table 5</p>
					</title>
					<caption>
						<p>
							<b>Maximum rate of decrease for survival function with CSR correlation vs. EbB2+ correlation</b>
						</p>
					</caption>
					<tgroup align="left" cols="6">
						<colspec align="left" colname="c1" colnum="1" colwidth="1*"/>
						<colspec align="left" colname="c2" colnum="2" colwidth="1*"/>
						<colspec align="left" colname="c3" colnum="3" colwidth="1*"/>
						<colspec align="left" colname="c4" colnum="4" colwidth="1*"/>
						<colspec align="left" colname="c5" colnum="5" colwidth="1*"/>
						<colspec align="left" colname="c6" colnum="6" colwidth="1*"/>
						<thead valign="top">
							<row rowsep="1">
								<entry colname="c1"/>
								<entry colname="c2">
									<p>
										<b>Time</b>
									</p>
								</entry>
								<entry colname="c3">
									<p>
										<b>Correlation</b>
									</p>
								</entry>
								<entry colname="c4">
									<p>
										<b>Velocity</b>
									</p>
								</entry>
								<entry colname="c5">
									<p>
										<b>Correlation</b>
									</p>
								</entry>
								<entry colname="c6">
									<p>
										<b>Velocity</b>
									</p>
								</entry>
							</row>
						</thead>
						<tfoot>
							<p>Note: Max[CSR] = 0.455306 and Max[CERBB] = 0.451045 for this data set.</p>
						</tfoot>
						<tbody valign="top">
							<row rowsep="1">
								<entry colname="c1" morerows="2">
									<p>Effect of variation of CSR correlation</p>
								</entry>
								<entry colname="c2">
									<p>5 years</p>
								</entry>
								<entry colname="c3">
									<p>0.6855</p>
								</entry>
								<entry colname="c4">
									<p>&#8722;0.9788</p>
								</entry>
								<entry colname="c5">
									<p>Max</p>
								</entry>
								<entry colname="c6">
									<p>&#8722;0.8387</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c2">
									<p>10 years</p>
								</entry>
								<entry colname="c3">
									<p>0.3855</p>
								</entry>
								<entry colname="c4">
									<p>&#8722;0.9788</p>
								</entry>
								<entry colname="c5">
									<p>0.3855</p>
								</entry>
								<entry colname="c6">
									<p>&#8722;0.9788</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c2">
									<p>20 years</p>
								</entry>
								<entry colname="c3">
									<p>0.1498</p>
								</entry>
								<entry colname="c4">
									<p>&#8722;0.9788</p>
								</entry>
								<entry colname="c5">
									<p>0.1498</p>
								</entry>
								<entry colname="c6">
									<p>&#8722;0.9788</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1" morerows="2">
									<p>Effect of variation of ErbB2+ correlation</p>
								</entry>
								<entry colname="c2">
									<p>5 years</p>
								</entry>
								<entry colname="c3">
									<p>1.000</p>
								</entry>
								<entry colname="c4">
									<p>&#8722;0.5652</p>
								</entry>
								<entry colname="c5">
									<p>Max</p>
								</entry>
								<entry colname="c6">
									<p>&#8722;0.3868</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c2">
									<p>10 years</p>
								</entry>
								<entry colname="c3">
									<p>0.6063</p>
								</entry>
								<entry colname="c4">
									<p>&#8722;0.5744</p>
								</entry>
								<entry colname="c5">
									<p>Max</p>
								</entry>
								<entry colname="c6">
									<p>&#8722;0.5590</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c2">
									<p>20 years</p>
								</entry>
								<entry colname="c3">
									<p>0.2063</p>
								</entry>
								<entry colname="c4">
									<p>&#8722;0.5744</p>
								</entry>
								<entry colname="c5">
									<p>0.2063</p>
								</entry>
								<entry colname="c6">
									<p>&#8722;0.5744</p>
								</entry>
							</row>
						</tbody>
					</tgroup>
				</table>
				<p>The hazard function continues increasing for both increasing time and increasing correlation, as we observe in the hazard graphs found in Figure 
					<figr fid="F4">4</figr>.</p>
				<p>We now investigate how ErbB2+ correlation affects the probability of survival beyond times of 5, 10, and 20 years. The graphs representing these survival curves appear in Figure 
					<figr fid="F5">5</figr>. The general effect is the same as that just observed with CSR correlation, but of a smaller magnitude. Along with a smaller overall magnitude of effect, the correlation must also reach higher levels in order to achieve its level of maximal effect. Table 
					<tblr tid="T5">5</tblr> also describes the maximum rate of decrease in these survival functions and the corresponding ErbB2+ correlations. In comparison with the CSR correlation, the rates of decrease of survival with respect to ErbB2+ correlation are considerably lower, with the maximum rate of decrease for the CERBB variable being approximately half that of the CSR variable, and requiring a higher level of correlation.</p>
				<fig id="F5"><title><p>Figure 5</p></title><caption><p>Survival at 5, 10, and 20 years, as functions of ErbB2+ correlation</p></caption><text>
   <p>
      <b>Survival at 5, 10, and 20 years, as functions of ErbB2+ correlation.</b>
   </p>
</text><graphic file="1755-8794-5-63-5"/></fig>
				<p>To further illustrate the quantitative difference for these two variables, we give Table 
					<tblr tid="T6">6</tblr> below, representing probability of survival beyond 10 years at several levels of CSR correlation and ErbB2+ correlation. For the CSR columns, the ErbB2+ correlation is held at its median, and likewise CSR correlation is fixed at its median level for the CSR column. The stronger influence of the CSR correlation on survival can be seen in the wider variation in the range of survival probabilities with CSR correlation.</p>
				<table id="T6">
					<title>
						<p>Table 6</p>
					</title>
					<caption>
						<p>
							<b>Probabilities of 5 year and 10 year survival with varying CSR and ErbB2+ correlations</b>
						</p>
					</caption>
					<tgroup align="left" cols="6">
						<colspec align="left" colname="c1" colnum="1" colwidth="1*"/>
						<colspec align="left" colname="c2" colnum="2" colwidth="1*"/>
						<colspec align="left" colname="c3" colnum="3" colwidth="1*"/>
						<colspec align="left" colname="c4" colnum="4" colwidth="1*"/>
						<colspec align="left" colname="c5" colnum="5" colwidth="1*"/>
						<colspec align="left" colname="c6" colnum="6" colwidth="1*"/>
						<thead valign="top">
							<row rowsep="1">
								<entry colname="c1" nameend="c6" namest="c1">
									<p>
										<b>5 year survival:</b>
									</p>
								</entry>
							</row>
						</thead>
						<tbody valign="top">
							<row rowsep="1">
								<entry colname="c1">
									<p>
										<b>Correlation</b>
									</p>
								</entry>
								<entry colname="c2">
									<p>
										<b>CSR</b>
									</p>
								</entry>
								<entry colname="c3">
									<p>
										<b>ErbB2+</b>
									</p>
								</entry>
								<entry colname="c4">
									<p>
										<b>Correlation</b>
									</p>
								</entry>
								<entry colname="c5">
									<p>
										<b>CSR</b>
									</p>
								</entry>
								<entry colname="c6">
									<p>
										<b>ErbB2+</b>
									</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>&#8722;0.3</p>
								</entry>
								<entry colname="c2">
									<p>0.9299</p>
								</entry>
								<entry colname="c3">
									<p>0.8967</p>
								</entry>
								<entry colname="c4">
									<p>0.1</p>
								</entry>
								<entry colname="c5">
									<p>0.8101</p>
								</entry>
								<entry colname="c6">
									<p>0.8157</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>&#8722;0.2</p>
								</entry>
								<entry colname="c2">
									<p>0.9095</p>
								</entry>
								<entry colname="c3">
									<p>0.8803</p>
								</entry>
								<entry colname="c4">
									<p>0.2</p>
								</entry>
								<entry colname="c5">
									<p>0.7597</p>
								</entry>
								<entry colname="c6">
									<p>0.7881</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>&#8722;0.1</p>
								</entry>
								<entry colname="c2">
									<p>0.8836</p>
								</entry>
								<entry colname="c3">
									<p>0.8615</p>
								</entry>
								<entry colname="c4">
									<p>0.3</p>
								</entry>
								<entry colname="c5">
									<p>0.6987</p>
								</entry>
								<entry colname="c6">
									<p>0.7570</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>0</p>
								</entry>
								<entry colname="c2">
									<p>0.8509</p>
								</entry>
								<entry colname="c3">
									<p>0.8401</p>
								</entry>
								<entry colname="c4">
									<p>0.4</p>
								</entry>
								<entry colname="c5">
									<p>0.6263</p>
								</entry>
								<entry colname="c6">
									<p>0.7223</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1" nameend="c6" namest="c1">
									<p>
										<b>10 year survival:</b>
									</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>
										<b>Correlation</b>
									</p>
								</entry>
								<entry colname="c2">
									<p>
										<b>CSR</b>
									</p>
								</entry>
								<entry colname="c3">
									<p>
										<b>ErbB2+</b>
									</p>
								</entry>
								<entry colname="c4">
									<p>
										<b>Correlation</b>
									</p>
								</entry>
								<entry colname="c5">
									<p>
										<b>CSR</b>
									</p>
								</entry>
								<entry colname="c6">
									<p>
										<b>ErbB2+</b>
									</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>&#8722;0.3</p>
								</entry>
								<entry colname="c2">
									<p>0.8506</p>
								</entry>
								<entry colname="c3">
									<p>0.7844</p>
								</entry>
								<entry colname="c4">
									<p>0.1</p>
								</entry>
								<entry colname="c5">
									<p>0.625607</p>
								</entry>
								<entry colname="c6">
									<p>0.6354</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>&#8722;0.2</p>
								</entry>
								<entry colname="c2">
									<p>0.8097</p>
								</entry>
								<entry colname="c3">
									<p>0.7528</p>
								</entry>
								<entry colname="c4">
									<p>0.2</p>
								</entry>
								<entry colname="c5">
									<p>0.542263</p>
								</entry>
								<entry colname="c6">
									<p>0.5885</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>&#8722;0.1</p>
								</entry>
								<entry colname="c2">
									<p>0.7592</p>
								</entry>
								<entry colname="c3">
									<p>0.7177</p>
								</entry>
								<entry colname="c4">
									<p>0.3</p>
								</entry>
								<entry colname="c5">
									<p>0.44998</p>
								</entry>
								<entry colname="c6">
									<p>0.5380</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>0</p>
								</entry>
								<entry colname="c2">
									<p>0.6981</p>
								</entry>
								<entry colname="c3">
									<p>0.6784</p>
								</entry>
								<entry colname="c4">
									<p>0.4</p>
								</entry>
								<entry colname="c5">
									<p>0.352761</p>
								</entry>
								<entry colname="c6">
									<p>0.4845</p>
								</entry>
							</row>
						</tbody>
					</tgroup>
				</table>
				<p>We consider how the survival function depends on both of these continuous variables. Note that since Table 
					<tblr tid="T6">6</tblr> always fixes one of these variables at the median level, it will not show either the highest or lowest extremes. In order to study the dependence of survival on both CSR correlation and ErbB2+ correlation, it is necessary to consider the survival function S[x,y,t] as a function of the variables x(CSR), y (ErbB2), and time t. We can represent S[x,y,t<sub>0</sub>] as a three dimensional graph for any fixed value of t<sub>0</sub>. In Figure 
					<figr fid="F6">6</figr> we consider survival beyond 10 years, letting t<sub>0</sub> = 10. The other variables are fixed at median age and a seventy gene signature representing a poor prognosis.</p>
				<fig id="F6"><title><p>Figure 6</p></title><caption><p>Survival beyond 10 years for CSR and ErbB2+ correlation</p></caption><text>
   <p>
      <b>Survival beyond 10 years for CSR and ErbB2+ correlation.</b>
   </p>
</text><graphic file="1755-8794-5-63-6"/></fig>
				<p>The dotted and dashed curves along the surface of this graph correspond to the 10 year (dotted) survival curves in Figures 
					<figr fid="F4">4</figr> and 
					<figr fid="F5">5</figr>, respectively. These are the cases of varying CSR correlation (CSR) at the median level of ErbB2 correlation (CERBB) and of varying ErbB2 correlation (CERBB) at median CSR correlation (CSR), respectively. The values in Table 
					<tblr tid="T6">6</tblr> above correspond to the appropriate points along these curves. Inspection of the surface of the graph in Figure 
					<figr fid="F6">6</figr> shows clearly that a much wider range of interaction of these variables CSR and CERBB is possible beyond the points on the two curves.</p>
				<p>The graph in Figure 
					<figr fid="F6">6</figr> and the above computations describe the interaction of the two correlation variables for the fixed time of 10 years. In Figure 
					<figr fid="F7">7</figr> we explore how each of the variables CSR and CERBB interacts with time in predicting survival. In each of these three-dimensional graphs a poor prognosis is assumed from the seventy gene signature, while the other variables are held at the median level.</p>
				<fig id="F7"><title><p>Figure 7</p></title><caption><p>Survival as a function of time and correlation</p></caption><text>
   <p>
      <b>Survival as a function of time and correlation.</b>
   </p>
</text><graphic file="1755-8794-5-63-7"/></fig>
				<p>The comparative effects of CSR correlation and ErbB2+ correlation are obvious from these graphs. At each time change of CSR correlation has a much larger impact as compared to ErbB2+ correlation. Similarly, for each given level of correlation, the decrease of survival percentage with respect to time is much larger for CSR correlation.</p>
				<p>Since the function (2) with the parameter values estimated by the model contains all of this information, it is possible to compute probabilities of survival to any time for any given combination of the variables. As a representative examples of the types of computations that can be made, in Table 
					<tblr tid="T7">7</tblr> we give probability of survival beyond 10 years, probability of survival beyond 20 years, and the conditional probability of survival beyond 20 years given survival to 10 years. The variables are at median level unless otherwise mentioned. Low levels of CSR or ErbB2+ correlation correspond to the tenth percentile, while high levels correspond to the ninetieth percentile.</p>
				<table id="T7">
					<title>
						<p>Table 7</p>
					</title>
					<caption>
						<p>
							<b>Explicit computation of survival probabilities for representative cases</b>
						</p>
					</caption>
					<tgroup align="left" cols="5">
						<colspec align="left" colname="c1" colnum="1" colwidth="1*"/>
						<colspec align="left" colname="c2" colnum="2" colwidth="1*"/>
						<colspec align="left" colname="c3" colnum="3" colwidth="1*"/>
						<colspec align="left" colname="c4" colnum="4" colwidth="1*"/>
						<colspec align="left" colname="c5" colnum="5" colwidth="1*"/>
						<thead valign="top">
							<row rowsep="1">
								<entry colname="c1"/>
								<entry colname="c2"/>
								<entry colname="c3">
									<p>
										<b>10 years</b>
									</p>
								</entry>
								<entry colname="c4">
									<p>
										<b>20 years</b>
									</p>
								</entry>
								<entry colname="c5">
									<p>
										<b>20 years | 10 years</b>
									</p>
								</entry>
							</row>
						</thead>
						<tbody valign="top">
							<row rowsep="1">
								<entry colname="c1"/>
								<entry colname="c2">
									<p>Good prognosis</p>
								</entry>
								<entry colname="c3">
									<p>0.8988</p>
								</entry>
								<entry colname="c4">
									<p>0.8193</p>
								</entry>
								<entry colname="c5">
									<p>0.9116</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1"/>
								<entry colname="c2">
									<p>Poor prognosis</p>
								</entry>
								<entry colname="c3">
									<p>0.7020</p>
								</entry>
								<entry colname="c4">
									<p>0.5164</p>
								</entry>
								<entry colname="c5">
									<p>0.7357</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1" morerows="3">
									<p>Good prognosis</p>
								</entry>
								<entry colname="c2">
									<p>Low CSR</p>
								</entry>
								<entry colname="c3">
									<p>0.9428</p>
								</entry>
								<entry colname="c4">
									<p>0.8958</p>
								</entry>
								<entry colname="c5">
									<p>0.9502</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c2">
									<p>High CSR</p>
								</entry>
								<entry colname="c3">
									<p>0.8114</p>
								</entry>
								<entry colname="c4">
									<p>0.6769</p>
								</entry>
								<entry colname="c5">
									<p>0.8342</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c2">
									<p>Low ErbB2+</p>
								</entry>
								<entry colname="c3">
									<p>0.9214</p>
								</entry>
								<entry colname="c4">
									<p>0.8583</p>
								</entry>
								<entry colname="c5">
									<p>0.9315</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c2">
									<p>High ErbB2+</p>
								</entry>
								<entry colname="c3">
									<p>0.8420</p>
								</entry>
								<entry colname="c4">
									<p>0.7253</p>
								</entry>
								<entry colname="c5">
									<p>0.8614</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1" morerows="3">
									<p>Poor prognosis</p>
								</entry>
								<entry colname="c2">
									<p>Low CSR</p>
								</entry>
								<entry colname="c3">
									<p>0.8225</p>
								</entry>
								<entry colname="c4">
									<p>0.6942</p>
								</entry>
								<entry colname="c5">
									<p>0.8440</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c2">
									<p>High CSR</p>
								</entry>
								<entry colname="c3">
									<p>0.5001</p>
								</entry>
								<entry colname="c4">
									<p>0.2742</p>
								</entry>
								<entry colname="c5">
									<p>0.5482</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c2">
									<p>Low ErbB2+</p>
								</entry>
								<entry colname="c3">
									<p>0.7624</p>
								</entry>
								<entry colname="c4">
									<p>0.6024</p>
								</entry>
								<entry colname="c5">
									<p>0.7903</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c2">
									<p>High ErbB2+</p>
								</entry>
								<entry colname="c3">
									<p>0.5654</p>
								</entry>
								<entry colname="c4">
									<p>0.3447</p>
								</entry>
								<entry colname="c5">
									<p>0.6097</p>
								</entry>
							</row>
						</tbody>
					</tgroup>
				</table>
				<p>In this four-variable model we observed how each of the three gene expression variables influenced the survival and hazard functions for breast cancer patients. For the two continuous gene expression variables, CSR correlation and ErbB2+ correlation, we analyze the effect of changes in levels of gene expression. We were able to assess the combined effect of these variables, or we could look at them separately and compare their effects, such as the above comparison of effects of change in CSR correlation and ErbB2+ correlation. The feature of the hypertabastic survival model of producing explicit hazard and survival functions allowed us to analyze these dynamics. Additionally we are able to compute explicit survival probabilities for any given patient profile. In concluding this survival analysis using several clinical and gene expression variables, we mention our recent work 
					<abbrgrp>
						<abbr bid="B3">3</abbr>
					</abbrgrp>, in which we investigate the role of metastasis in survival analysis and its interactions with the other covariates.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Conclusions</p>
			</st>
			<p>The new model presented in this article combines several features not included in previous models in survival analysis of breast cancer patients. Through use of the hypertabastic survival model, a parametric model we attain a better fitting model. It furthermore offers explicitly defined hazard and survival functions for use as tools in analysis. As demonstrated in this article, these functions can be used for computation of probabilities, such as those given in the tables above. Furthermore, analysis of the time course of these functions allows scientists to study the time course of the progression of hazard and the decline in survival for these patients. The influence of the variables, collectively or individually, can also be investigated in their role in determining this time course. This analysis illustrates the value of parametric models in survival analysis in cases where a suitable distribution can be found to be close enough to the underlying distribution of the data. We recommend consideration of the hypertabastic distribution as it is shown in 
				<abbrgrp>
					<abbr bid="B3">3</abbr>
				</abbrgrp> and in the current paper to have a good fit to breast cancer survival data. Furthermore simulations 
				<abbrgrp>
					<abbr bid="B2">2</abbr>
				</abbrgrp> have shown it to be robust with respect to departure from distribution. The feature of the hypertabastic distribution in adjusting its shape for a more accurate representation of the time course of the hazard and survival functions. In the context of the current work of scientists in developing gene expression variables for clinical use, these novel features of this model become even more significant.</p>
			<p>The novel feature of the current model of investigating collective behavior of distinct gene expression variables offers an important new direction of research. The three gene expression variables included in this model originate from three distinct types of gene expression signatures: one signature representing early distant metastasis, one representing the relation of the wound healing microenvironment to that of tumor progression, and the third representing classification of breast cancer tumors into molecular subtype. Furthermore the model gives a means to determine the relative contribution of each variable, quantitatively, in determining survival and hazard. For the two continuous gene expression variables we were also able to investigate the rate of change of hazard and survival with respect to change in the level of gene expression.</p>
			<p>By consideration of a wider range of gene expression variables together with clinical variables, this model has moved beyond previous models toward a quantitative assessment of hazard and survival involving all relevant information. These results show the potential to use multiple gene expression signatures to a combined greater effect when the signatures represent different aspects of the cancer biology. We note however that the current model has limitations in its representation of potential interactions between the various gene expression signatures. We feel this issue of interactions among gene expression variables, as well as other variables, is a critical issue for current research. We propose further investigations in this direction, as well as development of new and more refined models designed for this purpose. Certainly the new generation of gene signatures being developed for clinical use 
				<abbrgrp>
					<abbr bid="B17">17</abbr>
					<abbr bid="B18">18</abbr>
				</abbrgrp> should also be explored for their potential interactions and combined effects. As an extension of this work, we have explored the effect of an additional variable representing metastasis in a recent paper 
				<abbrgrp>
					<abbr bid="B3">3</abbr>
				</abbrgrp>, particularly in relation to the other variables in the model. We also propose to make a similar analysis after dividing the breast cancer cases into several different classes, such as estrogen receptor positive versus estrogen receptor negative cancers, or for the molecular subtypes based on the correlation variables CNorm, CERBB, CLumA, CLumB, and CBas. Another important direction for future research will be identification and analysis of variables that either cause the metastasis of tumors or that accelerate this process.</p>
		</sec>
		<sec>
			<st>
				<p>Abbreviations</p>
			</st>
			<p>ErbB2: v-erb-b2 erythroblastic leukemia viral oncogene homolog 2; HER2: Human epidermal growth factor receptor 2; CSR: Core Serum Response; AIC: Akikake Information Criterion; ER: Estrogen Receptor.</p>
		</sec>
		<sec>
			<st>
				<p>Competing interests</p>
			</st>
			<p>The authors declare they have no competing interests.</p>
		</sec>
		<sec>
			<st>
				<p>Authors&#8217; contributions</p>
			</st>
			<p>The work presented in this paper was carried out in collaboration among all authors. M.A.T and W.M.E. applied the hypertabastic proportional hazards model for the breast cancer data, analyzed and interpreted the data, and wrote the paper. N.N and K.P.S. participated in the interpretation and analysis of the data and gave technical assistance. H.L. assisted with running the SAS aspects of the program for the hypertabastic proportional hazards model, as well the log-logistic, Weibull, and Cox regression cases. H.L. also participated in discussion of the results. All authors read and approved the final manuscript.</p>
		</sec>
	</bdy>
	<bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgements</p>
				</st>
				<p>This research was partially supported by the National Institutes of Health grant P30 CA13148.</p>
			</sec>
		</ack>
		<refgrp><bibl id="B1"><title><p>Hypertabastic survival model</p></title><aug><au><snm>Tabatabai</snm><fnm>MA</fnm></au><au><snm>Bursac</snm><fnm>Z</fnm></au><au><snm>Williams</snm><fnm>DK</fnm></au><au><snm>Singh</snm><fnm>KP</fnm></au></aug><source>Theor. Biol. and Med. Model</source><pubdate>2007</pubdate><volume>4</volume><fpage>40</fpage><xrefbib><pubid idtype="doi">10.1186/1742-4682-4-40</pubid></xrefbib></bibl><bibl id="B2"><title><p>A simulation study of performance of hypertabastic and hyperbolastic survival models in comparison with classic survival models</p></title><aug><au><snm>Bursac</snm><fnm>Z</fnm></au><au><snm>Tabatabai</snm><fnm>M</fnm></au><au><snm>Williams</snm><fnm>DK</fnm></au><au><snm>Singh</snm><fnm>K</fnm></au></aug><source>Proc. 2008 American statistical assoc. Biometrics section (CD-ROM)</source><publisher>Alexandria, VA: American Statistical Association</publisher><pubdate>2009</pubdate><fpage>617</fpage><lpage>622</lpage><note>Alexandria, VA</note></bibl><bibl id="B3"><title><p>Role of metastasis in hypertabastic survival analysis of breast cancer: Interactions with clinical and gene expression variables</p></title><aug><au><snm>Tabatabai</snm><fnm>M</fnm></au><au><snm>Eby</snm><fnm>W</fnm></au><au><snm>Nimeh</snm><fnm>N</fnm></au><au><snm>Singh</snm><fnm>K</fnm></au></aug><source>Cancer Growth and Metastasis</source><pubdate>2012</pubdate><volume>5</volume><fpage>1</fpage><lpage>17</lpage><xrefbib><pubid idtype="doi">10.4137/CGM.S8821</pubid></xrefbib></bibl><bibl id="B4"><title><p>Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications</p></title><aug><au><snm>S&#511;rlie</snm><fnm>T</fnm></au><au><snm>Perou</snm><fnm>CM</fnm></au><au><snm>Tibshirani</snm><fnm>R</fnm></au><etal/></aug><source>PNAS</source><pubdate>2001</pubdate><volume>98</volume><fpage>10869</fpage><lpage>10874</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.191367098</pubid><pubid idtype="pmcid">58566</pubid><pubid idtype="pmpid" link="fulltext">11553815</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>Gene expression profiling predicts clinical outcome of breast cancer</p></title><aug><au><snm>Van&#8217;t Veer</snm><fnm>LJ</fnm></au><au><snm>Dai</snm><fnm>H</fnm></au><au><snm>van de Vijver</snm><fnm>MJ</fnm></au><etal/></aug><source>Nature</source><pubdate>2002</pubdate><volume>415</volume><fpage>530</fpage><lpage>536</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/415530a</pubid><pubid idtype="pmpid" link="fulltext">11823860</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>Breast cancer classification and prognosis based on gene expression profiles from a population-based study</p></title><aug><au><snm>Sotiriou</snm><fnm>C</fnm></au><au><snm>Neo</snm><fnm>SY</fnm></au><au><snm>McShane</snm><fnm>LM</fnm></au><etal/></aug><source>PNAS</source><pubdate>2003</pubdate><volume>100</volume><fpage>10393</fpage><lpage>10398</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.1732912100</pubid><pubid idtype="pmcid">193572</pubid><pubid idtype="pmpid" link="fulltext">12917485</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer</p></title><aug><au><snm>Wang</snm><fnm>Y</fnm></au><au><snm>Klijn</snm><fnm>JGM</fnm></au><au><snm>Zhang</snm><fnm>Y</fnm></au><etal/></aug><source>Lancet</source><pubdate>2005</pubdate><volume>365</volume><fpage>671</fpage><lpage>679</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">15721472</pubid></xrefbib></bibl><bibl id="B8"><title><p>Tumors: Wounds that do not heal</p></title><aug><au><snm>Dvorak</snm><fnm>HF</fnm></au></aug><source>NEJM</source><pubdate>1986</pubdate><volume>315</volume><fpage>1650</fpage><lpage>1659</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1056/NEJM198612253152606</pubid><pubid idtype="pmpid" link="fulltext">3537791</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>Gene expression signature of fibroblast serum response predicts human cancer progression: Similarities between tumors and wounds</p></title><aug><au><snm>Chang</snm><fnm>HY</fnm></au><au><snm>Sneddon</snm><fnm>JB</fnm></au><au><snm>Alizadeh</snm><fnm>AA</fnm></au><etal/></aug><source>PLoS Biology</source><pubdate>2004</pubdate><volume>2</volume><fpage>206</fpage><lpage>214</lpage><xrefbib><pubid idtype="doi">10.1371/journal.pbio.0020206</pubid></xrefbib></bibl><bibl id="B10"><title><p>Robustness, scalability, and integration of a wound-response gene expression signature in predicting breast cancer survival</p></title><aug><au><snm>Chang</snm><fnm>HY</fnm></au><au><snm>Nuyten</snm><fnm>DS</fnm></au><au><snm>Sneddon</snm><fnm>JB</fnm></au><etal/></aug><source>PNAS</source><pubdate>2005</pubdate><volume>102</volume><fpage>3738</fpage><lpage>3743</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0409462102</pubid><pubid idtype="pmcid">548329</pubid><pubid idtype="pmpid" link="fulltext">15701700</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>Taking gene-expression profiling to the clinic: when will molecular signatures become relevant to patient care?</p></title><aug><au><snm>Sotiriou</snm><fnm>C</fnm></au><au><snm>Piccart</snm><fnm>MJ</fnm></au></aug><source>Nat Rev Cancer</source><pubdate>2007</pubdate><volume>7</volume><fpage>545</fpage><lpage>553</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nrc2173</pubid><pubid idtype="pmpid" link="fulltext">17585334</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>Building prognostic models for breast cancer patients using clinical variables and hundreds of gene expression signatures</p></title><aug><au><snm>Fan</snm><fnm>C</fnm></au><au><snm>Prat</snm><fnm>A</fnm></au><au><snm>Parker</snm><fnm>JS</fnm></au><etal/></aug><source>BMC Medical Genomics</source><pubdate>2011</pubdate><volume>4</volume><issue>3</issue><fpage>1</fpage><lpage>15</lpage><xrefbib><pubidlist><pubid idtype="pmcid">3023653</pubid><pubid idtype="pmpid" link="fulltext">21208432</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>Treatment of pT1N0 breast cancer: multigene predictors to assess risk of relaps</p></title><aug><au><snm>Fumagalli</snm><fnm>D</fnm></au><au><snm>Sotiriou</snm><fnm>C</fnm></au></aug><source>Annals Oncol</source><pubdate>2010</pubdate><volume>21</volume><fpage>vii103</fpage><lpage>vii106</lpage><xrefbib><pubid idtype="doi">10.1093/annonc/mdq423</pubid></xrefbib></bibl><bibl id="B14"><title><p>Comparison of prognostic gene expression signatures for breast cancer</p></title><aug><au><snm>Haibe-Kains</snm><fnm>B</fnm></au><au><snm>Desmedt</snm><fnm>C</fnm></au><au><snm>Piette</snm><fnm>F</fnm></au><etal/></aug><source>BMC Genomics</source><pubdate>2008</pubdate><volume>9</volume><fpage>394</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-9-394</pubid><pubid idtype="pmcid">2533026</pubid><pubid idtype="pmpid" link="fulltext">18717985</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>Concordance among gene-expression based predictors for breast cancer</p></title><aug><au><snm>Fan</snm><fnm>C</fnm></au><au><snm>Oh</snm><fnm>DS</fnm></au><au><snm>Wessels</snm><fnm>L</fnm></au><etal/></aug><source>NEJM</source><pubdate>2006</pubdate><volume>355</volume><fpage>560</fpage><lpage>569</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1056/NEJMoa052933</pubid><pubid idtype="pmpid" link="fulltext">16899776</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>A fuzzy gene expression-based computational approach improves breast cancer prognostication</p></title><aug><au><snm>Haibe-Kains</snm><fnm>B</fnm></au><au><snm>Desmedt</snm><fnm>C</fnm></au><au><snm>Roth&#233;</snm><fnm>F</fnm></au><etal/></aug><source>Genome Biol</source><pubdate>2010</pubdate><volume>11</volume><fpage>R18</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2010-11-2-r18</pubid><pubid idtype="pmcid">2872878</pubid><pubid idtype="pmpid" link="fulltext">20156340</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><title><p>Clinical application of the 70-gene profile: the MINDACT trial</p></title><aug><au><snm>Cardoso</snm><fnm>F</fnm></au><au><snm>van&#8217;t Veer</snm><fnm>L</fnm></au><au><snm>Rutgers</snm><fnm>E</fnm></au><etal/></aug><source>J Clin Oncol</source><pubdate>2008</pubdate><volume>26</volume><fpage>729</fpage><lpage>735</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1200/JCO.2007.14.3222</pubid><pubid idtype="pmpid" link="fulltext">18258980</pubid></pubidlist></xrefbib></bibl><bibl id="B18"><title><p>Development of the 21-gene assay and its application in clinical practice and clinical trials</p></title><aug><au><snm>Sparano</snm><fnm>JA</fnm></au><au><snm>Paik</snm><fnm>S</fnm></au></aug><source>J Clin Oncol</source><pubdate>2008</pubdate><volume>26</volume><fpage>721</fpage><lpage>728</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1200/JCO.2007.15.1068</pubid><pubid idtype="pmpid" link="fulltext">18258979</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>Stromal gene expression predicts clinical outcome in breast cancer</p></title><aug><au><snm>Finak</snm><fnm>G</fnm></au><au><snm>Bertos</snm><fnm>N</fnm></au><au><snm>Pepin</snm><fnm>F</fnm></au><etal/></aug><source>Nature Med</source><pubdate>2008</pubdate><volume>14</volume><fpage>518</fpage><lpage>527</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nm1764</pubid><pubid idtype="pmpid" link="fulltext">18438415</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>An immune response gene expression module identifies a good prognosis subtype in oestrogen receptor negative breast cancer</p></title><aug><au><snm>Teschendorff</snm><fnm>AE</fnm></au><au><snm>Miremadi</snm><fnm>A</fnm></au><au><snm>Pinder</snm><fnm>SE</fnm></au><etal/></aug><source>Genome Biol</source><pubdate>2007</pubdate><volume>8</volume><fpage>R157</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2007-8-8-r157</pubid><pubid idtype="pmcid">2374988</pubid><pubid idtype="pmpid" link="fulltext">17683518</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>PlK3CA mutations associated with gene signature of low mTORC1 signaling and better outcomes in estrogen receptor-positive breast cancer</p></title><aug><au><snm>Loi</snm><fnm>S</fnm></au><au><snm>Haibe-Kains</snm><fnm>B</fnm></au><au><snm>Majjaj</snm><fnm>S</fnm></au><etal/></aug><source>PNAS</source><pubdate>2010</pubdate><volume>107</volume><fpage>10208</fpage><lpage>10213</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0907011107</pubid><pubid idtype="pmcid">2890442</pubid><pubid idtype="pmpid" link="fulltext">20479250</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>Application of microarrays for the prediction of therapy response in breast cancer</p></title><aug><au><snm>Gy&#246;rffy</snm><fnm>B</fnm></au><au><snm>Surowiak</snm><fnm>P</fnm></au><au><snm>Lage</snm><fnm>H</fnm></au></aug><source>Cancer Genom &amp; Proteom</source><pubdate>2005</pubdate><volume>2</volume><fpage>255</fpage><lpage>264</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">22706117</pubid></xrefbib></bibl><bibl id="B23"><title><p>A population-based gene signature is predictive of breast cancer survival and chemoresponse</p></title><aug><au><snm>Rathnagiriswaran</snm><fnm>S</fnm></au><au><snm>Wan</snm><fnm>Y</fnm></au><au><snm>Abraham</snm><fnm>J</fnm></au><etal/></aug><source>Intl J Oncol</source><pubdate>2010</pubdate><volume>36</volume><fpage>607</fpage><lpage>616</lpage></bibl><bibl id="B24"><title><p>Gene expression profiles derived from fine needle aspiration correlate with response to systemic chemotherapy in breast cancer</p></title><aug><au><snm>Sotiriou</snm><fnm>C</fnm></au><au><snm>Powles</snm><fnm>TJ</fnm></au><au><snm>Dowsett</snm><fnm>M</fnm></au><etal/></aug><source>Breast Cancer Res</source><pubdate>2002</pubdate><volume>4</volume><issue>R3</issue><fpage>1</fpage><lpage>8</lpage><xrefbib><pubidlist><pubid idtype="pmcid">138710</pubid><pubid idtype="pmpid" link="fulltext">11879551</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>A genomic predictor of response and survival following taxane-anthracycline chemotherapy for invasive breast cancer</p></title><aug><au><snm>Hatzis</snm><fnm>C</fnm></au><au><snm>Pusztai</snm><fnm>L</fnm></au><au><snm>Valero</snm><fnm>V</fnm></au><etal/></aug><source>JAMA</source><pubdate>2011</pubdate><volume>305</volume><issue>18</issue><fpage>1873</fpage><lpage>1881</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1001/jama.2011.593</pubid><pubid idtype="pmpid" link="fulltext">21558518</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>Patterns of resistance and incomplete resistance to decetaxel (Taxotere) by gene expression profiling in breast cancer patients</p></title><aug><au><snm>Chang</snm><fnm>JC</fnm></au><au><snm>Wooten</snm><fnm>EC</fnm></au><au><snm>Tsimelzou</snm><fnm>A</fnm></au><etal/></aug><source>J Clin Oncol</source><pubdate>2005</pubdate><volume>23</volume><fpage>1169</fpage><lpage>1177</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1200/JCO.2005.03.156</pubid><pubid idtype="pmpid" link="fulltext">15718313</pubid></pubidlist></xrefbib></bibl><bibl id="B27"><title><p>A gene-expression signature as a predictor of survival in breast cancer</p></title><aug><au><snm>van de Vijver</snm><fnm>MJ</fnm></au><au><snm>He</snm><fnm>YD</fnm></au><au><snm>van&#8217;t Veer</snm><fnm>LJ</fnm></au><etal/></aug><source>NEJM</source><pubdate>2002</pubdate><volume>347</volume><fpage>1999</fpage><lpage>2009</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1056/NEJMoa021967</pubid><pubid idtype="pmpid" link="fulltext">12490681</pubid></pubidlist></xrefbib></bibl><bibl id="B28"><title><p>New look at statistical model identification</p></title><aug><au><snm>Akikake</snm><fnm>A</fnm></au></aug><source>IEEE Trans Autom Control</source><pubdate>1974</pubdate><volume>19</volume><issue>6</issue><fpage>716</fpage><lpage>723</lpage><xrefbib><pubid idtype="doi">10.1109/TAC.1974.1100705</pubid></xrefbib></bibl><bibl id="B29"><title><p>The Her-2/neu gene and protein in breast cancer 2003: Biomarker and target of therapy</p></title><aug><au><snm>Ross</snm><fnm>JS</fnm></au><au><snm>Fletcher</snm><fnm>JA</fnm></au><au><snm>Linette</snm><fnm>GP</fnm></au><etal/></aug><source>Oncologist</source><pubdate>2003</pubdate><volume>8</volume><fpage>307</fpage><lpage>325</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1634/theoncologist.8-4-307</pubid><pubid idtype="pmpid" link="fulltext">12897328</pubid></pubidlist></xrefbib></bibl><bibl id="B30"><title><p>HER (erbB) tyrosine kinase inhibitors in the treatment of breast cancer</p></title><aug><au><snm>Arteaga</snm><fnm>CL</fnm></au><au><snm>Moulder</snm><fnm>SL</fnm></au><au><snm>Yakes</snm><fnm>FM</fnm></au></aug><source>Semin Oncol</source><pubdate>2002</pubdate><volume>29</volume><issue>3 Suppl. 11</issue><fpage>4</fpage><lpage>10</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">12170445</pubid></xrefbib></bibl><bibl id="B31"><title><p>Dysregulation of cellular signaling by HER2/neu in breast cancer</p></title><aug><au><snm>Zhou</snm><fnm>BP</fnm></au><au><snm>Hung</snm><fnm>MC</fnm></au></aug><source>Semin Oncol</source><pubdate>2003</pubdate><volume>30</volume><issue>Suppl 16</issue><fpage>38</fpage><lpage>48</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">14613025</pubid></xrefbib></bibl></refgrp>
	<sec><st><p>Pre-publication history</p></st><p>The pre-publication history for this paper can be accessed here:</p><p><url>http://www.biomedcentral.com/1755-8794/5/63/prepub</url></p></sec></bm>
</art>