<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-10-18</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Methodology article</dochead>
      <bibl>
         <title>
            <p>Incorporating pathway information into boosting estimation of high-dimensional risk prediction models</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Binder</snm>
               <fnm>Harald</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>binderh@fdm.uni-freiburg.de</email>
            </au>
            <au id="A2">
               <snm>Schumacher</snm>
               <fnm>Martin</fnm>
               <insr iid="I1"/>
               <email>ms@imbi.uni-freiburg.de</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Medical Biometry and Statistics, University Medical Center Freiburg, Stefan-Meier-Str 26, 79104 Freiburg, Germany</p>
            </ins>
            <ins id="I2">
               <p>Freiburg Center for Data Analysis and Modeling, University of Freiburg, Eckerstr 1, 79104 Freiburg, Germany</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2009</pubdate>
         <volume>10</volume>
         <issue>1</issue>
         <fpage>18</fpage>
         <url>http://www.biomedcentral.com/1471-2105/10/18</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">19144132</pubid>
               <pubid idtype="doi">10.1186/1471-2105-10-18</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>11</day>
               <month>6</month>
               <year>2008</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>13</day>
               <month>1</month>
               <year>2009</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>13</day>
               <month>1</month>
               <year>2009</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2009</year>
         <collab>Binder and Schumacher; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>There are several techniques for fitting risk prediction models to high-dimensional data, arising from microarrays. However, the biological knowledge about relations between genes is only rarely taken into account. One recent approach incorporates pathway information, available, e.g., from the KEGG database, by augmenting the penalty term in Lasso estimation for continuous response models.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>As an alternative, we extend componentwise likelihood-based boosting techniques for incorporating pathway information into a larger number of model classes, such as generalized linear models and the Cox proportional hazards model for time-to-event data. In contrast to Lasso-like approaches, no further assumptions for explicitly specifying the penalty structure are needed, as pathway information is incorporated by adapting the penalties for single microarray features in the course of the boosting steps. This is shown to result in improved prediction performance when the coefficients of connected genes have opposite sign. The properties of the fitted models resulting from this approach are then investigated in two application examples with microarray survival data.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The proposed approach results not only in improved prediction performance but also in structurally different model fits. Incorporating pathway information in the suggested way is therefore seen to be beneficial in several ways.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>When using microarray data for analyzing connections between gene expression and a clinical response, such as survival time, additional knowledge is often available, e.g., on pathway or ontology relations. While several proposals exist, that take the latter into account, for statistical testing, there are only few techniques that consider such meta-information for building of predictive models.</p>
         <p>One prominent source of knowledge on genes is the KEGG database <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Several authors have demonstrated that it can be highly beneficial to consider the pathway information found there into approaches for statistical testing <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. While pathways can directly provide information on relations of genes, annotation databases, such as Gene Ontology <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, can also be employed for testing for the association between a clinical response and groups of genes (see <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, for example).</p>
         <p>When building predictive models, Gene Ontology information, or the knowledge that two microarray features belong to the same pathway, can be incorporated by approaches that allow for explicit grouping of features <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B7">7</abbr></abbrgrp>. Alternatively, pathway signatures can be developed. For example in <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, pathway signatures are determined by experimental techniques, and it is shown that these are related to survival in several independent cancer data sets.</p>
         <p>However, simple grouping of features discards information on specific relations between genes within a pathway. A recent approach <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> not only uses the information that two genes are in the same pathway, but allows to incorporate information on specific gene relations. This is implemented by augmenting the log-likelihood criterion, to be maximized for estimating the parameters of a predictive model, by a penalty term that explicitly takes differences between the coefficients of linked genes into account.</p>
         <p>As a basis for the approach in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, the Lasso <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> is used, which provides for sparse estimates, i.e., predictive models where only few microarray features have non-zero influence. Similar to the fused Lasso <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, an additional term is added to the Lasso penalty. While there are techniques for fitting models to various response types when employing the original Lasso penalty <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, often only continuous response techniques are available for approaches which extend the Lasso penalty. Also, only an algorithm for estimation with a continuous response is provided for the approach in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. However, mainly binary and time-to-event responses are of interest for predictive microarray models.</p>
         <p>Another problem with extensions of the Lasso approach is that several assumptions have to be made when choosing the structure of the penalty term. For example, the criterion employed in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> penalizes the squared difference between (standardized) parameter estimates, which might be problematic when the true parameters have opposite sign. This is, e.g., the case when in a pair of connected genes one is up-regulated and the other one is down-regulated for patients with increased risk.</p>
         <p>Boosting is an alternative technique for fitting high-dimensional predictive models (see, e.g., <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> for an overview). It uses a stepwise approach that allows to build up an overall model from many simple fits, refining the overall fit in every boosting step. When only the parameter estimate for one covariate is updated in each boosting step, componentwise boosting is obtained, resulting in sparse fits similar to the Lasso <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. In addition, likelihood-based componentwise boosting allows for adequate consideration of clinical covariates in predictive microarray models <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>. The latter approach is available for all response types where estimation can be performed by Newton-Raphson steps for maximization of a likelihood, which are then adapted for penalized estimation in every boosting step.</p>
         <p>For incorporating pathway information into boosting algorithms, one approach is to dedicate each single boosting step to the genes in one specific pathway <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. However, just like grouping Lasso approaches, this does not take into account specific relations between genes.</p>
         <p>As an alternative, we are going to adapt the componentwise likelihood-based boosting approach <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp> for specifically incorporating pathway knowledge about gene relations into estimation of predictive models from gene expression data. The proposed <it>PathBoost </it>approach can be used for various response types, including binary and time-to-event responses. As pathway information is incorporated by adapting penalty parameters of connected genes in the course of the boosting steps, the approach also does not require an explicit specification of a penalty structure.</p>
         <p>After outlining the details of the PathBoost algorithm in the following, it will be evaluated in a small simulation study, where it will be compared to the approach given in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. Its advantages on terms of prediction performance and interpretability are furthermore illustrated in two application examples with microarray survival data.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>The PathBoost algorithm</p>
            </st>
            <p>There are different response types for predictive models built from microarray data, the two most prominent being binary responses, employed, e.g., when classification of tumors is wanted, and time-to-event responses when prediction of survival is wanted. Our proposal for incorporating pathway information is based on likelihood-based boosting <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>. It is therefore suitable for all settings where parameter estimation can be performed by maximization of a likelihood via Newton-Raphson steps. For generalized linear models, the response, which might be continuous, binary or a counting response, is taken to be from an exponential family. Given observations (<it>y</it><sub><it>i</it></sub>, <it>x</it><sub><it>i</it></sub>), <it>i </it>= 1,..., <it>n</it>, with response <it>y</it><sub><it>i </it></sub>and covariate vector <it>x</it><sub><it>i </it></sub>= (<it>x</it><sub><it>i</it>1</sub>,..., <it>x</it><sub><it>ip</it></sub>)', the structural part of such models is</p>
            <p>
               <display-formula><it>E</it>(<it>y</it><sub><it>i</it></sub>|<it>x</it><sub><it>i</it></sub>) = <it>h</it>(<it>&#951;</it><sub><it>i</it></sub>),</display-formula>
            </p>
            <p>where <it>h </it>is a known link function and <it>&#951;</it><sub><it>i </it></sub>is the linear predictor</p>
            <p>
               <display-formula>
                  <m:math name="1471-2105-10-18-i1" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>&#951;</m:mi>
                              <m:mi>i</m:mi>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:msub>
                              <m:mi>&#946;</m:mi>
                              <m:mrow>
                                 <m:mi>i</m:mi>
                                 <m:mi>n</m:mi>
                                 <m:mi>t</m:mi>
                                 <m:mi>e</m:mi>
                                 <m:mi>r</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>+</m:mo>
                           <m:msub>
                              <m:msup>
                                 <m:mi>x</m:mi>
                                 <m:mo>&#8242;</m:mo>
                              </m:msup>
                              <m:mi>i</m:mi>
                           </m:msub>
                           <m:mi>&#946;</m:mi>
                           <m:mo>,</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeq4TdG2aaSbaaSqaaiabdMgaPbqabaGccqGH9aqpcqaHYoGydaWgaaWcbaGaemyAaKMaemOBa4MaemiDaqNaemyzauMaemOCaihabeaakiabgUcaRiqbdIha4zaafaWaaSbaaSqaaiabdMgaPbqabaGccqaHYoGycqGGSaalaaa@3FA7@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>with intercept parameter <it>&#946;</it><sub><it>inter </it></sub>and parameter vector <it>&#946; </it>= (<it>&#946;</it><sub>1</sub>,..., <it>&#946;</it><sub><it>p</it></sub>)', which are estimated by maximization of the log-likelihood <it>l</it>(<it>&#946;</it>) (see e.g. <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> for more details).</p>
            <p>In a time-to-event setting, observations (<it>t</it><sub><it>i</it></sub>, <it>&#948;</it><sub><it>i</it></sub>, <it>x</it><sub><it>i</it></sub>), <it>i </it>= 1,..., <it>n</it>, typically comprise of an observed time <it>t</it><sub><it>i</it></sub>, a censoring indicator <it>&#948;</it><sub><it>i</it></sub>, that takes value 1 if the observed time is the time of the event of interest and value 0 if it is the time of censoring, and a covariate vector <it>x</it><sub><it>i</it></sub>. Due to censoring, direct modeling of <it>t</it><sub><it>i </it></sub>as a continuous response is problematic. Models for the hazard <it>&#955;</it>(<it>t</it>|<it>x</it><sub><it>i</it></sub>), i.e., the instantaneous risk of having an event at time <it>t</it>, given the covariate information, are preferred.</p>
            <p>The Cox proportional hazards model has the form</p>
            <p>
               <display-formula><it>&#955;</it>(<it>t</it>|<it>x</it><sub><it>i</it></sub>) = <it>&#955;</it><sub>0</sub>(<it>t</it>) exp(<it>&#951;</it><sub><it>i</it></sub>),</display-formula>
            </p>
            <p>where <it>&#955;</it><sub>0</sub>(<it>t</it>) is an unspecified baseline hazards and <it>&#951;</it><sub><it>i </it></sub>is a linear predictor of the form</p>
            <p>
               <display-formula>
                  <m:math name="1471-2105-10-18-i2" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>&#951;</m:mi>
                              <m:mi>i</m:mi>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:msub>
                              <m:msup>
                                 <m:mi>x</m:mi>
                                 <m:mo>&#8242;</m:mo>
                              </m:msup>
                              <m:mi>i</m:mi>
                           </m:msub>
                           <m:mi>&#946;</m:mi>
                           <m:mo>,</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeq4TdG2aaSbaaSqaaiabdMgaPbqabaGccqGH9aqpcuWG4baEgaqbamaaBaaaleaacqWGPbqAaeqaaOGaeqOSdiMaeiilaWcaaa@35FD@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>with parameter vector <it>&#946;</it>. Estimation of <it>&#946; </it>is performed by maximizing the partial log-likelihood</p>
            <p>
               <display-formula>
                  <m:math name="1471-2105-10-18-i3" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>l</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>&#946;</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:mstyle displaystyle="true">
                              <m:munderover>
                                 <m:mo>&#8721;</m:mo>
                                 <m:mrow>
                                    <m:mi>i</m:mi>
                                    <m:mo>=</m:mo>
                                    <m:mn>1</m:mn>
                                 </m:mrow>
                                 <m:mi>n</m:mi>
                              </m:munderover>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>&#948;</m:mi>
                                    <m:mi>i</m:mi>
                                 </m:msub>
                                 <m:mrow>
                                    <m:mo>(</m:mo>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>&#951;</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mi>log</m:mi>
                                       <m:mo>&#8289;</m:mo>
                                       <m:mrow>
                                          <m:mo>(</m:mo>
                                          <m:mrow>
                                             <m:mstyle displaystyle="true">
                                                <m:munderover>
                                                   <m:mo>&#8721;</m:mo>
                                                   <m:mrow>
                                                      <m:mi>k</m:mi>
                                                      <m:mo>=</m:mo>
                                                      <m:mn>1</m:mn>
                                                   </m:mrow>
                                                   <m:mi>n</m:mi>
                                                </m:munderover>
                                                <m:mrow>
                                                   <m:mi>I</m:mi>
                                                   <m:mo stretchy="false">(</m:mo>
                                                   <m:msub>
                                                      <m:mi>t</m:mi>
                                                      <m:mi>i</m:mi>
                                                   </m:msub>
                                                   <m:mo>&#8804;</m:mo>
                                                   <m:msub>
                                                      <m:mi>t</m:mi>
                                                      <m:mi>k</m:mi>
                                                   </m:msub>
                                                   <m:mo stretchy="false">)</m:mo>
                                                   <m:mi>exp</m:mi>
                                                   <m:mo>&#8289;</m:mo>
                                                   <m:mo stretchy="false">(</m:mo>
                                                   <m:msub>
                                                      <m:mi>&#951;</m:mi>
                                                      <m:mi>k</m:mi>
                                                   </m:msub>
                                                   <m:mo stretchy="false">)</m:mo>
                                                </m:mrow>
                                             </m:mstyle>
                                          </m:mrow>
                                          <m:mo>)</m:mo>
                                       </m:mrow>
                                    </m:mrow>
                                    <m:mo>)</m:mo>
                                 </m:mrow>
                                 <m:mo>,</m:mo>
                              </m:mrow>
                           </m:mstyle>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemiBaWMaeiikaGIaeqOSdiMaeiykaKIaeyypa0ZaaabCaeaacqaH0oazdaWgaaWcbaGaemyAaKgabeaakmaabmaabaGaeq4TdG2aaSbaaSqaaiabdMgaPbqabaGccqGHsislcyGGSbaBcqGGVbWBcqGGNbWzdaqadaqaamaaqahabaGaemysaKKaeiikaGIaemiDaq3aaSbaaSqaaiabdMgaPbqabaGccqGHKjYOcqWG0baDdaWgaaWcbaGaem4AaSgabeaakiabcMcaPiGbcwgaLjabcIha4jabcchaWjabcIcaOiabeE7aOnaaBaaaleaacqWGRbWAaeqaaOGaeiykaKcaleaacqWGRbWAcqGH9aqpcqaIXaqmaeaacqWGUbGBa0GaeyyeIuoaaOGaayjkaiaawMcaaaGaayjkaiaawMcaaiabcYcaSaWcbaGaemyAaKMaeyypa0JaeGymaedabaGaemOBa4ganiabggHiLdaaaa@62FC@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <it>I</it>( ) is an indicator function that takes value 1 if its argument is true and value 0 otherwise, avoiding estimation of the baseline hazard.</p>
            <sec>
               <st>
                  <p>Componentwise likelihood-based boosting</p>
               </st>
               <p>The basic idea of boosting is to fit several models to the data in a stepwise manner. In each boosting step, a new model is fitted, which gives larger weight to those observations that were fitted poorly in the previous boosting steps <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. All individual fits are then combined into one overall model. It has been recognized that this procedure is in specific settings equivalent to gradient descent in function space <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, which in turn is equivalent to repeated fitting of residuals for the continuous response case with squared error loss function <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>.</p>
               <p>In <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, the latter idea is extended to generalized linear models by incorporating the previous boosting steps as an offset into the linear predictor <it>&#951;</it><sub><it>i</it></sub>. In <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, a similar approach for boosting estimation of the Cox proportional hazards model is suggested. The basic likelihood-based boosting algorithm is given in the following for both types of models.</p>
               <p>Starting with parameter estimate <inline-formula><m:math name="1471-2105-10-18-i4" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>&#946;</m:mi><m:mo>^</m:mo></m:mover><m:mn>0</m:mn></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafqOSdiMbaKaadaWgaaWcbaGaeGimaadabeaaaaa@2EA0@</m:annotation></m:semantics></m:math></inline-formula> = (0,...,0), in each of <it>k</it>, <it>k </it>= 1,..., <it>M</it>, boosting steps, for each covariate <it>x</it><sub><it>ij</it></sub>, <it>j </it>= 1,..., <it>p</it>, candidate models with linear predictor</p>
               <p>
                  <display-formula>
                     <m:math name="1471-2105-10-18-i5" xmlns:m="http://www.w3.org/1998/Math/MathML">
                        <m:semantics>
                           <m:mrow>
                              <m:msub>
                                 <m:mi>&#951;</m:mi>
                                 <m:mrow>
                                    <m:mi>i</m:mi>
                                    <m:mi>j</m:mi>
                                    <m:mo>,</m:mo>
                                    <m:mi>k</m:mi>
                                 </m:mrow>
                              </m:msub>
                              <m:mo>=</m:mo>
                              <m:msub>
                                 <m:mover accent="true">
                                    <m:mi>&#951;</m:mi>
                                    <m:mo>^</m:mo>
                                 </m:mover>
                                 <m:mrow>
                                    <m:mi>i</m:mi>
                                    <m:mo>,</m:mo>
                                    <m:mi>k</m:mi>
                                    <m:mo>&#8722;</m:mo>
                                    <m:mn>1</m:mn>
                                 </m:mrow>
                              </m:msub>
                              <m:mo>+</m:mo>
                              <m:msub>
                                 <m:mi>&#947;</m:mi>
                                 <m:mrow>
                                    <m:mi>j</m:mi>
                                    <m:mo>,</m:mo>
                                    <m:mi>k</m:mi>
                                 </m:mrow>
                              </m:msub>
                              <m:msub>
                                 <m:mi>x</m:mi>
                                 <m:mrow>
                                    <m:mi>i</m:mi>
                                    <m:mi>j</m:mi>
                                 </m:mrow>
                              </m:msub>
                           </m:mrow>
                           <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeq4TdG2aaSbaaSqaaiabdMgaPjabdQgaQjabcYcaSiabdUgaRbqabaGccqGH9aqpcuaH3oaAgaqcamaaBaaaleaacqWGPbqAcqGGSaalcqWGRbWAcqGHsislcqaIXaqmaeqaaOGaey4kaSIaeq4SdC2aaSbaaSqaaiabdQgaQjabcYcaSiabdUgaRbqabaGccqWG4baEdaWgaaWcbaGaemyAaKMaemOAaOgabeaaaaa@4623@</m:annotation>
                        </m:semantics>
                     </m:math>
                  </display-formula>
               </p>
               <p>are fitted by estimating parameters <it>&#947;</it><sub><it>j</it>, <it>k</it></sub>. The offset <inline-formula><m:math name="1471-2105-10-18-i6" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>&#951;</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mi>i</m:mi><m:mo>,</m:mo><m:mi>k</m:mi><m:mo>&#8722;</m:mo><m:mn>1</m:mn></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafq4TdGMbaKaadaWgaaWcbaGaemyAaKMaeiilaWIaem4AaSMaeyOeI0IaeGymaedabeaaaaa@3334@</m:annotation></m:semantics></m:math></inline-formula> incorporates the information from the previous boosting steps, i.e.,</p>
               <p>
                  <display-formula>
                     <m:math name="1471-2105-10-18-i7" xmlns:m="http://www.w3.org/1998/Math/MathML">
                        <m:semantics>
                           <m:mrow>
                              <m:msub>
                                 <m:mover accent="true">
                                    <m:mi>&#951;</m:mi>
                                    <m:mo>^</m:mo>
                                 </m:mover>
                                 <m:mrow>
                                    <m:mi>i</m:mi>
                                    <m:mo>,</m:mo>
                                    <m:mi>k</m:mi>
                                    <m:mo>&#8722;</m:mo>
                                    <m:mn>1</m:mn>
                                 </m:mrow>
                              </m:msub>
                              <m:mo>=</m:mo>
                              <m:msub>
                                 <m:msup>
                                    <m:mi>x</m:mi>
                                    <m:mo>&#8242;</m:mo>
                                 </m:msup>
                                 <m:mi>i</m:mi>
                              </m:msub>
                              <m:msub>
                                 <m:mover accent="true">
                                    <m:mi>&#946;</m:mi>
                                    <m:mo>^</m:mo>
                                 </m:mover>
                                 <m:mrow>
                                    <m:mi>k</m:mi>
                                    <m:mo>&#8722;</m:mo>
                                    <m:mn>1</m:mn>
                                 </m:mrow>
                              </m:msub>
                           </m:mrow>
                           <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafq4TdGMbaKaadaWgaaWcbaGaemyAaKMaeiilaWIaem4AaSMaeyOeI0IaeGymaedabeaakiabg2da9iqbdIha4zaafaWaaSbaaSqaaiabdMgaPbqabaGccuaHYoGygaqcamaaBaaaleaacqWGRbWAcqGHsislcqaIXaqmaeqaaaaa@3CC1@</m:annotation>
                        </m:semantics>
                     </m:math>
                  </display-formula>
               </p>
               <p>for the Cox model and</p>
               <p>
                  <display-formula>
                     <m:math name="1471-2105-10-18-i8" xmlns:m="http://www.w3.org/1998/Math/MathML">
                        <m:semantics>
                           <m:mrow>
                              <m:msub>
                                 <m:mover accent="true">
                                    <m:mi>&#951;</m:mi>
                                    <m:mo>^</m:mo>
                                 </m:mover>
                                 <m:mrow>
                                    <m:mi>i</m:mi>
                                    <m:mo>,</m:mo>
                                    <m:mi>k</m:mi>
                                    <m:mo>&#8722;</m:mo>
                                    <m:mn>1</m:mn>
                                 </m:mrow>
                              </m:msub>
                              <m:mo>=</m:mo>
                              <m:msub>
                                 <m:mover accent="true">
                                    <m:mi>&#946;</m:mi>
                                    <m:mo>^</m:mo>
                                 </m:mover>
                                 <m:mrow>
                                    <m:mi>i</m:mi>
                                    <m:mi>n</m:mi>
                                    <m:mi>t</m:mi>
                                    <m:mi>e</m:mi>
                                    <m:mi>r</m:mi>
                                 </m:mrow>
                              </m:msub>
                              <m:mo>+</m:mo>
                              <m:msub>
                                 <m:msup>
                                    <m:mi>x</m:mi>
                                    <m:mo>&#8242;</m:mo>
                                 </m:msup>
                                 <m:mi>i</m:mi>
                              </m:msub>
                              <m:msub>
                                 <m:mover accent="true">
                                    <m:mi>&#946;</m:mi>
                                    <m:mo>^</m:mo>
                                 </m:mover>
                                 <m:mrow>
                                    <m:mi>k</m:mi>
                                    <m:mo>&#8722;</m:mo>
                                    <m:mn>1</m:mn>
                                 </m:mrow>
                              </m:msub>
                           </m:mrow>
                           <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafq4TdGMbaKaadaWgaaWcbaGaemyAaKMaeiilaWIaem4AaSMaeyOeI0IaeGymaedabeaakiabg2da9iqbek7aIzaajaWaaSbaaSqaaiabdMgaPjabd6gaUjabdsha0jabdwgaLjabdkhaYbqabaGccqGHRaWkcuWG4baEgaqbamaaBaaaleaacqWGPbqAaeqaaOGafqOSdiMbaKaadaWgaaWcbaGaem4AaSMaeyOeI0IaeGymaedabeaaaaa@467B@</m:annotation>
                        </m:semantics>
                     </m:math>
                  </display-formula>
               </p>
               <p>for generalized linear models. The intercept parameter <inline-formula><m:math name="1471-2105-10-18-i9" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>&#946;</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mi>i</m:mi><m:mi>n</m:mi><m:mi>t</m:mi><m:mi>e</m:mi><m:mi>r</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafqOSdiMbaKaadaWgaaWcbaGaemyAaKMaemOBa4MaemiDaqNaemyzauMaemOCaihabeaaaaa@34A3@</m:annotation></m:semantics></m:math></inline-formula> is updated before each boosting step by fitting an intercept-only model.</p>
               <p>For estimation of the <it>&#947;</it><sub><it>j</it>, <it>k</it></sub><it>s</it>, a penalized log-likelihood criterion</p>
               <p>
                  <display-formula>
                     <m:math name="1471-2105-10-18-i10" xmlns:m="http://www.w3.org/1998/Math/MathML">
                        <m:semantics>
                           <m:mrow>
                              <m:msub>
                                 <m:mi>l</m:mi>
                                 <m:mrow>
                                    <m:mi>p</m:mi>
                                    <m:mi>e</m:mi>
                                    <m:mi>n</m:mi>
                                 </m:mrow>
                              </m:msub>
                              <m:mo stretchy="false">(</m:mo>
                              <m:msub>
                                 <m:mi>&#947;</m:mi>
                                 <m:mrow>
                                    <m:mi>j</m:mi>
                                    <m:mo>,</m:mo>
                                    <m:mi>k</m:mi>
                                 </m:mrow>
                              </m:msub>
                              <m:mo stretchy="false">)</m:mo>
                              <m:mo>=</m:mo>
                              <m:mi>l</m:mi>
                              <m:mo stretchy="false">(</m:mo>
                              <m:msub>
                                 <m:mi>&#947;</m:mi>
                                 <m:mrow>
                                    <m:mi>j</m:mi>
                                    <m:mo>,</m:mo>
                                    <m:mi>k</m:mi>
                                 </m:mrow>
                              </m:msub>
                              <m:mo stretchy="false">)</m:mo>
                              <m:mo>+</m:mo>
                              <m:mfrac>
                                 <m:mn>1</m:mn>
                                 <m:mn>2</m:mn>
                              </m:mfrac>
                              <m:msub>
                                 <m:mi>&#955;</m:mi>
                                 <m:mrow>
                                    <m:mi>j</m:mi>
                                    <m:mo>,</m:mo>
                                    <m:mi>k</m:mi>
                                 </m:mrow>
                              </m:msub>
                              <m:msubsup>
                                 <m:mi>&#947;</m:mi>
                                 <m:mrow>
                                    <m:mi>j</m:mi>
                                    <m:mo>,</m:mo>
                                    <m:mi>k</m:mi>
                                 </m:mrow>
                                 <m:mn>2</m:mn>
                              </m:msubsup>
                           </m:mrow>
                           <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemiBaW2aaSbaaSqaaiabdchaWjabdwgaLjabd6gaUbqabaGccqGGOaakcqaHZoWzdaWgaaWcbaGaemOAaOMaeiilaWIaem4AaSgabeaakiabcMcaPiabg2da9iabdYgaSjabcIcaOiabeo7aNnaaBaaaleaacqWGQbGAcqGGSaalcqWGRbWAaeqaaOGaeiykaKIaey4kaSscfa4aaSaaaeaacqaIXaqmaeaacqaIYaGmaaGccqaH7oaBdaWgaaWcbaGaemOAaOMaeiilaWIaem4AaSgabeaakiabeo7aNnaaDaaaleaacqWGQbGAcqGGSaalcqWGRbWAaeaacqaIYaGmaaaaaa@51EC@</m:annotation>
                        </m:semantics>
                     </m:math>
                  </display-formula>
               </p>
               <p>is employed, where <it>&#955;</it><sub><it>j</it>, <it>k </it></sub>is a penalty parameter that determines the size of the boosting steps. Typically, the same value of <it>&#955;</it><sub><it>j</it>, <it>k </it></sub>= <it>&#955; </it>is employed for all covariates and all boosting steps. As the number of boosting steps <it>M</it>, which can, e.g., be determined by cross-validation, is the more important tuning parameter, the penalty parameter <it>&#955; </it>is chosen only very coarsely, such that the resulting number of boosting steps is not too small (say larger than 50).</p>
               <p>Using score function <it>U</it>(<it>&#947;</it>) = &#8706;<it>l</it>(<it>&#947;</it>)/&#8706;<it>&#947; </it>and information matrix <it>I</it>(<it>&#947;</it>) = -&#8706;<sup>2</sup><it>l</it>(<it>&#947;</it>)/&#8706;<sup>2</sup><it>&#947;</it>, more specifically the scalar values <it>U</it><sub><it>j</it>, <it>k </it></sub>= <it>U</it>(0) and <it>I</it><sub><it>j</it>, <it>k </it></sub>= <it>I</it>(0), we employ Newton-Raphson steps, resulting in estimates</p>
               <p>
                  <display-formula>
                     <m:math name="1471-2105-10-18-i11" xmlns:m="http://www.w3.org/1998/Math/MathML">
                        <m:semantics>
                           <m:mrow>
                              <m:msub>
                                 <m:mover accent="true">
                                    <m:mi>&#947;</m:mi>
                                    <m:mo>^</m:mo>
                                 </m:mover>
                                 <m:mrow>
                                    <m:mi>j</m:mi>
                                    <m:mo>,</m:mo>
                                    <m:mi>k</m:mi>
                                 </m:mrow>
                              </m:msub>
                              <m:mo>=</m:mo>
                              <m:mfrac>
                                 <m:mrow>
                                    <m:msub>
                                       <m:mi>U</m:mi>
                                       <m:mrow>
                                          <m:mi>j</m:mi>
                                          <m:mo>,</m:mo>
                                          <m:mi>k</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                 </m:mrow>
                                 <m:mrow>
                                    <m:msub>
                                       <m:mi>I</m:mi>
                                       <m:mrow>
                                          <m:mi>j</m:mi>
                                          <m:mo>,</m:mo>
                                          <m:mi>k</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                    <m:mo>+</m:mo>
                                    <m:msub>
                                       <m:mi>&#955;</m:mi>
                                       <m:mrow>
                                          <m:mi>j</m:mi>
                                          <m:mo>,</m:mo>
                                          <m:mi>k</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                 </m:mrow>
                              </m:mfrac>
                              <m:mo>.</m:mo>
                           </m:mrow>
                           <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafq4SdCMbaKaadaWgaaWcbaGaemOAaOMaeiilaWIaem4AaSgabeaakiabg2da9KqbaoaalaaabaGaemyvau1aaSbaaeaacqWGQbGAcqGGSaalcqWGRbWAaeqaaaqaaiabdMeajnaaBaaabaGaemOAaOMaeiilaWIaem4AaSgabeaacqGHRaWkcqaH7oaBdaWgaaqaaiabdQgaQjabcYcaSiabdUgaRbqabaaaaOGaeiOla4caaa@4459@</m:annotation>
                        </m:semantics>
                     </m:math>
                  </display-formula>
               </p>
               <p>This is based on only one Newton-Raphson step, as further refinements can potentially be performed in later boosting steps.</p>
               <p>The estimate <inline-formula><m:math name="1471-2105-10-18-i12" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>&#947;</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:msup><m:mi>j</m:mi><m:mo>&#8727;</m:mo></m:msup><m:mo>,</m:mo><m:mi>k</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafq4SdCMbaKaadaWgaaWcbaGaemOAaO2aaWbaaWqabeaacqGHxiIkaaWccqGGSaalcqWGRbWAaeqaaaaa@327C@</m:annotation></m:semantics></m:math></inline-formula> for the covariate with index <it>j</it>* which improves the fit the most (in terms of log-likelihood for generalized linear models or according to the penalized score statistic <inline-formula><m:math name="1471-2105-10-18-i13" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>U</m:mi><m:mrow><m:mi>j</m:mi><m:mo>,</m:mo><m:mi>k</m:mi></m:mrow><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyvau1aa0baaSqaaiabdQgaQjabcYcaSiabdUgaRbqaaiabikdaYaaaaaa@31C3@</m:annotation></m:semantics></m:math></inline-formula>/(<it>I</it><sub><it>j</it>, <it>k </it></sub>+ <it>&#955;</it><sub><it>j</it>, <it>k</it></sub>) for the Cox model) is then used to update the elements of the overall parameter vector via</p>
               <p>
                  <display-formula>
                     <m:math name="1471-2105-10-18-i14" xmlns:m="http://www.w3.org/1998/Math/MathML">
                        <m:semantics>
                           <m:mrow>
                              <m:msub>
                                 <m:mover accent="true">
                                    <m:mi>&#946;</m:mi>
                                    <m:mo>^</m:mo>
                                 </m:mover>
                                 <m:mrow>
                                    <m:mi>k</m:mi>
                                    <m:mo>,</m:mo>
                                    <m:mi>j</m:mi>
                                 </m:mrow>
                              </m:msub>
                              <m:mo>=</m:mo>
                              <m:mrow>
                                 <m:mo>{</m:mo>
                                 <m:mrow>
                                    <m:mtable columnalign="left">
                                       <m:mtr columnalign="left">
                                          <m:mtd columnalign="left">
                                             <m:mrow>
                                                <m:msub>
                                                   <m:mover accent="true">
                                                      <m:mi>&#946;</m:mi>
                                                      <m:mo>^</m:mo>
                                                   </m:mover>
                                                   <m:mrow>
                                                      <m:mi>k</m:mi>
                                                      <m:mo>&#8722;</m:mo>
                                                      <m:mn>1</m:mn>
                                                      <m:mo>,</m:mo>
                                                      <m:mi>j</m:mi>
                                                   </m:mrow>
                                                </m:msub>
                                                <m:mo>+</m:mo>
                                                <m:msub>
                                                   <m:mover accent="true">
                                                      <m:mi>&#947;</m:mi>
                                                      <m:mo>^</m:mo>
                                                   </m:mover>
                                                   <m:mrow>
                                                      <m:msup>
                                                         <m:mi>j</m:mi>
                                                         <m:mo>&#8727;</m:mo>
                                                      </m:msup>
                                                      <m:mo>,</m:mo>
                                                      <m:mi>k</m:mi>
                                                   </m:mrow>
                                                </m:msub>
                                             </m:mrow>
                                          </m:mtd>
                                          <m:mtd columnalign="left">
                                             <m:mrow>
                                                <m:mtext>for&#160;</m:mtext>
                                                <m:mi>j</m:mi>
                                                <m:mo>=</m:mo>
                                                <m:msup>
                                                   <m:mi>j</m:mi>
                                                   <m:mo>&#8727;</m:mo>
                                                </m:msup>
                                             </m:mrow>
                                          </m:mtd>
                                       </m:mtr>
                                       <m:mtr columnalign="left">
                                          <m:mtd columnalign="left">
                                             <m:mrow>
                                                <m:msub>
                                                   <m:mover accent="true">
                                                      <m:mi>&#946;</m:mi>
                                                      <m:mo>^</m:mo>
                                                   </m:mover>
                                                   <m:mrow>
                                                      <m:mi>k</m:mi>
                                                      <m:mo>&#8722;</m:mo>
                                                      <m:mn>1</m:mn>
                                                      <m:mo>,</m:mo>
                                                      <m:mi>j</m:mi>
                                                   </m:mrow>
                                                </m:msub>
                                             </m:mrow>
                                          </m:mtd>
                                          <m:mtd columnalign="left">
                                             <m:mrow>
                                                <m:mtext>otherwise</m:mtext>
                                             </m:mrow>
                                          </m:mtd>
                                       </m:mtr>
                                    </m:mtable>
                                    <m:mo>.</m:mo>
                                 </m:mrow>
                              </m:mrow>
                           </m:mrow>
                           <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafqOSdiMbaKaadaWgaaWcbaGaem4AaSMaeiilaWIaemOAaOgabeaakiabg2da9maaceaabaqbaeaabiGaaaqaaiqbek7aIzaajaWaaSbaaSqaaiabdUgaRjabgkHiTiabigdaXiabcYcaSiabdQgaQbqabaGccqGHRaWkcuaHZoWzgaqcamaaBaaaleaacqWGQbGAdaahaaadbeqaaiabgEHiQaaaliabcYcaSiabdUgaRbqabaaakeaacqqGMbGzcqqGVbWBcqqGYbGCcqqGGaaicqWGQbGAcqGH9aqpcqWGQbGAdaahaaWcbeqaaiabgEHiQaaaaOqaaiqbek7aIzaajaWaaSbaaSqaaiabdUgaRjabgkHiTiabigdaXiabcYcaSiabdQgaQbqabaaakeaacqqGVbWBcqqG0baDcqqGObaAcqqGLbqzcqqGYbGCcqqG3bWDcqqGPbqAcqqGZbWCcqqGLbqzaaGaeiOla4cacaGL7baaaaa@614F@</m:annotation>
                        </m:semantics>
                     </m:math>
                  </display-formula>
               </p>
               <p>This componentwise boosting approach results in sparse fits, i.e., where many elements of the estimated parameter vector are equal to zero.</p>
               <p>One of the advantages of likelihood-based boosting is that it is very easy to incorporate mandatory, unpenalized covariates (see <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, for example). This is useful when clinical covariates have to be incorporated in addition to microarray features, in order to compare the resulting model fit to a purely clinical model. The clinical covariates are then added to the linear predictor <it>&#951;</it><sub><it>i</it></sub>, and their coefficients are updated in or after every boosting step, but they do not enter into the penalty term.</p>
            </sec>
            <sec>
               <st>
                  <p>Incorporating pathway information</p>
               </st>
               <p>The sparseness of the fits, resulting from approaches such as the Lasso or componentwise boosting, is a desirable property in settings with many microarray features, as it potentially results in a short list of genes, that are deemed influential. It can, however, also have a negative effect on interpretability. For example, if the level of activity of (parts of) a specific pathway is related to the response, the microarray features associated with that pathway will be highly correlated and have similar predictive power. However, sparse fitting techniques will probably pick out only one of the features. This makes it difficult to identify the underlying pathway. Also, model fits might be less stable when relying only on one measurement instead of several features.</p>
               <p>For discouraging selection of only single microarray features associated with a pathway, we suggest to increase the penalty <it>&#955;</it><sub><it>j</it>*, <it>l</it></sub>, <it>l </it>> <it>k</it>, used for a specific covariate <it>x</it><sub><it>ij</it>*</sub>, after it has been selected in boosting step <it>k</it>. This decreases the size of the boosting steps for this covariate and makes it less likely that this covariate will be selected in future boosting steps. In turn, the penalties for the microarray features that belong to genes that are directly connected in the respective pathway are decreased, making it more likely that they will be selected in future steps.</p>
               <p>This approach requires specification of two rules, one for increasing the penalty of a selected covariate and one for decreasing the penalties for connected covariates. In the following, we provide such rules for penalty updates, which, in combination with componentwise likelihood-based boosting, constitute the <it>PathBoost </it>algorithm.</p>
               <sec>
                  <st>
                     <p>Increasing the penalty for a selected covariate</p>
                  </st>
                  <p>In order to provide a rule for penalty updates, a common metric for all covariates is needed. Therefore, we quantify the size of the boosting step <it>k</it>, performed for a covariate with index <it>j</it>* that has been selected in this step, by considering the estimate <inline-formula><m:math name="1471-2105-10-18-i12" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>&#947;</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:msup><m:mi>j</m:mi><m:mo>&#8727;</m:mo></m:msup><m:mo>,</m:mo><m:mi>k</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafq4SdCMbaKaadaWgaaWcbaGaemOAaO2aaWbaaWqabeaacqGHxiIkaaWccqGGSaalcqWGRbWAaeqaaaaa@327C@</m:annotation></m:semantics></m:math></inline-formula> relative to the estimate</p>
                  <p>
                     <display-formula>
                        <m:math name="1471-2105-10-18-i15" xmlns:m="http://www.w3.org/1998/Math/MathML">
                           <m:semantics>
                              <m:mrow>
                                 <m:msubsup>
                                    <m:mover accent="true">
                                       <m:mi>&#947;</m:mi>
                                       <m:mo>^</m:mo>
                                    </m:mover>
                                    <m:mrow>
                                       <m:msup>
                                          <m:mi>j</m:mi>
                                          <m:mo>&#8727;</m:mo>
                                       </m:msup>
                                       <m:mo>,</m:mo>
                                       <m:mi>k</m:mi>
                                    </m:mrow>
                                    <m:mrow>
                                       <m:mi>u</m:mi>
                                       <m:mi>n</m:mi>
                                       <m:mi>p</m:mi>
                                       <m:mi>e</m:mi>
                                       <m:mi>n</m:mi>
                                    </m:mrow>
                                 </m:msubsup>
                                 <m:mo>=</m:mo>
                                 <m:msub>
                                    <m:mi>U</m:mi>
                                    <m:mrow>
                                       <m:msup>
                                          <m:mi>j</m:mi>
                                          <m:mo>&#8727;</m:mo>
                                       </m:msup>
                                       <m:mo>,</m:mo>
                                       <m:mi>k</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo>/</m:mo>
                                 <m:msub>
                                    <m:mi>I</m:mi>
                                    <m:mrow>
                                       <m:msup>
                                          <m:mi>j</m:mi>
                                          <m:mo>&#8727;</m:mo>
                                       </m:msup>
                                       <m:mo>,</m:mo>
                                       <m:mi>k</m:mi>
                                    </m:mrow>
                                 </m:msub>
                              </m:mrow>
                              <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafq4SdCMbaKaadaqhaaWcbaGaemOAaO2aaWbaaWqabeaacqGHxiIkaaWccqGGSaalcqWGRbWAaeaacqWG1bqDcqWGUbGBcqWGWbaCcqWGLbqzcqWGUbGBaaGccqGH9aqpcqWGvbqvdaWgaaWcbaGaemOAaO2aaWbaaWqabeaacqGHxiIkaaWccqGGSaalcqWGRbWAaeqaaOGaei4la8IaemysaK0aaSbaaSqaaiabdQgaQnaaCaaameqabaGaey4fIOcaaSGaeiilaWIaem4AaSgabeaaaaa@47F2@</m:annotation>
                           </m:semantics>
                        </m:math>
                     </display-formula>
                  </p>
                  <p>obtained from unpenalized estimation, i.e., for <it>&#955;</it><sub><it>j</it>*, <it>k </it></sub>= 0. The step-size factor <it>&#957;</it><sub><it>j</it>*, <it>k </it></sub>then is given by</p>
                  <p>
                     <display-formula>
                        <m:math name="1471-2105-10-18-i16" xmlns:m="http://www.w3.org/1998/Math/MathML">
                           <m:semantics>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>&#957;</m:mi>
                                    <m:mrow>
                                       <m:msup>
                                          <m:mi>j</m:mi>
                                          <m:mo>&#8727;</m:mo>
                                       </m:msup>
                                       <m:mo>,</m:mo>
                                       <m:mi>k</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo>=</m:mo>
                                 <m:mfrac>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mover accent="true">
                                             <m:mi>&#947;</m:mi>
                                             <m:mo>^</m:mo>
                                          </m:mover>
                                          <m:mrow>
                                             <m:msup>
                                                <m:mi>j</m:mi>
                                                <m:mo>&#8727;</m:mo>
                                             </m:msup>
                                             <m:mo>,</m:mo>
                                             <m:mi>k</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                    </m:mrow>
                                    <m:mrow>
                                       <m:msubsup>
                                          <m:mover accent="true">
                                             <m:mi>&#947;</m:mi>
                                             <m:mo>^</m:mo>
                                          </m:mover>
                                          <m:mrow>
                                             <m:msup>
                                                <m:mi>j</m:mi>
                                                <m:mo>&#8727;</m:mo>
                                             </m:msup>
                                             <m:mo>,</m:mo>
                                             <m:mi>k</m:mi>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mi>u</m:mi>
                                             <m:mi>n</m:mi>
                                             <m:mi>p</m:mi>
                                             <m:mi>e</m:mi>
                                             <m:mi>n</m:mi>
                                          </m:mrow>
                                       </m:msubsup>
                                    </m:mrow>
                                 </m:mfrac>
                                 <m:mo>=</m:mo>
                                 <m:mfrac>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>I</m:mi>
                                          <m:mrow>
                                             <m:msup>
                                                <m:mi>j</m:mi>
                                                <m:mo>&#8727;</m:mo>
                                             </m:msup>
                                             <m:mo>,</m:mo>
                                             <m:mi>k</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                    </m:mrow>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>I</m:mi>
                                          <m:mrow>
                                             <m:msup>
                                                <m:mi>j</m:mi>
                                                <m:mo>&#8727;</m:mo>
                                             </m:msup>
                                             <m:mo>,</m:mo>
                                             <m:mi>k</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                       <m:mo>+</m:mo>
                                       <m:msub>
                                          <m:mi>&#955;</m:mi>
                                          <m:mrow>
                                             <m:msup>
                                                <m:mi>j</m:mi>
                                                <m:mo>&#8727;</m:mo>
                                             </m:msup>
                                             <m:mo>,</m:mo>
                                             <m:mi>k</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                    </m:mrow>
                                 </m:mfrac>
                                 <m:mo>.</m:mo>
                              </m:mrow>
                              <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeqyVd42aaSbaaSqaaiabdQgaQnaaCaaameqabaGaey4fIOcaaSGaeiilaWIaem4AaSgabeaakiabg2da9KqbaoaalaaabaGafq4SdCMbaKaadaWgaaqaaiabdQgaQnaaCaaabeqaaiabgEHiQaaacqGGSaalcqWGRbWAaeqaaaqaaiqbeo7aNzaajaWaa0baaeaacqWGQbGAdaahaaqabeaacqGHxiIkaaGaeiilaWIaem4AaSgabaGaemyDauNaemOBa4MaemiCaaNaemyzauMaemOBa4gaaaaakiabg2da9KqbaoaalaaabaGaemysaK0aaSbaaeaacqWGQbGAdaahaaqabeaacqGHxiIkaaGaeiilaWIaem4AaSgabeaaaeaacqWGjbqsdaWgaaqaaiabdQgaQnaaCaaabeqaaiabgEHiQaaacqGGSaalcqWGRbWAaeqaaiabgUcaRiabeU7aSnaaBaaabaGaemOAaO2aaWbaaeqabaGaey4fIOcaaiabcYcaSiabdUgaRbqabaaaaOGaeiOla4caaa@5E4F@</m:annotation>
                           </m:semantics>
                        </m:math>
                     </display-formula>
                  </p>
                  <p>For incorporating pathway information, we suggest to decrease the step-size factor for a selected covariate by a constant step-size modification factor 0 &lt;<it>c</it><sub><it>smf </it></sub>&#8804; 1. So, after the covariate with index <it>j</it>* has been selected in boosting step <it>k</it>, the new step-size factor for further boosting steps <it>l </it>> <it>k </it>becomes</p>
                  <p>
                     <display-formula><it>&#957;</it><sub><it>j</it>*, <it>l </it></sub>= <it>c</it><sub><it>smf </it></sub>&#183; <it>&#957;</it><sub><it>j</it>*, <it>k</it></sub>,</display-formula>
                  </p>
                  <p>implying a penalty increase via</p>
                  <p>
                     <display-formula id="M1">
                        <m:math name="1471-2105-10-18-i17" xmlns:m="http://www.w3.org/1998/Math/MathML">
                           <m:semantics>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>&#955;</m:mi>
                                    <m:mrow>
                                       <m:msup>
                                          <m:mi>j</m:mi>
                                          <m:mo>&#8727;</m:mo>
                                       </m:msup>
                                       <m:mo>,</m:mo>
                                       <m:mi>l</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo>=</m:mo>
                                 <m:mrow>
                                    <m:mo>(</m:mo>
                                    <m:mrow>
                                       <m:mfrac>
                                          <m:mn>1</m:mn>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>c</m:mi>
                                                <m:mrow>
                                                   <m:mi>s</m:mi>
                                                   <m:mi>m</m:mi>
                                                   <m:mi>f</m:mi>
                                                </m:mrow>
                                             </m:msub>
                                          </m:mrow>
                                       </m:mfrac>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mn>1</m:mn>
                                    </m:mrow>
                                    <m:mo>)</m:mo>
                                 </m:mrow>
                                 <m:msub>
                                    <m:mi>I</m:mi>
                                    <m:mrow>
                                       <m:msup>
                                          <m:mi>j</m:mi>
                                          <m:mo>&#8727;</m:mo>
                                       </m:msup>
                                       <m:mo>,</m:mo>
                                       <m:mi>l</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo>+</m:mo>
                                 <m:mfrac>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>&#955;</m:mi>
                                          <m:mrow>
                                             <m:msup>
                                                <m:mi>j</m:mi>
                                                <m:mo>&#8727;</m:mo>
                                             </m:msup>
                                             <m:mo>,</m:mo>
                                             <m:mi>k</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                    </m:mrow>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>c</m:mi>
                                          <m:mrow>
                                             <m:mi>s</m:mi>
                                             <m:mi>m</m:mi>
                                             <m:mi>f</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                    </m:mrow>
                                 </m:mfrac>
                                 <m:mo>.</m:mo>
                              </m:mrow>
                              <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeq4UdW2aaSbaaSqaaiabdQgaQnaaCaaameqabaGaey4fIOcaaSGaeiilaWIaemiBaWgabeaakiabg2da9maabmaajuaGbaWaaSaaaeaacqaIXaqmaeaacqWGJbWydaWgaaqaaiabdohaZjabd2gaTjabdAgaMbqabaaaaiabgkHiTiabigdaXaGccaGLOaGaayzkaaGaemysaK0aaSbaaSqaaiabdQgaQnaaCaaameqabaGaey4fIOcaaSGaeiilaWIaemiBaWgabeaakiabgUcaRKqbaoaalaaabaGaeq4UdW2aaSbaaeaacqWGQbGAdaahaaqabeaacqGHxiIkaaGaeiilaWIaem4AaSgabeaaaeaacqWGJbWydaWgaaqaaiabdohaZjabd2gaTjabdAgaMbqabaaaaOGaeiOla4caaa@530C@</m:annotation>
                           </m:semantics>
                        </m:math>
                     </display-formula>
                  </p>
                  <p>For computational simplicity, we will use a fixed value of <it>I</it><sub><it>j</it>*, <it>k</it>+1 </sub>instead of the flexible term <it>I</it><sub><it>j</it>*, <it>l </it></sub>in this penalty update rule. This means that the new penalty for a covariate can be calculated immediately after it has been selected in a boosting step and that the penalty stays the same until the covariate, or a covariate that is connected to it, is selected again.</p>
               </sec>
               <sec>
                  <st>
                     <p>Decreasing the penalty for connected covariates</p>
                  </st>
                  <p>If the penalty for a covariate <it>x</it><sub><it>ij</it>* </sub>is increased, and it is then selected again in a later boosting step, the explained variability due to this covariate and the pathways it belongs to will be decreased. To maintain the amount of variability explained by a pathway, the loss in explained variability for covariate <it>x</it><sub><it>ij</it>* </sub>is distributed to related covariates, e.g., to covariates that are connected to covariate <it>x</it><sub><it>ij</it>* </sub>in the pathway. The amount of potentially lost explained variability, that is to be distributed after a boosting step therefore has to be quantified. A proposal for this is provided in the following.</p>
                  <p>If <it>k </it>is the first boosting step where covariate <it>x</it><sub><it>ij</it>* </sub>is selected, then the unpenalized estimate <inline-formula><m:math name="1471-2105-10-18-i18" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mover accent="true"><m:mi>&#947;</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:msup><m:mi>j</m:mi><m:mo>&#8727;</m:mo></m:msup><m:mo>,</m:mo><m:mi>k</m:mi></m:mrow><m:mrow><m:mi>u</m:mi><m:mi>n</m:mi><m:mi>p</m:mi><m:mi>e</m:mi><m:mi>n</m:mi></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafq4SdCMbaKaadaqhaaWcbaGaemOAaO2aaWbaaWqabeaacqGHxiIkaaWccqGGSaalcqWGRbWAaeaacqWG1bqDcqWGUbGBcqWGWbaCcqWGLbqzcqWGUbGBaaaaaa@3976@</m:annotation></m:semantics></m:math></inline-formula> (obtained with <it>&#955;</it><sub><it>j</it>*, <it>k </it></sub>= 0) will be approximately equal to the (unpenalized) maximum likelihood estimate <inline-formula><m:math name="1471-2105-10-18-i19" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mover accent="true"><m:mi>&#947;</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:msup><m:mi>j</m:mi><m:mo>&#8727;</m:mo></m:msup></m:mrow><m:mrow><m:mi>m</m:mi><m:mi>l</m:mi></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafq4SdCMbaKaadaqhaaWcbaGaemOAaO2aaWbaaWqabeaacqGHxiIkaaaaleaacqWGTbqBcqWGSbaBaaaaaa@3302@</m:annotation></m:semantics></m:math></inline-formula> obtained from standard non-boosting estimation. As the relative step size, not realized due to penalized estimation, in boosting step <it>k </it>is given by 1 - <it>&#957;</it><sub><it>j</it>*, <it>k</it></sub>, for boosting step <it>k </it>+ 1, the unpenalized estimate <inline-formula><m:math name="1471-2105-10-18-i20" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mover accent="true"><m:mi>&#947;</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:msup><m:mi>j</m:mi><m:mo>&#8727;</m:mo></m:msup><m:mo>,</m:mo><m:mi>k</m:mi><m:mo>+</m:mo><m:mn>1</m:mn></m:mrow><m:mrow><m:mi>u</m:mi><m:mi>n</m:mi><m:mi>p</m:mi><m:mi>e</m:mi><m:mi>n</m:mi></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafq4SdCMbaKaadaqhaaWcbaGaemOAaO2aaWbaaWqabeaacqGHxiIkaaWccqGGSaalcqWGRbWAcqGHRaWkcqaIXaqmaeaacqWG1bqDcqWGUbGBcqWGWbaCcqWGLbqzcqWGUbGBaaaaaa@3B48@</m:annotation></m:semantics></m:math></inline-formula> will be approximately equal to</p>
                  <p>
                     <display-formula>
                        <m:math name="1471-2105-10-18-i21" xmlns:m="http://www.w3.org/1998/Math/MathML">
                           <m:semantics>
                              <m:mrow>
                                 <m:msubsup>
                                    <m:mover accent="true">
                                       <m:mi>&#947;</m:mi>
                                       <m:mo>^</m:mo>
                                    </m:mover>
                                    <m:mrow>
                                       <m:msup>
                                          <m:mi>j</m:mi>
                                          <m:mo>&#8727;</m:mo>
                                       </m:msup>
                                       <m:mo>,</m:mo>
                                       <m:mi>k</m:mi>
                                       <m:mo>+</m:mo>
                                       <m:mn>1</m:mn>
                                    </m:mrow>
                                    <m:mrow>
                                       <m:mi>u</m:mi>
                                       <m:mi>n</m:mi>
                                       <m:mi>p</m:mi>
                                       <m:mi>e</m:mi>
                                       <m:mi>n</m:mi>
                                    </m:mrow>
                                 </m:msubsup>
                                 <m:mo>&#8776;</m:mo>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mn>1</m:mn>
                                 <m:mo>&#8722;</m:mo>
                                 <m:msub>
                                    <m:mi>&#957;</m:mi>
                                    <m:mrow>
                                       <m:msup>
                                          <m:mi>j</m:mi>
                                          <m:mo>&#8727;</m:mo>
                                       </m:msup>
                                       <m:mo>,</m:mo>
                                       <m:mi>k</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:msubsup>
                                    <m:mover accent="true">
                                       <m:mi>&#947;</m:mi>
                                       <m:mo>^</m:mo>
                                    </m:mover>
                                    <m:mrow>
                                       <m:msup>
                                          <m:mi>j</m:mi>
                                          <m:mo>&#8727;</m:mo>
                                       </m:msup>
                                    </m:mrow>
                                    <m:mrow>
                                       <m:mi>m</m:mi>
                                       <m:mi>l</m:mi>
                                    </m:mrow>
                                 </m:msubsup>
                                 <m:mo>.</m:mo>
                              </m:mrow>
                              <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafq4SdCMbaKaadaqhaaWcbaGaemOAaO2aaWbaaWqabeaacqGHxiIkaaWccqGGSaalcqWGRbWAcqGHRaWkcqaIXaqmaeaacqWG1bqDcqWGUbGBcqWGWbaCcqWGLbqzcqWGUbGBaaGccqGHijYUcqGGOaakcqaIXaqmcqGHsislcqaH9oGBdaWgaaWcbaGaemOAaO2aaWbaaWqabeaacqGHxiIkaaWccqGGSaalcqWGRbWAaeqaaOGaeiykaKIafq4SdCMbaKaadaqhaaWcbaGaemOAaO2aaWbaaWqabeaacqGHxiIkaaaaleaacqWGTbqBcqWGSbaBaaGccqGGUaGlaaa@4FAD@</m:annotation>
                           </m:semantics>
                        </m:math>
                     </display-formula>
                  </p>
                  <p>Thus, the penalized estimate <inline-formula><m:math name="1471-2105-10-18-i22" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>&#947;</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:msup><m:mi>j</m:mi><m:mo>&#8727;</m:mo></m:msup><m:mo>,</m:mo><m:mi>k</m:mi><m:mo>+</m:mo><m:mn>1</m:mn></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafq4SdCMbaKaadaWgaaWcbaGaemOAaO2aaWbaaWqabeaacqGHxiIkaaWccqGGSaalcqWGRbWAcqGHRaWkcqaIXaqmaeqaaaaa@344E@</m:annotation></m:semantics></m:math></inline-formula> will be</p>
                  <p>
                     <display-formula>
                        <m:math name="1471-2105-10-18-i23" xmlns:m="http://www.w3.org/1998/Math/MathML">
                           <m:semantics>
                              <m:mrow>
                                 <m:msub>
                                    <m:mover accent="true">
                                       <m:mi>&#947;</m:mi>
                                       <m:mo>^</m:mo>
                                    </m:mover>
                                    <m:mrow>
                                       <m:msup>
                                          <m:mi>j</m:mi>
                                          <m:mo>&#8727;</m:mo>
                                       </m:msup>
                                       <m:mo>,</m:mo>
                                       <m:mi>k</m:mi>
                                       <m:mo>+</m:mo>
                                       <m:mn>1</m:mn>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo>=</m:mo>
                                 <m:msub>
                                    <m:mi>&#957;</m:mi>
                                    <m:mrow>
                                       <m:msup>
                                          <m:mi>j</m:mi>
                                          <m:mo>&#8727;</m:mo>
                                       </m:msup>
                                       <m:mo>,</m:mo>
                                       <m:mi>k</m:mi>
                                       <m:mo>+</m:mo>
                                       <m:mn>1</m:mn>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo>&#8901;</m:mo>
                                 <m:msubsup>
                                    <m:mover accent="true">
                                       <m:mi>&#947;</m:mi>
                                       <m:mo>^</m:mo>
                                    </m:mover>
                                    <m:mrow>
                                       <m:msup>
                                          <m:mi>j</m:mi>
                                          <m:mo>&#8727;</m:mo>
                                       </m:msup>
                                       <m:mo>,</m:mo>
                                       <m:mi>k</m:mi>
                                       <m:mo>+</m:mo>
                                       <m:mn>1</m:mn>
                                    </m:mrow>
                                    <m:mrow>
                                       <m:mi>u</m:mi>
                                       <m:mi>n</m:mi>
                                       <m:mi>p</m:mi>
                                       <m:mi>e</m:mi>
                                       <m:mi>n</m:mi>
                                    </m:mrow>
                                 </m:msubsup>
                                 <m:mo>&#8776;</m:mo>
                                 <m:msub>
                                    <m:mi>&#957;</m:mi>
                                    <m:mrow>
                                       <m:msup>
                                          <m:mi>j</m:mi>
                                          <m:mo>&#8727;</m:mo>
                                       </m:msup>
                                       <m:mo>,</m:mo>
                                       <m:mi>k</m:mi>
                                       <m:mo>+</m:mo>
                                       <m:mn>1</m:mn>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mn>1</m:mn>
                                 <m:mo>&#8722;</m:mo>
                                 <m:msub>
                                    <m:mi>&#957;</m:mi>
                                    <m:mrow>
                                       <m:msup>
                                          <m:mi>j</m:mi>
                                          <m:mo>&#8727;</m:mo>
                                       </m:msup>
                                       <m:mo>,</m:mo>
                                       <m:mi>k</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:msubsup>
                                    <m:mover accent="true">
                                       <m:mi>&#947;</m:mi>
                                       <m:mo>^</m:mo>
                                    </m:mover>
                                    <m:mrow>
                                       <m:msup>
                                          <m:mi>j</m:mi>
                                          <m:mo>&#8727;</m:mo>
                                       </m:msup>
                                    </m:mrow>
                                    <m:mrow>
                                       <m:mi>m</m:mi>
                                       <m:mi>l</m:mi>
                                    </m:mrow>
                                 </m:msubsup>
                                 <m:mo>.</m:mo>
                              </m:mrow>
                              <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafq4SdCMbaKaadaWgaaWcbaGaemOAaO2aaWbaaWqabeaacqGHxiIkaaWccqGGSaalcqWGRbWAcqGHRaWkcqaIXaqmaeqaaOGaeyypa0JaeqyVd42aaSbaaSqaaiabdQgaQnaaCaaameqabaGaey4fIOcaaSGaeiilaWIaem4AaSMaey4kaSIaeGymaedabeaakiabgwSixlqbeo7aNzaajaWaa0baaSqaaiabdQgaQnaaCaaameqabaGaey4fIOcaaSGaeiilaWIaem4AaSMaey4kaSIaeGymaedabaGaemyDauNaemOBa4MaemiCaaNaemyzauMaemOBa4gaaOGaeyisISRaeqyVd42aaSbaaSqaaiabdQgaQnaaCaaameqabaGaey4fIOcaaSGaeiilaWIaem4AaSMaey4kaSIaeGymaedabeaakiabcIcaOiabigdaXiabgkHiTiabe27aUnaaBaaaleaacqWGQbGAdaahaaadbeqaaiabgEHiQaaaliabcYcaSiabdUgaRbqabaGccqGGPaqkcuaHZoWzgaqcamaaDaaaleaacqWGQbGAdaahaaadbeqaaiabgEHiQaaaaSqaaiabd2gaTjabdYgaSbaakiabc6caUaaa@6C88@</m:annotation>
                           </m:semantics>
                        </m:math>
                     </display-formula>
                  </p>
                  <p>The approximate fraction <it>&#960;</it><sub><it>j</it>,(<it>m</it>) </sub>of the maximum likelihood estimate that has been realized for covariate <it>x</it><sub><it>ij </it></sub>in the <it>m</it><sub><it>th </it></sub>boosting step, where this covariate has been selected, then is</p>
                  <p>
                     <display-formula>
                        <m:math name="1471-2105-10-18-i24" xmlns:m="http://www.w3.org/1998/Math/MathML">
                           <m:semantics>
                              <m:mrow>
                                 <m:mtable columnalign="left">
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>&#960;</m:mi>
                                                <m:mrow>
                                                   <m:mi>j</m:mi>
                                                   <m:mo>,</m:mo>
                                                   <m:mo stretchy="false">(</m:mo>
                                                   <m:mn>1</m:mn>
                                                   <m:mo stretchy="false">)</m:mo>
                                                </m:mrow>
                                             </m:msub>
                                          </m:mrow>
                                       </m:mtd>
                                       <m:mtd columnalign="left">
                                          <m:mo>=</m:mo>
                                       </m:mtd>
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>&#957;</m:mi>
                                                <m:mrow>
                                                   <m:mi>j</m:mi>
                                                   <m:mo>,</m:mo>
                                                   <m:mo stretchy="false">(</m:mo>
                                                   <m:mn>1</m:mn>
                                                   <m:mo stretchy="false">)</m:mo>
                                                </m:mrow>
                                             </m:msub>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>&#960;</m:mi>
                                                <m:mrow>
                                                   <m:mi>j</m:mi>
                                                   <m:mo>,</m:mo>
                                                   <m:mo stretchy="false">(</m:mo>
                                                   <m:mn>2</m:mn>
                                                   <m:mo stretchy="false">)</m:mo>
                                                </m:mrow>
                                             </m:msub>
                                          </m:mrow>
                                       </m:mtd>
                                       <m:mtd columnalign="left">
                                          <m:mo>=</m:mo>
                                       </m:mtd>
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>&#957;</m:mi>
                                                <m:mrow>
                                                   <m:mi>j</m:mi>
                                                   <m:mo>,</m:mo>
                                                   <m:mo stretchy="false">(</m:mo>
                                                   <m:mn>1</m:mn>
                                                   <m:mo stretchy="false">)</m:mo>
                                                </m:mrow>
                                             </m:msub>
                                             <m:mo>+</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mn>1</m:mn>
                                             <m:mo>&#8722;</m:mo>
                                             <m:msub>
                                                <m:mi>&#957;</m:mi>
                                                <m:mrow>
                                                   <m:mi>j</m:mi>
                                                   <m:mo>,</m:mo>
                                                   <m:mo stretchy="false">(</m:mo>
                                                   <m:mn>1</m:mn>
                                                   <m:mo stretchy="false">)</m:mo>
                                                </m:mrow>
                                             </m:msub>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:msub>
                                                <m:mi>&#957;</m:mi>
                                                <m:mrow>
                                                   <m:mi>j</m:mi>
                                                   <m:mo>,</m:mo>
                                                   <m:mo stretchy="false">(</m:mo>
                                                   <m:mn>2</m:mn>
                                                   <m:mo stretchy="false">)</m:mo>
                                                </m:mrow>
                                             </m:msub>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow/>
                                       </m:mtd>
                                       <m:mtd columnalign="left">
                                          <m:mo>&#8943;</m:mo>
                                       </m:mtd>
                                       <m:mtd columnalign="left">
                                          <m:mrow/>
                                       </m:mtd>
                                    </m:mtr>
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>&#960;</m:mi>
                                                <m:mrow>
                                                   <m:mi>j</m:mi>
                                                   <m:mo>,</m:mo>
                                                   <m:mo stretchy="false">(</m:mo>
                                                   <m:mi>m</m:mi>
                                                   <m:mo stretchy="false">)</m:mo>
                                                </m:mrow>
                                             </m:msub>
                                          </m:mrow>
                                       </m:mtd>
                                       <m:mtd columnalign="left">
                                          <m:mo>=</m:mo>
                                       </m:mtd>
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>&#960;</m:mi>
                                                <m:mrow>
                                                   <m:mi>j</m:mi>
                                                   <m:mo>,</m:mo>
                                                   <m:mo stretchy="false">(</m:mo>
                                                   <m:mi>m</m:mi>
                                                   <m:mo>&#8722;</m:mo>
                                                   <m:mn>1</m:mn>
                                                   <m:mo stretchy="false">)</m:mo>
                                                </m:mrow>
                                             </m:msub>
                                             <m:mo>+</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mn>1</m:mn>
                                             <m:mo>&#8722;</m:mo>
                                             <m:msub>
                                                <m:mi>&#960;</m:mi>
                                                <m:mrow>
                                                   <m:mi>j</m:mi>
                                                   <m:mo>,</m:mo>
                                                   <m:mo stretchy="false">(</m:mo>
                                                   <m:mi>m</m:mi>
                                                   <m:mo>&#8722;</m:mo>
                                                   <m:mn>1</m:mn>
                                                   <m:mo stretchy="false">)</m:mo>
                                                </m:mrow>
                                             </m:msub>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:msub>
                                                <m:mi>&#957;</m:mi>
                                                <m:mrow>
                                                   <m:mi>j</m:mi>
                                                   <m:mo>,</m:mo>
                                                   <m:mo stretchy="false">(</m:mo>
                                                   <m:mi>m</m:mi>
                                                   <m:mo stretchy="false">)</m:mo>
                                                </m:mrow>
                                             </m:msub>
                                             <m:mo>.</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                 </m:mtable>
                              </m:mrow>
                              <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqbaeaabqWaaaaabaGaeqiWda3aaSbaaSqaaiabdQgaQjabcYcaSiabcIcaOiabigdaXiabcMcaPaqabaaakeaacqGH9aqpaeaacqaH9oGBdaWgaaWcbaGaemOAaOMaeiilaWIaeiikaGIaeGymaeJaeiykaKcabeaaaOqaaiabec8aWnaaBaaaleaacqWGQbGAcqGGSaalcqGGOaakcqaIYaGmcqGGPaqkaeqaaaGcbaGaeyypa0dabaGaeqyVd42aaSbaaSqaaiabdQgaQjabcYcaSiabcIcaOiabigdaXiabcMcaPaqabaGccqGHRaWkcqGGOaakcqaIXaqmcqGHsislcqaH9oGBdaWgaaWcbaGaemOAaOMaeiilaWIaeiikaGIaeGymaeJaeiykaKcabeaakiabcMcaPiabe27aUnaaBaaaleaacqWGQbGAcqGGSaalcqGGOaakcqaIYaGmcqGGPaqkaeqaaaGcbaaabaGaeS47IWeabaaabaGaeqiWda3aaSbaaSqaaiabdQgaQjabcYcaSiabcIcaOiabd2gaTjabcMcaPaqabaaakeaacqGH9aqpaeaacqaHapaCdaWgaaWcbaGaemOAaOMaeiilaWIaeiikaGIaemyBa0MaeyOeI0IaeGymaeJaeiykaKcabeaakiabgUcaRiabcIcaOiabigdaXiabgkHiTiabec8aWnaaBaaaleaacqWGQbGAcqGGSaalcqGGOaakcqWGTbqBcqGHsislcqaIXaqmcqGGPaqkaeqaaOGaeiykaKIaeqyVd42aaSbaaSqaaiabdQgaQjabcYcaSiabcIcaOiabd2gaTjabcMcaPaqabaGccqGGUaGlaaaaaa@84A8@</m:annotation>
                           </m:semantics>
                        </m:math>
                     </display-formula>
                  </p>
                  <p>Let now <it>j</it><sub>1 </sub>be the index of a covariate that has been selected in boosting step <it>k </it>and <it>j</it><sub>2 </sub>be the index of the covariate to which a potential loss in explained variability is to be transferred. There is a potential loss that is incurred for <inline-formula><m:math name="1471-2105-10-18-i25" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>x</m:mi><m:mrow><m:mi>i</m:mi><m:msub><m:mi>j</m:mi><m:mn>1</m:mn></m:msub></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemiEaG3aaSbaaSqaaiabdMgaPjabdQgaQnaaBaaameaacqaIXaqmaeqaaaWcbeaaaaa@315A@</m:annotation></m:semantics></m:math></inline-formula> in a future boosting step <it>l </it>by employing a penalty that is updated via (1), with corresponding step-size factor <inline-formula><m:math name="1471-2105-10-18-i26" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>&#957;</m:mi><m:mrow><m:msub><m:mi>j</m:mi><m:mn>1</m:mn></m:msub><m:mo>,</m:mo><m:mi>l</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeqyVd42aaSbaaSqaaiabdQgaQnaaBaaameaacqaIXaqmaeqaaSGaeiilaWIaemiBaWgabeaaaaa@327F@</m:annotation></m:semantics></m:math></inline-formula>, instead of not modifying the penalty, i.e., keeping the step-size factor <inline-formula><m:math name="1471-2105-10-18-i27" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>&#957;</m:mi><m:mrow><m:msub><m:mi>j</m:mi><m:mn>1</m:mn></m:msub><m:mo>,</m:mo><m:mi>k</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeqyVd42aaSbaaSqaaiabdQgaQnaaBaaameaacqaIXaqmaeqaaSGaeiilaWIaem4AaSgabeaaaaa@327D@</m:annotation></m:semantics></m:math></inline-formula>. In terms of the fraction of the maximum likelihood estimate this loss is given by</p>
                  <p>
                     <display-formula>
                        <m:math name="1471-2105-10-18-i28" xmlns:m="http://www.w3.org/1998/Math/MathML">
                           <m:semantics>
                              <m:mrow>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mn>1</m:mn>
                                 <m:mo>&#8722;</m:mo>
                                 <m:msub>
                                    <m:mi>&#960;</m:mi>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>j</m:mi>
                                          <m:mn>1</m:mn>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:mi>k</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:msub>
                                    <m:mi>&#957;</m:mi>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>j</m:mi>
                                          <m:mn>1</m:mn>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:mi>k</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo>&#8722;</m:mo>
                                 <m:msub>
                                    <m:mi>&#957;</m:mi>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>j</m:mi>
                                          <m:mn>1</m:mn>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:mi>l</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo>.</m:mo>
                              </m:mrow>
                              <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeiikaGIaeGymaeJaeyOeI0IaeqiWda3aaSbaaSqaaiabdQgaQnaaBaaameaacqaIXaqmaeqaaSGaeiilaWIaem4AaSgabeaakiabcMcaPiabcIcaOiabe27aUnaaBaaaleaacqWGQbGAdaWgaaadbaGaeGymaedabeaaliabcYcaSiabdUgaRbqabaGccqGHsislcqaH9oGBdaWgaaWcbaGaemOAaO2aaSbaaWqaaiabigdaXaqabaWccqGGSaalcqWGSbaBaeqaaOGaeiykaKIaeiOla4caaa@4752@</m:annotation>
                           </m:semantics>
                        </m:math>
                     </display-formula>
                  </p>
                  <p>The aim is now to choose the penalty <inline-formula><m:math name="1471-2105-10-18-i29" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>&#955;</m:mi><m:mrow><m:msub><m:mi>j</m:mi><m:mn>2</m:mn></m:msub><m:mo>,</m:mo><m:mi>l</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeq4UdW2aaSbaaSqaaiabdQgaQnaaBaaameaacqaIYaGmaeqaaSGaeiilaWIaemiBaWgabeaaaaa@327D@</m:annotation></m:semantics></m:math></inline-formula>, or correspondingly the step-size factor <inline-formula><m:math name="1471-2105-10-18-i30" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>&#957;</m:mi><m:mrow><m:msub><m:mi>j</m:mi><m:mn>2</m:mn></m:msub><m:mo>,</m:mo><m:mi>l</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeqyVd42aaSbaaSqaaiabdQgaQnaaBaaameaacqaIYaGmaeqaaSGaeiilaWIaemiBaWgabeaaaaa@3281@</m:annotation></m:semantics></m:math></inline-formula>, for the covariate with index <it>j</it><sub>2 </sub>for a future boosting step <it>l </it>(compared to step <it>k</it>), such that the loss for covariate <inline-formula><m:math name="1471-2105-10-18-i25" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>x</m:mi><m:mrow><m:mi>i</m:mi><m:msub><m:mi>j</m:mi><m:mn>1</m:mn></m:msub></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemiEaG3aaSbaaSqaaiabdMgaPjabdQgaQnaaBaaameaacqaIXaqmaeqaaaWcbeaaaaa@315A@</m:annotation></m:semantics></m:math></inline-formula> is compensated by covariate <inline-formula><m:math name="1471-2105-10-18-i31" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>x</m:mi><m:mrow><m:mi>i</m:mi><m:msub><m:mi>j</m:mi><m:mn>2</m:mn></m:msub></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemiEaG3aaSbaaSqaaiabdMgaPjabdQgaQnaaBaaameaacqaIYaGmaeqaaaWcbeaaaaa@315C@</m:annotation></m:semantics></m:math></inline-formula>. Equating</p>
                  <p>
                     <display-formula id="M2">
                        <m:math name="1471-2105-10-18-i32" xmlns:m="http://www.w3.org/1998/Math/MathML">
                           <m:semantics>
                              <m:mrow>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mn>1</m:mn>
                                 <m:mo>&#8722;</m:mo>
                                 <m:msub>
                                    <m:mi>&#960;</m:mi>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>j</m:mi>
                                          <m:mn>2</m:mn>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:mi>k</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:msub>
                                    <m:mi>&#957;</m:mi>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>j</m:mi>
                                          <m:mn>2</m:mn>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:mi>l</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mn>1</m:mn>
                                 <m:mo>&#8722;</m:mo>
                                 <m:msub>
                                    <m:mi>&#960;</m:mi>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>j</m:mi>
                                          <m:mn>2</m:mn>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:mi>k</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:msub>
                                    <m:mi>&#957;</m:mi>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>j</m:mi>
                                          <m:mn>2</m:mn>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:mi>k</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo>=</m:mo>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mn>1</m:mn>
                                 <m:mo>&#8722;</m:mo>
                                 <m:msub>
                                    <m:mi>&#960;</m:mi>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>j</m:mi>
                                          <m:mn>1</m:mn>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:mi>k</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:msub>
                                    <m:mi>&#957;</m:mi>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>j</m:mi>
                                          <m:mn>1</m:mn>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:mi>k</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo>&#8722;</m:mo>
                                 <m:msub>
                                    <m:mi>&#957;</m:mi>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>j</m:mi>
                                          <m:mn>1</m:mn>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:mi>l</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                              <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeiikaGIaeGymaeJaeyOeI0IaeqiWda3aaSbaaSqaaiabdQgaQnaaBaaameaacqaIYaGmaeqaaSGaeiilaWIaem4AaSgabeaakiabcMcaPiabe27aUnaaBaaaleaacqWGQbGAdaWgaaadbaGaeGOmaidabeaaliabcYcaSiabdYgaSbqabaGccqGHsislcqGGOaakcqaIXaqmcqGHsislcqaHapaCdaWgaaWcbaGaemOAaO2aaSbaaWqaaiabikdaYaqabaWccqGGSaalcqWGRbWAaeqaaOGaeiykaKIaeqyVd42aaSbaaSqaaiabdQgaQnaaBaaameaacqaIYaGmaeqaaSGaeiilaWIaem4AaSgabeaakiabg2da9iabcIcaOiabigdaXiabgkHiTiabec8aWnaaBaaaleaacqWGQbGAdaWgaaadbaGaeGymaedabeaaliabcYcaSiabdUgaRbqabaGccqGGPaqkcqGGOaakcqaH9oGBdaWgaaWcbaGaemOAaO2aaSbaaWqaaiabigdaXaqabaWccqGGSaalcqWGRbWAaeqaaOGaeyOeI0IaeqyVd42aaSbaaSqaaiabdQgaQnaaBaaameaacqaIXaqmaeqaaSGaeiilaWIaemiBaWgabeaakiabcMcaPaaa@6A5B@</m:annotation>
                           </m:semantics>
                        </m:math>
                     </display-formula>
                  </p>
                  <p>results in an update for the step-size factor</p>
                  <p>
                     <display-formula>
                        <m:math name="1471-2105-10-18-i33" xmlns:m="http://www.w3.org/1998/Math/MathML">
                           <m:semantics>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>&#957;</m:mi>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>j</m:mi>
                                          <m:mn>2</m:mn>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:mi>l</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo>=</m:mo>
                                 <m:msub>
                                    <m:mi>&#957;</m:mi>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>j</m:mi>
                                          <m:mn>2</m:mn>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:mi>k</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo>+</m:mo>
                                 <m:mfrac>
                                    <m:mrow>
                                       <m:mn>1</m:mn>
                                       <m:mo>&#8722;</m:mo>
                                       <m:msub>
                                          <m:mi>&#960;</m:mi>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>j</m:mi>
                                                <m:mn>1</m:mn>
                                             </m:msub>
                                             <m:mo>,</m:mo>
                                             <m:mi>k</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                    </m:mrow>
                                    <m:mrow>
                                       <m:mn>1</m:mn>
                                       <m:mo>&#8722;</m:mo>
                                       <m:msub>
                                          <m:mi>&#960;</m:mi>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>j</m:mi>
                                                <m:mn>2</m:mn>
                                             </m:msub>
                                             <m:mo>,</m:mo>
                                             <m:mi>k</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                    </m:mrow>
                                 </m:mfrac>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:msub>
                                    <m:mi>&#957;</m:mi>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>j</m:mi>
                                          <m:mn>1</m:mn>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:mi>k</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo>&#8722;</m:mo>
                                 <m:msub>
                                    <m:mi>&#957;</m:mi>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>j</m:mi>
                                          <m:mn>1</m:mn>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:mi>l</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo>.</m:mo>
                              </m:mrow>
                              <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeqyVd42aaSbaaSqaaiabdQgaQnaaBaaameaacqaIYaGmaeqaaSGaeiilaWIaemiBaWgabeaakiabg2da9iabe27aUnaaBaaaleaacqWGQbGAdaWgaaadbaGaeGOmaidabeaaliabcYcaSiabdUgaRbqabaGccqGHRaWkjuaGdaWcaaqaaiabigdaXiabgkHiTiabec8aWnaaBaaabaGaemOAaO2aaSbaaeaacqaIXaqmaeqaaiabcYcaSiabdUgaRbqabaaabaGaeGymaeJaeyOeI0IaeqiWda3aaSbaaeaacqWGQbGAdaWgaaqaaiabikdaYaqabaGaeiilaWIaem4AaSgabeaaaaGccqGGOaakcqaH9oGBdaWgaaWcbaGaemOAaO2aaSbaaWqaaiabigdaXaqabaWccqGGSaalcqWGRbWAaeqaaOGaeyOeI0IaeqyVd42aaSbaaSqaaiabdQgaQnaaBaaameaacqaIXaqmaeqaaSGaeiilaWIaemiBaWgabeaakiabcMcaPiabc6caUaaa@5DD8@</m:annotation>
                           </m:semantics>
                        </m:math>
                     </display-formula>
                  </p>
                  <p>This implies a decrease of the penalty parameter <inline-formula><m:math name="1471-2105-10-18-i34" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>&#955;</m:mi><m:mrow><m:msub><m:mi>j</m:mi><m:mn>2</m:mn></m:msub><m:mo>,</m:mo><m:mi>k</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeq4UdW2aaSbaaSqaaiabdQgaQnaaBaaameaacqaIYaGmaeqaaSGaeiilaWIaem4AaSgabeaaaaa@327B@</m:annotation></m:semantics></m:math></inline-formula> via</p>
                  <p>
                     <display-formula id="M3">
                        <m:math name="1471-2105-10-18-i35" xmlns:m="http://www.w3.org/1998/Math/MathML">
                           <m:semantics>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>&#955;</m:mi>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>j</m:mi>
                                          <m:mn>2</m:mn>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:mi>l</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo>=</m:mo>
                                 <m:mfrac>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mn>1</m:mn>
                                       <m:mo>&#8722;</m:mo>
                                       <m:msub>
                                          <m:mi>&#960;</m:mi>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>j</m:mi>
                                                <m:mn>2</m:mn>
                                             </m:msub>
                                             <m:mo>,</m:mo>
                                             <m:mi>k</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:msub>
                                          <m:mi>I</m:mi>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>j</m:mi>
                                                <m:mn>2</m:mn>
                                             </m:msub>
                                             <m:mo>,</m:mo>
                                             <m:mi>l</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                    </m:mrow>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mn>1</m:mn>
                                       <m:mo>&#8722;</m:mo>
                                       <m:msub>
                                          <m:mi>c</m:mi>
                                          <m:mrow>
                                             <m:mi>s</m:mi>
                                             <m:mi>m</m:mi>
                                             <m:mi>f</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mfrac>
                                          <m:mrow>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mn>1</m:mn>
                                             <m:mo>&#8722;</m:mo>
                                             <m:msub>
                                                <m:mi>&#960;</m:mi>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>j</m:mi>
                                                      <m:mn>1</m:mn>
                                                   </m:msub>
                                                   <m:mo>,</m:mo>
                                                   <m:mi>k</m:mi>
                                                </m:mrow>
                                             </m:msub>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:msub>
                                                <m:mi>I</m:mi>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>j</m:mi>
                                                      <m:mn>1</m:mn>
                                                   </m:msub>
                                                   <m:mo>,</m:mo>
                                                   <m:mi>l</m:mi>
                                                </m:mrow>
                                             </m:msub>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>I</m:mi>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>j</m:mi>
                                                      <m:mn>1</m:mn>
                                                   </m:msub>
                                                   <m:mo>,</m:mo>
                                                   <m:mi>l</m:mi>
                                                </m:mrow>
                                             </m:msub>
                                             <m:mo>+</m:mo>
                                             <m:msub>
                                                <m:mi>&#955;</m:mi>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>j</m:mi>
                                                      <m:mn>1</m:mn>
                                                   </m:msub>
                                                   <m:mo>,</m:mo>
                                                   <m:mi>k</m:mi>
                                                </m:mrow>
                                             </m:msub>
                                          </m:mrow>
                                       </m:mfrac>
                                       <m:mo>+</m:mo>
                                       <m:mfrac>
                                          <m:mrow>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mn>1</m:mn>
                                             <m:mo>&#8722;</m:mo>
                                             <m:msub>
                                                <m:mi>&#960;</m:mi>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>j</m:mi>
                                                      <m:mn>2</m:mn>
                                                   </m:msub>
                                                   <m:mo>,</m:mo>
                                                   <m:mi>k</m:mi>
                                                </m:mrow>
                                             </m:msub>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:msub>
                                                <m:mi>I</m:mi>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>j</m:mi>
                                                      <m:mn>2</m:mn>
                                                   </m:msub>
                                                   <m:mo>,</m:mo>
                                                   <m:mi>l</m:mi>
                                                </m:mrow>
                                             </m:msub>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>I</m:mi>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>j</m:mi>
                                                      <m:mn>2</m:mn>
                                                   </m:msub>
                                                   <m:mo>,</m:mo>
                                                   <m:mi>l</m:mi>
                                                </m:mrow>
                                             </m:msub>
                                             <m:mo>+</m:mo>
                                             <m:msub>
                                                <m:mi>&#955;</m:mi>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>j</m:mi>
                                                      <m:mn>2</m:mn>
                                                   </m:msub>
                                                   <m:mo>,</m:mo>
                                                   <m:mi>k</m:mi>
                                                </m:mrow>
                                             </m:msub>
                                          </m:mrow>
                                       </m:mfrac>
                                    </m:mrow>
                                 </m:mfrac>
                                 <m:mo>&#8722;</m:mo>
                                 <m:msub>
                                    <m:mi>I</m:mi>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>j</m:mi>
                                          <m:mn>2</m:mn>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:mi>l</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo>.</m:mo>
                              </m:mrow>
                              <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeq4UdW2aaSbaaSqaaiabdQgaQnaaBaaameaacqaIYaGmaeqaaSGaeiilaWIaemiBaWgabeaakiabg2da9KqbaoaalaaabaGaeiikaGIaeGymaeJaeyOeI0IaeqiWda3aaSbaaeaacqWGQbGAdaWgaaqaaiabikdaYaqabaGaeiilaWIaem4AaSgabeaacqGGPaqkcqWGjbqsdaWgaaqaaiabdQgaQnaaBaaabaGaeGOmaidabeaacqGGSaalcqWGSbaBaeqaaaqaaiabcIcaOiabigdaXiabgkHiTiabdogaJnaaBaaabaGaem4CamNaemyBa0MaemOzaygabeaacqGGPaqkdaWcaaqaaiabcIcaOiabigdaXiabgkHiTiabec8aWnaaBaaabaGaemOAaO2aaSbaaeaacqaIXaqmaeqaaiabcYcaSiabdUgaRbqabaGaeiykaKIaemysaK0aaSbaaeaacqWGQbGAdaWgaaqaaiabigdaXaqabaGaeiilaWIaemiBaWgabeaaaeaacqWGjbqsdaWgaaqaaiabdQgaQnaaBaaabaGaeGymaedabeaacqGGSaalcqWGSbaBaeqaaiabgUcaRiabeU7aSnaaBaaabaGaemOAaO2aaSbaaeaacqaIXaqmaeqaaiabcYcaSiabdUgaRbqabaaaaiabgUcaRmaalaaabaGaeiikaGIaeGymaeJaeyOeI0IaeqiWda3aaSbaaeaacqWGQbGAdaWgaaqaaiabikdaYaqabaGaeiilaWIaem4AaSgabeaacqGGPaqkcqWGjbqsdaWgaaqaaiabdQgaQnaaBaaabaGaeGOmaidabeaacqGGSaalcqWGSbaBaeqaaaqaaiabdMeajnaaBaaabaGaemOAaO2aaSbaaeaacqaIYaGmaeqaaiabcYcaSiabdYgaSbqabaGaey4kaSIaeq4UdW2aaSbaaeaacqWGQbGAdaWgaaqaaiabikdaYaqabaGaeiilaWIaem4AaSgabeaaaaaaaOGaeyOeI0IaemysaK0aaSbaaSqaaiabdQgaQnaaBaaameaacqaIYaGmaeqaaSGaeiilaWIaemiBaWgabeaakiabc6caUaaa@914E@</m:annotation>
                           </m:semantics>
                        </m:math>
                     </display-formula>
                  </p>
                  <p>Again, for computational simplicity, we use a fixed value of <inline-formula><m:math name="1471-2105-10-18-i36" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>I</m:mi><m:mrow><m:msub><m:mi>j</m:mi><m:mn>1</m:mn></m:msub><m:mo>,</m:mo><m:mi>k</m:mi><m:mo>+</m:mo><m:mn>1</m:mn></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemysaK0aaSbaaSqaaiabdQgaQnaaBaaameaacqaIXaqmaeqaaSGaeiilaWIaem4AaSMaey4kaSIaeGymaedabeaaaaa@33B2@</m:annotation></m:semantics></m:math></inline-formula> instead of <inline-formula><m:math name="1471-2105-10-18-i37" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>I</m:mi><m:mrow><m:msub><m:mi>j</m:mi><m:mn>1</m:mn></m:msub><m:mo>,</m:mo><m:mi>l</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemysaK0aaSbaaSqaaiabdQgaQnaaBaaameaacqaIXaqmaeqaaSGaeiilaWIaemiBaWgabeaaaaa@31E2@</m:annotation></m:semantics></m:math></inline-formula>, and a value of <inline-formula><m:math name="1471-2105-10-18-i38" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>I</m:mi><m:mrow><m:msub><m:mi>j</m:mi><m:mn>2</m:mn></m:msub><m:mo>,</m:mo><m:mi>k</m:mi><m:mo>+</m:mo><m:mn>1</m:mn></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemysaK0aaSbaaSqaaiabdQgaQnaaBaaameaacqaIYaGmaeqaaSGaeiilaWIaem4AaSMaey4kaSIaeGymaedabeaaaaa@33B4@</m:annotation></m:semantics></m:math></inline-formula> instead of <inline-formula><m:math name="1471-2105-10-18-i39" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>I</m:mi><m:mrow><m:msub><m:mi>j</m:mi><m:mn>2</m:mn></m:msub><m:mo>,</m:mo><m:mi>l</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemysaK0aaSbaaSqaaiabdQgaQnaaBaaameaacqaIYaGmaeqaaSGaeiilaWIaemiBaWgabeaaaaa@31E4@</m:annotation></m:semantics></m:math></inline-formula> in this update rule. Therefore, the new penalties of connected covariates can be calculated immediately after the boosting step, avoiding recalculation after every boosting step and storage of results from past boosting steps.</p>
                  <p>As an increase of the penalty via (1) would leave the potential loss in explained variability undistributed for a covariate without connections, the penalty update is only performed for covariates that correspond to genes that have a connection to another gene, with corresponding covariate, in a pathway. For connected genes, however, the question remains whether the total amount should be transferred to every connected covariate or whether the right-hand side of (2) should be divided by the number of connections. As componentwise boosting results in very sparse fits, it can be expected that only few connected covariates will be selected in the remaining boosting steps. It therefore seems to be reasonable to assign the amount to each connected covariate.</p>
                  <p>While a measure of uncertainty is not available for connections in a pathway in the KEGG pathway database, it might be available from other sources. Such information could easily be incorporated into the PathBoost algorithm by multiplying the right-hand side of (2) by the measure of uncertainty (given that the latter has values between 0 an 1). Also, information on the direction of relations could be incorporated by propagating changes of a penalty only into one direction.</p>
               </sec>
            </sec>
            <sec>
               <st>
                  <p>Choice of tuning parameters</p>
               </st>
               <p>The proposed PathBoost algorithm has three flexible parameters: an initial penalty <it>&#955;</it><sub><it>j</it>,1 </sub>= <it>&#955;</it>, <it>j </it>= 1,..., <it>p</it>, common to all covariates, the number of boosting steps <it>M</it>, and the step-size modification factor <it>c</it><sub><it>smf</it></sub>. The initial penalty parameter is of minor importance and can be chosen very coarsely. A value that roughly corresponds to initial step-size factors of about 0.01 works very well in our experience. For determining the step-size modification factor <it>c</it><sub><it>smf</it></sub>, a coarse line search is performed. For each value of <it>c</it><sub><it>smf</it></sub>, the optimal number of boosting steps is determined by 10-fold cross-validation. Then the value of c<sub><it>smf</it></sub>, which results in the overall maximum of cross-validated (partial) log-likelihood, is chosen.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Simulation study</p>
            </st>
            <p>To evaluate the performance of the PathBoost approach, we perform a small simulation study that is identical, in terms of design, to the study employed in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. Models for a continuous response are built from p = 2200 covariates. Of these, 200 take the role of transcription factors. The remaining 2000 covariates comprise of blocks of 10 covariates, where the covariates in each block are correlated with one specific transcription factor. The connection information, required for the approach given in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> and for the PathBoost approach, is chosen such that there is a bidirectional connection between each transcription factor and each of the 10 covariates associated with it.</p>
            <p>The true parameter vector in the generating linear model is chosen such that only four transcription factors (and the corresponding blocks of correlated covariates) have an effect on the response. There are six types of generating models with varying size and type of effect.</p>
            <p>In Model 1, the true parameters of the covariates that are related to a transcription factor have the same sign as the parameter of the transcription factor itself. This is expected to be favorable for the approach given in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, as the penalty term employed there penalizes the squared (standardized) differences of parameters. However, for true parameters with opposite sign, this difference will be large, making it rather unlikely that the true values are recovered. Model 2 features such a setting, where in each block of 10 informative covariates, the parameters of three covariates have a sign opposite to that of the associated transcription factor. In <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> it was found that this considerably affected the performance of the approach with an explicit penalty structure. In contrast, we do not expect a performance degradation for the PathBoost approach as it does not rely on differences of parameters.</p>
            <p>Model 3 is similar to Model 1, and Model 4 is similar to Model 2, the only difference being a smaller effect of the covariates. Extending the design given in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, we added two further settings, Model 5 and Model 6, which are based on Model 2 and Model 4 respectively. In these settings, only the first and the third block of informative covariates contain effects with opposite sign. Therefore, only six of a total of 40 informative connected covariates have an effect with a sign opposite to the associated transcription factor.</p>
            <p>As a minimal performance reference, an intercept-only model, i.e., a model that does not use any covariate information, is fitted. A more specific performance reference for the PathBoost approach is provided by componentwise likelihood-based boosting without pathway information <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. The main tuning parameter there is the number of boosting steps, which is determined by 10-fold cross-validation. As already suggested, the additional parameter <it>c</it><sub><it>smf </it></sub>for the PathBoost approach is determined by a coarse line search.</p>
            <p>As a performance reference for the approach given in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, models are fitted by the Lasso <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, which also penalizes the absolute values of the parameters, but does not incorporate pathway information. For both approaches, fitting is performed by the least angle regression technique <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>, which allows for fast computation of solutions for a large range of values for the penalty parameter that governs the absolute value term in the penalty. For the Lasso, only the latter has to be chosen, which is done by 10-fold cross-validation. For the approach given in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, a second penalty parameter is required, which, similar to the PathBoost approach, is determined by a coarse line search.</p>
            <p>All approaches are fitted to training sets of size <it>n </it>= 100, and prediction performance is evaluated on a test set of the same size. This is repeated 50 times. Table <tblr tid="T1">1</tblr> shows the corresponding mean values and standard errors of the predictive mean squared error.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Results of the simulation study.</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="center">
                        <p>Model</p>
                     </c>
                     <c ca="left">
                        <p>intercept</p>
                     </c>
                     <c ca="left">
                        <p>Lasso</p>
                     </c>
                     <c ca="left">
                        <p>Li&amp;Li</p>
                     </c>
                     <c ca="left">
                        <p>lik.boost</p>
                     </c>
                     <c ca="left">
                        <p>PathBoost</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>762.5 (14.4)</p>
                     </c>
                     <c ca="left">
                        <p>83.6 (2.6)</p>
                     </c>
                     <c ca="left">
                        <p>42.5 (1.1)</p>
                     </c>
                     <c ca="left">
                        <p>83.4 (2.4)</p>
                     </c>
                     <c ca="left">
                        <p>61.0 (1.7)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>305.8 (5.1)</p>
                     </c>
                     <c ca="left">
                        <p>91.0 (2.7)</p>
                     </c>
                     <c ca="left">
                        <p>80.8 (1.9)</p>
                     </c>
                     <c ca="left">
                        <p>89.7 (2.7)</p>
                     </c>
                     <c ca="left">
                        <p>64.8 (1.8)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>215.6 (4.1)</p>
                     </c>
                     <c ca="left">
                        <p>32.6 (0.9)</p>
                     </c>
                     <c ca="left">
                        <p>24.9 (0.8)</p>
                     </c>
                     <c ca="left">
                        <p>32.1 (0.9)</p>
                     </c>
                     <c ca="left">
                        <p>26.5 (0.7)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>131.1 (2.4)</p>
                     </c>
                     <c ca="left">
                        <p>32.6 (0.9)</p>
                     </c>
                     <c ca="left">
                        <p>29.9 (0.7)</p>
                     </c>
                     <c ca="left">
                        <p>32.5 (0.9)</p>
                     </c>
                     <c ca="left">
                        <p>26.9 (0.7)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>525.7 (9.9)</p>
                     </c>
                     <c ca="left">
                        <p>87.9 (2.6)</p>
                     </c>
                     <c ca="left">
                        <p>61.6 (1.5)</p>
                     </c>
                     <c ca="left">
                        <p>85.6 (2.2)</p>
                     </c>
                     <c ca="left">
                        <p>62.2 (1.6)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>171.6 (3.3)</p>
                     </c>
                     <c ca="left">
                        <p>32.9 (0.9)</p>
                     </c>
                     <c ca="left">
                        <p>27.6 (0.7)</p>
                     </c>
                     <c ca="left">
                        <p>32.2 (0.9)</p>
                     </c>
                     <c ca="left">
                        <p>26.9 (0.8)</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Predictive mean squared error, mean and standard errors (in parentheses), for an intercept-only model, the Lasso, the pathway-based procedure proposed in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> (Li&amp;Li), componentwise likelihood-based boosting (lik.boost), and boosting with pathway information (PathBoost) for six types of generating models.</p>
               </tblfn>
            </tbl>
            <p>The predictive mean squared error for all approaches is far below that of the intercept-only model, indicating that the prediction problems are very simple. As would be expected, the performance of the Lasso and componentwise boosting is very similar. So, there is no disadvantage of choosing one of the two as a basis for an approach that incorporates pathway information.</p>
            <p>The approach given in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> outperforms the Lasso in all six settings. However, the performance difference is greatly diminished with Models 2 and 4, where several of the parameters of connected covariates have opposite sign. This highlights the difficulties potentially arising from an explicitly specified penalty structure. In contrast, the PathBoost approach is seen to result in a consistent improvement over boosting without pathway information in all settings. As would be expected from the design of the algorithm, the sign of the true parameters does not matter.</p>
            <p>Comparing the PathBoost approach to that given in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, the latter shows better prediction performance for Models 1 and 2, i.e., where its penalty structure matches the sign of the true parameters. However, for Models 3 and 4, where the sign of parameters of connected covariates may be different, the approach given in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> performs worse. The performance of the two approaches is similar for Models 5 and 6, implying that already a small mismatch in sign information can nullify potential performance advantages gained by explicitly specifying the penalty structure in the approach given in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Application examples</p>
            </st>
            <p>In the following, we investigate the properties of the PathBoost approach in two application examples with microarray survival data, where a Cox proportional hazards model is fitted. When applying a technique for fitting predictive models that incorporates pathway information in a real application setting, there are two objectives. The first is to get better interpretability of the model fit, but the interpretation of a fit will only be credible if the second objective, that of improved prediction performance, is met. For adequately evaluating a potential gain in prediction performance from incorporating pathway information in a time-to-event setting, we employ bootstrap .632+ prediction error curve estimates <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>.</p>
            <p>Pathway information is extracted from the KEGG pathway database <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Similar to <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, we restrict analyses to regulatory pathways, but also include cancer pathways. As a restriction to gene-gene relations would have resulted in a very small number of connections, any genes that are linked by some kind of KEGG relation are considered to be connected.</p>
            <p>While the glioblastoma data analyzed in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> has a time-to-event response, closer inspection showed that the genes which have predictive power are not represented in KEGG pathways. Therefore, an approach focussed on the latter cannot improve over a null model that does not use any microarray information <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. We investigate two other data sets, one from patients with large B-cell lymphoma <abbrgrp><abbr bid="B26">26</abbr></abbrgrp> and a second from patients with ovarian cancer <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>.</p>
            <sec>
               <st>
                  <p>Diffuse large B-cell lymphoma</p>
               </st>
               <p>The data from patients with diffuse large B-cell lymphoma (DLBCL) has already been used for illustrating prediction error curve techniques <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> and the likelihood-based boosting technique for the Cox proportional hazards model <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, on which the PathBoost approach is based. Details of preprocessing are described there. There are <it>n </it>= 240 observation with <it>p </it>= 7399 microarray features. Only 1281 of the latter could be related to KEGG pathways, based on the information available. To avoid restriction to a (relatively) small number of microarray features and to maintain comparability to previous analyses, also the features not represented in KEGG pathways are considered.</p>
               <p>A coarse line search, in combination with 10-fold cross validation, results in selection of a step-size modification factor of <it>c</it><sub><it>smf </it></sub>= 0.9, which indicates that there might be some predictive pathway information in the data. Use of this factor results in 47 non-zero coefficients. In comparison, application of boosting without pathway information results in only 27 non-zero coefficients. There is an overlap of 20 non-zero coefficients, indicating that seven microarray features are no longer deemed important when pathway information is included, with 27 new features being added to the model.</p>
               <p>For checking whether the 27 added features just contain information similar to the seven features not found in the PathBoost fit, we applied componentwise boosting to a data set where the seven features were removed. If the 27 seven features would be a substitute for the seven removed features, some of the former should now be included. However, the resulting model has 20 non-zero coefficients, which all belong to the same covariates as the overlapping coefficients above, i.e., none of the 27 microarray features, identified by PathBoost, are included in the model. The prediction performance decreases (not shown), indicating that the seven microarray features contain information which is useful in combination with componentwise boosting. However, as PathBoost does not utilize these seven microarray features and nevertheless performs better, this underlines that PathBoost results in structurally different model fits.</p>
               <p>While in the model, fitted by boosting without pathway information, only two connected microarray features receive non-zero coefficient estimates, PathBoost results in 12 connected microarray features that receive non-zero estimates. This indicates that the fit from the latter algorithm reflects pathway knowledge. The coefficients of connected microarray features have different sign in several instances. As such a constellation did not influence the performance of PathBoost in the simulation study, an impact is also not expected in this application example.</p>
               <p>The change in structure of the fitted models is also seen from the coefficient paths, i.e., the parameter estimates plotted against the boosting steps. Figure <figr fid="F1">1</figr> shows the coefficient paths for boosting without pathway information (left panel) and PathBoost (right panel). While they are rather similar, there are some features with strong effect that appear only in the PathBoost fit (e.g., UNIQIDs 29911 and 27573). As the PathBoost algorithm increases the penalty for a covariate after it has been selected, it could be expected that the estimates are somewhat shrunken compared to the CoxBoost fit. This is seen, e.g., for the microarray features with UNIQIDs 32238 and 32679, which are no longer selected by PathBoost after a certain boosting step, as the penalty for them has become too large. This is different from approaches that use an explicit shrinkage term in the penalized (partial) log-likelihood criterion, as there it would be expected that the whole path is shrunken.</p>
               <fig id="F1">
                  <title>
                     <p>Figure 1</p>
                  </title>
                  <caption>
                     <p>Coefficient paths for the DLBCL data</p>
                  </caption>
                  <text>
                     <p><b>Coefficient paths for the DLBCL data</b>. Coefficient paths for boosting without pathway information (left panel) and PathBoost (right panel), applied to DLBCL data. The models selected by 10-fold cross validation are indicated by vertical lines. Microarray features common to both models are indicated by solid curves, the others by dotted curves.</p>
                  </text>
                  <graphic file="1471-2105-10-18-1"/>
               </fig>
               <p>While use of pathway information is seen to have influenced the model fit, interpretation of the latter can only be assumed to be more valid, compared to the fit obtained without pathway information, if prediction performance is also improved. The thick curves in Figure <figr fid="F2">2</figr> indicate .632+ prediction error curve estimates (based on 100 bootstrap samples of size 0.632<it>n</it>, drawn without replacement). The Kaplan-Meier benchmark (grey curve) that does not use any covariate information is given as a reference. All procedures are seen to improve over the Kaplan-Meier benchmark, where PathBoost (solid curve) seems to have a slight advantage over boosting without pathway information (dashed curve). While the difference is not very large, it nevertheless improves the credibility of the PathBoost fit.</p>
               <fig id="F2">
                  <title>
                     <p>Figure 2</p>
                  </title>
                  <caption>
                     <p>Prediction error curves for the DLBCL data</p>
                  </caption>
                  <text>
                     <p><b>Prediction error curves for the DLBCL data</b>. Bootstrap .632+ prediction error curve estimates for boosting without pathway information (dashed curves) and PathBoost (solid curves), applied to DLBCL data, without (thick curves) and with clinical covariates (thin curves). The Kaplan-Meier benchmark (grey curve) and a purely clinical model (dotted curve) are given as a reference.</p>
                  </text>
                  <graphic file="1471-2105-10-18-2"/>
               </fig>
               <p>For 222 patients, a clinical predictor, the International Prognostic Index (IPI), is available. As it is typically of interest how much microarray information can improve over purely clinical models, we include the clinical covariate as a mandatory, unpenalized covariate, as described in <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. The corresponding prediction error curve estimates are indicated by thin curves in Figure <figr fid="F2">2</figr>. The prediction performance of a purely clinical model is indicated by the dotted curve. It is seen that the combined models can improve over the purely clinical model. However, PathBoost (solid curve) can no longer improve over boosting without pathway information (dashed curve). The lack of additional value of pathway information in this setting is also reflected by the step-size modification factor, chosen by a line search, which is <it>c</it><sub><it>smf </it></sub>= 1. Therefore it seems that, in the present example, pathway information is most useful in describing phenomena that are already reflected by the clinical covariate.</p>
            </sec>
            <sec>
               <st>
                  <p>Ovarian cancer</p>
               </st>
               <p>The second data set, to be used for illustration of the PathBoost approach, is from patients with ovarian cancer. The original analysis of this data <abbrgrp><abbr bid="B8">8</abbr></abbrgrp> already showed that there is a connection between pathway activity and survival, where pathway signatures were derived from prior experiments. In contrast, we will investigate whether pathway knowledge derived from the KEGG database can also add to prediction of survival.</p>
               <p>For the 133 patients, where time-to-event information is available, we performed preprocessing of the microarray data, using the RMA approach <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>, resulting in 21801 microarray features. We restrict analysis to those 4868 features that are related to any of the human KEGG pathways.</p>
               <p>The connections between genes, just as for the DLBCL data, are extracted from the regulatory KEGG pathways, including the cancer pathways. The step-size modification factor, selected by a line search in combination with 10-fold cross-validation, then is <it>c</it><sub><it>smf </it></sub>= 1, i.e., pathway information would not be expected to be useful for prediction of survival. However, when only the connections from the cancer pathways are considered, the resulting factor is <it>c</it><sub><it>smf </it></sub>= 0.9. This indicates that targeted pathway information might be useful, while use of too many pathways is detrimental to prediction performance. Figure <figr fid="F3">3</figr> shows bootstrap .632+ prediction error curve estimates for boosting without pathway information (thick dashed curve) and for PathBoost approach (thick solid curve), when considering only the cancer pathways. We also investigate models that incorporate the clinical covariate "tumor stage" as a mandatory unpenalized covariate (thin curves). All models perform considerably better than the Kaplan-Meier benchmark. Just as for the DLBCL data, there is an advantage of PathBoost over boosting without pathway information, albeit a smaller one, indicating usefulness of pathway information for prediction. In contrast to the DLBCL example, PathBoost also performs better when the clinical covariate is included. This indicates that the pathways provide information in addition to the clinical covariate.</p>
               <fig id="F3">
                  <title>
                     <p>Figure 3</p>
                  </title>
                  <caption>
                     <p>Prediction error curves for the ovarian cancer data</p>
                  </caption>
                  <text>
                     <p><b>Prediction error curves for the ovarian cancer data</b>. Bootstrap .632+ prediction error curve estimates for boosting without pathway information (dashed curves) and PathBoost (solid curves), applied to ovarian cancer data, without (thick curves) and with clinical covariates (thin curves). The Kaplan-Meier benchmark (grey curve) is given as a reference.</p>
                  </text>
                  <graphic file="1471-2105-10-18-3"/>
               </fig>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>Integration of different sources of information promises to result in improved predictive models built from microarray data. For example, the potential of experimentally derived pathway signatures was already demonstrated in <abbrgrp><abbr bid="B8">8</abbr></abbrgrp> for various independent cancer data sets.</p>
         <p>Another source of pathway knowledge is the KEGG pathway database <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. In <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, an approach was presented that utilizes this source for tailoring the penalty term in Lasso-like estimation. However, such approaches are not readily available for binary response and time-to-event data. Furthermore, they require explicit specification of a penalty structure, which is, e.g., problematic when the parameters of connected genes might have different sign.</p>
         <p>As an alternative, we proposed a new likelihood-based boosting approach that also incorporates pathway information. Penalties are adapted after every boosting step, such that a microarray feature that is connected to another feature that already has a received a non-zero parameter estimate, is more likely to also receive a non-zero estimate. This avoids specification of a penalty structure, and therefore is not affected by parameters with opposite sign.</p>
         <p>The proposed PathBoost was seen to perform well in various settings of a simulation study, using the design employed in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. While the approach given in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> performed better in settings where the sign of the true parameters matched with its penalty structure, PathBoost showed equal or better performance in the other settings. This pattern of prediction performance might have been expected, as knowledge of the true sign of the parameters (in this case incorporated into the penalty structure) should result in increased prediction performance. However, in typical application settings such knowledge will rarely be available. Therefore, the PathBoost approach should be preferred. There still is a certain arbitrariness with respect to the suggested updated rules, i.e., other rules that might also work could be devised. However, the good performance, resulting from the suggested rules, provides at least some justification.</p>
         <p>We employed the simulation design used in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> to allow for better comparison to the results there. However, the design itself has some limitations, making it difficult to draw conclusions on performance with real data. For example, the pathway information employed does not contain inaccuracies, which will probably be present in sources such as the KEGG database. Also, the signal-to-noise ratios are large, resulting in simple prediction problems, untypical for microarray data. Furthermore, the simulation study is limited to continuous response settings, due to lack of an algorithm for the approach given in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> for other response types. However, in most microarray applications the response is binary or a time-to-event response. Fitting predictive models for these is more difficult, and, therefore, less benefit from incorporating pathway information might be expected.</p>
         <p>The proposed boosting approach is easily adapted to different response types. Variants for generalized linear models and the Cox proportional hazards model were given. The latter was employed in two application examples, where the gain in prediction performance by incorporating pathway information was more moderate, compared to the simulation study. As indicated, this might, e.g., be due to inaccuracies in the KEGG database. The estimated parameters of several connected microarray features had opposite sign, indicating similarity to those scenarios of the simulation study, where only PathBoost could fully utilize pathway information.</p>
         <p>In comparison to models fitted without pathway information, application of PathBoost resulted in structurally different model fits, now honoring knowledge from external sources such as the KEGG database. Credibility of the interpretation of the new model fits was underlined by improved prediction performance. Given more detailed pathway knowledge, e.g., with information on the direction of gene relations and measures of uncertainty being available, further improvement of model fits could be expected. As demonstrated, the proposed boosting algorithm is highly flexible in terms of being able to incorporate additional sources of knowledge. While further refinements could be devised, e.g., for including information from Gene Ontology, it can already now be expected to provide for better model fits with better prediction performance in many applications.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>HB developed and implemented the initial version of the proposed algorithm, performed the simulation study, applied the algorithm to the example data, and wrote most of the manuscript. MS contributed design decisions for the algorithm, helped with interpretation of the results for the simulation study and the example data, and revised the manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We gratefully acknowledge support from Deutsche Forschungsgemeinschaft (DFG Forschergruppe FOR 534).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>KEGG: Kyoto Encyclopedia of Genes and Genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Kanehisa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Goto</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2000</pubdate>
            <volume>28</volume>
            <fpage>27</fpage>
            <lpage>30</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">102409</pubid>
                  <pubid idtype="pmpid" link="fulltext">10592173</pubid>
                  <pubid idtype="doi">10.1093/nar/28.1.27</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Nonparametric Pathway-Based Regression Models for Analysis of Genomic Data</p>
            </title>
            <aug>
               <au>
                  <snm>Wei</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Biostatistics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <issue>2</issue>
            <fpage>265</fpage>
            <lpage>284</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/biostatistics/kxl007</pubid>
                  <pubid idtype="pmpid" link="fulltext">16772399</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>A Hidden Spatial-Temporal Markov Random Field Model for Network-Based Analysis of Time Course Gene Expression Data</p>
            </title>
            <aug>
               <au>
                  <snm>Wei</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Annals of Applied Statistics</source>
            <pubdate>2008</pubdate>
            <volume>2</volume>
            <fpage>408</fpage>
            <lpage>429</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1214/07--AOAS145</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Incorporating Gene Networks into Statistical Tests for Genomic Data via a Spatially Correlated Mixture Model</p>
            </title>
            <aug>
               <au>
                  <snm>Wei</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Pan</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2008</pubdate>
            <volume>24</volume>
            <issue>3</issue>
            <fpage>404</fpage>
            <lpage>411</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">18083717</pubid>
                  <pubid idtype="doi">10.1093/bioinformatics/btm612</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Gene Ontology: Tool for the Unification of Biology</p>
            </title>
            <aug>
               <au>
                  <snm>Ashburner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ball</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Blake</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Butler</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Cherry</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Dolinski</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Dwight</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Eppig</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Issel-Tarver</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Kasarskis</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>AK</fnm>
               </au>
               <au>
                  <snm>Matese</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Richardson</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Ringwald</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Sherlock</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nature Genetics</source>
            <pubdate>2000</pubdate>
            <volume>25</volume>
            <fpage>25</fpage>
            <lpage>29</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/75556</pubid>
                  <pubid idtype="pmpid" link="fulltext">10802651</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Multiple Testing on the Directed Acyclic Graph of Gene Ontology</p>
            </title>
            <aug>
               <au>
                  <snm>Goeman</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Mansmann</snm>
                  <fnm>U</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2008</pubdate>
            <volume>24</volume>
            <issue>4</issue>
            <fpage>537</fpage>
            <lpage>544</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btm628</pubid>
                  <pubid idtype="pmpid" link="fulltext">18203773</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Group Additive Regression Models for Genomic Data Analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Luan</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Biostatistics</source>
            <pubdate>2008</pubdate>
            <volume>9</volume>
            <fpage>100</fpage>
            <lpage>113</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/biostatistics/kxm015</pubid>
                  <pubid idtype="pmpid" link="fulltext">17513311</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Oncogenic Pathway Signatures in Human Cancers as a Guide to Targeted Therapies</p>
            </title>
            <aug>
               <au>
                  <snm>Bild</snm>
                  <fnm>AH</fnm>
               </au>
               <au>
                  <snm>Yao</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Potti</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Chasse</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Joshi</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Harpole</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lancaster</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Berchuck</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Olson</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Marks</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Dressman</snm>
                  <fnm>HK</fnm>
               </au>
               <au>
                  <snm>West</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2006</pubdate>
            <volume>439</volume>
            <issue>7074</issue>
            <fpage>353</fpage>
            <lpage>357</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature04296</pubid>
                  <pubid idtype="pmpid" link="fulltext">16273092</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Network-constrained Regularization and Variable Selection for Analysis of Genomic Data</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2008</pubdate>
            <volume>24</volume>
            <issue>9</issue>
            <fpage>1175</fpage>
            <lpage>1182</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btn081</pubid>
                  <pubid idtype="pmpid" link="fulltext">18310618</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Regression Shrinkage and Selection via the Lasso</p>
            </title>
            <aug>
               <au>
                  <snm>Tibshirani</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Journal of the Royal Statistical Society B</source>
            <pubdate>1996</pubdate>
            <volume>58</volume>
            <fpage>267</fpage>
            <lpage>288</lpage>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Sparsity and Smoothness Via the Fused Lasso</p>
            </title>
            <aug>
               <au>
                  <snm>Tibshirani</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Saunders</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rosset</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kneight</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Journal of the Royal Statistical Society B</source>
            <pubdate>2005</pubdate>
            <volume>67</volume>
            <fpage>91</fpage>
            <lpage>108</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1111/j.1467-9868.2005.00490.x</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>L<sub>1</sub>-Regularization Path Algorithms for Generalized Linear Models</p>
            </title>
            <aug>
               <au>
                  <snm>Park</snm>
                  <fnm>MY</fnm>
               </au>
               <au>
                  <snm>Hastie</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Journal of the Royal Statistical Society B</source>
            <pubdate>2007</pubdate>
            <volume>69</volume>
            <issue>4</issue>
            <fpage>659</fpage>
            <lpage>677</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1111/j.1467-9868.2007.00607.x</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Boosting Algorithms: Regularization, Prediction and Model Fitting</p>
            </title>
            <aug>
               <au>
                  <snm>B&#252;hlmann</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Hothorn</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Statistical Science</source>
            <pubdate>2007</pubdate>
            <volume>22</volume>
            <issue>4</issue>
            <fpage>477</fpage>
            <lpage>505</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1214/07-STS242</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Boosting With the L2 Loss: Regression and Classification</p>
            </title>
            <aug>
               <au>
                  <snm>B&#252;hlmann</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Journal of the American Statistical Association</source>
            <pubdate>2003</pubdate>
            <volume>98</volume>
            <fpage>324</fpage>
            <lpage>339</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1198/016214503000125</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Boosting Ridge Regression</p>
            </title>
            <aug>
               <au>
                  <snm>Tutz</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Binder</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Computational Statistics &amp; Data Analysis</source>
            <pubdate>2007</pubdate>
            <volume>51</volume>
            <issue>12</issue>
            <fpage>6044</fpage>
            <lpage>6059</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/j.csda.2006.11.041</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Allowing for Mandatory Covariates in Boosting Estimation of Sparse High-Dimensional Survival Models</p>
            </title>
            <aug>
               <au>
                  <snm>Binder</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Schumacher</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2008</pubdate>
            <volume>9</volume>
            <fpage>14</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2245904</pubid>
                  <pubid idtype="pmpid" link="fulltext">18186927</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-9-14</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Generalized Additive Modelling with Implicit Variable Selection by Likelihood Based Boosting</p>
            </title>
            <aug>
               <au>
                  <snm>Tutz</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Binder</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Biometrics</source>
            <pubdate>2006</pubdate>
            <volume>62</volume>
            <fpage>961</fpage>
            <lpage>971</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1541-0420.2006.00578.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">17156269</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Generalized Linear Models</p>
            </title>
            <aug>
               <au>
                  <snm>McCullagh</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Nelder</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <publisher>London, U.K.: Chapman &amp; Hall</publisher>
            <edition>2</edition>
            <pubdate>1989</pubdate>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Experiments with a new boosting algorithm</p>
            </title>
            <aug>
               <au>
                  <snm>Freund</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Schapire</snm>
                  <fnm>RE</fnm>
               </au>
            </aug>
            <source>Machine Learning: Proc. Thirteenth International Conference</source>
            <publisher>San Francisco, CA: Morgan Kaufman</publisher>
            <pubdate>1996</pubdate>
            <fpage>148</fpage>
            <lpage>156</lpage>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Greedy Function Approximation: A Gradient Boosting Machine</p>
            </title>
            <aug>
               <au>
                  <snm>Friedman</snm>
                  <fnm>JH</fnm>
               </au>
            </aug>
            <source>The Annals of Statistics</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>1189</fpage>
            <lpage>1232</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1214/aos/1013203451</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Least Angle Regression</p>
            </title>
            <aug>
               <au>
                  <snm>Efron</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Hastie</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Johnstone</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Tibshirani</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>The Annals of Statistics</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>2</issue>
            <fpage>407</fpage>
            <lpage>499</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1214/009053604000000067</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Efron-type measures of prediction error for survival analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Gerds</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Schumacher</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Biometrics</source>
            <pubdate>2007</pubdate>
            <volume>63</volume>
            <issue>4</issue>
            <fpage>1283</fpage>
            <lpage>1287</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17651459</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Assessment of Survival Prediction Models Based on Microarray Data</p>
            </title>
            <aug>
               <au>
                  <snm>Schumacher</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Binder</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Gerds</snm>
                  <fnm>TA</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>23</volume>
            <issue>14</issue>
            <fpage>1768</fpage>
            <lpage>1774</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btm232</pubid>
                  <pubid idtype="pmpid" link="fulltext">17485430</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Adapting Prediction Error Estimates for Biased Complexity Selection in High-Dimensional Bootstrap Samples</p>
            </title>
            <aug>
               <au>
                  <snm>Binder</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Schumacher</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Stat Appl Genet Mol Biol</source>
            <pubdate>2008</pubdate>
            <volume>7</volume>
            <issue>1</issue>
            <fpage>Article 12</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid">18384265</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Comment on 'Network-Constrained Regularization and Variable Selection for Analysis of Genomic Data'</p>
            </title>
            <aug>
               <au>
                  <snm>Binder</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Schumacher</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2008</pubdate>
            <volume>24</volume>
            <issue>21</issue>
            <fpage>2566</fpage>
            <lpage>2568</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btn412</pubid>
                  <pubid idtype="pmpid" link="fulltext">18682424</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>The Use of Molecular Profiling to Predict Survival After Chemotherapy for Diffuse Large-B-cell Lymphoma</p>
            </title>
            <aug>
               <au>
                  <snm>Rosenwald</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wright</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Chan</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Connors</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Campo</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Fisher</snm>
                  <fnm>RI</fnm>
               </au>
               <au>
                  <snm>Gascoyna</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Muller-Hermelink</snm>
                  <fnm>HK</fnm>
               </au>
               <au>
                  <snm>Smeland</snm>
                  <fnm>EB</fnm>
               </au>
               <au>
                  <snm>Staudt</snm>
                  <fnm>LM</fnm>
               </au>
            </aug>
            <source>The New England Journal of Medicine</source>
            <pubdate>2002</pubdate>
            <volume>346</volume>
            <issue>25</issue>
            <fpage>1937</fpage>
            <lpage>1946</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1056/NEJMoa012914</pubid>
                  <pubid idtype="pmpid" link="fulltext">12075054</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Exploration, Normalization, and Summaries of High Denisty Oligonucleotide Array Probe Level Data</p>
            </title>
            <aug>
               <au>
                  <snm>Irizarry</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Hobbs</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Collin</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Beazer-Barclay</snm>
                  <fnm>YD</fnm>
               </au>
               <au>
                  <snm>Antonellis</snm>
                  <fnm>KJ</fnm>
               </au>
               <au>
                  <snm>Scherf</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Speed</snm>
                  <fnm>TP</fnm>
               </au>
            </aug>
            <source>Biostatistics</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <issue>2</issue>
            <fpage>249</fpage>
            <lpage>264</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/biostatistics/4.2.249</pubid>
                  <pubid idtype="pmpid" link="fulltext">12925520</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
