<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1472-6963-2-1</ui>
   <ji>1472-6963</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Quality and methods of developing practice guidelines</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Cruse</snm>
               <fnm>Hugh</fnm>
               <insr iid="I1"/>
               <email>crusehh@moffitt.usf.edu</email>
            </au>
            <au id="A2">
               <snm>Winiarek</snm>
               <fnm>Magdalena</fnm>
               <insr iid="I1"/>
               <email>winiarmb@moffitt.usf.edu</email>
            </au>
            <au id="A3">
               <snm>Marshburn</snm>
               <fnm>Jan</fnm>
               <insr iid="I1"/>
               <email>marshbmj@moffitt.usf.edu</email>
            </au>
            <au id="A4">
               <snm>Clark</snm>
               <fnm>Otavio</fnm>
               <insr iid="I1"/>
               <email>clarkoa@moffitt.usf.edu</email>
            </au>
            <au id="A5" ca="yes">
               <snm>Djulbegovic</snm>
               <fnm>Benjamin</fnm>
               <insr iid="I1"/>
               <email>djulbebm@moffitt.usf.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>H Lee Moffitt Cancer Center and Research institute, the University of South Florida, Tampa, FL, USA</p>
            </ins>
         </insg>
         <source>BMC Health Services Research</source>
         <issn>1472-6963</issn>
         <pubdate>2002</pubdate>
         <volume>2</volume>
         <issue>1</issue>
         <fpage>1</fpage>
         <url>http://www.biomedcentral.com/1472-6963/2/1</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="doi">10.1186/1472-6963-2-1</pubid>
               <pubid idtype="pmpid">11825346</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>20</day>
               <month>10</month>
               <year>2001</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>11</day>
               <month>1</month>
               <year>2002</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>11</day>
               <month>1</month>
               <year>2002</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2002</year>
         <collab>Cruse et al; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.</collab>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>It is not known whether there are differences in the quality and recommendations between evidence-based (EB) and consensus-based (CB) guidelines. We used breast cancer guidelines as a case study to assess for these differences.</p>
            </sec>
            <sec>
               <st>
                  <p>Methods</p>
               </st>
               <p>Five different instruments to evaluate the quality of guidelines were identified by a literature search. We also searched MEDLINE and the Internet to locate 8 breast cancer guidelines. These guidelines were classified in three categories: evidence based, consensus based and consensus based with no explicit consideration of evidence (CB-EB). Each guideline was evaluated by three of the authors using each of the instruments. For each guideline we assessed the agreement among 14 decision points which were selected from the NCCN (National Cancer Comprehensive Network) guidelines algorithm. For each decision point we recorded the level of the quality of the information used to support it. A regression analysis was performed to assess if the percentage of high quality evidence used in the guidelines development was related to the overall quality of the guidelines.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Three guidelines were classified as EB, three as CB-EB and two as CB. The EB guidelines scored better than CB, with the CB-EB scoring in the middle among all instruments for guidelines quality assessment. No major disagreement in recommendations was detected among the guidelines regardless of the method used for development, but the EB guidelines had a better agreement with the benchmark guideline for any decision point. When the source of evidence used to support decision were of high quality, we found a higher level of full agreement among the guidelines' recommendations. Up to 94% of variation in the quality score among guidelines could be explained by the quality of evidence used for guidelines development.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>EB guidelines have a better quality than CB guidelines and CB-EB guidelines. Explicit use of high quality evidence can lead to a better agreement among recommendations. However, no major disagreement among guidelines was noted regardless of the method for their development.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The objective of guidelines development is to assist physicians and patients in making optimal health care decisions, which in turn should improve the quality of clinical practice <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>.</p>
         <p>Different methods are used to develop guidelines. Some are developed by a consensus of experts while others also use a formal way to appraise the literature and create evidence-based (EB) guidelines. In general, evidence-based guidelines are considered to provide better recommendations for practice than consensus-based guidelines but are time consuming and expensive to create <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. This belief that EB guidelines are superior to other types of guideline is based on our normative views of methods for guidelines development <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> and not on empirical comparison of practice recommendations using different methods for development of guidelines. To date no formal evaluation has been performed to detect if there are differences in the quality and recommendations between evidence-based and consensus-based (CB) guidelines.</p>
         <p>If guidelines developed by using consensus or evidence-based methods have the same quality and agree in the recommendations, then obviously resources spent on the laborious and time-consuming process of locating and appraising evidence can be used elsewhere. Otherwise, if evidence based guidelines have a better quality and their recommendations differ from those guidelines produced by consensus, then creation of evidence based guidelines may become the only acceptable method of guideline development.</p>
         <p>In this paper, we explore if there are differences in the quality and recommendations between EB and CB guidelines.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <p>To enable meaningful comparison, multiple recommendations produced by a given guideline method should be available. This objective is best met by focusing on the guidelines that comprehensively attempt to guide clinicians in the management of one disorder. Since breast cancer is an important disease and various organizations have produced guidelines using different methods <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>, we conducted a comparison study of comprehensive breast cancer guidelines. We assessed both the differences in the quality as measured by using different quality instruments assessment and the level of agreement among guidelines according to the method of development.</p>
         <p>1. Identification and assessment of instruments for measurement of the quality of guidelines</p>
         <p>Since there is no uniformly accepted instrument for evaluation of the quality of guidelines, we first performed a comprehensive literature search to identify published tools for assessment of clinical practice guideline quality. We searched MEDLINE (1996&#8211;2000) using the keywords: guidelines, practice guidelines, quality, "weights and measures", "scale", psychometrics, reproducibility. Any article considered relevant to evaluate quality of guidelines was retrieved. The list of references of each article was also scanned. After an assessment of 14 papers by four of us, four instruments to assess the quality of guidelines were identified <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>. An additional instrument (SIGN) was identified through Evidence-based Health Discussion Group (Table <tblr tid="T1">1</tblr>). For additional details on the instruments for evaluation of guidelines readers are referred to the Appendix (see <supplr sid="S1">Additional file</supplr>). To assess their reliability and reproducibility, we applied all identified instruments to each guideline (see below). We calculated the coefficient of agreement (kappa) among evaluators for each guideline <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. A good interobserver agreement was considered if kappa value exceeded 0.4 <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. In our evaluation, two instruments <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B12">12</abbr></abbrgrp> had a kappa interobserver agreement K > 0.4 among all investigators in 6 of 8 guidelines (Table <tblr tid="T1">1</tblr>). When it comes to evaluation of the quality of breast cancer guidelines these instruments <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B12">12</abbr></abbrgrp> performed better than others and probably can be recommended for future use.</p>
         <tbl id="T1">
            <title>
               <p>Table 1</p>
            </title>
            <caption>
               <p>Interobserver agreement of instruments for assessment of the guidelines quality</p>
            </caption>
            <tblbdy cols="2">
               <r>
                  <c ca="left">
                     <p>
                        <b>Instrument</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Guidelines with K > 0.4 among all four evaluators</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="2">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Cluzeau (8)</p>
                  </c>
                  <c ca="left">
                     <p>ACCC (19), CMA (7)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Grilli (9)</p>
                  </c>
                  <c ca="left">
                     <p>ACCC (19), NHMRC (16), MPS (18)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Sanders(11)</p>
                  </c>
                  <c ca="left">
                     <p>None</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Petrie (SIGN) (10)</p>
                  </c>
                  <c ca="left">
                     <p>NCCN (6), ACCC (19), SIGN (17), ICSI (15), MPS (18), SSO (5)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Shaneyfelt 1999 (12)</p>
                  </c>
                  <c ca="left">
                     <p>ACCC (19), CMA (7), NHMRC (16), ICSI (15), MPS (18), SSO (5)</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p>Acronyms: ACCC &#8211; Association of Community Cancer Centers; CMA-Canadian Medical Association; ICSI &#8211; Institute for Clinical Systems Improvement; MPS &#8211; Multi Professional Societies; NCCN &#8211; National Comprehensive Cancer Network; NHMRC &#8211; The National Health and Medical Research Council; SIGN &#8211; Scottish Intercollegiate Guidelines Network; SSO &#8211; Society of Surgical Oncology.</p>
            </tblfn>
         </tbl>
         <p>2. Identification and classification of breast cancer guidelines</p>
         <p>A literature search was conducted for published breast cancer guidelines using MEDLINE for the years 1996 &#8211; April 2000. The following keywords were used in combination: Guidelines, Practice Guidelines, recommendations, breast neoplasms. An Internet search was also performed, using the method described by Sanders et al <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. 131 articles were retrieved, and reviewed for their content. We considered any article that fit the definition of the National Library of Medicine for practice guidelines: directions or principles presenting current or future rules of policy for the health care practitioner to assist him in patient care decisions regarding diagnosis, therapy, or related clinical circumstances <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Eight papers referred to breast cancer guidelines <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>,<abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp> and were selected for the analysis.</p>
         <p>Each guideline was classified as CB, when there was no consideration about the quality of evidence used to make practice recommendations; as EB, when there was an explicit consideration of the quality of evidence in the development of guidelines; or as consensus based with no explicit consideration of evidence (CB-EB) when there were considerations about the evidence, but not in explicit manner. From these eight guidelines, three were classified as EB <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B7">7</abbr><abbr bid="B16">16</abbr></abbrgrp> three as CB-EB <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B18">18</abbr><abbr bid="B6">6</abbr></abbrgrp> and two as CB <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B5">5</abbr></abbrgrp> (Table <tblr tid="T2">2</tblr>).</p>
         <tbl id="T2">
            <title>
               <p>Table 2</p>
            </title>
            <caption>
               <p>Classification of Breast Cancer Guidelines according to the method of development.</p>
            </caption>
            <tblbdy cols="3">
               <r>
                  <c ca="left">
                     <p>
                        <b>Evidence Based</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Consensus Based with no explicit consideration of evidence</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Consensus Based</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="3">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>CMA (7)</p>
                  </c>
                  <c ca="left">
                     <p>NCCN (6)</p>
                  </c>
                  <c ca="left">
                     <p>ACCC(19)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>SIGN (17)</p>
                  </c>
                  <c ca="left">
                     <p>ICSI (15)</p>
                  </c>
                  <c ca="left">
                     <p>SSO (5)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>NHMRC(16)</p>
                  </c>
                  <c ca="left">
                     <p>MPS(18)</p>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p>Acronyms: see footnote in Table <tblr tid="T1">1</tblr></p>
            </tblfn>
         </tbl>
         <p>3. Evaluation of guidelines</p>
         <p>Each guideline was evaluated independently by three of us using each of the instruments. All discordances were resolved by a consensus meeting. Each guideline was scored according to the instructions of each instrument. The quality and rank was determined by the quotient of items scored positively by the total items scored for each instrument.</p>
         <p>4. Evaluation of agreement among guidelines</p>
         <p>Using instruments to evaluate practice guidelines yields conclusions regarding normative aspects of the guidelines development <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>, but does not necessarily mean that recommendations provided by guidelines using different methods will produce different management advice to our patients. To assess if recommendations among various guidelines differ, we need to determine the level of agreement among guidelines for each specific decision point.</p>
         <p>Since NCCN (National Comprehensive Cancer Network) guidelines <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> were presented in explicit, algorithmic format, we used this one to identify the decision points for matched comparison with other guidelines. These guidelines have been developed by the leading 18 cancer institutions in the US and have been constantly updated and re-evaluated. They have also been developed to closely mimic clinical practice. Therefore, we feel that selection of decision points based on the NCCN guidelines were appropriate. We identified fourteen decision points in the management of stage I and II breast cancer that were linked to specific recommendations in the other guidelines for our comparison. Comparison of recommendations for advanced stages of breast cancer has not been performed since there was only one guideline that included it <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>.</p>
         <p>Subsequently, four of us evaluated each of these decision points in each guideline examining level of agreement among various guidelines. Since matching between recommendations in the guidelines that were presented in non-algorithmic format was poor, we decided to use NCCN guidelines as a benchmark. We classified agreement of each guideline with the NCCN guidelines as having full agreement, partial agreement and disagreement. It was considered that guidelines agree with the NCCN if the management recommendation was the same; the guidelines were considered to disagree if they provided different recommendations. A partial agreement was judged to exist if the guideline recommended the same management but in a broadly defined sense and not in explicit, clear manner.</p>
         <p>Each of these decision points was also classified as supported by high quality evidence or not. High quality evidence was considered to be based on randomized trials (RCT) or systematic reviews (SR)/meta-analysis (MA). If the quality evidence was not based on RCT or SR/MA or was not stated, it was classified as low quality evidence.</p>
         <p>Subsequently, we performed a regression analysis to assess the contribution of the quality of evidence to the total score obtained by each instrument for the evaluation of the guidelines quality. Independent variable was the proportion of decisions supported by high quality evidence while dependent variable was score obtained by each instrument. A regression analysis was performed after it has assessed that the distribution of the variables was normal by Wilks-Shapiro test.</p>
         <suppl id="S1">
            <title>
               <p>Additional file</p>
            </title>
            <text>
               <p>The file contains bibliographic details and description of the published instruments for evaluation of the quality of practice guidelines</p>
            </text>
            <file name="1472-6963-2-1-S1.doc">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Evaluation of the quality of guidelines</p>
            </st>
            <p>The results of the quality of each guideline according to each instrument are shown in Table <tblr tid="T3">3</tblr>. Overall, EB guidelines had higher scores than CB, and the CB-EB category ranked in the middle (Fig <figr fid="F1">1</figr>). As expected, the instruments for the evaluation of quality are based on the number of desired built-in normative features of good guidelines development, as initially recommended by Institute of Medicine<abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. This is further confirmed by the evaluation of the contribution of the quality of evidence to the final quality score: the regression analysis performed showed that the quality of guidelines, as measured by these instruments, is a function of the percentage of high quality evidence that each guideline contains. This suggests that evidence plays a major role in the composition of the quality scales. If the quality of evidence is poor, paying attention to other quality domains in the development of guidelines will not result in higher quality scores. Fig <figr fid="F2">2</figr> illustrates a relationship between the quality of evidence and the total quality score using the two instruments that achieved best agreement among evaluators <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B12">12</abbr></abbrgrp>. It is quite remarkable to note that up to >94% variation in the score could be explained by the quality of evidence alone.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Average score of each guideline according to the method of development</p>
               </caption>
               <text>
                  <p>Average score of each guideline according to the method of development Acronyms and abbreviations: ACCC &#8211; Association of Community Cancer Centers; CMA-Canadian Medical Association; ICSI &#8211; Institute for Clinical Systems Improvement; MPS &#8211; Multi Professional Societies; NCCN &#8211; National Comprehensive Cancer Network; NHMRC &#8211; The National Health and Medical Research Council; SIGN &#8211; Scottish Intercollegiate Guidelines Network; SSO &#8211; Society of Surgical Oncology. EB: evidence-based guidelines; CB: consensus-based guidelines EB-CB: consensus-based guidelines with no explicit considerations of evidence</p>
               </text>
               <graphic file="1472-6963-2-1-1"/>
            </fig>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>A relationship between quality of evidence and total guideline quality score.</p>
               </caption>
               <text>
                  <p>A relationship between quality of evidence and total guideline quality score. Note that up to 94% of variation in the quality score can be explained by the quality of evidence.</p>
               </text>
               <graphic file="1472-6963-2-1-2"/>
            </fig>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Quality of breast cancer guidelines</p>
               </caption>
               <tblbdy cols="7">
                  <r>
                     <c cspan="7" ca="center">
                        <p>Instruments for assessment of guidelines quality</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Cluzeau (8)</p>
                     </c>
                     <c ca="center">
                        <p>Grilly (9)</p>
                     </c>
                     <c ca="center">
                        <p>Sanders (11)</p>
                     </c>
                     <c ca="center">
                        <p>Shaneyfelt (12)</p>
                     </c>
                     <c ca="center">
                        <p>Petrie (SIGN) (10)</p>
                     </c>
                     <c ca="center">
                        <p>Average Score</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>SIGN(17)</p>
                     </c>
                     <c ca="right">
                        <p>0.94</p>
                     </c>
                     <c ca="right">
                        <p>1.00</p>
                     </c>
                     <c ca="right">
                        <p>1.00</p>
                     </c>
                     <c ca="right">
                        <p>0.84</p>
                     </c>
                     <c ca="right">
                        <p>0.82</p>
                     </c>
                     <c ca="right">
                        <p>0.92</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NHMRC(16)</p>
                     </c>
                     <c ca="right">
                        <p>0.77</p>
                     </c>
                     <c ca="right">
                        <p>1.00</p>
                     </c>
                     <c ca="right">
                        <p>1.00</p>
                     </c>
                     <c ca="right">
                        <p>0.80</p>
                     </c>
                     <c ca="right">
                        <p>0.75</p>
                     </c>
                     <c ca="right">
                        <p>0.86</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CMA(7)</p>
                     </c>
                     <c ca="right">
                        <p>0.74</p>
                     </c>
                     <c ca="right">
                        <p>1.00</p>
                     </c>
                     <c ca="right">
                        <p>0.93</p>
                     </c>
                     <c ca="right">
                        <p>0.88</p>
                     </c>
                     <c ca="right">
                        <p>0.65</p>
                     </c>
                     <c ca="right">
                        <p>0.84</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ICSI(15)</p>
                     </c>
                     <c ca="right">
                        <p>0.77</p>
                     </c>
                     <c ca="right">
                        <p>0.67</p>
                     </c>
                     <c ca="right">
                        <p>1.00</p>
                     </c>
                     <c ca="right">
                        <p>0.64</p>
                     </c>
                     <c ca="right">
                        <p>0.55</p>
                     </c>
                     <c ca="right">
                        <p>0.72</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NCCN(6)</p>
                     </c>
                     <c ca="right">
                        <p>0.66</p>
                     </c>
                     <c ca="right">
                        <p>0.67</p>
                     </c>
                     <c ca="right">
                        <p>0.86</p>
                     </c>
                     <c ca="right">
                        <p>0.60</p>
                     </c>
                     <c ca="right">
                        <p>0.44</p>
                     </c>
                     <c ca="right">
                        <p>0.64</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MPS(18)</p>
                     </c>
                     <c ca="right">
                        <p>0.37</p>
                     </c>
                     <c ca="right">
                        <p>0.67</p>
                     </c>
                     <c ca="right">
                        <p>0.80</p>
                     </c>
                     <c ca="right">
                        <p>0.40</p>
                     </c>
                     <c ca="right">
                        <p>0.37</p>
                     </c>
                     <c ca="right">
                        <p>0.52</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ACCC(19)</p>
                     </c>
                     <c ca="right">
                        <p>0.36</p>
                     </c>
                     <c ca="right">
                        <p>0.00</p>
                     </c>
                     <c ca="right">
                        <p>0.60</p>
                     </c>
                     <c ca="right">
                        <p>0.40</p>
                     </c>
                     <c ca="right">
                        <p>0.32</p>
                     </c>
                     <c ca="right">
                        <p>0.34</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>SSO(5)</p>
                     </c>
                     <c ca="right">
                        <p>0.18</p>
                     </c>
                     <c ca="right">
                        <p>0.00</p>
                     </c>
                     <c ca="right">
                        <p>0.20</p>
                     </c>
                     <c ca="right">
                        <p>0.20</p>
                     </c>
                     <c ca="right">
                        <p>0.18</p>
                     </c>
                     <c ca="right">
                        <p>0.15</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Acronyms: see footnote in Table <tblr tid="T1">1</tblr></p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Evaluation of agreement among guidelines</p>
            </st>
            <p>The agreement among each guideline for the 14 decision points is shown in Table <tblr tid="T4">4</tblr>. We obtained no major disagreements among guidelines, but the EB guidelines had a better agreement with the decision points in any situation than CB-guidelines and CB-EB guidelines. The fact that no major disagreements were seen regardless the method of development can probably be explained by the vagueness of recommendations by CB guidelines. As shown in Table <tblr tid="T4">4</tblr>, the number of decision points supported by high quality evidence is highest in the EB guidelines and zero in CB guidelines. The use of high quality evidence was significantly associated with a higher level of concordance among the decision points. When the source of evidence was of good quality (RCT or SR), we had 18 full agreements and 23 partial agreements (Chi square = 0.610, degrees of freedom = 1, p = 0.435). When the source of evidence was not stated or was of lower quality, we had 17 full agreements and 40 partial agreements (Chi-Square 9.281, Degrees of freedom 2, p= 0.002). This means that recommendations based on high quality evidence may lead to less disagreement and potentially less practice variation.</p>
            <tbl id="T4">
               <title>
                  <p>Table 4</p>
               </title>
               <caption>
                  <p>Level of agreement between NCCN guideline and other breast cancer guidelines.</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>Guideline</p>
                     </c>
                     <c ca="left">
                        <p>Agreement</p>
                     </c>
                     <c ca="left">
                        <p>Partial Agreement</p>
                     </c>
                     <c ca="left">
                        <p>Statistical significance (p)</p>
                     </c>
                     <c ca="left">
                        <p>N. of decision points with high quality evidence</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>SIGN(17)</p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>.285</p>
                     </c>
                     <c ca="left">
                        <p>13</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NHMRC(16)</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>.593</p>
                     </c>
                     <c ca="left">
                        <p>13</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CMA(7)</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>.285</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ICSI(15)</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>10</p>
                     </c>
                     <c ca="left">
                        <p>.109</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NCCN(6)</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MPS(18)</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>14</p>
                     </c>
                     <c ca="left">
                        <p>&lt;0.0001</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ACCC(19)</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>.285</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>SSO(5)</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>14</p>
                     </c>
                     <c ca="left">
                        <p>&lt;0.0001</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Acronyms: see footnote table <tblr tid="T1">1</tblr></p>
               </tblfn>
            </tbl>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>Guidelines have been increasingly used in medical decision-making. Different methods have been used in guideline development. Does it matter how guidelines were produced? Most authors believe that it matters very much <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> and that guidelines produced using evidence-based methods are superior to other methodologies of development <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B4">4</abbr><abbr bid="B9">9</abbr></abbrgrp>. However, empirical investigations to assess if guidelines produced by different methods have different quality and result in different recommendations have not been performed. Here, we report such a study.</p>
         <p>Using formal instruments for evaluation of the quality of guidelines we found that EB-guidelines had substantially higher score than CB-guidelines or guideline that considered evidence in a less formal way (CB-EB). As discussed above (see Results), this is not a surprising result, since the instruments for the guidelines evaluation measure the quality based on the number of desired normative characteristics in a particular guideline. Since appraisal of evidence is considered inherently important for the development of a good guideline, one would then expect that the guidelines that pay more attention to its evidence basis (i.e., those that are evidence-based) would receive higher quality score than other types of the guidelines (i.e. guidelines developed solely by a consensus process) (see Fig <figr fid="F1">1</figr>). This is also evident in our finding that variation in the total quality score can be up to 94% explained by the quality of evidence (see Fig <figr fid="F2">2</figr>).</p>
         <p>Not all instruments for evaluation of guidelines performed equally well. Only two of the instruments available to address the quality of guidelines had a good level of agreement among evaluators (k > 0.4) in most of guidelines. This result raises concern about the reproducibility of results using the other instruments reported in the literature. In general, a few studies have been done to evaluate reproducibility of the instruments for assessment of the guidelines quality. Any future study attempting to address the quality of guidelines should take this finding into account.</p>
         <p>A more interesting question is to assess if the recommendations among guidelines produced by different methods actually differ. We found no instance of total disagreement among guidelines regardless of the method of development. We also found that EB and CB-EB guidelines had more points of agreement with our benchmark guidelines (NCCN)<abbrgrp><abbr bid="B6">6</abbr></abbrgrp> than guidelines developed using exclusively consensus method. We also found that when high-quality evidence existed in the literature (see Results) less disagreement was found among various guidelines. This is not completely surprising because formulation of guidelines does not happen in a vacuum. Most guideline developers are experts in the field who have knowledge of the literature. When evidence is unequivocal, less disagreement may be expected. Consequently, less practice variation may be found when high-quality evidence exists.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>In conclusion, EB guidelines have a better quality than CB guidelines as measured by the quality assessment instruments used in this study. The explicit use of high quality evidence is desirable and can lead to a better agreement among recommendations. However, no major disagreement among guidelines was noted regardless of the method for their development.</p>
      </sec>
      <sec>
         <st>
            <p>Competing Interest</p>
         </st>
         <p>none declared</p>
      </sec>
      <sec>
         <st>
            <p>Acknowledgements</p>
         </st>
         <p>We thank Dr.Stephen Edge for reviewing our paper and his helpful comments and constructive critique.</p>
      </sec>
   </bdy>
   <bm>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Institute of Medicine. Guidelines for clinical practice: from development to use.</p>
            </title>
            <source>Washigton DC: National Academic Press;</source>
            <pubdate>1992</pubdate>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Evidence-based medicine and practice guidelines: an overview.</p>
            </title>
            <aug>
               <au>
                  <snm>Woolf</snm>
                  <fnm>SH</fnm>
               </au>
            </aug>
            <source>Cancer Control</source>
            <pubdate>2000</pubdate>
            <volume>7</volume>
            <issue>4</issue>
            <fpage>362</fpage>
            <lpage>7</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10895131</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Development of practice guidelines.</p>
            </title>
            <aug>
               <au>
                  <snm>Miller</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Petrie</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Lancet</source>
            <pubdate>2000</pubdate>
            <volume>355</volume>
            <issue>9198</issue>
            <fpage>82</fpage>
            <lpage>3</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0140-6736(99)90326-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">10675160</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Clinical decision making: from theory to practice. Practice policies-guidelines for methods.</p>
            </title>
            <aug>
               <au>
                  <snm>Eddy</snm>
                  <fnm>DM</fnm>
               </au>
            </aug>
            <source>JAMA</source>
            <pubdate>1990</pubdate>
            <volume>263</volume>
            <issue>13</issue>
            <fpage>1839</fpage>
            <lpage>41</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1001/jama.263.13.1839</pubid>
                  <pubid idtype="pmpid">2313855</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Breast cancer surgical practice guidelines. Society of Surgical Oncology practice guidelines.</p>
            </title>
            <aug>
               <au>
                  <snm>Morrow</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bland</snm>
                  <fnm>KI</fnm>
               </au>
               <au>
                  <snm>Foster</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Oncology (Huntingt)</source>
            <pubdate>1997</pubdate>
            <volume>11</volume>
            <issue>6</issue>
            <fpage>877</fpage>
            <lpage>81</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9189942</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Update: NCCN practice guidelines for the treatment of breast cancer. National Comprehensive Cancer Network.</p>
            </title>
            <aug>
               <au>
                  <snm/>
                  <fnm/>
               </au>
            </aug>
            <source>Oncology (Huntingt)</source>
            <pubdate>1999</pubdate>
            <volume>13</volume>
            <issue>11A</issue>
            <fpage>187</fpage>
            <lpage>212</lpage>
            <xrefbib>
               <pubid idtype="pmpid">10079469</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>The Steering Committee on Clinical Practice Guidelines for the Care and Treatment of Breast Cancer.</p>
            </title>
            <source>CMAJ</source>
            <pubdate>1998</pubdate>
            <volume>158</volume>
            <issue>Suppl 3</issue>
            <fpage>S1</fpage>
            <lpage>2</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9484271</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Appraisal Instrument for Clinical Guidelines.</p>
            </title>
            <aug>
               <au>
                  <snm>Cluzeau</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Littlejohns</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Grimshaw</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Feder</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>London: St. George's Hospital Medical School;</source>
            <pubdate>1997</pubdate>
            <note>Available from: St. George's Hospital Medical School web site <url>http://www.sghms.ac.uk/depts/phs/hceu/clinguid.htm</url>. Accessed 11 June 2001.</note>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Practice guidelines developed by specialty societies: the need for a critical appraisal.</p>
            </title>
            <aug>
               <au>
                  <snm>Grilli</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Magrini</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Penna</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mura</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Liberati</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Lancet</source>
            <pubdate>2000</pubdate>
            <volume>355</volume>
            <issue>9198</issue>
            <fpage>103</fpage>
            <lpage>6</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0140-6736(99)02171-6</pubid>
                  <pubid idtype="pmpid" link="fulltext">10675167</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Criteria for Appraisal for National Use &#8211; Scottish Intercollegiate Guidelines Network (SIGN).</p>
            </title>
            <aug>
               <au>
                  <snm>Petrie</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Barnwell</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Grimshaw</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>SIGN Publication Number 39, 1995. Edinburgh: Scottish Intercollegiate Guidelines Network (SIGN);</source>
            <pubdate>1995</pubdate>
            <note>Available from SIGN web site <url>http://www.sign.ac.uk/guidelines/fulltext/50/index.html</url> (Version 2001). Accessed 11 June 2001.</note>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Design and pilot evaluation of a system to develop computer-based site-specific practice guidelines from decision models.</p>
            </title>
            <aug>
               <au>
                  <snm>Sanders</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Nease</snm>
                  <fnm>RF</fnm>
               </au>
               <au>
                  <snm>Owens</snm>
                  <fnm>DK</fnm>
               </au>
            </aug>
            <source>Med Decis Making</source>
            <pubdate>2000</pubdate>
            <volume>20</volume>
            <issue>2</issue>
            <fpage>145</fpage>
            <lpage>59</lpage>
            <xrefbib>
               <pubid idtype="pmpid">10772353</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Are guidelines following guidelines? The methodological quality of clinical practice guidelines in the peer-reviewed medical literature.</p>
            </title>
            <aug>
               <au>
                  <snm>Shaneyfelt</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Mayo-Smith</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Rothwangl</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>JAMA</source>
            <pubdate>1999</pubdate>
            <volume>281</volume>
            <issue>20</issue>
            <fpage>1900</fpage>
            <lpage>5</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1001/jama.281.20.1900</pubid>
                  <pubid idtype="pmpid" link="fulltext">10349893</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>A measurement of observer agreement for categorical data.</p>
            </title>
            <aug>
               <au>
                  <snm>Landis</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Koch</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Biometrics</source>
            <pubdate>1977</pubdate>
            <volume>33</volume>
            <fpage>159</fpage>
            <lpage>174</lpage>
            <xrefbib>
               <pubid idtype="pmpid">843571</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>NLM PubMed resources page.</p>
            </title>
            <pubdate>2001</pubdate>
            <note>Available via internet <url>http://www.ncbi.nlm.nih.gov/entrez/meshbrowser.cgi?term=Practice+Guidelines&amp;retrievestring=&amp;mbdetail=n</url> Accessed 11 June 2001.</note>
         </bibl>
         <bibl id="B15">
            <title>
               <p>ICSI Institute for Clinical Systems Improvement. ICSI Health Care Guideline: Breast Cancer Treatment.</p>
            </title>
            <source>ISCI,</source>
            <pubdate>2000</pubdate>
            <note>Available from: ICSI web site <url>http://www.icsi.org/guidelst.htm</url>. Accessed 11 June 2001</note>
         </bibl>
         <bibl id="B16">
            <title>
               <p>NHMRC-AU. NHMRC National Breast Cancer Centre &#8211; Clinical Practice Guidelines For The Management of Early Breast Cancer 1999.</p>
            </title>
            <source>NHMRC-AU</source>
            <pubdate>1999</pubdate>
            <note>Available from: NHMRC-AU web site <url>http://www.health.gov.au/nhmrc/advice/pdf/earlybrs.pdf</url> (Version 2000). Accessed 11 June 2001.</note>
         </bibl>
         <bibl id="B17">
            <title>
               <p>SIGN &#8211; Scottish Intercollegiate Guidelines Network. Breast Cancer in Women &#8211; A National Clinical Guideline. SIGN Publication Number 29,1998.</p>
            </title>
            <source>Edinburgh: Scottish Intercollegiate Guidelines Network (SIGN);</source>
            <pubdate>1998</pubdate>
            <note>Available from SIGN web site <url>http://www.sign.ac.uk/pdf/sign29.pdf</url>. Accessed 11 June 2001.</note>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Standards for diagnosis and management of invasive breast carcinoma. American College of Radiology. American College of Surgeons. College of American Pathologists. Society of Surgical Oncology.</p>
            </title>
            <aug>
               <au>
                  <snm>Winchester</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Cox</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>CA Cancer J Clin</source>
            <pubdate>1998</pubdate>
            <volume>48</volume>
            <issue>2</issue>
            <fpage>83</fpage>
            <lpage>107</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9522824</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>ACCC &#8211; Association of Community Cancer Centers. Oncology Patient Management Guidelines. Breast Carcinoma version 3.0.</p>
            </title>
            <source>ACCC</source>
            <pubdate>1999</pubdate>
         </bibl>
      </refgrp>
      <sec>
         <st>
            <p>Pre-publication history</p>
         </st>
         <p>The pre-publication history for this paper can be accessed here:</p>
         <p>
            <url>http://www.biomedcentral.com/1472-6963/2/1/prepub</url>
         </p>
      </sec>
   </bm>
</art>

