<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1472-6947-4-21</ui>
   <ji>1472-6947</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Quantitative evaluation of recall and precision of CAT Crawler, a search engine specialized on retrieval of Critically Appraised Topics</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Dong</snm>
               <fnm>Peng</fnm>
               <insr iid="I1"/>
               <email>cindy_dongpeng@yahoo.com</email>
            </au>
            <au id="A2">
               <snm>Wong</snm>
               <mnm>Ling</mnm>
               <fnm>Ling</fnm>
               <insr iid="I1"/>
               <email>lingling@raffles.org</email>
            </au>
            <au id="A3">
               <snm>Ng</snm>
               <fnm>Sarah</fnm>
               <insr iid="I1"/>
               <email>sarahngxl@yahoo.com.sg</email>
            </au>
            <au id="A4">
               <snm>Loh</snm>
               <fnm>Marie</fnm>
               <insr iid="I1"/>
               <email>marie_lohcs@yahoo.com</email>
            </au>
            <au id="A5" ca="yes">
               <snm>Mondry</snm>
               <fnm>Adrian</fnm>
               <insr iid="I1"/>
               <email>mondry@hotmail.com</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Medical and Clinical Informatics Group, Bioinformatics Institute, BMRC, A*STAR, Singapore</p>
            </ins>
         </insg>
         <source>BMC Medical Informatics and Decision Making</source>
         <issn>1472-6947</issn>
         <pubdate>2004</pubdate>
         <volume>4</volume>
         <issue>1</issue>
         <fpage>21</fpage>
         <url>http://www.biomedcentral.com/1472-6947/4/21</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">15588311</pubid>
               <pubid idtype="doi">10.1186/1472-6947-4-21</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>20</day>
               <month>8</month>
               <year>2004</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>10</day>
               <month>12</month>
               <year>2004</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>10</day>
               <month>12</month>
               <year>2004</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2004</year>
         <collab>Dong et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Critically Appraised Topics (CATs) are a useful tool that helps physicians to make clinical decisions as the healthcare moves towards the practice of Evidence-Based Medicine (EBM). The fast growing World Wide Web has provided a place for physicians to share their appraised topics online, but an increasing amount of time is needed to find a particular topic within such a rich repository.</p>
            </sec>
            <sec>
               <st>
                  <p>Methods</p>
               </st>
               <p>A web-based application, namely the CAT Crawler, was developed by Singapore's Bioinformatics Institute to allow physicians to adequately access available appraised topics on the Internet. A meta-search engine, as the core component of the application, finds relevant topics following keyword input. The primary objective of the work presented here is to evaluate the quantity and quality of search results obtained from the meta-search engine of the CAT Crawler by comparing them with those obtained from two individual CAT search engines. From the CAT libraries at these two sites, all possible keywords were extracted using a keyword extractor. Of those common to both libraries, ten were randomly chosen for evaluation. All ten were submitted to the two search engines individually, and through the meta-search engine of the CAT Crawler. Search results were evaluated for relevance both by medical amateurs and professionals, and the respective recall and precision were calculated.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>While achieving an identical recall, the meta-search engine showed a precision of 77.26% (&#177;14.45) compared to the individual search engines' 52.65% (&#177;12.0) (p &lt; 0.001).</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The results demonstrate the validity of the CAT Crawler meta-search engine approach. The improved precision due to inherent filters underlines the practical usefulness of this tool for clinicians.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Healthcare has been steadily moving towards Evidence-Based Medicine (EBM) since the term was formally introduced in 1992 by a group led by Gordon Guyatt at McMaster University, Canada <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. EBM promotes systematic literature review, critical appraisal skills and integrates scientific evidence with clinical expertise in the daily management of patients. The first three steps involved in the practice of EBM can comprehensively be summarized as a one-page written paper on a particular clinical topic, which is most commonly called a 'Critically Appraised Topic' (CAT) <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. Different acronyms have emerged in various specialties, such as Best Evidence Topics (BET) <abbrgrp><abbr bid="B5">5</abbr></abbrgrp> in emergency medicine and Evidence-Based Journal Club Reviews (EBJCR) <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> in pediatric critical care medicine. All these essentially provide physicians with a systematic method of formulating a clinical question and then critically evaluating the literature to answer the question posed.</p>
         <p>With the use of resources on the World Wide Web becoming common practice, several academic and healthcare organizations have built online CAT libraries for knowledge sharing with peer physicians. The repository of CATs has been growing steadily since the setup of the first accessible CATBank developed by the Centre for Evidence Based Medicine, Oxford in 1992 <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Among those, BestBETs developed by the Emergency Department, Manchester Royal Infirmary <abbrgrp><abbr bid="B8">8</abbr></abbrgrp> and UMHS by the Department of Pediatric, University of Michigan Health System, Ann Arbor <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> hold hundreds of distinct topics. They are furnished with individual search engines for fast and direct access to a particular topic. Given the wealth of such medical information scattered in cyberspace, the effectiveness of locating the correct information has become an important issue <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>.</p>
         <sec>
            <st>
               <p>The CAT Crawler application</p>
            </st>
            <p>It is believed that more CATs will be added into the repositories as more people participate in EBM practice. However, the non-standardized electronic format of CATs has created much difficulty for physicians to access a particular topic. Accordingly, the CAT Crawler was developed at the Bioinformatics Institute, Singapore <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp> to provide a one-stop search and download site for physicians by setting up a common platform to access eight popular online CAT libraries. CAT Crawler is freely accessible online <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>.</p>
            <p>The core component of the CAT Crawler is a meta-search engine. Its search is currently based on CAT resources from eight public online libraries <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. Once the user chooses the libraries he intends to use in the search, information tailored to his needs can be produced. The matched results are sorted according to their origins.</p>
            <p>Following the user input of a query keyword, a partial search is done through information extracted during an off-line process from six websites that do not hold search engines.</p>
            <p>The remaining search is carried out by querying the two individual search engines at BestBETs and UMHS. Use of the CAT Crawler is expected to have a quantitative and qualitative improvement of the retrieved results by post-processing obtained raw results from both libraries.</p>
         </sec>
         <sec>
            <st>
               <p>Motivation of the evaluation</p>
            </st>
            <p>The work presented here aims to evaluate the quantity and quality of the obtained results from the CAT Crawler meta-search engine, and thus to evaluate the validity and the usefulness of the application. Recall and precision were estimated to measure the performance of this meta-search engine versus the two individual search engines at BestBETs and UMHS.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <p>The workflow of this study is demonstrated in Figure <figr fid="F1">1</figr>.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Workflow for evaluation of the CAT Crawler meta-search engine</p>
            </caption>
            <text>
               <p>Workflow for evaluation of the CAT Crawler meta-search engine</p>
            </text>
            <graphic file="1472-6947-4-21-1"/>
         </fig>
         <sec>
            <st>
               <p>Selection of ten query keywords</p>
            </st>
            <p>To find a viable sample of keywords for a test search, the titles of all CATs stored in the two CAT libraries, namely BestBETs and UMHS were submitted to <it>AnalogX Keyword Extractor</it>, which is freely available online <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. This led to a list of around 2000 keywords, of which approximately 500 were present in both libraries, of which ten were randomly chosen. In a second step, that list was curated so that only medically relevant keywords remained, excluding words such as <it>and </it>and <it>day</it>.</p>
         </sec>
         <sec>
            <st>
               <p>Search for technically relevant documents in the dataset</p>
            </st>
            <p>In order to be able to calculate recall as detailed below, the <it>technical relevance </it>of all documents in the dataset must be assessed. In this study, a document is called technically relevant for a given search term if it contains this term in the full-text. <it>Perl </it>scripts were developed to examine all CATs in the two libraries BestBETs and UMHS and the total number of relevant documents as per the above definition in each library was collected for further calculation. This was done for each selected keyword and the process was independent from the search using the three search engines: the CAT Crawler, BestBETs and UMHS.</p>
         </sec>
         <sec>
            <st>
               <p>Relevance evaluation of the retrieval results</p>
            </st>
            <p>In the next step, those ten keywords were submitted to the search engines at BestBETs and UMHS, and to the CAT Crawler meta-search engine. The retrieved links were evaluated for their relevance by 13 volunteers, who are categorized into three groups. Among them, one physician in Group I represents medical professionals, six persons in Group II represent people who were trained in biology or medicine, and six persons in Group III represent people who do not have any medical background.</p>
         </sec>
         <sec>
            <st>
               <p>Calculation of recall and precision</p>
            </st>
            <p>Recall and precision are two accepted measurements to determine the utility of an information retrieval system or search strategy <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. They are defined as:</p>
            <p>
               <graphic file="1472-6947-4-21-i1.gif"/>
            </p>
            <p>Despite the relevance evaluation from 13 volunteers, it is necessary to know the total number of the relevant documents in a database for each query keyword in order to estimate the recall. In the present study, a particular CAT in a database was defined as technically relevant if the keyword could be found in its full-text article.</p>
            <p>The CAT Crawler is designed not to hold permanently any full-text CATs <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. When a query is done choosing the option to search only BestBETs and UMHS, the total number of relevant document in its acute database is equivalent to the sum of the number of relevant documents in the two libraries BestBETs and UMHS. Accordingly, the recall and precision of the CAT Crawler meta-search engine are revised as:</p>
            <p>
               <graphic file="1472-6947-4-21-i2.gif"/>
            </p>
            <p>Similarly, the recall and precision of the search engines at BestBETs and UMHS are estimated based on the combined repository of the two individual sites. The revised formula are shown below:</p>
            <p>
               <graphic file="1472-6947-4-21-i3.gif"/>
            </p>
         </sec>
         <sec>
            <st>
               <p>Performance evaluation of the CAT Crawler versus BestBETs and UMHS</p>
            </st>
            <p>The averaged precision and recall over all evaluators are used to evaluate the performance of the CAT Crawler meta-search engine. These values are compared to the estimate based on the search results from the two individual search engines at BestBETs and UMHS.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Ten keywords for the search engine evaluation</p>
            </st>
            <p>According to the predefined selection criteria, the ten keywords listed in Table <tblr tid="T1">1</tblr> were selected as the seed for a test search. The number of retrieved results from each search engine was gathered with respect to each keyword query. For the selected ten medically relevant keywords, the total number of matched results are 116, 65 and132 corresponding to the three search engines at BestBETs, UMHS and CAT Crawler. The difference of 49 retrievals between the CAT Crawler and the sum of BestBETs and UMHS reflects the meta-search engine's inherent filter function which is described previously <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Ten random keywords and corresponding number of retrieved results from search engine at BestBETs, UMHS and CAT Crawler</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="left">
                        <p>Keyword</p>
                     </c>
                     <c cspan="3" ca="center">
                        <p>Search Engine</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>BestBETs</p>
                     </c>
                     <c ca="center">
                        <p>UMHS</p>
                     </c>
                     <c ca="center">
                        <p>CAT Crawler</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Appendicitis</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Colic</p>
                     </c>
                     <c ca="center">
                        <p>15</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Intubation</p>
                     </c>
                     <c ca="center">
                        <p>26</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>22</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Ketoacidosis</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Octreotide</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Palsy</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Prophylaxis</p>
                     </c>
                     <c ca="center">
                        <p>18</p>
                     </c>
                     <c ca="center">
                        <p>19</p>
                     </c>
                     <c ca="center">
                        <p>30</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Sleep</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>13</p>
                     </c>
                     <c ca="center">
                        <p>16</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Tape</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Ultrasound</p>
                     </c>
                     <c ca="center">
                        <p>30</p>
                     </c>
                     <c ca="center">
                        <p>12</p>
                     </c>
                     <c ca="center">
                        <p>29</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>116</p>
                     </c>
                     <c ca="center">
                        <p>65</p>
                     </c>
                     <c ca="center">
                        <p>132</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Performance evaluation of the CAT Crawler versus BestBETs and UMHS</p>
            </st>
            <p>To compare the performance of the CAT Crawler meta-search engine to that of the two individual search engines, recall and precision were computed and averaged over the evaluation of all 13 participators. The data recorded are shown in Table <tblr tid="T2">2</tblr>. As the CAT Crawler meta-search engine is built upon the two individual search engines, the document collection for evaluation is the combined repository of BestBETs and UMHS. The retrieved relevant documents from the CAT Crawler are the same as that from the individual search engines. This leads to the identical recall for both cases (Table <tblr tid="T2">2</tblr>). The average precision is increased from the individual search engines' 52.65% (&#177;12.0) to the CAT Crawler's 77.26% (&#177;14.45). Figure <figr fid="F2">2</figr> provides a more intuitive comparison corresponding to each keyword.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Numerical recall and precision for the CAT Crawler meta-search engine and two individual search engines at BestBETs and UMHS</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2" ca="center">
                        <p>Recall (%)</p>
                     </c>
                     <c cspan="3" ca="center">
                        <p>Precision (%)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>BestBETs &amp; UMHS</p>
                     </c>
                     <c ca="center">
                        <p>CAT Crawler</p>
                     </c>
                     <c ca="center">
                        <p>BestBETs &amp; UMHS</p>
                     </c>
                     <c ca="center">
                        <p>CAT Crawler</p>
                     </c>
                     <c ca="center">
                        <p>p-value</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Appendicitis</p>
                     </c>
                     <c ca="center">
                        <p>96.15</p>
                     </c>
                     <c ca="center">
                        <p>96.15</p>
                     </c>
                     <c ca="center">
                        <p>76.92 (&#177;4.80)</p>
                     </c>
                     <c ca="center">
                        <p>96.15 (&#177;6.00)</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Colic</p>
                     </c>
                     <c ca="center">
                        <p>54.81</p>
                     </c>
                     <c ca="center">
                        <p>54.81</p>
                     </c>
                     <c ca="center">
                        <p>51.58 (&#177;2.58)</p>
                     </c>
                     <c ca="center">
                        <p>97.44 (&#177;4.87)</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Intubation</p>
                     </c>
                     <c ca="center">
                        <p>44.12</p>
                     </c>
                     <c ca="center">
                        <p>44.12</p>
                     </c>
                     <c ca="center">
                        <p>48.39 (&#177;13.56)</p>
                     </c>
                     <c ca="center">
                        <p>68.18 (&#177;19.10)</p>
                     </c>
                     <c ca="center">
                        <p>0.130</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Ketoacidosis</p>
                     </c>
                     <c ca="center">
                        <p>48.72</p>
                     </c>
                     <c ca="center">
                        <p>48.72</p>
                     </c>
                     <c ca="center">
                        <p>36.54 (&#177;12.97)</p>
                     </c>
                     <c ca="center">
                        <p>73.08 (&#177;25.94)</p>
                     </c>
                     <c ca="center">
                        <p>0.001</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Octreotide</p>
                     </c>
                     <c ca="center">
                        <p>59.62</p>
                     </c>
                     <c ca="center">
                        <p>59.62</p>
                     </c>
                     <c ca="center">
                        <p>47.69 (&#177;10.13)</p>
                     </c>
                     <c ca="center">
                        <p>79.49 (&#177;16.88)</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Palsy</p>
                     </c>
                     <c ca="center">
                        <p>70.77</p>
                     </c>
                     <c ca="center">
                        <p>70.77</p>
                     </c>
                     <c ca="center">
                        <p>64.34 (&#177;16.37)</p>
                     </c>
                     <c ca="center">
                        <p>70.77 (&#177;18.01)</p>
                     </c>
                     <c ca="center">
                        <p>0.002</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Prophylaxis</p>
                     </c>
                     <c ca="center">
                        <p>67.03</p>
                     </c>
                     <c ca="center">
                        <p>67.03</p>
                     </c>
                     <c ca="center">
                        <p>63.41 (&#177;11.60)</p>
                     </c>
                     <c ca="center">
                        <p>78.21 (&#177;14.31)</p>
                     </c>
                     <c ca="center">
                        <p>0.074</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Sleep</p>
                     </c>
                     <c ca="center">
                        <p>57.95</p>
                     </c>
                     <c ca="center">
                        <p>57.95</p>
                     </c>
                     <c ca="center">
                        <p>48.29 (&#177;19.82)</p>
                     </c>
                     <c ca="center">
                        <p>54.33 (&#177;22.30)</p>
                     </c>
                     <c ca="center">
                        <p>0.038</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Tape</p>
                     </c>
                     <c ca="center">
                        <p>46.15</p>
                     </c>
                     <c ca="center">
                        <p>46.15</p>
                     </c>
                     <c ca="center">
                        <p>46.15 (&#177;7.31)</p>
                     </c>
                     <c ca="center">
                        <p>92.31 (&#177;14.62)</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Ultrasound</p>
                     </c>
                     <c ca="center">
                        <p>42.22</p>
                     </c>
                     <c ca="center">
                        <p>42.22</p>
                     </c>
                     <c ca="center">
                        <p>43.22 (&#177;8.06)</p>
                     </c>
                     <c ca="center">
                        <p>62.60 (&#177;11.68)</p>
                     </c>
                     <c ca="center">
                        <p>0.017</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Average</p>
                     </c>
                     <c ca="center">
                        <p>58.75 (&#177;16.25)</p>
                     </c>
                     <c ca="center">
                        <p>58.75 (&#177;16.25)</p>
                     </c>
                     <c ca="center">
                        <p>52.65 (&#177;12.0)</p>
                     </c>
                     <c ca="center">
                        <p>77.26 (&#177;14.45)</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Precision plot of the CAT Crawler meta-search engine and two individual search engines</p>
               </caption>
               <text>
                  <p>Precision plot of the CAT Crawler meta-search engine and two individual search engines</p>
               </text>
               <graphic file="1472-6947-4-21-2"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>The performance evaluation clearly places the CAT Crawler meta-search engine on par with the individual search engines at BestBETs and UMHS as far as recall is concerned, and well above them for precision (see Table <tblr tid="T2">2</tblr> and Figure <figr fid="F2">2</figr>). According to these results, the application can be called successful: by using the CAT Crawler to look for relevant information at specific sites, the medical professional will obtain as much information as by going to the sites directly, but the precision of the obtained results will be higher.</p>
         <p>Benoit <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> has analyzed various methods of information retrieval and their impact on user behavior. He finds that users wish for greater interactive opportunities to determine for themselves the potential relevance of documents, and that a parts-of-document approach is preferable for many information retrieval situations. At present, the CAT Crawler allows a number of interactive opportunities <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, but their implementation would have no impact on the calculation of recall and precision under the condition of the present study. Benoit's reasoning should be kept in mind, however, for improving the user friendliness in the sense that some further useful filter functions can be included in future versions of the application. While such advanced search functions will be profitable when large datasets are studied, the currently still manageable information in the online CAT libraries <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> will serve the user better if initially displayed in a broader way. For example, some of the information displayed here may be older than 18 months, which makes it undesirable according to the strict rules for CAT updating as defined by Sackett et al <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. Formally outdated information, however, may in a given situation still be "best evidence" and positively influence the decision-making. Use of filters to block aged information will certainly influence this process.</p>
         <p>Despite the encouraging results, some fundamental questions regarding the evaluation of this meta-search engine in particular, and also meta-search engines in general remain unsolved.</p>
         <p>With regard to recall, there is the theoretical possibility that manually searching all documents at a given repository will yield a higher recall for a given search term. In view of hundreds of CAT documents per repository, however, it seems unlikely that a human evaluator's attention will not wander, leading to less than optimal scrutiny of the documents and introducing a non-quantifiable error to the evaluation. This is a general problem of knowledge databases, especially when indexing is done by humans, whose decisions are not consistent. In a study of 700 Medline references indexed in duplicate, the consistency of main subject-heading indexing was only 68% and that for heading-subheading combinations was significantly less <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. Also, in two studies <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp> on Medline searching, there was considerable disagreement by those judging relevance of the retrieved documents regarding which documents were relevant to a given query.</p>
         <p>In order to overcome this problem, the number of documents that contained a given keyword as found by the keyword extractor was used as the basis for calculating the technical recall. This may (or may not) lead to numerical results for recall that differ from the absolute true value as determined above. As the same numbers are used throughout, however, the comparison of search results obtained by the individual search engines and the CAT Crawler meta-search engine remains valid.</p>
         <p>Critics have pointed out the over-reliance of researchers on the use of recall and precision in evaluation studies <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> and the difficulty to design an experiment that allows both laboratory-style control and operational realism <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. For instance, recall may be of only little consequence once the user has found a useful document. Rhodes and Maes <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> evaluated both with a traditional field user test and then asked for relevance feedback. In their experiment, users gave a score 1&#8211;5 to each document that was delivered to calculate an overall average value for perceived precision. While a document can get a high score for precision, it may at the same time get a low score for practical usefulness. This was often due to the fact that the documents were already known to the users, in some cases had even been written by them. Accordingly, Rhodes and Maes <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> added features to the system that weeded out relevant documents that by some predefined criteria would not be useful. As a result, the measurable precision could be worse, but the overall usefulness could be better. In the study presented here, a similar approach was chosen in the instructions to the evaluators in the sense that they could make the distinction between 'irrelevant' (e.g. the retrieved document was only a web hosted clinical question) and 'medically irrelevant' (e.g. the word <it>Appendicitis </it>appeared only in the reference section of a document dealing with questions of abdominal pain relief). Due to the relatively small number, no difference could be detected between the various grades of relevance, and results were pooled to relevant/irrelevant and used for calculating recall and precision as described above. If a larger number of volunteers could be recruited, repetition of this evaluation might yield interesting results.</p>
         <p>Other approaches have been spawned to evaluating system effectiveness in order to minimize these problems with recall and precision. One example are task-oriented methods that measure how well the user can perform certain tasks <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>. These different approaches were not chosen in this study for a reason: the primary aim was to compare the search engines. Under the present restrictions, recall and precision allow to answer this question.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>In summary, the data obtained from the analysis of search results obtained from identical queries submitted to the two CAT libraries at BestBETs and UMHS, using either their respective search engines or the CAT Crawler meta-search engine, showed a competitive recall, and superior precision of the meta-search engine compared to the individual search engines.</p>
      </sec>
      <sec>
         <st>
            <p>Competing interests</p>
         </st>
         <p>The author(s) declare that they have no competing interests.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>PD participated in the design of the study, data analysis and drafting of the manuscript. LLW and SN generated raw data for the study. ML was involved in drafting the manuscript. AM designed the study and participated in the drafting of the manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The authors would like to thank the staff and students of the Bioinformatics Institute for volunteering to evaluate the performance of search.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Evidence-Based Medicine. A new approach to teaching the practice of medicine.</p>
            </title>
            <aug>
               <au>
                  <snm>Group</snm>
                  <fnm>EBMW</fnm>
               </au>
            </aug>
            <source>JAMA</source>
            <pubdate>1992</pubdate>
            <volume>268</volume>
            <fpage>2420</fpage>
            <lpage>2425</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1001/jama.268.17.2420</pubid>
                  <pubid idtype="pmpid">1404801</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Evidence based medicine: what it is and what it isn't</p>
            </title>
            <aug>
               <au>
                  <snm>Sackett</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Rosenberg</snm>
                  <fnm>WM</fnm>
               </au>
               <au>
                  <snm>Gray</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Haynes</snm>
                  <fnm>RB</fnm>
               </au>
               <au>
                  <snm>Richardson</snm>
                  <fnm>WS</fnm>
               </au>
            </aug>
            <source>BMJ</source>
            <pubdate>1996</pubdate>
            <volume>312</volume>
            <fpage>71</fpage>
            <lpage>72</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8555924</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Evidence-Based Medicine: How to practice and teach EBM</p>
            </title>
            <aug>
               <au>
                  <snm>Sackett</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Straus</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Richardson</snm>
                  <fnm>WS</fnm>
               </au>
               <au>
                  <snm>Rosenberg</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Haynes</snm>
                  <fnm>RB</fnm>
               </au>
            </aug>
            <publisher> London, Churchill Livingstone</publisher>
            <pubdate>2000</pubdate>
         </bibl>
         <bibl id="B4">
            <title>
               <p>The critically appraised topic: a practical approach to learning critical appraisal.</p>
            </title>
            <aug>
               <au>
                  <snm>Sauve</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>HN</fnm>
               </au>
               <au>
                  <snm>Meade</snm>
                  <fnm>MO</fnm>
               </au>
               <au>
                  <snm>Lang</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Farkouh</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cook</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Sackett</snm>
                  <fnm>DL</fnm>
               </au>
            </aug>
            <source>Ann Roy Soc Phys Surg Canada</source>
            <pubdate>1995</pubdate>
            <volume>28</volume>
            <fpage>396</fpage>
            <lpage>398</lpage>
         </bibl>
         <bibl id="B5">
            <title>
               <p>BETs and CATs, Emergency Department, Manchester Royal Infirmary</p>
            </title>
            <fpage> [http://www.bestbets.org/background/betscats.html]</fpage>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Pediatric Critical Care Medicine, Evidence-Based Journal Club Review</p>
            </title>
            <fpage> [http://pedsccm.wustl.edu/ebjournal_club.html]</fpage>
         </bibl>
         <bibl id="B7">
            <title>
               <p>CAT Library, Oxford-Centre for Evidence Based Medicine</p>
            </title>
            <fpage> [http://www.minervation.com/cebm2/cats/allcats.html]</fpage>
         </bibl>
         <bibl id="B8">
            <title>
               <p>BET Database Search Engine, Emergency Department, Manchester Royal Infirmary</p>
            </title>
            <fpage> [http://www.bestbets.org/database/search.html]</fpage>
         </bibl>
         <bibl id="B9">
            <title>
               <p>CAT Library Search Engine, University of Michigan</p>
            </title>
            <fpage> [http://www.med.umich.edu/pediatrics/ebm/Search.htm]</fpage>
         </bibl>
         <bibl id="B10">
            <title>
               <p>The retrieval effectiveness of medical information on the web.</p>
            </title>
            <aug>
               <au>
                  <snm>Bin</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Lun</snm>
                  <fnm>KC</fnm>
               </au>
            </aug>
            <source>Int J Med Inf</source>
            <pubdate>2001</pubdate>
            <volume>62</volume>
            <fpage>155</fpage>
            <lpage>163</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1386-5056(01)00159-9</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Enhanced quality and quantity of retrieval of Critically Appraised Topics using the CAT Crawler</p>
            </title>
            <aug>
               <au>
                  <snm>Dong</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Mondry</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Med Inform Internet Med</source>
            <pubdate>2004</pubdate>
            <volume>29</volume>
            <fpage>43</fpage>
            <lpage>55</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1080/14639230310001655849</pubid>
                  <pubid idtype="pmpid" link="fulltext">15204609</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>CAT Crawler - an online resource for Critically Appraised Topics (CATs)</p>
            </title>
            <url>http://www.bii.as-tar.edu.sg/research/mig/cat.asp</url>
         </bibl>
         <bibl id="B13">
            <title>
               <p>AnalogX Keyword Extractor</p>
            </title>
            <url>http://www.analogx.com</url>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Information-Retrieval Systems</p>
            </title>
            <aug>
               <au>
                  <snm>Hersh</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Detmer</snm>
                  <fnm>WM</fnm>
               </au>
               <au>
                  <snm>Frisse</snm>
                  <fnm>ME</fnm>
               </au>
            </aug>
            <source>Medical Informatics</source>
            <publisher>New York, Springer</publisher>
            <editor>H SE and E PL</editor>
            <pubdate>2001</pubdate>
            <fpage>539</fpage>
            <lpage>572</lpage>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Properties-based retrieval and user decision states: User control and behavior modeling.</p>
            </title>
            <aug>
               <au>
                  <snm>Benoit</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>JASIST</source>
            <pubdate>2004</pubdate>
            <volume>55</volume>
            <fpage>488</fpage>
            <lpage>497</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1002/asi.10399</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Indexing consistency in MEDLINE</p>
            </title>
            <aug>
               <au>
                  <snm>Funk</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Reid</snm>
                  <fnm>CA</fnm>
               </au>
            </aug>
            <source>Bull Med Libr Assoc</source>
            <pubdate>1983</pubdate>
            <volume>71</volume>
            <fpage>176</fpage>
            <lpage>183</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">227138</pubid>
                  <pubid idtype="pmpid">6344946</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Online access to MEDLINE in clinical settings: A study of use and usefulness</p>
            </title>
            <aug>
               <au>
                  <snm>Haynes</snm>
                  <fnm>RB</fnm>
               </au>
               <au>
                  <snm>McKibbon</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Walker</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Ryan</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Fitzgerald</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ramsden</snm>
                  <fnm>MF</fnm>
               </au>
            </aug>
            <source>Ann Intern Med</source>
            <pubdate>1990</pubdate>
            <volume>112</volume>
            <fpage>78</fpage>
            <lpage>84</lpage>
            <xrefbib>
               <pubid idtype="pmpid">2403476</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Relevance and retrieval evaluation: perspectives from medicine</p>
            </title>
            <aug>
               <au>
                  <snm>Hersh</snm>
                  <fnm>WR</fnm>
               </au>
            </aug>
            <source>J Am Soc Inform Sci</source>
            <pubdate>1994</pubdate>
            <volume>45</volume>
            <fpage>201</fpage>
            <lpage>206</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1002/(SICI)1097-4571(199404)45:3&lt;201::AID-ASI9>3.0.CO;2-W</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Evaluation in information retrieval</p>
            </title>
            <aug>
               <au>
                  <snm>Robertson</snm>
                  <fnm>SE</fnm>
               </au>
            </aug>
            <source>ESSIR LNCS</source>
            <publisher>, Springer-Verlag</publisher>
            <editor>Agosti M, Crestani F and Pasi G</editor>
            <pubdate>2000</pubdate>
            <fpage>81</fpage>
            <lpage>92</lpage>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Just-in-time information retrieval agents</p>
            </title>
            <aug>
               <au>
                  <snm>Rhodes</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Maes</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>IBM Systems Journal</source>
            <pubdate>2000</pubdate>
            <volume>39</volume>
            <fpage>685</fpage>
            <lpage>704</lpage>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Formative design-evaluation of Superbook</p>
            </title>
            <aug>
               <au>
                  <snm>Egan</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Remde</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Gomez</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Landauer</snm>
                  <fnm>TK</fnm>
               </au>
               <au>
                  <snm>Eberhardt</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lochbaum</snm>
                  <fnm>CC</fnm>
               </au>
            </aug>
            <source>ACM Trans Inf Syst</source>
            <pubdate>1989</pubdate>
            <volume>7</volume>
            <fpage>30</fpage>
            <lpage>57</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1145/64789.64790</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Towards new measures of information retrieval evaluation: Jul 9-13; Seattle, Washington, USA.</p>
            </title>
            <aug>
               <au>
                  <snm>Hersh</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Elliot</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Hickam</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Molnar</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Leichtenstein</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <publisher>New York: ACM Press</publisher>
            <pubdate>1995</pubdate>
            <fpage>164</fpage>
            <lpage>170</lpage>
         </bibl>
         <bibl id="B23">
            <title>
               <p>The scientific community's response to evidence of fraudulent publication. The Robert Slutsky case.</p>
            </title>
            <aug>
               <au>
                  <snm>Whitely</snm>
                  <fnm>WP</fnm>
               </au>
               <au>
                  <snm>Rennie</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hafner</snm>
                  <fnm>AW</fnm>
               </au>
            </aug>
            <source>JAMA</source>
            <pubdate>1994</pubdate>
            <volume>272</volume>
            <fpage>170</fpage>
            <lpage>173</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1001/jama.272.2.170</pubid>
                  <pubid idtype="pmpid">8015137</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>A task-oriented approach to information retrieval evaluation</p>
            </title>
            <aug>
               <au>
                  <snm>Hersh</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Pentecost</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hickham</snm>
                  <fnm>DH</fnm>
               </au>
            </aug>
            <source>J Am Soc Inform Sci</source>
            <pubdate>1996</pubdate>
            <volume>47</volume>
            <fpage>50</fpage>
            <lpage>56</lpage>
         </bibl>
      </refgrp>
      <sec>
         <st>
            <p>Pre-publication history</p>
         </st>
         <p>The pre-publication history for this paper can be accessed here:</p>
         <p>
            <url>http://www.biomedcentral.com/1472-6947/4/21/prepub</url>
         </p>
      </sec>
   </bm>
</art>

