Can electronic search engines optimize screening of search results in systematic reviews: an empirical study
1 Chalmers Research Group, Children's Hospital of Eastern Ontario Research Institute, Ottawa, Canada
2 Department of Pediatrics, Faculty of Medicine, University of Ottawa, Ottawa, Canada
3 School of Mathematics and Statistics, Carleton University, Ottawa, Canada
4 Canadian Coordinating Office for Health Technology Assessment, Ottawa, Canada
5 Departments of Pediatrics and of Epidemiology and Biostatistics, McGill University, Montreal, QC, Canada
6 Department of Pediatrics, University of Alberta, Edmonton, Canada
7 Natural Sciences Library, University of Saskatchewan, Saskatoon, Canada
BMC Medical Research Methodology 2006, 6:7 doi:10.1186/1471-2288-6-7Published: 24 February 2006
Most electronic search efforts directed at identifying primary studies for inclusion in systematic reviews rely on the optimal Boolean search features of search interfaces such as DIALOG® and Ovid™. Our objective is to test the ability of an Ultraseek® search engine to rank MEDLINE® records of the included studies of Cochrane reviews within the top half of all the records retrieved by the Boolean MEDLINE search used by the reviewers.
Collections were created using the MEDLINE bibliographic records of included and excluded studies listed in the review and all records retrieved by the MEDLINE search. Records were converted to individual HTML files. Collections of records were indexed and searched through a statistical search engine, Ultraseek, using review-specific search terms. Our data sources, systematic reviews published in the Cochrane library, were included if they reported using at least one phase of the Cochrane Highly Sensitive Search Strategy (HSSS), provided citations for both included and excluded studies and conducted a meta-analysis using a binary outcome measure. Reviews were selected if they yielded between 1000–6000 records when the MEDLINE search strategy was replicated.
Nine Cochrane reviews were included. Included studies within the Cochrane reviews were found within the first 500 retrieved studies more often than would be expected by chance. Across all reviews, recall of included studies into the top 500 was 0.70. There was no statistically significant difference in ranking when comparing included studies with just the subset of excluded studies listed as excluded in the published review.
The relevance ranking provided by the search engine was better than expected by chance and shows promise for the preliminary evaluation of large results from Boolean searches. A statistical search engine does not appear to be able to make fine discriminations concerning the relevance of bibliographic records that have been pre-screened by systematic reviewers.