<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2288-6-2</ui>
   <ji>1471-2288</ji>
   <fm>
      <dochead>Study protocol</dochead>
      <bibl>
         <title>
            <p>Protocol of the COSMIN study: COnsensus-based Standards for the selection of health Measurement INstruments</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Mokkink</snm>
               <fnm>LB</fnm>
               <insr iid="I1"/>
               <email>w.mokkink@vumc.nl</email>
            </au>
            <au id="A2">
               <snm>Terwee</snm>
               <fnm>CB</fnm>
               <insr iid="I1"/>
               <email>cb.terwee@vumc.nl</email>
            </au>
            <au id="A3">
               <snm>Knol</snm>
               <fnm>DL</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>d.knol@vumc.nl</email>
            </au>
            <au id="A4">
               <snm>Stratford</snm>
               <fnm>PW</fnm>
               <insr iid="I3"/>
               <email>stratford@mcmaster.ca</email>
            </au>
            <au id="A5">
               <snm>Alonso</snm>
               <fnm>J</fnm>
               <insr iid="I4"/>
               <email>jalonso@imim.es</email>
            </au>
            <au id="A6">
               <snm>Patrick</snm>
               <fnm>DL</fnm>
               <insr iid="I5"/>
               <email>donald@u.washington.edu</email>
            </au>
            <au id="A7">
               <snm>Bouter</snm>
               <fnm>LM</fnm>
               <insr iid="I1"/>
               <email>lm.bouter@vumc.nl</email>
            </au>
            <au id="A8">
               <snm>de Vet</snm>
               <fnm>HCW</fnm>
               <insr iid="I1"/>
               <email>hcw.devet@vumc.nl</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Institute for Research in Extramural Medicine (EMGO Institute; www.emgo.nl), VU University Medical Center (VUmc), Amsterdam, The Netherlands</p>
            </ins>
            <ins id="I2">
               <p>Department of Clinical Epidemiology and Biostatistics, VU University Medical Center (VUmc), Amsterdam, The Netherlands</p>
            </ins>
            <ins id="I3">
               <p>School of Rehabilitation Science and Department of Clinical Epidemiology and Biostatistics, McMaster University, Hamilton, Canada</p>
            </ins>
            <ins id="I4">
               <p>Health Services Research Unit, Institute Municipal d'Investigacio Medica (IMIM-IMAS), Barcelona, Spain</p>
            </ins>
            <ins id="I5">
               <p>Department of Health Services, University of Washington, Seattle, USA</p>
            </ins>
         </insg>
         <source>BMC Medical Research Methodology</source>
         <issn>1471-2288</issn>
         <pubdate>2006</pubdate>
         <volume>6</volume>
         <issue>1</issue>
         <fpage>2</fpage>
         <url>http://www.biomedcentral.com/1471-2288/6/2</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">16433905</pubid>
               <pubid idtype="doi">10.1186/1471-2288-6-2</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>07</day>
               <month>10</month>
               <year>2005</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>24</day>
               <month>1</month>
               <year>2006</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>24</day>
               <month>1</month>
               <year>2006</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2006</year>
         <collab>Mokkink et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Choosing an adequate measurement instrument depends on the proposed use of the instrument, the concept to be measured, the measurement properties (e.g. internal consistency, reproducibility, content and construct validity, responsiveness, and interpretability), the requirements, the burden for subjects, and costs of the available instruments. As far as measurement properties are concerned, there are no sufficiently specific standards for the evaluation of measurement properties of instruments to measure health status, and also no explicit criteria for what constitutes good measurement properties. In this paper we describe the protocol for the COSMIN study, the objective of which is to develop a checklist that contains COnsensus-based Standards for the selection of health Measurement INstruments, including explicit criteria for satisfying these standards. We will focus on evaluative health related patient-reported outcomes (HR-PROs), i.e. patient-reported health measurement instruments used in a longitudinal design as an outcome measure, excluding health care related PROs, such as satisfaction with care or adherence. The COSMIN standards will be made available in the form of an easily applicable checklist.</p>
            </sec>
            <sec>
               <st>
                  <p>Method</p>
               </st>
               <p>An international Delphi study will be performed to reach consensus on which and how measurement properties should be assessed, and on criteria for good measurement properties. Two sources of input will be used for the Delphi study: (1) a systematic review of properties, standards and criteria of measurement properties found in systematic reviews of measurement instruments, and (2) an additional literature search of methodological articles presenting a comprehensive checklist of standards and criteria. The Delphi study will consist of four (written) Delphi rounds, with approximately 30 expert panel members with different backgrounds in clinical medicine, biostatistics, psychology, and epidemiology. The final checklist will subsequently be field-tested by assessing the inter-rater reproducibility of the checklist.</p>
            </sec>
            <sec>
               <st>
                  <p>Discussion</p>
               </st>
               <p>Since the study will mainly be anonymous, problems that are commonly encountered in face-to-face group meetings, such as the dominance of certain persons in the communication process, will be avoided. By performing a Delphi study and involving many experts, the likelihood that the checklist will have sufficient credibility to be accepted and implemented will increase.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="refman"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Choosing the appropriate health status measurement instrument for a specific purpose is a difficult and time-consuming task. The choice depends on the proposed use of the instrument, the concept to be measured, the readability of the questions, the requirements and costs associated with the use of the instrument, the burden on the subjects, and, last but not least, the measurement properties of the instruments. The measurement properties concern internal consistency, reproducibility, content and construct validity, responsiveness, and interpretability. Kirshner and Guyatt distinguished three kinds of health status measures, i.e. discriminative, predictive and evaluative measures <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Not all measurement properties are equally important for each purpose. For example, responsiveness is only important for evaluative measurement instruments.</p>
         <p>Although there is consensus that measurement instruments must have good measurement properties, only general guidelines are available, and there are no explicit and comprehensive criteria for what constitutes good measurement properties. Without clear standards the evidence-based selection of measurement instruments is strongly hampered.</p>
         <p>Several authors have suggested standards for the development and evaluation of instruments to measure health status. One of the most elaborate lists was proposed by the Scientific Advisory Committee (SAC) of the Medical Outcomes Trust <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. The SAC defined a set of eight key attributes of instruments to measure health status and (health-related) quality of life and standards with which these attributes should be assessed. Most of the standards concern information that authors should provide when reporting on a reproducibility study or a validation study, e.g. a clear description of the methods of data-collection and reporting of specific estimates and standard errors. In addition, they gave some standards, for example for assessing reliability, and some criteria for good measurement properties, such as cut-off points for ICCs. Another list of standards has been compiled by Bombardier and Tugwell, who developed a checklist to compare and evaluate the usefulness of instruments to measure functional status <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. They propose 12 rules, referring to 6 major issues: comprehensiveness, credibility, accuracy, sensitivity to change, biological sense, and feasibility. Andresen has also defined standards for assessing instruments to measure disability outcomes. Her standards for measurement properties include validity, reliability, and sensitivity to change, as well as statistical methods such as Rasch analysis for assessing the scaling properties <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. More guidelines for developing or evaluating measurement instruments are given, e.g. by Chassany et al. <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, McDowell and Jenkinson <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>, and by country specific organizations such as the American Psychological Association and the Dutch Professional Association of Psychologists.</p>
         <p>However, often lacking in these standards are explicit criteria for what constitutes good measurement properties. For example, an intraclass correlation coefficient (ICC) has often been recommended as the most appropriate measure of reliability. There have been some suggestions about what constitutes a minimal ICC for good reliability, but it is unclear whether this concerns the point estimate or the lower limit of the confidence interval, and whether a minimal sample size is required. For the assessment of construct validity it is often recommended that explicit hypotheses about the expected results should be tested <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. However there are no criteria for how many hypotheses should be defined, how specific these hypotheses should be, the extent to which these hypotheses should be confirmed for good validity, or characteristics, representativity and the size of the sample for a validation study. This may lead to situations in which one is satisfied about the construct validity when instrument A correlates with a similar instrument B in a sample of 20 patients, but with no justification of the expected magnitude of the correlation, or the precision of its estimate. Another problem is that there is a lack of consensus with regard to the method of assessment for some measurement properties. For example, although there is consensus on the importance of responsiveness, there is no consensus on the best way to assess it <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>.</p>
         <p>This has often resulted in a battery of change coefficients being applied to the same data. A possible explanation for this is that often investigators have not conceptualized and declared the anticipated change characteristic (i.e., whether patients in the sample are likely to undergo homogeneous or heterogeneous change) of the sample <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>.</p>
         <p>With the rapid increase in the number of instruments that are being developed to measure health status there is also an increase in the publication of systematic reviews in which the measurement properties of these instruments are evaluated and compared. These systematic reviews are important tools for the evidence-based selection of instruments. These reviews focus for instance on outcome measures in specific patient groups <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>, on instruments to measure (general or disease-specific) quality of life in specific patient groups <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr></abbrgrp>, on functional disability questionnaires for patients with upper extremity disorders <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp> and rheumatoid arthritis <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>, and on instruments to assess co-morbidity <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. In most systematic reviews evidence concerning the measurement properties of the instruments is summarized, but only a few authors use explicit and comprehensive criteria to define good measurement properties. Without such explicit criteria, however, it is difficult to decide on the best instrument.</p>
         <p>We recently developed a checklist for the evaluation and comparison of the measurement properties of instruments to measure health status <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, which we used in two systematic reviews <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B30">30</abbr></abbrgrp>. This checklist was based on the SAC criteria <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp> and the Bombardier and Tugwell method <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, supplemented with explicit criteria for good measurement properties, typically as defined within our research group <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. However, other researchers have used different criteria. There is still no consensus with regard to the best criteria.</p>
         <p>In this paper we describe the protocol for the COSMIN study. The aim of this study is to develop COnsensus-based Standards for the selection of health Measurement INstruments. Firstly, we will focus on a homogeneous set of measurement instruments, since it is not clear if these standards and criteria can be applied to all sorts of measurement instruments that measure health status. The initial focus of these standards will be on evaluative health related patient-reported outcomes (HR-PROs). We defined evaluative as instruments which are applied to measure HR-PROs in a longitudinal study to assess change over time. With this definition we exclude measurement instruments which are (1) only used as discriminative instruments, (2) only used for predictive purposes, such as diagnostic or screening instruments, or (3) only used as an independent (prognostic) variable, such as a determinant, confounder or effect modifier in a longitudinal study design. PROs include any endpoint derived from patient reports, whether collected in the clinic, in a diary, or by other means, including single-item outcome measures, event logs, symptom reports, formal instruments to measure health-related quality of life (HRQL), health status, adherence, and satisfaction with treatment <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. By the restriction to health-related PROs, we exclude for example health care related PROs, such a s satisfaction with care and adherence. When these standards and criteria seems applicable to HR-PRO, the next step is to examine if these can be applied on other measurement instruments, such as performance-based instruments or health care related PROs.</p>
         <p>Note that one and the same measurement instrument can be used for different purposes, such as discriminative, evaluative and predictive purposes. The COSMIN standards focus on the evaluative application of measurement instruments.</p>
         <p>The COSMIN standards will be made available in the form of an easily applicable checklist. This project consists of the preparation by performing a systematic review and a additional literature search, an international Delphi procedure, and field-testing of the resulting checklist.</p>
         <sec>
            <st>
               <p>Aim of the study</p>
            </st>
            <p>The aim of the COSMIN study is to develop consensus-based standards for the assessment of evaluative HR-PROs, including explicit criteria for good measurement properties. To develop these standards, the following research questions will be addressed:</p>
            <p>1. Which measurements properties should be included in the assessment of evaluative HR-PROs, and how should they be defined?</p>
            <p>2. How should these measurement properties be assessed in terms of study design and statistical analysis? (i.e. standards)</p>
            <p>3. Which criteria should be applied to define what good measurement properties are?</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Method</p>
         </st>
         <sec>
            <st>
               <p>The Delphi procedure</p>
            </st>
            <p>The Delphi procedure is basically a series of sequential questionnaires or 'rounds', interspersed by controlled feedback, that seeks to achieve consensus of opinion among a panel of experts <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. The Delphi procedure is a tool that can be used to generate a debate and to structure and organize a group communication process <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. It is not a method for creating new knowledge, but rather a process for making the best use of available information <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. The first round consists of a questionnaire with a large item pool, to identify issues to be addressed in later rounds. An item pool is a set of items regarding all possible issues on a subject. In our study, for example, these issues concern all proposed measurement properties, standards and criteria to judge these with. All panel members are asked to give their opinion about each item, and they also have the opportunity to add additional items. The second and subsequent questionnaires are more specific and aim to converge opinions and to reach consensus.</p>
         </sec>
         <sec>
            <st>
               <p>Preparation for the Delphi procedure: a systematic review and a additional literature search</p>
            </st>
            <p>To prepare the questionnaires for the Delphi procedure, a systematic literature review will be performed to search for systematic reviews of evaluative health status measurement instruments that describe measurement properties, quality standards and criteria. To be able to find as many as possible standards and criteria we did not yet use the narrow concept of only HR-PROs. Health status measurement instruments include HR-PROs, performance based measures, clinical ratings, etc. We will search in PubMed, Embase and Psycinfo to find systematic reviews of health status measurement instruments. Articles will be included that meet the following inclusion criteria: (1) 'systematic review', (2) the purpose of the review is to find systematically all available 'health status instruments' regarding a specific topic or a specific population, i.e. all questionnaires or performance based measures, etc. or a combination of any of these, (3) the health status instrument has to be applicable as an evaluative measure (i.e. not discriminative or predictive (i.e. diagnostic or screening)), and (4) the purpose of the review is to report on the clinimetric properties of the measurement instruments. Systematic reviews of diagnostic or screening instruments will be excluded. Two of the authors (LM en CT) will perform the selection of articles independently, based on abstracts and if necessary on full text.</p>
            <p>An additional literature search in PubMed, Embase and Psycinfo of methodological articles and textbooks presenting comprehensive checklists for standards and criteria will be carried out. The purpose of both the review and the additional literature search is to determine which measurement properties are reported in systematic reviews and in existing standards published in methodological papers and textbooks for the evaluation of health status measurement instruments, which measurement properties should be assessed and how (standards), and which criteria are used to define good measurement properties.</p>
         </sec>
         <sec>
            <st>
               <p>Design of the Delphi procedure of the COSMIN study</p>
            </st>
            <p>In the Delphi procedure of the COSMIN study, four Delphi rounds are planned, as outlined in Figure <figr fid="F1">1</figr>. Based on the results of the two searches described above, we will developdevelop a pool of all measurement properties, standards and criteria for measurement properties that were found in the literature, and ask for the explicit opinions of the panel members on each of these issues.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>the Delphi procedure of the COSMIN study</p>
               </caption>
               <text>
                  <p>the Delphi procedure of the COSMIN study.</p>
               </text>
               <graphic file="1471-2288-6-2-1"/>
            </fig>
            <p>The first round will focus on (1) agreement on which measurement properties of HR-PROs should be assessed, and how these measurement properties should be defined. Subsequent rounds will focus on (2) how the selected measurement properties should be assessed (standards), and on (3) defining criteria for good measurement properties.</p>
            <p>Each of these three subjects will be at issue in at least two subsequent rounds, in order to offer panel members the opportunity to reconsider and, if appropriate, to change their previous opinion in light of the anonymous responses and considerations of the other panel members. Each subsequent questionnaire contains also a feedback report.</p>
            <p>In a Delphi procedure the panel members are carefully selected for their knowledge and interest in a specific field. The panel members will be selected by the Steering Committee (LM, CT, DK, PS, JA, DP, LB and HdV), based on the following inclusion criteria: they should be experts on the development and evaluation of health status measurement instruments, and should have credibility according to the target audience, indicated by authorship of multiple frequently cited publications on (the methodology of) this subject in important journals, such as Quality of Life Research, the Journal of Clinical Epidemiology, Medical Care, and BioMed Central Medical Research Methodology.</p>
            <p>Panel members with the following scientific backgrounds will be selected: clinical medicine, biostatistics, psychology (psychometrics), and epidemiology (clinimetrics). They will also be selected to represent important organizations, and to facilitate dissemination and implementation of the checklist, i.e. members of the International Society for Quality of Life (ISOQOL), the Mapi Research Institute, Cochrane PRO methods group, the Patient-Reported Outcomes Group and the European Research Group on Health Outcomes (ERGHO), and editors of important relevant journals, such as Quality of Life Research, and the Journal of Clinical Epidemiology.</p>
            <p>Approximately 30 panel members (6&#8211;7 per category of scientific background) will be considered appropriate <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. Based on our previous experience <abbrgrp><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr></abbrgrp>, we expect approximately that 70% of the invited experts will agree to participate, 80% of the participants will return the first questionnaire, and 65% of these will also return the second and subsequent questionnaires. Therefore, we will initially invite 80 experts to participate (20 per category of scientific background). Those who are invited will be asked to inform us whether or not they wish to participate. If less than 55 are willing to participate, more persons will be invited, until 55 have agreed to participate. If one of the categories of scientific background is under-represented, additional experts will be invited from that category. The identity of the panel members will be kept unknown to the other panel members until all rounds has been completed. Furthermore, the responses will be distributed anonymously among to the panel members and the Steering Committee (except for the member of the Steering Committee who is responsible for the correspondence).</p>
         </sec>
         <sec>
            <st>
               <p>Consensus</p>
            </st>
            <p>To reach consensus on each of the three issues outlined above, the same strategy will be carried out in each round. In the first questionnaire the panel members will be asked to rate how strongly they (dis)agree to include each measurement property in the checklist (e.g. to which extent do you agree that internal consistency should be included in the assessment of evaluative HR-PROs?), and what the most appropriate definition for each measurement property is. Ratings will be scored on a 5-point Likert scale (ranging from strongly agree to strongly disagree). The panel members will be asked to give considerations and arguments to support their opinion. They will also be given the opportunity to suggest alternative wordings, to suggest additional measurement properties, or to make any other comments.</p>
            <p>In the second questionnaire the aim is to reach consensus on the items included in the first questionnaire, i.e. to reach consensus on which relevant measurement properties should be included in the COSMIN checklist, and on their definition. An anonymous feedback report of the results of the first round will be distributed among the panel members with the second questionnaire. The results of Delphi round 1 will be presented both quantitatively (the distribution and mean (or median) scores on the 5-point Likert scales) and qualitatively (the suggestions and comments of the panel members concerning each measurement property and the definitions). The panel members will again be asked to give their opinion on each item. In principle, only measurement properties for which minimal 67% scored at least 3 points in the second round will be selected for inclusion, but the Steering Committee has the right to make alternative decisions after reviewing the responses. If different terminology is proposed by certain panel members, the Steering Committee will choose one term, and will provide a description of synonyms. The decisions made by the Steering Committee will be presented and justified in the feedback report between the Delphi rounds. The panel members will be given the opportunity to react to, and (dis)approve these decisions.</p>
            <p>This procedure will be repeated for each issue, i.e. which measurement properties should be assessed, how these properties should be assessed, and which criteria should be used to define what good measurement properties.</p>
            <p>The Steering Committee will decide whether or not consensus was reached. In general, consensus will be defined as "a general agreement among a substantial majority (i.e. 67% had a score of at least 3 points on the 5-point Likert scale) of panel members". It is expected that it will be possible to reach consensus on which measurement properties should be assessed and how they should be assessed. However, we anticipate that it will be much more difficult to reach consensus on the criteria for good measurement properties, because this often depends on the situation in which the criteria will be applied. A possible outcome of our study will therefore be a checklist with consensus-based standards for the evaluation of the measurement properties of HR-PROs, but with no explicit criteria for good measurement properties for all the measurement properties.</p>
         </sec>
         <sec>
            <st>
               <p>Field-testing</p>
            </st>
            <p>After the four Delphi rounds, a first version of the checklist and a user's manual will be prepared by the Steering Committee. This checklist will be tested in a inter-rater reproducibility study, in which a number of raters will be asked to judge a selection of validation studies of a variety of measurement instruments. Inter-rater variability will be determined on each item of the checklist.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>The Delphi procedure is particularly suitable for the development of consensus-based standards for the evaluation of HR-PROs for several reasons:</p>
         <p>- A Delphi approach is especially useful for situations in which there is a lack of empirical evidence and decisive factors are rather subjective, and not knowledge based.</p>
         <p>- It is useful for situations in which strong differences of opinion, e.g. due to differences in expertise and scientific backgrounds, as is anticipated in this study.</p>
         <p>- The checklist can be developed in co-operation with many experts in the field of health status measurement from different scientific backgrounds and from different countries. By providing feedback from previous rounds, the Delphi technique provides the advantage of a group process of building on the work and expertise of all panel members.</p>
         <p>- The Delphi technique avoids problems that are commonly encountered in face-to-face group meetings, such as the dominance of certain persons in the communication process, and the geographical constraints and expenses of bringing together a group of experts. The panel members can be kept unaware of the identity and opinions of the other panel members, which allows them to express their personal views freely.</p>
         <p>- Checklists developed by individual experts or small research groups from one institute, such as our checklist <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, do not have sufficient credibility to be accepted and implemented. Only checklists developed in international collaborations, such as the OMERACT initiative <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>, will have a fair chance of becoming widely used.</p>
         <sec>
            <st>
               <p>Final remark</p>
            </st>
            <p>The purpose of the COSMIN study is to develop consensus-based standards to assess the quality of evaluative HR-PROs, so that in the future this can be assessed more uniformly. The checklist (containing these standards) can be used by researchers, reviewers of journals or professionals for the development, evaluation and selection of measurement instruments, the planning of validation studies, and the critical appraisal of them. Our standards should contribute to the improvement of the quality of (validation studies of) HR-PROs.</p>
            <p>We expect the final version of the checklist to be ready in 2008/2009.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Competing interests</p>
         </st>
         <p>The author(s) declare that they have no competing interest.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>LM is the principal investigator of the study described in this article. CT, DK, LB and HdV developed the initial study protocol. All authors participated in the design and preparation of the study. LM wrote the first draft of the manuscript. All other authors commented on this draft and contributed to the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This study is financially supported by the Institute for Research in Extramural Medicine (EMGO Institute), VU University Medical Center, Amsterdam.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>A methodological framework for assessing health indices</p>
            </title>
            <aug>
               <au>
                  <snm>Kirshner</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Guyatt</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>J Chronic Dis</source>
            <pubdate>1985</pubdate>
            <volume>38</volume>
            <fpage>27</fpage>
            <lpage>36</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0021-9681(85)90005-0</pubid>
                  <pubid idtype="pmpid">3972947</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Assessing health status and quality-of-life instruments: attributes and review criteria</p>
            </title>
            <source>Qual Life Res</source>
            <pubdate>2002</pubdate>
            <volume>11</volume>
            <fpage>193</fpage>
            <lpage>205</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1023/A:1015291021312</pubid>
                  <pubid idtype="pmpid" link="fulltext">12074258</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Evaluating quality-of-life and health status instruments: development of scientific review criteria</p>
            </title>
            <aug>
               <au>
                  <snm>Lohr</snm>
                  <fnm>KN</fnm>
               </au>
               <au>
                  <snm>Aaronson</snm>
                  <fnm>NK</fnm>
               </au>
               <au>
                  <snm>Alonso</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Burnam</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Patrick</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Perrin</snm>
                  <fnm>EB</fnm>
               </au>
               <au>
                  <snm>Roberts</snm>
                  <fnm>JS</fnm>
               </au>
            </aug>
            <source>Clinical Therapeutics</source>
            <pubdate>1996</pubdate>
            <volume>18</volume>
            <fpage>979</fpage>
            <lpage>992</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0149-2918(96)80054-3</pubid>
                  <pubid idtype="pmpid" link="fulltext">8930436</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Methodological considerations in functional assessment</p>
            </title>
            <aug>
               <au>
                  <snm>Bombardier</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Tugwell</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>J Rheumatol Suppl</source>
            <pubdate>1987</pubdate>
            <volume>14 Suppl 15</volume>
            <fpage>6</fpage>
            <lpage>10</lpage>
            <xrefbib>
               <pubid idtype="pmpid">3498841</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Health-related quality of life outcomes measures</p>
            </title>
            <aug>
               <au>
                  <snm>Andresen</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Meyers</snm>
                  <fnm>AR</fnm>
               </au>
            </aug>
            <source>Arch Phys Med Rehabil</source>
            <pubdate>2000</pubdate>
            <volume>81</volume>
            <fpage>S30</fpage>
            <lpage>S45</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1053/apmr.2000.20621</pubid>
                  <pubid idtype="pmpid">11128902</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Patient-reported outcomes: the example of health-related quality of life - a European guidance document for the improved integration of health-related quality of life assessment in the drug regulatory process.</p>
            </title>
            <aug>
               <au>
                  <snm>Chassany</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Sagnier</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Marquis</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Fullerton</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Aaronson</snm>
                  <fnm>NK</fnm>
               </au>
               <au>
                  <cnm>for the European Regularoty Issues on Quality of Life Assessment Group</cnm>
               </au>
            </aug>
            <source>Drug Information Journal</source>
            <pubdate>2002</pubdate>
            <volume>36</volume>
            <fpage>209</fpage>
            <lpage>238</lpage>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Development standards for health measures</p>
            </title>
            <aug>
               <au>
                  <snm>McDowell</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Jenkinson</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>J Health Serv Res Policy</source>
            <pubdate>1996</pubdate>
            <volume>1</volume>
            <fpage>238</fpage>
            <lpage>246</lpage>
            <xrefbib>
               <pubid idtype="pmpid">10180877</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <aug>
               <au>
                  <snm>Streiner</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Norman</snm>
                  <fnm>GR</fnm>
               </au>
            </aug>
            <source>Health measurement scales. A practical guide to their development and use</source>
            <publisher>Oxford, University Press</publisher>
            <edition>second</edition>
            <pubdate>1995</pubdate>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Methods for assessing responsiveness: a critical review and recommendations</p>
            </title>
            <aug>
               <au>
                  <snm>Husted</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Cook</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Farewell</snm>
                  <fnm>VT</fnm>
               </au>
               <au>
                  <snm>Gladman</snm>
                  <fnm>DD</fnm>
               </au>
            </aug>
            <source>J Clin Epidemiol</source>
            <pubdate>2000</pubdate>
            <volume>53</volume>
            <fpage>459</fpage>
            <lpage>468</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0895-4356(99)00206-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">10812317</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>On assessing responsiveness of health-related quality of life instruments: guidelines for instrument evaluation</p>
            </title>
            <aug>
               <au>
                  <snm>Terwee</snm>
                  <fnm>CB</fnm>
               </au>
               <au>
                  <snm>Dekker</snm>
                  <fnm>FW</fnm>
               </au>
               <au>
                  <snm>Wiersinga</snm>
                  <fnm>WM</fnm>
               </au>
               <au>
                  <snm>Prummel</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Bossuyt</snm>
                  <fnm>PM</fnm>
               </au>
            </aug>
            <source>Qual Life Res</source>
            <pubdate>2003</pubdate>
            <volume>12</volume>
            <fpage>349</fpage>
            <lpage>362</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1023/A:1023499322593</pubid>
                  <pubid idtype="pmpid" link="fulltext">12797708</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Assessing sensitivity to change: choosing the appropriate change coefficient</p>
            </title>
            <aug>
               <au>
                  <snm>Stratford</snm>
                  <fnm>PW</fnm>
               </au>
               <au>
                  <snm>Riddle</snm>
                  <fnm>DL</fnm>
               </au>
            </aug>
            <source>Health Qual Life Outcomes</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>23</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1084357</pubid>
                  <pubid idtype="pmpid" link="fulltext">15811176</pubid>
                  <pubid idtype="doi">10.1186/1477-7525-3-23</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>A systematic review of the content and quality of wrist outcome instruments</p>
            </title>
            <aug>
               <au>
                  <snm>Bialocerkowski</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Grimmer</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Bain</snm>
                  <fnm>GI</fnm>
               </au>
            </aug>
            <source>Int J Qual Health Care</source>
            <pubdate>2000</pubdate>
            <volume>12</volume>
            <fpage>149</fpage>
            <lpage>157</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/intqhc/12.2.149</pubid>
                  <pubid idtype="pmpid" link="fulltext">10830672</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Reliable and valid self-report outcome measures in sexual (dys)function: a systematic review</p>
            </title>
            <aug>
               <au>
                  <snm>Daker-White</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Arch Sex Behav</source>
            <pubdate>2002</pubdate>
            <volume>31</volume>
            <fpage>197</fpage>
            <lpage>209</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1023/A:1014743304566</pubid>
                  <pubid idtype="pmpid" link="fulltext">11974645</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Outcome measures for adult critical care: a systematic review</p>
            </title>
            <aug>
               <au>
                  <snm>Hayes</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Black</snm>
                  <fnm>NA</fnm>
               </au>
               <au>
                  <snm>Jenkinson</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Rowan</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Daly</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ridley</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Health Technol Assess</source>
            <pubdate>2000</pubdate>
            <volume>4</volume>
            <fpage>1</fpage>
            <lpage>111</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11074394</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Outcome measures in palliative care for advanced cancer patients: a review</p>
            </title>
            <aug>
               <au>
                  <snm>Hearn</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Higginson</snm>
                  <fnm>IJ</fnm>
               </au>
            </aug>
            <source>J Public Health Med</source>
            <pubdate>1997</pubdate>
            <volume>19</volume>
            <fpage>193</fpage>
            <lpage>199</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9243435</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Reliability and validity of clinical outcome measurements of osteoarthritis of the hip and knee: a review of the literature</p>
            </title>
            <aug>
               <au>
                  <snm>Sun</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Sturmer</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Gunther</snm>
                  <fnm>KP</fnm>
               </au>
               <au>
                  <snm>Brenner</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Clin Rheumatol</source>
            <pubdate>1997</pubdate>
            <volume>16</volume>
            <fpage>185</fpage>
            <lpage>198</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF02247849</pubid>
                  <pubid idtype="pmpid">9093802</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Evaluation of measures used to assess quality of life after stroke</p>
            </title>
            <aug>
               <au>
                  <snm>Buck</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Jacoby</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Massey</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ford</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Stroke</source>
            <pubdate>2000</pubdate>
            <volume>31</volume>
            <fpage>2004</fpage>
            <lpage>2010</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10926971</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Quality of life instruments in studies of menorrhagia: a systematic review</p>
            </title>
            <aug>
               <au>
                  <snm>Clark</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Khan</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Foon</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Pattison</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Bryan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gupta</snm>
                  <fnm>JK</fnm>
               </au>
            </aug>
            <source>Eur J Obstet Gynecol Reprod Biol</source>
            <pubdate>2002</pubdate>
            <volume>104</volume>
            <fpage>96</fpage>
            <lpage>104</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0301-2115(02)00076-3</pubid>
                  <pubid idtype="pmpid" link="fulltext">12206918</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Measuring quality of life in paediatric patients</p>
            </title>
            <aug>
               <au>
                  <snm>Connolly</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Pharmacoeconomics</source>
            <pubdate>1999</pubdate>
            <volume>16</volume>
            <fpage>605</fpage>
            <lpage>625</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.2165/00019053-199916060-00002</pubid>
                  <pubid idtype="pmpid">10724790</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>A comparative review of generic quality-of-life instruments</p>
            </title>
            <aug>
               <au>
                  <snm>Coons</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Rao</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Keininger</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Hays</snm>
                  <fnm>RD</fnm>
               </au>
            </aug>
            <source>Pharmacoeconomics</source>
            <pubdate>2000</pubdate>
            <volume>17</volume>
            <fpage>13</fpage>
            <lpage>35</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.2165/00019053-200017010-00002</pubid>
                  <pubid idtype="pmpid">10747763</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Psychometric properties of vision-related quality of life questionnaires: a systematic review</p>
            </title>
            <aug>
               <au>
                  <snm>De Boer</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Moll</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>De Vet</snm>
                  <fnm>HC</fnm>
               </au>
               <au>
                  <snm>Terwee</snm>
                  <fnm>CB</fnm>
               </au>
               <au>
                  <snm>Volker-Dieben</snm>
                  <fnm>HJ</fnm>
               </au>
               <au>
                  <snm>Van Rens</snm>
                  <fnm>GH</fnm>
               </au>
            </aug>
            <source>Ophthalmic Physiol Opt</source>
            <pubdate>2004</pubdate>
            <volume>24</volume>
            <fpage>257</fpage>
            <lpage>273</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1475-1313.2004.00187.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">15228503</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>The suitability of quality-of-life questionnaires for psoriasis research: a systematic literature review</p>
            </title>
            <aug>
               <au>
                  <snm>De Korte</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mombers</snm>
                  <fnm>FM</fnm>
               </au>
               <au>
                  <snm>Sprangers</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Bos</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>Arch Dermatol</source>
            <pubdate>2002</pubdate>
            <volume>138</volume>
            <fpage>1221</fpage>
            <lpage>1227</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1001/archderm.138.9.1221</pubid>
                  <pubid idtype="pmpid" link="fulltext">12224984</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>A method to select an instrument for measurement of HR-QOL for cross-cultural adaptation applied to dermatology</p>
            </title>
            <aug>
               <au>
                  <snm>De Tiedra</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Mercadal</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Badia</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Mascaro</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Lozano</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Pharmacoeconomics</source>
            <pubdate>1998</pubdate>
            <volume>14</volume>
            <fpage>405</fpage>
            <lpage>422</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.2165/00019053-199814040-00007</pubid>
                  <pubid idtype="pmpid">10344908</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Quality of life instruments for caregivers of patients with cancer: a review of their psychometric properties</p>
            </title>
            <aug>
               <au>
                  <snm>Edwards</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Ung</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Cancer Nurs</source>
            <pubdate>2002</pubdate>
            <volume>25</volume>
            <fpage>342</fpage>
            <lpage>349</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1097/00002820-200210000-00002</pubid>
                  <pubid idtype="pmpid" link="fulltext">12394561</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Spinal cord injury and quality of life measures: a review of instrument psychometric quality</p>
            </title>
            <aug>
               <au>
                  <snm>Hallin</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Sullivan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kreuter</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Spinal Cord</source>
            <pubdate>2000</pubdate>
            <volume>38</volume>
            <fpage>509</fpage>
            <lpage>523</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.sc.3101054</pubid>
                  <pubid idtype="pmpid">11035471</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Vision-specific instruments for the assessment of health-related quality of life and visual functioning: a literature review</p>
            </title>
            <aug>
               <au>
                  <snm>Margolis</snm>
                  <fnm>MK</fnm>
               </au>
               <au>
                  <snm>Coyne</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kennedy-Martin</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Baker</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Schein</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Revicki</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>Pharmacoeconomics</source>
            <pubdate>2002</pubdate>
            <volume>20</volume>
            <fpage>791</fpage>
            <lpage>812</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.2165/00019053-200220120-00001</pubid>
                  <pubid idtype="pmpid">12236802</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Health related quality of life in Parkinson's disease: a systematic review of disease specific instruments</p>
            </title>
            <aug>
               <au>
                  <snm>Marinus</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ramaker</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Van Hilten</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Stiggelbout</snm>
                  <fnm>AM</fnm>
               </au>
            </aug>
            <source>J Neurol Neurosurg Psychiatry</source>
            <pubdate>2002</pubdate>
            <volume>72</volume>
            <fpage>241</fpage>
            <lpage>248</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1136/jnnp.72.2.241</pubid>
                  <pubid idtype="pmpid" link="fulltext">11796776</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Instruments for quality of life assessment in patients with inflammatory bowel disease</p>
            </title>
            <aug>
               <au>
                  <snm>Pallis</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Mouzas</snm>
                  <fnm>IA</fnm>
               </au>
            </aug>
            <source>Dig Liver Dis</source>
            <pubdate>2000</pubdate>
            <volume>32</volume>
            <fpage>682</fpage>
            <lpage>688</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1590-8658(00)80330-8</pubid>
                  <pubid idtype="pmpid">11142577</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>A structured review of quality of life instruments for head and neck cancer patients</p>
            </title>
            <aug>
               <au>
                  <snm>Ringash</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bezjak</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Head Neck</source>
            <pubdate>2001</pubdate>
            <volume>23</volume>
            <fpage>201</fpage>
            <lpage>213</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/1097-0347(200103)23:3&lt;201::AID-HED1019>3.0.CO;2-M</pubid>
                  <pubid idtype="pmpid" link="fulltext">11428450</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Clinimetric evaluation of shoulder disability questionnaires: a systematic review of the literature</p>
            </title>
            <aug>
               <au>
                  <snm>Bot</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Terwee</snm>
                  <fnm>CB</fnm>
               </au>
               <au>
                  <snm>Van der Windt</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Bouter</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Dekker</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>De Vet</snm>
                  <fnm>HC</fnm>
               </au>
            </aug>
            <source>Ann Rheum Dis</source>
            <pubdate>2004</pubdate>
            <volume>63</volume>
            <fpage>335</fpage>
            <lpage>341</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1136/ard.2003.007724</pubid>
                  <pubid idtype="pmpid" link="fulltext">15020324</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>A review of self-report scales for the assessment of functional limitation and disability of the shoulder</p>
            </title>
            <aug>
               <au>
                  <snm>Michener</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Leggin</snm>
                  <fnm>BG</fnm>
               </au>
            </aug>
            <source>J Hand Ther</source>
            <pubdate>2001</pubdate>
            <volume>14</volume>
            <fpage>68</fpage>
            <lpage>76</lpage>
            <xrefbib>
               <pubid idtype="pmpid">11382257</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>A review of functional status measures for workers with upper extremity disorders</p>
            </title>
            <aug>
               <au>
                  <snm>Salerno</snm>
                  <fnm>DF</fnm>
               </au>
               <au>
                  <snm>Copley-Merriman</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>TN</fnm>
               </au>
               <au>
                  <snm>Shinogle</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Schulz</snm>
                  <fnm>RM</fnm>
               </au>
            </aug>
            <source>Occup Environ Med</source>
            <pubdate>2002</pubdate>
            <volume>59</volume>
            <fpage>664</fpage>
            <lpage>670</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1136/oem.59.10.664</pubid>
                  <pubid idtype="pmpid" link="fulltext">12356925</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Which are the best instruments for measuring disabilities in gait and gait-related activities in patients with rheumatic disorders</p>
            </title>
            <aug>
               <au>
                  <snm>Swinkels</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Oostendorp</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Bouter</snm>
                  <fnm>LM</fnm>
               </au>
            </aug>
            <source>Clin Exp Rheumatol</source>
            <pubdate>2004</pubdate>
            <volume>22</volume>
            <fpage>25</fpage>
            <lpage>33</lpage>
            <xrefbib>
               <pubid idtype="pmpid">15005000</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>How to measure comorbidity. a critical review of available methods</p>
            </title>
            <aug>
               <au>
                  <snm>De Groot</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Beckerman</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lankhorst</snm>
                  <fnm>GJ</fnm>
               </au>
               <au>
                  <snm>Bouter</snm>
                  <fnm>LM</fnm>
               </au>
            </aug>
            <source>J Clin Epidemiol</source>
            <pubdate>2003</pubdate>
            <volume>56</volume>
            <fpage>221</fpage>
            <lpage>229</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0895-4356(02)00585-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">12725876</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Psychometric evaluation of self-report questionnaires - the development of a checklist.</p>
            </title>
            <aug>
               <au>
                  <snm>Bot</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Terwee</snm>
                  <fnm>CB</fnm>
               </au>
               <au>
                  <snm>Van der Windt</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Bouter</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Dekker</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>De Vet</snm>
                  <fnm>HC</fnm>
               </au>
            </aug>
            <source>Second workshop on research methodology</source>
            <publisher>Amsterdam: VU University, June 25-27</publisher>
            <editor>Ader HJ and Mellenbergh GJ</editor>
            <pubdate>2003</pubdate>
            <fpage>161-168</fpage>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Measuring treatment impact: a review of patient-reported outcomes and other efficacy endpoints in approved product labels</p>
            </title>
            <aug>
               <au>
                  <snm>Willke</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Burke</snm>
                  <fnm>LB</fnm>
               </au>
               <au>
                  <snm>Erickson</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Control Clin Trials</source>
            <pubdate>2004</pubdate>
            <volume>25</volume>
            <fpage>535</fpage>
            <lpage>552</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cct.2004.09.003</pubid>
                  <pubid idtype="pmpid" link="fulltext">15588741</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>The Delphi technique: myths and realities</p>
            </title>
            <aug>
               <au>
                  <snm>Powell</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>J Adv Nurs</source>
            <pubdate>2003</pubdate>
            <volume>41</volume>
            <fpage>376</fpage>
            <lpage>382</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2648.2003.02537.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">12581103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <aug>
               <au>
                  <snm>Moore</snm>
                  <fnm>CM</fnm>
               </au>
            </aug>
            <source>Group techniques for idea building. Applied Social Research Methods Series.</source>
            <publisher>Newbury Park: Sage Publicantions, Inc.</publisher>
            <pubdate>1987</pubdate>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Criteria list for assessment of methodological quality of economic evaluations: Consensus on Health Economic Criteria</p>
            </title>
            <aug>
               <au>
                  <snm>Evers</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Goossens</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>De Vet</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Van Tulder</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ament</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Int J Technol Assess Health Care</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <fpage>240</fpage>
            <lpage>245</lpage>
            <xrefbib>
               <pubid idtype="pmpid">15921065</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>The Delphi list: a criteria list for quality assessment of randomized clinical trials for conducting systematic reviews developed by Delphi consensus</p>
            </title>
            <aug>
               <au>
                  <snm>Verhagen</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>De Vet</snm>
                  <fnm>HC</fnm>
               </au>
               <au>
                  <snm>De Bie</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Kessels</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Boers</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bouter</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Knipschild</snm>
                  <fnm>PG</fnm>
               </au>
            </aug>
            <source>J Clin Epidemiol</source>
            <pubdate>1998</pubdate>
            <volume>51</volume>
            <fpage>1235</fpage>
            <lpage>1241</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0895-4356(98)00131-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">10086815</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>A method for achieving consensus on rheumatoid arthritis outcome measures: the OMERACT conference process</p>
            </title>
            <aug>
               <au>
                  <snm>Fried</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Boers</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Baker</snm>
                  <fnm>PR</fnm>
               </au>
            </aug>
            <source>J Rheumatol</source>
            <pubdate>1993</pubdate>
            <volume>20</volume>
            <fpage>548</fpage>
            <lpage>551</lpage>
            <xrefbib>
               <pubid idtype="pmpid">8478870</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
      <sec>
         <st>
            <p>Pre-publication history</p>
         </st>
         <p>The pre-publication history for this paper can be accessed here:</p>
         <p>
            <url>http://www.biomedcentral.com/1471-2288/6/2/prepub</url>
         </p>
      </sec>
   </bm>
</art>
