<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1477-7525-5-14</ui>
   <ji>1477-7525</ji>
   <fm>
      <dochead>Review</dochead>
      <bibl>
         <title>
            <p>Theoretical framework and methodological development of common subjective health outcome measures in osteoarthritis: a critical review</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Pollard</snm>
               <fnm>Beth</fnm>
               <insr iid="I1"/>
               <email>beth.pollard@abdn.ac.uk</email>
            </au>
            <au id="A2">
               <snm>Johnston</snm>
               <fnm>Marie</fnm>
               <insr iid="I1"/>
               <email>m.johnston@abdn.ac.uk</email>
            </au>
            <au id="A3">
               <snm>Dixon</snm>
               <fnm>Diane</fnm>
               <insr iid="I2"/>
               <email>diane.dixon@stir.ac.uk</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>School of Psychology, University of Aberdeen, Aberdeen, AB24 2UB, UK</p>
            </ins>
            <ins id="I2">
               <p>Department of Psychology, University of Stirling, Stirling, FK9 4LA, UK</p>
            </ins>
         </insg>
         <source>Health and Quality of Life Outcomes</source>
         <issn>1477-7525</issn>
         <pubdate>2007</pubdate>
         <volume>5</volume>
         <issue>1</issue>
         <fpage>14</fpage>
         <url>http://www.hqlo.com/content/5/1/14</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17343739</pubid>
               <pubid idtype="doi">10.1186/1477-7525-5-14</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>28</day>
               <month>11</month>
               <year>2006</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>07</day>
               <month>3</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>07</day>
               <month>3</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Pollard et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <p>Subjective measures involving clinician ratings or patient self-assessments have become recognised as an important tool for the assessment of health outcome. The value of a health outcome measure is usually assessed by a psychometric evaluation of its reliability, validity and responsiveness. However, psychometric testing involves an accumulation of evidence and has recognised limitations. It has been suggested that an evaluation of how well a measure has been developed would be a useful additional criteria in assessing the value of a measure. This paper explored the theoretical background and methodological development of subjective health status measures commonly used in osteoarthritis research. Fourteen subjective health outcome measures commonly used in osteoarthritis research were examined. Each measure was explored on the basis of their i) theoretical framework (was there a definition of what was being assessed and was it part of a theoretical model?) and ii) methodological development (what was the scaling strategy, how were the items generated and reduced, what was the response format and what was the scoring method?). Only the AIMS, SF-36 and WHOQOL defined what they were assessing (i.e. the construct of interest) and no measure assessed was part of a theoretical model. None of the clinician report measures appeared to have implemented a scaling procedure or described the rationale for the items selected or scoring system. Of the patient self-report measures, the AIMS, MPQ, OXFORD, SF-36, WHOQOL and WOMAC appeared to follow a standard psychometric scaling method. The DRP and EuroQol used alternative scaling methods. The review highlighted the general lack of theoretical framework for both clinician report and patient self-report measures. This review also drew attention to the wide variation in the methodological development of commonly used measures in OA. While, in general the patient self-report measures had good methodological development, the clinician report measures appeared less well developed. It would be of value if new measures defined the construct of interest and, that the construct, be part of theoretical model. By ensuring measures are both theoretically and empirically valid then improvements in subjective health outcome measures should be possible.</p>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Review</p>
         </st>
         <p>There has been a huge increase in the use and development of subjective health outcome measures <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Consequently, it is increasingly important to ensure that the measures are assessing what they intend to measure, as accurately as possible. If measures do not adequately sample the specified outcomes, or they are not accurate, then any conclusions drawn about the effectiveness of, for example, a new treatment may be misleading.</p>
         <p>The standard approach to assessing the 'value' of a health outcome measure is to be satisfied that a measure has adequate psychometric properties in terms of reliability, validity, and responsiveness <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. However, there are many known limitations with the most commonly reported methods of psychometric testing. For example, Cronbach's alpha <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> is widely used to evaluate internal reliability, but there is often an over-emphasis on achieving a high alpha. Selecting items for a measure based on alpha may result in almost identical items or might exclude important items, and only tap a narrow part of the underlying construct. In addition, alpha can be increased by simply increasing the number of items <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B4">4</abbr></abbrgrp>. Further, the validity of a measure is often explored by correlating it with a similar existing measure. There is concern about whether the 'similar' measures are actually similar or not. A facet of this problem is known as the 'jingle-jangle fallacies': the jingle fallacy being that just because things are called the same name it does not mean that they are the same thing; the jangle refers to the issue that because things are called different things it does not necessarily mean they are different <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. These problems are illustrated in a systematic review that found that only 16% of the identified impairment measures for rheumatic disorders were validated against a similar construct <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Another common problem is that claims of validity are made if a significant correlation coefficient is achieved without any reference to acceptable levels <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. Finally, reliability and validity can never be proved. A single study can only provide support towards establishing reliability or validity as there needs to be an accumulation of ongoing and evolving evidence <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>.</p>
         <p>Due to the limitations of psychometric testing, other considerations may add to the assessment of the 'value' of a measure. The Scientific Advisory Committee of the Medical Outcomes Trust, 2002 <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> have suggested that the rationale for, and description of, the conceptual and measurement model of health status measures should be reported. Such theoretical and methodological criteria have generally been overlooked when evaluating health outcome measures. It is suggested therefore these criteria could be the starting point for evaluating measures <it>before </it>time, and probably costs, are involved in psychometrically evaluating the measure. Thus, an evaluation of how well a measure has been developed would appear to be a useful additional criteria in assessing the 'value' of a measure. Therefore this review explores how well measures have been developed in terms of i) theoretical framework and ii) methodological development.</p>
         <sec>
            <st>
               <p>i) The theoretical framework</p>
            </st>
            <p>It is advantageous if a measure <it>defines </it>what it is supposed to be assessing (i.e. the construct of interest). For example, if we consider a measure that states it is measuring disability as a health outcome, there are many different interpretations of a what 'disability' encompasses. Disability may mean to some, limitations in physical function, but to others, it may represent a broader measure encompassing the social impact of a condition. Hence, a definition of the intended focus of a measure enhances compatibility, comparisons and understanding between studies.</p>
            <p>Measures that are developed within a <it>theoretical framework </it>or model have the advantage of allowing underlying processes to be investigated, and interventions appropriately targeted. The dominant theoretical models of health outcomes or the consequence of disease have been the biomedical models developed by the World Health Organisation <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>. The most recent version is the International Classification of Functioning, Disability, and Health that identifies three distinct outcomes, impairment, activity limitations and participation restrictions <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. Using this model, we may find that analgesics influence all three outcomes, whereas modifying the structure of the home might only alter activity limitations. Failure to adequately measure each distinguishable outcome might result in failure to detect benefit or harm occurring due to an intervention or to a disease. Further, with distinguishable outcomes, it is possible to postulate relationships between them, e.g. in the analgesic example, pain relief might affect impairment with consequent reductions in activity limitations. In this review, considerations are given to whether the underlying construct has been defined and whether the construct is part of a theoretical model.</p>
         </sec>
         <sec>
            <st>
               <p>ii) The methodological development</p>
            </st>
            <p>The use of a standard <it>scaling </it>procedure (i.e. the method of attributing numerical values to responses) is advantageous as it prescribes a standard, theoretically sound method for developing and scoring measures. Standard scaling methods usually start by collecting a large number of items, and then use defined methods to reduce the number of items, attach a response format, and score the final scale. The most common standard scaling techniques in health status measures have been derived from the scaling of attitudes &#8211; Likert <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, Guttman <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> and Thurstone scaling <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. These methods ensure that the scoring, scaling, and the response format for items will be consistent. For example, if a Likert scaling technique is used then all items will conform to a Likert scale with Likert response formats (5 point with agree to disagree response stems) and use an additive scoring method, whereas Guttman scaling requires a binary response format, and the score reflects the 'highest' item endorsed. However, if only some aspects of the scaling method are followed, it is possible that problems with the scale will arise. For example, it has been shown that problems with a 'gold standard' measure, the Sickness Impact Profile, were due to an inconsistency between the scoring method (additive) and the scaling method (Thurstone scaling) <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>; as a result, an individual with small limitations could have a higher score than someone who was completely incapacitated.</p>
            <p>If a standard method is not implemented, it is preferable if the method for <it>selecting items </it>is broad enough to sample the full range and not restricted to just one source or domain. For example, in a thorough selection process items, may have been derived from previous measures, research literature, expert judges, patients, and healthy individuals. The resultant pool may then be reduced by going through a systematic sorting or <it>item reduction </it>process. The resultant items may then be explored empirically through item analysis, enabling poor items to be identified and eliminated from the final measure.</p>
            <p>Therefore in this review, considerations are given to the scaling strategy, item generation and reduction, scaling, response format, and scoring method of each of the measures. Additionally, the explanations given for the rationale for the response categories and scoring method are reviewed.</p>
            <p>In summary, the aim of this review is to explore the theoretical framework and methodological development of common subjective health outcome measures using the criteria specified in Table <tblr tid="T1">1</tblr>. The context of osteoarthritis has been chosen as the focus of this review.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Criteria used to assess the theoretical framework and methodological development of health outcome measures</p>
               </caption>
               <tblbdy cols="1">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Theoretical framework</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="1">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>1. What construct is being measured?</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>2. Has the construct been defined?</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>3. Was the construct part of a (specified) theoretical model?</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Methodological development</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="1">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>4. What scaling strategy was adopted?</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>5. How were the items generated (to tap the construct)?</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>6. How was item reduction conducted?</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>7. What was the response format?</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>8. What was the scoring method?</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Measures</p>
            </st>
            <p>The measures selected were commonly used to assess subjective health outcome in hip or knee osteoarthritis (OA). The measures were identified as part of a review of interventions used for the treatment of OA <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. In addition, citation-based searches (using Web of Science) for other subjective health outcome measures were undertaken to identify any very widely used measures not already selected. Nine hundred and forty abstracts were examined and all named measures noted. Any measure with 10 or more citations was included in this review.</p>
            <p>This resulted in the addition of two measures: the Hospital for Special Surgery knee score (HSS) <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> and the Merle d'Aubigne Hip Rating <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. An in depth theoretical review of one of the measures, the Sickness Impact Profile, <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> has already been carried out <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, and so was not included here. This resulted in 10 disease-specific measures (clinician report or patient self-report) and 4 generic measures (all patient self-report). The measures are specified in Table <tblr tid="T2">2</tblr>.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Outcome instruments assessed in this study</p>
               </caption>
               <tblbdy cols="1">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Generic</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="1">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Patient self-report: EuroQol [21]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Medical Outcomes Study Short Form-36 (SF-36) [22-25]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>McGill Pain Questionnaire (MPQ) [26-28]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>World Health Organisation Quality of life Assessment (WHOQOL) [29,30]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Disease Specific &#8211; Clinician report</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="1">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>American Knee Society Score (AKS) [31]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Harris Hip Score [32]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Hospital for Special Surgery Knee Score (HSS) [18]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Lequesne Hip and Knee Indices [33]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Merle d'Aubigne Hip Rating [19]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Disease Specific &#8211; Patient self-report</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="1">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Arthritis Impact Measurement Scale (AIMS) [34,35]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Disease Repercussion Profile (DRP) [36-38]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Health Assessment Questionnaire- Disability Index (HAQ-DI) [39-42]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Oxford Hip and Knee Questionnaires [43,44]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Western Ontario and MacMaster Universities Osteoarthritis Index (WOMAC) [45-48]</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Analysis</p>
            </st>
            <p>A literature search was conducted for published papers relating to the development of each measure and they were examined (a complete search may not have been carried out where papers were published prior to electronic database searches limits, where papers were unavailable in English, or where the paper could not be traced). The focus of this review was on the original measure rather than modified versions (e.g. short forms).</p>
            <p>The information extracted from the literature for this review was:</p>
            <p>a) For the basic description of measures: the number of items and item content areas.</p>
            <p>b) For the theoretical framework: was the underlying construct defined and was the construct part of a theoretical model?</p>
            <p>c) For the methodological development: what was the scaling strategy, how were the items generated and reduced, what was the response format and what was the scoring method?</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <p>A summary of the basic measure information is in 'Additional file <supplr sid="S1">1</supplr>' and a summary of the review is in 'Additional file <supplr sid="S2">2</supplr>'.</p>
         <suppl id="S1">
            <title>
               <p>Additional file 1</p>
            </title>
            <text>
               <p>Summary information on each measure. Table of summary information on each measure (number of items and item content areas)</p>
            </text>
            <file name="1477-7525-5-14-S1.doc">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S2">
            <title>
               <p>Additional file 2</p>
            </title>
            <text>
               <p>Summary of the theoretical review. Summary table of the review</p>
            </text>
            <file name="1477-7525-5-14-S2.doc">
               <p>Click here for file</p>
            </file>
         </suppl>
         <sec>
            <st>
               <p>i) Theoretical framework</p>
            </st>
            <p>The clinician report measures stated what the measure was about but none defined what it was supposed to be assessing. These measures also lacked an underlying theoretical framework. The American Knee Society Score (derived to measure knee and patient function), Harris Hip Score (pain and functional capacity), Hospital for Special Surgery Knee Score (disability), Lequesne Hip and Knee Indices (an indices of severity of disease), Merle d'Aubigne Hip Rating (function of the hip) are all measures which, while of value clinically, did not have a well defined construct, nor were they derived from a strong theoretical framework.</p>
            <p>Some self-report measures were based on conceptual frameworks proposed by the author(s) of the measure. The McGill Pain Questionnaire (MPQ) was based on a Melzack's theory of pain <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. This review focuses on the Pain Rating Index (PRI) and the present pain intensity (PPI) item of the McGill Pain Questionnaire. The Health Assessment Questionnaire (HAQ) was based on a hierarchical model of death, disability, discomfort, drug toxicity and dollar cost <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. This most commonly used part of the HAQ, the Disability Index (HAQ-DI) is focussed on in this review. Much consideration was given to the conceptual meaning of handicap in the process of developing the Disease Repercussion Profile. The Disease Repercussion Profile measures individualised patient-perceived handicap in a broader manner than the WHO defined dimensions of handicap <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. Other measures were based on an existing defined construct. The SF-36 was derived to measure health status based on the identification and definition of five generic health concepts <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> plus two other concepts identified from empirical evidence <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. The Arthritis Impact Measurement Scale was developed to reflect the WHO definition of health <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>, and the WHOQOL from the definition of quality of life devised by the WHOQOL group <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>.</p>
            <p>Other measures stated the construct measured but without explicit definition. The EuroQol was developed as a standardised non-disease specific measure for describing and valuing health-related quality of life <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. The dimensions were selected primarily from existing health status measures. The WOMAC was based on the objective of defining the dimensionality of pain and disability, with five dimensions being initially identified <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. The final version had three subscales of pain, stiffness, and physical function <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. The underlying aim of the Oxford Hip and Knee Questionnaires was to measure "patients' perception of a single disease entity" <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>.</p>
            <p>Thus although three measures defined the construct of interest, <it>no </it>measure was based on both a defined construct and a theoretical framework.</p>
         </sec>
         <sec>
            <st>
               <p>ii) Methodological development</p>
            </st>
            <sec>
               <st>
                  <p>Scaling strategy</p>
               </st>
               <p>Six of the fourteen measures appeared to use standard psychometric scaling methods. The stated scaling methodology of the SF-36, WOMAC and WHOQOL was Likert scaling. The WOMAC could, alternatively, be implemented using a 0&#8211;100 mm visual analogue scale for each item, with descriptive anchors of none and extreme. A numeric rating scale version of the WOMAC has also been developed, with response categories between 0 (none) and 10 (extreme) <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>. While the authors of the Oxford Hip and Knee Questionnaires did not state that Likert scaling was used, the resultant questionnaire had the appearance of a Likert-type scale. Two scaling methods were used for the Arthritis Impact Measurement Scale: first, items were grouped into subscales and each subscale was examined using Guttman scaling procedures, and then Likert scaling was used to form an additive scale for each subscale. Thurstone's Categorical Judgement model <abbrgrp><abbr bid="B51">51</abbr></abbrgrp> was used to obtain weightings of pain intensity for each descriptor of pain in the McGill Pain Questionnaire-PRI. This procedure results in an interval scale. The McGill Pain Questionnaire-PPI was a single item with five response categories that were considered equally far apart as to represent an interval scale.</p>
               <p>An econometric scaling method was used for the development of the EuroQol. This method involved subjects rating health states (from combining different levels from each item) and results in values being attached to each health state. The Disease Repercussion Profile used a combination of open questions and 10-point graphical rating scales to create a graphical profile score. The HAQ-DI did not appear to have been developed using a standard scaling technique.</p>
               <p>None of the clinician report measures appeared to have been developed using a standard scaling technique nor did they explain their scaling strategy.</p>
            </sec>
            <sec>
               <st>
                  <p>Item generation technique</p>
               </st>
               <p>A range of techniques was used to generate the items within a measure. There was no information on the item selection techniques for the Harris Hip Score, Hospital for Special Surgery Knee Score, Lequesne Hip and Knee Indices and Merle d'Aubigne Hip Rating. The items for the American Knee Society Score were generated by consensus by members of the American Knee Society. Some measures were based on items from existing instruments (Arthritis Impact Measurement Scale, EuroQol, HAQ-DI, SF-36). Some items were selected from literature, e.g. McGill Pain Questionnaire. Others started by gathering items from patients, e.g. Oxford Hip and Knee Questionnaires, WOMAC and Disease Repercussion Profile. Some measures took a comprehensive approach and used all these techniques and additional ones (e.g. extensive focus groups and question writing panels were additionally used for the WHOQOL). In summary, the method of item generation for the patient self-report measures was generally comprehensive, with most measures using appropriate methods to generate a pool of items that cover the domain of interest. In contrast, there was little information about the choice of items in the clinician report measures.</p>
            </sec>
            <sec>
               <st>
                  <p>Item reduction</p>
               </st>
               <p>The Arthritis Impact Measurement Scale, McGill Pain Questionnaire, WHOQOL and WOMAC used psychometric methods of item reduction to reduce the number of items. The SF-36 used specific methods to construct short-form measures from the 'parent' longer Medical Outcomes Study measure <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B52">52</abbr></abbrgrp>. The method details were not found; however, if the methods were similar to those for the SF-20 <abbrgrp><abbr bid="B52">52</abbr></abbrgrp> then it would imply comprehensive testing where item-scale correlations, reliability and validity were examined. Subsequently, the Likert scaling assumptions of the SF-36, were explored with all scales passing tests for item-internal consistency, item-discrimination, and internal consistency of each scale score <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. The main item reduction for the HAQ-DI was carried out by correlational analyses that identified redundant items <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. The methods of item reduction for the Oxford Hip and Knee Questionnaires and EuroQol were not explained in detail in the published literature. The item reduction procedures were described in detail for the measures where a stated psychometric scaling strategy was followed, illustrating the advantage of using a psychometric scaling method with an explicit predefined methodology.</p>
            </sec>
            <sec>
               <st>
                  <p>Response formats</p>
               </st>
               <p>The Disease Repercussion Profile used open questions for each domain, with severity being rated on a ten point graphical rating scale. For the McGill Pain Questionnaire-PRI, the respondents select from each of the 20 categories, the individual descriptive words that best represent their pain. If none of the words in a category apply then the respondent leaves the category out. For the present pain intensity item, the respondent selects one of five response categories.</p>
               <p>All the other twelve measures had ordered response categories with the Arthritis Impact Measurement Scale &amp; the EuroQol additionally including a visual analogue scale. Six of these twelve measures had items with different numbers of response categories (American Knee Society Score, Lequesne Hip and Knee Indices, Hospital for Special Surgery Knee Score, Harris Hip Score, SF-36 &amp; the Arthritis Impact Measurement Scale with between 1 and 6 response categories depending on the measure and item). However, the number of response categories was only discussed for the SF-36 and then only for some items <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. The other six measures had the same number of response categories for all the items throughout the measure (EuroQol, HAQ-DI, Merle D'Aubigne Hip Rating, Oxford Hip and Knee Questionnaires, WHOQOL, WOMAC). Of these, only the WOMAC and HAQ-DI had the same response continuum (i.e. same wording) for all the items. The HAQ-DI response formats were based on the American Rheumatism Association (ARA) functional classes.</p>
               <p>Therefore most of the measures used ordinal (ordered) response formats but there was little consistency of the response format and response continuum within measures. There is much discussion on the problems in performing arithmetic operations and statistical analysis on ordinal scales, mainly due to the unknown interval between categories <abbrgrp><abbr bid="B53">53</abbr><abbr bid="B54">54</abbr></abbrgrp>. The PRI index of the McGill Pain Questionnaire was the only measure on an interval scale and therefore was without these problems. Likert scales are ordinal, although there is much debate as to whether they can be assumed to be interval (i.e., with equal intervals between responses <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>). The response format for the Likert-type measures (SF-36, WOMAC, WHOQOL, Arthritis Impact Measurement Scale, Oxford Hip and Knee Questionnaires) were not true Likert scales as the response continuum was not 'agree' to 'disagree'. This may have an impact on the resultant scale as any changes in the response categories, e.g., changing the usual agree-disagree to favourable-unfavourable, may have an impact on the intervals between the categories. In addition, all the items within a true Likert scale usually have either five or seven response categories, but the Arthritis Impact Measurement Scale and the SF-36 did not use a constant number of response categories, which again may impact the scale. However, it is not clear whether these changes from a traditional Likert scale have a significant impact as there was empirical support for the scaling assumptions of traditional Likert scales in the SF-36 subscales <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>.</p>
            </sec>
            <sec>
               <st>
                  <p>Scoring method</p>
               </st>
               <p>The McGill Pain Questionnaire-PRI used three possible scoring methods for the list of pain descriptors: the number of items chosen (NWC), the mean scale values (PRI(S)), or the summed rank values of items chosen ((PRI(R)). An alternative weighted-rank method of scoring was also developed <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. The PPI score was simply the value selected from the 1&#8211;5 response scale. The Disease Repercussion Profile used profile scores, where the handicap rating for each domain was plotted on a bar chart to obtain a handicap profile for each patient.</p>
               <p>Two measures containing items with different numbers response categories addressed this in their scoring. The Arthritis Impact Measurement Scale used a standardised additive scale. The SF-36 recalibrated the additive scores for linearity and transformed the scores. The American Knee Society Score, Harris Hip Score, Hospital for Special Surgery Knee Score, and Lequesne Hip and Knee Indices (all with varying numbers of response categories) used summated scale systems with the Hospital for Special Surgery Knee Score and American Knee Society Score having items that result in deductions from the point score, e.g., Hospital for Special Surgery Knee Score uses a one point deduction for using a cane. It is unclear how this scoring method was derived and why responses to certain items were allocated their particular points with some items having more weighting than others.</p>
               <p>The scoring of the measures with constant numbers of response categories varies; an additive score was used for the Likert-type scales of the Oxford Hip and Knee Questionnaires and WHOQOL. An additive scale is also most commonly used for the WOMAC, however other weighting and aggregation methods were proposed (i.e. normalisation, pooled index, weighting by relative importance, response criteria) <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>. In addition, the WOMAC can be scored using a signal method where patients are asked to select the most important item from each subscale. However, there are concerns about the stability of using the signal method and is not currently recommended <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>. The score for the HAQ-DI items was based on the highest score on any item within each of the eight subscales. The subscale scores were adjusted to take account of the use of aids. An overall disability score was calculated as the average of the subscale scores. The EuroQol could be scored as a profile or a weighted health index based on a table of values from general population samples. A table was used for the Merle D'Aubigne Hip Rating to allow classification of the functional grading of the hip, and an algorithm was provided to calculate improvement after surgery on the hip.</p>
               <p>Three of the measures (Oxford Hip and Knee Questionnaires, Merle D'Aubigne Hip Rating and Lequesne Hip and Knee Indices) had only an overall score. All the others also had subscale scores. The SF-36 and American Knee Society Score only had subscale scores and not an overall score. All other measures had an overall score.</p>
               <p>In sum, the measures use a wide range of scoring procedures, from the complex weightings in the EuroQol to the simple method of the HAQ-DI (using the highest score within each subclass) that does not fully utilise all the information collected. Jenkinson, 1991 <abbrgrp><abbr bid="B55">55</abbr></abbrgrp> demonstrated that complex weighting methods gain little over a simple scoring system, and thus a simple additive method is generally recommended</p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>Although most measures gave some indication of what they were measuring, few defined the construct or linked it to a theoretical model. The clinician report measures were generally the poorest measures in this respect. The Arthritis Impact Measurement Scale, SF-36 and WHOQOL defined their construct of interest, but it was not related to a theoretical model. The Disease Repercussion Profile and McGill Pain Questionnaire discussed, in detail, their underlying construct (although without a stated definition of terms).</p>
         <p>The measures that appeared to have the weakest methodological development were the clinician report measures with none defining a scaling strategy. The item selection for the American Knee Society Score was by 'consensus' with no other clinician report measure describing the item selection method. No clinician report measure explained their choice of response categories or scoring method.</p>
         <p>Of the patient self-report measures, only the McGill Pain Questionnaire-PRI was completely developed from a standard scaling procedure. The McGill Pain Questionnaire-PRI was also the only measure with an interval scale, and hence has mathematical and statistical advantages over all the other measures. The other measures that appeared to use a standard scaling procedure were the Arthritis Impact Measurement Scale, Oxford Hip and Knee Questionnaires, SF-36, WOMAC and WHOQOL. The Disease Repercussion Profile and EuroQol used alternative scaling methods, while the HAQ-DI did not appear to have a specific scaling strategy.</p>
         <p>The method of item selection was generally good for the patient self-report measures, although the item reduction methods were not always explained, except for those that used a defined scaling procedure. In addition, the reasoning for the choice of response formats was not often explained. The scoring method was generally appropriate for the scaling method (where used) and for the item response format, although the HAQ-DI used a method that did not maximise the information available.</p>
         <p>In summary, the clinician report measures were poor in terms of both their theoretical framework and methodological development. The patient self-report measures appeared to have acceptable methodological development, although there were some limitations with the HAQ-DI. However only the Arthritis Impact Measurement Scale, SF-36 &amp; WHOQOL defined the construct that they were assessing and no measure was part of a theoretical model.</p>
         <p>While this review has focussed on specific theoretical criteria, it is appreciated that there are other theoretical factors that should be explored such as the rationale for the grouping of items into subscales.</p>
         <p>This review was based on peer reviewed published literature on the development of the measures, and some theoretical aspects of the development may have been unpublished. However, it is important for users of measures to have this background information, and electronic publishing methods may facilitate access to this more detailed information.</p>
         <p>The review was based on OA measures that were frequently referenced in the literature and hence some of the newer measures such as the Knee injury and Osteoarthritis Outcome Score (KOOS) <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>, Hip disability and Osteoarthritis Outcome Score (HOOS) <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>, Musculoskeletal Functional Assessment Questionnaire (MFA) <abbrgrp><abbr bid="B58">58</abbr></abbrgrp> were not evaluated here. The uptake and utility of these newer measures remains to be seen.</p>
         <p>Further, this review has focussed on measures used as outcome for osteoarthritis and different conclusions may be reached for other health outcomes or for other conditions. Where outcomes are psychologically theorised, e.g. mood measurements such as anxiety, it is likely that they are more theoretically based and would have used development procedures derived from psychometric theory. However, many health outcomes, especially those involving self-report, require a similar level of attention to measurement issues. They assess patients' experience of their health condition and healthcare and therefore relate to unobservable phenomena rather than phenomena that can be observed by others. One reason for the limited development of some of the measures in osteoarthritis may be that such outcomes have not been articulated as psychological in nature and as a result not subjected to normal psychometric evaluation.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>This review has highlighted the general lack of attention given to the theoretical framework of the health outcome measures. It would be valuable if new measures could define what they are measuring and be a construct within a theoretical model.</p>
         <p>The review also demonstrates the large variation in the methodological development of commonly used measures in OA. While patient self-report measures had, in general, good methodological development, this review has also highlighted the relatively poor development of clinician report measures.</p>
         <p>It is suggested that to improve the quality and performance of new measures, the foundations of their theoretical development should be considered before psychometric evaluation is performed. By ensuring measures are both theoretically and empirically valid, improvements in subjective health outcome measures should be possible.</p>
      </sec>
      <sec>
         <st>
            <p>Competing interests</p>
         </st>
         <p>The author(s) declare that they have no competing interests.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>BP participated in the conception and design of the study, the analysis and the drafting and revision of the manuscript. MJ participated in the conception and design of the study and the drafting and revision of the manuscript. DD contributed to the interpretation of the data and revision of the manuscript. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This study was funded by the Medical Research Council &#8211; Health Services Research Collaboration (MOBILE research programme)</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Measurement in subjective health assessment: themes and prospects</p>
            </title>
            <aug>
               <au>
                  <snm>Jenkinson</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bardsley</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lawence</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Measuring Health and Medical Outcomes</source>
            <publisher>London: UCL Press</publisher>
            <editor>Jenkinson C</editor>
            <pubdate>1994</pubdate>
            <fpage>176</fpage>
            <lpage>185</lpage>
         </bibl>
         <bibl id="B2">
            <aug>
               <au>
                  <snm>Streiner</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Norman</snm>
                  <fnm>GR</fnm>
               </au>
            </aug>
            <source>Health measurement scales: a practical guide to their development and use</source>
            <publisher>Oxford: Oxford University Press</publisher>
            <pubdate>1989</pubdate>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Coefficient alpha and the internal structure of tests</p>
            </title>
            <aug>
               <au>
                  <snm>Cronbach</snm>
                  <fnm>LJ</fnm>
               </au>
            </aug>
            <source>Psychometrika</source>
            <pubdate>1951</pubdate>
            <volume>16</volume>
            <fpage>297</fpage>
            <lpage>334</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1007/BF02310555</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Evaluating patient-based outcome measures for use in clinical trials</p>
            </title>
            <aug>
               <au>
                  <snm>Fitzpatrick</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Davey</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Buxton</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>DR</fnm>
               </au>
            </aug>
            <source>Health Technol Assess</source>
            <pubdate>1998</pubdate>
            <volume>2</volume>
            <issue>14</issue>
            <fpage>i-iv</fpage>
            <lpage>1-74</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9812244</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <aug>
               <au>
                  <snm>Pedhazur</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Schmelkin</snm>
                  <fnm>LP</fnm>
               </au>
            </aug>
            <source>Measurement, Design and Analysis</source>
            <publisher>New Jersey: Lawrence Erlbaum Associates</publisher>
            <pubdate>1991</pubdate>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Construct validity of instruments measuring impairments in body structures and function in rheumatic disorders: Which constructs are selected for validation? A systematic review</p>
            </title>
            <aug>
               <au>
                  <snm>Swinkels</snm>
                  <fnm>RAHM</fnm>
               </au>
               <au>
                  <snm>Bouter</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Oostendorp</snm>
                  <fnm>RAB</fnm>
               </au>
               <au>
                  <snm>Swinkels-Meewisse</snm>
                  <fnm>IJCM</fnm>
               </au>
               <au>
                  <snm>Dijkstra</snm>
                  <fnm>PU</fnm>
               </au>
               <au>
                  <snm>de Vet</snm>
                  <fnm>HCW</fnm>
               </au>
            </aug>
            <source>Clin Exp Rheumatol</source>
            <pubdate>2006</pubdate>
            <volume>24</volume>
            <fpage>93</fpage>
            <lpage>102</lpage>
            <xrefbib>
               <pubid idtype="pmpid">16539827</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <aug>
               <au>
                  <snm>Bowling</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Measuring health: a review of quality of life measurement scales</source>
            <publisher>Milton Keynes: OUP</publisher>
            <pubdate>1991</pubdate>
         </bibl>
         <bibl id="B8">
            <aug>
               <au>
                  <snm>McDowell</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Newell</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Measuring health: a guide to rating scales and questionnaires</source>
            <publisher>New York: OUP</publisher>
            <pubdate>1987</pubdate>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Validity of psychological assessment &#8211; validation of inferences from persons responses and performances as scientific inquiry into score meaning</p>
            </title>
            <aug>
               <au>
                  <snm>Messick</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Am Psychol</source>
            <pubdate>1995</pubdate>
            <volume>50</volume>
            <fpage>741</fpage>
            <lpage>749</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1037/0003-066X.50.9.741</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Assessing health status and quality-of-life instruments: Attributes and review criteria</p>
            </title>
            <source>Qual Life Res</source>
            <pubdate>2002</pubdate>
            <volume>11</volume>
            <fpage>193</fpage>
            <lpage>205</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1023/A:1015291021312</pubid>
                  <pubid idtype="pmpid" link="fulltext">12074258</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <aug>
               <au>
                  <cnm>World Health Organisation</cnm>
               </au>
            </aug>
            <source>International Classification of Impairments, Disabilities and Handicaps</source>
            <publisher>Geneva: World Health Organisation</publisher>
            <pubdate>1980</pubdate>
         </bibl>
         <bibl id="B12">
            <aug>
               <au>
                  <cnm>World Health Organisation</cnm>
               </au>
            </aug>
            <source>The International Classification of Functioning, Disability and Health</source>
            <publisher>Geneva: World Health Organisation</publisher>
            <pubdate>2001</pubdate>
         </bibl>
         <bibl id="B13">
            <title>
               <p>A technique for the measurement of attitudes</p>
            </title>
            <aug>
               <au>
                  <snm>Likert</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Archives of Psychology</source>
            <pubdate>1932</pubdate>
            <volume>140</volume>
            <fpage>44</fpage>
            <lpage>60</lpage>
         </bibl>
         <bibl id="B14">
            <title>
               <p>A basis for the scaling of quantitative data</p>
            </title>
            <aug>
               <au>
                  <snm>Guttman</snm>
                  <fnm>LL</fnm>
               </au>
            </aug>
            <source>Am Sociol Rev</source>
            <pubdate>1944</pubdate>
            <volume>9</volume>
            <fpage>139</fpage>
            <lpage>150</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2307/2086306</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <aug>
               <au>
                  <snm>Thurstone</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Chave</snm>
                  <fnm>EJ</fnm>
               </au>
            </aug>
            <source>The measurement of attitude</source>
            <publisher>Chicago: University of Chicago Press</publisher>
            <pubdate>1929</pubdate>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Problems with the Sickness Impact Profile: a theoretically based analysis and a proposal for a new method of implementation and scoring</p>
            </title>
            <aug>
               <au>
                  <snm>Pollard</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Johnston</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Soc Sci Med</source>
            <pubdate>2001</pubdate>
            <volume>52</volume>
            <fpage>921</fpage>
            <lpage>934</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0277-9536(00)00194-5</pubid>
                  <pubid idtype="pmpid" link="fulltext">11234865</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Epidemiology of research into interventions for the treatment of osteoarthritis of the knee joint</p>
            </title>
            <aug>
               <au>
                  <snm>Chard</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Tallon</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Dieppe</snm>
                  <fnm>PA</fnm>
               </au>
            </aug>
            <source>Ann Rheum Dis</source>
            <pubdate>2000</pubdate>
            <volume>59</volume>
            <fpage>414</fpage>
            <lpage>418</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1753176</pubid>
                  <pubid idtype="pmpid" link="fulltext">10834855</pubid>
                  <pubid idtype="doi">10.1136/ard.59.6.414</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Duo-Condylar knee arthroscopy</p>
            </title>
            <aug>
               <au>
                  <snm>Ranawat</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Insall</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Shine</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Clin Orthop Relat Res</source>
            <pubdate>1976</pubdate>
            <volume>120</volume>
            <fpage>76</fpage>
            <lpage>92</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">975669</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Hip arthroplasty and acrylic prosthesis</p>
            </title>
            <aug>
               <au>
                  <snm>D'Aubigne</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Postel</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Journal of Bone and Joint Surgery-American Volume</source>
            <pubdate>1954</pubdate>
            <volume>36-A</volume>
            <fpage>451</fpage>
            <lpage>465</lpage>
         </bibl>
         <bibl id="B20">
            <title>
               <p>The Sickness Impact Profile &#8211; development and final revision of a health-status measure</p>
            </title>
            <aug>
               <au>
                  <snm>Bergner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bobbitt</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Carter</snm>
                  <fnm>WB</fnm>
               </au>
               <au>
                  <snm>Gilson</snm>
                  <fnm>BS</fnm>
               </au>
            </aug>
            <source>Med Care</source>
            <pubdate>1981</pubdate>
            <volume>19</volume>
            <fpage>787</fpage>
            <lpage>805</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1097/00005650-198108000-00001</pubid>
                  <pubid idtype="pmpid">7278416</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Euroqol-A New Facility for the Measurement of Health-Related Quality-Of-Life</p>
            </title>
            <source>Health Policy</source>
            <pubdate>1990</pubdate>
            <volume>16</volume>
            <fpage>199</fpage>
            <lpage>208</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0168-8510(90)90421-9</pubid>
                  <pubid idtype="pmpid">10109801</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Standards for Validating Health Measures &#8211; Definition and Content</p>
            </title>
            <aug>
               <au>
                  <snm>Ware</snm>
                  <fnm>JE</fnm>
               </au>
            </aug>
            <source>J Chronic Dis</source>
            <pubdate>1987</pubdate>
            <volume>40</volume>
            <fpage>473</fpage>
            <lpage>480</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0021-9681(87)90003-8</pubid>
                  <pubid idtype="pmpid">3298292</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>The MOS 36-item short-form health survey (SF-36) .1. Conceptual-framework and item selection</p>
            </title>
            <aug>
               <au>
                  <snm>Ware</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Sherbourne</snm>
                  <fnm>CD</fnm>
               </au>
            </aug>
            <source>Med Care</source>
            <pubdate>1992</pubdate>
            <volume>30</volume>
            <fpage>473</fpage>
            <lpage>483</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1097/00005650-199206000-00002</pubid>
                  <pubid idtype="pmpid">1593914</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>The MOS 36-Item short-form health survey (SF-36) .3. Tests of data quality, scaling assumptions, and reliability across diverse patient groups</p>
            </title>
            <aug>
               <au>
                  <snm>McHorney</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Ware</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Lu</snm>
                  <fnm>JFR</fnm>
               </au>
               <au>
                  <snm>Sherbourne</snm>
                  <fnm>CD</fnm>
               </au>
            </aug>
            <source>Med Care</source>
            <pubdate>1994</pubdate>
            <volume>32</volume>
            <fpage>40</fpage>
            <lpage>66</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1097/00005650-199401000-00004</pubid>
                  <pubid idtype="pmpid">8277801</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <aug>
               <au>
                  <snm>Ware</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Snow</snm>
                  <fnm>KK</fnm>
               </au>
               <au>
                  <snm>Kosinski</snm>
                  <fnm>MK</fnm>
               </au>
               <au>
                  <snm>Gandek</snm>
                  <fnm>BG</fnm>
               </au>
            </aug>
            <source>SF-36 Health Survey: Manual and interpretation guide</source>
            <publisher>Boston: The Health Institute, New England Medical Center</publisher>
            <pubdate>1993</pubdate>
         </bibl>
         <bibl id="B26">
            <title>
               <p>On the language of pain</p>
            </title>
            <aug>
               <au>
                  <snm>Melzack</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Torgerson</snm>
                  <fnm>WS</fnm>
               </au>
            </aug>
            <source>Anesthesiology</source>
            <pubdate>1971</pubdate>
            <volume>34</volume>
            <fpage>50</fpage>
            <lpage>59</lpage>
            <xrefbib>
               <pubid idtype="pmpid">4924784</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>The McGillI Pain Questionnaire: major properties and scoring methods</p>
            </title>
            <aug>
               <au>
                  <snm>Melzack</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Pain</source>
            <pubdate>1975</pubdate>
            <volume>1</volume>
            <fpage>277</fpage>
            <lpage>299</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0304-3959(75)90044-5</pubid>
                  <pubid idtype="pmpid" link="fulltext">1235985</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>The Role of Compensation in Chronic Pain &#8211; Analysis Using A New Method of Scoring the Mcgill Pain Questionnaire</p>
            </title>
            <aug>
               <au>
                  <snm>Melzack</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Katz</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jeans</snm>
                  <fnm>ME</fnm>
               </au>
            </aug>
            <source>Pain</source>
            <pubdate>1985</pubdate>
            <volume>23</volume>
            <fpage>101</fpage>
            <lpage>112</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0304-3959(85)90052-1</pubid>
                  <pubid idtype="pmpid">2933623</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>The World-Health-Organization Quality-Of-Life Assessment (Whoqol) &#8211; Position Paper from the World-Health-Organization</p>
            </title>
            <source>Soc Sci Med</source>
            <pubdate>1995</pubdate>
            <volume>41</volume>
            <fpage>1403</fpage>
            <lpage>1409</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0277-9536(95)00112-K</pubid>
                  <pubid idtype="pmpid" link="fulltext">8560308</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>The World Health Organisation quality of life assessment (WHOQOL): Development and general psychometric properties</p>
            </title>
            <source>Soc Sci Med</source>
            <pubdate>1998</pubdate>
            <volume>46</volume>
            <fpage>1569</fpage>
            <lpage>1585</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0277-9536(98)00009-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">9672396</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Rationale of the knee-society clinical rating system</p>
            </title>
            <aug>
               <au>
                  <snm>Insall</snm>
                  <fnm>JN</fnm>
               </au>
               <au>
                  <snm>Dorr</snm>
                  <fnm>LD</fnm>
               </au>
               <au>
                  <snm>Scott</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Scott</snm>
                  <fnm>WN</fnm>
               </au>
            </aug>
            <source>Clin Orthop Relat Res</source>
            <pubdate>1989</pubdate>
            <fpage>13</fpage>
            <lpage>14</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">2805470</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Traumatic Arthritis of the hip after dislocation and acetabular fractures: treatment by Mold arthroplasty</p>
            </title>
            <aug>
               <au>
                  <snm>Harris</snm>
                  <fnm>WH</fnm>
               </au>
            </aug>
            <source>Journal of Bone and Joint Surgery-American Volume</source>
            <pubdate>1969</pubdate>
            <volume>51_A</volume>
            <fpage>737</fpage>
            <lpage>755</lpage>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Indexes of Severity for Osteo-Arthritis of the Hip and Knee &#8211; Validation Value in Comparison with Other Assessment Tests</p>
            </title>
            <aug>
               <au>
                  <snm>Lequesne</snm>
                  <fnm>MG</fnm>
               </au>
               <au>
                  <snm>Mery</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Samson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gerard</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Scand J Rheumatol Suppl</source>
            <pubdate>1987</pubdate>
            <volume>65</volume>
            <fpage>85</fpage>
            <lpage>89</lpage>
            <xrefbib>
               <pubid idtype="pmpid">3479839</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Measuring health status in arthritis &#8211; The Arthritis Impact Measurement Scales</p>
            </title>
            <aug>
               <au>
                  <snm>Meenan</snm>
                  <fnm>RF</fnm>
               </au>
               <au>
                  <snm>Gertman</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Mason</snm>
                  <fnm>JH</fnm>
               </au>
            </aug>
            <source>Arthritis Rheum</source>
            <pubdate>1980</pubdate>
            <volume>23</volume>
            <fpage>146</fpage>
            <lpage>152</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/art.1780230203</pubid>
                  <pubid idtype="pmpid">7362665</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>The Arthritis Impact Measurement Scales &#8211; Further Investigations of A Health-Status Measure</p>
            </title>
            <aug>
               <au>
                  <snm>Meenan</snm>
                  <fnm>RF</fnm>
               </au>
               <au>
                  <snm>Gertman</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Mason</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Dunaif</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Arthritis Rheum</source>
            <pubdate>1982</pubdate>
            <volume>25</volume>
            <fpage>1048</fpage>
            <lpage>1053</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/art.1780250903</pubid>
                  <pubid idtype="pmpid">7126289</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Towards a measure of patient-perceived handicap in rheumatoid-arthritis</p>
            </title>
            <aug>
               <au>
                  <snm>Carr</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Thompson</snm>
                  <fnm>PW</fnm>
               </au>
            </aug>
            <source>Br J Rheumatol</source>
            <pubdate>1994</pubdate>
            <volume>33</volume>
            <fpage>378</fpage>
            <lpage>382</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/rheumatology/33.4.378</pubid>
                  <pubid idtype="pmpid" link="fulltext">8156312</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>A patient-centred approach to evaluation and treatment in rheumatoid arthritis: The development of a clinical tool to measure patient-perceived handicap</p>
            </title>
            <aug>
               <au>
                  <snm>Carr</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>Br J Rheumatol</source>
            <pubdate>1996</pubdate>
            <volume>35</volume>
            <fpage>921</fpage>
            <lpage>932</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/rheumatology/35.10.921</pubid>
                  <pubid idtype="pmpid">8883429</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Beyond disability: measuring the social and personal consequences of osteoarthritis</p>
            </title>
            <aug>
               <au>
                  <snm>Carr</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>Osteoarthritis Cartilage</source>
            <pubdate>1999</pubdate>
            <volume>7</volume>
            <fpage>230</fpage>
            <lpage>238</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1053/joca.1998.0154</pubid>
                  <pubid idtype="pmpid" link="fulltext">10222222</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Measurement of patient outcome in arthritis</p>
            </title>
            <aug>
               <au>
                  <snm>Fries</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Spitz</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Kraines</snm>
                  <fnm>RG</fnm>
               </au>
               <au>
                  <snm>Holman</snm>
                  <fnm>HR</fnm>
               </au>
            </aug>
            <source>Arthritis Rheum</source>
            <pubdate>1980</pubdate>
            <volume>23</volume>
            <fpage>137</fpage>
            <lpage>145</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/art.1780230202</pubid>
                  <pubid idtype="pmpid">7362664</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>The dimensions of health outcomes: The Health Assessment Questionnaire, disability and pain scales</p>
            </title>
            <aug>
               <au>
                  <snm>Fries</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Spitz</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>DY</fnm>
               </au>
            </aug>
            <source>J Rheumatol</source>
            <pubdate>1982</pubdate>
            <volume>9</volume>
            <fpage>789</fpage>
            <lpage>793</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7175852</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>The Health Assessment Questionnaire 1992</p>
            </title>
            <aug>
               <au>
                  <snm>Ramey</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Raynauld</snm>
                  <fnm>J-P</fnm>
               </au>
               <au>
                  <snm>Fries</snm>
                  <fnm>JF</fnm>
               </au>
            </aug>
            <source>Arthritis Care Res</source>
            <pubdate>1992</pubdate>
            <volume>5</volume>
            <fpage>119</fpage>
            <lpage>129</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/art.1790050303</pubid>
                  <pubid idtype="pmpid">1457486</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>The Stanford Health Assessment Questionnaire: A review of its history, issues, progress and documentation</p>
            </title>
            <aug>
               <au>
                  <snm>Bruce</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Fries</snm>
                  <fnm>JF</fnm>
               </au>
            </aug>
            <source>J Rheumatol</source>
            <pubdate>2003</pubdate>
            <volume>30</volume>
            <fpage>167</fpage>
            <lpage>178</lpage>
            <xrefbib>
               <pubid idtype="pmpid">12508408</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Questionnaire on the perceptions of patients about total hip replacement</p>
            </title>
            <aug>
               <au>
                  <snm>Dawson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Fitzpatrick</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Carr</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Murray</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>J Bone Joint Surg Br</source>
            <pubdate>1996</pubdate>
            <volume>78</volume>
            <fpage>185</fpage>
            <lpage>190</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8666621</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Questionnaire on the perceptions of patients about total knee replacement</p>
            </title>
            <aug>
               <au>
                  <snm>Dawson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Fitzpatrick</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Murray</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Carr</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>J Bone Joint Surg Br</source>
            <pubdate>1998</pubdate>
            <volume>80</volume>
            <fpage>63</fpage>
            <lpage>69</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1302/0301-620X.80B1.7859</pubid>
                  <pubid idtype="pmpid" link="fulltext">9460955</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>A preliminary evaluation of the dimensionality and clinical importance of pain and disability in osteoarthritis of the hip and knee</p>
            </title>
            <aug>
               <au>
                  <snm>Bellamy</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Buchanan</snm>
                  <fnm>WW</fnm>
               </au>
            </aug>
            <source>Clin Rheumatol</source>
            <pubdate>1986</pubdate>
            <volume>5</volume>
            <fpage>231</fpage>
            <lpage>241</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF02032362</pubid>
                  <pubid idtype="pmpid">3731718</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Validation-study of WOMAC &#8211; A health status instrument for measuring clinically important patient relevant outcomes to antirheumatic drug-therapy in patients with osteo-arthritis of the hip or knee</p>
            </title>
            <aug>
               <au>
                  <snm>Bellamy</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Buchanan</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Goldsmith</snm>
                  <fnm>CH</fnm>
               </au>
               <au>
                  <snm>Campbell</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Stitt</snm>
                  <fnm>LW</fnm>
               </au>
            </aug>
            <source>J Rheumatol</source>
            <pubdate>1988</pubdate>
            <volume>15</volume>
            <fpage>1833</fpage>
            <lpage>1840</lpage>
            <xrefbib>
               <pubid idtype="pmpid">3068365</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>A Comparative study of signal versus aggregate methods of outcome measurement based on the WOMAC Osteoarthritis Index</p>
            </title>
            <aug>
               <au>
                  <snm>Barr</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bellamy</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Buchanan</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Chalmers</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ford</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Kean</snm>
                  <fnm>WF</fnm>
               </au>
               <au>
                  <snm>Kraag</snm>
                  <fnm>GR</fnm>
               </au>
               <au>
                  <snm>Gerecz-Simon</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Campbell</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>J Rheumatol</source>
            <pubdate>1994</pubdate>
            <volume>21</volume>
            <fpage>2106</fpage>
            <lpage>2112</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7869318</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <aug>
               <au>
                  <snm>Bellamy</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Womac Osteoarthritis Index User Guide VII</source>
            <pubdate>2004</pubdate>
         </bibl>
         <bibl id="B49">
            <title>
               <p>On the language of pain</p>
            </title>
            <aug>
               <au>
                  <snm>Melzack</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Anesthesiology</source>
            <pubdate>1971</pubdate>
            <volume>34</volume>
            <fpage>50</fpage>
            <lpage>9</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid">4924784</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <aug>
               <au>
                  <cnm>World Health Organisation</cnm>
               </au>
            </aug>
            <source>The First Ten Years of the World Health Organisation</source>
            <publisher>Geneva: World Health Organisation</publisher>
            <pubdate>1958</pubdate>
         </bibl>
         <bibl id="B51">
            <aug>
               <au>
                  <snm>Torgerson</snm>
                  <fnm>WS</fnm>
               </au>
            </aug>
            <source>Theory and Methods of Scaling</source>
            <publisher>New York: Wiley and Son</publisher>
            <pubdate>1958</pubdate>
         </bibl>
         <bibl id="B52">
            <aug>
               <au>
                  <snm>Stewart</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Ware</snm>
                  <fnm>JE</fnm>
               </au>
            </aug>
            <source>Measuring functioning and well being: the medical outcomes study approach</source>
            <publisher>London: Duke University Press</publisher>
            <pubdate>1999</pubdate>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Ordinal scales and foundations of misinference</p>
            </title>
            <aug>
               <au>
                  <snm>Merbitz</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Grip</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Arch Phys Med Rehabil</source>
            <pubdate>1989</pubdate>
            <volume>70</volume>
            <fpage>308</fpage>
            <lpage>312</lpage>
            <xrefbib>
               <pubid idtype="pmpid">2535599</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Observations Are Always Ordinal &#8211; Measurements, However, Must be Interval</p>
            </title>
            <aug>
               <au>
                  <snm>Wright</snm>
                  <fnm>BD</fnm>
               </au>
               <au>
                  <snm>Linacre</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Arch Phys Med Rehabil</source>
            <pubdate>1989</pubdate>
            <volume>70</volume>
            <fpage>857</fpage>
            <lpage>860</lpage>
            <xrefbib>
               <pubid idtype="pmpid">2818162</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>Why are we weighting &#8211; A critical-examination of the use of item weights in a health-status measure</p>
            </title>
            <aug>
               <au>
                  <snm>Jenkinson</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Soc Sci Med</source>
            <pubdate>1991</pubdate>
            <volume>32</volume>
            <fpage>1413</fpage>
            <lpage>1416</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0277-9536(91)90202-N</pubid>
                  <pubid idtype="pmpid">1871612</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>Knee injury and osteoarthritis outcome score (KOOS) &#8211; Development of a self-administered outcome measure</p>
            </title>
            <aug>
               <au>
                  <snm>Roos</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Roos</snm>
                  <fnm>HP</fnm>
               </au>
               <au>
                  <snm>Lohmander</snm>
                  <fnm>LS</fnm>
               </au>
               <au>
                  <snm>Ekdahl</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Beynnon</snm>
                  <fnm>BD</fnm>
               </au>
            </aug>
            <source>J Orthop Sports Phys Ther</source>
            <pubdate>1998</pubdate>
            <volume>28</volume>
            <fpage>88</fpage>
            <lpage>96</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9699158</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Hip disability and osteoarthritis outcome score &#8211; An extension of the Western Ontario and McMaster Universities Osteoarthritis Index</p>
            </title>
            <aug>
               <au>
                  <snm>Klassbo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Larsson</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Mannevik</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Scand J Rheumatol</source>
            <pubdate>2003</pubdate>
            <volume>32</volume>
            <fpage>46</fpage>
            <lpage>51</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1080/03009740310000409</pubid>
                  <pubid idtype="pmpid">12635946</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>Musculoskeletal function assessment instrument: Criterion and construct validity</p>
            </title>
            <aug>
               <au>
                  <snm>Engelberg</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Martin</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Agel</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Obremsky</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Coronado</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Swiontkowski</snm>
                  <fnm>MF</fnm>
               </au>
            </aug>
            <source>J Orthop Res</source>
            <pubdate>1996</pubdate>
            <volume>14</volume>
            <fpage>182</fpage>
            <lpage>192</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/jor.1100140204</pubid>
                  <pubid idtype="pmpid" link="fulltext">8648494</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
