Skip to main content

Effectiveness of Pilates exercise in treating people with chronic low back pain: a systematic review of systematic reviews

Abstract

Background

Systematic reviews provide clinical practice recommendations that are based on evaluation of primary evidence. When systematic reviews with the same aims have different conclusions, it is difficult to ascertain which review reported the most credible and robust findings.

Methods

This study examined five systematic reviews that have investigated the effectiveness of Pilates exercise in people with chronic low back pain. A four-stage process was used to interpret findings of the reviews. This process included comparison of research questions, included primary studies, and the level and quality of evidence of systematic reviews. Two independent reviewers assessed the level of evidence and the methodological quality of systematic reviews, using the National Health and Medical Research Council hierarchy of evidence, and the Revised Assessment of Multiple Systematic Reviews respectively. Any disagreements were resolved by a third researcher.

Results

A high level of consensus was achieved between the reviewers. Conflicting findings were reported by the five systematic reviews regarding the effectiveness of Pilates in reducing pain and disability in people with chronic low back pain. Authors of the systematic reviews included primary studies that did not match their questions in relation to treatment or population characteristics. A total of ten primary studies were identified across five systematic reviews. Only two of the primary studies were included in all of the reviews due to different inclusion criteria relating to publication date and status, definition of Pilates, and methodological quality. The level of evidence of reviews was low due to the methodological design of the primary studies. The methodological quality of reviews varied. Those which conducted a meta-analysis obtained higher scores.

Conclusion

There is inconclusive evidence that Pilates is effective in reducing pain and disability in people with chronic low back pain. This is due to the small number and poor methodological quality of primary studies. The Revised Assessment of Multiple Systematic Reviews provides a useful method of appraising the methodological quality of systematic reviews. Individual item scores, however, should be examined in addition to total scores, so that significant methodological flaws of systematic reviews are not missed, and results are interpreted appropriately. (348 words)

Peer Review reports

Background

Systematic reviews are ranked as the most valid form of research in several hierarchies of evidence [1, 2]. They provide evidence-based recommendations from the synthesis and critically appraisal of primary studies [3]. Within health care, systematic reviews are used to efficiently obtain advice regarding client management [4]. Conflicting results of systematic reviews, however, creates confusion for readers [5].

Several recently published systematic reviews have investigated the effectiveness of Pilates in people with chronic low back pain (CLBP) [610]. Pilates is a mind-body exercise that targets core stability, strength, flexibility, posture, breathing, and muscle control [11]. It has been recommended in the management of people with CLBP, as this type of exercise may strengthen deep, stabilising muscles that support the lumbar spine, such as transverses abdominis [6, 12]. These muscles are inhibited in people with CLBP [13, 14].

Reviews examining the efficacy of Pilates in people with CLBP, however, report different conclusions. La Touche et al. (2008) [6] suggested that Pilates reduces pain and disability, while Lim et al. (2011) [7] reported that Pilates reduces pain when compared to minimal treatments, but not disability. In contrast, Pereira et al. (2012) [8] concluded that Pilates is ineffective in reducing pain and disability, and Posadzki et al. (2011) [9] suggested that evidence was inconclusive. Aladro-Gonzalvo et al. (2012) also provided conflicting results reporting that Pilates may reduce pain only when compared to minimal intervention, and disability only when compared to other physiotherapeutic treatments [10]. These contradictory findings make it difficult to conclude on the efficacy of Pilates in people with CLBP and to direct use in clinical settings.

A systematic review of reviews was conducted to critically evaluate and summarise the results of all published systematic reviews that have investigated the effectiveness of Pilates exercise in reducing pain and disability in people with CLBP. Areas for improvement for systematic reviews were subsequently identified, and an evidence-based conclusion provided regarding the efficacy of Pilates exercise in people with CLBP.

Methods

A four-stage process was used to determine the appropriateness of systematic review conclusions. This involved comparison of reviews with respect to research questions, included primary studies, their level of evidence and methodological quality (Figure 1). The level of quality of the reviews was assessed using the National Health and Medical Research Council hierarchy of evidence (2009) [1], while the methodological quality was assessed using the Revised Assessment of Multiple Systematic Reviews (R-AMSTAR) [15]. Systematic review findings were then interpreted with respect to these factors.

Figure 1
figure 1

Review process for systematic reviews.

Study design

A systematic review design was chosen over a narrative review as it limits bias in the selection and appraisal of evidence [1618]. In a systematic review, a comprehensive search of the literature is undertaken to answer a focused research question; the search strategy, criterion for selection and critical appraisal of literature is defined; quantitative rather than qualitative results are reported and evidence-based inferences are made [18]. This systematic review was written to meet Preferred Reporting Items for Systematic Review and Meta-Analyses (PRISMA) guidelines [3].

Search strategy

A comprehensive literature search was undertaken using ten databases including Cumulative Index to Nursing and Allied Health Literature (CINAHL), Cochrane Library, Medline, Physiotherapy Evidence Database (PEDro), ProQuest:Health and Medical Complete, Proquest: Nursing and Allied Health Source, Proquest Research Library: Health and Medicine, Scopus, Sport Discus, and Web of Science. The standardised search strategy included the use of Medical Subject Headings (MeSH) terms “Pilates” and “Low Back Pain”, and search term “Review” in the title, abstract, and as able, the keyword fields within maximal date ranges of each database up until November 4, 2012 (Table 1).

Table 1 Search strategy: using medical search headings (MeSH) “Pilates” and “Low Back Pain”, and search term “Review”

Preliminary searching revealed that expanding the search to include “exercise”, “motor control”, and “core stability” did not identify any additional reviews, nor did changing the Boolean operator to “or”. Removing “Low Back Pain” and “Review” also did not help identify any additional systematic reviews. Secondary searching of reference lists of included papers was undertaken to identify any additional, relevant studies that met the inclusion criteria.

Selection procedures

Selection of relevant papers was based on the title, and if required, review of the abstract or full text of the document. Papers identified from the search process were assessed against inclusion and exclusion criteria by two independent reviewers (CW, BH). If there were any discrepancies in selected papers between the two reviewers, a third reviewer (AB) independently reviewed the papers and through discussion, obtained a consensus.

Selection criteria

To be included in this systematic review, systematic reviews needed to:

  • Be identified as a systematic review of 2 or more intervention studies. In a systematic review, a comprehensive search of the literature is undertaken to answer a focused research question; the search strategy, criterion for selection and critical appraisal of literature is defined; quantitative rather than qualitative results are reported and evidence-based inferences are made [16, 17]. Narrative reviews or expert commentaries did not meet inclusion requirements [17].

  • Be published in the English language. For ease of interpretation and access, reviews that were unpublished or published in another language were excluded.

  • Include human participants with chronic low back pain, that is, localised pain in the lumbar region that lasts for more than three months [19]. If reviews only included participants with low back pain lasting less than three months, they were excluded.

  • Assess the effectiveness of Pilates, where the term “Pilates” was used to describe the type of prescribed exercise being investigated. Exercises described as “motor control” or “lumbar stabilisation” did not suffice for Pilates. This is because Pilates may include features in addition to these exercise approaches [11].

  • Use outcome measures to evaluate disability, that is, impairments, activity limitations or participation restrictions according to the International Classification of Health, Functioning, and Disability (ICF) [20]. Pain is considered a functional impairment in the ICF.

Level of evidence

According to the NHMRC hierarchy, the level of evidence of a systematic review depends on the methodological design of included primary studies [1]. Systematic reviews that include only randomised controlled trials are rated as the highest form of evidence. Systematic reviews that include studies other than randomised controlled trials are rated only as high as the lowest level of evidence represented by primary studies (Table 2). Two independent reviewers graded the level of evidence of systematic reviews according to the NHRMC hierarchy of evidence [1]. Any discrepancies between the two reviewers were discussed with a third reviewer to obtain a consensus.

Table 2 Modified national health and medical research council (NHMRC) hierarchy of evidence

Methodological quality

The methodological quality of included systematic reviews was evaluated using the R-AMSTAR [15]. The R-AMSTAR rates the methodological quality of systematic reviews by providing a numerical score for 11 items (Table 3). For each item, the methodological quality is scored out of 4 where one indicates poor methodological quality, and four indicates excellent methodological quality [15]. R-AMSTAR items originate from the Assessment of Multiple Systematic Reviews (AMSTAR). While the AMSTAR has been shown to be valid and reliable in assessing the methodological quality of reviews, the numerical score provided by the R-AMSTAR provides an additional quantitative score that is easy to interpret [15, 21, 22].

Table 3 R-AMSTAR scores for methodological quality of systematic reviews

Two independent reviewers graded the reviews, with any discrepancies being resolved by discussion with a third reviewer. R-AMSTAR items were graded as per guidelines provided by Kung et al. (2010) [15]. Percentile ranks were not calculated in this systematic review due to the small number of reviews being considered. Following grading of the methodological quality of the three systematic reviews, the percentage agreement and kappa score of agreement, and 95% confidence interval, between the two independent reviewers were calculated.

Data extraction and syntheses

The following data were extracted and synthesised from selected papers:

  1. 1.

    Author(s), year of publication, and reference of systematic reviews. Descriptive statistics were used to summarise the number of systematic reviews and dates of publication.

  2. 2.

    The findings and conclusions of systematic reviews in relation to pain and disability, including effect sizes and 95% confidence intervals provided by meta-analyses.

  3. 3.

    Author(s), year of publication, and reference of primary studies included in the systematic reviews. Descriptive statistics were used to summarise the number of primary studies, and differences in included primary studies across systematic reviews.

  4. 4.

    The NHMRC level of evidence and R-AMSTAR scores for methodological quality were calculated for each review and tabulated alongside author(s) and year of publication.

  5. 5.

    The research questions of systematic reviews in terms of study population, intervention, comparisons, and outcome measures. This included consideration of systematic review aims, and corresponding included primary study details.

Results

A total of 44 papers were identified using the search strategy described in the methods. Five of these papers fulfilled selection criteria [610]. There was 100% agreement among the two independent reviewers on the selection of the systematic reviews. Most papers were excluded due to being duplicates, or not using a systematic review methodology (Figure 2).

Figure 2
figure 2

Results of literature search.

Findings of systematic reviews

The five reviews had conflicting conclusions regarding the effectiveness of Pilates in reducing pain and disability in people with CLBP (Table 7). Three of the reviews conducted meta-analyses [7, 8, 10]. Aladro-Gonzalvo et al. (2012) [10] also conducted a meta-regression analysis to identify co-variants that may have contributed to the heterogeneity of treatment effect across studies [23]. No predictor variable, however, was identified.

Research questions

  1. a)

    Population

    The authors of all reviews, apart from Posadzki et al. (2011) [9], failed to ensure the duration of symptoms reported by participants in primary studies matched with their research questions. For example, La Touche et al. (2011) [6] and Pereira et al. (2012) [8] aimed to focus on participants with CLBP, and Aladro-Gonzalvo et al. (2012) [10] and Lim et al. (2011) [7] on participants with low back pain lasting more than 6 weeks. The authors of these reviews, however, included primary studies with participants with acute, subacute, recurrent or chronic low back pain (Table 4).

  2. b)

    Intervention

    Diverse Pilates exercise protocols for people with low back pain were reported across reviews (Table 4). In the majority of primary studies, authors prescribed Pilates mat exercises, although Anderson (2005) [24] and Rydeard et al. (2006) [25] suggested use of specialised Pilates equipment. Only 60% of primary studies described home exercises as part of the Pilates protocol [2429].

    The validity of Pilates exercise interventions in reviews also varied. La Touche et al. (2008) [6], Lim et al. (2011) [7], Pereira et al. (2012) [8], and Aladro-Gonzalvo et al. (2012) [10], ensured that treatments in primary studies were described solely as Pilates exercise. Posadzki et al. (2011) [9], however, included a primary study where treatment involved yoga, rehabilitation, and physical therapy as well [30].

  3. c)

    Comparison

    Comparison treatments varied considerably, ranging from no exercise, usual care, massage, physiotherapy, and alternative exercises (Table 4). Usual care comparison treatments also differed, ranging from education and medication, to physiotherapy and bracing [25, 30, 31]. Co-interventions were also evident in two primary studies [29, 31].

    There was also inconsistency across reviews regarding the description of comparison physiotherapy treatment within the Obrien et al. 2006 [32] study. Pereira et al. (2012) [8] defined the type of physiotherapy as lumbar stabilisation exercise, while Lim et al. (2006) [7] reported that the physiotherapy treatment included other modalities as well.

  4. d)

    Outcome measures

    Similar outcome measures were used across primary studies and in the systematic reviews (Table 4). Lim et al. (2011) [7], Aladro-Gonzalvo et al. (2012) [10], and Pereira et al. (2012) [8], however, elected to use different outcome measures for pain given in the same primary study (Anderson, 2005) [24]. That is, Lim et al. (2011) [7] and Aladro-Gonzalvo et al. (2012) [10] used the Miami Back Pain Index scores [33], while Pereira et al. (2012) used pain scores given within the Short Form Health Survey (SF-36) [34].

    Although similar outcome measures were used across reviews, participants were evaluated at different points in time across primary studies. Timing of evaluation was dependent on the duration of the Pilates treatment and the length of follow up. The shortest follow up was at 6 weeks [28, 31, 32] and longest follow up assessment was at 12 months following the completion of Pilates treatment [24, 25, 30].

Table 4 Description of population, intervention, comparison, outcomes measures in systematic reviews

Included primary studies

There were ten different primary studies identified across the five systematic reviews [2432, 35] (Table 5). La Touche et al. (2008) [6] and Posadzki et al. (2011) [9] included only studies published in full, as opposed to Aladro-Gonzalvo et al. (2012) [10], Lim et al. (2011) [7], and Pereira et al. (2012) [8] who included studies that were unpublished, or part-published [24, 28, 29, 32, 35]. Pereira et al. (2012) [8] also only included studies that had low risk of bias as defined by the Cochrane Back Review Group [36]. This meant that Donzelli et al. (2006) [27] and Quinn (2005) [35] were not included in this review.

Table 5 Primary studies included in systematic reviews

Level of evidence

There was 100% agreement between reviewers regarding the methodological design, and level of evidence of the primary studies and the systematic reviews. Primary studies consisted of randomised controlled trials (n=4), pseudo-randomised controlled trials (n=5), and a parallel case series (n=1). According to the National Health and Medical Research Council (NHMRC) hierarchy, the level of evidence represented by these primary studies ranges from Level II to Level IV evidence [1] (Table 6).

Table 6 Primary studies: level of evidence and methodological design

Aladro-Gonzalvo et al. (2012) [10], La Touche et al. (2008) [6], Lim et al. (2011) [7], and Posadzki et al. (2011) [9] included Donzelli et al. (2006) [27], a parallel case series article. These three reviews consequently represent Level IV evidence on the NHMRC hierarchy [1]. Pereira et al. (2012) [8] excluded Donzelli et al. (2006) [27], but included two pseudo-randomised controlled trials [31, 32]. This means that the systematic review by Pereira et al. (2012) [8] represents Level III evidence on the NHMRC hierarchy [1].

Methodological quality

The two reviewers agreed on 84% of R-AMSTAR scores across the systematic reviews (46/55). Different scores were obtained for criterion 9 and 10 for Aladro-Gonzalvo et al. (2012) [10], criterion 1, 2 and 6 for La Touche et al. (2008) [6], criterion 3 for Lim et al. (2011) [7], criterion 7 and 9 for Pereira et al. (2012) [8], criterion 8 for Posadzki et al. (2011) [9]. The inter-rater agreement for R-AMSTAR scores remained “substantial” when chance agreement was eliminated (kappa: 0.78, 95% confidence interval: 0.71-0.85) [37]. All disagreements were resolved through discussion with a third reviewer.

The R-AMSTAR scores of methodological quality of systematic reviews ranged from 19–37 out of 44 (Table 3). Aladro-Gonzalvo et al. (2012) [10] achieved the highest total score (37/44), followed by Lim et al. (2011) [7] (35/44), Pereira et al. (2012) [8] (32/44), Posadzki et al. (2011) [9] (30/44), and La Touche et al. (2008) [6] (19/44). The R-AMSTAR scores indicate that all reviews lacked a thorough assessment of publication bias and statement regarding conflict of interest. Duplicate data selection and extraction, inclusion of grey literature, listing of excluded studies, and documentation of study characteristics were also insufficient in several reviews [610].

Finally, R-AMSTAR scores identified that La Touche et al. (2008) [6] and Pereira et al. (2012) [8] needed to improve consideration of the methodological quality of the primary studies when formulating conclusions. Also, La Touche et al. (2008) [6] and Posadzki et al. (2011) [9] did not provide a justification for not undertaking a meta-analysis, and Lim et al. 2011 [7] and Aladro-Gonzalvo et al. (2012) [10] needed to improve their method of combining findings of primary studies in their meta-analyses.

Discussion

This systematic review identified five published reviews that have investigated the efficacy of Pilates exercise in the treatment of people with CLBP [610]. These reviews have different conclusions, despite having similar research aims. To interpret results of reviews, a comparison of research questions, included primary studies, the level of evidence, and the methodological quality of systematic reviews was undertaken (Figure 1). This process assisted in identifying and understanding the reasons for the different review findings, and in considering the validity of those findings [36].

Research questions

La Touche et al. (2008) [6] and Posadzki et al. (2011) [9] included primary studies with participants with sub acute, chronic or recurrent low back pain. Meanwhile, Aladro-Gonzalvo et al. (2012) [10], Lim et al. (2011) [7] and Pereira et al. (2012) [8] incorporated an additional primary study that included participants with acute low back pain as well [28]. Outcomes reported by and Aladro-Gonzalvo et al. (2012) [10], Lim et al. (2011) [7] and Pereira et al. (2012)[8] therefore may be conservative and underestimate the effects of Pilates in people with CLBP, as people with acute low back pain tend to respond less favourably to exercise [38].

The findings of Aladro-Gonzalvo et al. (2012) [10], La Touche et al. (2008) [6], Lim et al. (2011) [7], and Pereira et al. (2012) [8] relate to people with non-specific low back pain. Non-specific low back pain is pain in the lower back without an identifiable pathology [39]. In contrast, Posadski et al. (2011) [9] included an additional primary study with participants with low back pain related to disc pathology in the lumbar spine [30]. Further research into the effectiveness of Pilates in people with low back pain with specific pathologies should be undertaken so that conclusions can be made regarding the efficacy Pilates in people with all forms of low back pain [36].

With regards to treatment, Aladro-Gonzalvo et al. (2012) [10], La Touche et al. (2008) [6], Lim et al. (2011) [7], and Pereira et al. (2012) [8] included primary studies that investigated only Pilates exercise. Posadzki et al. (2011) [9], however, included a primary study that evaluated the effectiveness of an intervention that was only part-Pilates [30]. Treatment effects reported by this review may consequently relate to other therapies provided other than Pilates to the intervention group [40].

Pilates exercise protocols varied considerably across primary studies (Table 4). Authors of reviews reported Pilates exercise sessions of 15–60 minutes duration, 1–7 times per week, for 10 days and up to 12 months [610]. There was also variation in the use of mat versus specialised equipment, and incorporation of home exercises [7]. Further research is therefore required to define the essential elements of Pilates exercise in people with chronic low back pain [10].

In terms of comparison treatments, usual care was defined differently across the primary studies [25, 30, 31]. This may have resulted in an inaccurate measurement of Pilates treatment effect as participants had variable types and amounts of “usual care” in both treatment and comparison groups [40]. Pereira et al. (2012) [8] and Lim et al. (2011) [7] also described physiotherapy interventions provided by O’Brien et al. (2006) [32] differently. Pereira et al. (2012) [8] considered physiotherapy to consist of only lumbar stabilisation exercise, however, Lim et al. (2011) [7] reported physiotherapy treatment as also involving other modalities. This may have also contributed to inaccurate measurements of treatment effect with the pooling of primary studies with variable comparison treatments [40].

Similar outcome measures were used in primary studies to assess the effect of Pilates on pain and disability. The majority of these outcome measures are validated for use in people with low back pain, and have been found to be reliable [33, 34, 41]. The different treatment effects reported by Lim et al. (2011) [7] and Pereira et al. (2012) [8], however, could relate to the use of different outcome measures for pain intensity provided for Anderson (2005) [24].

Different findings between meta-analyses could also relate to different grouping of primary studies. For example, Aladro-Gonzalvo et al. (2012) [10] considered alternative exercise to Pilates to be a minimal intervention, while Lim et al. (2011) [7] and Pereira et al. (2012) [8] did not. Classifying alternative exercise to Pilates as a “minimal intervention” could be considered inappropriate as exercise has been found to reduce pain and disability in people with CLBP [38]. Effect sizes for Pilates may therefore be more conservative in Aladro-Gonzalvo et al. (2012) [40].

Included primary studies

A comparison of included primary studies in reviews was undertaken as incorporating additional evidence can lead to different results [42]. Nine of the primary studies were available at the time of publication of the first systematic review [6]. La Touche et al. (2008) [6] and Posadzki et al. (2011) [9], however, chose to exclude unpublished primary studies and abstract articles (Table 7). This means that the findings of these reviews could be inflated as unpublished studies often have outcomes that are less positive or statistically insignificant [43].

Table 7 Findings of systematic reviews: effectiveness of Pilates in people with chronic low back pain

In contrast, Aladro-Gonzalvo et al. (2012) [10], Lim et al. (2011) [7] and Pereira et al. (2012) [8] included several unpublished theses and an abstract study in their reviews (Table 5). These reviews, then, are likely to have less publication bias and more realistic findings [43]. Pereira et al. (2012) [8] also excluded primary studies that had a high risk of bias as defined by the Cochrane Back Review Group [36]. This review’s findings may therefore have greater credibility than other reviews [44].

The meta-regression analysis undertaken by Aladro-Gonzalvo et al. (2012) [10] did not identify any predictor variables that could explain differences in treatment effects across studies. This is not surprising, however, as the power of meta-regression was limited due to too few studies, and their heterogeneity [23, 45]. The rationale for examining several co-variants is also questionable, and aggregation bias likely as client-specific characteristics such as the duration of complaint were taken from the mean results of studies rather than individual statistics [23, 46, 47].

Level of evidence

The NHMRC level of evidence of all reviews was lower than expected for systematic reviews due to the inclusion of primary studies that were not randomised controlled trials. Aladro-Gonzalvo et al. (2012) [10], La Touche et al. (2011) [6], Lim et al. (2011) [7], and Posadzki et al. (2011) [9] represent the lowest level of evidence (Level IV) on the NHMRC hierarchy [1]. This is because these reviews included Donzelli et al. (2006) [27], a parallel case series article. Pereira et al. (2012) [8], however, represents Level III evidence on the NHMRC hierarchy as this review included only pseudo-randomised and randomised controlled trials. This means the findings of all reviews may contain bias related to the methodological design of primary studies, but Pereira et al. (2012) [8] may be less biased than other reviews [1, 48].

Methodological quality

The methodological quality of reviews was analysed to assist in the interpretation of findings [5]. The R-AMSTAR provided a numerical score of methodological quality for each review based on AMSTAR criteria [11]. The AMSTAR is reported as valid and reliable in assessing methodological quality of systematic reviews [5, 15, 21, 22]. The inter-rater agreement for R-AMSTAR scores remained “substantial” as indicated by a kappa score of 0.78, 95% confidence interval: 0.71-0.85 [37]. This is similar to other scores reported for AMSTAR in the literature [22].

R-AMSTAR scores provide an indication of level of bias in review findings with high scores indicating greater credibility of findings [15]. Findings of Aladro-Gonzalvo et al. (2012) [10] which scored 37/44, can therefore be considered to be the most robust in relation to the methodological quality of systematic reviews. Examining individual item scores with the R-AMSTAR, however, is also critical to identify factors that influence the credibility of findings.

La Touche et al. (2008) [6] and Pereira et al. (2012) [8], for example, did not consider the methodological quality of primary studies in forming their conclusions. This is despite significant methodological flaws being identified in primary studies, such as small sample sizes, baseline differences between treatment and control groups, high drop-out rates, lack of assessor blinding, and intention to treat analyses [6, 7, 9]. The conclusions of La Touche et al. (2008) [6] and Pereira et al. (2012) [8], therefore, need to be interpreted with caution as these factors were not considered [49].

There is also a concern that the high R-AMSTAR scores of Aladro-Gonzalvo et al. (2012) [10], Lim et al. (2011) [7] and Pereira et al. (2012) [8] do not reflect the inappropriateness of conducting a meta-analysis. Aladro-Gonzalvo et al. (2012) [10], Lim et al. (2011) [7] and Pereira et al. (2012) [8] pooled the results of primary studies that had similar comparison groups, but different treatment protocols, outcome measures, and timing of re-assessments (Table 2). This clinical heterogeneity should have indicated that conducting a meta-analysis was inappropriate [36]. This is because pooling heterogeneous studies can produce inaccurate treatment effects [15, 50, 51].

Significant statistical heterogeneity (for example I2>60%) was also reported in both reviews when Pilates was compared to usual care [7, 8, 10]. This again suggests meta-analysis is inappropriate [52]. Using a random effects model to compensate for heterogeneity may have assisted to improve the accuracy of findings, but it does not explain or remove the primary study differences [36]. Moreover, combining two few primary studies in a meta-analysis can also produce misleading results [53]. The findings of Aladro-Gonzalvo et al. (2012) [10], Lim et al. (2011) [7] and Pereira et al. (2012) [8] therefore need to be interpreted carefully due to the small number and heterogeneity of primary studies.

Conclusion

We are in agreement with Posadzki et al. (2011) [9], that there is inconclusive evidence that Pilates is effective in reducing pain and disability in people with CLBP. This conclusion relates to the insufficient number and methodological quality of available primary studies, rather than the methodological quality of reviews. These findings contrast to other review conclusions where Aladro-Gonzalvo et al. (2012) [10], La Touche et al. (2008) [6] and Lim et al. (2011) [7] report some effectiveness of Pilates, and Pereira et al. (2012) report no effectiveness.

Subsequent systematic reviews need to ensure that conclusions consider the methodological design and quality of primary studies. Meta-analyses and meta-regression analyses should also not be conducted when there is significant clinical and statistical heterogeneity across studies, and when primary studies are few in number. The Revised Assessment of Multiple Systematic Reviews provides a useful method of appraising the methodological quality of systematic reviews. Individual item scores, however, need to examined, in addition to total scores. This will ensure that significant methodological flaws are not missed, and results of reviews are interpreted appropriately.

Authors’ information

CW is a registered physiotherapist undertaking doctoral studies at the University of Western Sydney under the supervision of AB, GSK, and PM. AB is an Associate Professor of Physiotherapy at Griffith University, however, the majority of this work was undertaken while AB was the Foundation Associate Professor and Head of Physiotherapy program at the University of Western Sydney. GSK is the Professor of Health Science and Dean of Science and Health, PM a Senior Lecturer, and CW a Lecturer in the School of Science and Health at the University of Western Sydney. BM is a registered physiotherapist and doctoral student at the University of Western Sydney.

Abbreviations

AMSTAR:

Assessment of Multiple Systematic Reviews

CINAHL:

Cumulative index to Nursing and Allied Health Literature

CLBP:

Chronic Low Back Pain

ICF:

International Classification of Health Functioning, and Disability

NHMRC:

National Health and Medical Research Council

PEDRO:

Physiotherapy Evidence Database

PRISMA:

Preferred Reporting Items for Systematic Review and Meta-Analyses

R-AMSTAR:

Revised Assessment of Multiple Systematic Reviews

SF-36:

36 Item Short Form Health Survey.

References

  1. National Health and Medical Research Council: NHMRC levels of evidence and grades for recommendations for developers of guidelines. 2009, Canberra: National Health and Medical Research Council

    Google Scholar 

  2. Oxford Centre for Evidence-Based Medicine: Levels of Evidence. Retrieved February 12, 2012, from http://www.cebm.net/index.aspx?o=1025; 2009

  3. Moher D, Liberati A, Tetzlaff J, Altman DG, The PRISMA Group: Preferred Reporting Items for Systematic Reviews and Meta-Analyses: The PRISMA Statement. PLoS Med. 2009, 6: e1000097-10.1371/journal.pmed.1000097. http://www.plosmedicine.org/article/info%3Adoi%2F10.1371%2Fjournal.pmed.1000097,

    Article  PubMed  PubMed Central  Google Scholar 

  4. Evans D: Hierarchy of evidence: A framework for the ranking of evidence evaluating nursing interventions. J Clin Nurs. 2003, 12: 77-84. 10.1046/j.1365-2702.2003.00662.x.

    Article  PubMed  Google Scholar 

  5. Smith V, Devan D, Begley CM, Clarke M: Methodology in conducting a systematic review of systematic reviews of healthcare interventions. BMC Med Res Methodol. 2011, 11: 15-10.1186/1471-2288-11-15.

    Article  PubMed  PubMed Central  Google Scholar 

  6. La Touche R, Escalante K, Linares MT: Treating non-specific chronic low back pain through the Pilates Method. J Bodyw Mov Ther. 2008, 12: 364-370. 10.1016/j.jbmt.2007.11.004.

    Article  PubMed  Google Scholar 

  7. Lim ECW, Poh RLC, Low AY, Wong WP: Effects of Pilates-based exercises on pain and disability in individuals with persistent non specific low back pain: A systematic review with meta-analysis. J Orthop Sports Phys Ther. 2011, 41: 70-80.

    Article  PubMed  Google Scholar 

  8. Pereira LM, Obara K, Dias JM, Menacho MO, Guariglia DA, Schiavoni D, Pereira HM, Cardoso JR: Comparing the Pilates method with no exercise or lumbar stabilisation for pain and functionality in patients with chronic low back pain: Systematic review and meta-analysis. Clin Rehabil. 2012, 26: 10-20. 10.1177/0269215511411113.

    Article  PubMed  Google Scholar 

  9. Posadzki P, Lizis P, Hagner-Derengowska M: Pilates for low back pain: A systematic review. Complement Ther Clin Pract. 2011, 17: 85-89. 10.1016/j.ctcp.2010.09.005.

    Article  PubMed  Google Scholar 

  10. Aladro-Gonzalvo AR, Araya-Vargas GA, Machado-Diaz M, Salazar-Rojas W: Pilates-based exercise for persistent, non specific low back pain and associated functional disability: A meta-analysis with meta-regression. J Bodyw Mov Ther. 2012, http://dx.doi.org/10.1016/j.jbmt.2012.08.003

    Google Scholar 

  11. Wells C, Bialocerkowski A, Kolt GS: Definition of Pilates: A systematic review. Complement Ther Med. 2012, 20: 253-262. 10.1016/j.ctim.2012.02.005.

    Article  PubMed  Google Scholar 

  12. Endleman I, Critchley DJ: Transversus abdominis and obliquus internus activity during pilates exercises: measurement with ultrasound scanning. Arch Phys Med Rehabil. 2008, 89: 2205-2212. 10.1016/j.apmr.2008.04.025.

    Article  PubMed  Google Scholar 

  13. Ferreira PH, Ferreira ML, Maher CG, Refshauge K, Herbert R, Hodges PW: Changes in recruitment of transversus abdominis correlate with disability in people with chronic low back pain. Br J Sports Med. 2010, 44: 1166-72. 10.1136/bjsm.2009.061515.

    Article  CAS  PubMed  Google Scholar 

  14. Wallwork T, Stanton W, Freke M, Hides J: The effect of chronic low back pain on size and contraction of the lumbar multifidus muscle. Man Ther. 2009, 14: 496-500. 10.1016/j.math.2008.09.006.

    Article  PubMed  Google Scholar 

  15. Kung J, Chiappelli F, Cajulis OO, Avezova R, Kossan G: From systematic reviews to clinical recommendations for evidence-based health care: Validation of revised assessment of multiple systematic reviews (R-AMSTAR) for grading of clinical relevance. Open Dent J. 2010, 4: 84-91.

    PubMed  PubMed Central  Google Scholar 

  16. Collins J, Fauser B, Bart CJM: Balancing the strengths of systematic and narrative reviews. Human Reprod Update. 2005, 11: 103-104.

    Article  Google Scholar 

  17. Cook D, Mulrow C, Haynes R: Systematic reviews: Synthesis of best evidence for clinical decisions. Ann Intern Med. 1997, 126: 376-380.

    Article  CAS  PubMed  Google Scholar 

  18. Schlesselman JJ, Collins JA: Evaluating systematic reviews and meta-analyses. Semin Reprod Med. 2003, 21: 95-105.

    Article  PubMed  Google Scholar 

  19. Charlton JE: Core Curriculum for Professional Education in Pain. 2005, Seattle: International Association of the Study of Pain (IASP) Press, 3

    Google Scholar 

  20. Steiner WA, Ryser L, Huber EO, Uebelhart D, Aeschlimann A, Stucki G: Use of the ICF model as a clinical problem-solving tool in physical therapy and rehabilitation medicine. Phys Ther. 2002, 82: 1098-1107.

    PubMed  Google Scholar 

  21. Shea B, Grimshaw J, Wells G, Boers M, Andersson N, Hamel C, Porter AC, Tugwell P, Moher D, Bouter LM: Development of AMSTAR: a measurement tool to assess the methodological quality of systematic reviews. BMC Med Res Methodol. 2007, 7: 10-10.1186/1471-2288-7-10.

    Article  PubMed  PubMed Central  Google Scholar 

  22. Shea BJ, Hamel C, Wells GA, Bouter LM, Kristjanson E, Grimshaw J, Henry DA, Boers M: AMSTAR is a reliable and valid measurement tool to assess the methodological quality of systematic reviews. J Clin Epidemiol. 2009, 62: 1013-1020. 10.1016/j.jclinepi.2008.10.009.

    Article  PubMed  Google Scholar 

  23. Baker WL, White CM, Cappelleri JC, Kluger J, Coleman C: Understanding heterogeneity in meta-analysis: The role of meta-regression. Int J Clin Practice. 2009, 63: 1426-1434. 10.1111/j.1742-1241.2009.02168.x.

    Article  CAS  Google Scholar 

  24. Anderson B: Randomised clinical trial comparing active versus passive approaches to the treatment of recurrent and chronic low back pain. 2005, University of Miami, PhD thesis

    Google Scholar 

  25. Rydeard R, Leger A, Smith D: Pilates-based therapeutic exercise: Effect on subjects with nonspecific chronic low back pain and functional disability: A randomized controlled trial. J Orthop Sports Phys Ther. 2006, 36: 472-484.

    Article  PubMed  Google Scholar 

  26. da Fonseca JL, Magini M, de Freitas TH: Laboratory Gait Analysis in Patients with Low Back Pain Before and After a Pilates Intervention. J Sport Rehabil. 2009, 18: 269-282.

    PubMed  Google Scholar 

  27. Donzelli S, Di Domenica F, Cova AM, Galletti R, Giunta N: Two different techniques in the rehabilitation treatment of low back pain: a randomized controlled trial. Europa Medicophysica. 2006, 42: 205-210.

    CAS  PubMed  Google Scholar 

  28. Gagnon LH: Efficacy of Pilates exercises as therapeutic intervention in treating patients with low back pain. 2005, University of Tennessee, PhD thesis

    Google Scholar 

  29. MacIntyre L: The effect of Pilates on patients’ chronic low back pain: A pilot study. Master of Science in Physiotherapy thesis. 2006, Johannesburg: University of the Witwatersrand, Thesis

    Google Scholar 

  30. Vad VB, Bhat AL, Tarabichi Y: The role of back rx exercise program in diskogenic low back pain: A prospective randomized trial. Arch Phys Med Rehabil. 2007, 88: 577-582. 10.1016/j.apmr.2007.02.008.

    Article  PubMed  Google Scholar 

  31. Gladwell V, Head S, Haggar M, Beneke R: Does a program of Pilates improve chronic non-specific low back pain?. J Sport Rehabil. 2006, 15: 338-350.

    Google Scholar 

  32. O’Brien N, Hanlon N, Meldrum D: Randomised controlled trial comparing physiotherapy and Pilates in the treatment of ordinary low back pain[abstract]. Phys Ther Rev. 2006, 11: 224-225. 10.1179/174329306X84300.

    Google Scholar 

  33. Roach K, Carreras K, Lee A, Reed L, Zimmerman G: Development and reliability of the Miami Back Index. JOSPT. 2001, 31: 97-

    Article  Google Scholar 

  34. Chapman JR, Norvell DC, Hermsmeyer JT, Bransford RJ, DeVine J, McGirt MJ, Lee MJ: Evaluating common outcomes for measuring treatment success for chronic low back pain. Spine. 2011, 36 (Suppl): S54-68.

    Article  PubMed  Google Scholar 

  35. Quinn J: Influence of Pilates-based mat exercise on chronic lower back pain. 2005, Boca Raton, Florida: Florida Atlantic University, PhD thesis

    Google Scholar 

  36. Furlan AD, Pennick V, Bombardier C, van Tulder M: Updated method guidelines for systematic reviews in the Cochrane Back Review Group. Spine. 2009, 34: 1929-1941. 10.1097/BRS.0b013e3181b1c99f.

    Article  PubMed  Google Scholar 

  37. Viera AJ, Garrett JM: Understanding interobserver agreement: The kappa statistic. Fam Med. 2005, 37: 360-363.

    PubMed  Google Scholar 

  38. Maher C: Effective physical treatment for chronic low back pain. Orthop Clin North Am. 2004, 35: 57-64. 10.1016/S0030-5898(03)00088-9.

    Article  CAS  PubMed  Google Scholar 

  39. Krismer M, van Tulder M: Strategies for prevention and management of musculoskeletal conditions. Low back pain (non-specific). Best Pract Res Clin Rheumatol. 2007, 21: 77-91. 10.1016/j.berh.2006.08.004.

    Article  CAS  PubMed  Google Scholar 

  40. Godwin M, Ruhland L, Casson I, MacDonald S, Delva D, Birtwhistle R, Lam M, Seguin R: Pragmatic controlled clinical trials in primary care: The struggle between external and internal validity. BMC Med Res Methodol. 2003, 3: 28-10.1186/1471-2288-3-28.

    Article  PubMed  PubMed Central  Google Scholar 

  41. Davidson M, Keating JL: A comparison of five low back disability questionnaires: Reliability and responsiveness. Phys Ther. 2002, 82: 8-24.

    PubMed  Google Scholar 

  42. Hopewell S, McDonald S, Clarke M, Egger M: Grey literature in meta-analyses of randomised trials of health care interventions. Cochrane Database Syst Rev. 2007, 10.1002/14651858.MR000010.pub3. Issue 2. Art No.: MR000010

    Google Scholar 

  43. Dwan K, Altman DG, Arnaiz JA, Bloom J, Chan A-W, Cronin E, Decullier E, Easterbrook PJ, Von Elm E, Gamble C, Ghersi D, Ioannidis JPA, Simes J, Williamson PR: Systematic Review of the Empirical Evidence of Study Publication Bias and Outcome Reporting Bias. PLoS One. 2008, 3: e3081-10.1371/journal.pone.0003081.

    Article  PubMed  PubMed Central  Google Scholar 

  44. van Tulder MW, Suttorp M, Morton S, Bouter LM, Shekella P: Empirical evidence of an association between internal validity and effect size in randomized controlled trials of low-back pain. Spine. 2009, 34: 1685-1692. 10.1097/BRS.0b013e3181ab6a78.

    Article  PubMed  Google Scholar 

  45. Sterne JAC, Gavaghan D, Egger M: Publication and related bias in meta-analysis: Power of statistical tests and prevalence in the literature. J Clin Epidemiol. 2000, 53: 1119-1129. 10.1016/S0895-4356(00)00242-0.

    Article  CAS  PubMed  Google Scholar 

  46. Thomson SG, Higgins JP: How should meta-regression analyses be undertaken and interpreted?. Stat Med. 2002, 21: 15591573-

    Google Scholar 

  47. Lambert PC, Sutton AJ, Abrams KR, Jones DR: A comparison of summary patient-level covariates in regression with individual patient data meta-analysis. J Clin Epidemiol. 2002, 55: 86-94. 10.1016/S0895-4356(01)00414-0.

    Article  CAS  PubMed  Google Scholar 

  48. Egger M, Bartlett C, Holenstein F, Sterne J: How important are comprehensive literature searches and the assessment of trial quality in systematic reviews? Empirical study. Health Technol Assess. 2003, 7: 1-76.

    CAS  PubMed  Google Scholar 

  49. Khad KS, Kunz R, Kleijnen J, Antes G: Five steps to conducting a systematic review. J R Soc Med. 2003, 96: 118-121. 10.1258/jrsm.96.3.118.

    Article  Google Scholar 

  50. Slavin RE: Best evidence synthesis: An intelligent alternative to meta-analysis. Clin Epidemiol. 1995, 48: 9-18. 10.1016/0895-4356(94)00097-A.

    Article  CAS  Google Scholar 

  51. Tobin MJ, Jabran A: Meta-analysis under the spotlight: Focused on a meta-analysis of ventilator weaning. Crit Care Med. 2008, 36: 1-7. 10.1097/01.CCM.0000297883.04634.11.

    Article  PubMed  Google Scholar 

  52. Noordzij M, Hooft L, Dekker FW, Zoccali C, Jager KJ: Systematic reviews and meta-analyses: When they are useful and when to be careful. Kidney Int. 2009, 76: 1130-1136. 10.1038/ki.2009.339.

    Article  PubMed  Google Scholar 

  53. Walker E, Hernandez AV, Kattan MW: Meta-analysis: Its strengths and limitations. Cleve Clin J Med. 2008, 75: 431-440. 10.3949/ccjm.75.6.431.

    Article  PubMed  Google Scholar 

Pre-publication history

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Cherie Wells.

Additional information

Competing interests

There are no financial or non-financial competing interests for any of the authors of this review.

Authors’ contributions

CW designed, conducted, and drafted this systematic review with advice and guidance from AB, GSK, and PM. CW and BM evaluated the level of evidence and methodological quality of systematic reviews. AB acted as the third reviewer to obtain a consensus regarding methodological scores of reviews where there was disagreement. All authors reviewed the content of the manuscript prior to submission for its intellectual content. All authors read and approved the final manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Wells, C., Kolt, G.S., Marshall, P. et al. Effectiveness of Pilates exercise in treating people with chronic low back pain: a systematic review of systematic reviews. BMC Med Res Methodol 13, 7 (2013). https://doi.org/10.1186/1471-2288-13-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/1471-2288-13-7

Keywords