How to spot a statistical problem: advice for a non-statistical reviewer

Greenwood, Darren C.; Freeman, Jennifer V.

doi:10.1186/s12916-015-0510-5

Tutorial
Open access
Published: 02 November 2015

How to spot a statistical problem: advice for a non-statistical reviewer

Darren C. Greenwood¹ &
Jennifer V. Freeman¹

BMC Medicine volume 13, Article number: 270 (2015) Cite this article

21k Accesses
15 Citations
188 Altmetric
Metrics details

Abstract

Statistical analyses presented in general medical journals are becoming increasingly sophisticated. BMC Medicine relies on subject reviewers to indicate when a statistical review is required. We consider this policy and provide guidance on when to recommend a manuscript for statistical evaluation. Indicators for statistical review include insufficient detail in methods or results, some common statistical issues and interpretation not based on the presented evidence. Reviewers are required to ensure that the manuscript is methodologically sound and clearly written. Within that context, they are expected to provide constructive feedback and opinion on the statistical design, analysis, presentation and interpretation. If reviewers lack the appropriate background to positively confirm the appropriateness of any of the manuscript’s statistical aspects, they are encouraged to recommend it for expert statistical review.

Peer Review reports

Introduction

Most papers published in general medical journals, including BMC Medicine, contain some element of statistical methods, analysis and interpretation. There is evidence that statistical analyses are becoming increasingly sophisticated [1]. Expert statistical review has therefore become an integral part of the editorial process. Some journals send all manuscripts for statistical review. Other journals only send a manuscript for statistical review if it is considered necessary; for example, if the methods are particularly complex or if the Editor or subject reviewer has concerns. The approach taken by BMC Medicine is to ask subject reviewers if they are able to assess all the statistical aspects of the manuscript themselves or whether they recommend an additional statistical review.

One potential weakness of this approach is that it is a system that relies heavily upon the statistical expertise of subject reviewers, who may not have a formal qualification or professional accreditation in statistics. As such, the subject reviewer may be competent in a specific range of statistical methods applicable to their area of expertise, but may not necessarily be aware of more general statistical issues or more recent methodological developments and best practices. The subject reviewer may be able to spot the most egregious errors but is likely to miss the subtlety of inappropriate statistics that might be picked up by an appropriately qualified statistical expert. The aim of this paper is to provide subject reviewers with some help in deciding when a manuscript might benefit from undergoing a proper statistical review. Our comments mainly refer to review of primary research, rather than to systematic reviews and meta-analysis, for which a separate tutorial is available [2].

Statistical review is an important element of the peer-review process that has been shown to substantially improve the quality of manuscripts [3–5]. This relates not only to the statistical analysis, but also to other relevant areas, such as data sources, study design, presentation of results and interpretation of results [1, 6].

We argue that sending a paper for statistical review should not be limited to studies where the subject reviewer considers the methods to be potentially incorrect, or beyond their expertise. Rather, the subject reviewer should generally recommend expert statistical review unless they can positively confirm that there are no problems with the study design, statistical analysis, presentation and interpretation of results.

Although some statistical irregularities are subtle and only likely to be detected by a statistical expert, subject reviewers should consider some of the following indicators of the more common problems encountered in primary research:

Is there sufficient detail to review the statistical aspects?

Have the relevant reporting guidelines been followed (for example, CONSORT for randomized controlled trials [7] or STROBE for observational studies [8])?
Have the authors justified their sample size and made reasonable assumptions about the effect size they consider important to detect? Have they presented enough information to verify their calculations [9]?
Have the methods been provided in sufficient detail to replicate the results if the data were available [1, 10, 11]?
Is it clear how all the results were derived, such as the test or model used, including any covariates, and were the assumptions made in implementing the model reasonable?

Are there any common statistical issues?

Are there lots of P values, or subgroup analyses, particularly unplanned subgroup analyses that were not pre-specified, indicating multiple testing [12]?
Are the covariates adjusted for in models appropriate, without remaining confounding, or over-adjustment for covariates on the causal pathway (for example, longitudinal studies where a covariate is measured after the exposure)?
Are there any hierarchical data structures (for example, cluster randomized trials, repeated measures or matching of cases and controls), and if so has the analysis taken this into account?
Should the analysis address agreement rather than association [13]?
Has the intention-to-treat principle been appropriately applied in pragmatic effectiveness trials [14, 15]?
Have continuous variables been categorized? Have trends been ignored? This may not necessarily mean an inappropriate analysis, but may indicate that a full statistical review would be beneficial.

Is the presentation of results appropriate?

Is there any evidence of selective reporting? Do the main results focus on the main research question, or do they deviate to a secondary question or subgroup? This is particularly problematic if the subgroup analysis was not specified prior to undertaking the analysis [12].
Are results presented without estimates, just P values [16]?
Are estimates presented with no confidence intervals? Standard errors alone are rarely adequate for presenting the uncertainty in estimates, either in the text or graphically [16].

Is the interpretation of results appropriate?

Are limitations of observational studies correctly acknowledged, with no implication of causality in the wording of results and conclusions?
Are results over-extrapolated, beyond the range of the data, or to populations not represented by the study sample?
Is there an appropriate consideration of the impact of any incomplete or missing data?

Although there might be alternative approaches to statistical analysis or presentation, this does not necessarily imply the authors’ methods are invalid. What is important is that the methods chosen are appropriate for the research question and have been done correctly [17]. BMC Medicine allows comments under “discretionary revisions” where such observations can be made.

The same caution we recommend to non-statistical reviewers also applies to statistical experts. Statistical methods are many and varied, particularly in a general medical journal such as BMC Medicine. Some of the more specialist methods may be outside of the experience of a general statistical reviewer. Consequently they should be encouraged to recommend that the editorial office approach an additional specialist in those particular methods for further scrutiny of the article.

Conclusions

In advising the Editor on publication, reviewers are required to comment on whether a manuscript is methodologically sound and clearly written. Within that context, they are expected to provide clear, constructive feedback and opinion on study design, statistical analysis, presentation and interpretation of results. We have provided a number of indicators to assist the non-statistical reviewer in this task. If reviewers lack the appropriate background to positively confirm the appropriateness of any of the manuscript’s statistical aspects, they are encouraged to recommend it for expert statistical review.

Abbreviations

CONSORT:: Consolidated Standards of Reporting Trials
STROBE:: Strengthening the Reporting of Observational Studies in Epidemiology

References

Altman DG. Statistical reviewing for medical journals. Stat Med. 1998;17:2661–74.
Article CAS PubMed Google Scholar
Moher D. Optimal strategies to consider when peer reviewing a systematic review and meta-analysis. BMC Medicine 2015. doi: 10.1186/s12916-015-0509-y.
Chauvin A, Ravaud P, Baron G, Barnes C, Boutron I. The most important tasks for peer reviewers evaluating a randomized controlled trial are not congruent with the tasks most often requested by journal editors. BMC Med. 2015;13:158.
Article PubMed Central PubMed Google Scholar
Gardner MJ, Bond J. An exploratory study of statistical assessment of papers published in the British Medical Journal. J Am Med Assoc. 1990;263:1355–7.
Article CAS Google Scholar
Schor S, Karten I. Statistical evaluation of medical journal manuscripts. J Am Med Assoc. 1966;195:1123–8.
Article CAS Google Scholar
Murray GD. The task of a statistical referee. Br J Surg. 1988;75:664–7.
Article CAS PubMed Google Scholar
Schulz KF, Altman DG, Moher D. CONSORT 2010 Statement: updated guidelines for reporting parallel group randomised trials. BMC Med. 2010;8:18.
Article PubMed Central PubMed Google Scholar
von Elm E, Altman DG, Egger M, Pocock SJ, Gotzsche PC, Vandenbroucke JP. The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement: guidelines for reporting observational studies. J Clin Epidemiol. 2008;61:344–9.
Article Google Scholar
Altman DG. Statistics and ethics in medical research: III How large a sample? Br Med J. 1980;281:1336–8.
Article PubMed Central CAS PubMed Google Scholar
International Committee of Medical Journal Editors. Uniform requirements for manuscripts submitted to biomedical journals. International Committee of Medical Journal Editors. Med Educ. 1999;33:66–78.
Article Google Scholar
Altman DG. Making research articles fit for purpose: structured reporting of key methods and findings. Trials. 2015;16:53.
Article PubMed Central PubMed Google Scholar
Assmann SF, Pocock SJ, Enos LE, Kasten LE. Subgroup analysis and other (mis)uses of baseline data in clinical trials. Lancet. 2000;355:1064–9.
Article CAS PubMed Google Scholar
Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. 1986;i:307–10.
Article Google Scholar
Hollis S, Campbell MJ. What is meant by intention to treat analysis? Survey of published randomised controlled trials. BMJ. 1999;319:670–4.
Article PubMed Central CAS PubMed Google Scholar
White IR, Carpenter J, Horton NJ. Including all individuals is not enough: lessons for intention-to-treat analysis. Clinical Trials. 2012;9:396–407.
Article PubMed Central PubMed Google Scholar
Gardner MJ, Altman DG. Confidence intervals rather than P values: estimation rather than hypothesis testing. Br Med J (Clin Res Ed). 1986;292:746–50.
Article CAS Google Scholar
Pyke DA. Writing and speaking in medicine. How I referee. Br Med J. 1976;2:1117–8.
Article PubMed Central CAS PubMed Google Scholar

Download references

Acknowledgements

We are grateful for the detailed and constructive comments of the Editor, Sabina Alam, and the reviewers of this article, Andrea Tricco and Jaime Peters.

Author information

Authors and Affiliations

Division of Epidemiology and Biostatistics, University of Leeds, Leeds, LS2 9JT, UK
Darren C. Greenwood & Jennifer V. Freeman

Authors

Darren C. Greenwood
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer V. Freeman
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Darren C. Greenwood.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

DCG wrote the first draft of the manuscript. Both authors contributed to further drafts. Both authors read and approved the final manuscript.

Authors’ information

DCG is a Senior Lecturer in Biostatistics at the University of Leeds, and a member of the Editorial Board of BMC Medicine and British Journal of Nutrition. JVF is a Senior Lecturer in Biostatistics, a Royal Statistical Society Chartered Statistician and current Vice-President of the Royal Statistical Society. Both authors have acted as subject reviewer or statistical reviewer of numerous manuscripts for a wide range of medical journals.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Greenwood, D.C., Freeman, J.V. How to spot a statistical problem: advice for a non-statistical reviewer. BMC Med 13, 270 (2015). https://doi.org/10.1186/s12916-015-0510-5

How to spot a statistical problem: advice for a non-statistical reviewer

Abstract

Introduction

Is there sufficient detail to review the statistical aspects?

Are there any common statistical issues?

Is the presentation of results appropriate?

Is the interpretation of results appropriate?

Conclusions

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Authors’ information

Rights and permissions

About this article

Cite this article

Keywords

Comments

BMC Medicine

Contact us

How to spot a statistical problem: advice for a non-statistical reviewer

Abstract

Introduction

Is there sufficient detail to review the statistical aspects?

Are there any common statistical issues?

Is the presentation of results appropriate?

Is the interpretation of results appropriate?

Conclusions

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Authors’ information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Comments

BMC Medicine

Contact us