A comparison of two methods for estimating odds ratios: Results from the National Health Survey

Bakhshi, Enayatollah; Eshraghian, Mohammad R; Mohammad, Kazem; Seifi, Behjat

doi:10.1186/1471-2288-8-78

Research article
Open access
Published: 25 November 2008

A comparison of two methods for estimating odds ratios: Results from the National Health Survey

Enayatollah Bakhshi¹,
Mohammad R Eshraghian¹,
Kazem Mohammad¹ &
…
Behjat Seifi²

BMC Medical Research Methodology volume 8, Article number: 78 (2008) Cite this article

6538 Accesses
9 Citations
Metrics details

Abstract

Background

The practice of dichotomizing a continuous outcome variable does not make use of within-category information. That means the loss of information. This study compared two approaches in the modelling of the association between sociodemographic and smoking with obesity in adult women in Iran.

Methods

We conducted a comparative study between two methods via an illustrative example, using data from the "National Health Survey in Iran (NHSI)" database. It included 14176 women aged 20–69 years. At first, body mass index(BMI) was treated as a continuous variable, OR_s and 95 per cent confidence intervals were calculated using the "without dichotomizing" method. Then subjects were classified into obese (BMI ≥ 30 kg/m²) and nonobese (BMI < 30 kg/m²) and logistic regression model was used to estimate OR_s and 95 per cent confidence intervals.

Results

The odds ratio estimates changed only slightly over the two methods. But the "without dichotomizing" method provided shorter confidence intervals on the odds ratio parameters than dichotomizing method. All relative confidence interval lengths were greater than 1.15.

Conclusion

If responses are continuous then the "without dichotomizing" method is certainly more useful than the "dichotomizing" method and leads to more precise estimation of odds ratios.

Peer Review reports

Background

Over the past 20 years, the logistic regression model has become more common. The parameter in logistic regression has the interpretation of log odds ratio, which is easy for people such as physicians to understand. This model uses a categorical (dichotomous or polytomous) outcome variable. In many areas of research, the outcome data are continuous. Many researchers have no hesitation in dichotomizing a continuous variable but this practice does not make use of within-category information. Several investigators have noted the disadvantages of dichotomizing [1–9].

Although Goldwasser and Fitzmaurice [10] stated that a 'direct comparison of the logistic and linear regression coefficients is not meaningful since they have different interpretations, Moser and Coombs [11] provided a closed form relationship that allows a direct comparison between the logistic and linear regression coefficients. They also provided a procedure that allows the researcher to analyze the original continuous outcome without dichotomizing.

The aims of this paper are: (1) to demonstrate that the coefficient estimates from the "without dichotomizing (WDICH)" method have smaller variances and shorter confidence intervals than the dichotomizing (DICH) method; and (2) to find more efficient parameter estimates than logistic regression model for the association of sociodemographic and smoking with obesity by using cross-sectional data from the 1999–2000 National Health Survey in Iran.

Methods

Overview of WDICH method

The WDICH method overcomes some of the disadvantages of logistic regression model [11]. The linear regression model can be stated as follows:

y_{i} = x_{i}^{'} β + e_{i}

Where e _iis random error term with mean 0 and variance σ ² > 0; e _iand e _jare uncorrelated so that the covariance (e _i, e _j) = 0 for all i, j; i ≠ j. Moser and Coombs supposed that the random terms e _ifollow a logistic distribution and explanatory variables x _ifollow a discrete uniform distribution. They provided an estimate of the same odds ratio parameter as the DICH method, but without loss of information [11]. The estimates obtained from WDICH are more efficient than those from the logistic model [11]. They also carried out an extensive simulation study to evaluate the robustness of this conclusion to changes in the distributions of e _iand x _i[11]. The reliability of these simulation results is assessed in this paper.

Data set examined

The NHSI is a survey designed to gain comprehensive knowledge and information about health care problems and difficulties throughout in Iran, 1999–2000. Data from the NHSI were considered in this investigation. In this study, 14176 women, 8957 urban and 5219 rural aged 20–69 years were investigated. We excluded pregnant women from the analyses. This study is approved by the Ethic Committee of the Tehran University of Medical Sciences.

Model variables

a)Response variable

Height and weight were measured rather than self-reported. BMI (Body Mass Index) was calculated as weight in kilograms divided by the square of the height in meters (kg/m²), and subjects were classified into obese (BMI ≥ 30 kg/m²) and nonobese (BMI < 30 kg/m²).

b) Independent variables

i. Place of residence: Urban (1) or Rural (0);

ii. Age (yr);

iii. Education: The total number of years of education;

iv. Smoking status: Smoker (1) or Nonsmoker (0);

v. Marital status: Married (1) or Non-married (0);

vi. Economic index: Economic index was defined as square meter of living place divided by number of household. Participants were classified by their economy index status into four classes: 1) low (economic index ≤ Quartile 1), 2) lower-middle (Quartile 1 < economic index ≤ Quartile 2), 3) upper-middle (Quartile 2 < economic index ≤ Quartile 3) and 4) high (economic index > Quartile 3).

Statistical analysis

At first, BMI was treated as a continuous variable and is expressed as a function of place of residence, age, education, smoking, marital status and economic index using the WDICH method, OR_s and 95 per cent confidence intervals were calculated. Then subjects were classified into obese (BMI ≥ 30 kg/m²) and nonobese (BMI < 30 kg/m²) and logistic regression model was used to estimate OR_s and 95 per cent confidence intervals.

Two methods were compared with respect to relative confidence interval length of parameter estimates.

Analyses results were obtained using STATA (Version 8.0) and R (Version 2.0.1).

Results

Distribution of age, BMI, education, marital status, economic index and smoking are shown in table 1 in order to make the data presentation complete. The mean BMI of urban women was 26.02 kg/m²(95 percent CI: 25.92–26.12). The rural women had a mean BMI 24.14 kg/m² (95 percent CI: 24.02–24.26).

Table 1 Characteristics of the analytical sample by place of residence in 14176 Iranian women, 1999–2000

Full size table

Results in Table 2 were obtained from fitting Models in DICH and WDICH methods. DICH and WDICH produced different confidence intervals, although the odds ratios were similar. The odds ratio estimate from the WDICH method had smaller variances and shorter confidence intervals than the DICH method. The mathematical proof and simulation results are found in Moser and Coombs [11].

Table 2 Adjusted odds ratios for obesity and confidence intervals using two methods for the National Health Survey

Full size table

Explanation of results from Table 2(WDICH)

❑ Urban women had significantly higher odds of obesity than their rural counterparts (OR = 2.041, 95% CI: 1.916 – 2.914).

❑ Age was directly associated with obesity (OR = 1.03, 95% CI: 1.026–1.032).

❑ Education was inversely associated with obesity (OR = 0.99, 95% CI: 0.979–0.996).

❑ Non smoker women were more obese than smokers. Obesity odds ratio was 0.69 (95 percent CI: 0.553–0.856) for smoker women compared to non smokers.

❑ Married women had significantly higher odds of obesity than their non-married counterparts (OR = 1.24, 95% CI: 1.150 – 1.323).

❑ An association observed between economic index and obesity. Using low as the reference group, obesity odds ratios were 1.36 (95 percent CI: 1.246–1.475), 1.31 (95 percent CI: 1.205–1.426) and 1.29 (95 percent CI: 1.155–1.443) for the lower-middle, upper-middle and high groups respectively.

Discussion

Dichotomizing the primary outcome variable may result in loss of information. We conducted a comparative study between two methods via an illustrative example, using data from the NHSI database. It included 14176 women aged 20–69 years. OR estimates and 95 per cent confidence intervals were calculated using both the DICH method and WDICH method. Overall, we obtained similar parameter estimates from DICH and WDICH methods. But the odds ratio estimate from the WDICH method had smaller variances and shorter confidence intervals than the DICH method. Our results indicated the improvement of the WDICH method over the DICH method because for all covariates the relative confidence interval length was greater than 1.15. Our results were consistent with the findings by Moser and Coombs [11] showing the greater efficiency of parameter estimates from WDICH method in comparison to DICH method.

In our study, there was a positive association between age and obesity. Our results are consistent with most studies [12–15].

In most studies, women with lower education were more obese than those with higher education. Our results were consistent with these studies [16–20].

We observed an inverse association between smoking and obesity. Most studies report that smoking is associated with lower relative weight [21–25]. Our findings are basically in line with these studies.

We found that non-married women were less likely to be obese than their married counterparts. Our results are consistent with most studies [26, 27].

We found a statistically significant association between economic index level and obesity for women. Women with low level were leaner than those with other levels. Our findings are consistent with some study in developing countries [28].

One of the limitations of this study is the cross-sectional nature of the NHSI dataset. This means that we cannot draw definitive conclusions concerning the direction of causality. It is another limitation that physical activity and income were not used in our investigation. The other limitation in this study is that marital status could be categorized into legally married and non-married only.

Our study had several strengths. It was performed in a nationally representative sample of the Iranian women. Height and weight were actually measured rather than self-reported. It is well known that self-reports underestimate the prevalence of obesity [29, 30].

Conclusion

WDICH method is useful to estimate odds ratios and provides more efficient parameter estimates than DICH method when responses are continuous. When outcome is a continuous variable, it should not be treated as a binary variable.

Abbreviations

BMI:: body mass index
OR:: odds ratio
NHSI:: National Health Survey in Iran
WDICH:: without dichotomizing
DICH:: dichotomizing.

References

Zhao LP, Kolonel LN: Efficiency loss from categorizing quantitative exposures into qualitative exposures in case-control studies. American Journal of Epidemiology. 1992, 136: 464-474.
CAS PubMed Google Scholar
MacCallum RC, Zhang S, Preacher KJ, Rucker DD: On the practice of dichotomization of quantitative variables. Psychological Methods. 2002, 7: 19-40. 10.1037/1082-989X.7.1.19.
Article PubMed Google Scholar
Cohen J: The cost of dichotomization. Applied Psychological Measurement. 1983, 7: 249-253. 10.1177/014662168300700301.
Article Google Scholar
Greenland S: Avoiding power loss associated with categorization and ordinal scores in dose-response and tread analysis. Epidemiology. 1995, 6: 450-454.
Article CAS PubMed Google Scholar
Austin PC, Brunner LJ: Inflation of the type I error rate when a continuous confounding variable is categorized in logistic regression analyses. Statistics in Medicine. 2004, 23: 1159-1178. 10.1002/sim.1687.
Article PubMed Google Scholar
Vargha A, Rudas T, Delaney HD, Maxwell SE: Dichotomization, partial correlation, and conditional independence. Journal of Educational and Behavioral Statistics. 1996, 21: 264-282.
Article Google Scholar
Maxwell SE, Delaney HD: Bivariate median splits and spurious statistical significance. Psychological Bulletin. 1993, 113: 181-190. 10.1037/0033-2909.113.1.181.
Article Google Scholar
Streiner DL: Breaking up is hard to do: the heartbreak of dichotomizing continuous data. Can J Psychiatry. 2002, 47 (3): 262-266.
PubMed Google Scholar
Chen H, Cohen P, Chen S: Biased odds ratios from dichotomization of age. Statistics in Medicine. 2007, 26: 3487-3497. 10.1002/sim.2737.
Article PubMed Google Scholar
Goldwasser MA, Fitzmaurice GM: Multivariate linear regression analysis of childhood psychopathology using multiple informant data. International Journal of Methods in Psychiatric Research. 2001, 10: 1-10. 10.1002/mpr.95.
Article Google Scholar
Moser BK, Coombs LP: Odds ratios for a continuous outcome variable without dichotomizing. Statistics in Medicine. 2004, 23: 1843-60. 10.1002/sim.1776.
Article PubMed Google Scholar
Ogden CL, Carroll MD, Curtin LR, McDowell MA, Tabak CJ, Flegal KM: Prevalence of overweight and obesity in the United States, 1999–2004. JAMA. 2006, 295 (13): 1549-55. 10.1001/jama.295.13.1549.
Article CAS PubMed Google Scholar
Flegal KM, Caroll MD, Kuczmarski RJ, Johnson CL: Overweight and obesity in the United States: prevalence and trends, 1960–1994. Int J Obes. 1998, 22: 39-47. 10.1038/sj.ijo.0800541.
Article CAS Google Scholar
Lewis CE, Jacobs DR, McCreath H, et al: Weight gain continues in the 1990s: 10 year trends in weight and overweight from the CARDIA Study. Am J Epidemiol. 2000, 151 (12): 1172-81.
Article CAS PubMed Google Scholar
Bagust A, Roberts BL, Haycox AK, Barrow S: The additional lost of obesity to the health service and the potential for resource savings from effective interventions. Eur J Public Health. 1999, 9: 258-64. 10.1093/eurpub/9.4.258.
Article Google Scholar
Klumbien J, Petkeviciene J, Helasoja V, Prattala R, Kasmel A: Sociodemographic and health behaviour factors associated with obesity in adult populations in Estonia, Finland and Lithuania. Eur J Public Health. 2004, 14: 390-4. 10.1093/eurpub/14.4.390.
Article Google Scholar
Sarlio-Lahteenkorva S, Lahelma E: The association of body mass index with social and economic disadvantage in women and men. Int J Epidemiol. 1999, 28: 445-9. 10.1093/ije/28.3.445.
Article CAS PubMed Google Scholar
Laurier D, Guiguet M, Chau NP, Wells JA, Valleron AJ: Prevalence of obesity: a comparative survey in France, United Kingdom and the United States. Int J Obes Relat Metab Disord. 1992, 16 (8): 565-572.
CAS PubMed Google Scholar
Molarius A, Seidell JC, San S, Tuomilehto J, Kuulasmaa K: Educational level, relative body weight, and changes in their association over 10 years: an international perspective from the WHO MONICA Project. Am J Public Health. 2000, 90: 1260-8. 10.2105/AJPH.90.8.1260.
Article CAS PubMed PubMed Central Google Scholar
Wamala SP, Wolk A, Orth-Gomer K: Determinants of obesity in relation to socioeconomic status among middle-aged Swedish women. Prev Med. 1997, 26: 734-44. 10.1006/pmed.1997.0199.
Article CAS PubMed Google Scholar
Jeffery RW, Forster JL, Folsom AR, Luepker RV, Jacobs DR, Blackburn H: The relationship between social status and body mass index in the Minnesota Heart Health Program. Int J Obes. 1989, 13: 59-67.
CAS PubMed Google Scholar
Kawada T: Difference of body mass index stratified by the period of smoking cessation from a cross-sectional study. Arch Med Res. 2004, 35: 181-4. 10.1016/j.arcmed.2003.09.012.
Article PubMed Google Scholar
Laaksonen M, Rahkonen O, Prattala R: Smoking status and relative weight by educational level in Finland, 1978–1995. Prev Med. 1998, 27: 431-7. 10.1006/pmed.1998.0288.
Article CAS PubMed Google Scholar
Rasky E, Stronegger WJ, Freidl W: The relationship between body weight and patterns of smoking in women and men. Int J Epidemiol. 1996, 25: 1208-12. 10.1093/ije/25.6.1208.
Article CAS PubMed Google Scholar
Molarius A, Seidell JC, Sans S, Tuomilehto J, Kuulasmaa K: Smoking and relative body weight: an international perspective from the WHO MONICA Project. J Epidemiol Community Health. 1997, 51: 252-60. 10.1136/jech.51.3.252.
Article CAS PubMed PubMed Central Google Scholar
Jeffery RW, Rick AM: Cross-sectional and longitudinal associations between body mass index and marriage-related factors. Obes Res. 2002, 10: 809-15. 10.1038/oby.2002.109.
Article PubMed Google Scholar
Sobal J, Rauschenbach B, Frongillo EA: Marital status changes and body weight changes: a US longitudinal analysis. Soc Sci Med. 2002, 56: 1543-55. 10.1016/S0277-9536(02)00155-7.
Article Google Scholar
Griffiths P, Bently M: Women of higher socio-economic status are more likely to be overweight in Karnataka. India Eur J Clin Nutr. 2005, 95: 1217-20. 10.1038/sj.ejcn.1602228.
Article Google Scholar
Bostrom G, Diderichsen F: Socioeconomic differences in misclassification of height, weight and body mass index based on questionnaire data. Int J Epidemiol. 1997, 26: 860-6. 10.1093/ije/26.4.860.
Article CAS PubMed Google Scholar
Kuskowska-Wolk A, Bergstrom R, Bostrom G: Relationship between questionnaire data and medical records of height, weight and body mass index. Int J Obes Relat Metab Disord. 1992, 16 (1): 1-9.
CAS PubMed Google Scholar

Pre-publication history

The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2288/8/78/prepub

Download references

Acknowledgements

This study was financed by a grant from Tehran University/Medical Sciences. The authors acknowledge the National Health Survey for their data, coordinated at the Department of Biostatistics, School of Public Health and Institute of Public Health Research, Tehran University/Medical Science, Iran.

Author information

Authors and Affiliations

Department of Biostatistics, School of Public Health and Institute of Public Health Research, Tehran University/Medical Sciences, Iran
Enayatollah Bakhshi, Mohammad R Eshraghian & Kazem Mohammad
Department of Physiology, Medicine School, Tehran University/Medical Sciences, Iran
Behjat Seifi

Authors

Enayatollah Bakhshi
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad R Eshraghian
View author publications
You can also search for this author in PubMed Google Scholar
Kazem Mohammad
View author publications
You can also search for this author in PubMed Google Scholar
Behjat Seifi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kazem Mohammad.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

EB, KM and MRE originated the idea for this study, did the research proposal, data analysis and prepared the manuscript. BS helped and edited the final version as the medical consultant. All authors read and approved the final manuscript.

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Bakhshi, E., Eshraghian, M.R., Mohammad, K. et al. A comparison of two methods for estimating odds ratios: Results from the National Health Survey. BMC Med Res Methodol 8, 78 (2008). https://doi.org/10.1186/1471-2288-8-78

Download citation

Received: 06 July 2008
Accepted: 25 November 2008
Published: 25 November 2008
DOI: https://doi.org/10.1186/1471-2288-8-78

A comparison of two methods for estimating odds ratios: Results from the National Health Survey

Abstract

Background

Methods

Results

Conclusion

Background

Methods

Overview of WDICH method

Data set examined

Model variables

a)Response variable

b) Independent variables

Statistical analysis

Results

Explanation of results from Table 2(WDICH)

Discussion

Conclusion

Abbreviations

References

Pre-publication history

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors' contributions

Rights and permissions

About this article

Cite this article

Keywords

BMC Medical Research Methodology

Contact us

A comparison of two methods for estimating odds ratios: Results from the National Health Survey

Abstract

Background

Methods

Results

Conclusion

Background

Methods

Overview of WDICH method

Data set examined

Model variables

a)Response variable

b) Independent variables

Statistical analysis

Results

Explanation of results from Table 2(WDICH)

Discussion

Conclusion

Abbreviations

References

Pre-publication history

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors' contributions

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Medical Research Methodology

Contact us