Abstract
Background
Crosssectional studies with binary outcomes analyzed by logistic regression are frequent in the epidemiological literature. However, the odds ratio can importantly overestimate the prevalence ratio, the measure of choice in these studies. Also, controlling for confounding is not equivalent for the two measures. In this paper we explore alternatives for modeling data of such studies with techniques that directly estimate the prevalence ratio.
Methods
We compared Cox regression with constant time at risk, Poisson regression and logbinomial regression against the standard MantelHaenszel estimators. Models with robust variance estimators in Cox and Poisson regressions and variance corrected by the scale parameter in Poisson regression were also evaluated.
Results
Three outcomes, from a crosssectional study carried out in Pelotas, Brazil, with different levels of prevalence were explored: weightforage deficit (4%), asthma (31%) and mother in a paid job (52%). Unadjusted Cox/Poisson regression and Poisson regression with scale parameter adjusted by deviance performed worst in terms of interval estimates. Poisson regression with scale parameter adjusted by χ^{2 }showed variable performance depending on the outcome prevalence. Cox/Poisson regression with robust variance, and logbinomial regression performed equally well when the model was correctly specified.
Conclusions
Cox or Poisson regression with robust variance and logbinomial regression provide correct estimates and are a better alternative for the analysis of crosssectional studies with binary outcomes than logistic regression, since the prevalence ratio is more interpretable and easier to communicate to nonspecialists than the odds ratio. However, precautions are needed to avoid estimation problems in specific situations.
Keywords:
Cox regression; crosssectional studies; logistic regression; odds ratio; Poisson regression; prevalence ratio; robust variance; statistical modelsBackground
Epidemiologic studies found in the literature are frequently crosssectional, as this is a simple, fast and inexpensive design alternative. Often the outcomes are binary, and logistic regression is used for the analysis. This results in the odds ratio being frequently reported in situations where incidence or prevalence ratios are estimable, despite the fact that it is "biologically interpretable only insofar as it estimates the incidenceproportion or incidencedensity ratio" [1].
From a survey done by the authors in the International Journal of Epidemiology and in the Revista de Saúde Pública (São Paulo, Brazil) published in 1998, 221 original articles were found. Among these, 110 (50%) were based on crosssectional studies, and 45 (20%) on longitudinal studies. Logistic regression was used for the analysis of 37 (34%) and 10 (22%) of these studies, respectively. We have, therefore, that an important proportion of such studies end up reporting odds ratios, the effect measure yielded by logistic regression, rather than prevalence or incidence ratios.
The use of odds ratios is absolutely correct. There is nothing intrinsically wrong with them. But, when working with frequent outcomes, what is common in crosssectional studies, the odds ratio can strongly overestimate the prevalence ratio. Here resides the most common mistake associated with odds ratios in our experience: the authors "forget" what their measure of association is and make interpretations such as "the exposed group has a risk of illness four times greater than the nonexposed group". The relative risk interpretation given to the odds ratio can be misleading, in theoretical and practical terms, especially if used for definition of policy priorities in conjunction with other true relative risks [14].
Additionally, logistic regression is often used for the sake of control of confounding and adjustment of interactions. But confounding and interaction are dependent on the measure of effect, so that controlling for confounding for the odds ratio is not the same thing as doing so for the prevalence ratio [5,6]. Therefore, interpreting the odds ratio as if it were a prevalence ratio is inadequate not only in terms of the possible overestimation, but also because confounding may not be appropriately controlled.
Several alternatives have been discussed in the literature for the analysis of binary outcomes in crosssectional (or longitudinal) studies using the prevalence ratio rather than the odds ratio. The simplest way is to transform the odds ratios obtained by logistic regression into prevalence ratios [79]. Another possibility is to use a statistical model that estimates directly the prevalence ratio and its confidence interval. Alternatives explored in the epidemiological literature are Cox regression with equal times of followup assigned to all individuals [10], logbinomial regression (a generalized linear model with a logarithmic link function and binomial distribution for the residual) [7,1113], Poisson regression [13] and complementary loglog model, where the link function is log( log(1  π)) and the distribution is binomial [13,14].
The authors that contributed to the discussion have not reached a conclusion on which would be the best approach, and only three publications make some comparison among available alternatives [4,13,15]. Possible fixes for the problems related to confidence intervals in some of the techniques proposed were dealt with to some extent by one of the papers [13], where the conclusion is that "there are no valid reasons for the systematic choice of odds ratio and of the logistic regression model to estimate prevalence rate ratios unless the type of study imperatively requires their use."
We have applied, in the context of a crosssectional study, logbinomial regression, Cox regression, and Poisson regression. Corrections for the standard errors were included for Cox and Poisson regressions. Also, to increase the applicability of the results, a confounder was always included in the scenarios studied, which involved outcomes of varying prevalences. The point and interval estimates obtained with each approach were compared to the standard MantelHaenszellike prevalence ratios and confidence interval estimators.
Methods
Using Cox regression to analyze a binary outcome in a crosssectional study was suggested by Lee & Chia [10], and assessed by others [4,15]. Usually, Cox regression is used to analyze timetoevent data, that is, the response is the time an individual takes to present the outcome of interest. Individuals that never get ill are assigned the total length of time of the followup, and are treated as censored, meaning that it is not known when they will get ill, but at least until the time of the end of the followup they are well. Individuals lost to followup are treated in a similar way. Cox regression estimates the hazard rate function that expresses how the hazard rate depends upon a set of covariates. The model formulation is
h(t) = h_{0}(t) exp(β_{1}z_{1 }+ ... + β_{k}z_{k })(1)
where h_{0}(t) is the base hazard function of time, z_{i }are covariates and β_{i}, the coefficients for the k covariates. The Cox model treats h_{0}(t) as a nuisance function and actually does not estimate it [16].
When a constant risk period is assigned to everyone in the cohort, the hazard rate ratio estimated by Cox regression equals the cumulative incidence ratio in longitudinal studies, or the prevalence ratio in crosssectional studies [17,18]. Although this model can produce correct point estimates, the underlying distribution of the response is Poisson. As prevalence data in a crosssectional study follow a binomial distribution, the variance of the coefficients tends to be overestimated, resulting in wider confidence intervals compared to those based on the binomial distribution. This is easily explained by comparing the binomial variance, p(1p), with a maximum of 0,25 when p = 0,5 with Poisson variance, λ, that grows steadily with the intensity of the process. That is, the variance estimated by the Poisson model will be very close to the binomial variance when the outcome is rare, but will be increasingly greater as the outcome becomes more frequent. In such a situation we have underdispersion, the opposite to the more commonly observed overdispersion, where the data is more dispersed than the model predicts.
It is possible to improve the situation using the robust variance estimates proposed by Lin & Wei [19], similar to other robust sandwich estimators proposed for parametric models, such as Huber's sandwich estimator [20]. In this paper, Cox regression with equal followup times was assessed, with standard and robust variance estimates.
Poisson regression is commonly used in epidemiology to analyze longitudinal studies where the outcome is a count of episodes of an illness occurring over time (e.g. episodes of diarrhea). The model formulation is
where n is the count of events for a given individual, t the time it was followedup, and X_{i }the covariates. The model parameters (β_{i}) are log relative risks. In this context, Poisson regression is equivalent to Cox regression [21], and the parameters estimated are the same.
As described for Cox regression, the prevalence ratio is directly estimated by the model, and the confidence intervals are wider than those provided by a binomial model. A simple remedy is to multiply the estimated Poisson variance by some estimate of underdispersion (or overdispersion). These estimates can be based on the deviance or the chisquare of the model, dividing these quantities by the residual degrees of freedom [22,23]. In practice, this ratio is used as a scale parameter, replacing the original Poisson value of 1. A robust variance estimate is also available for the Poisson model, based on the Huber sandwich estimate [20] (which again yields results that are equal to Cox regression with robust variance). This alternative is known to underestimate the true variability with moderately sized samples, while adjusting the scale parameter tends to overestimate it. Other alternatives would be jackknife and bootstrap variance estimates [23]. We decided not to use the latter alternatives as they are not directly available in standard statistical software. Thus, Poisson regression was used in this paper with unadjusted variances, with scale parameter adjustment for both deviance and chisquare statistics, and with robust variance estimates.
The last model assessed was the logbinomial model [15] – a generalized linear model where the link function is the logarithm of the proportion under study and the distribution of the error is binomial [4,7,11,12,15]. The measure of effect in this model is also the relative risk.
For k covariates the model is written as
log(π) = β_{0 }+β_{1}X_{1 }+ ... + β_{k}X_{k }(3)
where π is the probability of success (e. g., the proportion of sick persons in a group), and X_{i }the covariates. The relative risk estimate of a given covariate is e^{β}.
Since log(π) must be in the interval ∞ to 0, restrictions in the estimation process have to be used to avoid predicting probabilities out of the [0,1] interval. When estimates are on the boundaries of the valid parameter space, the estimates of the NewtonRaphson method will not converge to the maximum likelihood estimates [24]. Convergence problems in the estimation process are most likely to happen when the model contains a continuous covariate or multiple politomic covariates, or the outcome prevalence is high [12,24]. When the estimates are not on the boundary of the parameter space, convergence problems may still happen, and better starting values for the estimation process than the default used by the software will help. Most logbinomial models fitted in this paper used the default Stata estimation options, without convergence problems. In one case, when the model failed to converge, the "search" option, which makes the procedure search for a better starting value, was used [25].
The results obtained from the various models were compared to the pooled MantelHaenszellike prevalence ratios (MHPR) and corresponding confidence intervals, used here as the reference results. MantelHaenszel estimates are easy to obtain in simple situations such as the ones dealt with in this work (one exposure and one confounder). However, for more complex situations, their estimation is more complicated and the use of statistical models is more efficient.
All the analyses were performed with Stata 7.0 [25], and the actual command lines used are listed below. Each outcomeexposureconfounder combination was represented by one row in the dataset and its frequency given by the variable freq.
*** MH relative risk
cs ill exposed [fweight = freq], by(confounder)
*** Poisson regression unadjusted
poisson ill exposed confounder [fweight = freq], irr
*** Poisson regression adjusted by chisquared
glm ill exposed confounder [fweight = freq], family(poisson) scale(x2) eform
*** Poisson regression adjusted by deviance
glm ill exposed confounder [fweight = freq], family(poisson) scale(dev) eform irls
*** Poisson regression with robust variance
poisson ill exposed confounder [fweight = freq], irr r
*** Logbinomial regression
glm ill exposed confounder [fweight = freq], family(binomial) link(log) eform
*** Odds ratio from logistic regression
logistic ill exposed confounder [fweight = freq]
For the comparison of the above techniques, real data from a populationbased survey were used. A birth cohort was initiated in 1993, including all births happening in Pelotas, Southern Brazil [26]. These children were seen at birth, and their mothers interviewed. At 1 and 3 months of age, a subsample of 655 were sought for followup information. At 6 and 12 months, a larger subsample (including the 655 seen at 1 and 3 months) was sought, that comprised all children born with low birthweight and 20% of the remaining children. At these points, 1363 children were sought. The data used in this work came from another visit done between November 1997 and April 1998, when the children were 4–5 yearsold. The children sought were the same as those in the 12month revisit, and 1273 (93%) were actually interviewed. From the 90 children lost to followup, 61 (68%) had moved to other towns, 18 (20%) could not be found, 6 (7%) had died, and 5 (6%) refused to participate. The children were submitted to a nutritional assessment (weight and height) and their mothers answered a standardized precoded questionnaire including information on socioeconomic, demographic, reproductive, and health characteristics.
Three outcomes with different prevalences were used in the analyses, each in conjunction with a risk factor and a confounding factor, in a way to form 3 distinct situations. The three sets of variables used respectively as outcome, risk factor and confounder were: situation 1 – underweight (weight for age Zscore < 2), previous hospitalization and birth weight; situation 2 – asthma (asthma or bronchitis reported by the person responsible for the child in study), whether mother smoked and social class; situation 3 – status of maternal employment (whether or not in a paid job), father living with the family and social class. All variables were made dichotomous in order to simplify the comparisons and understanding of the models.
In order to widen the scenarios available, each set of variables was manipulated to increase the level of confounding. This was achieved by arbitrary changes in the prevalences of the risk and confounding factors, reweighting the relevant strata in the data in a way to keep the sample size constant. The original and manipulated data are fully presented in the results section.
Confounding was measured by the proportional change from the crude prevalence ratio to the adjusted (MantelHaenszel) prevalence ratio using the expression , so that whenever the adjusted prevalence ratio was smaller than the crude, confounding was negative.
Results
Underweight was the least common outcome studied, with a prevalence of 4.1%. Asthma (prevalence = 31.2%), was intermediate and mother in a paid job was the commonest (prevalence = 51.4%). In the modified situation 1, underweight prevalence was increased in the low birth weight group from 7.8% to 10.4%, changing confounding from 14 to 18%. The overall prevalence of underweight changed to 4.9% (Tables 1 and 2). In situation 2, the prevalence of asthma was reduced from 22.7% to 12.7% in the upper class and increased from 38.1% to 52.1% in the lower class. Confounding changed from 8% to 17%. Overall asthma prevalence changed from 31.2% to 34.3% (Tables 3 and 4). In situation 3, confounding was increased by changing both the prevalences of the exposure and the outcome, achieving an increase from 4% to 25%. This was the only situation where the test for heterogeneity of prevalence ratios across strata was significant, indicating an interaction (Tables 5 and 6).
Table 1. Absolute frequencies, outcome prevalences, exposure prevalences, crude and pooled prevalence ratio (PR) estimates, and relative confounding for the analysis of the original data using underweight (weight for age Zscore < 2) as the outcome, previous hospitalization as the risk factor and low birth weight as confounder (situation 1 original).
Table 2. Absolute frequencies, outcome prevalences, exposure prevalences, crude and pooled prevalence ratio (PR) estimates, and relative confounding for the analysis of the modified data using underweight (weight for age Zscore <  2) as the outcome, previous hospitalization as the risk factor and low birth weight as confounder (situation 1 modified).
Table 3. Absolute frequencies, outcome prevalences, exposure prevalences, crude and pooled prevalence ratio (PR) estimates, and relative confounding for the analysis of the original data using asthma as the outcome, maternal smoking as the risk factor and social class as confounder (situation 2 original).
Table 4. Absolute frequencies, outcome prevalences, exposure prevalences, crude and pooled prevalence ratio (PR) estimates, and relative confounding for the analysis of the modified data using asthma as the outcome, maternal smoking as the risk factor and social class as confounder (situation 2 modified).
Comparing the results of the different models in situation 1 (Table 7), we see that the point estimates obtained with Cox, Poisson and logbinomial models are very close to the MantelHaenszel prevalence ratio (MHPR) for the original and modified data. In terms of the confidence intervals, the differences between Cox, Poisson and logbinomial models and the reference were less than 5%, except for Poisson scaled by deviance, where the CIs were approximately 50% narrower.
Situation 2 (Table 8) was similar to situation 1 in terms of point estimates. Confidence intervals were strongly overestimated by unadjusted Cox/Poisson models. In the modified data, the 95%CI was overestimated by 11% by Poisson regression with scale parameter adjusted by χ^{2}.
In situation 3 (Table 9), the outcome prevalence was highest, and there was a significant interaction between risk factor and confounder. Ignoring the interaction (i. e. using a misspecified model), the logbinomial model performed slightly worse than the Cox and Poisson models in relation to the point estimates. The latter presented a maximum difference of 2% compared to the MHPR, while the logbinomial estimates were up to 8.7% greater. In terms of interval estimates, only the Cox/Poisson models with robust variance presented differences less than 5% for both the original and modified data. An interaction term was included in the robust Poisson and logbinomial regressions. In the original situation, identical results (up to the third decimal place) were obtained from both models, matching the stratumspecific relative risks and confidence intervals. However, in the modified situation, the logbinomial model did not converge, while the robust Poisson model again reproduced the stratumspecific estimates. A common reason for nonconvergence is inappropriate starting values for model parameters. Stata's option "search", which specifies that the command "glm" should search for good starting values, solved the problem and, with this option, the results obtained from the logbinomial model were again virtually identical to Poisson regression.
A graphical summary of the results concerning the interval estimates is shown in Figure 1. It is clear from the figure that robust Poisson/Cox regression and logbinomial regression are the best performers, consistently producing the confidence intervals with amplitude closest to the reference. In third place, but with some confidence intervals nearly 20% wider than the reference, ranked Poisson regression with the scale parameter adjusted by χ^{2}.
Figure 1. Comparison of the relative differences between the 95% confidence intervals obtained by unadjusted Poisson/Cox regression, Poisson regression with scale factor adjusted by χ^{2 }and deviance, Poisson/Cox regression with robust variances and logbinomial regression and the Cornfield 95% confidence interval for each of the six situations studied. S1a (outcome prevalence / confounding): 4.1% / 14%; S1B: 4.9% / 18%; S2a: 31.2% / 8%; S2b: 34% / 17%; S3a: 51% / 4%; S3b: 54% / 25%.
Table 5. Absolute frequencies, outcome prevalences, exposure prevalences, crude and pooled prevalence ratio (PR) estimates, and relative confounding for the analysis of the original data using mother in a paid job as the outcome, father living with the family as the risk factor and social class as confounder (situation 3 original).
Table 6. Absolute frequencies, outcome prevalences, exposure prevalences, crude and pooled prevalence ratio (PR) estimates, and relative confounding for the analysis of the modified data using mother in a paid job as the outcome, father living with the family as the risk factor and social class as confounder (situation 3 modified).
Discussion
The literature on the different alternatives to analyze crosssectional or longitudinal data using prevalence (or cumulative incidence) ratios instead of odd ratios has not yet proposed a strategy that produces both point and interval acceptable estimates. To our knowledge this is the first paper to focus on different strategies and compare them to a suitable reference in terms of the prevalence ratios and confidence intervals obtained.
We have shown that there are several alternatives available that will provide very good results in terms of point estimates: Cox, Poisson and logbinomial regression. The case of interval estimates is more complicated, as some models will overestimate or underestimate them, in different situations. Even so, we are still left with three viable alternatives: logbinomial regression, Cox/Poisson regression with robust variance, and Poisson regression with scale parameter adjusted by χ^{2}.
One limitation of this work is not having dealt with continuous covariates. The main reason was the reference used. The MantelHaenszel techniques work for categorical variables only. Furthermore, most epidemiological analyses involve only categorical variables. The main problem with this omission is that continuous variables are a potential cause for model misbehavior, that is, the logbinomial model not converging, and the Poisson model producing estimates of individual probabilities greater than 1. This situation happens when the estimates are on the boundary of the parameter space, and is illustrated with the artificial data presented by Deddens in a paper where a simple strategy, the COPY method, was proposed to achieve convergence when fitting logbinomial models in such a case [27].
The logbinomial regression, used without any correction to the standard errors, presented results that were equivalent to those yielded by robust Poisson/Cox regression in situations 1 and 2. In situation 3, where an interaction was ignored, the model tended to present confidence intervals that were too narrow (up to 10.4%) compared to the reference, and slightly different point estimates. This situation was included in this exercise to present a scenario with a misspecified model, situation that is bound to happen in reported analyses, as failing to look for or correctly identifying interactions is not infrequent. When the correct model (including the interaction) was fitted, the results were again equivalent to those yielded by the reference and robust Poisson/Cox regression in the original data. In the modified data the model failed to converge, what was solved by using better starting values for the estimation procedure. In situations where the estimates are on the boundary of the parameter space the model will not converge, unless a strategy such as the COPY method is used [27].
Cox regression has been suggested as an alternative to logistic regression but the problems with the variance estimates were not dealt with [4,10,15]. As expected, we showed that confidence intervals can be strongly overestimated (up to 69% in our examples using real data). The use of robust variance estimates [19], as we proposed, improved variance estimation considerably, limiting the difference relative to the reference confidence interval to less than 3% in the studied examples. Poisson regression, as mentioned before, works similarly, and has the advantage over Cox regression of using a command syntax similar to linear and logistic regressions in Stata.
The use of Poisson regression offers still other alternatives by means of changing the scale parameter to correct the standard errors when over or underdispersion is observed [22]. In the set of situations we presented, correction by the Pearson χ^{2 }was superior to correction by the deviance, and, although not as good as robust estimates, represented a considerable improvement in relation to the uncorrected standard errors. The maximum observed difference relative to the reference confidence intervals was17%. Poisson regression, however, can also present problems when the estimates are on the boundaries of the parameter space, as mentioned above. It is strongly advisable that the individual probabilities are calculated (Stata's "predict" command will do that) and examined.
We have used the robust Poisson model in the analysis of several epidemiological studies, three of which have been already published [2830]. In all cases we have used the same modeling strategy with logistic regression and robust Poisson regression. In these real situations the final sets of selected variables were the same, and the differences in model parameters within the expected between odds ratios and prevalence ratios. Until more experience is gathered, this may be a useful strategy to help identify anomalous results with robust Poisson regression, along with assessing the predicted individual probabilities.
The rapid and continuous evolution of statistical software means that most packages will perform at least one of the analyses that performed best in this exercise. Stata 7.0, used here, and widely employed in epidemiology research groups, can perform them all. It was not possible for us to assess other packages in terms of what they can do, and how to do it, as software like SPSS, SAS, SPlus, among others, were not available to us.
Table 7. Comparison of prevalence ratios and respective confidence interval estimates (obtained by unadjusted Poisson/Cox regression, Poisson regression with scale factor adjusted by χ^{2 }and deviance, Poisson/Cox regression with robust variances, logbinomial regression and logistic regression) and odds ratio with confidence interval estimate (obtained by logistic regression) with the MantelHaenszel prevalence ratio in the analysis of the original and modified data using underweight (weight for age Zscore < 2) as the outcome, previous hospitalization as the risk factor and low birth weight as confounder (situation1).
Conclusions
We have shown that the use of Cox or Poisson regression without any adjustment for the analysis of crosssectional data, as suggested sometimes in the literature, may lead to large errors in interval estimates. On the other hand, taken the precautions discussed in the paper, the logbinomial model and the Cox or Poisson models with adjusted variances provide correct point and interval estimates. It is, therefore, not only possible, but actually easy to use other models than logistic regression to analyze crosssectional (or longitudinal) data with binary outcomes, the advantage being the prevalence (or cumulative incidence) ratio as the measure of association, more interpretable and easier to communicate, especially to nonepidemiologists. It is for the analyst to choose among these methods, based on software availability and the analyst's training.
Table 8. Comparison of prevalence ratios and respective confidence interval estimates (obtained by unadjusted Poisson/Cox regression, Poisson regression with scale factor adjusted by χ^{2 }and deviance, Poisson/Cox regression with robust variances, logbinomial regression and logistic regression) and odds ratio with confidence interval estimate (obtained by logistic regression) with the MantelHaenszel prevalence ratio in the analysis of the original and modified data using asthma as the outcome, maternal smoking as the risk factor and social class as confounder (situation 2).
Table 9. Comparison of prevalence ratios and respective confidence interval estimates (obtained by unadjusted Poisson/Cox regression, Poisson regression with scale factor adjusted by χ^{2 }and deviance, Poisson/Cox regression with robust variances, logbinomial regression and logistic regression) and odds ratio with confidence interval estimate (obtained by logistic regression) with the MantelHaenszel prevalence ratio in the analysis of the original and modified data using mother in a paid job as the outcome, father living with the family as the risk factor and social class as confounder (situation3).
Competing interests
None declared.
Author's contributions
AB proposed the idea, carried out part of the literature review and modeling, and drafted the manuscript. VH carried out most of the literature review, analyses, and prepared the tables and figures.
Acknowledgements
We thank the Brazilian Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES) for supporting this work, and Drs. Cesar G. Victora, Bernardo L. Horta and J. Norberto W. Dachs for the suggestions and encouragement.
References

Greenland S: Interpretation and choice of effect measures in epidemiologic analyses.
American Journal of Epidemiology 1987, 125:761768. PubMed Abstract

Savitz DA: Measurements, estimates, and inferences in reporting epidemiologic study results [editorial].
American Journal of Epidemiology 1992, 135:223224. PubMed Abstract

Nurminen M: To use or not to use the odds ratio in epidemiologic analyses.
European Journal of Epidemiology 1995, 11:365371. PubMed Abstract

Thompson ML, Myers JE, Kriebel D: Prevalence odds ratio or prevalence ratio in the analysis of cross sectional data: what is to be done?
Occupational and Environmental Medicine 1998, 55:272277. PubMed Abstract  Publisher Full Text

Miettinen OS, Cook EF: Confounding: essence and detection.
Am J Epidemiol 1981, 114:593603. PubMed Abstract

Axelson O, Fredriksson M, Ekberg K: Use of the prevalence ratio v the prevalence odds ratio as a measure of risk in cross sectional studies [letter; comment].
Occup Environ Med 1994, 51:574. PubMed Abstract

Zocchetti C, Consonni D, Bertazzi PA: Estimation of prevalence rate ratios from crosssectional data [letter; comment].
International Journal of Epidemiology 1995, 24:10641067. PubMed Abstract

Osborn J, Cattaruzza MS: Odds ratio and relative risk for crosssectional data [letter; comment].
International Journal of Epidemiology 1995, 24:464465. PubMed Abstract

Hirakata Vânia Naomi: Alternativas de análise para um desfecho binário em estudos transversais e longitudinais [MSc dissertation]. In Depto. Medicina Social. Pelotas, Brasil, Universidade Federal de Pelotas;; 1999.

Lee J, Chia KS: Estimation of prevalence rate ratios for cross sectional data: an example in occupational epidemiology [letter].
British Journal of Industrial Medicine 1993, 50:861862. PubMed Abstract

Victora Cesar Gomes, Vaughan JP, Kirkwood Betty R., Martines JC, Barcelos LB: Risk factors for malnutrition in Brazilian children. The role of social and environmental variables.
Bulletin of the World Health Organization 1986, 64:299309. PubMed Abstract

Wacholder S: Binomial regression in GLIM: estimating risk ratios and risk differences [see comments].
Am J Epidemiol 1986, 123:174184. PubMed Abstract

Traissac P, MartinPrevel Y, Delpeuch F, Maire B: Régression logistique vs autres modèles linéaires généralisés pour l'estimation de rapports de prévalences.
Rev Epidemiol Sante Publique 1999, 47:593604. PubMed Abstract  Publisher Full Text

Martuzzi M, Elliott P: Estimating the incidence rate ratio in crosssectional studies using a simple alternative to logistic regression.
Annals of Epidemiology 1998, 8:5255. PubMed Abstract  Publisher Full Text

Skov T, Deddens J, Petersen MR, Endahl L: Prevalence proportion ratios: estimation and hypothesis testing.
International Journal of Epidemiology 1998, 27:9195. PubMed Abstract  Publisher Full Text

Cox DR: Regression models and lifetables [with discussion].

Breslow NE: Covariance analysis of censored survival data.
Biometrics 1974, 30:8999. PubMed Abstract

Lee J: Odds ratio or relative risk for crosssectional data? [letter].
International Journal of Epidemiology 1994, 23:201203. PubMed Abstract

Lin DY, Wei LJ: The robust inference for the Cox proportional hazards model.
Journal of the American Statistical Association 1989, 84:10741078.

Huber PJ: The behavior of maximum likelihood estimates under nonstandard conditions. In Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability. Vol. 1. Berkeley, CA, University of California Press; 1967:1, 221233.

Clayton David, Hills Michael: Statistical Models in Epidemiology. New York, Oxford University Press Inc.;; 1996:367.

McCullagh P, Nelder JA: Generalized linear models. 2nd edition. New York, Chapman and Hall;; 1989.

Breslow NE: Generalized linear models: checking assumptions and strengthening conclusions.

Lee J: Estimation of prevalence rate ratios from crosssectional data: a reply.

StataCorp.: Stata Statistical Software: Release 7.0. College Station, TX, Stata Corporation; 2001.

Victora CG, Barros FC, Halpern R, Menezes AM, Horta BL, Tomasi E, Weiderpass E, Cesar JA, Olinto MT, Guimaraes PR, Garcia MM, Vaughan JP: Estudo longitudinal da populacao maternoinfantil da regiao urbana do Sul do Brasil, 1993: aspectos metodologicos e resultados preliminares.

Deddens J, Petersen MR, Lei X: Estimation of prevalence ratios when PROC GENMOD does not converge. In Proceedings of SAS Users Group International 28 (SUGI28). Seattle, Washington; 2003:Paper 270.

Fonseca SS, Victora CG, Halpern R, Barros AJ, Lima RC, Monteiro LA, Barros FC: Fatores de risco para injúrias acidentais em préescolares.
Jornal de Pediatria 2002, 78:97104. PubMed Abstract  Publisher Full Text

MendozaSassi R, Beria JU, Barros AJ: Outpatient health service utilization and associated factors: a populationbased study.
Revista de Saúde Pública 2003, 37:372378. PubMed Abstract  Publisher Full Text

Hallal PC, Victora CG, Wells JC, Lima RC: Phisical inactivity: prevalence and associated variables in Brazilian adults.
Prepublication history
The prepublication history for this paper can be accessed here: