Exposure misclassification, selection bias, and confounding are important biases in epidemiologic studies, yet only confounding is routinely addressed quantitatively. We describe how to combine two previously described methods and adjust for multiple biases using logistic regression.
Weights were created from selection probabilities and predictive values for exposure classification and applied to multivariable logistic regression models in a case-control study of prepregnancy obesity (body mass index ≥30 versus <30 kg/m2) and cleft lip with or without cleft palate (CL/P) using data from the National Birth Defects Prevention Study (2,523 cases, 10,605 controls).
Adjusting for confounding by race/ethnicity, prepregnancy obesity and CL/P were weakly associated (odds ratio 1.10, 95% confidence interval: 0.98, 1.23). After weighting the data to account for exposure misclassification, missing exposure data, selection bias, and confounding, multiple bias-adjusted odds ratios ranged from 0.94 to 1.03 in non-probabilistic bias analyses and median multiple bias-adjusted odds ratios ranged from 0.93 to 1.02 in probabilistic analyses.
This approach, adjusting for multiple biases using a logistic regression model, suggested that the observed association between obesity and CL/P could be due to the presence of bias.
Bias can affect results of epidemiologic studies so that both the direction and magnitude of the observed association can be incorrect.
Dozens of studies have found associations between prepregnancy obesity and an increased risk of having a child with a birth defect.
These associations are small enough that exposure misclassification or selection bias could explain the results. One previous study investigated potential effects of nondifferential exposure misclassification on this association; the OR was 1.25 before accounting for misclassification and ranged from 1.38 to 2.94 after.
The purpose of this analysis is two-fold. The first is to explore how the association between prepregnancy obesity and CL/P might be affected by exposure misclassification and selection bias. The second is to demonstrate how to combine two previously described methods to adjust for misclassification and selection bias using both non-probabilistic and probabilistic multiple bias analysis.
We used data from the National Birth Defects Prevention Study (NBDPS), a population-based case-control study of birth defects.
The outcome of interest was nonsyndromic isolated CL/P; clinical geneticists reviewed medical records to exclude cases possibly caused by genetic or other syndromes.
We included study site, maternal race/ethnicity, and maternal education in the models as potential confounders. Only mothers reporting their race/ethnicity as non-Hispanic white, non-Hispanic black, or Hispanic were included, to correspond with available exposure misclassification validation data (details below).
Among 3,161 CL/P case mothers and 11,692 control mothers, we excluded 382 (12%) case mothers of infants with non-isolated CL/P, 199 (6%) case and 763 (7%) control mothers reporting race/ethnicities not meeting inclusion criteria, 1 (<1%) case and 7 (<1%) control mothers with missing race/ethnicity, and 56 (2%) case and 317 (3%) control mothers with missing data on maternal education. Following exclusions, we included 2,523 case mothers and 10,605 control mothers. Mothers with missing BMI were retained in the analysis so we could account for missing exposure data.
We used logistic regression to estimate crude and confounding-adjusted ORs and 95% CIs for associations between prepregnancy obesity and CL/P. All statistical analyses were conducted in SAS version 9.4 (Cary, NC).
In this analysis, we perform both non-probabilistic and probabilistic bias analysis. Probabilistic analyses take into account uncertainty in bias parameter estimates to be taken into account by conducting analyses using a range of values for the bias parameters.
We used the method of Lyles and Lin to adjust for exposure misclassification.
We had no internal validation data on exposure misclassification for NBDPS. We used external validation data from the 1999–2010 National Health and Nutrition Examination Surveys (NHANES), representative of the civilian noninstitutionalized population of the United States.
We restricted the NHANES analysis to nonpregnant females aged 16–49 years with height and weight measurements. We cross-tabulated self-reported and measured BMI categories conditional on race/ethnicity, accounting for the complex sampling design, to estimate Se and Sp. Although predictive values can be calculated from these data, we estimated Se and Sp to examine nondifferential and differential exposure misclassification (whether or not misclassification is differential depends directly on differences in Se and Sp between cases and controls, not predictive values). Reliable estimates from NHANES were available for non-Hispanic white, non-Hispanic black, and Mexican-American women, and therefore we restricted our NBDPS analysis to these racial/ethnic groups (because approximately two-thirds of Hispanics in the U.S. are of Mexican descent, we used the estimate for Mexican-Americans for all NBDPS Hispanic women).
We assumed that the NHANES Se and Sp were accurate estimates of the Se and Sp in NBDPS. Not knowing if exposure misclassification was differential or nondifferential, we performed three analyses, assuming: (1) nondifferential misclassification, (2) “differential A” misclassification (classification is better for cases than controls), and (3) “differential B” misclassification (classification is better for controls than cases). In the first, we assigned cases and controls to have the same Se and Sp values (NHANES Se and Sp). In the second, we assigned the NHANES Se and Sp to controls and Se + 0.05 and Sp + 0.03 to cases. In the third, we assigned the NHANES Se and Sp to controls and Se – 0.05 and Sp – 0.03 to cases. Se and Sp were restricted to lie between 0.5 and 1.0, inclusive. We converted Se and Sp to predictive values (restricted to lie between 0 and 1, inclusive). Bias parameters were calculated separately for non-Hispanic white, non-Hispanic black, and Hispanic/Mexican-American women. For simplicity, we assumed they did not differ by other variables.
We also used predictive values to account for missing BMI.
For the analysis, we created a dataset with two observations for each participant (participant “copies”): one copy was assigned to have prepregnancy obesity, and the other to not have prepregnancy obesity — these represent the two possible obesity statuses the participant could have had in the absence of exposure misclassification.
To conduct probabilistic bias analysis, we assigned triangular distributions to each predictive value using the values calculated above as the mode and +/− 0.10 of the mode as the upper and lower bounds (restricted to fall between 0 and 1). We sampled each parameter 5,000 times and calculated 5,000 ORs. The results were summarized as the median OR and 95% simulation interval (SI), the 2.5th and 97.5th percentile of the OR distribution.
We used the method of Lash et al. to account for random error, but other methods, such as bootstrapping, could also be used.
We used inverse probability of selection weights (IPSW) to adjust for selection bias.
NBDPS participation rates for cases and controls were 67% and 65%.
For the probabilistic analysis, we assigned triangular distributions to each selection probability, using +/− 0.10 of the mode as the upper and lower bounds, with values restricted to lie between 0 and 1. We selected 5,000 sets of selection probabilities, inverted them to calculate the IPSW, and used these to calculate 5,000 ORs. Results were summarized as median OR and 95% SI. We added random error, producing a median OR and 95% RESI.
To adjust for exposure misclassification, missing exposure data, selection bias, and confounding, we multiplied 5,000 simulated IPSW by 5,000 simulated predictive values to create 5,000 combined weights. Then, as before, to adjust for exposure misclassification we created a dataset with two observations per participant and assigned each observation the combined weight corresponding to the assigned exposure status. The multivariable model regressed assigned exposure (not reported exposure) on the outcome, adjusted for confounders (study site, maternal race/ethnicity, maternal education), and was weighted by the combined weight to estimate the OR. Probabilistic results were summarized as median OR and 95% SI. Random error was added to generate a median OR and 95% RESI.
When biases are adjusted serially in multiple bias analysis, the order of bias adjustment is important; if adjustment is done out of order, incorrect results could be obtained.
If we consider exposure misclassification and selection bias, there are four possible datasets: (1) both biases present, (2) selection bias only, (3) exposure misclassification only, and (4) no exposure misclassification or selection bias. Our goal is to move from dataset 1 (two types of bias) to dataset 4 (no bias). This can be done by removing exposure misclassification first (datasets 1 to 2 to 4) or selection bias first (datasets 1 to 3 to 4).
In NBDPS, we removed selection bias first. When estimating IPSWs, we obtained these values from a cohort study, which likely had exposure misclassification and selection bias (dataset 1). Once the IPSW were estimated and applied, this produced dataset 3 (exposure misclassification only). We estimated predictive values from a “dataset 3” (exposure misclassification, no selection bias); this was NHANES. Because NHANES-provided weights accounting for nonresponse and other selection effects, we assumed this represented what NBDPS would have been in absence of selection bias. Once predictive values were estimated and applied, this moved from dataset 3 to 4 (no exposure misclassification or selection bias). Because there was confounding in the underlying source population, we estimated bias parameters conditional on confounders.
The prevalence of prepregnancy obesity was similar between cases and controls (
The crude OR for the association between prepregnancy obesity and CL/P was 1.09 (95% CI: 0.97, 1.21). After adjusting for study site, maternal race/ethnicity, and maternal education, it was 1.10 (95% CI: 0.98, 1.23). Despite the confidence interval crossing the null, we continued the bias analysis because of evidence in the literature that this weak association might not be due to chance.
Using bias parameters from NHANES (
In non-probabilistic and probabilistic analyses, the OR adjusted for selection bias and confounding was 0.98 (
In the non-probabilistic multiple bias analyses for exposure misclassification, missing exposure, selection bias, and confounding, the adjusted OR ranged from 0.94 to 1.03 for the three misclassification scenarios. In the probabilistic analyses, it ranged from 0.93 to 1.02 (
Multiple bias analyses suggest that exposure misclassification and selection bias could account for the weak association between prepregnancy obesity and CL/P, with analyses based on realistic bias parameters compatible with no association. The median multiple bias-adjusted ORs were closer to the null than the confounding-adjusted OR, although the SIs and RESIs spanned values compatible with inverse, positive, or no associations. Selection bias and “differential B” misclassification had the greatest effects in moving the association towards the null.
The ORs for the non-probabilistic and probabilistic analyses were similar, likely because our triangular distributions centered on the bias parameters used in the non-probabilistic analysis. Conducting non-probabilistic bias analyses is a simple way to explore the effects of bias; however, probabilistic analyses might indicate if results are compatible with a wider range of values.
One previous bias analysis investigated the impact of nondifferential exposure misclassification on this association, finding bias towards the null.
Combining the methods of Lyles and Lin and Hernán et al. allowed for adjusting combinations of different biases.
We assumed that our parameters were applicable to the NBDPS study population, but had no evidence to support this. We tested few of many possible parameters, and if our assumptions were incorrect, our results might not reflect the true OR.
We made simplifying assumptions that predictive values varied only by race/ethnicity and case-control status and that selection probabilities varied by exposure and case-control status. Estimating bias parameters conditional on other variables could provide better adjustment for bias, if parameters do vary. However, if bias parameters are estimated from subgroups with small sample size, the random error introduced might bias the analysis, particularly if extreme weights are estimated and cause some participants to carry undue influence.
Additional complexities can be added to our analyses. Outcome misclassification, unmeasured confounding, or covariate misclassification might be integrated using other methods; for example, using the multiple bias-adjusted OR as input for the method of Ding and VanderWeele to examine unmeasured confounders.
Studies of obesity have been criticized when used for causal inference because obesity does not correspond to a well-defined causal question.
The association observed between prepregnancy obesity and CL/P could be attributable to exposure misclassification or selection bias. We encourage epidemiologists to incorporate multiple bias analysis into their research or teaching.
The findings and conclusions in this report are those of the authors and do not necessarily represent the official position of the Centers for Disease Control and Prevention. The authors wish to thank Regina Simeone and Annette Christianson for assistance with analysis.
This work was supported by the Centers for Disease Control and Prevention (cooperative agreements PA 96043, PA 02081, and FOA DD09–001 to the Centers for Birth Defects Research and Prevention participating in the National Birth Defects Prevention Study).
body mass index
confidence interval
cleft lip with or without cleft palate
inverse probability of selection weight
National Birth Defects Prevention Study
National Health and Nutrition Examination Survey
odds ratio
random error-added simulation interval
sensitivity
simulation interval
specificity
Characteristics of case and control mothers in the analysis, National Birth Defects Prevention Study, 1997–2011.
| Case Mothers | Control Mothers | Odds Ratio | |||
|---|---|---|---|---|---|
| Number | % | Number | % | ||
| Prepregnancy obesity | |||||
| No | 1,919 | 76 | 8,259 | 78 | 1.00 (Ref) |
| Yes | 478 | 19 | 1,896 | 18 | 1.09 (0.97, 1.21) |
| Missing | 126 | 5 | 450 | 4 | |
| Race/ethnicity | |||||
| Non-Hispanic white | 1,671 | 66 | 6,601 | 62 | 1.00 (Ref) |
| Non-Hispanic black | 154 | 6 | 1,251 | 12 | 0.49 (0.41, 0.58) |
| Hispanic | 698 | 28 | 2,753 | 26 | 1.00 (0.91, 1.11) |
| Maternal education | |||||
| 0–11 years | 503 | 20 | 1,800 | 17 | 1.41 (1.24, 1.60) |
| 12 years | 684 | 27 | 2,545 | 24 | 1.36 (1.21, 1.53) |
| 13–15 years | 659 | 26 | 2,843 | 27 | 1.17 (1.04, 1.32) |
| ≥16 years | 677 | 27 | 3,417 | 32 | 1.00 (Ref) |
| Study site | |||||
| Arkansas | 297 | 12 | 1,384 | 13 | 1.00 (Ref) |
| California | 388 | 15 | 1,118 | 11 | 1.62 (1.36, 1.92) |
| Georgia | 246 | 10 | 1,117 | 11 | 1.03 (0.85, 1.24) |
| Iowa | 270 | 11 | 1,225 | 12 | 1.03 (0.86, 1.23) |
| Massachusetts | 296 | 12 | 1,284 | 12 | 1.07 (0.90, 1.28) |
| New Jersey | 92 | 4 | 526 | 5 | 0.82 (0.63, 1.05) |
| New York | 208 | 8 | 891 | 8 | 1.09 (0.89, 1.32) |
| North Carolina | 173 | 7 | 862 | 8 | 0.94 (0.76, 1.15) |
| Texas | 283 | 11 | 1,279 | 12 | 1.03 (0.86, 1.23) |
| Utah | 270 | 11 | 919 | 9 | 1.37 (1.14, 1.65) |
Bias parameters used for adjustment of exposure misclassification by racial/ethnic group, National Health and Nutrition Examination Survey, 1999–2010.
| Sensitivity | Specificity | P(obese|missing) | P(not obese|missing) | |
|---|---|---|---|---|
| Mexican-American | 0.817 | 0.968 | 0.458 | 0.542 |
| Non-Hispanic black | 0.859 | 0.959 | 0.521 | 0.479 |
| Non-Hispanic white | 0.841 | 0.991 | 0.335 | 0.665 |
Proportion of women with missing data on body mass index who were truly obese based on body measurements.
Proportion of women with missing data on body mass index who were truly not obese based on body measurements.
Associations Between Prepregnancy Obesity and Cleft Lip With or Without Cleft Palate, Adjusting for Different Combinations of Biases, National Birth Defects Prevention Study, 1997–2011.
| Conventional Analysis | Non- | Probabilistic | Probabilistic Bias Analysis | ||||
|---|---|---|---|---|---|---|---|
| OR | 95% CI | OR | Median OR | 95% SI | Median OR | 95% RESI | |
| Unadjusted | 1.09 | 0.97, 1.21 | |||||
| Confounding only | 1.10 | 0.98, 1.23 | |||||
| Selection bias and confounding | 0.98 | 0.98 | 0.82, 1.17 | 0.98 | 0.80, 1.21 | ||
| Exposure misclassification and confounding | |||||||
| Nondifferential (Seca = Seco, Spca = Spco) | 1.12 | 1.11 | 0.84, 1.45 | 1.11 | 0.83, 1.48 | ||
| Differential A (Seca = Seco+0.05, Spca = Spco+0.03) | 1.09 | 1.09 | 0.84, 1.41 | 1.09 | 0.82, 1.44 | ||
| Differential B (Seca = Seco−0.05, Spca = Spco−0.03) | 1.02 | 1.01 | 0.76, 1.34 | 1.01 | 0.74, 1.36 | ||
| All biases combined | |||||||
| Nondifferential (Seca = Seco, Spca = Spco) | 1.03 | 1.02 | 0.76, 1.37 | 1.02 | 0.75, 1.39 | ||
| Differential A (Seca = Seco+0.05, Spca = Spco+0.03) | 1.01 | 1.01 | 0.76, 1.33 | 1.00 | 0.74, 1.36 | ||
| Differential B (Seca = Seco−0.05, Spca = Spco−0.03) | 0.94 | 0.93 | 0.69, 1.26 | 0.93 | 0.67, 1.28 | ||
Abbreviations: CI, confidence interval; OR, odds ratio; RESI, random error-added simulation interval; Seca, sensitivity for cases; Seco, sensitivity for controls; SI, simulation interval; Spca, specificity for cases; Spco, specificity for controls.
Fixed values of bias parameters chosen for the analysis.
Triangular distributions of bias parameters sampled over 5,000 iterations.
Adjusted for confounding by maternal race/ethnicity.
Adjusted for exposure misclassification and missing exposure data.
Adjusted for selection bias, exposure misclassification, missing exposure, and confounding.