In 1994, the U.S. Department of Health and Human Services mandated sufficient inclusion of racial and ethnic minorities in all federally funded research. This mandate requires researchers to monitor study samples for research participation and differential survey nonresponse. This study illustrates methods to assess differential survey nonresponse when population race data are incomplete, which is often the case when studies are conducted among members of health plans.
We collected data as part of the PRISM (Personally Relevant Information about Screening Mammography) study, a trial funded by the National Institutes of Health to increase rates of annual mammography adherence. We used two methods to estimate racial distribution of the PRISM study population. The first method, called E-Tech, estimated race of the sample frame by using individuals' names and zip codes. In the second method, we conducted interviews with a subsample of PRISM study refusals. We validated both estimation methods through comparisons with self-reported race. We used race information generated by E-Tech, interviewer estimates, and self-report to assess differential nonresponse in the PRISM study.
The E-Tech method had moderate sensitivity (48%) in estimating race of black participants but higher specificity (97%) and positive predictive value (71%). The interviewer-estimation method had high sensitivity (100%), high specificity (95%), and moderate positive predictive value (80%). Black women were less likely than white women to be reached for study participation.
There was slight differential nonresponse by race in the PRISM study. Techniques described here may be useful for assessing differential nonresponse in samples with incomplete data on race.
Differential nonresponse is a potential problem in all health survey research. It can be particularly problematic in studies that include low-income groups, racial and ethnic minority groups, or both. Differential nonresponse occurs when one sample subgroup has a lower survey response than other subgroups. Statistical strategies to compensate for differential nonresponse, such as weighting, attempt to attenuate the impact of differential nonresponse on survey error (
Blacks often have lower rates of participation in health survey research compared with whites. This discrepancy is attributed to factors such as socioeconomic status as well as challenges in research recruitment and participation. Studies have found black populations less likely to be located and reached and more likely to refuse participation (
In 1994, the U.S. Department of Health and Human Services mandated sufficient inclusion of racial and ethnic minorities in all federally funded research (
Our initial estimate was that approximately 23% of participants in the study's baseline telephone interviews would be black, based on the known racial composition of North Carolina state employees (
The primary aim of the research reported here was to determine whether differential nonresponse by race occurred. Because race data were not available on the frame used to select the sample, we tested our primary aim indirectly, using two approaches to estimate race of nonparticipants. We chose two approaches because each single approach has inherent weaknesses. The first approach, called E-Tech, estimated racial composition of the frame using algorithms based on the names and zip codes of individuals. In the second approach, we conducted brief interviews with a subsample of women who refused participation in the PRISM study. A secondary aim was to compare estimated race data with self-reported race data to validate these approaches.
PRISM, part of the National Institutes of Health's (NIH's) Health Maintenance Consortium, is an NIH-funded intervention trial to increase rates of mammography maintenance (
PRISM study recruitment occurred between October 2004 and April 2005. Researchers first mailed invitation letters to the sample of 9079 potential participants. The letters provided instructions for opting out of the study. In addition, potential participants were sent required HIPAA (Health Insurance Portability and Accountability Act of 1996) information about the types of personal health information that would be collected. Trained telephone interviewers from Battelle Centers for Public Health Research and Evaluation contacted potential participants to obtain their active consent. Following consent, women completed 30-minute baseline telephone interviews designed to collect sociodemographic data (including race) and information on mammography knowledge, beliefs, and practices. Interviewers made up to 12 attempts to contact women. The Institutional Review Boards for the University of North Carolina School of Public Health and Duke University Medical Center approved the research study.
PRISM telephone interviewers attempted to contact the random sample of 9079 individuals who met initial eligibility criteria (
PRISM (Personally Relevant Information about Screening Mammography) participant recruitment for baseline and refusal interviews.
Of the 3490 PRISM study participants for analysis in this study, 89.3% reported their race as white (n = 3116), and 10.7% reported their race as black or African American (n = 374). Fewer than 1% of participants (n = 53) were American Indian or Alaska Native (n = 34), Asian (n = 11), or Native Hawaiian or Other Pacific Islander (n = 1), or gave a response of "other" (n = 7). Twenty-one participants reported that they were Hispanic/Latina.
We employed Ethnic Technologies, LLC, a professional data encoding service (
As a second method to estimate race and to determine whether PRISM study refusals were disproportionately black, we conducted brief interviews, referred to as
Upon completion of 126 of the 150 target refusal interviews, we added a second element to the remaining attempted interviews: we asked interviewers to estimate race of women with whom they spoke, regardless of their participation in refusal interviews, based on verbal cues. The purpose of the interviewer-estimation component was to determine the accuracy of this method through comparisons with self-reported race. PRISM study researchers provided no training to interviewers about how to use verbal cues. Interviewers classified the 53 women with whom they spoke as black or white; none of the women was classified as "do not know" or "other." Of the 53 women contacted by interviewers, 24 agreed to participate in refusal interviews and provided self-reported race. Therefore, we validated the interviewer-estimation method by using a subsample of 24 women.
We dichotomized race as white and black because the distribution of PRISM study participants was predominately white (89.3%) or black (10.7%). Participants who represented other racial or ethnic groups (<1%) were removed from analyses as were participants who gave a self-reported race as "other" (<1%) because their numbers were too small for meaningful analysis.
We calculated each method's sensitivity, specificity, and positive predictive value to correctly estimate black race compared with self-reported race (
We used chi-square tests when comparing racial distributions of participants to nonparticipants. We used one-sample binomial tests (z scores) when making comparisons to the PRISM sample frame. Because we found that the E-Tech method tended to misclassify black participants as white, we applied ratio-weighted adjustments to the sample frame. We applied a ratio-weighted adjustment of 1.465 to each black woman identified by E-Tech to increase the proportion of estimated blacks in the frame and applied an adjustment of 0.953 to each estimated white woman to decrease their representation in the frame. We calculated these adjustments through comparisons with self-reported race data. We performed data analyses using SAS version 9.1 (SAS Institute Inc, Cary, NC). Statistical tests were considered significant at
We found a high level of agreement between interviewer estimates of race and self-reported race (κ = 0.86; 95% CI, 0.60–1.00) (
Comparison of refusal-interview participants with the weighted E-Tech sample frame showed slight nonsignificant differences in the percentage of black individuals (
Although E-Tech was nearly perfect in estimating white race when participants were self-reported white, it misclassified 52% of the sample's self-reported black participants as white, resulting in underestimation of black participants. This discrepancy might be explained by the E-Tech process for assigning race codes. For example, if a woman resided in a predominately white geographical area, she was coded as white unless her first or surname suggested otherwise. Black participants whose first or surnames were not ethnically unique (e.g., Melissa Smith) and lived in predominately white geographical areas were likely coded as white. Similarly, a study by Kwok and Yankaskas (
The method in which interviewers estimated race of women with whom they spoke was highly accurate. Only one white study participant was misclassified as black; the rest were accurately identified. This finding is consistent with literature suggesting that certain characteristics of African American vernacular English may make it distinguishable from non-African American speakers (
By triangulating results from multiple statistical comparisons, we assessed potential differential nonresponse in the PRISM study. Both unweighted and weighted E-Tech–estimated sample frames differed in their racial distributions compared with PRISM study participants, leading us to conclude there was slight differential nonresponse by race.
When we examined categories of nonparticipation using weighted E-Tech estimates, we found that study interviewers had more difficulty reaching black women compared with white women. That is, the category of nonparticipants who had no working telephones, reached the maximum number of call attempts without successful contact, requested call-backs but were not reached on subsequent attempts, or for whom gatekeepers refused participation was disproportionately black. The finding that black women were more difficult to reach is consistent with reports in the health survey literature (
Our findings were inconclusive as to whether study refusals were disproportionately black. Although racial distribution of refusal-interview participants was slightly different compared with the distribution of the weighted E-Tech sample frame, the differences were not statistically significant. Yet, racial distribution of refusal-interview participants was significantly different compared with PRISM study participants. Findings in the health-survey literature suggest that blacks may be more likely to refuse participation. For example, analysis of the 2003 Behavioral Risk Factor Surveillance System (BRFSS) data found refusal rates were significantly higher in counties with higher percentages of black residents (
The two described methods to estimate race of the study sample each had limitations. First, the E-Tech method tended to misclassify black women as white. We applied weighted adjustments to the E-Tech numbers to help overcome this limitation. Second, because we implemented the refusal-interview component as a supplemental study near the latter stages of participant recruitment, sample sizes used to assess the accuracy of the interviewer-estimation method were small and should be replicated with larger samples. Given the characteristics of our PRISM sample, our findings are limited in their generalizability. For example, we do not know the accuracy of the E-Tech and interviewer methods to estimate race for men or age groups such as adolescents. Also, our sample had very few participants who represented racial or ethnic groups other than black and white. Thus, we do not know how accurate these methods would be for estimating race or ethnicity for Hispanics, Asians, or other groups. Our findings as they relate to differential nonresponse are generalizable only to our target population of insured women who are adherent to mammography.
Adequate participation in health research from racial and ethnic minorities is essential to reveal potential health disparities, to ensure that results of intervention and other research can be generalized to these populations, and to comply with federal regulations. Monitoring recruitment is essential to determine whether study participants are disproportionate in their racial composition compared with the sample and, furthermore, whether conclusions drawn from study findings may be limited in their generalizability due to nonresponse bias. We illustrated two methods to assess differential nonresponse when race data are incomplete. Like many studies that rely on samples from government or health plan populations, racial or ethnic data were not available for our initial sample. Techniques described here may be useful to other researchers who wish to assess potential differential nonresponse when faced with incomplete race data. Use and improvement of methods, such as E-Tech and telephone-interviewer estimation of race, are important given U.S. Department of Health and Human Services requirements to achieve adequate participation from racial and ethnic minority groups in federally funded health research (
The authors thank Deborah Usinger and Tara Strigo from the PRISM study for their assistance with data collection. We are grateful to staff from Ethnic Technologies, LLC; Ingrid Morris, MPH, and Don Bradley, MD, of BlueCross BlueShield of North Carolina; and Nancy Henley, MD, MPH, and Casey Herget, MPH, MSW, of the North Carolina State Health Plan for making this research possible. The authors are also grateful to Dr Jo Anne Earp and members of her manuscript writing course for their review of earlier drafts and to Peter Nyangweso for his assistance with data analysis.
This study was supported by National Cancer Institute grant no. 5R01CA105786.
Agreement Between E-Tech Estimate of Race and Self-reported Race for PRISM Study Particpants (n = 3375), North Carolina, 2005
| Category | Black | White | American Indian,Asian,or Native Hawaiian | Total |
|---|---|---|---|---|
| Self-reported race | 365 | 3010 | 0 | 3375 |
| E-Tech–estimated race | 245 | 3106 | 24 | 3375 |
| Correctly identified | 174 | 2916 | 0 | 3090 |
| Incorrectly identified | 71 | 190 | 24 | 285 |
PRISM indicates Personally Relevant Information About Screening Mammography.
E-Tech did not identify race codes for 115 of the 3490 PRISM study participants who self-reported black or white, resulting in 3375 total participants. Sensitivity of E-Tech estimates of black participants = 47.7% (174/365); positive predictive value of E-Tech estimates of black participants = 71.0% (174/245); specificity of E-Tech estimates of white participants = 96.9% (2916/3010). κ = 0.53; 95% confidence interval, 0.48–0.58.
Of the 71 participants who were incorrectly identified as black, all self-reported as white. Of the 190 participants who were incorrectly identified as white, all self-reported as black. Of the 24 participants incorrectly identified as American Indian, Asian, or Native Hawaiian, 1 self-reported as black and 23 self-reported as white.
Agreement Between Telephone-Interviewer Estimate of Race and Self-reported Race Among Subsample (n = 24) of Refusal-Interview Participants, North Carolina, 2005
| Category | Black | White | Total |
|---|---|---|---|
| Self-reported race | 4 | 20 | 24 |
| Interviewer-estimated race | 5 | 19 | 24 |
| Correctly identified | 4 | 19 | 23 |
| Incorrectly identified | 1 | 0 | 1 |
PRISM indicates Personally Relevant Information About Screening Mammography.
Sensitivity of interviewer estimates of black participants = 100% (4/4); positive predictive value of interviewer estimates of black participants = 80% (4/5); specificity of interviewer estimates of white participants = 95% (19/20). κ = 0.86; 95% confidence interval, 0.60–1.00.
Self-Reported Race for PRISM Study Participants Compared With Unweighted and Weighted E-Tech–Estimated Sample Frames, North Carolina, 2005
| Race | PRISM Participants (Self-Reported Race) (n = 3490) % (95% CI) | Unweighted PRISM Frame (E-Tech-Estimated Race) | Weighted PRISM Frame (Adjusted E-Tech-Estimated Race) |
|---|---|---|---|
| Black | 10.7 (9.7-11.7) | 9.2 (8.8-9.5) | 13.4 (13.0-13.8) |
| White | 89.3 (88.3-90.3) | 90.8 (90.4-91.2) | 86.6 (86.2-87.0) |
PRISM indicates Personally Relevant Information About Screening Mammography; CI, confidence interval. N values are actual (unweighted) values whereas proportions are weighted proportions.
PRISM participants compared with E-Tech–estimated PRISM frame:
PRISM participants compared with weighted E-Tech–estimated PRISM frame:
Weighted E-Tech–Estimated Racial Distributions for Categories of Nonparticipation Compared With Weighted PRISM Frame, North Carolina, 2005
| Race | Weighted PRISM Frame (Adjusted E-Tech- Estimated Race) (N =26,688)% (95% CI) | Weighted E-Tech-Estimated Nonparticipants | |||
|---|---|---|---|---|---|
| Could Not Be Contacted | Ineligible for Study | Removed From Sample | Refused | ||
| Black | 13.4 (13.0-13.8) | 18.3 (15.6-20.9) | 10.7 (6.8-14.6) | 14.0 (12.6-15.4) | 13.5 (12.0-15.0) |
| White | 86.6 (86.2-87.0) | 81.7 (79.1-84.3) | 89.3 (85.4-93.2) | 86.0 (84.6-87.4) | 86.5 (85.0-88.0) |
PRISM indicates Personally Relevant Information About Screening Mammography; CI, confidence interval. N values are actual (unweighted) values whereas proportions are weighted proportions.
Comparison with weighted PRISM frame:
Comparison with weighted PRISM frame:
Comparison with weighted PRISM frame:
Comparison with weighted PRISM frame:
Analysis of Refusal-Interview Participants, PRISM Study, North Carolina, 2005
| Race | Refusal-Interview Participants (Self-Reported Race) (n = 150)% (95% CI) | Weighted PRISM Frame (Adjusted E-Tech Estimated Race) | PRISM Participants (Self-Reported Race) | Declined Refusal Interview, (Interviewer-Estimated Race) |
|---|---|---|---|---|
| Black | 16.7 (10.7-22.6) | 13.4 (13.0-13.8) | 10.7 (9.7-11.7) | 17.2 (3.5-31.0) |
| White | 83.3 (77.4-89.3) | 86.6 (86.2-87.0) | 89.3 (88.3-90.3) | 82.8 (69.0-96.5) |
PRISM indicates Personally Relevant Information About Screening Mammography; CI, confidence interval. N values are actual (unweighted) values whereas proportions are weighted proportions.
Refusal-interview participants compared with weighted PRISM frame:
Refusal-interview participants compared with PRISM participants: χ2 = 5.2;
Refusal-interview participants compared with those who declined refusal interview and for whom interviewer estimated race: χ2 = 0.06;
The opinions expressed by authors contributing to this journal do not necessarily reflect the opinions of the U.S. Department of Health and Human Services, the Public Health Service, Centers for Disease Control and Prevention, or the authors' affiliated institutions. Use of trade names is for identification only and does not imply endorsement by any of the groups named above.