Practice-Based Research Networks (PBRNs) and health systems may provide timely, reliable data to guide the development and distribution of public health resources to promote healthy behaviors, such as quitting smoking. The objective of this study was to determine if PBRN data could be used to make neighborhood-level estimates of smoking prevalence.
We estimated the smoking prevalence in 32 greater Boston neighborhoods (population = 877,943 adults) by using the electronic health record data of adults who in 2009 visited one of 26 Partners Primary Care PBRN practices (n = 77,529). We compared PBRN-derived estimates to population-based estimates derived from 1999–2009 Behavioral Risk Factor Surveillance System (BRFSS) data (n = 20,475).
The PBRN estimates of neighborhood smoking status ranged from 5% to 22% and averaged 11%. The 2009 neighborhood-level smoking prevalence estimates derived from the BRFSS ranged from 5% to 26% and averaged 13%. The difference in smoking prevalence between the PBRN and the BRFSS averaged −2 percentage points (standard deviation, 3 percentage points).
Health behavior data collected during routine clinical care by PBRNs and health systems could supplement or be an alternative to using traditional sources of public health data.
Population distribution measurements of health indicators for chronic diseases can be used to target health care and community resources to areas with greatest need. The Centers for Disease Control and Prevention (CDC) and state health departments expend time, money, and energy measuring the distribution of health conditions and behavioral risk factors throughout the United States. The Behavioral Risk Factor Surveillance System (BRFSS), a cornerstone of these efforts with an annual budget of about $15 million, collects data on smoking, weight, diet, exercise, preventive medical care, and other behaviors (
Practice-based research networks (PBRNs) and integrated health delivery systems also collect patient behavioral health data in the course of routine clinical care (
Tobacco use remains the leading cause of preventable death in the United States (
Our analysis focused on 32 neighborhoods in the greater Boston area that corresponded to the catchment area of the Partners Primary Care PBRN. We compared population-based smoking prevalence estimates derived from the BRFSS and the US Census with 2009 prevalence estimates derived from the Partners Primary Care PBRN. We limited our analysis to respondents and patients aged 18 years or older. The Partners Human Research Committee approved the study protocol.
We derived individual-level data for population-based smoking prevalence estimates from the BRFSS, a population-based telephone survey administered by the CDC and state health departments (
We combined individual-level data from the BRFSS and community-level data from the US Census to estimate community-level smoking prevalence using a mixed-effects logistic regression model described previously (
The Partners Primary Care PBRN includes 23 practices in eastern Massachusetts affiliated with Brigham and Women’s Hospital or Massachusetts General Hospital. The 23 practices include 5 hospital-based practices, 12 community-based practices, and 6 community health centers. We included patients aged at least 18 who made at least 1 visit to 1 of these PBRN practices in 2009 and had a zip code in 1 of the 32 neighborhoods in the catchment area of the PBRN. The Partners Primary Care PBRN practices use the Longitudinal Medical Record, an internally developed, web-based, fully functional EHR (
We calculated the practice-based smoking prevalence and the frequency with which smoking status was not documented. We calculated the “market share” of the Partners Primary Care PBRN in each of the 32 neighborhoods by dividing the number of PBRN patients by the US Census population 18 years or older in that neighborhood.
We estimated neighborhood smoking prevalence in 2 ways using PBRN data. First, we calculated the crude prevalence of smoking in the PBRN data in each neighborhood by dividing the number of smokers by the total number of patients — smokers, nonsmokers, and patients with undocumented smoking status — from that neighborhood. Second, we calculated the smoking prevalence standardized by the neighborhood makeup according to the 2000 US census using age in 4 categories (18–39, 40–59, 60–79, or ≥80), sex, and race/ethnicity in 4 categories (white, Latino, black, or other). There were no patients in 71 of 1,024 cells. To make standardized estimates for these cells, we calculated predicted probabilities using a logistic regression model with smoking as the outcome and 2-way and 3-way interactions between age, sex, and race/ethnicity. We then filled the empty cells by calculating a smoothed estimate of the smoking prevalence by fitting the same logistic regression model but leaving out the 3-way interaction term for that cell.
For BRFSS data, we calculated 95% confidence intervals using the variances of the random effects from the logistic regression model. For the PBRN data, we calculated exact binomial 95% confidence intervals. Because of the large sample sizes and our interest in how similar the prevalence of smoking was, we relied on clinical significance rather than formal statistical testing. For ease of interpretation and because smaller increments are unlikely to be clinically significant and may give a false sense of precision, we rounded all proportions to the nearest whole percentage. We assessed the relationship between the practice at which patients were seen and the neighborhood in which they lived using Cramer’s
According to the US Census, 877,943 adults lived in the 32 neighborhoods in 2009 (population range across neighborhoods, 4,143–60,028). From 1999 to 2009, there were 20,475 adult BRFSS respondents who lived in the target area of interest used to estimate 2009 smoking prevalence (sample range across neighborhoods, 85–1,731).
There were 77,516 adult patients seen in the Partners Primary Care PBRN in 2009 who lived in 1 of the 32 neighborhoods (range across neighborhoods, 430–7,960). There was a strong relationship between the practice patients attended and neighborhood in which they lived (Cramer’s
| Characteristic | 2000 Census | Practice-Based Research Network |
|---|---|---|
|
| ||
| N (%) | ||
|
| 877,943 (100) | 77,516 (100) |
|
| ||
| 18–39 | 457,634 (52) | 27,629 (36) |
| 40–59 | 253,104 (29) | 28,705 (37) |
| 60–79 | 128,800 (15) | 17,074 (22) |
| ≥80 | 38,405 (4) | 4,108 (5) |
|
| ||
| Female | 463,807 (53) | 47,831(62) |
| Male | 414,136 (47) | 29,685 (38) |
|
| ||
| White | 570,879 (65) | 44,877 (58) |
| Black | 114,940 (13) | 8,449 (11) |
| Latino | 83,701 (10) | 15,272 (20) |
| Other | 108,423 (12) | 8,918 (12) |
|
| ||
| English | 712,249 (69) | 63,463 (82) |
| Spanish | 108,275 (10) | 9,401 (12) |
| Other | 212,334 (21) | 4,652 (6) |
Language was available for the census only for the population age 5 or older. Total N for census language data was 1,032,858.
Overall, 12% of patients seen at the Partners PBRN practices were documented smokers and 27% did not have smoking status documented. Across the 23 practices, the documented smoking rate averaged 11% (range, 4%–24%). The proportion of patients without smoking status documented across the 23 practices averaged 27% (range, 1%–79%).
According to the BRFSS, smoking prevalence averaged 13%, ranging from 5% to 26% (
| Community Name | Adult Population | Population-Based Estimates | Practice-Based Research Network | Percentage Point Difference in Standardized Population Estimates | ||||
|---|---|---|---|---|---|---|---|---|
|
|
| |||||||
| N (1999 to 2009) | Smoking Prevalence, % (95% CI) | N (2009) | Market Share, | Crude Smoking Prevalence, | Standardized Smoking Prevalence, | |||
| South Boston | 24,569 | 872 | 21 (19–24) | 1,723 | 7 | 12 (10–13) | 11 (9–13) | −10 |
| Cambridge (141) | 10,204 | 85 | 15 (13–18) | 495 | 5 | 8 (6–10) | 8 (5–11) | −7 |
| Quincy (169) | 42,855 | 536 | 19 (16–21) | 1,190 | 3 | 12 (10–14) | 12 (10–15) | −6 |
| North Dorchester | 24,343 | 686 | 18 (16–21) | 1,257 | 5 | 13 (11–15) | 12 (10–14) | −6 |
| South End | 37,451 | 1,310 | 15 (14–17) | 3,009 | 8 | 10 (9–11) | 10 (9–11) | −5 |
| Revere | 37,363 | 426 | 26 (23–29) | 7,960 | 21 | 20 (19–21) | 22 (21–23) | −4 |
| Cambridge (140) | 13,907 | 218 | 9 (8–11) | 638 | 5 | 5 (3–7) | 6 (4–8) | −4 |
| Back Bay/Beacon Hill | 4143 | 394 | 12 (10–14) | 714 | 17 | 8 (6–10) | 9 (7–11) | −3 |
| Chelsea | 25,512 | 276 | 21 (18–24) | 6,113 | 24 | 15 (14–15) | 18 (16–20) | −3 |
| Quincy (170/171) | 29,789 | 336 | 14 (12–16) | 970 | 3 | 10 (9–12) | 11 (8–13) | −3 |
| Needham | 29,589 | 389 | 8 (6–9) | 2,393 | 8 | 5 (4–6) | 5 (4–6) | −2 |
| West Roxbury | 19,741 | 911 | 15 (13–16) | 3,126 | 16 | 11 (10–12) | 12 (10–14) | −2 |
| South Dorchester | 53,044 | 1,731 | 16 (14–17) | 2,768 | 5 | 13 (12–14) | 13 (12–15) | −2 |
| Brookline | 46,737 | 799 | 8 (7–10) | 3,949 | 8 | 6 (6–7) | 7 (6–8) | −2 |
| Cambridge (139/142) | 31,839 | 406 | 11 (9–12) | 1,457 | 5 | 8 (7–10) | 9 (7–11) | −2 |
| Fenway-Kenmore | 45,724 | 776 | 12 (11–14) | 1,763 | 4 | 11 (10–13) | 11 (9–13) | −1 |
| East Boston | 29,364 | 935 | 17 (16–19) | 2,182 | 7 | 17 (15–19) | 16 (13–20) | −1 |
| Cambridge (138) | 31,989 | 401 | 7 (6–9) | 1,124 | 4 | 6 (5–7) | 6 (5–8) | −1 |
| Roxbury | 32,908 | 1,060 | 17 (16–19) | 3,100 | 9 | 16 (15–18) | 17 (15–18) | −1 |
| Malden | 45,102 | 469 | 19 (17–22) | 2,471 | 5 | 16 (15–17) | 18 (16–20) | −1 |
| Newton (462/466) | 6,834 | 104 | 7 (6–8) | 430 | 6 | 6 (4–8) | 6 (4–8) | −1 |
| Allston/Brighton | 60,028 | 1,363 | 12 (11–14) | 1,968 | 3 | 11 (9–12) | 11 (9–14) | −1 |
| Mattapan | 19,623 | 495 | 14 (12–16) | 1,335 | 7 | 12 (10–14) | 13 (9–17) | 0 |
| Central Boston | 19,332 | 677 | 9 (8–10) | 2,547 | 13 | 9 (8–10) | 8 (7–10) | 0 |
| Newton (459/467) | 23,042 | 536 | 6 (5–7) | 3,274 | 14 | 5 (4–6) | 6 (5–7) | 0 |
| Newton (460/465) | 15,927 | 289 | 7 (6–8) | 1,281 | 8 | 6 (5–8) | 8 (5–10) | 1 |
| Newton (458) | 9,783 | 148 | 7 (6–9) | 1,019 | 10 | 8 (6–9) | 8 (6–11) | 1 |
| Newton (461/464/468) | 11,299 | 223 | 5 (4–6) | 1,036 | 9 | 7 (5–8) | 7 (5–9) | 2 |
| Roslindale | 24,474 | 905 | 13 (11–14) | 3,375 | 14 | 14 (13–15) | 15 (14–17) | 3 |
| Hyde Park | 20,911 | 774 | 15 (13–17) | 3,182 | 15 | 15 (14–16) | 18 (16–20) | 3 |
| Jamaica Plain | 38,146 | 1,402 | 11 (10–13) | 5,665 | 15 | 14 (13–15) | 15 (13–16) | 4 |
| Charlestown | 12,371 | 543 | 12 (11–14) | 4,015 | 32 | 19 (17–20) | 18 (16–20) | 6 |
| Total or average | 877,943 | 20,475 | 13 | 77,529 | 10 | 11 | 11 | −2 |
Abbreviation: CI, confidence interval.
Numbers following community names represent zip codes, which have a leading “02” omitted.
Adult population estimates derived from the 2000 US Census.
Population-based estimates derived from the Behavioral Risk Factor Surveillance System.
Market share was calculated by dividing the number of Practice-Based Research Network patients by the census population aged 18 years or older in that neighborhood.
Crude smoking prevalence was calculated by dividing the number of smokers by the total number of patients in each neighborhood.
Standardized smoking prevalence was calculated by standardizing the age, sex, and race/ethnicity of Practice Based Research Network patients according to the neighborhood makeup according to the 2000 Census.
Population-based smoking prevalence in 32 Boston neighborhoods. Data derived from the Behavioral Risk Factor Surveillance System, 1999–2009.
Community Name Zip Code Rate, % East Boston
02128
17.4
Charlestown
02129
12.2
South Boston
02210
21.3
South Boston
02127
21.3
Central
02109
8.7
Central
02110
8.7
Central
02111
8.7
Central
02114
8.7
Back Bay/Beacon Hill
02108
12.1
Back Bay/Beacon Hill
02199
12.1
South End
02116
15.0
South End
02118
15.0
Fenway-Kenmore
02115
11.9
Fenway-Kenmore
02215
11.9
Allston/Brighton
02134
12.2
Allston/Brighton
02135
12.2
Allston/Brighton
02136
12.2
Allston/Brighton
02146
12.2
Jamaica Plain
02120
11.1
Jamaica Plain
02130
11.1
Roxbury
02119
17.4
Roxbury
02121
17.4
North Dorchester
02125
18.2
South Dorchester
02122
15.6
South Dorchester
02124
15.6
Mattapan
02126
13.6
Roslindale
02131
12.7
West Roxbury
02132
14.5
Hyde Park
02136
14.8
Cambridge
02138
7.3
Cambridge
02139
10.5
Cambridge
02142
10.5
Cambridge
02140
9.1
Cambridge
02141
15.2
Newton
02458
7.1
Newton
02460
6.9
Newton
02465
6.9
Newton
02459
5.6
Newton
02467
5.6
Newton
02462
6.7
Newton
02466
6.7
Newton
02461
5.1
Newton
02464
5.1
Newton
02468
5.1
Quincy
02169
18.5
Quincy
02170
13.5
Quincy
02171
13.5
Brookline
02445
8.3
Brookline
02446
8.3
Malden
02148
18.9
Revere
02151
25.9
Chelsea
02150
20.8
Needham
02492
7.5
Needham
02493
7.5
Needham 02494 7.5
According to standardized PBRN data, the 32 neighborhoods had an overall smoking prevalence of 11%, ranging from 5% to 22% (
Practice-Based Research Network smoking prevalence in 32 Boston Neighborhoods.
Community Name Zip Code Rate, % East Boston
02128
16.4
Charlestown
02129
17.8
South Boston
02210
11.3
South Boston
02127
11.3
Central
02109
8.3
Central
02110
8.3
Central
02111
8.3
Central
02114
8.3
Back Bay/Beacon Hill
02108
8.7
Back Bay/Beacon Hill
02199
8.7
South End
02116
10
South End
02118
10
Fenway-Kenmore
02115
10.9
Fenway-Kenmore
02215
10.9
Allston/Brighton
02134
11.4
Allston/Brighton
02135
11.4
Allston/Brighton
02136
11.4
Allston/Brighton
02146
11.4
Jamaica Plain
02120
14.7
Jamaica Plain
02130
14.7
Roxbury
02119
16.5
Roxbury
02121
16.5
North Dorchester
02125
12.1
South Dorchester
02122
13.2
South Dorchester
02124
13.2
Mattapan
02126
13.2
Roslindale
02131
15.2
West Roxbury
02132
12.1
Hyde Park
02136
17.7
Cambridge
02138
6.4
Cambridge
02139
8.9
Cambridge
02142
8.9
Cambridge
02140
5.5
Cambridge
02141
8
Newton
02458
8.1
Newton
02460
7.5
Newton
02465
7.5
Newton
02459
5.7
Newton
02467
5.7
Newton
02462
5.9
Newton
02466
5.9
Newton
02461
6.8
Newton
02464
6.8
Newton
02468
6.8
Quincy
02169
12.2
Quincy
02170
10.7
Quincy
02171
10.7
Brookline
02445
6.7
Brookline
02446
6.7
Malden
02148
18
Revere
02151
21.9
Chelsea
02150
17.9
Needham
02492
5.1
Needham
02493
5.1
Needham 02494 5.1
Scatter plot of Practice-Based Research Network (PBRN) smoking prevalence and population-based neighborhood prevalence of smoking for 32 Boston neighborhoods (%).
Community Name Zip Code BRFSS Prevalence, % Partners PBRN Prevalence, % Newton
02461
5.1
6.8
Newton
02464
5.1
6.8
Newton
02468
5.1
6.8
Newton
02459
5.6
5.7
Newton
02467
5.6
5.7
Newton
02462
6.7
5.9
Newton
02466
6.7
5.9
Newton
02460
6.9
7.5
Newton
02465
6.9
7.5
Newton
02458
7.1
8.1
Cambridge
02138
7.3
6.4
Needham
02492
7.5
5.1
Needham
02493
7.5
5.1
Needham
02494
7.5
5.1
Brookline
02445
8.3
6.7
Brookline
02446
8.3
6.7
Central
02109
8.7
8.3
Central
02110
8.7
8.3
Central
02111
8.7
8.3
Central
02114
8.7
8.3
Cambridge
02140
9.1
5.5
Cambridge
02139
10.5
8.9
Cambridge
02142
10.5
8.9
Jamaica Plain
02120
11.1
14.7
Jamaica Plain
02130
11.1
14.7
Fenway-Kenmore
02115
11.9
10.9
Fenway-Kenmore
02215
11.9
10.9
Back Bay/Beacon Hill
02108
12.1
8.7
Back Bay/Beacon Hill
02199
12.1
8.7
Charlestown
02129
12.2
17.8
Allston/Brighton
02134
12.2
11.4
Allston/Brighton
02135
12.2
11.4
Allston/Brighton
02136
12.2
11.4
Allston/Brighton
02146
12.2
11.4
Roslindale
02131
12.7
15.2
Quincy
02170
13.5
10.7
Quincy
02171
13.5
10.7
Mattapan
02126
13.6
13.2
West Roxbury
02132
14.5
12.1
Hyde Park
02136
14.8
17.7
South End
02116
15.0
10
South End
02118
15.0
10
Cambridge
02141
15.2
8
South Dorchester
02122
15.6
13.2
South Dorchester
02124
15.6
13.2
East Boston
02128
17.4
16.4
Roxbury
02119
17.4
16.5
Roxbury
02121
17.4
16.5
North Dorchester
02125
18.2
12.1
Quincy
02169
18.5
12.2
Malden
02148
18.9
18
Chelsea
02150
20.8
17.9
South Boston
02210
21.3
11.3
South Boston
02127
21.3
11.3
Revere 02151 25.9 21.9
Neighborhood-level smoking prevalence can be estimated using EHR data from a PBRN, collected in the routine course of clinical care. Generally, PBRN estimates were slightly lower than the BRFSS estimates, but they were higher in some neighborhoods in which the PBRN had higher market penetration.
PBRN smoking prevalence estimates may have been generally lower for several reasons. Practices did not have smoking status documented for 100% of patients. Though not directly comparable, our 73% documentation rate compares favorably to the BRFSS response rate of 48%, a rate that is declining (
Other studies have noted similarities and differences between health system–based and population-based estimates of smoking prevalence. A study in Leicester, England, found that General Practice notes, computerized and manual, tended to overestimate population smoking prevalence (
Using PBRN or health system data to measure the neighborhood prevalence of smoking offers several advantages over traditional, population-based methods. First, the data are already collected as part of routine clinical care, so their collection is less expensive than using a separate infrastructure to conduct population-based surveys. Second, for small-area estimation, as our data show, the sample sizes available in PBRN data are much larger, resulting in smaller standard errors (although relative to neighborhood population, even the PBRN estimates might be considered small), which may allow for the targeting of community-based interventions in smaller areas than is feasible using the BRFSS. Third, given the larger sample size and collection during routine clinical care, the data are potentially more current than population-based surveys, for which time is needed for data to become available (
Despite these advantages to PBRN and health system data, advantages exist to population-based estimates, such as those derived from BRFSS. Obviously, despite low and declining response rates, the BRFSS is population-based and can provide estimates regardless of whether an individual seeks care through a particular health system or seeks health care at all. The BRFSS is not dependent on “market share” to get more accurate estimates, is not dependent on the presence of health care facilities in neighborhoods, and is not subject to health system peculiarities that may limit the generalizability of PBRN data. Second, although it was not intended to provide the small-scale prevalence we calculated, the BRFSS is consistently administered across the United States. Third, the BRFSS potentially provides greater consistency over time. Although standards are emerging for the structure of EHR data, the BRFSS data are collected with greater attention to consistency in definitions and measurement. Population-based surveys are not free from bias, however. For example, smokers generally have lower response rates to surveys than do nonsmokers (
Our analysis suggests that PBRNs and health system data can be used to guide community resources to neighborhoods with greater need. However, an understanding of the limitations of estimates derived from nonpopulation-based sources, for example, data from a single health system with low market penetration, cannot be used for small-area estimation, because the results are potentially biased because of insufficient population coverage and selection bias.
The true public health benefit of using health system–based risk factor information will come from combining data from multiple health systems, which requires a change in view of health system data as a public health resource. A convergence of health system data with population-based results should be seen more often as documentation improves (
This work was conducted with support from The Lung Cancer Disparities Center at the Harvard School of Public Health (National Cancer Institute Award no. P50 CA148596) and the Harvard Catalyst, the Harvard Clinical and Translational Science Center (NIH Grant no. 1 UL1 RR 025758-01 and financial contributions from participating institutions). The content is solely the responsibility of the authors and does not necessarily represent the official views of Harvard University and its affiliated academic health care centers or the National Institutes of Health.
The opinions expressed by authors contributing to this journal do not necessarily reflect the opinions of the U.S. Department of Health and Human Services, the Public Health Service, the Centers for Disease Control and Prevention, or the authors' affiliated institutions.