Public health programs can deliver benefits only if they are able to sustain programs, policies, and activities over time. Although numerous sustainability frameworks and models have been developed, there are almost no assessment tools that have demonstrated reliability or validity or have been widely disseminated. We present the Program Sustainability Assessment Tool (PSAT), a new and reliable instrument for assessing the capacity for program sustainability of various public health and other programs.
A measurement development study was conducted to assess the reliability of the PSAT. Program managers and staff (n = 592) representing 252 public health programs used the PSAT to rate the sustainability of their program. State and community-level programs participated, representing 4 types of chronic disease programs: tobacco control, diabetes, obesity prevention, and oral health.
The final version of the PSAT contains 40 items, spread across 8 sustainability domains, with 5 items per domain. Confirmatory factor analysis shows good fit of the data with the 8 sustainability domains. The subscales have excellent internal consistency; the average Cronbach’s α is 0.88, ranging from 0.79 to 0.92. Preliminary validation analyses suggest that PSAT scores are related to important program and organizational characteristics.
The PSAT is a new and reliable assessment instrument that can be used to measure a public health program’s capacity for sustainability. The tool is designed to be used by researchers, evaluators, program managers, and staff for large and small public health programs.
The new discipline of dissemination and implementation science has driven an increase in studies of how new scientific discoveries are translated and developed into programs, policies, and practices (
The concept of program sustainability is not new, and theoretical work has been done in many fields, including business, health care administration, social services, and public health (
Although much conceptual work has been done on defining sustainability, a dearth of tools is available to scientists, evaluators, and public health program managers to assess sustainability. Of all the frameworks for sustainability, few have been translated into valid and reliable tools for measuring sustainability. In an extensive literature search, Hutchinson found references to 33 tools measuring some aspect of sustainability (
We present a new assessment tool based on our program sustainability framework (
This measurement development study was designed to produce a reliable assessment instrument that could be easily used to measure capacity for program sustainability. The structure of the PSAT was based on our previous sustainability conceptual framework (
On the basis of our prior literature review and concept mapping study (
The pilot instrument development study was not designed as an in-depth validation study. However, we collected a small number of items from a sample of the participants that could be used for simple validation analyses. The focus of the PSAT is to accurately characterize the sustainability capacity of a public health program. As such, if the instrument is valid, the sustainability scores should reflect characteristics of the program, agency, or organization. Conversely, sustainability scores should not be strongly related to characteristics of the individual program directors or staff members who are filling out the PSAT.
We employed 2 primary tools to collect data: paper surveys administered in person at trainings and online surveys administered using Qualtrics (Qualtrics, Provo, Utah). The paper and electronic versions of the surveys were identical. Each was composed of 63 elements organized into the 9 original domains.
The sustainability project advisory group identified initiatives (comprising multiple grantees with separate programs) and individual programs for a pilot test of the instrument (
| Participating Initiatives and Programs | Program Level | Program Focus | No. of Programs | No. of Participants |
|---|---|---|---|---|
| Missouri Healthy and Active Communities grantees | Community | Obesity Prevention | 47 | 99 |
| Appalachia Diabetes Coalitions | Community | Diabetes | 31 | 31 |
| Missouri Tobacco Prevention and Cessation Initiative grantees | Community | Tobacco | 31 | 82 |
| Centers for Disease Control and Prevention (CDC), Division of Nutrition, Physical Activity, and Obesity grantees | State | Obesity Prevention | 50 | 114 |
| Missouri Council for Activity and Nutrition Coalition | State | Obesity Prevention | 1 | 8 |
| CDC, Fall Institute workshop grantees | State | Diabetes, tobacco | 21 | 24 |
| Missouri Tobacco Control Program | State | Tobacco | 1 | 11 |
| CDC, Office on Smoking and Health grantees | State | Tobacco | 53 | 142 |
| CDC, Office on Smoking and Health Sustaining States grantees | State | Tobacco | 4 | 46 |
| CDC, Division of Oral Health grantees | State | Oral Health | 13 | 35 |
|
|
|
| ||
Several variables that described the respondents’ programs were used as covariates in the validation measure. These variables were program level, program type, role in program, years in program, and perceived sustainability. Program level had 2 options, state or community. Program type described the public health issue that the program focused on: obesity prevention, diabetes, oral health, tobacco, or multiple health topics. Role in program indicated the respondent’s role in their organization: manager/director, evaluator, financial/operations/business manager, program staff, board member, community partner, or other. “Years in program” was a continuous variable that indicated how long the respondent had been with the organization. Perceived sustainability was measured on a 7-point Likert scale with anchors of 1 (not at all sustainable) and 7 (very sustainable) in response to the question “How would you rate the overall sustainability of your program?” The perceived sustainability question was added partway through the pilot study, and 205 participants were able to respond to this question. Although we were limited in the types of covariate questions we could include in this pilot study, these 4 were included to capture potential differences in program sustainability capacity. For example, whether a program is organized at the state or community level may influence its access to resources or partnerships that support sustainability capacity.
Using the lavaan structural equation modeling package version 0.4-11 in the R statistical software (R Foundation for Statistical Computing, Vienna, Austria) (
The item and confirmatory factor analyses resulted in the final structure of the PSAT.
| Phase | Subscales | Items | χ2/ | CFI | RMSEA | SRMR | AIC |
|---|---|---|---|---|---|---|---|
| Baseline | 1 | 63 | 15.3 | 0.58 | 0.102 | 0.087 | 114,884 |
| Pilot | 9 | 63 | 3.7 | 0.82 | 0.067 | 0.063 | 108,194 |
| Final | 8 | 40 | 3.6 | 0.89 | 0.066 | 0.055 | 69,518 |
Abbreviations: CFI, comparative fit index; RMSEA, root mean square error of approximation; SRMR, standardized root mean residual; AIC, Akaike Information Criterion.
| Subscale Definition and Items | Total Sample (n = 592) | Confirmatory Factor Analysis | |||
|---|---|---|---|---|---|
| Program Level | Program Type | ||||
| Community (n = 212) | State (n = 380) | Tobacco (n = 301) | Nontobacco (n = 291) | ||
|
| |||||
| 1. Political champions advocate for the program. | 0.84 | 0.84 | 0.84 | 0.82 | 0.84 |
| 2. The program has strong champions with the ability to garner resources. | 0.81 | 0.74 | 0.82 | 0.82 | 0.80 |
| 3. The program has political support within the larger organization. | 0.72 | 0.77 | 0.68 | 0.69 | 0.75 |
| 4. The program has political support from outside of the organization. | 0.84 | 0.87 | 0.82 | 0.81 | 0.86 |
| 5. The program has strong advocacy support. | 0.74 | 0.68 | 0.75 | 0.66 | 0.82 |
|
| |||||
| 6. The program exists in a supportive state economic climate. | 0.61 | 0.44 | 0.66 | 0.57 | 0.66 |
| 7. The program implements policies to help ensure sustained funding. | 0.66 | 0.66 | 0.64 | 0.64 | 0.68 |
| 8. The program is funded through a variety of sources. | 0.61 | 0.59 | 0.63 | 0.62 | 0.60 |
| 9. The program has a combination of stable and flexible funding. | 0.77 | 0.72 | 0.80 | 0.80 | 0.75 |
| 10. The program has sustained funding. | 0.75 | 0.79 | 0.73 | 0.74 | 0.76 |
|
| |||||
| 11. Diverse community organizations are invested in the success of the program. | 0.77 | 0.76 | 0.77 | 0.77 | 0.77 |
| 12. The program communicates with community leaders. | 0.85 | 0.82 | 0.86 | 0.86 | 0.85 |
| 13. Community leaders are involved with the program. | 0.85 | 0.81 | 0.88 | 0.86 | 0.85 |
| 14. Community members are passionately committed to the program. | 0.78 | 0.74 | 0.78 | 0.78 | 0.78 |
| 15. The community is engaged in the development of program goals. | 0.78 | 0.77 | 0.78 | 0.77 | 0.79 |
|
| |||||
| 16. The program is well integrated into the operations of the organization. | 0.77 | 0.68 | 0.80 | 0.75 | 0.79 |
| 17. Organizational systems are in place to support the various program needs. | 0.84 | 0.73 | 0.86 | 0.85 | 0.83 |
| 18. Leadership effectively articulates the vision of the program to external partners. | 0.81 | 0.78 | 0.78 | 0.84 | 0.79 |
| 19. Leadership efficiently manages staff and other resources. | 0.84 | 0.85 | 0.80 | 0.85 | 0.85 |
| 20. The program has adequate staff to complete the program’s goals. | 0.60 | 0.60 | 0.52 | 0.65 | 0.54 |
|
| |||||
| 21. The program has the capacity for quality program evaluation. | 0.78 | 0.74 | 0.78 | 0.78 | 0.78 |
| 22. The program reports short-term and intermediate outcomes. | 0.82 | 0.78 | 0.83 | 0.82 | 0.81 |
| 23. Evaluation results inform program planning and implementation. | 0.89 | 0.90 | 0.89 | 0.93 | 0.86 |
| 24. Program evaluation results are used to demonstrate successes to funders and other key stakeholders. | 0.84 | 0.84 | 0.84 | 0.86 | 0.83 |
| 25. The program provides strong evidence to the public that the program works. | 0.80 | 0.72 | 0.80 | 0.76 | 0.82 |
|
| |||||
| 26. The program periodically reviews the evidence base. | 0.78 | 0.78 | 0.81 | 0.81 | 0.74 |
| 27. The program adapts strategies as needed. | 0.89 | 0.87 | 0.89 | 0.92 | 0.85 |
| 28. The program adapts to new science. | 0.86 | 0.84 | 0.88 | 0.90 | 0.81 |
| 29. The program proactively adapts to changes in the environment. | 0.89 | 0.89 | 0.88 | 0.90 | 0.88 |
| 30. The program makes decisions about which components are ineffective and should not continue. | 0.75 | 0.73 | 0.74 | 0.77 | 0.74 |
|
| |||||
| 31. The program has communication strategies to secure and maintain public support. | 0.89 | 0.89 | 0.88 | 0.91 | 0.88 |
| 32. Program staff members communicate the need for the program to the public. | 0.86 | 0.85 | 0.84 | 0.85 | 0.88 |
| 33. The program is marketed in a way that generates interest. | 0.85 | 0.81 | 0.83 | 0.86 | 0.84 |
| 34. The program increases community awareness of the issue. | 0.83 | 0.74 | 0.81 | 0.85 | 0.81 |
| 35. The program demonstrates its value to the public. | 0.81 | 0.62 | 0.82 | 0.85 | 0.77 |
|
| |||||
| 36. The program plans for future resource needs. | 0.81 | 0.80 | 0.82 | 0.82 | 0.80 |
| 37. The program has a long-term financial plan. | 0.83 | 0.85 | 0.84 | 0.80 | 0.84 |
| 38. The program has a sustainability plan. | 0.82 | 0.83 | 0.81 | 0.77 | 0.84 |
| 39. The program’s goals are understood by all stakeholders. | 0.74 | 0.64 | 0.74 | 0.82 | 0.67 |
| 40. The program clearly outlines roles and responsibilities for all stakeholders. | 0.78 | 0.65 | 0.79 | 0.85 | 0.71 |
This domain is now called Environmental Support.
As a further test of the PSAT structure, we conducted multiple-group confirmatory factor analysis to test for factorial invariance across levels of 2 covariates: program level and program type. This allowed us to determine whether the subscale structure of the PSAT is the same for different types of programs. These analyses tested the hypothesis that there were equal item-factor loadings across different subgroups. The results indicated a significant difference between community and state programs (χ2 = 78.0, degrees of freedom [
The subscale reliabilities (internal consistency) for the PSAT were excellent, especially given the small size of each subscale (5 items) (
| Subscale | Cronbach’s α |
|---|---|
| Political Support | 0.88 |
| Funding Stability | 0.79 |
| Partnerships | 0.90 |
| Organizational Capacity | 0.87 |
| Program Evaluation | 0.90 |
| Program Adaptation | 0.91 |
| Communications | 0.92 |
| Strategic Planning | 0.88 |
This domain is now called Environmental Support.
Using the data obtained from the 592 participants of the PSAT pilot development, we created the final PSAT scale to be used for a small number of exploratory descriptive and validation analyses. The average total sustainability score across the 252 programs was 4.84, with a range of 1.32 to 7.00 and an interquartile range of 4.17 to 5.58. Scores had good coverage across the range of possible scores, although program participants were not likely to report extremely low sustainability scores (
Density plot (frequency) of the variability of Program Sustainability Assessment Tool (PSAT) scores across 252 public health programs participating in tests of the PSAT.
Analyses looking at the relationship between the overall sustainability scores and a small set of organizational and individual-level covariates showed that the sustainability scores obtained from the PSAT are significantly related to 2 important organizational predictors: type of program (F4,587 = 3.33,
Program Sustainability Assessment Tool domain scores by level of program and type of program among programs participating in tests of the tool.
Finally, we conducted a simple construct validity analysis. A sample of 205 of the pilot participants were asked, after filling out the PSAT and before receiving their results, to indicate their perception of their program’s general sustainability. If the PSAT were a valid measure of program sustainability, we would expect PSAT sustainability scores to be positively related to what program managers and staff perceive is the sustainability of the program. The correlations between the perceived sustainability scores and the PSAT overall sustainability scores as well as the 8 domain scores were as follows: overall = 0.68, Political Support = 0.48, Funding Stability = 0.67, Partnerships = 0.44, Organizational Capacity = 0.58, Program Evaluation = 0.45, Program Adaptation = 0.32, Communications = 0.55, Strategic Planning = 0.63. The formal PSAT sustainability scores had a moderate positive correlation with perceived program sustainability. The Funding Stability and Strategic Planning domains were most closely associated with general perceived program sustainability.
The results of our psychometric study of the PSAT indicated that the tool is reliable and ready to use for assessing a program’s capacity for sustainability by researchers, evaluators, and program managers and staff. The confirmatory factor analyses show that even with only 40 items, the PSAT is able to capture the distinct elements of program sustainability suggested by previous conceptual work (
The PSAT was designed to be easy to use by a wide variety of public health and other social service programs. It is short, has a consistent structure that facilitates training, and can be used for numerous purposes including program monitoring, program evaluation, and strategic planning. A companion article in
There are numerous research and evaluation next steps for the PSAT. A strength of this study is the diversity of public health program types used to test its psychometrics. However, despite this diversity, the tool has been used only with chronic disease programs. On the basis of confirmatory factor analysis, the poorly performing domain of Public Health Impacts was dropped. By removing this public health–specific domain, the tool can now be used by social service and clinical care programs as well. In particular, the promise of sustainability assessment for large-scale health systems is intriguing (
A second area of future work is to further validate the PSAT. Although the simple validation data presented here suggest that the sustainability scores obtained with the PSAT are associated with important organizational and program characteristics, this finding needs to be explored in more detail. For example, our data showed that across more than 250 programs, state programs have lower sustainability scores than community programs. Future work can tell us if this is a typical result and help to uncover the underlying causes. It is not surprising that state and local programs may differ in their sustainability. State programs are typically larger, exist in more diverse political environments, have broader partnership structures, and have typically been in existence longer than local programs, which tend to have shorter life cycles. More research is needed to identify the sustainability differences across program types.
The ultimate validation challenge for the PSAT is to use it to predict and understand long-term sustainability outcomes for public health and other types of programs (
This project was funded by the Centers for Disease Control and Prevention (CDC), Office on Smoking and Health, contract no. 200-2011-42065. The authors extend special thanks to Monica Eischen and Shawna Shields of CDC’s Office on Smoking and Health and the project’s advisory committee for their guidance during the development of the tool.
The opinions expressed by authors contributing to this journal do not necessarily reflect the opinions of the U.S. Department of Health and Human Services, the Public Health Service, the Centers for Disease Control and Prevention, or the authors' affiliated institutions.