<!DOCTYPE article
PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD with MathML3 v1.3 20210610//EN" "JATS-archivearticle1-3-mathml3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="1.3" xml:lang="en" article-type="research-article"><?properties manuscript?><processing-meta base-tagset="archiving" mathml-version="3.0" table-model="xhtml" tagset-family="jats"><restricted-by>pmc</restricted-by></processing-meta><front><journal-meta><journal-id journal-id-type="nlm-journal-id">101532881</journal-id><journal-id journal-id-type="pubmed-jr-id">37796</journal-id><journal-id journal-id-type="nlm-ta">Stat Methods Appt</journal-id><journal-id journal-id-type="iso-abbrev">Stat Methods Appt</journal-id><journal-title-group><journal-title>Statistical methods &#x00026; applications</journal-title></journal-title-group><issn pub-type="ppub">1618-2510</issn><issn pub-type="epub">1613-981X</issn></journal-meta><article-meta><article-id pub-id-type="pmid">39664166</article-id><article-id pub-id-type="pmc">11633323</article-id><article-id pub-id-type="doi">10.1007/s10260-022-00678-7</article-id><article-id pub-id-type="manuscript">HHSPA1980368</article-id><article-categories><subj-group subj-group-type="heading"><subject>Article</subject></subj-group></article-categories><title-group><article-title>Hierarchical Bayes small area estimation for county-level health prevalence to having a personal doctor</article-title></title-group><contrib-group><contrib contrib-type="author"><name><surname>Andreea L.</surname><given-names>Erciulescu</given-names></name><xref rid="CR1" ref-type="corresp">*</xref><xref rid="A1" ref-type="aff">1</xref></contrib><contrib contrib-type="author"><name><surname>Jianzhu</surname><given-names>Li</given-names></name><xref rid="A1" ref-type="aff">1</xref></contrib><contrib contrib-type="author"><name><surname>Tom</surname><given-names>Krenzke</given-names></name><xref rid="A1" ref-type="aff">1</xref></contrib><contrib contrib-type="author"><name><surname>Machell</surname><given-names>Town</given-names></name><xref rid="A2" ref-type="aff">2</xref></contrib></contrib-group><aff id="A1"><label>1</label>Westat, Maryland, United States</aff><aff id="A2"><label>2</label>Population Health Surveillance Branch, Division of Population Health, National Center for Chronic Disease Prevention and Health Promotion, Centers for Disease Control and Prevention, Georgia, United States</aff><author-notes><fn fn-type="con" id="FN1"><p id="P1">Author contributions</p><p id="P2">Dr. Erciulescu managed the development and implementation of the small area estimation models, reviewed and provided guidance on the implementation of weighting adjustments, imputation, survey direct estimation, and variable selection, and prepared the initial draft of this manuscript.</p><p id="P3">Dr. Li led the raking adjustments to survey weights, imputation, creation of survey estimates at various geographical levels, preparation of model covariates from different data sources, implementation of variable selection for the small area estimation models, as well as reviewed the small area predictions and diagnostics.</p><p id="P4">Mr. Krenzke managed the project in collaboration with the CDC, provided general guidance to align with available resources, and reviewed the various contributions to the project by Dr. Erciulescu and Dr. Li.</p><p id="P5">Dr. Town developed the initial scope of work, provided oversight from CDC and reviewed all manuscript revisions and reporting documents. She oversaw the production of data from the BRFSS on which this research is based.</p></fn><corresp id="CR1"><label>*</label><bold>Correspondence</bold>: Andreea L. Erciulescu, 1600 Research Blvd., Rockville, Maryland, 20850. <email>AndreeaErciulescu@westat.com</email></corresp></author-notes><pub-date pub-type="nihms-submitted"><day>2</day><month>4</month><year>2024</year></pub-date><pub-date pub-type="ppub"><month>12</month><year>2022</year></pub-date><pub-date pub-type="pmc-release"><day>11</day><month>12</month><year>2024</year></pub-date><volume>33</volume><fpage>1171</fpage><lpage>1191</lpage><abstract id="ABS1"><title>Summary</title><p id="P6">The complexity of survey data and the availability of data from auxiliary sources motivate researchers to explore estimation methods that extend beyond traditional survey-based estimation. The U.S. Centers for Disease Control and Prevention&#x02019;s Behavioral Risk Factor Surveillance System (BRFSS) collects a wide range of health information, including whether respondents have a personal doctor. While the BRFSS focuses on state-level estimation, there is demand for county-level estimation of health indicators using BRFSS data. A hierarchical Bayes small area estimation model is developed to combine county-level BRFSS survey data with county-level data from auxiliary sources, while accounting for various sources of error and nested geographical levels. To mitigate extreme proportions and unstable survey variances, a transformation is applied to the survey data. Model-based county-level predictions are constructed for prevalence of having a personal doctor for all the counties in the U.S., including those where BRFSS survey data were not available. An evaluation study using only the counties with large BRFSS sample sizes to fit the model versus using all the counties with BRFSS data to fit the model is also presented.</p></abstract><kwd-group><kwd>Behavioral Risk Factor Surveillance System</kwd><kwd>disaggregation</kwd><kwd>hierarchical Bayes</kwd><kwd>multiple data sources</kwd><kwd>nested levels</kwd></kwd-group></article-meta></front><body><sec id="S1"><label>1 |</label><title>INTRODUCTION</title><p id="P7">The U.S. Centers for Disease Control and Prevention&#x02019;s (CDC&#x02019;s) Behavioral Risk Factor Surveillance System (BRFSS) collects a wide range of health information, including whether respondents have a personal doctor. BRFSS data are used by state health departments to plan interventions, allocate scarce resources, and provide information to the general public. While the BRFSS focuses on state-level estimation, there is demand for county-level estimation of health indicators using BRFSS data. The ability to make the most productive use of the BRFSS data and to improve county-level planning based on BRFSS data is essential.</p><p id="P8">The overall large sample size of BRFSS allows for county-level survey-based estimation of health indicators. However, survey-based estimates can only be produced for counties with sample survey data available and are subject to high uncertainty for counties with small sample sizes. The sparsity of the survey data in some counties and the availability of data from auxiliary sources motivate researchers to explore estimation methods that extend beyond the traditional survey-based estimation. For example, model-based small area estimation (SAE) provides a principled way to combine survey and auxiliary data, while exploring the relationship between the outcome of interest and the auxiliary information, and accounting for the sources of uncertainty in the model components.</p><p id="P9">There are two main classes of SAE models: unit-level models, introduced in Battese, Harter, and Fuller (1988) <sup><xref rid="R1" ref-type="bibr">1</xref></sup>, and area-level models, introduced in Fay and Herriot (1979) <sup><xref rid="R2" ref-type="bibr">2</xref></sup>. The first class of SAE models take as input unit-level survey and auxiliary data, while the second class of SAE models take as input area-level survey estimates and auxiliary data. The area-level SAE models are often preferred because of their practical applicability, while overcoming challenges related to the availability of auxiliary data, linking between the survey unit and the auxiliary source unit, inclusion of survey design effects, and others (for example, definition of unit and computational resources). These models are fit to the set of areas with sample data, but estimates can be constructed for all the areas of interest.</p><p id="P10">In this paper, an area-level SAE model is developed to combine county-level BRFSS survey data with county-level data available from auxiliary sources, for the purpose of estimating the proportions of individuals who do not have a personal care provider or doctor. The areas of interest comprise of the set of U.S. counties (3,142 counties), but the model is fit only to the set of counties for which BRFSS data on having or not a personal care provider or doctor are available (3,115 counties). For evaluation purposes, the model is also fit to the set of counties with BRFSS sample size of at least 500, also known as the Selected Metropolitan/Micropolitan Area Risk Trends (SMART) counties (213 counties). Model-based county-level estimates are produced for all the U.S. counties.</p><p id="P11">SAE models have been considered in the past for estimating BRFSS quantities. We first review some of the unit-level models studies. For county-level estimation of prevalence of self-reported diagnosed diabetes, Cadwell et al. (2010)<sup><xref rid="R3" ref-type="bibr">3</xref></sup> developed a census region-specific model taking as input BRFSS survey data at a level defined by the cross-tabulation of age, sex, race/ethnicity, and county; this model fits under the unit-level modeling framework. For county-level estimation of chronic obstructive pulmonary disease prevalence, Zhang et al. (2014) <sup><xref rid="R4" ref-type="bibr">4</xref></sup> developed a model taking as input BRFSS survey data at a level defined by the cross-tabulation of age, sex, race/ethnicity, county, and state, with extensions to finer geographies including census blocks and tracts; this model fits under the unit-level modeling framework. The model in Zhang et al. (2014) <sup><xref rid="R4" ref-type="bibr">4</xref></sup> was then applied to BRFSS data on health status and access indicators, after removing sex and state from the definition of the domain level (see Pierannunzi et al., 2016<sup><xref rid="R5" ref-type="bibr">5</xref></sup>), to BRFSS data on colorectal cancer screening prevalence (see Berkowitz et al., 2018 <sup><xref rid="R6" ref-type="bibr">6</xref></sup>), and to BRFSS data on mammography screening rates, after removing sex from the definition of the domain level (see Berkowitz et al., 2019 <sup><xref rid="R7" ref-type="bibr">7</xref></sup>). Holt et al. (2019) <sup><xref rid="R8" ref-type="bibr">8</xref></sup> developed a similar model to the one in Zhang et al. (2014) <sup><xref rid="R4" ref-type="bibr">4</xref></sup> for estimating the number of at-risk community-dwelling adults with a chronic condition (chronic obstructive pulmonary disease), while combining BRFSS and National Hurricane Center (NHC) data. As documented in these studies, the current methods are not capturing the BRFSS complex survey design effects, hence are lacking at accounting for a key source of uncertainty in the survey data. Moreover, the pool of auxiliary data is constrained to BRFSS demographic variables for most of these studies. Unlike these studies, we explore a large pool of auxiliary data and adopt an area-level modeling approach that starts with BRFSS survey estimates and accounts for the sampling variability in the survey data.</p><p id="P12">Raghunathan et al. (2007) <sup><xref rid="R9" ref-type="bibr">9</xref></sup> developed a joint county-level model for BRFSS and the National Health Interview Survey (NHIS) current-smoking and mammography screening rates; this model fits under the area-level modeling framework. For estimating county-level smoking and screening for female breast cancer, cervical cancer, and colorectal cancer rates, Liu et al. (2019) <sup><xref rid="R10" ref-type="bibr">10</xref></sup> extended the model in Raghunathan et al. (2007) 9 that com bines BRFSS and NHIS data. Like in Raghunathan et al. (2007) <sup><xref rid="R9" ref-type="bibr">9</xref></sup>, we also model the arcsine-square-root-transformed survey-based county-level estimates, in order to mitigate unstable survey variances and help support a normality assumption for the survey-based county-level esti mates. As a result, the sampling variances for the transformed survey estimates are functions of the effective sample sizes. For counties with small sample sizes, the authors imputed the design effect and set a lower bound for the effective sample sizes. To mitigate such scenarios, as well as extreme survey-based proportions, we approximate the effective sample sizes using Kish&#x02019;s approximation (see Kish, 1965 <sup><xref rid="R11" ref-type="bibr">11</xref></sup>).</p><p id="P13">Multi-fold SAE models have been studied in the literature to further account for the nested data structure of the survey data and allow for reliable estimation at various levels of aggregation. Area-level multi-fold SAE models were initially introduced in an unpublished manuscript by Fuller and Goyeneche (1998)<sup><xref rid="R12" ref-type="bibr">12</xref></sup>, as two-fold models, and then considered in Torabi and Rao (2014)<sup><xref rid="R13" ref-type="bibr">13</xref></sup>. Both of these studies employed frequentist estimation of model parameters. Erciulescu, Cruze, and Nandram (2020)<sup><xref rid="R14" ref-type="bibr">14</xref></sup> specified a two-fold SAE model as a hierarchical Bayes model and derived closed-form expressions for model predictions for counties with survey data. Krenzke et al. (2020) <sup><xref rid="R15" ref-type="bibr">15</xref></sup> applied three-fold SAE models specified as hierarchical Bayes models to predict U.S. adult competency at the county-level. We consider Bayesian inference to be a straightforward inferential approach for multi-fold SAE models, while allowing for the construction of full posterior distributions for quantities of interest. In addition, we model county-level BRFSS and auxiliary data, while accounting for the nested structure of counties within states and states within census divisions and allowing for reliable estimation at these three levels: county, state, and census division. In this aspect, the hierarchical Bayes SAE model developed in this paper is similar in structure to the ones considered in Krenzke et al. (2020) <sup><xref rid="R15" ref-type="bibr">15</xref></sup>.</p><p id="P14">The rest of the paper is organized as follows. In <xref rid="S2" ref-type="sec">Section 2</xref>, we describe the data available for the application study and the initial survey estimation and variable selection steps necessary to prepare the model input data. The hierarchical Bayes SAE model is presented in <xref rid="S3" ref-type="sec">Section 3</xref>, along with a framework for model fit, internal validation, estimation and prediction, and comparison. Final selected results are provided in <xref rid="S6" ref-type="sec">Section 4</xref> and include external validation checks. A general discussion is given in <xref rid="S8" ref-type="sec">Section 5</xref>.</p></sec><sec id="S2"><label>2 |</label><title>DATA FOR THE APPLICATION STUDY</title><p id="P15">The survey data for the application consist of micro-level BRFSS data for the reference year 2018, subject to unit-level adjustments described next. Using the BRFSS information on whether or not an individual has a personal care provider or doctor, an indicator variable is constructed having these two categories. The item nonresponse rate for this indicator variable is 0.64%. Missing values are imputed using a hot-deck imputation approach. The survey weights are adjusted using a raking procedure to align the county-level population totals estimated from the survey to corresponding fixed controls available from the 2014&#x02013;2018 American Community Survey (ACS) Summary File (also known as Detailed Tables). The resulting micro-level data are used to produce county-level survey estimates for proportions of individuals who do not have a personal care provider or doctor. Specifically, H&#x000e1;jek-type point estimators and associated Taylor series variance estimators are constructed for all the counties with available survey data. Let these be denoted by county-level pairs <inline-formula><mml:math id="M1" display="inline"><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>V</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> where <inline-formula><mml:math id="M2" display="inline"><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:mi>m</mml:mi></mml:math></inline-formula> indexes the census divisions, <inline-formula><mml:math id="M3" display="inline"><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> indexes the states in census division <inline-formula><mml:math id="M4" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>, and <inline-formula><mml:math id="M5" display="inline"><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> indexes the counties in state <inline-formula><mml:math id="M6" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula> and census division <inline-formula><mml:math id="M7" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>. Let the county-level population totals from the 2014&#x02013;2018 ACS Summary File be denoted by <inline-formula><mml:math id="M8" display="inline"><mml:msub><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>.</p><p id="P16">County-level survey estimates and associated variances on the arcsine-square-root scale serve as inputs into the models described in the next section. Let these be denoted by county-level pairs <inline-formula><mml:math id="M9" display="inline"><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msubsup><mml:mrow><mml:mi>&#x003c3;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> where
<disp-formula id="FD1">
<mml:math id="M10" display="block"><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mo>=</mml:mo><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:msup><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mo>&#x02212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:msqrt><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:msqrt><mml:mo>,</mml:mo></mml:math>
</disp-formula>
and
<disp-formula id="FD2">
<mml:math id="M11" display="block"><mml:msubsup><mml:mrow><mml:mi>&#x003c3;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>&#x02254;</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>4</mml:mn><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mo>~</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>e</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac><mml:mo>.</mml:mo></mml:math>
</disp-formula></p><p id="P17">The county-level effective sample sizes are approximated as
<disp-formula id="FD3">
<mml:math id="M12" display="block"><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mo>~</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>e</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02248;</mml:mo><mml:mfrac><mml:mrow><mml:msup><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mo stretchy="false">&#x02211;</mml:mo><mml:mrow><mml:mi>a</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:msubsup><mml:mrow><mml:mi>w</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>f</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mrow><mml:msub><mml:mo stretchy="false">&#x02211;</mml:mo><mml:mrow><mml:mi>a</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:msubsup><mml:mrow><mml:mi>w</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>f</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:mfrac></mml:math>
</disp-formula>
where <inline-formula><mml:math id="M13" display="inline"><mml:msubsup><mml:mrow><mml:mi>w</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>f</mml:mi></mml:mrow></mml:msubsup></mml:math></inline-formula> is the adjusted survey weight associated with the individual <inline-formula><mml:math id="M14" display="inline"><mml:mi>a</mml:mi></mml:math></inline-formula> in county <inline-formula><mml:math id="M15" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>.</p><p id="P18">The pool of variables related to the transformed proportion of individuals who do not have a personal care provider or doctor consists of demographic characteristics (i.e., race/ethnicity, age, sex, marital status), socioeconomic status (i.e., poverty, income, employment status, occupation), education (i.e., education, English-speaking ability), location (i.e., urbanicity), immigration status (i.e., length of stay for foreign-born people, migration), health (i.e., insurance, health professions, facilities), and other (i.e., journey to work, housing unit tenure, tax, birth rate, fertility rate, infant mortality rate, crime rate, Federal aid, energy consumption). A list of data sources and auxiliary variables is provided in the <xref rid="T5" ref-type="table">Appendix Table 1</xref>.</p><p id="P19">As the initial step in the variable selection process, we identify the most complete and less prone to error county-level auxiliary information. Next, a correlation analysis is conducted to reduce the pool of auxiliary variables to those identified to be highly correlated with the transformed proportion of individuals who do not have a personal care provider or doctor and minimally correlated with other auxiliary variables in the pool. Finally, a small set of auxiliary variables is selected using the least absolute shrinkage and selection operator (LASSO), with 20-fold cross-validation. Two choices are considered for the LASSO shrinkage/penalty parameters, both close to the optimal value that minimizes the mean cross-validated error but resulting in different numbers of selected variables. The two corresponding models constructed with these two sets of covariates will be referred to as the full model and reduced model, respectively. The full model contains a larger number of covariates than the reduced model does. Irrespective of the model, let the matrix of covariates be denoted by <inline-formula><mml:math id="M16" display="inline"><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>; its rows correspond to counties with survey sample and its columns correspond to an intercept, followed by the selected variables. The exact number of counties with survey sample and the exact number of selected variables used to fit the models are discussed in the next section.</p></sec><sec id="S3"><label>3 |</label><title>MODELS</title><p id="P20">The overall structure of the proposed small area model follows closely the structure of the small area model for average adult proficiency presented in Krenzke et al. (2020) <sup><xref rid="R15" ref-type="bibr">15</xref></sup>. First, a sampling level is specified for the survey estimates on the transformed scale:
<disp-formula id="FD4">
<mml:math id="M17" display="block"><mml:mi mathvariant="normal">S</mml:mi><mml:mi mathvariant="normal">a</mml:mi><mml:mi mathvariant="normal">m</mml:mi><mml:mi mathvariant="normal">p</mml:mi><mml:mi mathvariant="normal">l</mml:mi><mml:mi mathvariant="normal">i</mml:mi><mml:mi mathvariant="normal">n</mml:mi><mml:mi mathvariant="normal">g</mml:mi><mml:mspace width="0.25em"/><mml:mi mathvariant="normal">l</mml:mi><mml:mi mathvariant="normal">e</mml:mi><mml:mi mathvariant="normal">v</mml:mi><mml:mi mathvariant="normal">e</mml:mi><mml:mi mathvariant="normal">l</mml:mi><mml:mo>:</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">|</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>&#x003b8;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msubsup><mml:mrow><mml:mi>&#x003c3;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="false">)</mml:mo><mml:mo>~</mml:mo><mml:mi>N</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>&#x003b8;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msubsup><mml:mrow><mml:mi>&#x003c3;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="false">)</mml:mo><mml:mo>,</mml:mo></mml:math>
</disp-formula>
where <inline-formula><mml:math id="M18" display="inline"><mml:msub><mml:mrow><mml:mi>&#x003b8;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> are the county-level quantities of interest on the transformed scale, i.e., the transformed proportions of individuals who do not have a personal care provider or doctor. Then, we borrow strength from auxiliary data and across small areas, while accounting for the nested structure of the data, via a linking level:
<disp-formula id="FD5">
<mml:math id="M19" display="block"><mml:mi mathvariant="normal">L</mml:mi><mml:mi mathvariant="normal">i</mml:mi><mml:mi mathvariant="normal">n</mml:mi><mml:mi mathvariant="normal">k</mml:mi><mml:mi mathvariant="normal">i</mml:mi><mml:mi mathvariant="normal">n</mml:mi><mml:mi mathvariant="normal">g</mml:mi><mml:mspace width="0.25em"/><mml:mi mathvariant="normal">l</mml:mi><mml:mi mathvariant="normal">e</mml:mi><mml:mi mathvariant="normal">v</mml:mi><mml:mi mathvariant="normal">e</mml:mi><mml:mi mathvariant="normal">l</mml:mi><mml:mo>:</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>&#x003b8;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>|</mml:mo><mml:mo>(</mml:mo><mml:mi>&#x003b2;</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>c</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mi>&#x003b2;</mml:mi><mml:mo>+</mml:mo><mml:msub><mml:mrow><mml:mi>c</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo></mml:math>
</disp-formula>
<disp-formula id="FD6">
<mml:math id="M20" display="block"><mml:msub><mml:mrow><mml:mi>c</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>|</mml:mo><mml:mfenced separators="|"><mml:mrow><mml:msubsup><mml:mrow><mml:mi>&#x003c3;</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced><mml:mo>~</mml:mo><mml:mi>N</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:msubsup><mml:mrow><mml:mi>&#x003c3;</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced><mml:mo>,</mml:mo></mml:math>
</disp-formula>
<disp-formula id="FD7">
<mml:math id="M21" display="block"><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>|</mml:mo><mml:mo>(</mml:mo><mml:msubsup><mml:mrow><mml:mi>&#x003c3;</mml:mi></mml:mrow><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>)</mml:mo><mml:mo>~</mml:mo><mml:mi>N</mml:mi><mml:mo>(</mml:mo><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:msubsup><mml:mrow><mml:mi>&#x003c3;</mml:mi></mml:mrow><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>)</mml:mo><mml:mo>,</mml:mo></mml:math>
</disp-formula>
<disp-formula id="FD8">
<mml:math id="M22" display="block"><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>|</mml:mo><mml:mfenced separators="|"><mml:mrow><mml:msubsup><mml:mrow><mml:mi>&#x003c3;</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced><mml:mo>~</mml:mo><mml:mi>N</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:msubsup><mml:mrow><mml:mi>&#x003c3;</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced><mml:mo>,</mml:mo></mml:math>
</disp-formula>
where <inline-formula><mml:math id="M23" display="inline"><mml:mi>&#x003b2;</mml:mi></mml:math></inline-formula> is a vector of unknown regression coefficients, and <inline-formula><mml:math id="M24" display="inline"><mml:msub><mml:mrow><mml:mi>c</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, and <inline-formula><mml:math id="M25" display="inline"><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> are the county, state, and census division latent effects, respectively, assumed to follow normal distributions with zero means and unknown variances <inline-formula><mml:math id="M26" display="inline"><mml:msubsup><mml:mrow><mml:mi>&#x003c3;</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msubsup><mml:mrow><mml:mi>&#x003c3;</mml:mi></mml:mrow><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula>, and <inline-formula><mml:math id="M27" display="inline"><mml:msubsup><mml:mrow><mml:mi>&#x003c3;</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup></mml:math></inline-formula>, respectively.</p><p id="P21">To fully specify the model as a hierarchical Bayes model, we adopt independent weakly informative priors for the unknown model parameters <inline-formula><mml:math id="M28" display="inline"><mml:mi>&#x003b2;</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>&#x003c3;</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>&#x003c3;</mml:mi></mml:mrow><mml:mrow><mml:mi>s</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, and <inline-formula><mml:math id="M29" display="inline"><mml:msub><mml:mrow><mml:mi>&#x003c3;</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>:</p><p id="P22">
<inline-formula>
<mml:math id="M30" display="inline"><mml:mi mathvariant="normal">P</mml:mi><mml:mi mathvariant="normal">r</mml:mi><mml:mi mathvariant="normal">i</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">r</mml:mi><mml:mi mathvariant="normal">s</mml:mi><mml:mo>:</mml:mo><mml:mspace width="0.25em"/><mml:mi>&#x003b2;</mml:mi><mml:mo>~</mml:mo><mml:mi>N</mml:mi><mml:mo>(</mml:mo><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:msup><mml:mrow><mml:mn>10</mml:mn></mml:mrow><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mi mathvariant="normal">c</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">m</mml:mi><mml:mi mathvariant="normal">p</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">n</mml:mi><mml:mi mathvariant="normal">e</mml:mi><mml:mi mathvariant="normal">n</mml:mi><mml:mi mathvariant="normal">t</mml:mi><mml:mo>&#x02212;</mml:mo><mml:mi mathvariant="normal">w</mml:mi><mml:mi mathvariant="normal">i</mml:mi><mml:mi mathvariant="normal">s</mml:mi><mml:mi mathvariant="normal">e</mml:mi></mml:math>
</inline-formula>
</p><p id="P23">
<inline-formula>
<mml:math id="M31" display="inline"><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>&#x003c3;</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003c3;</mml:mi></mml:mrow><mml:mrow><mml:mi>s</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003c3;</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced><mml:mo>~</mml:mo><mml:mi mathvariant="normal">C</mml:mi><mml:mi mathvariant="normal">a</mml:mi><mml:mi mathvariant="normal">u</mml:mi><mml:mi mathvariant="normal">c</mml:mi><mml:mi mathvariant="normal">h</mml:mi><mml:mi mathvariant="normal">y</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:mn>0,5</mml:mn></mml:mrow></mml:mfenced><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mi mathvariant="normal">c</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">m</mml:mi><mml:mi mathvariant="normal">p</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">n</mml:mi><mml:mi mathvariant="normal">e</mml:mi><mml:mi mathvariant="normal">n</mml:mi><mml:mi mathvariant="normal">t</mml:mi><mml:mo>&#x02212;</mml:mo><mml:mi mathvariant="normal">w</mml:mi><mml:mi mathvariant="normal">i</mml:mi><mml:mi mathvariant="normal">s</mml:mi><mml:mi mathvariant="normal">e</mml:mi><mml:mo>.</mml:mo></mml:math>
</inline-formula>
</p><p id="P24">These choices of prior distributions ensure that little information about the values of these parameters is provided to the model, the data (likelihood) having the major role in shaping the posterior distributions. See Krenzke et al. (2020)<sup><xref rid="R15" ref-type="bibr">15</xref></sup>, Browne and Draper (2006)<sup><xref rid="R16" ref-type="bibr">16</xref></sup>, Gelman (2006) <sup><xref rid="R17" ref-type="bibr">17</xref></sup>, and Polson and Scott (2012)<sup><xref rid="R18" ref-type="bibr">18</xref></sup>, for some recent discussions about the choice of prior distribution for the scale parameter in a univariate hierarchical Bayes model.</p><sec id="S4"><label>3.1 |</label><title>Fit</title><p id="P25">Four models are developed, depending on the number of counties with survey sample used and on the number of variables selected. For evaluation purposes only, we consider two sets of counties with survey data to fit the model presented above: one set comprises of all but 27 counties in the United States and corresponds to all the counties with BRFSS survey data (3,114 counties and the District of Columbia), and the other set comprises of counties with BRFSS sample size of at least 500 and corresponds to all the BRFSS SMART counties (213 counties). The variable selection steps are specific to the model, so the process presented above is conducted twice, each variable selection process corresponding to the two sets of counties used in the model fit. To summarize, we fit a pair of full and reduced models for all the counties with survey sample and another pair of full and reduced models for the SMART counties. The two pairs of full and reduced model fits are compared using the widely applicable information criterion (WAIC); smaller values indicate better model fit.</p><p id="P26">Each model is specified as an R STAN object and it is fit using Markov chain Monte Carlo (MCMC), with three chains, each chain starting with 20,000 samples, of which 5,000 samples are burned in. To reduce the autocorrelation in the chains used to make inference, we thin each of the chains so that every 10<italic toggle="yes">th</italic> iteration is kept and the rest are discarded. The final set of <inline-formula><mml:math id="M32" display="inline"><mml:mi>R</mml:mi><mml:mo>=</mml:mo><mml:mn>4</mml:mn><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mn>500</mml:mn></mml:math></inline-formula> samples is used for inference.</p><p id="P27">Internal validation checks are conducted iteratively with the model development process for each model. In particular, we follow Erciulescu and Opsomer (2019) <sup><xref rid="R19" ref-type="bibr">19</xref></sup> and implement mixing and converge diagnostics for the MCMC sampler, residual diagnostics for the normality assumptions, and posterior predictive checks for the normality and linearity assumptions, as well as the correlation between the response variable and the first covariate. As a result, the tuning parameters used in the model fit are specific to each model. Also, the final model specification ensures that the <inline-formula><mml:math id="M33" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula> posterior samples are valid for inference.</p></sec><sec id="S5"><label>3.2 |</label><title>Prediction</title><p id="P28">Define in-sample counties as the set of counties used to fit the model. The rest of the counties of interest comprise the not-in-sample counties. The model predictions for the in-sample counties are composites of survey estimates and model-synthetic predictions based on the hierarchical Bayes model presented above. Survey estimates with large associated variance estimates (or small approximated effective sample sizes), relative to all the variance estimates, are smoothed more than the others, towards a common trend prediction that depends on the linear relationship assumed between the survey estimates and the covariates. As a result, the model predictions for such counties deviate from the survey estimates: point estimates may be either smaller or larger, and variance estimates are smaller. At the same time, also a result of the smoothing, the model predictions for counties with small variance estimates (or large approximated effective sample sizes) remain close to the survey estimates; both the point estimates and their associated variance estimates. For example, the model predictions are closer to the corresponding survey estimates for SMART counties, compared to the rest of the counties. Using the <inline-formula><mml:math id="M34" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula> samples <inline-formula><mml:math id="M35" display="inline"><mml:msub><mml:mrow><mml:mi>&#x003b8;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>,</mml:mo><mml:mi>R</mml:mi></mml:math></inline-formula>, from the posterior predictive distribution <inline-formula><mml:math id="M36" display="inline"><mml:mfenced open="[" close="]" separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>&#x003b8;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>|</mml:mo><mml:mi>y</mml:mi><mml:mo>,</mml:mo><mml:mi>x</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">&#x003c3;</mml:mi></mml:mrow></mml:mfenced></mml:math></inline-formula> posterior summaries such as means, variances, credible intervals, are constructed for the transformed proportion of individuals without a personal doctor, for all the counties.</p><p id="P29">Compared to the prediction for in-sample counties, prediction for not-in-sample counties relies more heavily on the model structure and on the covariates available for these counties. The contribution of the survey estimates to these predictions is only present via the nested structure of the model, because there is no contribution from the survey (the county-level survey estimates are either not available for these counties, or are not used as is the case with our model fitted only to the SMART counties); see Erciulescu et al. (2020)<sup><xref rid="R14" ref-type="bibr">14</xref></sup> for derived expressions for the model predictions under a two-fold small area estimation model.</p><p id="P30">For a not-in-sample county <inline-formula><mml:math id="M37" display="inline"><mml:msup><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula>, where the county belongs to a state <inline-formula><mml:math id="M38" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula> already represented in the set of in-sample counties, we generate <inline-formula><mml:math id="M39" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula> samples <inline-formula><mml:math id="M40" display="inline"><mml:msub><mml:mrow><mml:mi>&#x003b8;</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>,</mml:mo><mml:mi>R</mml:mi></mml:math></inline-formula>, from the distribution assumed in the linking model with parameters evaluated at the <inline-formula><mml:math id="M41" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula> posterior samples, <inline-formula><mml:math id="M42" display="inline"><mml:mi>N</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:msup><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>&#x003b2;</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msubsup><mml:mrow><mml:mi>&#x003c3;</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>. If the not-in-sample county <inline-formula><mml:math id="M43" display="inline"><mml:msup><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> belongs to a state <inline-formula><mml:math id="M44" display="inline"><mml:msup><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> for which no other county component is represented in the in-sample counties, but it belongs to a census division <inline-formula><mml:math id="M45" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula> already represented in the in-sample counties, we generate <inline-formula><mml:math id="M46" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula> samples <inline-formula><mml:math id="M47" display="inline"><mml:msub><mml:mrow><mml:mi>&#x003b8;</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup><mml:msup><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>,</mml:mo><mml:mi>R</mml:mi></mml:math></inline-formula> in two steps: first, we generate state latent effect samples <inline-formula><mml:math id="M48" display="inline"><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> from <inline-formula><mml:math id="M49" display="inline"><mml:mi>N</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:msubsup><mml:mrow><mml:mi>&#x003c3;</mml:mi></mml:mrow><mml:mrow><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>; second, we generate <inline-formula><mml:math id="M50" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula> samples <inline-formula><mml:math id="M51" display="inline"><mml:msub><mml:mrow><mml:mi>&#x003b8;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:msup><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup><mml:msup><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> from <inline-formula><mml:math id="M52" display="inline"><mml:mi>N</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup><mml:msup><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>&#x003b2;</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msubsup><mml:mrow><mml:mi>&#x003c3;</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>. Finally, if the not-in-sample county <inline-formula><mml:math id="M53" display="inline"><mml:msup><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> belongs to a state <inline-formula><mml:math id="M54" display="inline"><mml:msup><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> that belongs to a census division <inline-formula><mml:math id="M55" display="inline"><mml:msup><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> for which no other county component is represented in the in-sample set of counties, we generate <inline-formula><mml:math id="M56" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula> samples <inline-formula><mml:math id="M57" display="inline"><mml:msub><mml:mrow><mml:mi>&#x003b8;</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup><mml:msup><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup><mml:msup><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>,</mml:mo><mml:mi>R</mml:mi></mml:math></inline-formula> in three steps: first, we generate census division latent effect samples <inline-formula><mml:math id="M58" display="inline"><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> from <inline-formula><mml:math id="M59" display="inline"><mml:mi>N</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:msubsup><mml:mrow><mml:mi>&#x003c3;</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>; second, we generate state latent effect samples <inline-formula><mml:math id="M60" display="inline"><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup><mml:msup><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> from <inline-formula><mml:math id="M61" display="inline"><mml:mi>N</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:msubsup><mml:mrow><mml:mi>&#x003c3;</mml:mi></mml:mrow><mml:mrow><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>; third, we generate <inline-formula><mml:math id="M62" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula> samples <inline-formula><mml:math id="M63" display="inline"><mml:msub><mml:mrow><mml:mi>&#x003b8;</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup><mml:msup><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup><mml:msup><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> from <inline-formula><mml:math id="M64" display="inline"><mml:mi>N</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup><mml:msup><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup><mml:msup><mml:mrow><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi>&#x003b2;</mml:mi></mml:mrow><mml:mrow><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup><mml:msup><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:msup><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">*</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msubsup><mml:mrow><mml:mi>&#x003c3;</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msubsup><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>. Note that this final scenario is included here for methodology completion, but it was not encountered in the application modeling process.</p><p id="P31">To construct model predictions on the original scale, we back-transform the posterior samples using a squared-sine transformation, <inline-formula><mml:math id="M65" display="inline"><mml:msubsup><mml:mrow><mml:mi>&#x003b8;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow><mml:mrow><mml:mi>f</mml:mi></mml:mrow></mml:msubsup><mml:mo>:</mml:mo><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mfenced separators="|"><mml:mrow><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>n</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mrow><mml:mi>&#x003b8;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, for all the in-sample and not-in-sample counties. Then, county-level posterior summaries such as means, variances, credible intervals are constructed using the <inline-formula><mml:math id="M66" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula> samples <inline-formula><mml:math id="M67" display="inline"><mml:msub><mml:mrow><mml:msup><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>&#x003b8;</mml:mi></mml:mrow><mml:mo>~</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>f</mml:mi></mml:mrow></mml:msup></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02208;</mml:mo><mml:mfenced open="{" close="}" separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>&#x003b8;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:msup><mml:mrow><mml:mi>&#x003b8;</mml:mi></mml:mrow><mml:mrow><mml:mi>f</mml:mi></mml:mrow></mml:msup></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math></inline-formula>.</p><p id="P32">State-level, census division-level, and nation-level model predictions are constructed as aggregates of the county-level model predictions, with ACS county-level population totals <inline-formula><mml:math id="M68" display="inline"><mml:msub><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> serving as aggregation weights. On the original scale, the state-level model predictions are constructed using the corresponding <inline-formula><mml:math id="M69" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula> samples. The state-level samples are defined as <inline-formula><mml:math id="M70" display="inline"><mml:msub><mml:mrow><mml:msup><mml:mrow><mml:mi>&#x003b8;</mml:mi></mml:mrow><mml:mrow><mml:mi>f</mml:mi></mml:mrow></mml:msup></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mo>=</mml:mo><mml:mrow><mml:munder><mml:mo stretchy="false">&#x02211;</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:munder><mml:mrow><mml:msub><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mrow><mml:msub><mml:mrow><mml:msup><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>&#x003b8;</mml:mi></mml:mrow><mml:mo>~</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>f</mml:mi></mml:mrow></mml:msup></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub><mml:msup><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:munder><mml:mo stretchy="false">&#x02211;</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:munder><mml:mrow><mml:msub><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mrow><mml:mrow><mml:mo>&#x02212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, the census division-level samples are defined as <inline-formula><mml:math id="M71" display="inline"><mml:msub><mml:mrow><mml:msup><mml:mrow><mml:mi>&#x003b8;</mml:mi></mml:mrow><mml:mrow><mml:mi>f</mml:mi></mml:mrow></mml:msup></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mo>=</mml:mo><mml:mrow><mml:munder><mml:mo stretchy="false">&#x02211;</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:munder><mml:mrow><mml:msub><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mrow><mml:msub><mml:mrow><mml:msup><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>&#x003b8;</mml:mi></mml:mrow><mml:mo>~</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>f</mml:mi></mml:mrow></mml:msup></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub><mml:msup><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:munder><mml:mo stretchy="false">&#x02211;</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:munder><mml:mrow><mml:msub><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mrow><mml:mrow><mml:mo>&#x02212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, and the nation-level samples are defined as <inline-formula><mml:math id="M72" display="inline"><mml:msub><mml:mrow><mml:msup><mml:mrow><mml:mi>&#x003b8;</mml:mi></mml:mrow><mml:mrow><mml:mi>f</mml:mi></mml:mrow></mml:msup></mml:mrow><mml:mrow><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mo>=</mml:mo><mml:mrow><mml:munder><mml:mo stretchy="false">&#x02211;</mml:mo><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:munder><mml:mrow><mml:msub><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mrow><mml:msub><mml:mrow><mml:msup><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>&#x003b8;</mml:mi></mml:mrow><mml:mo>~</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>f</mml:mi></mml:mrow></mml:msup></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x003b6;</mml:mi></mml:mrow></mml:msub><mml:msup><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:munder><mml:mo stretchy="false">&#x02211;</mml:mo><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:munder><mml:mrow><mml:msub><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mrow><mml:mrow><mml:mo>&#x02212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>.</p><p id="P33">The predictive power of the models is tested using cross-validation. For this, the in-sample counties are divided into two groups with similar distribution of sample sizes. Each model is fitted to one of the two groups of counties, in turn, and the counties in the other group are considered not-in-sample. The model predictions for the not-in-sample counties are then compared against the survey estimates, for groups of counties with large sample size.</p></sec></sec><sec id="S6"><label>4. |</label><title>RESULTS</title><p id="P34">Variable selection and cross-validation results are presented in <xref rid="T1" ref-type="table">Tables 1</xref> and <xref rid="T2" ref-type="table">2</xref>, for the models fitted to all the counties with sample data and for the models fitted to the SMART counties, respectively. The auxiliary variables selected in the models fitted to all the counties with sample data are available at the county-level from ACS, Small Area Health Insurance Estimates (SAHIE), or Statistics of Income (SOI). The first six variables listed in <xref rid="T2" ref-type="table">Table 2</xref> are available at the county level, but the rest are only available at the state level. Compared to the auxiliary data selected for the models fitted to all the counties with sample data, data from three additional government agencies were selected for the models fitted to the SMART counties: the National Center for Health Statistics (NCHS), the National Highway Traffic Safety Administration (NHTSA), and the Energy Information Administration (EIA). Recall <xref rid="T5" ref-type="table">Table 1</xref> in the <xref rid="APP1" ref-type="app">Appendix</xref> with the information on the data sources. The full models have better predictive power than the reduced models, having smaller sum of squared differences between the model predictions and the survey estimates. The results from the model comparison using WAIC are also in favor of the full models, as shown in <xref rid="T3" ref-type="table">Table 3</xref>.</p><p id="P35">For the counties with available survey estimates, we compare the distributions of the survey estimates and the model predictions produced using the four models under investigation. The results are illustrated in <xref rid="F1" ref-type="fig">Figure 1</xref>. There are minor differences between the full and reduced models for each of the two pairs considered. Based on these results and the results from the model comparison and cross-validation, we conclude that the reduced models are no longer of interest.</p><p id="P36">As illustrated in <xref rid="F1" ref-type="fig">Figure 1</xref>, the point estimates are comparable across the different sources considered in this comparison, with exception being the main mode of the distributions: the model predictions fall in a narrower range than the range of the survey estimates, as a result of smoothing, with the model predictions based on the models fitted to the SMART counties being smoothed towards slightly larger values than the other model predictions. The uncertainty in the model predictions based on the models fitted to the SMART counties is noticeably smaller than both the uncertainty in the survey estimates and the uncertainty in the model predictions based on the models fitted to all the counties with sample data. This result is a consequence of constructing a much larger number of not-in-sample model predictions under the approach where only the SMART counties are used in the model fit, while ignoring the survey data available for the non-SMART counties.</p><p id="P37">To further evaluate the model fits using different sets of counties, we compare the estimates for large geographies: census divisions and nation. Pairwise comparisons of the survey and model predictions for the high levels of aggregation are presented in <xref rid="T4" ref-type="table">Table 4</xref>. Note that for most of the census divisions, there is closer agreement between the survey estimates and the model predictions based on the model fitted to all the counties with sample data than the model predictions based on the model fitted to the SMART counties. In addition, for the nation and most of the census divisions, the standard errors of the model predictions based on the model fitted to the SMART counties are larger than the survey standard errors. This result is a consequence of constructing a large number of positively highly correlated not-in-sample model predictions under the model fitted to the SMART counties only, which impacts the variance estimates at aggregated levels. Hence, we identify the model fitted to all the counties with sample data to be the preferred or final model. A map of all the county-level model predictions constructed based on the final model is illustrated in <xref rid="F2" ref-type="fig">Figure 2</xref>.</p><p id="P38">Finally, we note that the covariates selected do not present a very strong linear relationship to our outcome of interest (the proportion of individuals who do not have a personal care provider or doctor); the R squared for a multiple linear regression fitted to the county-level survey estimates and using the set of covariates selected for the model fitted to all the counties with sample data is approximately 0.08. Hence, it is important to use all the available county-level survey estimates in fitting the model, so that the number of not-in-sample predictions is as small as possible.</p><sec id="S7"><label>4.1 |</label><title>External validation</title><p id="P39">Various external validation checks are considered for the final model. For the in-sample counties, we compare the distributions of the survey estimates and the model predictions in terms of point estimates and coefficients of variation (CVs). The results are illustrated in <xref rid="F3" ref-type="fig">Figure 3</xref> for both the transformed and the original scales. Note that the ranges of the model predictions are narrower than the ranges of the survey estimates as a result of smoothing. The main modes of the overall distributions of point estimates overlap, with no noticeable difference between the distributions for in-sample county-level model predictions and all model predictions (for in-sample counties and not-in-sample counties). On the other hand, the main modes of the overall distributions of CVs are shifted to the left for the model predictions, compared to the survey estimates, as a result of model-based decrease in uncertainty.</p><p id="P40">Next, we investigate the distribution of the differences between the survey estimates and model predictions, and the ratios of survey to model standard errors, both relative to the county sample sizes. The results are illustrated in <xref rid="F4" ref-type="fig">Figure 4</xref> for both the transformed and the original scales. A dotted line is included in the plots to indicate the group of SMART counties: points to the right of the line correspond to SMART counties. Note that the survey estimates and model predictions are comparable for larger counties, while for counties with smaller sample sizes, the estimates differ: model predictions deviate from the survey estimates because the latter are not sufficiently precise, and the model standard errors/CVs are smaller than the survey standard errors/CVs (on the transformed scale) as a result of model shrinkage. The shrinkage effects are observed on the scale on which the models are fitted, i.e., the transformed proportions using the arcsine-square-root transformation, and are diminished after the back transformation to the original scale, i.e. the proportions. As a consequence, the model standard errors/CVs are smaller than the survey standard errors/CVs for most, but not all of the counties used in the model fitting process.</p><p id="P41">Pairwise comparisons of the survey estimates and model predictions for the high levels of aggregation are presented in <xref rid="T4" ref-type="table">Table 4</xref> Note that there is close agreement between the point estimates and standard errors, with slight reduction in the standard errors for some areas: census divisions Middle Atlantic, West South Central, and Mountain. The agreement in the point estimates suggests there are no necessary changes in the model specification. The reduction in the standard errors is a result of the model shrinkage, and its small magnitude is expected because at these high levels (nation and census division) the survey estimates are precise.</p><p id="P42">As a final step in the external validation process, we construct the percentage of counties with non-overlapping 95 percent uncertainty intervals for the survey estimates and model predictions defined as
<disp-formula id="FD9">
<mml:math id="M73" display="block"><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>&#x000b1;</mml:mo><mml:msub><mml:mrow><mml:mn>2</mml:mn><mml:mi>&#x003c3;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mspace width="0.25em"/><mml:mo stretchy="false">(</mml:mo><mml:mi mathvariant="normal">t</mml:mi><mml:mi mathvariant="normal">r</mml:mi><mml:mi mathvariant="normal">a</mml:mi><mml:mi mathvariant="normal">n</mml:mi><mml:mi mathvariant="normal">s</mml:mi><mml:mi mathvariant="normal">f</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">r</mml:mi><mml:mi mathvariant="normal">m</mml:mi><mml:mi mathvariant="normal">e</mml:mi><mml:mi mathvariant="normal">d</mml:mi><mml:mspace width="0.25em"/><mml:mi mathvariant="normal">s</mml:mi><mml:mi mathvariant="normal">c</mml:mi><mml:mi mathvariant="normal">a</mml:mi><mml:mi mathvariant="normal">l</mml:mi><mml:mi mathvariant="normal">e</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mspace width="0.25em"/><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">r</mml:mi><mml:mspace width="0.25em"/><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>&#x000b1;</mml:mo><mml:mn>2</mml:mn><mml:msqrt><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>V</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:msqrt><mml:mspace width="0.25em"/><mml:mo stretchy="false">(</mml:mo><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">r</mml:mi><mml:mi mathvariant="normal">i</mml:mi><mml:mi mathvariant="normal">g</mml:mi><mml:mi mathvariant="normal">i</mml:mi><mml:mi mathvariant="normal">n</mml:mi><mml:mi mathvariant="normal">a</mml:mi><mml:mi mathvariant="normal">l</mml:mi><mml:mspace width="0.25em"/><mml:mi mathvariant="normal">s</mml:mi><mml:mi mathvariant="normal">c</mml:mi><mml:mi mathvariant="normal">a</mml:mi><mml:mi mathvariant="normal">l</mml:mi><mml:mi mathvariant="normal">e</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>,</mml:mo></mml:math>
</disp-formula>
and
<disp-formula id="FD10">
<mml:math id="M74" display="block"><mml:mfenced separators="|"><mml:mrow><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>&#x003b8;</mml:mi></mml:mrow><mml:mo>~</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mfenced separators="|"><mml:mrow><mml:mn>0.025</mml:mn></mml:mrow></mml:mfenced></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>&#x003b8;</mml:mi></mml:mrow><mml:mo>~</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mfenced separators="|"><mml:mrow><mml:mn>0.975</mml:mn></mml:mrow></mml:mfenced></mml:mrow></mml:msub></mml:mrow></mml:mfenced><mml:mo>,</mml:mo></mml:math>
</disp-formula>
respectively. For large counties, one would expect to see few or no non-overlapping intervals because the survey estimates in these counties are considered sufficiently precise and the model predictions should not deviate significantly from them. Since the survey estimates for smaller counties are not as reliable as the ones for larger counties, one can expect to see more non-overlapping intervals. For the SMART counties, all the uncertainty intervals overlap. Among the counties with sample sizes greater than or equal to 30, there are less than 2 percent non-overlapping uncertainty intervals. Among all the counties with sample, there are less than 4 percent non-overlapping uncertainty intervals. On the transformed scale, most (31 out of 40) of the non-overlapping uncertainty intervals correspond to counties with survey estimates of 0 or 1 on the original scale. This is expected, because the model predictions for counties with smaller sample sizes are shrunk more toward the common trend, than the large counties are, resulting in significantly different estimates from the survey estimates.</p></sec></sec><sec id="S8"><label>5 |</label><title>DISCUSSION</title><p id="P43">In summary, we developed model-based small area estimation methodology for the proportion of individuals with no personal care provider or doctor in the 2018 population. The models combine survey data from the BRFSS with data from other sources, while accounting for the error in the survey data and the nested structure of the data. Model predictions were constructed for all the counties in the nation, irrespective of the availability of survey data. For counties where survey estimates are available, the model predictions are more precise than the survey estimates. For evaluation purposes, the models were developed using either all the counties with survey data or only the counties with sample sizes at least 500. The former was preferred because it resulted in more accurate point estimates than the latter.</p><p id="P44">As part of the model development, we investigated other models and compared their performance to the performance of the model presented in this manuscript. Among these alternative model specifications, we investigated area-level models for the BRFSS survey estimates on the original scale or transformed using other transformations: square root and logit. Similarly to the transformation adopted for the proposed model, the square root and the logit transformations help relax a normality assumption for the survey estimates on the original scale. However, the logit transformation is only applicable to survey estimates greater than zero and smaller than one. Moreover, for the original scale of these two alternative transformations, the estimated variances associated with the survey estimates would not be defined for all the counties, either due to sample sizes or due to the transformation. Finally, unlike the transformation adopted for the proposed model, none of these three alternative model specifications would stabilize the sampling variances, so a smoothing step would be necessary, too.</p><p id="P45">Lacking unit-level auxiliary data for the entire population, we did not investigate any unit-level models. However, we did consider a hybrid-level model, where the sampling level is specified at the unit level and the linking level is specified at the area (county) level. As an initial step, we ignored the survey design effect and the imputation. That is, we specified the sampling level for the BRFSS unit-level raw data. The computational challenges led us to quickly set aside a nation-wide model, and instead fit a model to all the data available for one state. As part of the internal validation process, we noticed residuals with heavy-tailed distribution, so the model specification indicated there was room for improvement. Also, the state-level (aggregated) model prediction was significantly smaller than the state-level survey estimate, again indicating the model specification can be improved.</p><p id="P46">Future investigations may include closer attention to the hybrid-modeling and extensions to the model presented in this manuscript: (1) multivariate specifications, for example for modeling the BRFSS proportion of individuals who do not have a personal care provider or doctor and the proportion of individuals without health insurance jointly; and (2) measurement error specifications, to account for the uncertainty in the county-level covariates.</p></sec></body><back><ack id="S9"><title>ACKNOWLEDGMENTS</title><p id="P47">This work was conducted under a CDC-Westat project. The authors thank Carol Pierannunzi, the CDC&#x02019;s main contact for the project, for helpful discussions and comments.</p></ack><fn-group><fn fn-type="COI-statement" id="FN2"><p id="P53">Conflict of interest</p><p id="P54">The authors declare no potential conflict of interests.</p></fn><fn id="FN3"><p id="P55">Financial disclosure</p><p id="P56">The work described in this paper was conducted under contract with the Centers for Disease Control and Prevention (CDC Contract #HHSD2002013M53968B Order #75D30120F09442).</p></fn><fn id="FN4"><p id="P57">Disclaimer: The findings and conclusions in this report are those of the authors and do not necessarily represent the official position of the Centers for Disease Control and Prevention.</p></fn></fn-group><bio id="d67e3148"><title>AUTHOR BIOGRAPHY</title><p id="P49"><bold>Andreea L. Erciulescu.</bold> Dr. Andreea Erciulescu is a Senior Statistician at Westat, working on the interface between survey statistics and other areas of statistics, including combining multiple survey and non-survey sources to answer complex analytic questions. Her areas of expertise include Bayesian statistics, hierarchical models, measurement error, official statistics, resampling methods, small area estimation, statistical data integration, and survey statistics. Dr. Erciulescu is an Elected Member of the International Statistical Institute.</p></bio><bio id="d67e3153"><p id="P50"><bold>Jianzhu Li.</bold> Dr. Li is a Senior Statistician in Westat&#x02019;s Statistics and Evaluation Sciences Unit and has 20 years of experience. She leads and works on survey sampling, small area estimation and data confidentiality and disclosure protection for several large scale household and establishment surveys for government agencies.</p></bio><bio id="d67e3157"><p id="P51"><bold>Tom Krenzke.</bold> Mr. Tom Krenzke is a Vice President and Associate Director in Westat&#x02019;s Statistics and Evaluation Sciences Unit and has 30 years of experience. Mr. Krenzke leads research in survey sampling, statistical confidentiality, small area estimation and other statistical topics. Mr. Krenzke is a Fellow of the American Statistical Association (ASA) and a Westat Senior Statistical Fellow.</p></bio><bio id="d67e3161"><p id="P52"><bold>Machell Town</bold> is the Branch Chief for the Population Health and Surveillance Branch of the Division of Population Health at the CDC. She has 30 years of experience in population estimation and statistics.</p></bio><app-group><app id="APP1"><title>APPENDIX</title><table-wrap position="anchor" id="T5"><label>TABLE 1</label><caption><p id="P48">Auxiliary data pool</p></caption><table frame="hsides" rules="none"><colgroup span="1"><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/></colgroup><thead><tr><th align="left" valign="top" rowspan="1" colspan="1">Source</th><th align="left" valign="top" rowspan="1" colspan="1">Auxiliary variable</th></tr><tr><th colspan="2" align="left" valign="bottom" rowspan="1">
<hr/>
</th></tr></thead><tbody><tr><td align="left" valign="top" rowspan="1" colspan="1">Census Bureau</td><td align="left" valign="top" rowspan="1" colspan="1">5-year estimates (2013&#x02013;2017) of socioeconomic</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">American Community Survey (ACS)</td><td align="left" valign="top" rowspan="1" colspan="1">demographic, and housing characteristics</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Census Bureau</td><td align="left" valign="top" rowspan="1" colspan="1">small area estimates of selected</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Small Area Income and Poverty Estimates (SAIPE)</td><td align="left" valign="top" rowspan="1" colspan="1">income and poverty statistics</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Census Bureau</td><td align="left" valign="top" rowspan="1" colspan="1">health insurance coverage status</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Small Area Health Insurance Estimates (SAHIE)</td><td align="left" valign="top" rowspan="1" colspan="1">by selected economic and demographic characteristics</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">U.S. Department of Agriculture (USDA)<break/>Economic Research Service</td><td align="left" valign="top" rowspan="1" colspan="1">classification of counties into metro and non-metro (OMB subdivided counties into three metro and</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1">six non-metro categories)</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Bureau of Labor Statistics (BLS)</td><td align="left" valign="top" rowspan="1" colspan="1">monthly and annual employment, unemployment,</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">The Local Area Unemployment Statistics (LAUS) program</td><td align="left" valign="top" rowspan="1" colspan="1">and labor force data</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Bureau of Economic Analysis (BEA)</td><td align="left" valign="top" rowspan="1" colspan="1">estimates of personal income for local areas</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Centers for Disease Control and Prevention</td><td align="left" valign="top" rowspan="1" colspan="1">updated statistics about diabetes</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Division of Diabetes Translation (DDT)</td><td align="left" valign="top" rowspan="1" colspan="1"/></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Centers for Medicare &#x00026; Medicaid Services (CMS)</td><td align="left" valign="top" rowspan="1" colspan="1">utilization and quality of health care services for the Medicare fee-for-service population</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Geographic variation public use file</td><td align="left" valign="top" rowspan="1" colspan="1">income and tax data such as number of tax returns,</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">The Internal Revenue Service The Statistics of Income (SOI) Data</td><td align="left" valign="top" rowspan="1" colspan="1">returns with unemployment compensation, returns with taxable Social Security benefit,</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1">adjusted gross personal income,</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1">personal unemployment compensation amount, and</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1">personal taxable Social Security benefit amount, etc.</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1"/></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Health Resources and Services Administration</td><td align="left" valign="top" rowspan="1" colspan="1">data on health care professions, health facilities,</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Area Health Resources Files (AHRF)</td><td align="left" valign="top" rowspan="1" colspan="1">population characteristics, economics,</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1">health professions training, hospital utilization,</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1">hospital expenditures, and environment</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">National Center for Health Statistics (NCHS)</td><td align="left" valign="top" rowspan="1" colspan="1">birth rate and infant mortality rate</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Federal Bureau of Investigation (FBI)</td><td align="left" valign="top" rowspan="1" colspan="1">crime rates</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">National Highway Traffic Safety Administration (NHTSA)</td><td align="left" valign="top" rowspan="1" colspan="1">traffic fatalities</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">U.S. Energy Information Administration (EIA)</td><td align="left" valign="top" rowspan="1" colspan="1">energy consumption height</td></tr></tbody></table></table-wrap></app></app-group><glossary><title>Abbreviations:</title><def-list><def-item><term>ACS</term><def><p id="P58">American Community Survey</p></def></def-item><def-item><term>BRFSS</term><def><p id="P59">Behavioral Risk Factor Surveillance System</p></def></def-item><def-item><term>CDC</term><def><p id="P60">Centers for Disease Control and Prevention</p></def></def-item><def-item><term>EIA</term><def><p id="P61">Energy Information Administration</p></def></def-item><def-item><term>MCMC</term><def><p id="P62">Markov chain Monte Carlo</p></def></def-item><def-item><term>NCHS</term><def><p id="P63">National Center for Health Statistics</p></def></def-item><def-item><term>NHTSA</term><def><p id="P64">National Highway Traffic Safety Administration</p></def></def-item><def-item><term>SAE</term><def><p id="P65">small area estimation</p></def></def-item><def-item><term>SAHIE</term><def><p id="P66">Small Area Health Insurance Estimates</p></def></def-item><def-item><term>SMART</term><def><p id="P67">Selected Metropolitan/Micropolitan Area Risk Trends</p></def></def-item><def-item><term>SOI</term><def><p id="P68">Statistics of Income</p></def></def-item><def-item><term>WAIC</term><def><p id="P69">widely applicable information criterion</p></def></def-item></def-list></glossary><ref-list><title>References</title><ref id="R1"><label>1.</label><mixed-citation publication-type="journal"><name><surname>Battese</surname><given-names>G</given-names></name>, <name><surname>Harter</surname><given-names>R</given-names></name>, <name><surname>Fuller</surname><given-names>W</given-names></name>. <article-title>An error-components model for prediction of county crop areas using survey and satellite data</article-title>. <source>Journal of the American Statistical Association</source>
<year>1988</year>; <volume>83</volume>: <fpage>28</fpage>&#x02013;<lpage>36</lpage>.</mixed-citation></ref><ref id="R2"><label>2.</label><mixed-citation publication-type="journal"><name><surname>Fay</surname><given-names>R</given-names></name>, <name><surname>Herriot</surname><given-names>R</given-names></name>. <article-title>Estimates of income for small places: an application of James-Stein procedures to census data</article-title>. <source>Journal of the American Statistical Association</source>
<year>1979</year>; <volume>74</volume>(<issue>366a</issue>): <fpage>269</fpage>&#x02013;<lpage>277</lpage>.</mixed-citation></ref><ref id="R3"><label>3.</label><mixed-citation publication-type="journal"><name><surname>Cadwell</surname><given-names>B</given-names></name>, <name><surname>Thompson</surname><given-names>T</given-names></name>, <name><surname>Boyle</surname><given-names>J</given-names></name>, <name><surname>Baker</surname><given-names>L</given-names></name>. <article-title>Bayesian small area estimation of diabetes prevalence by U.S. county</article-title>, <year>2005</year>. <source>Journal of Data Science 2010</source>; <volume>8</volume>: <fpage>173</fpage>&#x02013;<lpage>188</lpage>.</mixed-citation></ref><ref id="R4"><label>4.</label><mixed-citation publication-type="journal"><name><surname>Zhang</surname><given-names>Z</given-names></name>, <name><surname>Holt</surname><given-names>J</given-names></name>, <name><surname>Lu</surname><given-names>H</given-names></name>, <etal/>
<article-title>Multilevel regression and poststratification for small-area estimation of population health outcomes: A case study of chronic obstructive pulmonary disease prevalence using the Behavioral Risk Factor Surveillance System</article-title>. <source>American Journal of Epidemiology</source>
<year>2014</year>; <volume>179</volume>(<issue>8</issue>): <fpage>1025</fpage>&#x02013;<lpage>1033</lpage>.<pub-id pub-id-type="pmid">24598867</pub-id>
</mixed-citation></ref><ref id="R5"><label>5.</label><mixed-citation publication-type="journal"><name><surname>Pierannunzi</surname><given-names>C</given-names></name>, <name><surname>Xu</surname><given-names>F</given-names></name>, <name><surname>Wallace</surname><given-names>R</given-names></name>, <etal/>
<article-title>A methodological approach to small area estimation for the Behavioral Risk Factor Surveillance System</article-title>. <source>Preventing Chronic Disease</source>
<year>2016</year>; <volume>13</volume>(<issue>E91</issue>): <fpage>150480</fpage>.</mixed-citation></ref><ref id="R6"><label>6.</label><mixed-citation publication-type="journal"><name><surname>Berkowitz</surname><given-names>Z</given-names></name>, <name><surname>Zhang</surname><given-names>X</given-names></name>, <name><surname>Richards</surname><given-names>T</given-names></name>, <name><surname>Nadel</surname><given-names>M</given-names></name>, <name><surname>Peipins</surname><given-names>L</given-names></name>, <name><surname>Holt</surname><given-names>J</given-names></name>. <article-title>Multilevel small-area estimation of colorectal cancer screening in the United States</article-title>. <source>Cancer Epidemiology, Biomarkers and Prevention</source>
<year>2018</year>; <volume>27</volume>(<issue>3</issue>): <fpage>245</fpage>&#x02013;<lpage>253</lpage>.</mixed-citation></ref><ref id="R7"><label>7.</label><mixed-citation publication-type="journal"><name><surname>Berkowitz</surname><given-names>Z</given-names></name>, <name><surname>Zhang</surname><given-names>X</given-names></name>, <name><surname>Richards</surname><given-names>T</given-names></name>, <etal/>
<article-title>Multilevel regression for small-area estimation of mammography use in the United States, 2014</article-title>. <source>Cancer Epidemiology, Biomarkers and Prevention</source>
<year>2019</year>; <volume>28</volume>(<issue>1</issue>): <fpage>32</fpage>&#x02013;<lpage>40</lpage>.</mixed-citation></ref><ref id="R8"><label>8.</label><mixed-citation publication-type="journal"><name><surname>Holt</surname><given-names>J</given-names></name>, <name><surname>Matthews</surname><given-names>K</given-names></name>, <name><surname>Lu</surname><given-names>H</given-names></name>, <etal/>
<article-title>Small Area Estimates of Populations With Chronic Conditions for Community Preparedness for Public Health Emergencies</article-title>. <source>American Journal of Public Health</source>
<year>2019</year>; <volume>109</volume>(<issue>S4</issue>): <fpage>S325</fpage>&#x02013;<lpage>S331</lpage>.<pub-id pub-id-type="pmid">31505141</pub-id>
</mixed-citation></ref><ref id="R9"><label>9.</label><mixed-citation publication-type="journal"><name><surname>Raghunathan</surname><given-names>T</given-names></name>, <name><surname>Xie</surname><given-names>D</given-names></name>, <name><surname>Schenker</surname><given-names>N</given-names></name>, <etal/>
<article-title>Combining information from two surveys to estimate county-level prevalence rates of cancer risk factors and screening</article-title>. <source>Journal of the American Statistical Association</source>
<year>2007</year>; <volume>102</volume>: <fpage>474</fpage>&#x02013;<lpage>486</lpage>.</mixed-citation></ref><ref id="R10"><label>10.</label><mixed-citation publication-type="journal"><name><surname>Liu</surname><given-names>B</given-names></name>, <name><surname>Parsons</surname><given-names>V</given-names></name>, <name><surname>Feuer</surname><given-names>E</given-names></name>, <etal/>
<article-title>Small area estimation of cancer risk factors and screening behaviors in U.S. counties by combining two large national health surveys</article-title>. <source>Preventing Chronic Disease</source>
<year>2019</year>; <volume>16</volume>(<issue>E119</issue>): <fpage>190013</fpage>.</mixed-citation></ref><ref id="R11"><label>11.</label><mixed-citation publication-type="book"><name><surname>Kish</surname><given-names>L</given-names></name>
<source>Survey Sampling</source>. <publisher-loc>New York</publisher-loc>: <publisher-name>John Wiley &#x00026; Sons, Inc.</publisher-name>
<year>1965</year>.</mixed-citation></ref><ref id="R12"><label>12.</label><mixed-citation publication-type="journal"><name><surname>Fuller</surname><given-names>W</given-names></name>, <name><surname>Goyeneche</surname><given-names>J</given-names></name>. <article-title>Estimation of the state variance component</article-title>. <source>Unpublished manuscript</source>
<year>1998</year>.</mixed-citation></ref><ref id="R13"><label>13.</label><mixed-citation publication-type="journal"><name><surname>Torabi</surname><given-names>M</given-names></name>, <name><surname>Rao</surname><given-names>J</given-names></name>. <article-title>On small area estimation under a sub-area level model</article-title>. <source>Journal of Multivariate Analysis</source>
<year>2014</year>; <volume>127</volume>: <fpage>36</fpage>&#x02013;<lpage>55</lpage>.</mixed-citation></ref><ref id="R14"><label>14.</label><mixed-citation publication-type="journal"><name><surname>Erciulescu</surname><given-names>A</given-names></name>, <name><surname>Cruze</surname><given-names>N</given-names></name>, <name><surname>Nandram</surname><given-names>B</given-names></name>. <article-title>Statistical challenges in combining survey and auxiliary data to produce official statistics</article-title>. <source>Journal of Official Statistics</source>
<year>2020</year>; <volume>36</volume>(<issue>1</issue>): <fpage>63</fpage>&#x02013;<lpage>88</lpage>.</mixed-citation></ref><ref id="R15"><label>15.</label><mixed-citation publication-type="book"><name><surname>Krenzke</surname><given-names>T</given-names></name>, <name><surname>Mohadjer</surname><given-names>L</given-names></name>, <name><surname>Li</surname><given-names>J</given-names></name>, <etal/>
<part-title>Program for the International Assessment of Adult Competencies (PIAAC): State and County Estimation Methodology Report</part-title>. <source>Tech. Rep. NCES2020225, U.S. Department of Education</source>; <publisher-loc>Rockville, MD</publisher-loc>: <publisher-name>Westat</publisher-name>: <year>2020</year>. <comment>Available at <ext-link xlink:href="https://nces.ed.gov/pubsearch/pubsinfo.asp?pubid=2020225" ext-link-type="uri">https://nces.ed.gov/pubsearch/pubsinfo.asp?pubid=2020225</ext-link></comment></mixed-citation></ref><ref id="R16"><label>16.</label><mixed-citation publication-type="journal"><name><surname>Browne</surname><given-names>W</given-names></name>, <name><surname>Draper</surname><given-names>D</given-names></name>. <article-title>A comparison of Bayesian and likelihood based methods for fitting multilevel models</article-title>. <source>Bayesian Analysis</source>
<year>2006</year>; <volume>1</volume>(<issue>3</issue>): <fpage>473</fpage>&#x02013;<lpage>514</lpage>.</mixed-citation></ref><ref id="R17"><label>17.</label><mixed-citation publication-type="journal"><name><surname>Gelman</surname><given-names>A</given-names></name>
<article-title>Prior distributions for variance parameters in hierarchical models (Comment on an article by Browne and Draper)</article-title>. <source>Bayesian Analysis</source>
<year>2006</year>; <volume>1</volume>(<issue>3</issue>): <fpage>515</fpage>&#x02013;<lpage>534</lpage>.</mixed-citation></ref><ref id="R18"><label>18.</label><mixed-citation publication-type="journal"><name><surname>Polson</surname><given-names>N</given-names></name>, <name><surname>Scott</surname><given-names>J</given-names></name>. <article-title>On the Half-Cauchy Prior for a Global Scale Parameter</article-title>. <source>Bayesian Analysis</source>
<year>2012</year>; <volume>7</volume>(<issue>4</issue>): <fpage>887</fpage>&#x02013;<lpage>902</lpage>.</mixed-citation></ref><ref id="R19"><label>19.</label><mixed-citation publication-type="confproc"><name><surname>Erciulescu</surname><given-names>A</given-names></name>, <name><surname>Opsomer</surname><given-names>J</given-names></name>. <source>A model-based approach to predict employee compensation components</source>. <conf-name>In: Joint Statistical Meetings Proceedings</conf-name>. <publisher-name>Government Statistics Section. American Statistical Association.</publisher-name>; <comment>July 27 &#x02013; August 1</comment>, <conf-date>2019</conf-date>; <conf-loc>Alexandria, VA</conf-loc>: <fpage>1601</fpage>&#x02013;<lpage>1623</lpage>.</mixed-citation></ref></ref-list></back><floats-group><fig position="float" id="F1"><label>FIGURE 1</label><caption><p id="P70">Comparison of county-level point estimates, standard errors, and CVs; models fitted to all the counties with sample data, models fitted to the SMART counties, and survey</p></caption><graphic xlink:href="nihms-1980368-f0001" position="float"/></fig><fig position="float" id="F2"><label>FIGURE 2</label><caption><p id="P71">Map of county-level model predictions: proportions of individuals with no personal care provider or doctor</p></caption><graphic xlink:href="nihms-1980368-f0002" position="float"/></fig><fig position="float" id="F3"><label>FIGURE 3</label><caption><p id="P72">Comparison of county-level model predictions and survey estimates: point estimates and CVs; transformed and original scales</p></caption><graphic xlink:href="nihms-1980368-f0003" position="float"/></fig><fig position="float" id="F4"><label>FIGURE 4</label><caption><p id="P73">Comparison of county-level model predictions and survey estimates: difference in point estimates and ratio of standard errors, relative to the sample size; transformed and original scales</p></caption><graphic xlink:href="nihms-1980368-f0004" position="float"/></fig><table-wrap position="float" id="T1"><label>TABLE 1</label><caption><p id="P74">Variable selection and cross-validation results for the models fitted to all the counties with sample data</p></caption><table frame="hsides" rules="none"><colgroup span="1"><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/></colgroup><thead><tr><th align="left" valign="bottom" rowspan="1" colspan="1">Variable</th><th align="left" valign="bottom" rowspan="1" colspan="1">Source</th><th align="left" valign="bottom" rowspan="1" colspan="1">Full model</th><th align="left" valign="bottom" rowspan="1" colspan="1">Reduced model</th></tr><tr><th colspan="4" align="left" valign="bottom" rowspan="1">
<hr/>
</th></tr></thead><tbody><tr><td align="left" valign="top" rowspan="1" colspan="1">Proportion of population age 25+: with high school diploma, no college</td><td align="left" valign="top" rowspan="1" colspan="1">ACS</td><td align="right" valign="top" rowspan="1" colspan="1">X</td><td align="right" valign="top" rowspan="1" colspan="1">X</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Proportion of Hispanics</td><td align="left" valign="top" rowspan="1" colspan="1">ACS</td><td align="right" valign="top" rowspan="1" colspan="1">X</td><td align="right" valign="top" rowspan="1" colspan="1">X</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Proportion of Non-Hispanic Whites</td><td align="left" valign="top" rowspan="1" colspan="1">ACS</td><td align="right" valign="top" rowspan="1" colspan="1">X</td><td align="right" valign="top" rowspan="1" colspan="1">X</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Proportion of Native Americans</td><td align="left" valign="top" rowspan="1" colspan="1">ACS</td><td align="right" valign="top" rowspan="1" colspan="1">X</td><td align="right" valign="top" rowspan="1" colspan="1"/></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Proportion of owner occupied housing unit</td><td align="left" valign="top" rowspan="1" colspan="1">ACS</td><td align="right" valign="top" rowspan="1" colspan="1">X</td><td align="right" valign="top" rowspan="1" colspan="1">X</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Proportion of population 18+: in different house in the past year</td><td align="left" valign="top" rowspan="1" colspan="1">ACS</td><td align="right" valign="top" rowspan="1" colspan="1">X</td><td align="right" valign="top" rowspan="1" colspan="1"/></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Proportion of population 18+: in different state in the past year</td><td align="left" valign="top" rowspan="1" colspan="1">ACS</td><td align="right" valign="top" rowspan="1" colspan="1">X</td><td align="right" valign="top" rowspan="1" colspan="1"/></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Proportion of uninsured population</td><td align="left" valign="top" rowspan="1" colspan="1">SAHIE</td><td align="right" valign="top" rowspan="1" colspan="1">X</td><td align="right" valign="top" rowspan="1" colspan="1">X</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Proportion of returns with taxable Social Security benefits per person</td><td align="left" valign="top" rowspan="1" colspan="1">SOI</td><td align="right" valign="top" rowspan="1" colspan="1">X</td><td align="right" valign="top" rowspan="1" colspan="1"/></tr><tr><td colspan="4" align="left" valign="bottom" rowspan="1">
<hr/>
</td></tr><tr><td colspan="4" align="center" valign="bottom" rowspan="1">Sum of squared differences between model predictions and survey estimates</td></tr><tr><td colspan="4" align="left" valign="bottom" rowspan="1">
<hr/>
</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">213 counties with sample size &#x02265; 500</td><td align="right" valign="top" rowspan="1" colspan="1"/><td align="right" valign="top" rowspan="1" colspan="1">0.350</td><td align="right" valign="top" rowspan="1" colspan="1">0.416</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">963 counties with sample size &#x02265; 100</td><td align="right" valign="top" rowspan="1" colspan="1"/><td align="right" valign="top" rowspan="1" colspan="1">4.092</td><td align="right" valign="top" rowspan="1" colspan="1">4.244</td></tr></tbody></table><table-wrap-foot><fn id="TFN1"><p id="P75">The variables included in the model are indicated by X.</p></fn></table-wrap-foot></table-wrap><table-wrap position="float" id="T2"><label>TABLE 2</label><caption><p id="P76">Variable selection and cross-validation results for the models fitted to the SMART counties</p></caption><table frame="hsides" rules="none"><colgroup span="1"><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/></colgroup><thead><tr><th align="left" valign="bottom" rowspan="1" colspan="1">Variable</th><th align="left" valign="bottom" rowspan="1" colspan="1">Source</th><th align="right" valign="bottom" rowspan="1" colspan="1">Full model</th><th align="right" valign="bottom" rowspan="1" colspan="1">Reduced model</th></tr><tr><th colspan="4" align="left" valign="bottom" rowspan="1">
<hr/>
</th></tr></thead><tbody><tr><td align="left" valign="top" rowspan="1" colspan="1">Proportion of Hispanics</td><td align="left" valign="top" rowspan="1" colspan="1">ACS</td><td align="right" valign="top" rowspan="1" colspan="1">X</td><td align="right" valign="top" rowspan="1" colspan="1">X</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Proportion of Non-Hispanic Whites</td><td align="left" valign="top" rowspan="1" colspan="1">ACS</td><td align="right" valign="top" rowspan="1" colspan="1">X</td><td align="right" valign="top" rowspan="1" colspan="1">X</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Proportion of population 55&#x02013;64 years old</td><td align="left" valign="top" rowspan="1" colspan="1">ACS</td><td align="right" valign="top" rowspan="1" colspan="1">X</td><td align="right" valign="top" rowspan="1" colspan="1">X</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Proportion of population 18+: in different state in the past year</td><td align="left" valign="top" rowspan="1" colspan="1">ACS</td><td align="right" valign="top" rowspan="1" colspan="1">X</td><td align="right" valign="top" rowspan="1" colspan="1"/></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Proportion of uninsured population</td><td align="left" valign="top" rowspan="1" colspan="1">SAHIE</td><td align="right" valign="top" rowspan="1" colspan="1">X</td><td align="right" valign="top" rowspan="1" colspan="1">X</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Proportion of returns with taxable Social Security benefits per person</td><td align="left" valign="top" rowspan="1" colspan="1">SOI</td><td align="right" valign="top" rowspan="1" colspan="1">X</td><td align="right" valign="top" rowspan="1" colspan="1"/></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Birth rate (live birth as a proportion of total population)</td><td align="left" valign="top" rowspan="1" colspan="1">NCHS</td><td align="right" valign="top" rowspan="1" colspan="1">X</td><td align="right" valign="top" rowspan="1" colspan="1">X</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Per capita energy consumption</td><td align="left" valign="top" rowspan="1" colspan="1">EIA</td><td align="right" valign="top" rowspan="1" colspan="1">X</td><td align="right" valign="top" rowspan="1" colspan="1"/></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Traffic fatalities per 100 million vehicle miles</td><td align="left" valign="top" rowspan="1" colspan="1">NHTSA</td><td align="right" valign="top" rowspan="1" colspan="1">X</td><td align="right" valign="top" rowspan="1" colspan="1">X</td></tr><tr><td colspan="4" align="left" valign="bottom" rowspan="1">
<hr/>
</td></tr><tr><td colspan="4" align="center" valign="bottom" rowspan="1">Sum of squared differences between model predictions and survey estimates</td></tr><tr><td colspan="4" align="left" valign="bottom" rowspan="1">
<hr/>
</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">56 counties with sample size &#x02265; 1, 000</td><td align="right" valign="top" rowspan="1" colspan="1"/><td align="right" valign="top" rowspan="1" colspan="1">0.094</td><td align="right" valign="top" rowspan="1" colspan="1">0.109</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">213 counties with sample size &#x02265; 500</td><td align="right" valign="top" rowspan="1" colspan="1"/><td align="right" valign="top" rowspan="1" colspan="1">0.409</td><td align="right" valign="top" rowspan="1" colspan="1">0.483</td></tr></tbody></table><table-wrap-foot><fn id="TFN2"><p id="P77">The variables included in the model are indicated by X.</p></fn></table-wrap-foot></table-wrap><table-wrap position="float" id="T3"><label>TABLE 3</label><caption><p id="P78">Model comparison results</p></caption><table frame="hsides" rules="none"><colgroup span="1"><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/></colgroup><thead><tr><th align="left" valign="top" rowspan="1" colspan="1">Model</th><th align="left" valign="top" rowspan="1" colspan="1">WAIC</th></tr><tr><th colspan="2" align="left" valign="bottom" rowspan="1">
<hr/>
</th></tr></thead><tbody><tr><td align="left" valign="top" rowspan="1" colspan="1">All counties with sample data - full</td><td align="left" valign="top" rowspan="1" colspan="1">&#x02212;0.7198</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">All counties with sample data - reduced</td><td align="left" valign="top" rowspan="1" colspan="1">&#x02212;0.7177</td></tr><tr><td colspan="2" align="left" valign="bottom" rowspan="1">
<hr/>
</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">SMART counties - full</td><td align="left" valign="top" rowspan="1" colspan="1">&#x02212;2.0346</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">SMART counties - reduced</td><td align="left" valign="top" rowspan="1" colspan="1">&#x02212;2.0117</td></tr></tbody></table></table-wrap><table-wrap position="float" id="T4"><label>TABLE 4</label><caption><p id="P79">Comparison of nation-level and census division-level estimates, standard errors, and CVs; models fitted to all the counties with sample data (Model Nation), models fitted to the SMART counties (Model SMART), and survey</p></caption><table frame="hsides" rules="none"><colgroup span="1"><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/></colgroup><thead><tr><th rowspan="3" align="left" valign="top" colspan="1">Nation/Census Division</th><th colspan="2" align="center" valign="top" rowspan="1">Model Nation</th><th colspan="2" align="center" valign="top" rowspan="1">Model SMART</th><th colspan="2" align="center" valign="top" rowspan="1">Survey</th></tr><tr><th colspan="6" align="left" valign="bottom" rowspan="1">
<hr/>
</th></tr><tr><th align="right" valign="top" rowspan="1" colspan="1">Point estimate</th><th align="right" valign="top" rowspan="1" colspan="1">Standard error</th><th align="right" valign="top" rowspan="1" colspan="1">Point estimate</th><th align="right" valign="top" rowspan="1" colspan="1">Standard error</th><th align="right" valign="top" rowspan="1" colspan="1">Point estimate</th><th align="right" valign="top" rowspan="1" colspan="1">Standard error</th></tr><tr><th colspan="7" align="left" valign="bottom" rowspan="1">
<hr/>
</th></tr></thead><tbody><tr><td align="left" valign="top" rowspan="1" colspan="1">Nation</td><td align="right" valign="top" rowspan="1" colspan="1">0.228</td><td align="right" valign="top" rowspan="1" colspan="1">0.001</td><td align="right" valign="top" rowspan="1" colspan="1">0.229</td><td align="right" valign="top" rowspan="1" colspan="1">0.003</td><td align="right" valign="top" rowspan="1" colspan="1">0.230</td><td align="right" valign="top" rowspan="1" colspan="1">0.001</td></tr><tr><td colspan="7" align="left" valign="bottom" rowspan="1">
<hr/>
</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">New England</td><td align="right" valign="top" rowspan="1" colspan="1">0.144</td><td align="right" valign="top" rowspan="1" colspan="1">0.003</td><td align="right" valign="top" rowspan="1" colspan="1">0.140</td><td align="right" valign="top" rowspan="1" colspan="1">0.003</td><td align="right" valign="top" rowspan="1" colspan="1">0.143</td><td align="right" valign="top" rowspan="1" colspan="1">0.003</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Middle Atlantic</td><td align="right" valign="top" rowspan="1" colspan="1">0.182</td><td align="right" valign="top" rowspan="1" colspan="1">0.003</td><td align="right" valign="top" rowspan="1" colspan="1">0.181</td><td align="right" valign="top" rowspan="1" colspan="1">0.006</td><td align="right" valign="top" rowspan="1" colspan="1">0.185</td><td align="right" valign="top" rowspan="1" colspan="1">0.004</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">West North Central</td><td align="right" valign="top" rowspan="1" colspan="1">0.224</td><td align="right" valign="top" rowspan="1" colspan="1">0.003</td><td align="right" valign="top" rowspan="1" colspan="1">0.222</td><td align="right" valign="top" rowspan="1" colspan="1">0.005</td><td align="right" valign="top" rowspan="1" colspan="1">0.228</td><td align="right" valign="top" rowspan="1" colspan="1">0.003</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">South Atlantic</td><td align="right" valign="top" rowspan="1" colspan="1">0.240</td><td align="right" valign="top" rowspan="1" colspan="1">0.003</td><td align="right" valign="top" rowspan="1" colspan="1">0.240</td><td align="right" valign="top" rowspan="1" colspan="1">0.008</td><td align="right" valign="top" rowspan="1" colspan="1">0.242</td><td align="right" valign="top" rowspan="1" colspan="1">0.003</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">East South Central</td><td align="right" valign="top" rowspan="1" colspan="1">0.221</td><td align="right" valign="top" rowspan="1" colspan="1">0.004</td><td align="right" valign="top" rowspan="1" colspan="1">0.241</td><td align="right" valign="top" rowspan="1" colspan="1">0.014</td><td align="right" valign="top" rowspan="1" colspan="1">0.229</td><td align="right" valign="top" rowspan="1" colspan="1">0.004</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">West South Central</td><td align="right" valign="top" rowspan="1" colspan="1">0.289</td><td align="right" valign="top" rowspan="1" colspan="1">0.006</td><td align="right" valign="top" rowspan="1" colspan="1">0.306</td><td align="right" valign="top" rowspan="1" colspan="1">0.008</td><td align="right" valign="top" rowspan="1" colspan="1">0.295</td><td align="right" valign="top" rowspan="1" colspan="1">0.007</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Mountain</td><td align="right" valign="top" rowspan="1" colspan="1">0.280</td><td align="right" valign="top" rowspan="1" colspan="1">0.003</td><td align="right" valign="top" rowspan="1" colspan="1">0.277</td><td align="right" valign="top" rowspan="1" colspan="1">0.004</td><td align="right" valign="top" rowspan="1" colspan="1">0.280</td><td align="right" valign="top" rowspan="1" colspan="1">0.004</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Pacific</td><td align="right" valign="top" rowspan="1" colspan="1">0.252</td><td align="right" valign="top" rowspan="1" colspan="1">0.004</td><td align="right" valign="top" rowspan="1" colspan="1">0.241</td><td align="right" valign="top" rowspan="1" colspan="1">0.005</td><td align="right" valign="top" rowspan="1" colspan="1">0.252</td><td align="right" valign="top" rowspan="1" colspan="1">0.004</td></tr></tbody></table></table-wrap></floats-group></article>