<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" article-type="research-article"><?properties open_access?><front><journal-meta><journal-id journal-id-type="nlm-ta">Prev Chronic Dis</journal-id><journal-id journal-id-type="iso-abbrev">Prev Chronic Dis</journal-id><journal-id journal-id-type="publisher-id">PCD</journal-id><journal-title-group><journal-title>Preventing Chronic Disease</journal-title></journal-title-group><issn pub-type="epub">1545-1151</issn><publisher><publisher-name>Centers for Disease Control and Prevention</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="pmid">31198162</article-id><article-id pub-id-type="pmc">6583819</article-id><article-id pub-id-type="publisher-id">18_0441</article-id><article-id pub-id-type="doi">10.5888/pcd16.180441</article-id><article-categories><subj-group subj-group-type="heading"><subject>Original Research</subject></subj-group><series-title>Peer Reviewed</series-title></article-categories><title-group><article-title>Estimating County-Level Mortality Rates Using Highly Censored Data From CDC WONDER</article-title></title-group><contrib-group><contrib contrib-type="author" corresp="yes"><name><surname>Quick</surname><given-names>Harrison</given-names></name><degrees>PhD</degrees></contrib></contrib-group><author-notes><corresp id="cor1">Corresponding Author: Harrison Quick, PhD, Department of Epidemiology and Biostatistics, Drexel University, Philadelphia, PA 19104. Email: <email xlink:href="hsq23@drexel.edu">hsq23@drexel.edu</email>.</corresp></author-notes><pub-date pub-type="collection"><year>2019</year></pub-date><pub-date pub-type="epub"><day>13</day><month>6</month><year>2019</year></pub-date><volume>16</volume><elocation-id>E76</elocation-id><abstract><sec><title>Introduction</title><p>CDC WONDER is a system developed to promote information-driven decision making and provide access to detailed public health information to the general public. Although CDC WONDER contains a wealth of data, any counts fewer than 10 are suppressed for confidentiality reasons, resulting in left-censored data. The objective of this analysis was to describe methods for the analysis of highly censored data.</p></sec><sec><title>Methods</title><p>A substitution approach was compared with 1) a simple, nonspatial Bayesian model that smooths rates toward their statewide averages and 2) a more complex Bayesian model that accounts for spatial and between-age sources of dependence. Age group&#x02013;specific county-level data on heart disease mortality were used for the comparisons.</p></sec><sec><title>Results</title><p>Although the substitution and nonspatial approach provided age-standardized rate estimates that were more highly correlated with the true rate estimates, the estimates from the spatial Bayesian model provided a superior compromise between goodness-of-fit and model complexity, as measured by the deviance information criterion. In addition, the spatial Bayesian model provided rate estimates with greater precision than the nonspatial approach; in contrast, the substitution approach did not provide estimates of uncertainty.</p></sec><sec><title>Conclusion</title><p>Because of the ability to account for multiple sources of dependence and the flexibility to include covariate information, the use of spatial Bayesian models should be considered when analyzing highly censored data from CDC WONDER.</p></sec></abstract></article-meta></front><body><boxed-text id="Ba" position="float" orientation="portrait"><caption><title>Summary</title></caption><sec sec-type="other1"><title>What is already known on this topic?</title><p>Ignoring the impact of suppression due to small counts leads to biased inference.</p></sec><sec sec-type="other2"><title>What is added by this report?</title><p>This work describes and compares multiple approaches for analyzing highly suppressed data from CDC WONDER. R and WinBUGS code are provided to conduct the analyses.</p></sec><sec sec-type="other3"><title>What are the implications for public health practice?</title><p>The use of spatial Bayesian models can yield improved inference from the analysis of highly suppressed data such as those available on CDC WONDER.</p></sec></boxed-text><sec sec-type="intro"><title>Introduction</title><p>CDC WONDER (Wide-ranging ONline Data for Epidemiologic Research) is a system developed by the Centers for Disease Control and Prevention (CDC) to promote information-driven decision making by public health practitioners and researchers and provide access to detailed public health information to the general public (<xref rid="R1" ref-type="bibr">1</xref>). Although CDC WONDER contains a wealth of data, it has limitations. Per CDC policy (<xref rid="R2" ref-type="bibr">2</xref>), any counts fewer than 10 should be suppressed for confidentiality reasons, resulting in left-censored data. Because of high rates of suppression, many chronic disease researchers opt to focus their inference in a few highly populated regions (<xref rid="R3" ref-type="bibr">3</xref>) or state- or national-level trends (<xref rid="R4" ref-type="bibr">4</xref>), despite known geographic disparities in many chronic disease outcomes (<xref rid="R5" ref-type="bibr">5</xref>,<xref rid="R6" ref-type="bibr">6</xref>). This suppression may also discourage research on disparities between subsets of the population (eg, race or sex disparities) to avoid reducing already small counts below suppression thresholds. In short, suppression of small counts exacerbates many issues commonly encountered in the field of small area estimation, where the term &#x0201c;small area&#x0201d; refers to a geographic scale (eg, county, census tract) at which the observed data alone do not provide reliable inference. Thus, when CDC WONDER data are used to conduct surveillance, the ability to estimate rates for rural areas and minority populations &#x02014; where the chronic disease burden is high (<xref rid="R7" ref-type="bibr">7</xref>) &#x02014; is significantly hindered by data suppression.</p><p>To address CDC WONDER&#x02019;s data suppression issue, Tiwari et al (<xref rid="R8" ref-type="bibr">8</xref>) proposed an algorithm for estimating age-standardized rates in which suppressed age-specific counts are replaced with estimates based on the county&#x02019;s age-specific population size and the state-wide average rate for that age-group. For example, suppose <italic>y<sub>ik</sub></italic> denotes the number of deaths from age-bracket <italic>k</italic> in county <italic>i</italic> of a population of size <italic>n<sub>ik</sub></italic> and our inferential interest lies in &#x003bb;<italic><sub>ik</sub></italic>, the corresponding mortality rate. Tiwari et al (<xref rid="R8" ref-type="bibr">8</xref>) proposed replacing the suppressed <italic>y<sub>ik</sub></italic> &#x0003c;10 with <mml:math id="M1"><mml:msubsup><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>*</mml:mi></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi mathvariant="normal">&#x003bb;</mml:mi></mml:mrow><mml:mo>-</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mn>0</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>&#x000d7;</mml:mo><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:math>, where <italic>s<sub>i</sub></italic> denotes the state that county <italic>i</italic> belongs to and <mml:math id="M2"><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi mathvariant="normal">&#x003bb;</mml:mi></mml:mrow><mml:mo>-</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mn>0</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:math> denotes the state-wide average rate for age-bracket <italic>k</italic> in state <italic>s<sub>i</sub></italic> such that</p><p><mml:math id="M3"><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi mathvariant="normal">&#x003bb;</mml:mi></mml:mrow><mml:mo>-</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mn>0</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:mrow><mml:mrow><mml:munder><mml:mo stretchy="false">&#x02211;</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>:</mml:mo><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mi mathvariant="normal">&#x000a0;</mml:mi><mml:mo>=</mml:mo><mml:mi mathvariant="normal">&#x000a0;</mml:mi><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:munder><mml:mrow><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mrow></mml:mrow><mml:mo>/</mml:mo><mml:mrow><mml:mrow><mml:munder><mml:mo stretchy="false">&#x02211;</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>:</mml:mo><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mi mathvariant="normal">&#x000a0;</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi mathvariant="normal">&#x000a0;</mml:mi><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:munder><mml:mrow><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mrow></mml:mrow></mml:mrow></mml:math> (Equation 1)</p><p>Because state-level totals are often 10 or greater, we will assume from this point forward that <mml:math id="M4"><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi mathvariant="normal">&#x003bb;</mml:mi></mml:mrow><mml:mo>-</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mn>0</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:math> is known and publicly available; when this is not the case, rates could be smoothed toward an alternative value (eg, national estimates).</p><p>Although this approach may yield reasonable estimates, it has drawbacks. First and foremost, estimating the uncertainty in age-standardized rate estimates is not an exact science when the data are known (<xref rid="R9" ref-type="bibr">9</xref>,<xref rid="R10" ref-type="bibr">10</xref>), much less when the data are highly suppressed. Furthermore, the algorithm is not designed to account for heterogeneity in demographic information such as the racial/ethnic make-up and socioeconomic status of the counties&#x02019; populations. As a result, inference based on these substituted data may be both biased (ie, smoothing toward the wrong values) and too precise (ignoring the uncertainty due to data suppression).</p><p>When the goal is to assess geographic disparities in age-standardized rates between regions, overcoming the privacy protections to obtain trustworthy estimates of the age-specific rates and their levels of uncertainty is only half the battle. For instance, Fay (<xref rid="R11" ref-type="bibr">11</xref>) followed the work of Fay and Feuer (<xref rid="R9" ref-type="bibr">9</xref>) to construct interval estimates for ratios based on <italic>F</italic> distributions. Tiwari et al (<xref rid="R10" ref-type="bibr">10</xref>) modified this work to yield more efficient interval estimation for rates and ratios of rates from nonnested regions, work that was later extended by Tiwari et al (<xref rid="R12" ref-type="bibr">12</xref>) for when one subregion is nested within a larger region (eg, a county nested within a state); Zhu et al (<xref rid="R13" ref-type="bibr">13</xref>) extended these approaches to more accurately account for spatial autocorrelation. When the age-standardized rates must be estimated from suppressed data, further modifications must be made or these approaches will fail to adequately account for all sources of uncertainty, yielding interval estimates that may be too narrow (<xref rid="R14" ref-type="bibr">14</xref>,<xref rid="R15" ref-type="bibr">15</xref>).</p><p>Rather than develop the statistical theory to accurately account for substitution-based approaches to overcome CDC WONDER&#x02019;s privacy restrictions in variance calculations, we consider the use of Bayesian statistical models, which rely on data augmentation to make inference on the suppressed counts. As described by Fridley and Dixon (<xref rid="R14" ref-type="bibr">14</xref>), data augmentation approaches estimate the suppressed counts via multiple imputation (<xref rid="R16" ref-type="bibr">16</xref>) while simultaneously making inference on the parameters of interest &#x02014; for example, &#x003bb;<italic><sub>ik</sub></italic> and the effects of potential risk factors. As noted by Zhu et al (<xref rid="R13" ref-type="bibr">13</xref>), Bayesian methods for modeling spatial data (<xref rid="R17" ref-type="bibr">17</xref>) can yield improved rate estimates when data are limited while simultaneously providing a mechanism for estimating uncertainty in rate estimates &#x02014; uncertainty that can be seamlessly propagated into estimates such as age-standardized rates and rate ratios. That said, a key drawback of Bayesian methods is their tendency to rely on computationally burdensome Markov chain Monte Carlo (MCMC) methods.</p><p>The objective of this analysis was to illustrate 2 Bayesian approaches for estimating county-level mortality rates, by using heart disease mortality data from 1980 obtained from CDC WONDER (<xref rid="R18" ref-type="bibr">18</xref>), and to compare these results with those generated by the approach of Tiwari et al (<xref rid="R8" ref-type="bibr">8</xref>). In particular, we used a simple, nonspatial Bayesian model, which produces estimates similar to those from Tiwari et al (<xref rid="R8" ref-type="bibr">8</xref>), along with a more complex Bayesian model that accounts for spatial and between-age sources of dependence.</p></sec><sec sec-type="methods"><title>Methods</title><p>The study population for this analysis included all residents of the contiguous United States aged 35 or older during 1980. These data have multiple advantages. Because these data were collected before CDC&#x02019;s suppression guidelines (<xref rid="R2" ref-type="bibr">2</xref>) went into effect, the public-use data are complete and free of suppression. Furthermore, because county definitions changed in several ways during the 1980s, the choice of data from 1980 allowed use of readily available shapefiles from the US Census Bureau for the <italic>I</italic> = 3,109 counties (or county equivalents) in the contiguous United States. To replicate the analysis of Tiwari et al (<xref rid="R8" ref-type="bibr">8</xref>), the data were separated into <italic>K</italic> = 6 groups: those aged 35 to 44, 45 to 54, 55 to 64, 65 to 74, 75 to 84, and 85 or older. Annual counts of heart disease&#x02013;related deaths per county per age-group were obtained via CDC WONDER (<xref rid="R18" ref-type="bibr">18</xref>) and were defined as those for which the underlying cause of death was &#x0201c;diseases of the heart&#x0201d; according to the <italic>International Classification of Diseases, Ninth Revision</italic> (codes 390&#x02013;398, 402, 404&#x02013;429). Of the more than 18,000 counts in this data set, nearly half were fewer than 10.</p><sec><title>Statistical model</title><p>Recall that <italic>y<sub>ik</sub></italic> and <italic>n<sub>ik</sub></italic> denote the number of deaths and the population size in age group <italic>k</italic> in county <italic>i</italic>. To model these data, we considered 2 approaches: a simple Poisson-gamma model and a multivariate spatial Bayesian model. Although the former illustrates how a Bayesian model with weakly informative priors can produce estimates similar to those obtained directly from the raw data &#x02014; but with accurate uncertainty measures &#x02014; the latter illustrates how Bayesian models can incorporate complex dependence structures to produce more reliable estimates. A formal definition of what constitutes a &#x0201c;reliable&#x0201d; rate and the implications of this definition are provided in the Web Appendix (<ext-link ext-link-type="uri" xlink:href="https://sites.google.com/site/harryq/wonder">https://sites.google.com/site/harryq/wonder</ext-link>). Because of the complexity of Bayesian models, the Web Appendix also provides technical details on the methods described in this article and includes R (<xref rid="R19" ref-type="bibr">19</xref>) and WinBUGS (<xref rid="R20" ref-type="bibr">20</xref>) code.</p><sec><title>Poisson-gamma model</title><p>Following the advice of Brillinger (<xref rid="R21" ref-type="bibr">21</xref>), we assumed</p><p><mml:math id="M5"><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>|</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="normal">&#x003bb;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>~</mml:mo><mml:mi>P</mml:mi><mml:mi>o</mml:mi><mml:mi>i</mml:mi><mml:mi>s</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mrow><mml:mi mathvariant="normal">&#x003bb;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math> (Equation 2)</p><p>for <italic>i</italic> = 1, . . ., <italic>I</italic> and <italic>k</italic> = 1, &#x02026;, <italic>K</italic>. Because we wished to fit Equation 2 using a Bayesian framework, we had to specify a prior distribution for each &#x003bb;<italic><sub>ik</sub></italic>. A convenient choice was to let</p><p><mml:math id="M6"><mml:msub><mml:mrow><mml:mi mathvariant="normal">&#x003bb;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>~</mml:mo><mml:mi>G</mml:mi><mml:mi>a</mml:mi><mml:mi>m</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mn>0</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mn>0</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math> (Equation 3)</p><p>As described in the Web Appendix, <mml:math id="M7"><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mn>0</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:math> can be interpreted as the prior number of events and <mml:math id="M8"><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mn>0</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:math> as the prior population size, thereby providing a mechanism for comparing the informativeness of the prior to the amount of information contained in the data. For example, a prior with <mml:math id="M9"><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mn>0</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo></mml:math> 1,000 would contain the same amount of information as the data when <italic>n<sub>ik</sub></italic> = 1,000, and the posterior mean would be equal to the average of <mml:math id="M10"><mml:msub><mml:mrow><mml:mi mathvariant="normal">&#x003bb;</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mn>0</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mn>0</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mi mathvariant="normal">&#x000a0;</mml:mi></mml:mrow><mml:mo>/</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mn>0</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mrow></mml:math> (the estimate from the prior) and <mml:math id="M11"><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi mathvariant="normal">&#x003bb;</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo>/</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mrow></mml:math> (the estimate from the data). Here, we can take an empirical Bayesian approach by letting <mml:math id="M12"><mml:msub><mml:mrow><mml:mi mathvariant="normal">&#x003bb;</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mn>0</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi mathvariant="normal">&#x003bb;</mml:mi></mml:mrow><mml:mo>-</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mn>0</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:math> from Equation 1 and defining the informativeness of the prior to be such that <mml:math id="M13"><mml:mrow><mml:msub><mml:mo stretchy="false">&#x02211;</mml:mo><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mn>0</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mrow><mml:mo>=</mml:mo></mml:math> 6 for all states under the restriction that the <mml:math id="M14"><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mn>0</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mn>0</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo>/</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi mathvariant="normal">&#x003bb;</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mn>0</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mrow></mml:math> parameters respect the age distribution in the United States. To better accommodate low rates among the younger age groups, which produce a preponderance of zero counts, we modified the prior in Equation 3 based on the suggestion of Kerman (<xref rid="R22" ref-type="bibr">22</xref>) by letting</p><p><mml:math id="M15"><mml:msub><mml:mrow><mml:mi mathvariant="normal">&#x003bb;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>~</mml:mo><mml:mi>G</mml:mi><mml:mi>a</mml:mi><mml:mi>m</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mn>0</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mo>/</mml:mo><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:mrow><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mn>0</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math> (Equation 4)</p><p>This prior specification can be considered relatively noninformative because 96.4% of US counties had more than <mml:math id="M16"><mml:mrow><mml:msub><mml:mo stretchy="false">&#x02211;</mml:mo><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>s</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mn>0</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mo>/</mml:mo><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:mrow></mml:mrow></mml:mfenced></mml:mrow></mml:mrow><mml:mo>=</mml:mo></mml:math> 8 heart disease&#x02013;related deaths in 1980. A more complete discussion of this model is provided in the Web Appendix.</p></sec><sec><title>Multivariate conditional autoregressive model</title><p>Although the prior specification in Equation 4 is a convenient choice, it does not take full advantage of the possibilities of Bayesian modeling. In particular, Equation 4 does not account for spatial relationships or the relationships between different age groups. To allow for such structures to be included in the model, we considered Poisson regression models, where</p><p><mml:math id="M17"><mml:mrow><mml:mrow><mml:mi mathvariant="normal">log</mml:mi></mml:mrow><mml:mo>&#x02061;</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi mathvariant="normal">&#x003bb;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mi mathvariant="bold-italic">x</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:mrow><mml:msub><mml:mrow><mml:mi mathvariant="bold">&#x003b2;</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="normal">&#x003b8;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:math> (Equation 5)</p><p>Here, <bold><italic>x</italic></bold><italic><sub>ik</sub></italic> denotes a vector of county-specific covariates with corresponding age-specific regression coefficients, <bold>&#x003b2;</bold><italic><sub>k</sub></italic>; for example, including state-level effects could help account for important health policy differences across state lines. For this analysis, we simply assumed <mml:math id="M18"><mml:msubsup><mml:mrow><mml:mi mathvariant="bold-italic">x</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msubsup><mml:msub><mml:mrow><mml:mi mathvariant="bold">&#x003b2;</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>&#x000a0;</mml:mi><mml:msub><mml:mrow><mml:mi mathvariant="normal">&#x003b2;</mml:mi></mml:mrow><mml:mrow><mml:mn>0</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:math>; that is, a model with age-specific intercept parameters. To account for spatial and between-age sources of dependence, we first followed the approach of Besag et al (<xref rid="R17" ref-type="bibr">17</xref>) and defined <mml:math id="M19"><mml:msub><mml:mrow><mml:mi mathvariant="normal">&#x003b8;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mi>z</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="normal">&#x003c6;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:math>, where <italic>z<sub>ik</sub></italic> accounts for spatial structure within each age-group and <mml:math id="M20"><mml:msub><mml:mrow><mml:mi>&#x003c6;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:math> denotes an exchangeable (ie, nonspatial) random effect. More specifically, the conditional autoregressive (CAR) model of Besag et al (<xref rid="R17" ref-type="bibr">17</xref>) imposes spatial structure by shrinking each <italic>z<sub>ik</sub></italic> toward the values in neighboring counties (ie, counties that share a border), where the strength of this shrinkage is controlled by the number of neighboring counties.</p><p>Although the CAR model is a powerful tool for analyzing spatial data, it does not account for possible correlation between the multiple age groups. To account for this, we instead considered a multivariate extension of the CAR model: the multivariate CAR (MCAR) model of Gelfand and Vounatsou (<xref rid="R23" ref-type="bibr">23</xref>). As with the CAR model, the MCAR shrinks estimates toward their neighboring values; unlike the CAR model, however, the MCAR explicitly models the between-group correlation in the data and leverages these correlations to produce more precise age-specific rate estimates. MCAR models were used recently to model spatially referenced survival times in cancer data (<xref rid="R24" ref-type="bibr">24</xref>), temporal trends in county-level asthma hospitalization rates (<xref rid="R25" ref-type="bibr">25</xref>), temporal trends in heart disease mortality by race and sex (<xref rid="R26" ref-type="bibr">26</xref>), and temporal trends in age-specific stroke mortality (<xref rid="R27" ref-type="bibr">27</xref>), among many other applications. Full details, including a discussion of the prior distributions used, are provided in the Web Appendix.</p></sec><sec><title>Bayesian inference</title><p>Fitting the models in Equation 4 or Equation 5 while accounting for the suppression of counts fewer than 10 requires the use of MCMC algorithms. Because of the reliance on MCMC, inference from these Bayesian models is based on samples generated from the posterior distribution &#x02014; for example, <mml:math id="M21"><mml:msubsup><mml:mrow><mml:mi mathvariant="normal">&#x003bb;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>l</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msubsup></mml:math> for <italic>l</italic> = 1, &#x02026;, <italic>L</italic>, where <italic>L</italic> denotes the number of samples. These samples can then be used to compute quantities such as the age-standardized mortality rate:</p><p><mml:math id="M22"><mml:msubsup><mml:mrow><mml:mi mathvariant="normal">&#x003bb;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>&#x02219;</mml:mo></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>l</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:mrow><mml:msub><mml:mo stretchy="false">&#x02211;</mml:mo><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:msub><mml:mrow><mml:mi>&#x003c0;</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:msubsup><mml:mrow><mml:mi mathvariant="normal">&#x003bb;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>l</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:mrow></mml:math></p><p>where &#x003c0;<italic><sub>k</sub></italic> denotes a prespecified standard age distribution (eg, based on the 2010 US standard population). To summarize the posterior distribution, it is common to use the posterior median and the 95% credible interval (constructed from the 2.5 and 97.5 percentiles of the posterior samples and analogous to classical 95% confidence intervals).</p></sec></sec><sec><title>Comparison of approaches</title><p>To compare the various estimation approaches, we first considered simple correlations between the estimates and the rates obtained from the complete data (as considered by Tiwari et al [<xref rid="R8" ref-type="bibr">8</xref>]) and correlations between the age-standardized rates and the age-specific rates. The goal of these comparisons was not to demonstrate whether one approach is superior to another but rather to demonstrate the degree to which the approaches are similar to one another. In addition, we also compared the 2 Bayesian approaches by using the deviance information criterion (DIC) (<xref rid="R28" ref-type="bibr">28</xref>), which uses the posterior samples to produce a measure that is a compromise between model fit (denoted by <mml:math id="M23"><mml:mover accent="true"><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mo>-</mml:mo></mml:mover></mml:math>) and model complexity, <italic>p<sub>D</sub></italic>. In particular, <italic>p<sub>D</sub></italic> is often interpreted as the effective number of parameters in the model. Additional details on DIC, including a discussion of its use with censored data, are provided in the Web Appendix.</p></sec><sec><title>Creation of maps</title><p>Maps were created by using the R statistical software (The R Foundation). Code is available in step 6 of the walkthrough in the Web Appendix (<ext-link ext-link-type="uri" xlink:href="https://sites.google.com/site/harryq/wonder">https://sites.google.com/site/harryq/wonder</ext-link>).</p></sec></sec><sec sec-type="results"><title>Results</title><p>The maps of the age-standardized rates generated from the raw data (<xref ref-type="fig" rid="F1">Figure 1A</xref>) and the maps generated by the Poisson-gamma model (<xref ref-type="fig" rid="F1">Figure 1C</xref>) have strong similarities, while artifacts of substituting state-wide averages for suppressed counts based on the approach of Tiwari et al (<xref rid="R8" ref-type="bibr">8</xref>) lead to elevated estimates in many rural counties in the upper Midwest (<xref ref-type="fig" rid="F1">Figure 1B</xref>). In contrast, the map of the estimates from the MCAR model (<xref ref-type="fig" rid="F1">Figure 1D</xref>) preserves the overall trends in the data while producing significantly smoother rate estimates.</p><fig id="F1" fig-type="figure" orientation="portrait" position="float"><label>Figure 1</label><caption><p>Estimates of age-standardized heart disease mortality rates from 1980. A, Crude age-standardized rates based solely on the data. B, Estimates obtained by using the approach of Tiwari et al (<xref rid="R8" ref-type="bibr">8</xref>). C, Estimated posterior medians from the Poisson-gamma model. D, Estimated posterior medians from the multivariate conditional autoregressive model (MCAR). Data source: Centers for Disease Control and Prevention (<xref rid="R18" ref-type="bibr">18</xref>).</p></caption><long-desc>This figure displays maps of the age-standardized rates from the raw data, the approach of Tiwari et al (8), the Poisson-gamma model of equation 4, and the MCAR model described in relation to equation 5. The maps of the estimates from the raw data and the Poisson-gamma model look similar and contain many extreme rates, ie, rates at the high end or low end of the spectrum. Because of the smoothing used in the approach of Tiwari et al (8), the map of its rates tends to show similar rates in each of a state&#x02019;s more rural counties. Finally, the map of the estimates from the MCAR model exhibit the same general trend observed in the other 3 maps, but with more gradual transitions between regions with high rates and regions with low rates.</long-desc><graphic xlink:href="PCD-16-E76s01"/></fig><p>The correlation results (<xref rid="T1" ref-type="table">Table 1</xref>) largely support this assessment. The Poisson-gamma approach produced age-standardized rate estimates that were the most highly correlated with the true rates, although the estimates obtained by using the substitution approach of Tiwari et al (<xref rid="R8" ref-type="bibr">8</xref>) had nearly an identical correlation. These 2 approaches differed in age-specific rate estimates. In particular, although the Poisson-gamma approach appeared to struggle for adults aged 35 to 44 &#x02014; producing estimates that were less correlated with the truth &#x02014; it outperformed the substitution approach for all groups aged 55 or older. <xref ref-type="fig" rid="F2">Figure 2</xref>, which displays the age-specific rate estimates for adults aged 35 to 44 and adults 85 or older, explains how this occurred. Here, although the approach of Tiwari et al (<xref rid="R8" ref-type="bibr">8</xref>) gave every suppressed county in each state the same rate (by design), the Poisson-gamma model tended to overestimate rate estimates for those aged 35 to 44. According to Kerman (<xref rid="R22" ref-type="bibr">22</xref>), this overestimation of rates when counts are very small was to be expected. Furthermore, unlike the approach of Tiwari et al (<xref rid="R8" ref-type="bibr">8</xref>), the Poisson-gamma model produced full posterior distributions for each age-specific rate estimate, thereby allowing quantification of the uncertainty in these estimates. (Figure B.3 in the Web Appendix illustrates how only 4.5% of estimates for those aged 35 to 44 and 42.8% of all age-specific rate estimates from the Poisson-gamma model were deemed reliable.) When estimating rates for those 85 or older, the Poisson-gamma model permitted heterogeneity within states (<xref ref-type="fig" rid="F2">Figure 2E</xref>); the inability to permit such heterogeneity is a key weakness of the approach of Tiwari et al (<xref rid="R8" ref-type="bibr">8</xref>). Further evaluation of the low age-specific correlations is provided in the Web Appendix (Figures B.1 and B.2).</p><table-wrap id="T1" orientation="portrait" position="float"><label>Table 1</label><caption><title>Comparison of the Correlation Results of 3 Estimation Approaches, Analysis of County-Level Mortality Rates Using Highly Censored Data From CDC WONDER<xref rid="T1FN1" ref-type="table-fn">a</xref>
</title></caption><table frame="hsides" rules="groups"><col width="112" span="1"/><col width="41" span="1"/><col width="41" span="1"/><col width="41" span="1"/><col width="41" span="1"/><col width="41" span="1"/><col width="41" span="1"/><col width="82" span="1"/><thead><tr><th rowspan="2" valign="bottom" align="left" scope="col" colspan="1">Approach</th><th valign="bottom" colspan="6" align="center" scope="colgroup" rowspan="1">Age Group<hr/></th><th rowspan="2" valign="bottom" align="center" scope="col" colspan="1">Age-Standardized</th></tr><tr><th valign="bottom" colspan="1" align="center" scope="colgroup" rowspan="1">35&#x02013;44</th><th valign="bottom" align="center" scope="col" rowspan="1" colspan="1">45&#x02013;54</th><th valign="bottom" align="center" scope="col" rowspan="1" colspan="1">55&#x02013;64</th><th valign="bottom" align="center" scope="col" rowspan="1" colspan="1">65&#x02013;74</th><th valign="bottom" align="center" scope="col" rowspan="1" colspan="1">75&#x02013;84</th><th valign="bottom" align="center" scope="col" rowspan="1" colspan="1">&#x02265;85</th></tr></thead><tbody><tr><td valign="top" align="left" scope="row" rowspan="1" colspan="1">Tiwari et al (<xref rid="R8" ref-type="bibr">8</xref>)</td><td valign="top" align="center" rowspan="1" colspan="1">0.15</td><td valign="top" align="center" rowspan="1" colspan="1">0.73</td><td valign="top" align="center" rowspan="1" colspan="1">0.16</td><td valign="top" align="center" rowspan="1" colspan="1">0.07</td><td valign="top" align="center" rowspan="1" colspan="1">&#x02212;0.01</td><td valign="top" align="center" rowspan="1" colspan="1">0.08</td><td valign="top" align="center" rowspan="1" colspan="1">0.73</td></tr><tr><td valign="top" align="left" scope="row" rowspan="1" colspan="1">Poisson-gamma</td><td valign="top" align="center" rowspan="1" colspan="1">0.09</td><td valign="top" align="center" rowspan="1" colspan="1">0.74</td><td valign="top" align="center" rowspan="1" colspan="1">0.23</td><td valign="top" align="center" rowspan="1" colspan="1">0.25</td><td valign="top" align="center" rowspan="1" colspan="1">0.24</td><td valign="top" align="center" rowspan="1" colspan="1">0.27</td><td valign="top" align="center" rowspan="1" colspan="1">0.74</td></tr><tr><td valign="top" align="left" scope="row" rowspan="1" colspan="1">Multivariate conditional autoregressive model</td><td valign="top" align="center" rowspan="1" colspan="1">0.15</td><td valign="top" align="center" rowspan="1" colspan="1">0.65</td><td valign="top" align="center" rowspan="1" colspan="1">0.18</td><td valign="top" align="center" rowspan="1" colspan="1">0.15</td><td valign="top" align="center" rowspan="1" colspan="1">0.05</td><td valign="top" align="center" rowspan="1" colspan="1">0.14</td><td valign="top" align="center" rowspan="1" colspan="1">0.65</td></tr></tbody></table><table-wrap-foot><fn id="T1FN1"><label>a</label><p> Age-standardized correlation results were based on all 3,109 US counties, whereas age-specific correlation results were based only on the suppressed counties (counties with counts &#x0003c;10). Data source: Centers for Disease Control and Prevention (<xref rid="R18" ref-type="bibr">18</xref>).</p></fn></table-wrap-foot></table-wrap><fig id="F2" fig-type="figure" orientation="portrait" position="float"><label>Figure 2</label><caption><p>Comparison of 3 approaches for estimating age-standardized heart disease mortality rates for 2 age groups (adults aged 35 to 44 and adults aged &#x02265;85) from 1980. A, Estimates for adults aged 35 to 44 obtained by using the approach of Tiwari et al (<xref rid="R8" ref-type="bibr">8</xref>). B, Estimated posterior medians for adults aged 35 to 44 from the Poisson-gamma model. C, Estimated posterior medians for adults aged 35 to 44 from the multivariate conditional autoregressive model (MCAR). D, Estimates for adults aged &#x02265;85 obtained by using the approach of Tiwari et al (<xref rid="R8" ref-type="bibr">8</xref>). E, Estimated posterior medians for adults aged &#x02265;85 from the Poisson-gamma model. F, Estimated posterior medians for adults aged &#x02265;85 from the multivariate conditional autoregressive model (MCAR). Data source: Centers for Disease Control and Prevention (<xref rid="R18" ref-type="bibr">18</xref>).</p></caption><long-desc>Figure 2 compares maps of the rate estimates from the 3 approaches for the age groups 35 to 44 and 85 or older. Because of the high rate of suppression, the substitution approach of Tiwari et al (8) for those aged 35 to 44 assigns nearly every county in each state the same value. The Poisson-gamma model also struggles with this age-group, producing estimates that are quite high for many rural parts of the country. In contrast to these approaches, the MCAR model produces estimates that are much more spatially smooth (ie, gradual changes from high to low rates) for this age-group. Moving on to the estimates from the 85 and older age-group, we find a bit more similarity between the substitution approach and the Poisson-gamma model &#x02014; this is not too surprising, given the higher death counts and thus lower rate of suppression. Again, the MCAR model produces estimates for the 85 or older age-group that exhibit much more spatial smoothing. Finally, from an epidemiologic perspective, the MCAR estimates from the 35 to 44 age-group exhibit high rates in Appalachia and the Deep South, while the MCAR estimates from the 85 or older age-group exhibit high rates in New England and the Rust Belt region. These broader trends are more apparent in the maps of the MCAR estimates than in the maps from the other 2 approaches because of the lack of smoothing in the estimates they produce.</long-desc><graphic xlink:href="PCD-16-E76s02"/></fig><p>Looking at the correlation results (<xref rid="T1" ref-type="table">Table 1</xref>) and the maps in <xref ref-type="fig" rid="F1">Figure 1</xref>, one may wonder why we bother fitting the complex MCAR model. The DIC results (<xref rid="T2" ref-type="table">Table 2</xref>) explain why. Here, the MCAR model offered a model fit that is similar to the fit of the Poisson-gamma model (as measured by <mml:math id="M24"><mml:mover accent="true"><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mo>-</mml:mo></mml:mover></mml:math>) while doing so with far fewer &#x0201c;effective model parameters&#x0201d; (<italic>p<sub>D</sub></italic>). To understand how this can be, recall that each &#x003bb;<italic><sub>ik</sub></italic> in Equation 4 had its own independent prior distribution; that is, the Poisson-gamma model did not shrink the &#x003bb;<italic><sub>ik</sub></italic> toward each other, producing estimates of the (<italic>p<sub>D</sub></italic>) for older age groups that approach the full <italic>I</italic> = 3,109 number of parameters. In contrast, the MCAR model explicitly imposed dependence between its model parameters, resulting in estimates of the (<italic>p<sub>D</sub></italic>) that were nearly 80% less than those from the Poisson-gamma model (eg, 10,785 vs 2,307). In addition, the estimates produced by the MCAR model were more precise (Web Appendix), and the smooth geographic patterns in <xref ref-type="fig" rid="F1">Figure 1D</xref>, <xref ref-type="fig" rid="F2">Figure 2C</xref>, and <xref ref-type="fig" rid="F2">Figure 2F</xref> may provide clearer insight into the underlying trends in heart disease mortality.</p><table-wrap id="T2" orientation="portrait" position="float"><label>Table 2</label><caption><title>Comparison of the Deviance Information Criterion<xref rid="T2FN1" ref-type="table-fn">a</xref> Results of 3 Estimation Approaches, Analysis of County-Level Mortality Rates Using Highly Censored Data From CDC WONDER<xref rid="T2FN2" ref-type="table-fn">b</xref>
</title></caption><table frame="hsides" rules="groups"><col width="66" span="1"/><col width="51" span="1"/><col width="51" span="1"/><col width="56" span="1"/><col width="56" span="1"/><col width="56" span="1"/><col width="56" span="1"/><col width="56" span="1"/><thead><tr><th rowspan="2" valign="bottom" align="left" scope="col" colspan="1">Approach</th><th valign="bottom" colspan="6" align="center" scope="colgroup" rowspan="1">Age Group<hr/>
</th><th rowspan="2" valign="bottom" align="center" scope="col" colspan="1">Overall</th></tr><tr><th valign="bottom" colspan="1" align="center" scope="colgroup" rowspan="1">35&#x02013;44</th><th valign="bottom" align="center" scope="col" rowspan="1" colspan="1">45&#x02013;54</th><th valign="bottom" align="center" scope="col" rowspan="1" colspan="1">55&#x02013;64</th><th valign="bottom" align="center" scope="col" rowspan="1" colspan="1">65&#x02013;74</th><th valign="bottom" align="center" scope="col" rowspan="1" colspan="1">75&#x02013;84</th><th valign="bottom" align="center" scope="col" rowspan="1" colspan="1">&#x02265;85</th></tr></thead><tbody><tr><td colspan="8" valign="top" align="left" scope="col" rowspan="1">
<bold>Poisson-gamma</bold>
</td></tr><tr><td valign="top" align="left" scope="row" rowspan="1" colspan="1">DIC</td><td valign="top" align="center" rowspan="1" colspan="1">2,204</td><td valign="top" align="center" rowspan="1" colspan="1">6,108</td><td valign="top" align="center" rowspan="1" colspan="1">12,393</td><td valign="top" align="center" rowspan="1" colspan="1">17,866</td><td valign="top" align="center" rowspan="1" colspan="1">19,005</td><td valign="top" align="center" rowspan="1" colspan="1">16,956</td><td valign="top" align="center" rowspan="1" colspan="1">74,533</td></tr><tr><td valign="top" align="left" scope="row" rowspan="1" colspan="1"><mml:math id="M25"><mml:mover accent="true"><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mo>-</mml:mo></mml:mover></mml:math></td><td valign="top" align="center" rowspan="1" colspan="1">1,663</td><td valign="top" align="center" rowspan="1" colspan="1">5,006</td><td valign="top" align="center" rowspan="1" colspan="1">10,509</td><td valign="top" align="center" rowspan="1" colspan="1">15,447</td><td valign="top" align="center" rowspan="1" colspan="1">16,506</td><td valign="top" align="center" rowspan="1" colspan="1">14,616</td><td valign="top" align="center" rowspan="1" colspan="1">63,748</td></tr><tr><td valign="top" align="left" scope="row" rowspan="1" colspan="1">
<italic>p<sub>D</sub></italic>
</td><td valign="top" align="center" rowspan="1" colspan="1">542</td><td valign="top" align="center" rowspan="1" colspan="1">1,102</td><td valign="top" align="center" rowspan="1" colspan="1">1,884</td><td valign="top" align="center" rowspan="1" colspan="1">2,419</td><td valign="top" align="center" rowspan="1" colspan="1">2,499</td><td valign="top" align="center" rowspan="1" colspan="1">2,339</td><td valign="top" align="center" rowspan="1" colspan="1">10,785</td></tr><tr><td colspan="8" valign="top" align="left" scope="col" rowspan="1">
<bold>Multivariate conditional autoregressive model</bold>
</td></tr><tr><td valign="top" align="left" scope="row" rowspan="1" colspan="1">DIC</td><td valign="top" align="center" rowspan="1" colspan="1">1,558</td><td valign="top" align="center" rowspan="1" colspan="1">5,242</td><td valign="top" align="center" rowspan="1" colspan="1">11,245</td><td valign="top" align="center" rowspan="1" colspan="1">16,201</td><td valign="top" align="center" rowspan="1" colspan="1">17,417</td><td valign="top" align="center" rowspan="1" colspan="1">15,904</td><td valign="top" align="center" rowspan="1" colspan="1">67,568</td></tr><tr><td valign="top" align="left" scope="row" rowspan="1" colspan="1"><mml:math id="M26"><mml:mover accent="true"><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mo>-</mml:mo></mml:mover></mml:math></td><td valign="top" align="center" rowspan="1" colspan="1">1,478</td><td valign="top" align="center" rowspan="1" colspan="1">5,030</td><td valign="top" align="center" rowspan="1" colspan="1">10,842</td><td valign="top" align="center" rowspan="1" colspan="1">15,743</td><td valign="top" align="center" rowspan="1" colspan="1">16,887</td><td valign="top" align="center" rowspan="1" colspan="1">15,281</td><td valign="top" align="center" rowspan="1" colspan="1">65,260</td></tr><tr><td valign="top" align="left" scope="row" rowspan="1" colspan="1">
<italic>p<sub>D</sub></italic>
</td><td valign="top" align="center" rowspan="1" colspan="1">80</td><td valign="top" align="center" rowspan="1" colspan="1">213</td><td valign="top" align="center" rowspan="1" colspan="1">403</td><td valign="top" align="center" rowspan="1" colspan="1">458</td><td valign="top" align="center" rowspan="1" colspan="1">530</td><td valign="top" align="center" rowspan="1" colspan="1">624</td><td valign="top" align="center" rowspan="1" colspan="1">2,307</td></tr></tbody></table><table-wrap-foot><fn id="T2FN1"><label>a</label><p> Spiegelhalter et al (<xref rid="R28" ref-type="bibr">28</xref>).</p></fn><fn id="T2FN2"><label>b</label><p> Where <mml:math id="M27"><mml:mover accent="true"><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mo>-</mml:mo></mml:mover></mml:math> is a measure of model fit (lower is better), <italic>p<sub>D</sub></italic> is a measure of model complexity (lower indicating fewer effective model parameters), and <mml:math id="M28"><mml:mi>D</mml:mi><mml:mi>I</mml:mi><mml:mi>C</mml:mi><mml:mo>=</mml:mo><mml:mover accent="true"><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mo>-</mml:mo></mml:mover><mml:mi mathvariant="normal">&#x000a0;</mml:mi><mml:mo>+</mml:mo><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mi>D</mml:mi></mml:mrow></mml:msub><mml:mo>.</mml:mo></mml:math>. Data source: Centers for Disease Control and Prevention (<xref rid="R18" ref-type="bibr">18</xref>).</p></fn></table-wrap-foot></table-wrap></sec><sec sec-type="discussion"><title>Discussion</title><p>This analysis highlighted some of the benefits of using Bayesian methods to account for left-censored data like those encountered in CDC WONDER. Although the Poisson-gamma model is a relatively simple approach, models (such as the MCAR model) that explicitly account for multivariate spatial dependence structures can lead to better inference by leveraging other sources of information to produce more reliable estimates.</p><p>The strengths of the MCAR model described in this analysis extend beyond modeling censored data to the broader field of small area estimation. As alluded to in the discussion of Equation 5, many benefits are associated with using the MCAR model in conjunction with covariate information when modeling chronic disease outcomes. Combining covariate information with spatial structure can produce more reliable estimates of the rates themselves, which is beneficial for disease surveillance, while simultaneously conducting inference on the potential risk factors that are included as covariates. When the covariates in the analysis are themselves spatially structured, it can be unclear if the covariate is effecting change in the outcome or vice versa, or if an unmeasured spatial confounder is influencing both the covariate <italic>and</italic> the outcome. In these settings, including a spatial random effect can lead to a phenomenon referred to as &#x0201c;spatial confounding&#x0201d; (<xref rid="R29" ref-type="bibr">29</xref>) and increase the standard errors associated with these covariates. Although the notion of spatial confounding has historically been considered a drawback of spatial models (<xref rid="R29" ref-type="bibr">29</xref>), others have argued (<xref rid="R30" ref-type="bibr">30</xref>) that inference from such models can help protect against type 1 error (ie, incorrectly rejecting the null hypothesis).</p><p>Finally, although we analyzed age-specific heart disease mortality as an illustration, the MCAR model is also well suited for analyzing rarer event data via its ability to jointly model multiple outcomes. This analysis leveraged information from older age groups with higher death counts to produce more reliable estimates for those aged 35 to 44. Similarly, one could jointly model a chronic disease outcome for multiple race/ethnicities, exploiting the shared factors that may lead to increased rates for non-Hispanic white persons and racial/ethnic minorities alike. Alternatively, one could use MCAR models to simultaneously analyze multiple chronic disease outcomes with similar etiologies to improve the reliability of all estimates.</p><p>Although the suppression of data creates an obstacle to conducting chronic disease surveillance, Bayesian statistical methods such as those described in this analysis can overcome these challenges while also producing more reliable estimates with valid uncertainty measures. By illustrating the benefits of and providing code for their implementation, we hope to ease the burden of using Bayesian models and broaden their application to censored data sets available from sources like CDC WONDER, thereby improving the inference made from public-use data.</p></sec></body><back><ack><title>Acknowledgments</title><p>This work received no outside financial support.</p></ack><fn-group><fn><p>The opinions expressed by authors contributing to this journal do not necessarily reflect the opinions of the U.S. Department of Health and Human Services, the Public Health Service, the Centers for Disease Control and Prevention, or the authors' affiliated institutions.</p></fn><fn><p>
<italic>Suggested citation for this article:</italic> Quick H. Estimating County-Level Mortality Rates Using Highly Censored Data From CDC WONDER. Prev Chronic Dis 2019;16:180441. DOI: <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5888/pcd16.180441">https://doi.org/10.5888/pcd16.180441</ext-link>.</p></fn></fn-group><ref-list><title>References</title><ref id="R1"><label>1. </label><mixed-citation publication-type="web">
<collab>Centers for Disease Control and Prevention</collab>. CDC WONDER, <year>2017</year>
<ext-link ext-link-type="uri" xlink:href="http://wonder.cdc.gov">http://wonder.cdc.gov</ext-link>. <comment>Accessed March 17, 2017</comment>.</mixed-citation></ref><ref id="R2"><label>2. </label><mixed-citation publication-type="web">
<collab>Centers for Disease Control and Prevention</collab>. CDC/ATSDR policy on releasing and sharing data. Manual; guide CDC-02. <year>2003</year>
<ext-link ext-link-type="uri" xlink:href="http://www.cdc.gov/maso/Policy/ReleasingData.pdf">http://www.cdc.gov/maso/Policy/ReleasingData.pdf</ext-link>. <comment>Accessed June 30, 2015</comment>.</mixed-citation></ref><ref id="R3"><label>3. </label><mixed-citation publication-type="journal">
<string-name>
<surname>Rust</surname>
<given-names>G</given-names>
</string-name>, <string-name>
<surname>Zhang</surname>
<given-names>S</given-names>
</string-name>, <string-name>
<surname>Malhotra</surname>
<given-names>K</given-names>
</string-name>, <string-name>
<surname>Reese</surname>
<given-names>L</given-names>
</string-name>, <string-name>
<surname>McRoy</surname>
<given-names>L</given-names>
</string-name>, <string-name>
<surname>Baltrus</surname>
<given-names>P</given-names>
</string-name>, <etal/>
<article-title>Paths to health equity: local area variation in progress toward eliminating breast cancer mortality disparities, 1990&#x02013;2009.</article-title>
<source>Cancer</source>
<year>2015</year>;<volume>121</volume>(<issue>16</issue>):<fpage>2765</fpage>&#x02013;<lpage>74</lpage>. <pub-id pub-id-type="doi">10.1002/cncr.29405</pub-id>
<pub-id pub-id-type="pmid">25906833</pub-id></mixed-citation></ref><ref id="R4"><label>4. </label><mixed-citation publication-type="journal">
<string-name>
<surname>Wilmot</surname>
<given-names>KA</given-names>
</string-name>, <string-name>
<surname>O&#x02019;Flaherty</surname>
<given-names>M</given-names>
</string-name>, <string-name>
<surname>Capewell</surname>
<given-names>S</given-names>
</string-name>, <string-name>
<surname>Ford</surname>
<given-names>ES</given-names>
</string-name>, <string-name>
<surname>Vaccarino</surname>
<given-names>V</given-names>
</string-name>. <article-title>Coronary heart disease mortality declines in the United States from 1979 through 2011: evidence for stagnation in young adults, especially women.</article-title>
<source>Circulation</source>
<year>2015</year>;<volume>132</volume>(<issue>11</issue>):<fpage>997</fpage>&#x02013;<lpage>1002</lpage>. <pub-id pub-id-type="doi">10.1161/CIRCULATIONAHA.115.015293</pub-id>
<pub-id pub-id-type="pmid">26302759</pub-id></mixed-citation></ref><ref id="R5"><label>5. </label><mixed-citation publication-type="journal">
<string-name>
<surname>Casper</surname>
<given-names>M</given-names>
</string-name>, <string-name>
<surname>Kramer</surname>
<given-names>MR</given-names>
</string-name>, <string-name>
<surname>Quick</surname>
<given-names>H</given-names>
</string-name>, <string-name>
<surname>Schieb</surname>
<given-names>LJ</given-names>
</string-name>, <string-name>
<surname>Vaughan</surname>
<given-names>AS</given-names>
</string-name>, <string-name>
<surname>Greer</surname>
<given-names>S</given-names>
</string-name>. <article-title>Changes in the geographic patterns of heart disease mortality in the United States 1973 to 2010.</article-title>
<source>Circulation</source>
<year>2016</year>;<volume>133</volume>(<issue>12</issue>):<fpage>1171</fpage>&#x02013;<lpage>80</lpage>. <pub-id pub-id-type="doi">10.1161/CIRCULATIONAHA.115.018663</pub-id>
<pub-id pub-id-type="pmid">27002081</pub-id></mixed-citation></ref><ref id="R6"><label>6. </label><mixed-citation publication-type="journal">
<string-name>
<surname>Vaughan</surname>
<given-names>AS</given-names>
</string-name>, <string-name>
<surname>Quick</surname>
<given-names>H</given-names>
</string-name>, <string-name>
<surname>Pathak</surname>
<given-names>EB</given-names>
</string-name>, <string-name>
<surname>Kramer</surname>
<given-names>MR</given-names>
</string-name>, <string-name>
<surname>Casper</surname>
<given-names>M</given-names>
</string-name>. <article-title>Disparities in temporal and geographic patterns of declining heart disease mortality by race and sex in the United States, 1973&#x02013;2010.</article-title>
<source>J Am Heart Assoc</source>
<year>2015</year>;<volume>4</volume>(<issue>12</issue>):<elocation-id>e002567</elocation-id>. <pub-id pub-id-type="doi">10.1161/JAHA.115.002567</pub-id>
<pub-id pub-id-type="pmid">26672077</pub-id></mixed-citation></ref><ref id="R7"><label>7. </label><mixed-citation publication-type="web">
<collab>National Center for Health Statistics</collab>. Health, United States, 2017: with special feature on mortality. <ext-link ext-link-type="uri" xlink:href="https://www.cdc.gov/nchs/data/hus/hus17.pdf">https://www.cdc.gov/nchs/data/hus/hus17.pdf</ext-link>. Accessed February 27, 2019.</mixed-citation></ref><ref id="R8"><label>8. </label><mixed-citation publication-type="journal">
<string-name>
<surname>Tiwari</surname>
<given-names>C</given-names>
</string-name>, <string-name>
<surname>Beyer</surname>
<given-names>K</given-names>
</string-name>, <string-name>
<surname>Rushton</surname>
<given-names>G</given-names>
</string-name>. <article-title>The impact of data suppression on local mortality rates: the case of CDC WONDER.</article-title>
<source>Am J Public Health</source>
<year>2014</year>;<volume>104</volume>(<issue>8</issue>):<fpage>1386</fpage>&#x02013;<lpage>8</lpage>. <pub-id pub-id-type="doi">10.2105/AJPH.2014.301900</pub-id>
<pub-id pub-id-type="pmid">24922161</pub-id></mixed-citation></ref><ref id="R9"><label>9. </label><mixed-citation publication-type="journal">
<string-name>
<surname>Fay</surname>
<given-names>MP</given-names>
</string-name>, <string-name>
<surname>Feuer</surname>
<given-names>EJ</given-names>
</string-name>. <article-title>Confidence intervals for directly standardized rates: a method based on the gamma distribution.</article-title>
<source>Stat Med</source>
<year>1997</year>;<volume>16</volume>(<issue>7</issue>):<fpage>791</fpage>&#x02013;<lpage>801</lpage>. <pub-id pub-id-type="doi">10.1002/(SICI)1097-0258(19970415)16:7&#x0003c;791::AID-SIM500&#x0003e;3.0.CO;2-#</pub-id>
<pub-id pub-id-type="pmid">9131766</pub-id></mixed-citation></ref><ref id="R10"><label>10. </label><mixed-citation publication-type="journal">
<string-name>
<surname>Tiwari</surname>
<given-names>RC</given-names>
</string-name>, <string-name>
<surname>Clegg</surname>
<given-names>LX</given-names>
</string-name>, <string-name>
<surname>Zou</surname>
<given-names>Z</given-names>
</string-name>. <article-title>Efficient interval estimation for age-adjusted cancer rates.</article-title>
<source>Stat Methods Med Res</source>
<year>2006</year>;<volume>15</volume>(<issue>6</issue>):<fpage>547</fpage>&#x02013;<lpage>69</lpage>. <pub-id pub-id-type="doi">10.1177/0962280206070621</pub-id>
<pub-id pub-id-type="pmid">17260923</pub-id></mixed-citation></ref><ref id="R11"><label>11. </label><mixed-citation publication-type="journal">
<string-name>
<surname>Fay</surname>
<given-names>MP</given-names>
</string-name>. <article-title>Approximate confidence intervals for rate ratios from directly standardized rates with sparse data.</article-title>
<source>Commun Stat Theory Methods</source>
<year>1999</year>;<volume>28</volume>(<issue>9</issue>):<fpage>2141</fpage>&#x02013;<lpage>60</lpage>. <pub-id pub-id-type="doi">10.1080/03610929908832411</pub-id>
</mixed-citation></ref><ref id="R12"><label>12. </label><mixed-citation publication-type="journal">
<string-name>
<surname>Tiwari</surname>
<given-names>RC</given-names>
</string-name>, <string-name>
<surname>Li</surname>
<given-names>Y</given-names>
</string-name>, <string-name>
<surname>Zou</surname>
<given-names>Z</given-names>
</string-name>. <article-title>Interval estimation for ratios of correlated age-adjusted rates.</article-title>
<source>J Data Sci</source>
<year>2010</year>;<volume>8</volume>:<fpage>471</fpage>&#x02013;<lpage>82</lpage>.
<pub-id pub-id-type="pmid">22347884</pub-id></mixed-citation></ref><ref id="R13"><label>13. </label><mixed-citation publication-type="journal">
<string-name>
<surname>Zhu</surname>
<given-names>L</given-names>
</string-name>, <string-name>
<surname>Pickle</surname>
<given-names>LW</given-names>
</string-name>, <string-name>
<surname>Pearson</surname>
<given-names>JB</given-names>
<suffix>Jr</suffix>
</string-name>. <article-title>Confidence intervals for rate ratios between geographic units.</article-title>
<source>Int J Health Geogr</source>
<year>2016</year>;<volume>15</volume>(<issue>1</issue>):<fpage>44</fpage>. <pub-id pub-id-type="doi">10.1186/s12942-016-0073-5</pub-id>
<pub-id pub-id-type="pmid">27978838</pub-id></mixed-citation></ref><ref id="R14"><label>14. </label><mixed-citation publication-type="journal">
<string-name>
<surname>Fridley</surname>
<given-names>BL</given-names>
</string-name>, <string-name>
<surname>Dixon</surname>
<given-names>P</given-names>
</string-name>. <article-title>Data augmentation for a Bayesian spatial model involving censored observations.</article-title>
<source>Environmetrics</source>
<year>2007</year>;<volume>18</volume>(<issue>2</issue>):<fpage>107</fpage>&#x02013;<lpage>23</lpage>. <pub-id pub-id-type="doi">10.1002/env.806</pub-id>
</mixed-citation></ref><ref id="R15"><label>15. </label><mixed-citation publication-type="journal">
<string-name>
<surname>Quick</surname>
<given-names>H</given-names>
</string-name>, <string-name>
<surname>Groth</surname>
<given-names>C</given-names>
</string-name>, <string-name>
<surname>Banerjee</surname>
<given-names>S</given-names>
</string-name>, <string-name>
<surname>Carlin</surname>
<given-names>BP</given-names>
</string-name>, <string-name>
<surname>Stenzel</surname>
<given-names>MR</given-names>
</string-name>, <string-name>
<surname>Stewart</surname>
<given-names>PA</given-names>
</string-name>, <etal/>
<article-title>Exploration of the use of Bayesian modeling of gradients for censored spatiotemporal data from the <italic>Deepwater Horizon</italic> oil spill.</article-title>
<source>Spat Stat</source>
<year>2014</year>;<volume>9</volume>:<fpage>166</fpage>&#x02013;<lpage>79</lpage>. <pub-id pub-id-type="doi">10.1016/j.spasta.2014.03.002</pub-id>
<pub-id pub-id-type="pmid">25599019</pub-id></mixed-citation></ref><ref id="R16"><label>16. </label><mixed-citation publication-type="book">
<string-name>
<surname>Rubin</surname>
<given-names>DB</given-names>
</string-name>. Multiple imputation for nonresponse in surveys. New York (NY): John Wiley and Sons; <year>1987</year>.</mixed-citation></ref><ref id="R17"><label>17. </label><mixed-citation publication-type="journal">
<string-name>
<surname>Besag</surname>
<given-names>J</given-names>
</string-name>, <string-name>
<surname>York</surname>
<given-names>J</given-names>
</string-name>, <string-name>
<surname>Molli&#x000e9;</surname>
<given-names>A</given-names>
</string-name>. <article-title>Bayesian image restoration, with two applications in spatial statistics.</article-title>
<source>Ann Inst Stat Math</source>
<year>1991</year>;<volume>43</volume>(<issue>1</issue>):<fpage>1</fpage>&#x02013;<lpage>20</lpage>. <pub-id pub-id-type="doi">10.1007/BF00116466</pub-id>
</mixed-citation></ref><ref id="R18"><label>18. </label><mixed-citation publication-type="web">
<collab>Centers for Disease Control and Prevention, National Center for Health Statistics</collab>. Compressed mortality file 1979&#x02013;1998. CDC WONDER on-line database, compiled from compressed mortality file CMF 1968&#x02013;1988, series 20, no. 2A, 2000 and CMF 1989&#x02013;1998, series 20, no. 2E, <year>2003</year>
<ext-link ext-link-type="uri" xlink:href="https://wonder.cdc.gov/controller/saved/D16/D10F745">https://wonder.cdc.gov/controller/saved/D16/D10F745</ext-link>. <comment>Accessed March 3, 2017</comment>.</mixed-citation></ref><ref id="R19"><label>19. </label><mixed-citation publication-type="web">
<collab>R Core Team</collab>. R: A Language and Environment for Statistical Computing. Vienna (AT): R Foundation for Statistical Computing; <year>2015</year>
<underline>http://www.R-project.org/</underline>. Accessed February 27, 2019. </mixed-citation></ref><ref id="R20"><label>20. </label><mixed-citation publication-type="journal">
<string-name>
<surname>Lunn</surname>
<given-names>DJ</given-names>
</string-name>, <string-name>
<surname>Thomas</surname>
<given-names>A</given-names>
</string-name>, <string-name>
<surname>Best</surname>
<given-names>N</given-names>
</string-name>, <string-name>
<surname>Spiegelhalter</surname>
<given-names>D</given-names>
</string-name>. <article-title>WinBUGS &#x02014; a Bayesian modelling framework: concepts, structure, and extensibility.</article-title>
<source>Stat Comput</source>
<year>2000</year>;<volume>10</volume>(<issue>4</issue>):<fpage>325</fpage>&#x02013;<lpage>37</lpage>. <pub-id pub-id-type="doi">10.1023/A:1008929526011</pub-id>
</mixed-citation></ref><ref id="R21"><label>21. </label><mixed-citation publication-type="journal">
<string-name>
<surname>Brillinger</surname>
<given-names>DR</given-names>
</string-name>. <article-title>The natural variability of vital rates and associated statistics.</article-title>
<source>Biometrics</source>
<year>1986</year>;<volume>42</volume>(<issue>4</issue>):<fpage>693</fpage>&#x02013;<lpage>734</lpage>. <pub-id pub-id-type="doi">10.2307/2530689</pub-id>
<pub-id pub-id-type="pmid">3814721</pub-id></mixed-citation></ref><ref id="R22"><label>22. </label><mixed-citation publication-type="journal">
<string-name>
<surname>Kerman</surname>
<given-names>J</given-names>
</string-name>. <article-title>Neutral noninformative and informative conjugate beta and gamma prior distributions.</article-title>
<source>Electron J Stat</source>
<year>2011</year>;<volume>5</volume>(<issue>0</issue>):<fpage>1450</fpage>&#x02013;<lpage>70</lpage>. <pub-id pub-id-type="doi">10.1214/11-EJS648</pub-id>
</mixed-citation></ref><ref id="R23"><label>23. </label><mixed-citation publication-type="journal">
<string-name>
<surname>Gelfand</surname>
<given-names>AE</given-names>
</string-name>, <string-name>
<surname>Vounatsou</surname>
<given-names>P</given-names>
</string-name>. <article-title>Proper multivariate conditional autoregressive models for spatial data analysis.</article-title>
<source>Biostatistics</source>
<year>2003</year>;<volume>4</volume>(<issue>1</issue>):<fpage>11</fpage>&#x02013;<lpage>25</lpage>. <pub-id pub-id-type="doi">10.1093/biostatistics/4.1.11</pub-id>
<pub-id pub-id-type="pmid">12925327</pub-id></mixed-citation></ref><ref id="R24"><label>24. </label><mixed-citation publication-type="book">
<string-name>
<surname>Carlin</surname>
<given-names>BP</given-names>
</string-name>, <string-name>
<surname>Banerjee</surname>
<given-names>S</given-names>
</string-name>. Hierarchical multivariate CAR models for spatio-temporally correlated survival data (with discussion). In: JM Bernardo, M Bayarri, JO Berger, AP Dawid, D Heckerman, AFM Smith, and M West, editors. Bayesian statistics 7: Proceedings of the Seventh Valencia International Meeting. 45&#x02013;63. Oxford (UK): Oxford Science Publications; <year>2003</year>. P. 45&#x02013;63.</mixed-citation></ref><ref id="R25"><label>25. </label><mixed-citation publication-type="journal">
<string-name>
<surname>Quick</surname>
<given-names>H</given-names>
</string-name>, <string-name>
<surname>Banerjee</surname>
<given-names>S</given-names>
</string-name>, <string-name>
<surname>Carlin</surname>
<given-names>BP</given-names>
</string-name>. <article-title>Modeling temporal gradients in regionally aggregated California asthma hospitalization data.</article-title>
<source>Ann Appl Stat</source>
<year>2013</year>;<volume>7</volume>(<issue>1</issue>):<fpage>154</fpage>&#x02013;<lpage>76</lpage>. <pub-id pub-id-type="doi">10.1214/12-AOAS600</pub-id>
<pub-id pub-id-type="pmid">29606992</pub-id></mixed-citation></ref><ref id="R26"><label>26. </label><mixed-citation publication-type="journal">
<string-name>
<surname>Quick</surname>
<given-names>H</given-names>
</string-name>, <string-name>
<surname>Waller</surname>
<given-names>LA</given-names>
</string-name>, <string-name>
<surname>Casper</surname>
<given-names>M</given-names>
</string-name>. <article-title>A multivariate space-time model for analysing county-level heart disease death rates by race and sex.</article-title>
<source>J R Stat Soc</source>
<year>2018</year>;<volume>67</volume>(<issue>1</issue>):<fpage>291</fpage>&#x02013;<lpage>304</lpage>. <pub-id pub-id-type="doi">10.1111/rssc.12215</pub-id>
</mixed-citation></ref><ref id="R27"><label>27. </label><mixed-citation publication-type="journal">
<string-name>
<surname>Quick</surname>
<given-names>H</given-names>
</string-name>, <string-name>
<surname>Waller</surname>
<given-names>LA</given-names>
</string-name>, <string-name>
<surname>Casper</surname>
<given-names>M</given-names>
</string-name>. <article-title>Multivariate spatiotemporal modeling of age-specific stroke mortality.</article-title>
<source>Ann Appl Stat</source>
<year>2017</year>;<volume>11</volume>(<issue>4</issue>):<fpage>2165</fpage>&#x02013;<lpage>77</lpage>. <pub-id pub-id-type="doi">10.1214/17-AOAS1068</pub-id>
</mixed-citation></ref><ref id="R28"><label>28. </label><mixed-citation publication-type="journal">
<string-name>
<surname>Spiegelhalter</surname>
<given-names>DJ</given-names>
</string-name>, <string-name>
<surname>Best</surname>
<given-names>N</given-names>
</string-name>, <string-name>
<surname>Carlin</surname>
<given-names>BP</given-names>
</string-name>, <string-name>
<surname>van der Linde</surname>
<given-names>A</given-names>
</string-name>. <article-title>Bayesian measures of model complexity and fit (with discussion).</article-title>
<source>J R Stat Soc B</source>
<year>2002</year>;<volume>64</volume>(<issue>4</issue>):<fpage>583</fpage>&#x02013;<lpage>639</lpage>. <pub-id pub-id-type="doi">10.1111/1467-9868.00353</pub-id>
</mixed-citation></ref><ref id="R29"><label>29. </label><mixed-citation publication-type="journal">
<string-name>
<surname>Reich</surname>
<given-names>BJ</given-names>
</string-name>, <string-name>
<surname>Hodges</surname>
<given-names>JS</given-names>
</string-name>, <string-name>
<surname>Zadnik</surname>
<given-names>V</given-names>
</string-name>. <article-title>Effects of residual smoothing on the posterior of the fixed effects in disease-mapping models.</article-title>
<source>Biometrics</source>
<year>2006</year>;<volume>62</volume>(<issue>4</issue>):<fpage>1197</fpage>&#x02013;<lpage>206</lpage>. <pub-id pub-id-type="doi">10.1111/j.1541-0420.2006.00617.x</pub-id>
<pub-id pub-id-type="pmid">17156295</pub-id></mixed-citation></ref><ref id="R30"><label>30. </label><mixed-citation publication-type="journal">
<string-name>
<surname>Hanks</surname>
<given-names>EM</given-names>
</string-name>, <string-name>
<surname>Schliep</surname>
<given-names>EM</given-names>
</string-name>, <string-name>
<surname>Hooten</surname>
<given-names>MB</given-names>
</string-name>, <string-name>
<surname>Hoeting</surname>
<given-names>JA</given-names>
</string-name>. <article-title>Restricted spatial regression in practice: geostatistical models, confounding, and robustness under model misspecification.</article-title>
<source>Environmetrics</source>
<year>2015</year>;<volume>26</volume>(<issue>4</issue>):<fpage>243</fpage>&#x02013;<lpage>54</lpage>. <pub-id pub-id-type="doi">10.1002/env.2331</pub-id>
</mixed-citation></ref></ref-list></back></article>