<!DOCTYPE article
PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD with MathML3 v1.3 20210610//EN" "JATS-archivearticle1-3-mathml3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="1.3" xml:lang="en" article-type="research-article"><?properties manuscript?><processing-meta base-tagset="archiving" mathml-version="3.0" table-model="xhtml" tagset-family="jats"><restricted-by>pmc</restricted-by></processing-meta><front><journal-meta><journal-id journal-id-type="nlm-journal-id">101612400</journal-id><journal-id journal-id-type="pubmed-jr-id">41598</journal-id><journal-id journal-id-type="nlm-ta">Spat Stat</journal-id><journal-id journal-id-type="iso-abbrev">Spat Stat</journal-id><journal-title-group><journal-title>Spatial statistics</journal-title></journal-title-group><issn pub-type="epub">2211-6753</issn></journal-meta><article-meta><article-id pub-id-type="pmid">37396190</article-id><article-id pub-id-type="pmc">10312012</article-id><article-id pub-id-type="doi">10.1016/j.spasta.2023.100757</article-id><article-id pub-id-type="manuscript">NIHMS1902698</article-id><article-categories><subj-group subj-group-type="heading"><subject>Article</subject></subj-group></article-categories><title-group><article-title>A Hypothesis Test for Detecting Distance-Specific Clustering and Dispersion in Areal Data</article-title></title-group><contrib-group><contrib contrib-type="author"><name><surname>Self</surname><given-names>Stella</given-names></name><xref rid="CR1" ref-type="corresp">1</xref><xref rid="A2" ref-type="aff">2</xref></contrib><contrib contrib-type="author"><name><surname>Overby</surname><given-names>Anna</given-names></name><xref rid="A3" ref-type="aff">3</xref></contrib><contrib contrib-type="author"><name><surname>Zgodic</surname><given-names>Anja</given-names></name><xref rid="A2" ref-type="aff">2</xref></contrib><contrib contrib-type="author"><name><surname>White</surname><given-names>David</given-names></name><xref rid="A4" ref-type="aff">4</xref></contrib><contrib contrib-type="author"><name><surname>McLain</surname><given-names>Alexander</given-names></name><xref rid="A2" ref-type="aff">2</xref><xref rid="A5" ref-type="aff">5</xref></contrib><contrib contrib-type="author"><name><surname>Dyckman</surname><given-names>Caitlin</given-names></name><xref rid="A3" ref-type="aff">3</xref><xref rid="A5" ref-type="aff">5</xref></contrib></contrib-group><aff id="A2"><label>2</label>Arnold School of Public Health, University of South Carolina, 921 Assembly Street, Columbia, SC 29208, USA</aff><aff id="A3"><label>3</label>College of Architecture, Arts and Humanities, Clemson University, Fernow Street, Clemson, SC 29634, USA</aff><aff id="A4"><label>4</label>College of Behavioral, Social and Health Sciences, Clemson University, Epsilon Zeta Dr, Clemson, SC 29634, USA</aff><aff id="A5"><label>5</label>Shared Last Author</aff><author-notes><corresp id="CR1"><label>1</label>Corresponding Author <email>scwatson@mailbox.sc.edu</email> (Stella Self)</corresp></author-notes><pub-date pub-type="nihms-submitted"><day>11</day><month>6</month><year>2023</year></pub-date><pub-date pub-type="ppub"><month>6</month><year>2023</year></pub-date><pub-date pub-type="epub"><day>19</day><month>5</month><year>2023</year></pub-date><pub-date pub-type="pmc-release"><day>01</day><month>6</month><year>2024</year></pub-date><volume>55</volume><elocation-id>100757</elocation-id><abstract id="ABS1"><p id="P1">Spatial clustering detection has a variety of applications in diverse fields, including identifying infectious disease outbreaks, pinpointing crime hotspots, and identifying clusters of neurons in brain imaging applications. Ripley&#x02019;s K-function is a popular method for detecting clustering (or dispersion) in point process data at specific distances. Ripley&#x02019;s K-function measures the expected number of points within a given distance of any observed point. Clustering can be assessed by comparing the observed value of Ripley&#x02019;s K-function to the expected value under complete spatial randomness. While performing spatial clustering analysis on point process data is common, applications to areal data commonly arise and need to be accurately assessed. Inspired by Ripley&#x02019;s K-function, we develop the <italic toggle="yes">positive area proportion function (PAPF)</italic> and use it to develop a hypothesis testing procedure for the detection of spatial clustering and dispersion at specific distances in areal data. We compare the performance of the proposed PAPF hypothesis test to that of the global Moran&#x02019;s I statistic, the Getis-Ord general G statistic, and the spatial scan statistic with extensive simulation studies. We then evaluate the real-world performance of our method by using it to detect spatial clustering in land parcels containing conservation easements and US counties with high pediatric overweight/obesity rates.</p></abstract><kwd-group><kwd>cluster detection</kwd><kwd>clustering</kwd><kwd>areal data</kwd><kwd>Ripley&#x02019;s K-function</kwd></kwd-group></article-meta></front><body><sec id="S1"><label>1.</label><title>Introduction</title><p id="P2">The rapid rise in popularity of geographic information system (GIS) software over the past thirty years has led to an explosion of spatial data and associated analytical methods. Two of the most common types of spatial data are point process data and areal data. Point process data are associated with specific coordinate locations (such as a geocoded addresses), while areal data are associated with spatial regions (such as a counties or census tracts). The data may contain either location information only (e.g., the boundaries of a census tract that the United States Department of Agriculture has designated a food desert due to a paucity of stores selling healthy food) or location information combined with numerical attributes (e.g., the boundaries of a census tract with the number of healthy grocery stores). In this paper, we restrict our attention to areal data that contains only location information, that is, data consisting of a predefined set of spatial regions, some of which possess a characteristic of interest. We consider the set of spatial regions as fixed and only the possession of the characteristic as random. We consider the problem of assessing this type of data for spatial clustering and dispersion, loosely defined as an excess of regions with the characteristic of interest in part(s) of the study area (clustering) or semi-regular placement of such regions (dispersion). Census tracts designated as food deserts, tracts of land with development restrictions, or counties which required individuals to wear a mask in public during the COVID-19 pandemic are all examples of areal data that could be clustered. In this paper, we develop a method for detecting clustering and/or dispersion in areal data at specific distances.</p><p id="P3">Clustering can occur at different distances. For example, food desert census tracts might be clustered at close distances in metropolitan areas and at larger distances in more rural areas. Additionally, data may exhibit dispersion at one distance and clustering at another. For example, parks may exhibit small scale dispersion (e.g., city parks are unlikely to be within a quarter mile of another park), but large scale clustering (e.g., city parks are likely to be within 5 miles of a another park). In practice, the distance at which clustering and/or dispersion occur are generally informative about which processes may be causing the phenomena. Statistical methods that can detect clustering and/or dispersion at specific distances are desirable.</p><p id="P4">There are several existing methods to assess areal data for clustering, including the global Moran&#x02019;s I statistic [<xref rid="R45" ref-type="bibr">45</xref>], the Getis-Ord general G statistic [<xref rid="R28" ref-type="bibr">28</xref>], and the spatial scan statistic [<xref rid="R35" ref-type="bibr">35</xref>]. (These methods can also be used for point process data, with some modifications). However, obtaining the distance at which clustering/dispersion occurs is not straightforward for any of these methods. Ripley&#x02019;s K-function is able to detect clustering/dispersion at specific distances, but it is only suitable for point process data [<xref rid="R49" ref-type="bibr">49</xref>, <xref rid="R50" ref-type="bibr">50</xref>, <xref rid="R51" ref-type="bibr">51</xref>]. Ripley&#x02019;s K-function and related variants have been widely used in ecology [<xref rid="R30" ref-type="bibr">30</xref>, <xref rid="R39" ref-type="bibr">39</xref>, <xref rid="R37" ref-type="bibr">37</xref>], epidemiology [<xref rid="R19" ref-type="bibr">19</xref>, <xref rid="R26" ref-type="bibr">26</xref>, <xref rid="R10" ref-type="bibr">10</xref>], and spatial economics [<xref rid="R40" ref-type="bibr">40</xref>, <xref rid="R22" ref-type="bibr">22</xref>, <xref rid="R41" ref-type="bibr">41</xref>]. Despite the suitability issue, Ripley&#x02019;s K-function is commonly (mis)applied to areal data by mapping each spatial region to its centroid. For example, many researchers have attempted to assess spatial patterns in land parcel data using Ripley&#x02019;s K-function [<xref rid="R38" ref-type="bibr">38</xref>, <xref rid="R52" ref-type="bibr">52</xref>, <xref rid="R57" ref-type="bibr">57</xref>, <xref rid="R48" ref-type="bibr">48</xref>]. Other researchers have taken public health data associated with a geographical region (e.g., a city or health division) and computed Ripley&#x02019;s K-function using the centroids of the regions [<xref rid="R55" ref-type="bibr">55</xref>, <xref rid="R33" ref-type="bibr">33</xref>, <xref rid="R53" ref-type="bibr">53</xref>]. Ripley&#x02019;s K-function has also been used to assess areal data for clustering in a variety of ecological and geological applications [<xref rid="R34" ref-type="bibr">34</xref>, <xref rid="R16" ref-type="bibr">16</xref>, <xref rid="R43" ref-type="bibr">43</xref>].</p><p id="P5">Applying Ripley&#x02019;s K-function to the centroids of spatial regions is particularly problematic when the regions are vastly different sizes. For example, in one of our motivating data applications we wish to determine if land parcels with conservation easements (CEs) are clustered. Under the null hypothesis, all parcels are equally likely to have a CE. Sections of the study area with many small parcels (such as a metropolitan area) will have more parcels with CEs than portions of the study area with many large parcels, simply because there are more parcels per unit area. Put another way, centroids of smaller parcels will appear clustered relative to centroids of larger parcels simply because the size of the small parcels allows the centroids to be closer together (independent of the spatial pattern).</p><p id="P6">The (mis)application of Ripley&#x02019;s K-function to areal data is partially attributable to the lack of distance-specific cluster detection methods designed specifically for areal data. Further, it is enabled by popular spatial software packages like ArcGIS, which map areal data to their centroids in order to apply Ripley&#x02019;s K-function. Specifically, when performing hypothesis testing via Ripley&#x02019;s K-function with areal data in the Multidistance Spatial Cluster Analysis ArcGIS tool [<xref rid="R23" ref-type="bibr">23</xref>], the &#x02018;observed points&#x02019; (i.e., in continuous space) are defined as centroids of the polygons with the characteristic of interest [<xref rid="R23" ref-type="bibr">23</xref>]. This is done by default and without a warning message. We show in our simulation studies that such (mis)applications of Ripley&#x02019;s K-function to areal data often result in a severely inflated type I error rate.</p><p id="P7">In this paper, we develop a method for detecting distance-specific clustering/dispersion in areal data. Our method is motivated by Ripley&#x02019;s K-function and has a similar interpretation. In <xref rid="S2" ref-type="sec">Section 2</xref>, we introduce our method and explore some of its properties. <xref rid="S10" ref-type="sec">Section 3</xref> presents the results of an extensive simulation study in which we compare the ability of our method to detect clustering and dispersion to that of the global Moran&#x02019;s I statistic, the Getis-Ord general G statistic, the spatial scan statistic, and three point-process methods. In <xref rid="S17" ref-type="sec">Section 4</xref>, we demonstrate the use of our method on two real datasets. First, we use it to determine if there is spatialy clustering in land parcels that contain CEs in Boulder County, Colorado. Next, we use our method to determine if US counties with high childhood overweight rates are spatially clustered. <xref rid="S20" ref-type="sec">Section 5</xref> provides concluding remarks and suggestions for future work.</p></sec><sec id="S2"><label>2.</label><title>Methodology</title><p id="P8">To motivate our methods, we begin with a brief review of Ripley&#x02019;s K-function. Suppose we have a two-dimensional spatial point process <inline-formula><mml:math id="M1" display="inline"><mml:mi>&#x1d4ab;</mml:mi></mml:math></inline-formula> defined on a Borel set <inline-formula><mml:math id="M2" display="inline"><mml:mi>&#x1d49c;</mml:mi><mml:mo>&#x02286;</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="double-struck">R</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. For any Borel set <inline-formula><mml:math id="M3" display="inline"><mml:mi>&#x1d4ae;</mml:mi><mml:mo>&#x02286;</mml:mo><mml:mi>&#x1d49c;</mml:mi></mml:math></inline-formula>, let <inline-formula><mml:math id="M4" display="inline"><mml:mi>N</mml:mi><mml:mo>(</mml:mo><mml:mi>&#x1d4ae;</mml:mi><mml:mo>)</mml:mo></mml:math></inline-formula> count the number of points (events) in <inline-formula><mml:math id="M5" display="inline"><mml:mi>&#x1d4ae;</mml:mi></mml:math></inline-formula>. The intensity of the point process at a point <inline-formula><mml:math id="M6" display="inline"><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mi>&#x1d49c;</mml:mi></mml:math></inline-formula> is given by
<disp-formula id="FD1">
<mml:math id="M7" display="block"><mml:mi>&#x003bb;</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:munder><mml:mrow><mml:mi mathvariant="normal">l</mml:mi><mml:mi mathvariant="normal">i</mml:mi><mml:mi mathvariant="normal">m</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi><mml:mi>&#x1d4ae;</mml:mi><mml:mo>&#x02192;</mml:mo><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:mrow></mml:munder><mml:mspace width="0.25em"/><mml:mfrac><mml:mrow><mml:mi>E</mml:mi><mml:mo>[</mml:mo><mml:mi>N</mml:mi><mml:mo>(</mml:mo><mml:mi>&#x1d4ae;</mml:mi><mml:mo>)</mml:mo><mml:mo>]</mml:mo></mml:mrow><mml:mrow><mml:mi>d</mml:mi><mml:mi>&#x1d4ae;</mml:mi></mml:mrow></mml:mfrac></mml:math>
</disp-formula>
where <inline-formula><mml:math id="M8" display="inline"><mml:mi>&#x1d4ae;</mml:mi></mml:math></inline-formula> is an arbitrary neighborhood surrounding <inline-formula><mml:math id="M9" display="inline"><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:math></inline-formula>. A point process is said to be <italic toggle="yes">stationary</italic> if the intensity <inline-formula><mml:math id="M10" display="inline"><mml:mi>&#x003bb;</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi><mml:mo>)</mml:mo></mml:math></inline-formula> is a constant function. Note that for a stationary process, the expected number of points in an area depends only on the size of the area and not on its location. See [<xref rid="R15" ref-type="bibr">15</xref>] or [<xref rid="R3" ref-type="bibr">3</xref>] for further information on point processes.</p><sec id="S3"><label>2.1.</label><title>Ripley&#x02019;s K-function</title><p id="P9">For a stationary point process <inline-formula><mml:math id="M11" display="inline"><mml:mi>&#x1d4ab;</mml:mi></mml:math></inline-formula> with intensity <inline-formula><mml:math id="M12" display="inline"><mml:mi>&#x003bb;</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mi>&#x003bb;</mml:mi></mml:math></inline-formula> and a distance <inline-formula><mml:math id="M13" display="inline"><mml:mi>r</mml:mi><mml:mo>&#x0003e;</mml:mo><mml:mn>0</mml:mn></mml:math></inline-formula>, Ripley&#x02019;s K-function is defined as
<disp-formula id="FD2">
<mml:math id="M14" display="block"><mml:mi>K</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:mi>r</mml:mi></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mi>&#x003bb;</mml:mi></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:mi>E</mml:mi><mml:mfenced open="{" close="}" separators="|"><mml:mrow><mml:mi>N</mml:mi><mml:mfenced open="[" close="]" separators="|"><mml:mrow><mml:mi>c</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi><mml:mo>,</mml:mo><mml:mi>r</mml:mi></mml:mrow></mml:mfenced></mml:mrow></mml:mfenced><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mfenced><mml:mo>.</mml:mo></mml:math>
</disp-formula>
where <inline-formula><mml:math id="M15" display="inline"><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:math></inline-formula> is any point arising from <inline-formula><mml:math id="M16" display="inline"><mml:mi>&#x1d4ab;</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M17" display="inline"><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi><mml:mo>,</mml:mo><mml:mi>r</mml:mi><mml:mo>)</mml:mo></mml:math></inline-formula> is the circle centered at <inline-formula><mml:math id="M18" display="inline"><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:math></inline-formula> with radius <inline-formula><mml:math id="M19" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula> [<xref rid="R49" ref-type="bibr">49</xref>]. Thus <inline-formula><mml:math id="M20" display="inline"><mml:mi>K</mml:mi><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>)</mml:mo></mml:math></inline-formula> is the expected number of additional points within a distance of <inline-formula><mml:math id="M21" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula> of any point in <inline-formula><mml:math id="M22" display="inline"><mml:mi>&#x1d4ab;</mml:mi></mml:math></inline-formula>, re-scaled by the intensity of <inline-formula><mml:math id="M23" display="inline"><mml:mi>&#x1d4ab;</mml:mi></mml:math></inline-formula>. For a homogeneous Poisson process (realizations of which exhibit complete spatial randomness) on an infinite study area, <inline-formula><mml:math id="M24" display="inline"><mml:mi>K</mml:mi><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mi>&#x003c0;</mml:mi><mml:msup><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. For a high-level overview of Ripley&#x02019;s K-function, see [<xref rid="R20" ref-type="bibr">20</xref>]; for a more in depth treatment of Ripley&#x02019;s K-function and related topics in point processes, see [<xref rid="R8" ref-type="bibr">8</xref>] or [<xref rid="R12" ref-type="bibr">12</xref>].</p><p id="P10">While the original formulation of Ripley&#x02019;s K-function assumes a stationary point process, the definition has been extended to handle certain types of nonstationary processes [<xref rid="R4" ref-type="bibr">4</xref>], and many related fucntions have been developed to further quantify the behavior of point processes. For instance, the L-function is a rescaling of the <italic toggle="yes">K</italic>-function, and the <italic toggle="yes">K</italic><sub><italic toggle="yes">d</italic></sub>-function is a kernel estimator of the probability density function of distances between points [<xref rid="R22" ref-type="bibr">22</xref>]. Variants of Ripley&#x02019;s K-function have also been developed for <italic toggle="yes">marked point processes</italic> (MPP), i.e., point processes for which each point is associated with a random value called a <italic toggle="yes">mark</italic>. The D-function is defined for MPP with binary marks, and consists of the difference between the K-function computed on points with one type of mark and the K-function computed on the remaining points [<xref rid="R19" ref-type="bibr">19</xref>, <xref rid="R2" ref-type="bibr">2</xref>]. The M-function quantifies the pattern in points with a particular type of mark relative to other points by weighting the number of points of the type in question within a distance <inline-formula><mml:math id="M25" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula> of a given point by the total number of points within distance <inline-formula><mml:math id="M26" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula> [<xref rid="R42" ref-type="bibr">42</xref>]. The cross <italic toggle="yes">K</italic>-function gives the expected number of observed marks within a distance <inline-formula><mml:math id="M27" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula> of a given mark, where the observed mark types must be different than the given mark. The <inline-formula><mml:math id="M28" display="inline"><mml:msub><mml:mrow><mml:mi>K</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mi>m</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>-function quantifies the correlation between marks of points separated by a distance <inline-formula><mml:math id="M29" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula> [<xref rid="R47" ref-type="bibr">47</xref>]. For an effective overview of the practical use and interpretation of these functions, see [<xref rid="R41" ref-type="bibr">41</xref>].</p><p id="P11">Suppose that we have a realization of <inline-formula><mml:math id="M30" display="inline"><mml:mi>&#x1d4ab;</mml:mi></mml:math></inline-formula> consisting of <inline-formula><mml:math id="M31" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> observations, <inline-formula><mml:math id="M32" display="inline"><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>. We can estimate <inline-formula><mml:math id="M33" display="inline"><mml:mi>K</mml:mi><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>)</mml:mo></mml:math></inline-formula> with
<disp-formula id="FD3">
<mml:math id="M34" display="block"><mml:mover accent="true"><mml:mrow><mml:mi>K</mml:mi></mml:mrow><mml:mi>&#x002c6;</mml:mi></mml:mover><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>&#x003bb;</mml:mi></mml:mrow><mml:mi>&#x002c6;</mml:mi></mml:mover></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:mrow><mml:munderover><mml:mo stretchy="true">&#x02211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:munderover><mml:mrow><mml:mspace width="0.25em"/></mml:mrow></mml:mrow><mml:mrow><mml:munderover><mml:mo stretchy="true">&#x02211;</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>&#x02260;</mml:mo><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:munderover><mml:mrow><mml:mspace width="0.25em"/></mml:mrow></mml:mrow><mml:mfrac><mml:mrow><mml:msub><mml:mrow><mml:mi>w</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:mfrac></mml:math>
</disp-formula>
where <inline-formula><mml:math id="M35" display="inline"><mml:mover accent="true"><mml:mrow><mml:mi>&#x003bb;</mml:mi></mml:mrow><mml:mi>&#x002c6;</mml:mi></mml:mover><mml:mo>=</mml:mo><mml:mi>n</mml:mi><mml:mo>/</mml:mo><mml:mi>A</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>&#x1d49c;</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>, <inline-formula><mml:math id="M36" display="inline"><mml:mi>A</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>&#x1d49c;</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> denotes the area of <inline-formula><mml:math id="M37" display="inline"><mml:mi>&#x1d49c;</mml:mi></mml:math></inline-formula>, and <inline-formula><mml:math id="M38" display="inline"><mml:msub><mml:mrow><mml:mi>w</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is a weight associated with points <inline-formula><mml:math id="M39" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M40" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula>. In the traditional approach, <inline-formula><mml:math id="M41" display="inline"><mml:msub><mml:mrow><mml:mi>w</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula> if the distance between points <inline-formula><mml:math id="M42" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M43" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula> is less than <inline-formula><mml:math id="M44" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula> and 0 otherwise [<xref rid="R49" ref-type="bibr">49</xref>, <xref rid="R50" ref-type="bibr">50</xref>]. In practice, the traditional estimator generally exhibits some bias due to <italic toggle="yes">edge effects</italic>. This phenomena arises because <inline-formula><mml:math id="M45" display="inline"><mml:mi>N</mml:mi><mml:mfenced open="[" close="]" separators="|"><mml:mrow><mml:mi>c</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi>r</mml:mi></mml:mrow></mml:mfenced></mml:mrow></mml:mfenced></mml:math></inline-formula> is often lower than expected for <inline-formula><mml:math id="M46" display="inline"><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> near the boundary of <inline-formula><mml:math id="M47" display="inline"><mml:mi>&#x1d49c;</mml:mi></mml:math></inline-formula>, as some points which would otherwise contributed to <inline-formula><mml:math id="M48" display="inline"><mml:mi>N</mml:mi><mml:mfenced open="[" close="]" separators="|"><mml:mrow><mml:mi>c</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi>r</mml:mi></mml:mrow></mml:mfenced></mml:mrow></mml:mfenced></mml:math></inline-formula> fall outside of <inline-formula><mml:math id="M49" display="inline"><mml:mi>&#x1d49c;</mml:mi></mml:math></inline-formula>. However, many adjusted estimators of Ripley&#x02019;s K-function exist which modify the <inline-formula><mml:math id="M50" display="inline"><mml:msub><mml:mrow><mml:mi>w</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> to account for edge effects [<xref rid="R49" ref-type="bibr">49</xref>, <xref rid="R18" ref-type="bibr">18</xref>, <xref rid="R27" ref-type="bibr">27</xref>, <xref rid="R1" ref-type="bibr">1</xref>].</p></sec><sec id="S4"><label>2.2.</label><title>Using Ripley&#x02019;s K-function to Clustering and Dispersion</title><p id="P12">Ripley&#x02019;s K-function is often used to determine if an observed collection of points exhibits complete spatial randomness (CSR) (i.e., to determine if the points arise from a two-dimensional homogeneous Poisson process). The distribution of <inline-formula><mml:math id="M51" display="inline"><mml:mover accent="true"><mml:mrow><mml:mi>K</mml:mi></mml:mrow><mml:mi>&#x002c6;</mml:mi></mml:mover><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> under CSR for a finite study area <inline-formula><mml:math id="M52" display="inline"><mml:mi>&#x1d49c;</mml:mi></mml:math></inline-formula> can be approximated with Monte Carlo simulations, which are used to perform a hypothesis test with the null hypothesis being that the observed data arises from a homogeneous Poisson process with rate parameter <inline-formula><mml:math id="M53" display="inline"><mml:mover accent="true"><mml:mrow><mml:mi>&#x003bb;</mml:mi></mml:mrow><mml:mi>&#x002c6;</mml:mi></mml:mover></mml:math></inline-formula>. In practice, these Monte Carlo simulations are often carried out conditional on a fixed number of observations <inline-formula><mml:math id="M54" display="inline"><mml:mo stretchy="false">(</mml:mo><mml:mi>n</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>, in which case the null hypothesis is that the data arise from a two dimensional continuous uniform distribution. Large values of <inline-formula><mml:math id="M55" display="inline"><mml:mover accent="true"><mml:mrow><mml:mi>K</mml:mi></mml:mrow><mml:mi>&#x002c6;</mml:mi></mml:mover><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> indicate spatial clustering, that is, the number of points within a distance of <inline-formula><mml:math id="M56" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula> of any given point is larger than would be expected if the data exhibited CSR. Small values of <inline-formula><mml:math id="M57" display="inline"><mml:mover accent="true"><mml:mrow><mml:mi>K</mml:mi></mml:mrow><mml:mi>&#x002c6;</mml:mi></mml:mover><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> indicate dispersion, that is, the number of points within a distance of <inline-formula><mml:math id="M58" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula> of any given point is smaller than would be expected under CSR. While Ripley&#x02019;s K-function is most commonly used to test a null hypothesis of CSR, it can be used to test more complicated null hypotheses, such as that the data arise from a Neyman-Scott process [<xref rid="R18" ref-type="bibr">18</xref>] or a Strauss process [<xref rid="R14" ref-type="bibr">14</xref>]. Ripley&#x02019;s K-function-based hypothesis tests are an attractive tool for spatial data analysis because of their flexibility and interpretability. Ripley&#x02019;s K-function can simultaneously detect different spatial patterns at different distances (e.g., small distance dispersion combined with large distance clustering), which is a highly desirable and somewhat rare property among cluster detection methods.</p></sec><sec id="S5"><label>2.3.</label><title>Areal Processes</title><p id="P13">In this work, we assume we have a set of spatial regions, referred to as areal units, whose boundaries are fixed and known (e.g. the census tracts in a particular state). We observe a binary response variable for each of these areal units (e.g. whether the census tract is a food desert) which we refer to as &#x02018;binary areal data&#x02019; to distinguish it from areal data that are associated with one or more non-binary numeric attributes. For brevity, we will refer to areal units for which this binary random variable is equal to 1 as <italic toggle="yes">positive units</italic>, as they are positive for the characteristic of interest. The location and boundaries of the areal units are considered fixed, with the binary random variable serving as a random &#x02018;mark&#x02019;. To parallel the point process case, we loosely define clustering for binary areal data as an excess of positive units in a particular area and dispersion as positive units occurring a semi-regular intervals. The terms &#x02018;clustered data&#x02019; or &#x02018;clustering&#x02019; are sometimes used for data with positive spatial autocorrelation. For example, census tracts with a high rates of food insecurity might tend to be closer to other tracts with high rates. In binary areal data, there is no meaningful distinction between spatial clustering (an excess of positive units) and spatial autocorrelation (a higher concentration of similar values of the random variable).</p><p id="P14">We will define an areal process <inline-formula><mml:math id="M59" display="inline"><mml:mi>&#x1d49c;</mml:mi></mml:math></inline-formula> on a Borel subset <inline-formula><mml:math id="M60" display="inline"><mml:mi>&#x1d49c;</mml:mi><mml:mo>&#x02286;</mml:mo><mml:msup><mml:mrow><mml:mi mathvariant="double-struck">R</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> as a collection of <inline-formula><mml:math id="M61" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> disjoint Borel sets <inline-formula><mml:math id="M62" display="inline"><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> whose union covers <inline-formula><mml:math id="M63" display="inline"><mml:mi>&#x1d49c;</mml:mi></mml:math></inline-formula> and a vector <inline-formula><mml:math id="M64" display="inline"><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> where <inline-formula><mml:math id="M65" display="inline"><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is a binary random variable associated with <inline-formula><mml:math id="M66" display="inline"><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>. We will refer to the <inline-formula><mml:math id="M67" display="inline"><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> as <italic toggle="yes">areal units</italic>, and the set of <inline-formula><mml:math id="M68" display="inline"><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> for which <inline-formula><mml:math id="M69" display="inline"><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula> as <italic toggle="yes">positive areal units</italic>. For any Borel subset <inline-formula><mml:math id="M70" display="inline"><mml:mi>&#x1d4ae;</mml:mi></mml:math></inline-formula> of <inline-formula><mml:math id="M71" display="inline"><mml:mi>&#x1d49c;</mml:mi></mml:math></inline-formula>, define <inline-formula><mml:math id="M72" display="inline"><mml:mi>&#x1d4a9;</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>&#x1d4ae;</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msubsup><mml:mspace width="0.25em"/><mml:mi>A</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:mi>&#x1d4ae;</mml:mi><mml:mo>&#x02229;</mml:mo><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>. Thus <inline-formula><mml:math id="M73" display="inline"><mml:mi>&#x1d4a9;</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>&#x1d4ae;</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> is the amount of area in <inline-formula><mml:math id="M74" display="inline"><mml:mi>&#x1d4ae;</mml:mi></mml:math></inline-formula> which falls into positive areal units. Note that <inline-formula><mml:math id="M75" display="inline"><mml:mi>&#x1d4a9;</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x022c5;</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> maybe thought of as the areal analog of <inline-formula><mml:math id="M76" display="inline"><mml:mi>N</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x022c5;</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> in the point process case: <inline-formula><mml:math id="M77" display="inline"><mml:mi>N</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x022c5;</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> counts observed points, <inline-formula><mml:math id="M78" display="inline"><mml:mi>&#x1d4a9;</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x022c5;</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> counts observed area.</p><sec id="S6"><label>2.3.1.</label><title>Extending the Concept of Stationarity</title><p id="P15">In the point process case, the intensity was a limit of the ratio of the number of points in a region divided by the area of the region. The analogous quantity for an areal process would be the ratio of the amount of positive area in a region divided by the area of that region. That is
<disp-formula id="FD4">
<mml:math id="M79" display="block"><mml:mrow><mml:mi>&#x003bb;</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:munder><mml:mrow><mml:mi>lim</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi><mml:mi>&#x1d4ae;</mml:mi><mml:mo>&#x02192;</mml:mo><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:mrow></mml:munder><mml:mfrac><mml:mrow><mml:mi>E</mml:mi><mml:mo stretchy="false">[</mml:mo><mml:mi>&#x1d4a9;</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>&#x1d4ae;</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo stretchy="false">]</mml:mo></mml:mrow><mml:mrow><mml:mi>d</mml:mi><mml:mi>&#x1d4ae;</mml:mi></mml:mrow></mml:mfrac></mml:mrow></mml:math>
</disp-formula>
where <inline-formula><mml:math id="M80" display="inline"><mml:mi>&#x1d4ae;</mml:mi></mml:math></inline-formula> is an arbitrary neighborhood around <inline-formula><mml:math id="M81" display="inline"><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:math></inline-formula>. Thus for <inline-formula><mml:math id="M82" display="inline"><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi><mml:mo>&#x02208;</mml:mo><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mi>&#x003bb;</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi>P</mml:mi><mml:mfenced close="" separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo></mml:mrow></mml:mfenced><mml:mn>1</mml:mn><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>. We define an areal process as &#x02018;stationary&#x02019; if <inline-formula><mml:math id="M83" display="inline"><mml:mi>P</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mfenced></mml:math></inline-formula> is the same for all <inline-formula><mml:math id="M84" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>. It can be shown that for a stationary areal process with no edges (for example, an areal process on <inline-formula><mml:math id="M85" display="inline"><mml:msup><mml:mrow><mml:mi mathvariant="double-struck">R</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo stretchy="false">)</mml:mo><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mi>E</mml:mi><mml:mo>[</mml:mo><mml:mi>&#x1d4a9;</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>&#x1d4ae;</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>]</mml:mo></mml:math></inline-formula> depends only on the size of <inline-formula><mml:math id="M86" display="inline"><mml:mi>&#x1d4ae;</mml:mi></mml:math></inline-formula>. For a stationary areal process with <inline-formula><mml:math id="M87" display="inline"><mml:mi>P</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:mi>&#x003bb;</mml:mi></mml:math></inline-formula> for all <inline-formula><mml:math id="M88" display="inline"><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mi>E</mml:mi><mml:mo>[</mml:mo><mml:mi>&#x1d4a9;</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>&#x1d49c;</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>]</mml:mo><mml:mo>/</mml:mo><mml:mi>A</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>&#x1d49c;</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi>&#x003bb;</mml:mi></mml:math></inline-formula>. We also define an <italic toggle="yes">independent</italic> areal process as one for which the <inline-formula><mml:math id="M89" display="inline"><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> s are mutually independent.</p></sec></sec><sec id="S7"><label>2.4</label><title>. The Positive Area Proportion Function</title><p id="P16">Given an areal process <inline-formula><mml:math id="M90" display="inline"><mml:mi>&#x1d49c;</mml:mi></mml:math></inline-formula>, define the <italic toggle="yes">positive area proportion function for an areal unit</italic>
<inline-formula><mml:math id="M91" display="inline"><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>
<italic toggle="yes">at a distance</italic>
<inline-formula><mml:math id="M92" display="inline"><mml:mi>r</mml:mi><mml:mo>&#x0003e;</mml:mo><mml:mn>0</mml:mn></mml:math></inline-formula> as
<disp-formula id="FD5">
<mml:math id="M93" display="block"><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi>E</mml:mi><mml:mfenced open="{" close="}" separators="|"><mml:mrow><mml:mfrac><mml:mrow><mml:mi>&#x1d4a9;</mml:mi><mml:mfenced open="[" close="]" separators="|"><mml:mrow><mml:mi>c</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mi>r</mml:mi></mml:mrow></mml:mfenced><mml:mspace width="0.25em"/><mml:mo>&#x02229;</mml:mo><mml:mspace width="0.25em"/><mml:msubsup><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced><mml:mspace width="0.25em"/><mml:mo>+</mml:mo><mml:mspace width="0.25em"/><mml:mi>A</mml:mi><mml:mfenced open="[" close="]" separators="|"><mml:mrow><mml:mi>c</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mi>r</mml:mi></mml:mrow></mml:mfenced><mml:mspace width="0.25em"/><mml:mo>&#x02229;</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:mfenced open="" close="]" separators="|"><mml:mrow><mml:mi>A</mml:mi><mml:mfenced open="[" separators="|"><mml:mrow><mml:mi>c</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mi>r</mml:mi></mml:mrow></mml:mfenced><mml:mspace width="0.25em"/><mml:mo>&#x02229;</mml:mo><mml:mspace width="0.25em"/><mml:mi>&#x1d49c;</mml:mi></mml:mrow></mml:mfenced></mml:mrow></mml:mfenced></mml:mrow></mml:mfrac><mml:mo>&#x022c5;</mml:mo><mml:msup><mml:mrow><mml:mfenced separators="|"><mml:mrow><mml:mfrac><mml:mrow><mml:mi>&#x1d4a9;</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>&#x1d49c;</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mrow><mml:mi>A</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>&#x1d49c;</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mfrac></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:mo>&#x02223;</mml:mo><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="bold-italic">Y</mml:mi></mml:mrow></mml:msub><mml:mo>&#x0003e;</mml:mo><mml:mn>0</mml:mn></mml:mrow></mml:mfenced><mml:mo>.</mml:mo></mml:math>
</disp-formula>
where <inline-formula><mml:math id="M94" display="inline"><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is the centroid of <inline-formula><mml:math id="M95" display="inline"><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula><mml:math id="M96" display="inline"><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="bold-italic">Y</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is the number of positive units. Note that conditioning on <inline-formula><mml:math id="M97" display="inline"><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="bold-italic">Y</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is necessary for the expectation to be defined, as <inline-formula><mml:math id="M98" display="inline"><mml:mi>&#x1d4a9;</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>&#x1d49c;</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:math></inline-formula> if <inline-formula><mml:math id="M99" display="inline"><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="bold-italic">Y</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:math></inline-formula>. The first term in the expectation is the proportion of positive area within a distance of <inline-formula><mml:math id="M100" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula> of <inline-formula><mml:math id="M101" display="inline"><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> where the numerator captures area that either falls in <inline-formula><mml:math id="M102" display="inline"><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>A</mml:mi><mml:mo>[</mml:mo><mml:mo>&#x022c5;</mml:mo><mml:mo>]</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> or is positive <inline-formula><mml:math id="M103" display="inline"><mml:mo stretchy="false">(</mml:mo><mml:mi>&#x1d4a9;</mml:mi><mml:mo>[</mml:mo><mml:mo>&#x022c5;</mml:mo><mml:mo>]</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>. The second term is the inverse of the total proportion of the study area which is positive.</p><p id="P17">Binary areal data is clustered if positive areas tend to occur near other positive areas. Here, <inline-formula><mml:math id="M104" display="inline"><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> is used to quantify the expected amount of additional positive area near <inline-formula><mml:math id="M105" display="inline"><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>. This quantity only meaningfully assesses clustering near <inline-formula><mml:math id="M106" display="inline"><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> if <inline-formula><mml:math id="M107" display="inline"><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is positive. Just estimators of Ripley&#x02019;s K-function are only computed at observed points, estimators of <inline-formula><mml:math id="M108" display="inline"><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> are only computed for positive units <inline-formula><mml:math id="M109" display="inline"><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>. The inclusion of <inline-formula><mml:math id="M110" display="inline"><mml:mi>A</mml:mi><mml:mfenced open="[" close="]" separators="|"><mml:mrow><mml:mi>c</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi>r</mml:mi></mml:mrow></mml:mfenced><mml:mo>&#x02229;</mml:mo><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math></inline-formula> in the numerator of the first term ensures that sample based estimators, which are only computed if <inline-formula><mml:math id="M111" display="inline"><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is positive, are unbiased for <inline-formula><mml:math id="M112" display="inline"><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>. Just as Ripley&#x02019;s K-function quantifies the number of points within a certain distance of <italic toggle="yes">an observed point</italic>, the positive area proportion function quantifies the amount of positive area within a certain distance of <italic toggle="yes">a positive unit&#x02019;s centroid</italic>. See <xref rid="F1" ref-type="fig">Figure 1</xref> for an illustration of the quantities involved in <inline-formula><mml:math id="M113" display="inline"><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>.</p><p id="P18">Heuristically, <inline-formula><mml:math id="M114" display="inline"><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> is loosely analogous to an edge-corrected Ripley&#x02019;s K-function, with positive area playing the role of observed points. However, the inherent areal structure makes <inline-formula><mml:math id="M115" display="inline"><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> dependent on the choice of the positive unit <inline-formula><mml:math id="M116" display="inline"><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, while Ripley&#x02019;s K-function does not depend on the choice of the observed point. To remove this dependence on the choice of <inline-formula><mml:math id="M117" display="inline"><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, we can average the <inline-formula><mml:math id="M118" display="inline"><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> values over the positive units. Define the <italic toggle="yes">positive area proportion function at a distance</italic>
<inline-formula><mml:math id="M119" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula> by
<disp-formula id="FD6">
<mml:math id="M120" display="block"><mml:mi>M</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi>E</mml:mi><mml:mfenced open="[" close="]" separators="|"><mml:mrow><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="bold-italic">Y</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac><mml:mrow><mml:munderover><mml:mo stretchy="true">&#x02211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:munderover><mml:mrow><mml:mspace width="0.25em"/></mml:mrow></mml:mrow><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02223;</mml:mo><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="bold-italic">Y</mml:mi></mml:mrow></mml:msub><mml:mo>&#x0003e;</mml:mo><mml:mn>0</mml:mn></mml:mrow></mml:mfenced></mml:math>
</disp-formula>
where the expectation is taken over <inline-formula><mml:math id="M121" display="inline"><mml:mi mathvariant="bold-italic">Y</mml:mi></mml:math></inline-formula>. Thus, <inline-formula><mml:math id="M122" display="inline"><mml:mi>M</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> is the expected value of the average of the positive area proportion function of the positive units. Allow <inline-formula><mml:math id="M123" display="inline"><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mn>0</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> and <inline-formula><mml:math id="M124" display="inline"><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> to denote <inline-formula><mml:math id="M125" display="inline"><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> and <inline-formula><mml:math id="M126" display="inline"><mml:mi>M</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> for a stationary independent process, respectively. It can be shown that
<disp-formula id="FD7">
<mml:math id="M127" display="block"><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:mfrac><mml:mrow><mml:munderover><mml:mo stretchy="true">&#x02211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:munderover><mml:mrow><mml:mspace width="0.25em"/></mml:mrow></mml:mrow><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mn>0</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>.</mml:mo></mml:math>
</disp-formula>
See <xref rid="SD1" ref-type="supplementary-material">Web Appendix A</xref> for more details.</p><p id="P19">For an areal process realization with <inline-formula><mml:math id="M128" display="inline"><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>=</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M129" display="inline"><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="bold-italic">y</mml:mi></mml:mrow></mml:msub><mml:mo>&#x0003e;</mml:mo><mml:mn>0</mml:mn></mml:math></inline-formula>, we can estimate <inline-formula><mml:math id="M130" display="inline"><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> with
<disp-formula id="FD8">
<mml:math id="M131" display="block"><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mi>&#x1d4a9;</mml:mi><mml:mfenced open="[" close="]" separators="|"><mml:mrow><mml:mi>c</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mi>r</mml:mi></mml:mrow></mml:mfenced><mml:mspace width="0.25em"/><mml:mo>&#x02229;</mml:mo><mml:mspace width="0.25em"/><mml:msubsup><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced><mml:mspace width="0.25em"/><mml:mo>+</mml:mo><mml:mspace width="0.25em"/><mml:mi>A</mml:mi><mml:mfenced open="[" close="]" separators="|"><mml:mrow><mml:mi>c</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mi>r</mml:mi></mml:mrow></mml:mfenced><mml:mspace width="0.25em"/><mml:mo>&#x02229;</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:mi>A</mml:mi><mml:mfenced open="[" close="]" separators="|"><mml:mrow><mml:mi>c</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mi>r</mml:mi></mml:mrow></mml:mfenced><mml:mspace width="0.25em"/><mml:mo>&#x02229;</mml:mo><mml:mspace width="0.25em"/><mml:mi>&#x1d49c;</mml:mi></mml:mrow></mml:mfenced></mml:mrow></mml:mfrac><mml:msup><mml:mrow><mml:mfenced separators="|"><mml:mrow><mml:mfrac><mml:mrow><mml:mi>&#x1d4a9;</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>&#x1d49c;</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mrow><mml:mi>A</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>&#x1d49c;</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mfrac></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:mo>.</mml:mo></mml:math>
</disp-formula>
Note that by definition <inline-formula><mml:math id="M132" display="inline"><mml:mi>E</mml:mi><mml:mo stretchy="false">[</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>&#x02223;</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>&#x02223;</mml:mo><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="bold-italic">Y</mml:mi></mml:mrow></mml:msub><mml:mo>&#x0003e;</mml:mo><mml:mn>0</mml:mn></mml:mrow><mml:mo stretchy="false">]</mml:mo><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>. Further, define
<disp-formula id="FD9">
<mml:math id="M133" display="block"><mml:mover accent="true"><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover><mml:mfenced separators="|"><mml:mrow><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="bold-italic">y</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac><mml:mrow><mml:munder><mml:mo stretchy="true">&#x02211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>:</mml:mo><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:munder><mml:mrow><mml:mspace width="0.25em"/></mml:mrow></mml:mrow><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mfenced separators="|"><mml:mrow><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi></mml:mrow></mml:mfenced><mml:mo>.</mml:mo></mml:math>
</disp-formula>
as the sample mean of the <inline-formula><mml:math id="M134" display="inline"><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> s for each positive unit. It can be shown that if <inline-formula><mml:math id="M135" display="inline"><mml:mi>&#x1d49c;</mml:mi></mml:math></inline-formula> is a stationary, independent process, then <inline-formula><mml:math id="M136" display="inline"><mml:mi>E</mml:mi><mml:mo stretchy="false">[</mml:mo><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>&#x02223;</mml:mo><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="bold-italic">Y</mml:mi></mml:mrow></mml:msub><mml:mo>&#x0003e;</mml:mo><mml:mn>0</mml:mn></mml:mrow><mml:mo stretchy="false">]</mml:mo><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>. This fact forms the crux of the hypothesis testing procedure presented in the next section. See <xref rid="SD1" ref-type="supplementary-material">Web Appendix A</xref> for additional details.</p></sec><sec id="S8"><label>2.5.</label><title>Hypothesis Testing using the Positive Area Proportion Function at Specific Distances</title><p id="P20">Suppose we have a realization <inline-formula><mml:math id="M137" display="inline"><mml:mi mathvariant="bold-italic">y</mml:mi></mml:math></inline-formula> from an areal process <inline-formula><mml:math id="M138" display="inline"><mml:mi>&#x1d49c;</mml:mi></mml:math></inline-formula>, and we wish to test the null hypothesis that <inline-formula><mml:math id="M139" display="inline"><mml:mi>&#x1d49c;</mml:mi></mml:math></inline-formula> is a stationary, independent areal process against the alternative hypothesis that <inline-formula><mml:math id="M140" display="inline"><mml:mi>&#x1d49c;</mml:mi></mml:math></inline-formula> exhibits clustering and/or dispersion. To reduce the variability of the assumed null distribution, we will condition on the number of positive units in the realization. Conditional on <inline-formula><mml:math id="M141" display="inline"><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="bold-italic">Y</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>n</mml:mi></mml:math></inline-formula>, the null hypothesis of a stationary, independent process implies that <inline-formula><mml:math id="M142" display="inline"><mml:mi>P</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>=</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mfenced separators="|"><mml:mrow><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mi>N</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi>n</mml:mi></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> for <inline-formula><mml:math id="M143" display="inline"><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>:</mml:mo><mml:msubsup><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msubsup><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>n</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M144" display="inline"><mml:mi>P</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>=</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:math></inline-formula> otherwise. The null hypothesis is thus equivalent to the so-called &#x02018;random labeling hypothesis&#x02019;, under which all configurations of <inline-formula><mml:math id="M145" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> positive units are equally likely.</p><p id="P21">A clustered areal process might violate the stationarity assumption, or the independence assumption, or both. Formally, we consider an areal process to exhibit <italic toggle="yes">excess-clustering</italic> if there exists a subset of contiguous areal units indexed by <inline-formula><mml:math id="M146" display="inline"><mml:mi>&#x1d49e;</mml:mi></mml:math></inline-formula> such that for all <inline-formula><mml:math id="M147" display="inline"><mml:mi>i</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mi>&#x1d49e;</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mi>P</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mfenced><mml:mo>&#x0003e;</mml:mo><mml:mi>P</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mfenced></mml:math></inline-formula> for <inline-formula><mml:math id="M148" display="inline"><mml:mi>j</mml:mi><mml:mo>&#x02209;</mml:mo><mml:mi>C</mml:mi></mml:math></inline-formula>. That is, <inline-formula><mml:math id="M149" display="inline"><mml:mi>&#x1d49c;</mml:mi></mml:math></inline-formula> exhibits excess clustering if the probability of any areal unit in <inline-formula><mml:math id="M150" display="inline"><mml:mi>&#x1d49e;</mml:mi></mml:math></inline-formula> being positive is higher the probability of a unit out of <inline-formula><mml:math id="M151" display="inline"><mml:mi>&#x1d49e;</mml:mi></mml:math></inline-formula> being positive. An areal process exhibiting excess-clustering is not stationary, though it could be independent.</p><p id="P22">We will consider an areal process to exhibit <italic toggle="yes">autocorrelated-clustering</italic> if there exists at least one areal unit <inline-formula><mml:math id="M152" display="inline"><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> with at least one neighbor <inline-formula><mml:math id="M153" display="inline"><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> such that <inline-formula><mml:math id="M154" display="inline"><mml:mfenced close="" separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo></mml:mrow></mml:mfenced><mml:mfenced open="" separators="|"><mml:mrow><mml:mn>1</mml:mn><mml:mo>&#x02223;</mml:mo><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mfenced><mml:mo>&#x0003e;</mml:mo><mml:mi>P</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mfenced></mml:math></inline-formula>, that is <inline-formula><mml:math id="M155" display="inline"><mml:mi mathvariant="bold-italic">Y</mml:mi></mml:math></inline-formula> exhibits positive spatial autocorrelation. Here the definition of neighbor can be taken to be any desired measure of proximity (shared border, centroids within a certain distance, etc.). Autocorrelated-clustering violates the independence assumption, but not necessarily the stationarity assumption. For the purposes of developing a hypothesis testing procedure, we will consider an areal process to be clustered if it exhibits excess-clustering or autocorrelated-clustering or both. Note that unless more than one realization of the same areal process is observed (which is rare), it will not be possible to distinguish between the two types of clustering.</p><p id="P23">Consider the following test statistic
<disp-formula id="FD10">
<mml:math id="M156" display="block"><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mover accent="true"><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi>n</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math>
</disp-formula>
where <inline-formula><mml:math id="M157" display="inline"><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msubsup><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, <inline-formula><mml:math id="M158" display="inline"><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi>n</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi>E</mml:mi><mml:mo stretchy="false">[</mml:mo><mml:mrow><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="bold-italic">Y</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac><mml:msubsup><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msubsup><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi>n</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02223;</mml:mo><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="bold-italic">Y</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>n</mml:mi></mml:mrow><mml:mo stretchy="false">]</mml:mo></mml:math></inline-formula> and <inline-formula><mml:math id="M159" display="inline"><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi>n</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi>E</mml:mi><mml:mfenced open="{" close="}" separators="|"><mml:mrow><mml:mfrac><mml:mrow><mml:mi>&#x1d4a9;</mml:mi><mml:mfenced open="[" close="]" separators="|"><mml:mrow><mml:mi>c</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi>r</mml:mi></mml:mrow></mml:mfenced><mml:mo>&#x02229;</mml:mo><mml:msubsup><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced><mml:mo>+</mml:mo><mml:mi>A</mml:mi><mml:mfenced open="[" close="]" separators="|"><mml:mrow><mml:mi>c</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi>r</mml:mi></mml:mrow></mml:mfenced><mml:mo>&#x02229;</mml:mo><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:mfenced open="" close="]" separators="|"><mml:mrow><mml:mi>A</mml:mi><mml:mfenced open="[" separators="|"><mml:mrow><mml:mi>c</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi>r</mml:mi></mml:mrow></mml:mfenced><mml:mo>&#x02229;</mml:mo><mml:mi>&#x1d49c;</mml:mi></mml:mrow></mml:mfenced></mml:mrow></mml:mfenced></mml:mrow></mml:mfrac><mml:mo>&#x022c5;</mml:mo><mml:msup><mml:mrow><mml:mfenced separators="|"><mml:mrow><mml:mfrac><mml:mrow><mml:mi>&#x1d4a9;</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>&#x1d49c;</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mrow><mml:mi>A</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>&#x1d49c;</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mfrac></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:mo>&#x02223;</mml:mo><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="bold-italic">Y</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>n</mml:mi></mml:mrow></mml:mfenced></mml:math></inline-formula> for a stationary, independent process. Under the null hypothesis, <inline-formula><mml:math id="M160" display="inline"><mml:mi>E</mml:mi><mml:mfenced open="[" close="]" separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>&#x02223;</mml:mo><mml:msub><mml:mrow><mml:mi>n</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="bold-italic">Y</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>n</mml:mi></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:math></inline-formula>. Positive values of <inline-formula><mml:math id="M161" display="inline"><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> suggest clustering at distance <inline-formula><mml:math id="M162" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula>. Data which exhibits excess-clustering will have a larger-than-expected number of positive units in <inline-formula><mml:math id="M163" display="inline"><mml:mi>&#x1d49e;</mml:mi></mml:math></inline-formula>, which will tend to inflate the values of <inline-formula><mml:math id="M164" display="inline"><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> for <inline-formula><mml:math id="M165" display="inline"><mml:mi>i</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mi>&#x1d49e;</mml:mi></mml:math></inline-formula> and thus inflate <inline-formula><mml:math id="M166" display="inline"><mml:mover accent="true"><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>. Data which exhibits autocorrelated-clustering will have a larger than expected number of positive units with other positive units nearby, which will also inflate <inline-formula><mml:math id="M167" display="inline"><mml:mover accent="true"><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>. When the null hypothesis is rejected in favor of clustering at distance <inline-formula><mml:math id="M168" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula>, we conclude that there is more area within a distance of <inline-formula><mml:math id="M169" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula> of each positive unit centroid than expected under a stationary, independent process.</p><p id="P24">The null distribution of <inline-formula><mml:math id="M170" display="inline"><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mo>&#x022c5;</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> can be estimated using a Monte Carlo procedure in which data is generated under the random labeling hypothesis, that is, the <inline-formula><mml:math id="M171" display="inline"><mml:mi mathvariant="bold-italic">Y</mml:mi></mml:math></inline-formula> values are generated by selecting <inline-formula><mml:math id="M172" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> positive areal units with equal probability. Note that <inline-formula><mml:math id="M173" display="inline"><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi>n</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> can be computed exactly, but doing so requires evaluating <inline-formula><mml:math id="M174" display="inline"><mml:mi>N</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mi>N</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi>n</mml:mi></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mfenced></mml:math></inline-formula>-dimensional sums. Additionally, obtaining the terms in these sum requires computing the area of polygon intersections, which is generally computationally intense. As an alternative, we propose approximating <inline-formula><mml:math id="M175" display="inline"><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi>n</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> by averaging the <inline-formula><mml:math id="M176" display="inline"><mml:mover accent="true"><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> values from the same Monte Carlo procedure used to estimate the null distribution of <inline-formula><mml:math id="M177" display="inline"><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mo>&#x022c5;</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>. An <inline-formula><mml:math id="M178" display="inline"><mml:mi>&#x003b1;</mml:mi></mml:math></inline-formula>-level test for clustering at distance <inline-formula><mml:math id="M179" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula> can then be conducted by rejecting the null hypothesis if <inline-formula><mml:math id="M180" display="inline"><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> exceeds <inline-formula><mml:math id="M181" display="inline"><mml:mn>1</mml:mn><mml:mo>-</mml:mo><mml:mi>&#x003b1;</mml:mi></mml:math></inline-formula> quantile estimated from the simulated null distribution.</p><p id="P25">The same test statistic <inline-formula><mml:math id="M182" display="inline"><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mo>&#x022c5;</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> can be used to detect dispersion. Dispersion is a difficult phenomena to precisely quantify. Intuitively, an areal process is dispersed if positive areal units tend to be located further away from other positive units than would be expected for a stationary, independent process. To produce a working definition of dispersion, suppose we have defined a neighbor structure on the areal units. We will consider an areal process to exhibit <italic toggle="yes">buffered-dispersion</italic> if there exists a set of non-neighboring unit(s) indexed by <inline-formula><mml:math id="M183" display="inline"><mml:mi>&#x1d49f;</mml:mi></mml:math></inline-formula> such that for all <inline-formula><mml:math id="M184" display="inline"><mml:mi>i</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mi>&#x1d49f;</mml:mi></mml:math></inline-formula> and all neighbors <inline-formula><mml:math id="M185" display="inline"><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> of <inline-formula><mml:math id="M186" display="inline"><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mi>P</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mfenced><mml:mo>&#x0003e;</mml:mo><mml:mi>P</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mfenced></mml:math></inline-formula>. Here, the units in <inline-formula><mml:math id="M187" display="inline"><mml:mi>&#x1d49f;</mml:mi></mml:math></inline-formula> are surrounded by &#x02018;buffer units&#x02019; with a lower probability of being positive. An areal process which exhibits buffer-dispersion is not stationary, but it could be independent. We will consider an areal process to exhibit <italic toggle="yes">autocorrelated-dispersion</italic> if there exists at least one areal unit <inline-formula><mml:math id="M188" display="inline"><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> with at least one neighbor <inline-formula><mml:math id="M189" display="inline"><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> such that <inline-formula><mml:math id="M190" display="inline"><mml:mi>P</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>&#x02223;</mml:mo><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mfenced><mml:mo>&#x0003c;</mml:mo><mml:mi>P</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mfenced></mml:math></inline-formula>. Autocorrelated dispersed areal processes are not independent, though they may be stationary. For the purposes of hypothesis testing, we will consider an areal process dispersed if it is either buffer-dispersed or autocorrelated-dispersed. While this definition likely does not cover all processes which could give rise to dispersed areal data, it does cover two most common cases of areal unit(s) with a high probability of being positive surrounded by areal units with lower probability of being positive and the case of negative spatial autocorrelation.</p><p id="P26">If an areal process is dispersed, then at least some of the positive units have fewer positive units nearby than we would expect for a stationary, independent process. These units will tend have to have lower than expected <inline-formula><mml:math id="M191" display="inline"><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> values, decreasing <inline-formula><mml:math id="M192" display="inline"><mml:mover accent="true"><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>. We can perform an <inline-formula><mml:math id="M193" display="inline"><mml:mi>&#x003b1;</mml:mi></mml:math></inline-formula>-level test of the null hypothesis that <inline-formula><mml:math id="M194" display="inline"><mml:mi>&#x1d49c;</mml:mi></mml:math></inline-formula> is a stationary, independent process against the alternative hypothesis that <inline-formula><mml:math id="M195" display="inline"><mml:mi>&#x1d49c;</mml:mi></mml:math></inline-formula> exhibits dispersion by rejecting the null hypothesis if <inline-formula><mml:math id="M196" display="inline"><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> falls below the <inline-formula><mml:math id="M197" display="inline"><mml:mi>&#x003b1;</mml:mi></mml:math></inline-formula>-quantile of the null distribution of <inline-formula><mml:math id="M198" display="inline"><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mo>&#x022c5;</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>, which can be estimated using the Monte Carlo procedure described previously. If we reject the null hypothesis in favor of dispersion, we conclude that the amount of positive areal within a distance <inline-formula><mml:math id="M199" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula> of positive unit centroids is less than expected for stationary independent process. Finally, an <inline-formula><mml:math id="M200" display="inline"><mml:mi>&#x003b1;</mml:mi></mml:math></inline-formula>-level two-tailed test for either clustering or dispersion can be conducted by rejecting the null hypothesis if <inline-formula><mml:math id="M201" display="inline"><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> is less than the estimated <inline-formula><mml:math id="M202" display="inline"><mml:mi>&#x003b1;</mml:mi><mml:mo>/</mml:mo><mml:mn>2</mml:mn></mml:math></inline-formula> quantile of the null distribution or if <inline-formula><mml:math id="M203" display="inline"><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> is greater than the <inline-formula><mml:math id="M204" display="inline"><mml:mn>1</mml:mn><mml:mo>-</mml:mo><mml:mi>&#x003b1;</mml:mi><mml:mo>/</mml:mo><mml:mn>2</mml:mn></mml:math></inline-formula> quantile.</p></sec><sec id="S9"><label>2.6.</label><title>A Global Hypothesis Test for Type I Error Rate Control</title><p id="P27">In many applications, the ideal distance at which to test for spatial patterns may not be known. Computing the PAPF test statistic at a variety of radii induces a multiple testing problem and the potential for an inflated type I error rate. To control the overall type I error rate associated with such a procedure, we propose the following global test for clustering over a range of distances <inline-formula><mml:math id="M205" display="inline"><mml:mi>r</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mi>&#x0211b;</mml:mi><mml:mo>=</mml:mo><mml:mfenced open="{" close="}" separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mi>R</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math></inline-formula>. Define the global test statistic <inline-formula><mml:math id="M206" display="inline"><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mi>C</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="normal">m</mml:mi><mml:mi mathvariant="normal">a</mml:mi><mml:mi mathvariant="normal">x</mml:mi></mml:mrow><mml:mrow><mml:mi>r</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mi>&#x0211b;</mml:mi></mml:mrow></mml:msub><mml:mspace width="0.25em"/><mml:mfenced open="{" close="}" separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>/</mml:mo><mml:msub><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math></inline-formula> where <inline-formula><mml:math id="M207" display="inline"><mml:msub><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi mathvariant="normal">v</mml:mi><mml:mi mathvariant="normal">a</mml:mi><mml:mi mathvariant="normal">r</mml:mi><mml:mo>&#x02061;</mml:mo><mml:msup><mml:mrow><mml:mfenced open="[" close="]" separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mo>&#x022c5;</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:mn>1</mml:mn><mml:mo>/</mml:mo><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. We can estimate the null distribution of <inline-formula><mml:math id="M208" display="inline"><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mi>C</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x022c5;</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> within the Monte Carlo procedure discussed above. That is, <inline-formula><mml:math id="M209" display="inline"><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> is computed for each <inline-formula><mml:math id="M210" display="inline"><mml:mi>r</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mi>&#x0211b;</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M211" display="inline"><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mi>C</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x022c5;</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> is computed for each dataset. Note that we can estimate <inline-formula><mml:math id="M212" display="inline"><mml:mi mathvariant="normal">v</mml:mi><mml:mi mathvariant="normal">a</mml:mi><mml:mi mathvariant="normal">r</mml:mi><mml:mo>&#x02061;</mml:mo><mml:mfenced open="[" close="]" separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mo>&#x022c5;</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mfenced></mml:math></inline-formula> from the same Monte Carlo procedure. An upper tailed test is indicative of clustering. To test for dispersion, define the test statistic <inline-formula><mml:math id="M213" display="inline"><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mi>D</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="normal">m</mml:mi><mml:mi mathvariant="normal">i</mml:mi><mml:mi mathvariant="normal">n</mml:mi></mml:mrow><mml:mrow><mml:mi>r</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mi>&#x0211b;</mml:mi></mml:mrow></mml:msub><mml:mspace width="0.25em"/><mml:mfenced open="{" close="}" separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>/</mml:mo><mml:msub><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math></inline-formula>. The null distribution of <inline-formula><mml:math id="M214" display="inline"><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mi>D</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x022c5;</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> can be estimated analogously to that of <inline-formula><mml:math id="M215" display="inline"><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>C</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x022c5;</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> and a lower tailed test is indicative of dispersion. To simultaneously test for either clustering or dispersion, both tests can be performed and a Bonferroni correction for two tests applied. Similar methods have been proposed to derive global tests from Ripley&#x02019;s K-function and Ripley&#x02019;s D-function [<xref rid="R19" ref-type="bibr">19</xref>].</p></sec></sec><sec id="S10"><label>3.</label><title>Simulation Study</title><sec id="S11"><label>3.1.</label><title>Simulation Specifications</title><p id="P28">In this section, we perform an extensive simulation study to compare the performance our proposed PAPF hypothesis testing method to the performance of the global Moran&#x02019;s I statistic, the Getis-Ord general G statistic, and the spatial scan statistic. We also consider the performance of the misapplication of three point process methods to the areal unit centroids: a edge-corrected Ripley&#x02019;s K-function test, a related test based on Ripley&#x02019;s D-function [<xref rid="R19" ref-type="bibr">19</xref>], and the average nearest neighbor method [<xref rid="R13" ref-type="bibr">13</xref>]. These misapplications are included to highlight the importance of analyzing areal data only with methods designed for areal data. After describing our data generation procedures, we provide details on the implementation of each method.</p><p id="P29">We consider the performance of our proposed hypothesis testing procedure using two study areas which are shown in <xref rid="F2" ref-type="fig">Figure 2</xref>:</p><p id="P30"><inline-formula><mml:math id="M216" display="inline"><mml:msub><mml:mrow><mml:mi>&#x1d49c;</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> A 20 by 20 regular grid of <italic toggle="yes">N</italic><sub>1</sub> = 400 cells</p><p id="P31"><inline-formula><mml:math id="M217" display="inline"><mml:msub><mml:mrow><mml:mi>&#x1d49c;</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> The <italic toggle="yes">N</italic><sub>2</sub> = 3, 108 counties (and county-equivalents) in the contiguous US.</p><p id="P32">We define three distributions that will be used to sample the observed units. For each, areal unit <inline-formula><mml:math id="M218" display="inline"><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is positive when <inline-formula><mml:math id="M219" display="inline"><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula>. First, <inline-formula><mml:math id="M220" display="inline"><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>~</mml:mo><mml:mi mathvariant="normal">S</mml:mi><mml:mi mathvariant="normal">W</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">R</mml:mi><mml:mo>&#x02061;</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:mi>N</mml:mi><mml:mo>,</mml:mo><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">p</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> indicates that the random variable <inline-formula><mml:math id="M221" display="inline"><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>=</mml:mo><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math></inline-formula> arises by selecting <inline-formula><mml:math id="M222" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula> elements from <inline-formula><mml:math id="M223" display="inline"><mml:mo>{</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:mi>N</mml:mi><mml:mo>}</mml:mo></mml:math></inline-formula> via sampling without replacement (SWoR) where <inline-formula><mml:math id="M224" display="inline"><mml:mi mathvariant="bold-italic">p</mml:mi><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> gives the probability of selecting each element, and <inline-formula><mml:math id="M225" display="inline"><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula> if <inline-formula><mml:math id="M226" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula> was selected and 0 otherwise. Second, <inline-formula><mml:math id="M227" display="inline"><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>~</mml:mo><mml:mi>C</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi>m</mml:mi><mml:mo>,</mml:mo><mml:mi>q</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> denotes that <inline-formula><mml:math id="M228" display="inline"><mml:mi mathvariant="bold-italic">Y</mml:mi></mml:math></inline-formula> is generated using the following two-step process where <inline-formula><mml:math id="M229" display="inline"><mml:msub><mml:mrow><mml:mi>Y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula> if element <inline-formula><mml:math id="M230" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula> was selected in either step:
<list list-type="roman-lower" id="L1"><list-item><p id="P33"><inline-formula><mml:math id="M231" display="inline"><mml:mi>m</mml:mi></mml:math></inline-formula> elements of <inline-formula><mml:math id="M232" display="inline"><mml:mo>{</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:mi>N</mml:mi><mml:mo>}</mml:mo></mml:math></inline-formula> are randomly selected via <inline-formula><mml:math id="M233" display="inline"><mml:mi mathvariant="normal">S</mml:mi><mml:mi mathvariant="normal">W</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">R</mml:mi><mml:mo>&#x02061;</mml:mo><mml:mfenced separators="|"><mml:mrow><mml:mi>N</mml:mi><mml:mo>,</mml:mo><mml:mi>m</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">p</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math></inline-formula> where <inline-formula><mml:math id="M234" display="inline"><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> for all <inline-formula><mml:math id="M235" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>.</p></list-item><list-item><p id="P34"><inline-formula><mml:math id="M236" display="inline"><mml:mi>k</mml:mi><mml:mo>-</mml:mo><mml:mi>m</mml:mi></mml:math></inline-formula> elements of <inline-formula><mml:math id="M237" display="inline"><mml:mo>{</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:mi>N</mml:mi><mml:mo>}</mml:mo></mml:math></inline-formula> are selected via <inline-formula><mml:math id="M238" display="inline"><mml:mi mathvariant="normal">S</mml:mi><mml:mi mathvariant="normal">W</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">R</mml:mi><mml:mo>&#x02061;</mml:mo><mml:mfenced separators="|"><mml:mrow><mml:mi>N</mml:mi><mml:mo>,</mml:mo><mml:mi>k</mml:mi><mml:mo>-</mml:mo><mml:mi>m</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">p</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math></inline-formula>, where <inline-formula><mml:math id="M239" display="inline"><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:math></inline-formula> if <inline-formula><mml:math id="M240" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula> was selected in step (i), <inline-formula><mml:math id="M241" display="inline"><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>q</mml:mi><mml:mo>/</mml:mo><mml:mi>D</mml:mi></mml:math></inline-formula> for <inline-formula><mml:math id="M242" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula> such that <inline-formula><mml:math id="M243" display="inline"><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> shares a border with at least one unit selected in step (i), and <inline-formula><mml:math id="M244" display="inline"><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>/</mml:mo><mml:mi>D</mml:mi></mml:math></inline-formula> otherwise where <inline-formula><mml:math id="M245" display="inline"><mml:mi>D</mml:mi></mml:math></inline-formula> is such that <inline-formula><mml:math id="M246" display="inline"><mml:msub><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula>.</p></list-item></list></p><p id="P35">Third we define <inline-formula><mml:math id="M247" display="inline"><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>~</mml:mo><mml:mi>M</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>m</mml:mi><mml:mo>,</mml:mo><mml:mi>q</mml:mi><mml:mo>,</mml:mo><mml:mi>k</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> if <inline-formula><mml:math id="M248" display="inline"><mml:mi mathvariant="bold-italic">Y</mml:mi></mml:math></inline-formula> is generated as follows:
<list list-type="roman-lower" id="L2"><list-item><p id="P36">Divide the study area <inline-formula><mml:math id="M249" display="inline"><mml:mi>&#x1d49c;</mml:mi></mml:math></inline-formula> into three regions: a clustered region <inline-formula><mml:math id="M250" display="inline"><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, a dispersed region <inline-formula><mml:math id="M251" display="inline"><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and a random scatter region <inline-formula><mml:math id="M252" display="inline"><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>r</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>.</p></list-item><list-item><p id="P37">Select <inline-formula><mml:math id="M253" display="inline"><mml:mi>m</mml:mi></mml:math></inline-formula> units from <inline-formula><mml:math id="M254" display="inline"><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> as follows:
<list list-type="alpha-lower" id="L3"><list-item><p id="P38">Select 1 unit via <inline-formula><mml:math id="M255" display="inline"><mml:mi mathvariant="normal">S</mml:mi><mml:mi mathvariant="normal">W</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">R</mml:mi><mml:mo>&#x02061;</mml:mo><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">p</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math></inline-formula>, where <inline-formula><mml:math id="M256" display="inline"><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is the number of units in <inline-formula><mml:math id="M257" display="inline"><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula><mml:math id="M258" display="inline"><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">p</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mfenced separators="|"><mml:mrow><mml:msubsup><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msubsup><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula>.</p></list-item><list-item><p id="P39">Set <inline-formula><mml:math id="M259" display="inline"><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>q</mml:mi><mml:mo>/</mml:mo><mml:mi>D</mml:mi></mml:math></inline-formula> if <inline-formula><mml:math id="M260" display="inline"><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is adjacent to a previously selected unit, <inline-formula><mml:math id="M261" display="inline"><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:math></inline-formula> if <inline-formula><mml:math id="M262" display="inline"><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is a previously selected unit and <inline-formula><mml:math id="M263" display="inline"><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>/</mml:mo><mml:mi>D</mml:mi></mml:math></inline-formula> otherwise; <inline-formula><mml:math id="M264" display="inline"><mml:mi>D</mml:mi></mml:math></inline-formula> is chosen so that <inline-formula><mml:math id="M265" display="inline"><mml:msubsup><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msubsup><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula>.</p></list-item><list-item><p id="P40">Select one unit via <inline-formula><mml:math id="M266" display="inline"><mml:mi mathvariant="normal">S</mml:mi><mml:mi mathvariant="normal">W</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">R</mml:mi><mml:mo>&#x02061;</mml:mo><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">p</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math></inline-formula></p></list-item><list-item><p id="P41">Return to (b) until <inline-formula><mml:math id="M267" display="inline"><mml:mi>m</mml:mi></mml:math></inline-formula> units have been selected.</p></list-item></list></p></list-item><list-item><p id="P42">Select <inline-formula><mml:math id="M268" display="inline"><mml:mi>k</mml:mi><mml:mi>m</mml:mi></mml:math></inline-formula> units from <inline-formula><mml:math id="M269" display="inline"><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> via an analogous process to (ii) with <inline-formula><mml:math id="M270" display="inline"><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> now taken to be <inline-formula><mml:math id="M271" display="inline"><mml:mn>1</mml:mn><mml:mo>/</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:mi>q</mml:mi><mml:mi>D</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> in step <inline-formula><mml:math id="M272" display="inline"><mml:mo stretchy="false">(</mml:mo><mml:mi>b</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> if unit <inline-formula><mml:math id="M273" display="inline"><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is adjacent to a previously selected unit.</p></list-item><list-item><p id="P43">Select <inline-formula><mml:math id="M274" display="inline"><mml:mi>n</mml:mi><mml:mo>-</mml:mo><mml:mi>k</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>m</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> units from <inline-formula><mml:math id="M275" display="inline"><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>r</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> via <inline-formula><mml:math id="M276" display="inline"><mml:mi mathvariant="normal">S</mml:mi><mml:mi mathvariant="normal">W</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">R</mml:mi><mml:mo>&#x02061;</mml:mo><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>r</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi>n</mml:mi><mml:mo>-</mml:mo><mml:mi>k</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>m</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn><mml:mo stretchy="false">)</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">p</mml:mi></mml:mrow><mml:mrow><mml:mi>r</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math></inline-formula>, where <inline-formula><mml:math id="M277" display="inline"><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>r</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is the number of units in <inline-formula><mml:math id="M278" display="inline"><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>r</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula><mml:math id="M279" display="inline"><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">p</mml:mi></mml:mrow><mml:mrow><mml:mi>r</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mfenced separators="|"><mml:mrow><mml:msubsup><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msubsup><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula>.</p></list-item></list></p><p id="P44">For the first distribution, <inline-formula><mml:math id="M280" display="inline"><mml:mi mathvariant="bold-italic">p</mml:mi></mml:math></inline-formula> can be used to to generate data under the null hypothesis or data with clustering in certain locations where the <inline-formula><mml:math id="M281" display="inline"><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>&#x02019;s are larger. For the second distribution, <inline-formula><mml:math id="M282" display="inline"><mml:mi>q</mml:mi></mml:math></inline-formula> controls the degree of clustering or dispersion where larger <inline-formula><mml:math id="M283" display="inline"><mml:mi>q</mml:mi><mml:mo>&#x0003e;</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula> lead to more clustering while smaller <inline-formula><mml:math id="M284" display="inline"><mml:mi>q</mml:mi><mml:mo>&#x0003c;</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula> lead to more dispersion. The third distribution produces an area of small-scale clustering in <inline-formula><mml:math id="M285" display="inline"><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and an area of small-scale dispersion in <inline-formula><mml:math id="M286" display="inline"><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>. When <inline-formula><mml:math id="M287" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula> is sufficiently large relatively to <inline-formula><mml:math id="M288" display="inline"><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, this process will also produce clustering at larger distances in <inline-formula><mml:math id="M289" display="inline"><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>.</p><p id="P45">For each study area <inline-formula><mml:math id="M290" display="inline"><mml:msub><mml:mrow><mml:mi>&#x1d49c;</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mn>2</mml:mn></mml:math></inline-formula>, we generate data under 21 different scenarios. For brevity, we remove the subscript <inline-formula><mml:math id="M291" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula> when describing these scenarios (all depend on <inline-formula><mml:math id="M292" display="inline"><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>). First we consider the null hypothesis of a stationary independent areal process (e.g. no spatial pattern in the positive units) via the following three scenarios:
<disp-formula id="FD13">
<mml:math id="M293" display="block"><mml:msub><mml:mrow><mml:mi>I</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>~</mml:mo><mml:mi mathvariant="normal">S</mml:mi><mml:mi mathvariant="normal">W</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">R</mml:mi><mml:mo>&#x02061;</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:mi>N</mml:mi><mml:mo>,</mml:mo><mml:mo>&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>10</mml:mn><mml:mo>&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">p</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math>
</disp-formula>
<disp-formula id="FD14">
<mml:math id="M294" display="block"><mml:msub><mml:mrow><mml:mi>I</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>~</mml:mo><mml:mi mathvariant="normal">S</mml:mi><mml:mi mathvariant="normal">W</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">R</mml:mi><mml:mo>&#x02061;</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:mi>N</mml:mi><mml:mo>,</mml:mo><mml:mo>&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>4</mml:mn><mml:mo>&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">p</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math>
</disp-formula>
<disp-formula id="FD15">
<mml:math id="M295" display="block"><mml:msub><mml:mrow><mml:mi>I</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>~</mml:mo><mml:mi mathvariant="normal">S</mml:mi><mml:mi mathvariant="normal">W</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">R</mml:mi><mml:mo>&#x02061;</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:mi>N</mml:mi><mml:mo>,</mml:mo><mml:mo>&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>2</mml:mn><mml:mo>&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">p</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math>
</disp-formula>
where <inline-formula><mml:math id="M296" display="inline"><mml:mi mathvariant="bold-italic">p</mml:mi><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:mn>1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:mn>1</mml:mn><mml:mo>/</mml:mo><mml:mi>N</mml:mi><mml:msup><mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula>.</p><p id="P46">We also consider 12 scenarios in which the locations of observed units are clustered. The following 6 scenarios assess the ability to excess-clustering:
<disp-formula id="FD16">
<mml:math id="M297" display="block"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>~</mml:mo><mml:mi mathvariant="normal">S</mml:mi><mml:mi mathvariant="normal">W</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">R</mml:mi><mml:mo>&#x02061;</mml:mo><mml:mfenced separators="|"><mml:mrow><mml:mi>N</mml:mi><mml:mo>,</mml:mo><mml:mo>&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>10</mml:mn><mml:mo>&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">p</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math>
</disp-formula>
<disp-formula id="FD17">
<mml:math id="M298" display="block"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>~</mml:mo><mml:mi mathvariant="normal">S</mml:mi><mml:mi mathvariant="normal">W</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">R</mml:mi><mml:mo>&#x02061;</mml:mo><mml:mfenced separators="|"><mml:mrow><mml:mi>N</mml:mi><mml:mo>,</mml:mo><mml:mo>&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>10</mml:mn><mml:mo>&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">p</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math>
</disp-formula>
<disp-formula id="FD18">
<mml:math id="M299" display="block"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>~</mml:mo><mml:mi mathvariant="normal">S</mml:mi><mml:mi mathvariant="normal">W</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">R</mml:mi><mml:mo>&#x02061;</mml:mo><mml:mfenced separators="|"><mml:mrow><mml:mi>N</mml:mi><mml:mo>,</mml:mo><mml:mo>&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>4</mml:mn><mml:mo>&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">p</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math>
</disp-formula>
<disp-formula id="FD19">
<mml:math id="M300" display="block"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>~</mml:mo><mml:mrow><mml:mrow><mml:mi mathvariant="normal">SWoR</mml:mi></mml:mrow><mml:mo>&#x02061;</mml:mo><mml:mrow><mml:mfenced separators="|"><mml:mrow><mml:mi>N</mml:mi><mml:mo>,</mml:mo><mml:mfenced open="&#x02308;" close="&#x02309;" separators="|"><mml:mrow><mml:mfrac><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:mfrac></mml:mrow></mml:mfenced><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">p</mml:mi></mml:mrow><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:mrow></mml:math>
</disp-formula>
<disp-formula id="FD20">
<mml:math id="M301" display="block"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>5</mml:mn></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>~</mml:mo><mml:mi mathvariant="normal">S</mml:mi><mml:mi mathvariant="normal">W</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">R</mml:mi><mml:mo>&#x02061;</mml:mo><mml:mfenced separators="|"><mml:mrow><mml:mi>N</mml:mi><mml:mo>,</mml:mo><mml:mo>&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>2</mml:mn><mml:mo>&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">p</mml:mi></mml:mrow><mml:mrow><mml:mn>5</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math>
</disp-formula>
<disp-formula id="FD21">
<mml:math id="M302" display="block"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>6</mml:mn></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>~</mml:mo><mml:mi mathvariant="normal">S</mml:mi><mml:mi mathvariant="normal">W</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">R</mml:mi><mml:mo>&#x02061;</mml:mo><mml:mfenced separators="|"><mml:mrow><mml:mi>N</mml:mi><mml:mo>,</mml:mo><mml:mo>&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>2</mml:mn><mml:mo>&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">p</mml:mi></mml:mrow><mml:mrow><mml:mn>6</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfenced><mml:mo>.</mml:mo></mml:math>
</disp-formula>
The <inline-formula><mml:math id="M303" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula>-dimensional vectors <inline-formula><mml:math id="M304" display="inline"><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">p</mml:mi></mml:mrow><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mi>l</mml:mi><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mi>l</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> for <inline-formula><mml:math id="M305" display="inline"><mml:mi>l</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:mn>6</mml:mn></mml:math></inline-formula> are defined as follows: an entry of <inline-formula><mml:math id="M306" display="inline"><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is equal to <inline-formula><mml:math id="M307" display="inline"><mml:mi>q</mml:mi><mml:mo>/</mml:mo><mml:mi>D</mml:mi></mml:math></inline-formula> if the unit is shown in blue in <xref rid="F3" ref-type="fig">Figure 3</xref> and equal to <inline-formula><mml:math id="M308" display="inline"><mml:mn>1</mml:mn><mml:mo>/</mml:mo><mml:mi>D</mml:mi></mml:math></inline-formula> otherwise where <inline-formula><mml:math id="M309" display="inline"><mml:mi>D</mml:mi></mml:math></inline-formula> is such that <inline-formula><mml:math id="M310" display="inline"><mml:msub><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mi>l</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula>. For <inline-formula><mml:math id="M311" display="inline"><mml:mi>l</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mo>{</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mn>3</mml:mn><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mn>5</mml:mn><mml:mo>}</mml:mo></mml:math></inline-formula> we take <inline-formula><mml:math id="M312" display="inline"><mml:mi>q</mml:mi><mml:mo>=</mml:mo><mml:mn>5</mml:mn></mml:math></inline-formula> and for <inline-formula><mml:math id="M313" display="inline"><mml:mi>l</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mo>{</mml:mo><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mn>4</mml:mn><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mn>6</mml:mn><mml:mo>}</mml:mo></mml:math></inline-formula> we take <inline-formula><mml:math id="M314" display="inline"><mml:mi>q</mml:mi><mml:mo>=</mml:mo><mml:mn>10</mml:mn></mml:math></inline-formula>. Thus the blue units are 5 times more likely to be observed than the white units under <inline-formula><mml:math id="M315" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula><mml:math id="M316" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>5</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>, and 10 times more likely to be observed under <inline-formula><mml:math id="M317" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula><mml:math id="M318" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>6</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>.</p><p id="P47">Next, we assess the ability of our hypothesis test to autocorrelated-clustering by considering the following 6 scenarios:
<disp-formula id="FD22">
<mml:math id="M319" display="block"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>7</mml:mn></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>~</mml:mo><mml:mi>C</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>10</mml:mn><mml:mo>&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mo>&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>100</mml:mn><mml:mo>&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mn>5</mml:mn><mml:mo stretchy="false">)</mml:mo></mml:math>
</disp-formula>
<disp-formula id="FD23">
<mml:math id="M320" display="block"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>8</mml:mn></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>~</mml:mo><mml:mi>C</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>10</mml:mn><mml:mo>&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mo>&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>100</mml:mn><mml:mo>&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mn>10</mml:mn><mml:mo stretchy="false">)</mml:mo></mml:math>
</disp-formula>
<disp-formula id="FD24">
<mml:math id="M321" display="block"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>9</mml:mn></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>~</mml:mo><mml:mi>C</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>4</mml:mn><mml:mo>&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mo>&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>40</mml:mn><mml:mo>&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mn>5</mml:mn><mml:mo stretchy="false">)</mml:mo></mml:math>
</disp-formula>
<disp-formula id="FD25">
<mml:math id="M322" display="block"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>10</mml:mn></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>~</mml:mo><mml:mi>C</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>4</mml:mn><mml:mo>&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mo>&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>40</mml:mn><mml:mo>&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mn>10</mml:mn><mml:mo stretchy="false">)</mml:mo></mml:math>
</disp-formula>
<disp-formula id="FD26">
<mml:math id="M323" display="block"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>11</mml:mn></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>~</mml:mo><mml:mi>C</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>2</mml:mn><mml:mo>&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mo>&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>20</mml:mn><mml:mo>&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mn>5</mml:mn><mml:mo stretchy="false">)</mml:mo></mml:math>
</disp-formula>
<disp-formula id="FD27">
<mml:math id="M324" display="block"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>12</mml:mn></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>~</mml:mo><mml:mi>C</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>2</mml:mn><mml:mo>&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mo>&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>20</mml:mn><mml:mo>&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mn>10</mml:mn><mml:mo stretchy="false">)</mml:mo></mml:math>
</disp-formula>
Note that <inline-formula><mml:math id="M325" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>7</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>9</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula><mml:math id="M326" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>11</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> correspond to &#x02018;weaker&#x02019; clustering, in the sense that they tend to select fewer adjacent units than mechanisms <inline-formula><mml:math id="M327" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>8</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>10</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula><mml:math id="M328" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>12</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula></p><p id="P48">Next, we assess the ability of our hypothesis testing procedure to detect spatial dispersion under the following 4 scenarios.
<disp-formula id="FD28">
<mml:math id="M329" display="block"><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>~</mml:mo><mml:mi>C</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:mo stretchy="false">&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>10</mml:mn><mml:mo stretchy="false">&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mo stretchy="false">&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>100</mml:mn><mml:mo stretchy="false">&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>10</mml:mn></mml:mrow></mml:mfrac></mml:mrow></mml:mfenced></mml:math>
</disp-formula>
<disp-formula id="FD29">
<mml:math id="M330" display="block"><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>~</mml:mo><mml:mi>C</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>10</mml:mn><mml:mo>&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mo>&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>100</mml:mn><mml:mo>&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mn>0</mml:mn><mml:mo stretchy="false">)</mml:mo></mml:math>
</disp-formula>
<disp-formula id="FD30">
<mml:math id="M331" display="block"><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>~</mml:mo><mml:mi>C</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:mo stretchy="false">&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>6</mml:mn><mml:mo stretchy="false">&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mo stretchy="false">&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>100</mml:mn><mml:mo stretchy="false">&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>10</mml:mn></mml:mrow></mml:mfrac></mml:mrow></mml:mfenced></mml:math>
</disp-formula>
<disp-formula id="FD31">
<mml:math id="M332" display="block"><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>~</mml:mo><mml:mi>C</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>6</mml:mn><mml:mo>&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mo>&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>100</mml:mn><mml:mo>&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mn>0</mml:mn><mml:mo stretchy="false">)</mml:mo></mml:math>
</disp-formula>
Here, in <inline-formula><mml:math id="M333" display="inline"><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula><mml:math id="M334" display="inline"><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> adjacent units are one tenth as likely to be observed as non-adjacent units, creating a mild dispersion effect. Under <inline-formula><mml:math id="M335" display="inline"><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula><mml:math id="M336" display="inline"><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> adjacent units cannot be selected at all, creating a stronger dispersion effect. Finally, we consider only two samples sizes when assessing dispersion <inline-formula><mml:math id="M337" display="inline"><mml:mo stretchy="false">(</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>10</mml:mn></mml:math></inline-formula> and <inline-formula><mml:math id="M338" display="inline"><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>6</mml:mn><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> because it becomes increasingly difficult or impossible to select only non-adjacent units as number of selected units increases.</p><p id="P49">Finally, we assess the ability of the PAPF method to detect the simultaneous presence of spatial clustering and dispersion at different distances using the following two scenarios:
<disp-formula id="FD32">
<mml:math id="M339" display="block"><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>~</mml:mo><mml:mi>M</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:mo>&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>10</mml:mn><mml:mo>&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mn>10</mml:mn><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mn>3</mml:mn></mml:mrow></mml:mfenced></mml:math>
</disp-formula>
<disp-formula id="FD33">
<mml:math id="M340" display="block"><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>~</mml:mo><mml:mi>M</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:mo>&#x02308;</mml:mo><mml:mi>N</mml:mi><mml:mo>/</mml:mo><mml:mn>6</mml:mn><mml:mo>&#x02309;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mn>10</mml:mn><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mn>3</mml:mn></mml:mrow></mml:mfenced></mml:math>
</disp-formula>
We take <inline-formula><mml:math id="M341" display="inline"><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mn>11</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>9</mml:mn><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mn>21</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>13</mml:mn><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mn>12</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>60</mml:mn></mml:math></inline-formula> and <inline-formula><mml:math id="M342" display="inline"><mml:msub><mml:mrow><mml:mi>m</mml:mi></mml:mrow><mml:mrow><mml:mn>22</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>100</mml:mn></mml:math></inline-formula>. The regions <inline-formula><mml:math id="M343" display="inline"><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>d</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula><mml:math id="M344" display="inline"><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>r</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> defined for our two study areas are shown in <xref rid="F3" ref-type="fig">Figure 3</xref>. Examples of data generated each scenario for <inline-formula><mml:math id="M345" display="inline"><mml:msub><mml:mrow><mml:mi>&#x1d49c;</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> are shown in <xref rid="F4" ref-type="fig">Figure 4</xref>. Web <xref rid="F1" ref-type="fig">Figure 1</xref> provides similar examples for <inline-formula><mml:math id="M346" display="inline"><mml:msub><mml:mrow><mml:mi>&#x1d49c;</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>.</p></sec><sec id="S12"><label>3.2.</label><title>Method Specifications</title><p id="P50">All hypothesis tests were performed with at a level of <inline-formula><mml:math id="M347" display="inline"><mml:mi>&#x003b1;</mml:mi><mml:mo>=</mml:mo><mml:mn>0.05</mml:mn></mml:math></inline-formula>. For each study area, both global and radii-specific PAPF tests, Ripley&#x02019;s K tests and Ripley&#x02019;s D tests were performed. The radii-specifc PAPF tests were performed for 10 radii, <inline-formula><mml:math id="M348" display="inline"><mml:mi>r</mml:mi><mml:mo>&#x02208;</mml:mo><mml:msub><mml:mrow><mml:mi>&#x0211b;</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mfenced open="{" close="}" separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mn>10</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math></inline-formula>. The set of radii used for each study area are shown in <xref rid="F2" ref-type="fig">Figure 2</xref>. The smallest radius was equal the smallest distance between any areal unit centroids, and the radii were increased incrementally, with the largest radii being approximately one fourth of the width of the study area.</p><sec id="S13"><label>3.2.1.</label><title>PAPF Method</title><p id="P51">To perform the PAPF hypothesis test, Monte Carlo simulations were used. For each study area <inline-formula><mml:math id="M349" display="inline"><mml:msub><mml:mrow><mml:mi>&#x1d49c;</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, and observation size <inline-formula><mml:math id="M350" display="inline"><mml:mi>n</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mfenced open="{" close="}" separators="|"><mml:mrow><mml:mfenced open="&#x02308;" close="&#x02309;" separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>/</mml:mo><mml:mn>10</mml:mn></mml:mrow></mml:mfenced><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mfenced open="&#x02308;" close="&#x02309;" separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>/</mml:mo><mml:mn>6</mml:mn></mml:mrow></mml:mfenced><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mfenced open="&#x02308;" close="&#x02309;" separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>/</mml:mo><mml:mn>4</mml:mn></mml:mrow></mml:mfenced><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mfenced open="&#x02308;" close="&#x02309;" separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>/</mml:mo><mml:mn>2</mml:mn></mml:mrow></mml:mfenced></mml:mrow></mml:mfenced><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mi>G</mml:mi><mml:mo>=</mml:mo><mml:mn>200</mml:mn></mml:math></inline-formula> datasets were simulated under the null hypothesis of an stationary and independent areal process, conditional on <inline-formula><mml:math id="M351" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> positive units being observed. Specifically, for a given <inline-formula><mml:math id="M352" display="inline"><mml:msub><mml:mrow><mml:mi>&#x1d49c;</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, <inline-formula><mml:math id="M353" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula>, and <inline-formula><mml:math id="M354" display="inline"><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">p</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msup><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msubsup><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msubsup><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msubsup></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mrow><mml:mi>&#x02032;</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">y</mml:mi></mml:mrow><mml:mrow><mml:mi>g</mml:mi></mml:mrow></mml:msub><mml:mo>~</mml:mo><mml:mi mathvariant="normal">S</mml:mi><mml:mi mathvariant="normal">W</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">R</mml:mi><mml:mo>&#x02061;</mml:mo><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">p</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math></inline-formula> was sampled for <inline-formula><mml:math id="M355" display="inline"><mml:mi>g</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:mi>G</mml:mi></mml:math></inline-formula>, and
<disp-formula id="FD34">
<mml:math id="M356" display="block"><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi>n</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>G</mml:mi></mml:mrow></mml:mfrac><mml:mrow><mml:munderover><mml:mo stretchy="true">&#x02211;</mml:mo><mml:mrow><mml:mi>g</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>G</mml:mi></mml:mrow></mml:munderover><mml:mrow><mml:mspace width="0.25em"/></mml:mrow></mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover><mml:mfenced separators="|"><mml:mrow><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">y</mml:mi></mml:mrow><mml:mrow><mml:mi>g</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math>
</disp-formula>
was calculated for each <inline-formula><mml:math id="M357" display="inline"><mml:mi>r</mml:mi><mml:mo>&#x02208;</mml:mo><mml:msub><mml:mrow><mml:mi>&#x0211b;</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>. Then to approximate the distribution of the test statistic <inline-formula><mml:math id="M358" display="inline"><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mo>&#x022c5;</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>,
<disp-formula id="FD35">
<mml:math id="M359" display="block"><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mfenced separators="|"><mml:mrow><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">y</mml:mi></mml:mrow><mml:mrow><mml:mi>g</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:mover accent="true"><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover><mml:mfenced separators="|"><mml:mrow><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">y</mml:mi></mml:mrow><mml:mrow><mml:mi>g</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi>n</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math>
</disp-formula>
was calculated for <inline-formula><mml:math id="M360" display="inline"><mml:mi>g</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:mi>G</mml:mi></mml:math></inline-formula> and the quantiles of <inline-formula><mml:math id="M361" display="inline"><mml:mfenced open="{" close="}" separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mfenced separators="|"><mml:mrow><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">y</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo></mml:mrow></mml:mfenced><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mfenced separators="|"><mml:mrow><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">y</mml:mi></mml:mrow><mml:mrow><mml:mi>G</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:mfenced></mml:math></inline-formula> were used as approximate critical values for the hypothesis test. These Monte Carlo simulatons were also used to estimate the null distributions of <inline-formula><mml:math id="M362" display="inline"><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mi>C</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x022c5;</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> and <inline-formula><mml:math id="M363" display="inline"><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mi>D</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x022c5;</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>.</p><p id="P52">After approximating each null distribution, 500 instances of <inline-formula><mml:math id="M364" display="inline"><mml:mi mathvariant="bold-italic">y</mml:mi></mml:math></inline-formula> were generated under each of the 21 DGMs. For each generated <inline-formula><mml:math id="M365" display="inline"><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mi>C</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>, <inline-formula><mml:math id="M366" display="inline"><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mi>D</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> and <inline-formula><mml:math id="M367" display="inline"><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> for <inline-formula><mml:math id="M368" display="inline"><mml:mi>r</mml:mi><mml:mo>&#x02208;</mml:mo><mml:msub><mml:mrow><mml:mi>&#x0211b;</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> were computed and compared to critical values from the corresponding null distribution.</p></sec><sec id="S14"><label>3.2.2.</label><title>Comparison Methods</title><p id="P53"><xref rid="T1" ref-type="table">Table 1</xref> provides the assumed null, right-tailed and left-tailed alternative hypotheses for the PAPF, the global Moran&#x02019;s I statistic, the Getis-Ord general G statistic, the spatial scan statistic, and the misapplications of Ripley&#x02019;s K-function, Ripley&#x02019;s D-function, and the average nearest neighbor method. If the method requires a Monte Carlo procedure for estimating the null distribution of the test statistic, details of the data generation mechanism are also provided.</p><p id="P54">The global Moran&#x02019;s I statistic was computed for each dataset using the <monospace>Moran.I</monospace> function in the R package <monospace>ape</monospace> [<xref rid="R46" ref-type="bibr">46</xref>] using an adjacency-based spatial weights matrix. The Getis-Ord general G statistic was computed for each dataset using the <monospace>globalGtest</monospace> function in the R package <monospace>spdep</monospace> [<xref rid="R9" ref-type="bibr">9</xref>] using the same adjacency-based spatial weights matrix. The spatial scan statistic was computed using the <monospace>scan.test</monospace> function in the <monospace>spatstat</monospace> R package [<xref rid="R5" ref-type="bibr">5</xref>]. A binomial likelihood was assumed, with each areal unit having 1 trial. A set of circular zones with radii <inline-formula><mml:math id="M369" display="inline"><mml:mi>r</mml:mi><mml:mo>&#x02208;</mml:mo><mml:msub><mml:mrow><mml:mi>&#x0211b;</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> were considered. The Monte Carlo procedure for the spatial scan statistic is consistent with the null hypothesis of the statistic after conditioning on the number of observations <inline-formula><mml:math id="M370" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula>.</p><p id="P55">Ripley&#x02019;s K-function was computed with the <monospace>Kest</monospace> function in the R <monospace>spatstat</monospace> package [<xref rid="R5" ref-type="bibr">5</xref>], using the centroids of positive units as the set of observation locations. The <monospace>envelope</monospace> function (also in the <monospace>spatstat</monospace> package) with nsims = 200 was used to perform hypothesis testing for the radii-specific tests. This corresponds to generating 200 simulated datasets consisting of <inline-formula><mml:math id="M371" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> observations generated from a two dimensional continuous uniform distribution on <inline-formula><mml:math id="M372" display="inline"><mml:mi>&#x1d49c;</mml:mi></mml:math></inline-formula>. Note that neither the assumed null hypothesis nor the Monte Carlo procedure for Ripley&#x02019;s K-function is consistent with the random labelling hypothesis. In fact, the Monte Carlo procedure does not even produce &#x02018;centroids&#x02019; which are consistent with the assumed areal structure, that is, the points generated by the Monte Carlo procedure are not a subset of the areal unit centroids. Nevertheless, this is the approach used by ArcGIS to apply Ripley&#x02019;s K-function to areal data [<xref rid="R23" ref-type="bibr">23</xref>] and appears to be widely used in practice (see <xref rid="S1" ref-type="sec">Section 1</xref> for examples). As the centroids of the areal units are fully dependent on the assumed areal structure, it is highly unlikely that the centroids of positive units will be consistent with the assumed null hypothesis/Monte Carlo procedure of homogeneous point process (or for that matter, any stationary point process) even in the absence of clustering. We also applied a global Ripley&#x02019;s K test inspired by [<xref rid="R19" ref-type="bibr">19</xref>]. We used the following test statistic to detect clustering <inline-formula><mml:math id="M373" display="inline"><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>K</mml:mi></mml:mrow><mml:mi>&#x002c6;</mml:mi></mml:mover></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mi>C</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="normal">m</mml:mi><mml:mi mathvariant="normal">a</mml:mi><mml:mi mathvariant="normal">x</mml:mi><mml:mo stretchy="false">{</mml:mo><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>K</mml:mi></mml:mrow><mml:mi>&#x002c6;</mml:mi></mml:mover><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>/</mml:mo><mml:msqrt><mml:mi mathvariant="normal">v</mml:mi><mml:mi mathvariant="normal">a</mml:mi><mml:mi mathvariant="normal">r</mml:mi><mml:mo>&#x02061;</mml:mo><mml:mo stretchy="false">[</mml:mo><mml:mover accent="true"><mml:mrow><mml:mi>K</mml:mi></mml:mrow><mml:mi>&#x002c6;</mml:mi></mml:mover><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo stretchy="false">]</mml:mo></mml:msqrt><mml:mo>:</mml:mo><mml:mi>r</mml:mi><mml:mo>&#x02208;</mml:mo><mml:msub><mml:mrow><mml:mi>&#x0211b;</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">}</mml:mo></mml:math></inline-formula> and defined <inline-formula><mml:math id="M374" display="inline"><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>K</mml:mi></mml:mrow><mml:mi>&#x002c6;</mml:mi></mml:mover></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mi>D</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> analogously to detect dispersion. The null distributions of both test statistics were estimated with Monte Carlo simulations for which data was generated using the <monospace>evelope</monospace> function as described above.</p><p id="P56">Diggle and Chetwynd propose a method for adapting Ripley&#x02019;s K-function for the detection of spatial clustering relative to a non-homogeneous null distribution, referred to as Ripley&#x02019;s D-function [<xref rid="R19" ref-type="bibr">19</xref>]. Observations are assumed to belong to one of two types (cases or controls). Ripley&#x02019;s D-function <inline-formula><mml:math id="M375" display="inline"><mml:mi>D</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> is defined as the difference between Ripley&#x02019;s K-function computed using only the cases and Ripley&#x02019;s K-function computed using only the controls. To apply this method to our datasets, the centroids of positive areal units were treated as cases and the centroids of the other areal units were treated as the controls. The null hypothesis of random labeling is thus the same as an independent stationary areal process after conditioning on the number of positive areal units. The <monospace>Kest</monospace> function was used to compute Ripley&#x02019;s K-function for the cases and controls separately, and the test statistic was taken to be difference in Ripley&#x02019;s K-function between the two groups. An upper-tailed test was used to detect clustering, and a lower tailed test was used to detect dispersion. Monte Carlo simulations were used to estimate the null distribution of the test statistic. Again following [<xref rid="R19" ref-type="bibr">19</xref>], we also applied a global Ripley&#x02019;s D test. The test statistics <inline-formula><mml:math id="M376" display="inline"><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mi>C</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula><mml:math id="M377" display="inline"><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mi>n</mml:mi><mml:mi>D</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> were defined analogously to the global PAPF and global Ripley&#x02019;s K-function test statistics, with the estimated Ripley&#x02019;s D-function playing the role of <inline-formula><mml:math id="M378" display="inline"><mml:mover accent="true"><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> or <inline-formula><mml:math id="M379" display="inline"><mml:mover accent="true"><mml:mrow><mml:mi>K</mml:mi></mml:mrow><mml:mi>&#x002c6;</mml:mi></mml:mover><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>. The null distributions of both test statistics were estimated with Monte Carlo simulations as described above.</p><p id="P57">Clark and Evans develop the average nearest neighbor method for detecting clustering in point process data [<xref rid="R13" ref-type="bibr">13</xref>]. The test statistic compares the average distance between each point and its nearest neighbor to the expected distance under a null hypothesis of complete spatial randomness. To apply this method to our data, the centroids of the positive areal units were treated as the observation locations. We note that the assumed null hypothesis of this method is not consistent with the random labeling hypothesis. The average nearest neighbor test statistic was computed using the <monospace>nni</monospace> function in the <monospace>spatialEco</monospace> R package [<xref rid="R24" ref-type="bibr">24</xref>].</p><p id="P58">For each method, a two-tailed test was performed in scenarios <inline-formula><mml:math id="M380" display="inline"><mml:msub><mml:mrow><mml:mi>I</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>I</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>. For the spatial scan statistic and the Getis-Ord general G, these two-tailed test detect only clustering; for all other methods, the two-tailed test detects both clustering and dispersion. The two-tailed test for the global PAPF, global Ripley&#x02019;s K and global Ripley&#x02019;s D methods were conducting by performing separate tests for clustering and dispersion using the corresponding test statistics and applying a Bonferroni correction for 2 tests. As the empirical type I error rate for Ripley&#x02019;s K-function and the ANN method were severely inflated, they were not applied to the other scenarios. For scenarios <inline-formula><mml:math id="M381" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>12</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>, an <inline-formula><mml:math id="M382" display="inline"><mml:mi>&#x003b1;</mml:mi></mml:math></inline-formula>-level test for clustering was used for all the remaining methods. We note that a test for clustering is a single tailed test for all methods except the spatial scan statistic, for which it is a two-tailed test (see <xref rid="T1" ref-type="table">Table 1</xref>). For scenarios <inline-formula><mml:math id="M383" display="inline"><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>, an <inline-formula><mml:math id="M384" display="inline"><mml:mi>&#x003b1;</mml:mi></mml:math></inline-formula>-level test for dispersion was performed for the PAPF, Moran&#x02019;s I and Ripley&#x02019;s D methods. As neither the Getis-Ord general G statistic nor the spatial scan statistic can detect dispersion, these methods were not applied to these scenarios. Finally, for scenarios <inline-formula><mml:math id="M385" display="inline"><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>, separate <inline-formula><mml:math id="M386" display="inline"><mml:mi>&#x003b1;</mml:mi></mml:math></inline-formula>-level tests for clustering and dispersion were performed using the PAPF, Moran&#x02019;s I, and Ripley&#x02019;s D-function to assess the presence of both clustering and dispersion. An <inline-formula><mml:math id="M387" display="inline"><mml:mi>&#x003b1;</mml:mi></mml:math></inline-formula>-level test for clustering was also performed using the Getis-Ord general G and the spatial scan statistic.</p></sec></sec><sec id="S15"><label>3.3.</label><title>Simulation Results</title><p id="P59"><xref rid="T2" ref-type="table">Tables 2</xref> and <xref rid="T3" ref-type="table">3</xref> summarize selected results from study area <inline-formula><mml:math id="M388" display="inline"><mml:msub><mml:mrow><mml:mi>&#x1d49c;</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> (the regular grid) and study area <inline-formula><mml:math id="M389" display="inline"><mml:msub><mml:mrow><mml:mi>&#x1d49c;</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> (the US counties), respectively. The remaining results can be found in Web <xref rid="T1" ref-type="table">Tables 1</xref> and <xref rid="T2" ref-type="table">2</xref>. Each table reports the empirical rate of rejection for the null hypothesis. For scenarios <inline-formula><mml:math id="M390" display="inline"><mml:msub><mml:mrow><mml:mi>I</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>I</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula><mml:math id="M391" display="inline"><mml:msub><mml:mrow><mml:mi>I</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>, this quantity is the empirical type I error rate; for the other scenarios, this quantity is the empirical power. As the PAPF, Ripley&#x02019;s K and Ripley&#x02019;s D tests were performed at 10 different radii, the rejection rate for each radius is reported separately.</p><p id="P60">Under the null scenarios (<inline-formula><mml:math id="M392" display="inline"><mml:msub><mml:mrow><mml:mi>I</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>I</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula><mml:math id="M393" display="inline"><mml:msub><mml:mrow><mml:mi>I</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>), the empirical type I error rates of the global and radii-specific PAPF methods, the global Moran&#x02019;s I statistic, the Getis-Ord general G statistic, the spatial scan statistic and the global and radii-specific Ripley&#x02019;s D methods are within the Monte Carlo margin of error of their nominal levels. However the empirical type I error rate of global and radii-specific Ripley&#x02019;s K methods and the average nearest neighbor method were highly inflated for most scenarios. This is presumably due to the incongruities between the assumed null hypothesis of these tests and the null hypothesis under which data was generated.</p><p id="P61">Under the excess clustering scenarios <inline-formula><mml:math id="M394" display="inline"><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>6</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math></inline-formula>, the global PAPF test had 100% empirical power to detect clustering, with the exception of scenario <inline-formula><mml:math id="M395" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> under the regular grid. Additionally, the power of the radii-specific PAPF tests is fairly consistent across radii. For these scenarios, the performance of the global PAPF test is comparable to or better than the performance of the Moran&#x02019;s I statistic, the Getis-Ord general G statistic and the spatial scan statistic. Interestingly, while the performance of the global and radii-specific Ripley&#x02019;s D tests is comparable to or better than that of the global and radii-specific PAPF test for DGMs <inline-formula><mml:math id="M396" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>, the Ripley&#x02019;s D test performs quite poorly for the larger sample sizes (DGMs <inline-formula><mml:math id="M397" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>5</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>6</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>) on the regular grid, while the PAPF test continues to perform well.</p><p id="P62">Under the autocorrelated-clustering DGMs <inline-formula><mml:math id="M398" display="inline"><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>7</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>12</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math></inline-formula> the empirical power of the global PAPF test is generally high and the performance of the PAPF is comparable to that of Moran&#x02019;s I, Getis-Ord general G, and the scan statistic, with the exception scenarios <inline-formula><mml:math id="M399" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>7</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> on the regular grid and <inline-formula><mml:math id="M400" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>11</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> on the US counties, for which Moran&#x02019;s I and Getis-Ord are the top performers. The performance of the PAPF test and the Ripley&#x02019;s D test are generally comparable for scenarios <inline-formula><mml:math id="M401" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>7</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>10</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> on the regular grid, but the performance of the Ripley&#x02019;s D test deteriorates drastically for scenarios <inline-formula><mml:math id="M402" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>11</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>12</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>, while the PAPF test continues to exhibit good performance. Both the global and local PAPF tests tends to outperform their Ripley&#x02019;s D counterparts for the US county scenarios.</p><p id="P63">Under the dispersion DGMs <inline-formula><mml:math id="M403" display="inline"><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math></inline-formula> the performance of all methods was roughly comparable, except for the smaller sample sizes for the regular grid, for which the performance of the global PAPF statistic was noticeably worse than the others.</p><p id="P64">Under the mixture of clustering and dispersion scenarios <inline-formula><mml:math id="M404" display="inline"><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math></inline-formula>, the ability of the global PAPF test and global Ripley&#x02019;s D tests to detect clustering is quite good for the regular grid, but performance deteriorates noticeably for the US counties. The opposite is true for the spatial scan statistic, which exhibits poor performance (0% power) for the regular grid but excellent performance (100% power) for the US counties. The performance of the global Moran&#x02019;s I and Getis-Ord general G statistics to detect clustering is poor for all scenarios. The ability of the PAPF test, the Ripley&#x02019;s D test, and the global Moran&#x02019;s I statistic to detect dispersion is poor for the regular grid. However the ability of the PAPF test to detect dispersion on the US counties improves dramatically, while the performance of the other methods remain poor.</p><p id="P65">In summary, of the nine methods considered, only seven (the global and radii-specific PAPF tests, the global Moran&#x02019;s I statistic, the Getis-Ord general G statistic, the spatial scan statistic and the global and radii-specific Ripley&#x02019;s D tests) maintained their nominal type I error rates. The empirical power of the PAPF test was generally comparable to or better than all other methods, though there were a few exceptions to this rule.</p><p id="P66">Only Ripley&#x02019;s K-function, Ripley&#x02019;s D-function and the PAPF method allow one to detect distance-specific clustering or dispersion. Thus, if one wishes to test for spatial patterns at a single, specific distance, the radii-specific Ripley&#x02019;s K, Ripley&#x02019;s D and PAPF methods are the only potential options. However, the Ripley&#x02019;s K methods exhibited a severely inflated type I error rate and are thus unreliable. Notably, the performance of the global PAPF test is comparable to or better than that of the global Ripley&#x02019;s D test for all scenarios. In the some scenarios, the performance of the global PAPF test was 4&#x02013;6 times better than that of the global Ripley&#x02019;s D test. The Ripley&#x02019;s D test is designed for point process data and is here being misapplied to areal data, which may explain some of the strange behavior in it&#x02019;s results (e.g. worsening performance as the number of positive units increases). The Ripley&#x02019;s D test compares Ripley&#x02019;s K-function on the positive units to Ripley&#x02019;s K-function on the negative units. In scenarios <inline-formula><mml:math id="M405" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>5</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula><mml:math id="M406" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>6</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> on the regular grid, the global Ripley&#x02019;s D test performed very poorly. In these scenarios, almost all of the 200 negative units occur outside the region of excess clustering (shown in blue in <xref rid="F3" ref-type="fig">Figure 3</xref>). As there are 300 total units outside this region, this implies that probability that a unit in this region is negative is roughly 0.66, as opposed to 0.5 under the random labeling hypothesis. This implies that the negative units are <italic toggle="yes">also clustering-</italic> that is, they are more likely to be in the non-blue region than expected under the null hypothesis. Since Ripley&#x02019;s D-function compares the degree of clustering in the positive units to the degree of clustering in the negative units, the presence of clustering in both types of units may explain the lack of statistical significance in the global Ripley&#x02019;s D test. While this clustering of negative units also occurs in scenarios <inline-formula><mml:math id="M407" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>, it is less extreme. For example, in scenario <inline-formula><mml:math id="M408" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>, the probability of a unit outside the blue region being negative is approximately 0.97, versus 0.9 under the null hypothesis. The generally superior performance of the global PAPF method combined with the fact that Ripley&#x02019;s D-function is a point process method being misapplied to areal data compel use to recommend the use of the PAPF rather than Ripley&#x02019;s D-function to detect distance-specific patterns.</p></sec><sec id="S16"><label>3.4.</label><title>Supplementary Simulation Results</title><p id="P67">To assess the performance of the PAPF method under extremely small sample sizes, a simulation study was conducted using a 4&#x000d7;5 regular grid. In general, the global Ripley&#x02019;s D test had the highest power to detect clustering and the ability of all global methods to detect dispersion was roughly the same. However power of the radii-specific PAPF tests to detect both clustering and dispersion was generally higher than that of the radii-specific Ripley&#x02019;s D tests. The full simulation results can be found in Web <xref rid="T3" ref-type="table">Table 3</xref>.</p><p id="P68">Another simulation study was conducted to assess the sensitivity of the PAPF method to the dependence on areal unit centroids. In brief, the role of each areal unit&#x02019;s centroid in the definition of the PAPF was replaced with a randomly choosen point from each areal unit. For full details, see <xref rid="SD1" ref-type="supplementary-material">Web Appendix B</xref> of the <xref rid="SD1" ref-type="supplementary-material">Supplementary Material</xref>. The results of this simulation study are found in <xref rid="SD1" ref-type="supplementary-material">Web Tables 4</xref> and <xref rid="SD1" ref-type="supplementary-material">5</xref> in the <xref rid="SD1" ref-type="supplementary-material">Supplementary Material</xref>. The performance of this alternative PAPF method was comparable to that of the standard method.</p></sec></sec><sec id="S17"><label>4.</label><title>Data Application</title><p id="P69">In this section, we consider the performance of our method on real world applications from two different fields. First, we use the method to determine if land parcels with CEs are clustered in Boulder County, Colorado, using the 112,819 distinct land parcels in Boulder County as the areal structure. Next, we apply our method to determine if US counties with high childhood overweight rates are spatially clustered, using the 3,108 county and county-equivalents in the contiguous US as the areal structure.</p><sec id="S18"><label>4.1.</label><title>Application to Conservation Easements</title><p id="P70">CEs are a private and generally perpetual form of land conservation that legally severs aspects of private landownership (e.g., development rights, resource extraction, etc.) from a parcel of land [<xref rid="R44" ref-type="bibr">44</xref>]. Although a landowner makes an individual decision to place a CE, prior research has indicated spatial clustering of CEs over time, throughout the US [<xref rid="R36" ref-type="bibr">36</xref>]. Cumulative and clustered CE use may impact regional ecosystem character by altering the degree of CE parcels&#x02019; isolation or connectivity with other ecologically valuable parcels and may change ecologic quality on the CE parcel itself [<xref rid="R29" ref-type="bibr">29</xref>]. The greater the mass of clustering and ecological systems integrity, the more impact there may be on the land conversion rates at the county level, and on the decision to leave a parcel in open space (or not), potentially affecting placement of other socially valuable land uses. Recognizing if and where CEs are clustered and linking the social, political, biological, and geographical characteristics to the clustered areas may help elucidate the factors driving CE placement [<xref rid="R7" ref-type="bibr">7</xref>].</p><p id="P71">The Boulder County data consists of 112,819 land parcels in place in 2008. Of these land parcels, 817 were held as CEs. A parcel was considered to be part of a CE if any part of the parcel was part of an easement. <xref rid="F5" ref-type="fig">Figure 5</xref> depicts the land parcels; CE parcels are shown in blue. Global PAPF tests for clustering and dispersion were applied to the dataset using a Bonferroni correction to maintain an overall significance level of <inline-formula><mml:math id="M409" display="inline"><mml:mi>&#x003b1;</mml:mi></mml:math></inline-formula>, along with two-tailed radii-specific tests at 10 different radii, depicted in <xref rid="F5" ref-type="fig">Figure 5</xref>.</p><p id="P72">In order to apply our method, the distribution of the global and radii-specific PAPF test statistic under the null hypothesis was estimated using 200 Monte Carlo simulations. In each Monte Carlo simulation, 817 parcels were selected via simple random sampling without replacement. The observed global PAPF test statistic for clustering <inline-formula><mml:math id="M410" display="inline"><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mn>817</mml:mn><mml:mi>C</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mfenced></mml:math></inline-formula> was larger than the 97.5th quantile of it&#x02019;s estimated null distribution. The radii-specific test for radius <inline-formula><mml:math id="M411" display="inline"><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> (approximately 1.1 miles) was also larger than the 97.5th quantile of it&#x02019;s null distribution. No other radii-specific tests were significant. These results indicate that parcels which contain CEs are significantly clustered at small distances. The exact test statistics and estimated quantiles can be found in <xref rid="SD1" ref-type="supplementary-material">Web Table 6</xref> in the <xref rid="SD1" ref-type="supplementary-material">Supplementary Material</xref>.</p><p id="P73">In the context of CEs with a purpose of biological conservation, clustering easements close to one another is one reserve design principle to improve landscape connectivity and combat the adverse effects of habitat fragmentation from human land conversion [<xref rid="R17" ref-type="bibr">17</xref>, <xref rid="R31" ref-type="bibr">31</xref>]. Larger and higher quality habitats (particularly on CEs) increase the size and stability of source populations and subsequently increase species dispersal capabilities [<xref rid="R32" ref-type="bibr">32</xref>]. Clustering and structural connectivity between conservation areas are not always positive, however, as clustering may also leave these areas vulnerable to spatially autocorrelated extinction pressures, such as diseases, invasive species, stochastic environmental events, or negative effects from localized urban growth [<xref rid="R21" ref-type="bibr">21</xref>]. Given that the PAPF method indicated the spatial clustering of CEs at short distances in Boulder County, more detailed landscape connectivity studies focused on functional connectivity may be warranted [<xref rid="R6" ref-type="bibr">6</xref>, <xref rid="R54" ref-type="bibr">54</xref>].</p></sec><sec id="S19"><label>4.2.</label><title>Application to Counties with High Childhood Overweight/Obesity Rates</title><p id="P74">Next, we use the PAPF method to determine if US counties with high childhood overweight/obesity rates are spatially clustered. County-level childhood overweight rates were estimated from data collected in the 2016 National Survey of Children&#x02019;s Health using a multilevel small area estimation approach as described in [<xref rid="R56" ref-type="bibr">56</xref>]. A county was considered to have a high overweight rate if its estimated rate exceeded the 75th percentile of all county overweight rates. There are 3,108 counties and county-equivalents in the contiguous US, and 786 of these counties were found to have a high rate of childhood overweight. These counties are shown in blue in <xref rid="F5" ref-type="fig">Figure 5</xref>, along with the radii at which PAPF was applied. As for the CEs data application, an <inline-formula><mml:math id="M412" display="inline"><mml:mi>&#x003b1;</mml:mi><mml:mo>=</mml:mo><mml:mn>0.05</mml:mn></mml:math></inline-formula> global test for clustering or dispersion was conducted using <inline-formula><mml:math id="M413" display="inline"><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mn>786</mml:mn><mml:mi>C</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> and <inline-formula><mml:math id="M414" display="inline"><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mn>786</mml:mn><mml:mi>D</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> and applying a Bonferroni correction. A two-tailed <inline-formula><mml:math id="M415" display="inline"><mml:mi>&#x003b1;</mml:mi><mml:mo>=</mml:mo><mml:mo>.</mml:mo><mml:mn>05</mml:mn></mml:math></inline-formula> level test was also conducted for each radii.</p><p id="P75">The distribution of the global and radii-specific PAPF test statistics under the null hypothesis was estimated using 200 Monte Carlo simulations. In each Monte Carlo simulation, 786 counties were selected via simple random sampling without replacement. The observed global PAPF test statistic for clustering <inline-formula><mml:math id="M416" display="inline"><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mn>786</mml:mn><mml:mi>C</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mfenced></mml:math></inline-formula> was larger than the 97.5th quantile of it&#x02019;s estimated null distribution. All 10 radii-specific test statistics were also greater than the 97.5th quantiles of their null distributions, indicating that counties with high rates of childhood overweight are significantly clustered at small and large distances. As Southeastern and Midwest states tend to have higher overweight and obesity rates than the rest of the country [<xref rid="R25" ref-type="bibr">25</xref>, <xref rid="R11" ref-type="bibr">11</xref>], these results are not surprising. The exact test statistics and estimated quantiles can be found in <xref rid="SD1" ref-type="supplementary-material">Web Table 7</xref> in the <xref rid="SD1" ref-type="supplementary-material">Supplementary Material</xref>.</p></sec></sec><sec id="S20"><label>5.</label><title>Conclusion</title><p id="P76">The problem of assessing binary areal data for distance-specific spatial clustering or dispersion has been the subject of relatively little attention. Existing methods such as the global Moran&#x02019;s I statistic, the Getis-Ord general G statistic, and the spatial scan statistic are global tests for clustering which are not directly amenable to determining the distance at which clustering is occurring. The global Moran&#x02019;s I and Getis-Ord general G statistics can be sometimes be tuned to pick up patterns at a specific distance by choosing an appropriate spatial weight matrix. However, the interpretation of the spatial scale induced by the weight matrix is often very complex, particularly for inverse-distance based weights (for which all observations are related to some degree) or for adjacency based weights (which ignore the size of the underlying units). One can attempt to deduce the distance at which clustering occurs from the spatial scan statistic by examining the size of the most likely cluster, but as the spatial scan statistic cannot detect dispersion, one cannot detect the simultaneous presence of clustering and dispersion operating at different distances. Further, the spatial scan statistic has some challenges with areal data since large neighboring units that can have centroids which are relatively far from each other. The PAPF method improves on the 0/1 indicator used by the spatial scan statistic by allowing observed units to be partially inside a zone of interest.</p><p id="P77">While the average nearest neighbor method and the traditional Ripley&#x02019;s K method are often used to assess areal data for clustering by mapping each areal unit to its centroid, these methods were not designed for areal data. Our simulation study shows that applying these methods in this manner results in a highly inflated type I error rate. In fact, in many settings these methods had a 100% type I error rate. Since such an approach is the default method used by ArcGIS software, these results are concerning.</p><p id="P78">To provide a means of testing binary areal data for clustering and dispersion at specific distances, we developed the positive area proportion function (PAPF). The PAPF is motivated by Ripley&#x02019;s K-function, and has a similar interpretation. The PAPF method quantifies the average proportion of positive area within a specified distance of each positive unit centroid. The PAPF can be used to perform a hypothesis test for the presence of spatial clustering or dispersion by comparing the observed PAPF test statistic to the distribution of the PAPF test statistic under the null hypothesis of a stationary and independent areal process, which is equivalent to the well-studied random labelling hypothesis after conditioning on the number of positive units. Simulation studies demonstrated that PAPF hypothesis testing procedure maintains its nominal type I error rate and has high power to detect a variety of spatial patterns, including clustering and dispersion at a variety of distances. The PAPF generally displayed comparable or higher power to other methods.</p><p id="P79">To our knowledge, the PAPF method is the only method for detecting distance-specific patterns in binary areal data. The ability to detect patterns at specific distances is of critical importance in many fields that use and evaluate spatial relationships including urban planning, regional science, conservation biology, public health, epidemiology and many others. For example, the ecological implications of small-distance clusters of conserved land are quite different than the implications of clustering across large distances; addressing many small clusters of food desert census tracts requires a different policy approach than addressing a single expansive swath of food desert tracts. Our method allows researchers to identify the spatial scale at which any clustering or dispersion is operating, facilitating a deeper understanding of the processes at work. Our method also provides a reliable alternative to the misapplication of Ripley&#x02019;s K-function to areal data.</p><p id="P80">To facilitate the use of our method, R code which implements the PAPF method and performs the necessary Monte Carlo simulations has been made available online at <ext-link xlink:href="https://github.com/scwatson812/PAPF" ext-link-type="uri">https://github.com/scwatson812/PAPF</ext-link>. The computational expense of the method increases with the number of positive units. When the number of observed areal units is large, the Monte Carlo simulations may be run in parallel to reduce computation time. The development of faster methods for approximating the null distribution is an excellent area for future work. Future work could also consider the extension of the PAPF to continuous data or categorical data with more than two categories. While this work considered the areal units as fixed and only the designation of being &#x02018;positive&#x02019; as random, it is possible to treat both the locations/boundaries of the areal units and the designation of being positive as random. Such an application might arise when partitioning US states into congressional districts, with the &#x02018;positive&#x02019; districts being those dominated by a particular political party. In such a case, one could treat the centiods of the areal units as a marked point process, with the <inline-formula><mml:math id="M417" display="inline"><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> s as the associated marks. One could then analyze the resulting marked point process via the associated second order marked random measure and the marked <inline-formula><mml:math id="M418" display="inline"><mml:msub><mml:mrow><mml:mi>K</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mi>m</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>-function.</p></sec><sec sec-type="supplementary-material" id="SM1"><title>Supplementary Material</title><supplementary-material id="SD1" position="float" content-type="local-data"><label>MMC1</label><media xlink:href="NIHMS1902698-supplement-MMC1.pdf" id="d64e9068" position="anchor"/></supplementary-material></sec></body><back><ack id="S21"><title>Funding</title><p id="P81">SS and AM were supported in part by the Research Center for Child Well-Being [NIGMS P20GM130420]. SS, AZ, and AM were supported in part by the Centers for Disease Control [5 U19 DD 001218]. SS, AO, DW, and CD were supported in part by the National Science Foundation [CNH-L 1518455]. The funding sources played no role in study design, data collection, data analysis, or manuscript publication.</p></ack><fn-group><fn fn-type="COI-statement" id="FN1"><p id="P82">Competing Interests</p><p id="P83">Declarations of interest: none</p></fn><fn id="FN2"><p id="P84">CReDiT Author Statement</p><p id="P85">Stella Self: conceptualization, methodology, software, writing- original draft, writing-review &#x00026; editing visualization</p><p id="P86">Anna Overby: conceptualization, investigation, resources, data curation, writing- review &#x00026; editing</p><p id="P87">Anja Zgodic: conceptualization, investigation, data curation, writing- review &#x00026; editing</p><p id="P88">David White: resources, writing- review &#x00026; editing, supervision</p><p id="P89">Alexander McLain: conceptualization, data curation, writing- review &#x00026; editing, visualization, supervision, funding acquisition</p><p id="P90">Caitlin Dyckman: conceptualization, investigation, resources, data curation, writing- review &#x00026; editing, supervision, funding acquisition</p></fn><fn id="FN3"><p id="P91">Supplementary Material</p><p id="P92">The <xref rid="SD1" ref-type="supplementary-material">Supplementary Material</xref> contains derivations of properties of the PAPF (<xref rid="SD1" ref-type="supplementary-material">Web Appendix A</xref>), additional simulation results (<xref rid="SD1" ref-type="supplementary-material">Web Appendix B</xref>) and additional data application results (Web Appendix C).</p></fn><fn id="FN4"><p id="P93" content-type="publisher-disclaimer">This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.</p></fn></fn-group><ref-list><title>References</title><ref id="R1"><label>[1]</label><mixed-citation publication-type="journal"><name><surname>Andersen</surname><given-names>M</given-names></name> (<year>1992</year>). <article-title>Spatial analysis of two-species interaction</article-title>. <source>Oecologia</source>, <volume>91</volume>:<fpage>134</fpage>&#x02013;<lpage>140</lpage>.<pub-id pub-id-type="pmid">28313385</pub-id></mixed-citation></ref><ref id="R2"><label>[2]</label><mixed-citation publication-type="book"><name><surname>Arbia</surname><given-names>G</given-names></name>, <name><surname>Espa</surname><given-names>G</given-names></name>, and <name><surname>Quah</surname><given-names>D</given-names></name> (<year>2009</year>). <source>A class of spatial econometric methods in the empirical analysis of clusters of firms in the space</source>, pages <fpage>81</fpage>&#x02013;<lpage>103</lpage>. <publisher-name>Physica-Verlag HD</publisher-name>, <publisher-loc>Heidelberg</publisher-loc>.</mixed-citation></ref><ref id="R3"><label>[3]</label><mixed-citation publication-type="book"><name><surname>Baddeley</surname><given-names>A</given-names></name>, <name><surname>Gregori</surname><given-names>P</given-names></name>, <name><surname>Mateu</surname><given-names>J</given-names></name>, <name><surname>Stoica</surname><given-names>R</given-names></name>, and <name><surname>Stoyan</surname><given-names>D</given-names></name> (<year>2005</year>). <source>Case Studies in Spatial Point Process Modeling</source>. <publisher-name>Springer</publisher-name>, <publisher-loc>New York</publisher-loc>.</mixed-citation></ref><ref id="R4"><label>[4]</label><mixed-citation publication-type="journal"><name><surname>Baddeley</surname><given-names>A</given-names></name>, <name><surname>M&#x000f8;ller</surname><given-names>J</given-names></name>, and <name><surname>Waagepetersen</surname><given-names>R</given-names></name> (<year>2000</year>). <article-title>Non- and semiparametric estimation of interaction in inhomogeneous point patterns</article-title>. <source>Statistica Neerlandica</source>, <volume>54</volume>(<issue>3</issue>):<fpage>329</fpage>&#x02013;<lpage>350</lpage>.</mixed-citation></ref><ref id="R5"><label>[5]</label><mixed-citation publication-type="book"><name><surname>Baddeley</surname><given-names>A</given-names></name>, <name><surname>Rubak</surname><given-names>E</given-names></name>, and <name><surname>Turner</surname><given-names>R</given-names></name> (<year>2015</year>). <source>Spatial Point Patterns: Methodology and Applications with R</source>. <publisher-name>Chapman and Hall/CRC Press</publisher-name>, <publisher-loc>London</publisher-loc>.</mixed-citation></ref><ref id="R6"><label>[6]</label><mixed-citation publication-type="journal"><name><surname>Balbi</surname><given-names>M</given-names></name>, <name><surname>Petit</surname><given-names>EJ</given-names></name>, <name><surname>Croci</surname><given-names>S</given-names></name>, <name><surname>Nabucet</surname><given-names>J</given-names></name>, <name><surname>Georges</surname><given-names>R</given-names></name>, <name><surname>Madec</surname><given-names>L</given-names></name>, and <name><surname>Ernoult</surname><given-names>A</given-names></name> (<year>2019</year>). <article-title>Ecological relevance of least cost path analysis: An easy implementation method for landscape urban planning</article-title>. <source>Journal of Environmental Management</source>, <volume>244</volume>:<fpage>61</fpage>&#x02013;<lpage>68</lpage>.<pub-id pub-id-type="pmid">31108311</pub-id></mixed-citation></ref><ref id="R7"><label>[7]</label><mixed-citation publication-type="journal"><name><surname>Baldwin</surname><given-names>R</given-names></name> and <name><surname>Leonard</surname><given-names>P</given-names></name> (<year>2015</year>). <article-title>Interacting social and environmental predictors for the spatial distribution of conservation lands</article-title>. <source>PLOS ONE</source>, <volume>10</volume>(<issue>10</issue>).</mixed-citation></ref><ref id="R8"><label>[8]</label><mixed-citation publication-type="book"><name><surname>Bene</surname><given-names>C</given-names></name> and <name><surname>Rataj</surname><given-names>J</given-names></name> (<year>2004</year>). <source>Stochastic Geometry: Selected Topics</source>. <publisher-name>Kluwer Academic Publishers</publisher-name>.</mixed-citation></ref><ref id="R9"><label>[9]</label><mixed-citation publication-type="book"><name><surname>Bivand</surname><given-names>RS</given-names></name>, <name><surname>Pebesma</surname><given-names>E</given-names></name>, and <name><surname>Gomez-Rubio</surname><given-names>V</given-names></name> (<year>2013</year>). <source>Applied spatial data analysis with R, Second edition</source>. <publisher-name>Springer</publisher-name>, <publisher-loc>NY</publisher-loc>.</mixed-citation></ref><ref id="R10"><label>[10]</label><mixed-citation publication-type="journal"><name><surname>Caprarelli</surname><given-names>G</given-names></name> and <name><surname>Fletcher</surname><given-names>S</given-names></name> (<year>2014</year>). <article-title>A brief review of spatial analysis concepts and tools used for mapping, containment and risk modelling of infectious diseases and other illnesses</article-title>. <source>Parasitology</source>, <volume>141</volume>(<issue>5</issue>):<fpage>581</fpage>&#x02013;<lpage>601</lpage>.<pub-id pub-id-type="pmid">24476672</pub-id></mixed-citation></ref><ref id="R11"><label>[11]</label><mixed-citation publication-type="book"><source>Centers for Disease Control and Prevention</source> (<year>2021</year>). <publisher-name>Trends and maps</publisher-name>, <comment><ext-link xlink:href="https://nccd.cdc.gov/dnpao_dtm/rdpage.aspx?rdreport=dnpao_dtm.explorebytopic&#x00026;islclass=ows&#x00026;isltopic=&#x00026;go=go" ext-link-type="uri">https://nccd.cdc.gov/dnpao_dtm/rdpage.aspx?rdreport=dnpao_dtm.explorebytopic&#x00026;islclass=ows&#x00026;isltopic=&#x00026;go=go</ext-link></comment>.</mixed-citation></ref><ref id="R12"><label>[12]</label><mixed-citation publication-type="book"><name><surname>Chiu</surname><given-names>SN</given-names></name>, <name><surname>Stoyan</surname><given-names>D</given-names></name>, <name><surname>Kendall</surname><given-names>W</given-names></name>, and <name><surname>Mecke</surname><given-names>J</given-names></name> (<year>2013</year>). <source>Stochastic Geometry and Its Applications</source>. <publisher-name>John Wiley &#x00026; Sons</publisher-name>.</mixed-citation></ref><ref id="R13"><label>[13]</label><mixed-citation publication-type="journal"><name><surname>Clark</surname><given-names>PJ</given-names></name> and <name><surname>Evans</surname><given-names>FC</given-names></name> (<year>1954</year>). <article-title>Distance to nearest neighbor as a measure of spatial relationships in populations</article-title>. <source>Ecology</source>, <volume>35</volume>(<issue>4</issue>):<fpage>445</fpage>&#x02013;<lpage>453</lpage>.</mixed-citation></ref><ref id="R14"><label>[14]</label><mixed-citation publication-type="journal"><name><surname>Cuzick</surname><given-names>J</given-names></name> and <name><surname>Edwards</surname><given-names>R</given-names></name> (<year>1990</year>). <article-title>Spatial clustering for inhomogeneous populations (with discussion)</article-title>. <source>Journal of teh Royal Statistical Society, Series B</source>, <volume>52</volume>.</mixed-citation></ref><ref id="R15"><label>[15]</label><mixed-citation publication-type="book"><name><surname>Daley</surname><given-names>DJ</given-names></name> and <name><surname>Vere-Jones</surname><given-names>D</given-names></name> (<year>1998</year>). <source>An Introduction to the Theory of Point Processes</source>. <publisher-name>Springer</publisher-name>, <publisher-loc>New York</publisher-loc>.</mixed-citation></ref><ref id="R16"><label>[16]</label><mixed-citation publication-type="journal"><name><surname>Davarpanah</surname><given-names>A</given-names></name>, <name><surname>Babaie</surname><given-names>HA</given-names></name>, and <name><surname>Dai</surname><given-names>D</given-names></name> (<year>2018</year>). <article-title>Spatial autocorrelation of neogene-quaternary lava along the Snake River Plain, Idaho, USA</article-title>. <source>Earth Science Informatics</source>, <volume>11</volume>(<issue>1</issue>):<fpage>59</fpage>&#x02013;<lpage>75</lpage>.</mixed-citation></ref><ref id="R17"><label>[17]</label><mixed-citation publication-type="journal"><name><surname>Diamond</surname><given-names>J</given-names></name> (<year>1975</year>). <article-title>The island dilemma: lessons of modern biogeographic studies for the design of natural reserves</article-title>. <source>Biological Conservation</source>, <volume>7</volume>:<fpage>129</fpage>&#x02013;<lpage>146</lpage>.</mixed-citation></ref><ref id="R18"><label>[18]</label><mixed-citation publication-type="book"><name><surname>Diggle</surname><given-names>P</given-names></name> (<year>1983</year>). <source>Statistical analysis of spatial point patterns</source>. <publisher-name>Academic Press</publisher-name>, <publisher-loc>London</publisher-loc>.</mixed-citation></ref><ref id="R19"><label>[19]</label><mixed-citation publication-type="journal"><name><surname>Diggle</surname><given-names>P</given-names></name> and <name><surname>Chetwynd</surname><given-names>A</given-names></name> (<year>1991</year>). <article-title>Second-order analysis of spatial clustering for inhomogeneous populations</article-title>. <source>Biometrics</source>, <volume>47</volume>(<issue>3</issue>):<fpage>1155</fpage>&#x02013;<lpage>63</lpage>.<pub-id pub-id-type="pmid">1742435</pub-id></mixed-citation></ref><ref id="R20"><label>[20]</label><mixed-citation publication-type="book"><name><surname>Dixon</surname><given-names>PM</given-names></name> (<year>2014</year>). <source>Ripley&#x02019;s K Function</source>. <publisher-name>John Wiley &#x00026; Sons, Ltd</publisher-name>.</mixed-citation></ref><ref id="R21"><label>[21]</label><mixed-citation publication-type="journal"><name><surname>Donaldson</surname><given-names>L</given-names></name>, <name><surname>Wilson</surname><given-names>R</given-names></name>, and <name><surname>Maclean</surname><given-names>I</given-names></name> (<year>2016</year>). <article-title>Old concepts, new challenges: adapting landscape-scale conservation to the twenty-first century</article-title>. <source>Biodiversity and Conservation</source>, <volume>26</volume>(<issue>3</issue>):<fpage>527</fpage>&#x02013;<lpage>552</lpage>.<pub-id pub-id-type="pmid">32269427</pub-id></mixed-citation></ref><ref id="R22"><label>[22]</label><mixed-citation publication-type="journal"><name><surname>Duranton</surname><given-names>G</given-names></name> and <name><surname>Overman</surname><given-names>HG</given-names></name> (<year>2005</year>). <article-title>Testing for localization using micro-geographic data</article-title>. <source>The Review of Economic Studies</source>, <volume>72</volume>(<issue>4</issue>):<fpage>1077</fpage>&#x02013;<lpage>1106</lpage>.</mixed-citation></ref><ref id="R23"><label>[23]</label><mixed-citation publication-type="journal">ESRI (<year>2021</year>). <source>Arcmap 10.5.1: Multi-distance spatial cluster analysis (Ripley&#x02019;s K function) (spatial statistics)</source>.</mixed-citation></ref><ref id="R24"><label>[24]</label><mixed-citation publication-type="book"><name><surname>Evans</surname><given-names>JS</given-names></name> (<year>2021</year>). <source>spatialEco</source>. <publisher-name>R package version</publisher-name>
<volume>1</volume>.<fpage>3</fpage>&#x02013;<lpage>6</lpage>.</mixed-citation></ref><ref id="R25"><label>[25]</label><mixed-citation publication-type="journal"><name><surname>Gartner</surname><given-names>DR</given-names></name>, <name><surname>Taber</surname><given-names>DR</given-names></name>, <name><surname>Hirsch</surname><given-names>JA</given-names></name>, and <name><surname>Robinson</surname><given-names>WR</given-names></name> (<year>2016</year>). <article-title>The spatial distribution of gender differences in obesity prevalence differs from overall obesity prevalence among us adults</article-title>. <source>Annals of Epidemiology</source>, <volume>26</volume>(<issue>4</issue>):<fpage>293</fpage>&#x02013;<lpage>298</lpage><pub-id pub-id-type="pmid">27039046</pub-id></mixed-citation></ref><ref id="R26"><label>[26]</label><mixed-citation publication-type="journal"><name><surname>Gatrell</surname><given-names>AC</given-names></name>, <name><surname>Bailey</surname><given-names>TC</given-names></name>, <name><surname>Diggle</surname><given-names>PJ</given-names></name>, and <name><surname>Rowlingson</surname><given-names>BS</given-names></name> (<year>1996</year>). <article-title>Spatial point pattern analysis and its application in geographical epidemiology</article-title>. <source>Transactions of the Institute of British Geographers</source>, <volume>21</volume>(<issue>1</issue>):<fpage>256</fpage>&#x02013;<lpage>274</lpage>.</mixed-citation></ref><ref id="R27"><label>[27]</label><mixed-citation publication-type="journal"><name><surname>Getis</surname><given-names>A</given-names></name> and <name><surname>Franklin</surname><given-names>J</given-names></name> (<year>1987</year>). <article-title>Second-order neighborhood analysis of mapped point patterns</article-title>. <source>Ecology</source>, <volume>68</volume>(<issue>3</issue>):<fpage>473</fpage>&#x02013;<lpage>477</lpage>.</mixed-citation></ref><ref id="R28"><label>[28]</label><mixed-citation publication-type="journal"><name><surname>Getis</surname><given-names>A</given-names></name> and <name><surname>Ord</surname><given-names>JK</given-names></name> (<year>1992</year>). <article-title>The analysis of spatial association by use of distance statistics</article-title>. <source>Geographical Analysis</source>, <volume>24</volume>(<issue>3</issue>):<fpage>189</fpage>&#x02013;<lpage>206</lpage>.</mixed-citation></ref><ref id="R29"><label>[29]</label><mixed-citation publication-type="journal"><name><surname>Graves</surname><given-names>R</given-names></name>, <name><surname>Williamson</surname><given-names>M</given-names></name>, <name><surname>Belote</surname><given-names>T</given-names></name>, and <name><surname>Brandt</surname><given-names>J</given-names></name> (<year>2019</year>). <article-title>Quantifying the contribution of conservation easements to large-landscape conservation</article-title>. <source>Biological Conservation</source>, <volume>232</volume>:<fpage>83</fpage>&#x02013;<lpage>96</lpage>.</mixed-citation></ref><ref id="R30"><label>[30]</label><mixed-citation publication-type="journal"><name><surname>Haase</surname><given-names>P</given-names></name> (<year>1995</year>). <article-title>Spatial pattern analysis in ecology based on ripley&#x02019;s k-function: Introduction and methods of edge correction</article-title>. <source>Journal of Vegetation Science</source>, <volume>6</volume>(<issue>4</issue>):<fpage>575</fpage>&#x02013;<lpage>582</lpage>.</mixed-citation></ref><ref id="R31"><label>[31]</label><mixed-citation publication-type="journal"><name><surname>Haddad</surname><given-names>NM</given-names></name>, <name><surname>Brudvig</surname><given-names>LA</given-names></name>, <name><surname>Clobert</surname><given-names>J</given-names></name>, <name><surname>Davies</surname><given-names>KF</given-names></name>, <name><surname>Gonzalez</surname><given-names>A</given-names></name>, <name><surname>Holt</surname><given-names>RD</given-names></name>, <name><surname>Lovejoy</surname><given-names>TE</given-names></name>, <name><surname>Sexton</surname><given-names>JO</given-names></name>, <name><surname>Austin</surname><given-names>MP</given-names></name>, <name><surname>Collins</surname><given-names>CD</given-names></name>, <name><surname>Cook</surname><given-names>WM</given-names></name>, <name><surname>Damschen</surname><given-names>EI</given-names></name>, <name><surname>Ewers</surname><given-names>RM</given-names></name>, <name><surname>Foster</surname><given-names>BL</given-names></name>, <name><surname>Jenkins</surname><given-names>CN</given-names></name>, <name><surname>King</surname><given-names>AJ</given-names></name>, <name><surname>Laurance</surname><given-names>WF</given-names></name>, <name><surname>Levey</surname><given-names>DJ</given-names></name>, <name><surname>Margules</surname><given-names>CR</given-names></name>, <name><surname>Melbourne</surname><given-names>BA</given-names></name>, <name><surname>Nicholls</surname><given-names>AO</given-names></name>, <name><surname>Orrock</surname><given-names>JL</given-names></name>, <name><surname>Song</surname><given-names>D-X</given-names></name>, and <name><surname>Townshend</surname><given-names>JR</given-names></name> (<year>2015</year>). <article-title>Habitat fragmentation and its lasting impact on earth&#x02019;s ecosystems</article-title>. <source>Science Advances</source>, <volume>1</volume>(<issue>2</issue>):<fpage>e1500052</fpage>.<pub-id pub-id-type="pmid">26601154</pub-id></mixed-citation></ref><ref id="R32"><label>[32]</label><mixed-citation publication-type="journal"><name><surname>Hodgson</surname><given-names>JA</given-names></name>, <name><surname>Thomas</surname><given-names>CD</given-names></name>, <name><surname>Wintle</surname><given-names>BA</given-names></name>, and <name><surname>Moilanen</surname><given-names>A</given-names></name> (<year>2009</year>). <article-title>Climate change, connectivity and conservation decision making: back to basics</article-title>. <source>Journal of Applied Ecology</source>, <volume>46</volume>(<issue>5</issue>):<fpage>964</fpage>&#x02013;<lpage>969</lpage>.</mixed-citation></ref><ref id="R33"><label>[33]</label><mixed-citation publication-type="journal"><name><surname>Karunaweera</surname><given-names>ND</given-names></name>, <name><surname>Ginige</surname><given-names>S</given-names></name>, <name><surname>Senanayake</surname><given-names>S</given-names></name>, <name><surname>Silva</surname><given-names>H</given-names></name>, <name><surname>Manamperi</surname><given-names>N</given-names></name>, <name><surname>Samaranayake</surname><given-names>N</given-names></name>, <name><surname>Siriwardana</surname><given-names>Y</given-names></name>, <name><surname>Gamage</surname><given-names>D</given-names></name>, <name><surname>Senerath</surname><given-names>U</given-names></name>, and <name><surname>Zhou</surname><given-names>G</given-names></name> (<year>2020</year>). <article-title>Spatial epidemiologic trends and hotspots of leishmaniasis, sri lanka, 2001&#x02013;2018</article-title>. <source>Emerging infectious diseases</source>, <volume>26</volume>.</mixed-citation></ref><ref id="R34"><label>[34]</label><mixed-citation publication-type="journal"><name><surname>Kretser</surname><given-names>H</given-names></name>, <name><surname>Sullivan</surname><given-names>P</given-names></name>, and <name><surname>Knuth</surname><given-names>B</given-names></name> (<year>2008</year>). <article-title>Housing density as an indicator of spatial patterns of reported human-wildlife interactions in northern new york</article-title>. <source>Landscape and Urban Planning</source>, <volume>84</volume>:<fpage>282</fpage>&#x02013;<lpage>292</lpage>.</mixed-citation></ref><ref id="R35"><label>[35]</label><mixed-citation publication-type="journal"><name><surname>Kulldorff</surname><given-names>M</given-names></name> (<year>1997</year>). <article-title>A spatial scan statistic</article-title>. <source>Communications in Statistics - Theory and Methods</source>, <volume>26</volume>(<issue>6</issue>):<fpage>1481</fpage>&#x02013;<lpage>1496</lpage>.</mixed-citation></ref><ref id="R36"><label>[36]</label><mixed-citation publication-type="journal"><name><surname>Lamichhane</surname><given-names>S</given-names></name>, <name><surname>Sun</surname><given-names>C</given-names></name>, <name><surname>Gordon</surname><given-names>J</given-names></name>, <name><surname>Grado</surname><given-names>S</given-names></name>, and <name><surname>Poudel</surname><given-names>K</given-names></name> (<year>2021</year>). <article-title>Spatial dependence and determinants of conservation easement adoptions in the united states</article-title>. <source>Journal of Environmental Management</source>, <volume>296</volume>.</mixed-citation></ref><ref id="R37"><label>[37]</label><mixed-citation publication-type="journal"><name><surname>Law</surname><given-names>R</given-names></name>, <name><surname>Illian</surname><given-names>J</given-names></name>, <name><surname>Burslem</surname><given-names>DFRP</given-names></name>, <name><surname>Gratzer</surname><given-names>G</given-names></name>, <name><surname>Gunatilleke</surname><given-names>CVS</given-names></name>, and <name><surname>Gunatilleke</surname><given-names>IAUN</given-names></name> (<year>2009</year>). <article-title>Ecological information from spatial patterns of plants: Insights from point process theory</article-title>. <source>Journal of Ecology</source>, <volume>97</volume>(<issue>4</issue>):<fpage>616</fpage>&#x02013;<lpage>628</lpage></mixed-citation></ref><ref id="R38"><label>[38]</label><mixed-citation publication-type="journal"><name><surname>Lee</surname><given-names>S-K</given-names></name> and <name><surname>Lee</surname><given-names>B</given-names></name> (<year>2013</year>). <article-title>Assessing the appropriateness of the spatial distribution of standard lots using the l-index</article-title>. <source>Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography</source>, <volume>31</volume>(<issue>6.2</issue>):<fpage>601</fpage>&#x02013;<lpage>609</lpage>.</mixed-citation></ref><ref id="R39"><label>[39]</label><mixed-citation publication-type="journal"><name><surname>Loosmore</surname><given-names>NB</given-names></name> and <name><surname>Ford</surname><given-names>ED</given-names></name> (<year>2006</year>). <article-title>Statistical inference using the g or k point pattern spatial statistics</article-title>. <source>Ecology</source>, <volume>87</volume>(<issue>8</issue>):<fpage>1925</fpage>&#x02013;<lpage>1931</lpage>.<pub-id pub-id-type="pmid">16937629</pub-id></mixed-citation></ref><ref id="R40"><label>[40]</label><mixed-citation publication-type="journal"><name><surname>Marcon</surname><given-names>E</given-names></name> and <name><surname>Puech</surname><given-names>F</given-names></name> (<year>2003</year>). <article-title>Evaluating the geographic concentration of industries using distance-based methods</article-title>. <source>Journal of Economic Geography</source>, <volume>3</volume>(<issue>4</issue>):<fpage>409</fpage>&#x02013;<lpage>428</lpage>.</mixed-citation></ref><ref id="R41"><label>[41]</label><mixed-citation publication-type="journal"><name><surname>Marcon</surname><given-names>E</given-names></name> and <name><surname>Puech</surname><given-names>F</given-names></name> (<year>2017</year>). <article-title>A typology of distance-based measures of spatial concentration</article-title>. <source>Regional Science and Urban Economics</source>, <volume>62</volume>:<fpage>56</fpage>&#x02013;<lpage>67</lpage>.</mixed-citation></ref><ref id="R42"><label>[42]</label><mixed-citation publication-type="journal"><name><surname>Marcon</surname><given-names>E</given-names></name>, <name><surname>Puech</surname><given-names>F</given-names></name>, and <name><surname>Traissac</surname><given-names>S</given-names></name> (<year>2012</year>). <article-title>Characterizing the relative spatial structure of point patterns</article-title>. <source>International Journal of Ecology, 2012</source>.</mixed-citation></ref><ref id="R43"><label>[43]</label><mixed-citation publication-type="journal"><name><surname>Marj</surname><given-names>T</given-names></name> and <name><surname>Abellan</surname><given-names>A</given-names></name> (<year>2013</year>). <article-title>Rockfall detection from terrestrial LiDAR point clouds: A clustering approach using R</article-title>. <source>Journal of Spatial Information Science</source>, <volume>8</volume>.</mixed-citation></ref><ref id="R44"><label>[44]</label><mixed-citation publication-type="journal"><name><surname>McLaughlin</surname><given-names>N</given-names></name> and <name><surname>Weeks</surname><given-names>W</given-names></name> (<year>2009</year>). <article-title>In defense of conservation easements: A response to the end of perpetuity</article-title>. <source>Wyoming Law Review</source>, <volume>9</volume>:<fpage>1</fpage>&#x02013;<lpage>96</lpage>.</mixed-citation></ref><ref id="R45"><label>[45]</label><mixed-citation publication-type="journal"><name><surname>Moran</surname><given-names>PAP</given-names></name> (<year>1950</year>). <article-title>Notes on continuous stochastic phenomena</article-title>. <source>Biometrika</source>, <volume>37</volume>(<issue>1/2</issue>):<fpage>17</fpage>&#x02013;<lpage>23</lpage>.<pub-id pub-id-type="pmid">15420245</pub-id></mixed-citation></ref><ref id="R46"><label>[46]</label><mixed-citation publication-type="journal"><name><surname>Paradis</surname><given-names>E</given-names></name> and <name><surname>Schliep</surname><given-names>K</given-names></name> (<year>2019</year>). <article-title>ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R</article-title>. <source>Bioinformatics</source>, <volume>35</volume>:<fpage>526</fpage>&#x02013;<lpage>528</lpage>.<pub-id pub-id-type="pmid">30016406</pub-id></mixed-citation></ref><ref id="R47"><label>[47]</label><mixed-citation publication-type="journal"><name><surname>Penttinen</surname><given-names>A</given-names></name>, <name><surname>Stoyan</surname><given-names>D</given-names></name>, and <name><surname>Henttonen</surname><given-names>HM</given-names></name> (<year>1992</year>). <article-title>Marked Point Processes in Forest Statistics</article-title>. <source>Forest Science</source>, <volume>38</volume>(<issue>4</issue>):<fpage>806</fpage>&#x02013;<lpage>824</lpage>.</mixed-citation></ref><ref id="R48"><label>[48]</label><mixed-citation publication-type="journal"><name><surname>Qiao</surname><given-names>L</given-names></name>, <name><surname>Huang</surname><given-names>H</given-names></name>, and <name><surname>Tian</surname><given-names>Y</given-names></name> (<year>2019</year>). <article-title>The identification and use efficiency evaluation of urban industrial land based on multi-source data</article-title>. <source>Sustainability</source>, <volume>11</volume>(<issue>21</issue>).</mixed-citation></ref><ref id="R49"><label>[49]</label><mixed-citation publication-type="journal"><name><surname>Ripley</surname><given-names>B</given-names></name> (<year>1976</year>). <article-title>The second-order analysis of stationary point processes</article-title>. <source>Journal of Applied Probability</source>, <volume>13</volume>(<issue>2</issue>):<fpage>255</fpage>&#x02013;<lpage>266</lpage>.</mixed-citation></ref><ref id="R50"><label>[50]</label><mixed-citation publication-type="journal"><name><surname>Ripley</surname><given-names>B</given-names></name> (<year>1977</year>). <article-title>Modelling spatial patterns</article-title>. <source>Journal of the Royal Statistical Society. Series B (Methodological)</source>, <volume>39</volume>(<issue>2</issue>):<fpage>172</fpage>&#x02013;<lpage>212</lpage>.</mixed-citation></ref><ref id="R51"><label>[51]</label><mixed-citation publication-type="book"><name><surname>Ripley</surname><given-names>B</given-names></name> (<year>1981</year>). <source>Spatial Statistics</source>. <publisher-name>Wiley</publisher-name>, <publisher-loc>New York, NY</publisher-loc>.</mixed-citation></ref><ref id="R52"><label>[52]</label><mixed-citation publication-type="journal"><name><surname>Siordia</surname><given-names>C</given-names></name> (<year>2013</year>). <article-title>Benefits of small area measurements: a spatial clustering analysis on medicare beneficiaries in the usa</article-title>. <source>Human Geographies - Journal of Studies and Research in Human Geography</source>, <volume>7</volume>(<issue>&#x02018;</issue>):<fpage>53</fpage>&#x02013;<lpage>59</lpage>.</mixed-citation></ref><ref id="R53"><label>[53]</label><mixed-citation publication-type="journal"><name><surname>Skog</surname><given-names>L</given-names></name>, <name><surname>Linde</surname><given-names>A</given-names></name>, <name><surname>Palmgren</surname><given-names>H</given-names></name>, <name><surname>Hauska</surname><given-names>H</given-names></name>, and <name><surname>Elgh</surname><given-names>F</given-names></name> (<year>2014</year>). <article-title>Spatiotemporal characteristics of pandemic influenza</article-title>. <source>BMC Infectious Diseases</source>, <volume>14</volume>.</mixed-citation></ref><ref id="R54"><label>[54]</label><mixed-citation publication-type="journal"><name><surname>Tischendorf</surname><given-names>L</given-names></name> and <name><surname>Fahrig</surname><given-names>L</given-names></name> (<year>2000</year>). <article-title>On the usage and measurement of landscape connectivity</article-title>. <source>Oikos</source>, <volume>90</volume>(<issue>1</issue>):<fpage>7</fpage>&#x02013;<lpage>19</lpage>.</mixed-citation></ref><ref id="R55"><label>[55]</label><mixed-citation publication-type="journal"><name><surname>Wade</surname><given-names>BJ</given-names></name> (<year>2014</year>). <article-title>Spatial analysis of global prevalence of multiple sclerosis suggests need for an updated prevalence scale</article-title>. <source>Multiple Sclerosis International, 2014</source>.</mixed-citation></ref><ref id="R56"><label>[56]</label><mixed-citation publication-type="journal"><name><surname>Zgodic</surname><given-names>A</given-names></name>, <name><surname>Eberth</surname><given-names>JM</given-names></name>, <name><surname>Breneman</surname><given-names>CB</given-names></name>, <name><surname>Wende</surname><given-names>ME</given-names></name>, <name><surname>Kaczynski</surname><given-names>AT</given-names></name>, <name><surname>Liese</surname><given-names>AD</given-names></name>, and <name><surname>McLain</surname><given-names>AC</given-names></name> (<year>2021</year>). <article-title>Estimates of Childhood Overweight and Obesity at the Region, State, and County Levels: A Multilevel Small-Area Estimation Approach</article-title>. <source>American Journal of Epidemiology</source>. <fpage>kwab176</fpage>.</mixed-citation></ref><ref id="R57"><label>[57]</label><mixed-citation publication-type="journal"><name><surname>Zipp</surname><given-names>KY</given-names></name>, <name><surname>Lewis</surname><given-names>DJ</given-names></name>, and <name><surname>Provencher</surname><given-names>B</given-names></name> (<year>2017</year>). <article-title>Does the conservation of land reduce development? an econometric-based landscape simulation with land market feedbacks</article-title>. <source>Journal of Environmental Economics and Management</source>, <volume>81</volume>:<fpage>19</fpage>&#x02013;<lpage>37</lpage>.</mixed-citation></ref></ref-list></back><floats-group><fig position="float" id="F1"><label>Figure 1:</label><caption><p id="P94">An illustration of <inline-formula><mml:math id="M419" display="inline"><mml:mi>&#x1d4a9;</mml:mi><mml:mfenced open="[" close="]" separators="|"><mml:mrow><mml:mi>c</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi>r</mml:mi></mml:mrow></mml:mfenced><mml:mspace width="0.25em"/><mml:mo>&#x02229;</mml:mo><mml:mspace width="0.25em"/><mml:msubsup><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced><mml:mo>+</mml:mo><mml:mi>A</mml:mi><mml:mfenced open="[" close="]" separators="|"><mml:mrow><mml:mi>c</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi>r</mml:mi></mml:mrow></mml:mfenced><mml:mspace width="0.25em"/><mml:mo>&#x02229;</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math></inline-formula> for a realization of an areal process on a 20&#x000d7;20 regular grid. Positive areal units (i.e. units for which <inline-formula><mml:math id="M420" display="inline"><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula> are shown in color. The area of the shaded red region is equal to <inline-formula><mml:math id="M421" display="inline"><mml:mi>&#x1d4a9;</mml:mi><mml:mfenced open="[" close="]" separators="|"><mml:mrow><mml:mi>c</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi>r</mml:mi></mml:mrow></mml:mfenced><mml:mspace width="0.25em"/><mml:mo>&#x02229;</mml:mo><mml:mspace width="0.25em"/><mml:msubsup><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced><mml:mo>+</mml:mo><mml:mi>A</mml:mi><mml:mfenced open="[" close="]" separators="|"><mml:mrow><mml:mi>c</mml:mi><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi mathvariant="bold-italic">&#x02113;</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi>r</mml:mi></mml:mrow></mml:mfenced><mml:mspace width="0.25em"/><mml:mo>&#x02229;</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math></inline-formula>, where <inline-formula><mml:math id="M422" display="inline"><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is the grid cell in the 12th row and 5 th column.</p></caption><graphic xlink:href="nihms-1902698-f0001" position="float"/></fig><fig position="float" id="F2"><label>Figure 2:</label><caption><p id="P95">The 2 study areas considered in the simulation study. The 10 radii at which the positive area proportion function, Ripley&#x02019;s K-function and Ripley&#x02019;s D-function are evaluated are shown for a single location in blue.</p></caption><graphic xlink:href="nihms-1902698-f0002" position="float"/></fig><fig position="float" id="F3"><label>Figure 3:</label><caption><p id="P96">The top row illustrates the clustered regions for DGMs <inline-formula><mml:math id="M423" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>6</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> for study areas <inline-formula><mml:math id="M424" display="inline"><mml:msub><mml:mrow><mml:mi>&#x1d49c;</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> (top left), and <inline-formula><mml:math id="M425" display="inline"><mml:msub><mml:mrow><mml:mi>&#x1d49c;</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> (top right). Blue units are <inline-formula><mml:math id="M426" display="inline"><mml:mi>q</mml:mi></mml:math></inline-formula> times more likely to be selected than white units under spatial dependence configurations <inline-formula><mml:math id="M427" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>6</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>. The bottom row illustrates the clustered <inline-formula><mml:math id="M428" display="inline"><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mi>c</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math></inline-formula>, dispersed <inline-formula><mml:math id="M429" display="inline"><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mi>d</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math></inline-formula> and random scatter regions <inline-formula><mml:math id="M430" display="inline"><mml:mfenced separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math></inline-formula> for DGMs <inline-formula><mml:math id="M431" display="inline"><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula><mml:math id="M432" display="inline"><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> for <inline-formula><mml:math id="M433" display="inline"><mml:msub><mml:mrow><mml:mi>&#x1d49c;</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> (bottom left) and <inline-formula><mml:math id="M434" display="inline"><mml:msub><mml:mrow><mml:mi>&#x1d49c;</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>, (bottom right). Red denotes <inline-formula><mml:math id="M435" display="inline"><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mi>c</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, blue denotes <inline-formula><mml:math id="M436" display="inline"><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mi>d</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and white denotes <inline-formula><mml:math id="M437" display="inline"><mml:msub><mml:mrow><mml:mi>R</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>.</p></caption><graphic xlink:href="nihms-1902698-f0003" position="float"/></fig><fig position="float" id="F4"><label>Figure 4:</label><caption><p id="P97">Examples of observed units generated under each scenario for study area <inline-formula><mml:math id="M438" display="inline"><mml:msub><mml:mrow><mml:mi>&#x1d49c;</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>. The first row displays examples of data generated under the null hypothesis of equal probability sampling without replacement (left to right: <inline-formula><mml:math id="M439" display="inline"><mml:msub><mml:mrow><mml:mi>I</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>I</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>I</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>). The second row displays examples of data generated with excess clustering (left to right <inline-formula><mml:math id="M440" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>5</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>6</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>). The third row displays examples of data generated with autocorrelated clustering (left to right <inline-formula><mml:math id="M441" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>7</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>8</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>9</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>10</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>11</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:mfenced open="" separators="|"><mml:mrow><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>12</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:math></inline-formula>. The fourth row displays examples of data generated with dispersion or a mixture of clustering and dispersion (left to right <inline-formula><mml:math id="M442" display="inline"><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.25em"/><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>).</p></caption><graphic xlink:href="nihms-1902698-f0004" position="float"/></fig><fig position="float" id="F5"><label>Figure 5:</label><caption><p id="P98">The top pane displays the 112,819 land parcels in Boulder County, Colorado in 2008. Parcels held as a CE are shown in blue. The radii at which the PAPF method was evaluated are shown in red. Note that areas with many small parcels appear black. The bottom pane displays the 3,108 counties in the contiguous US. Counties with a high rate of childhood overweight/obesity are shown in blue. The radii at which the PAPF method was evaluated are shown in red.</p></caption><graphic xlink:href="nihms-1902698-f0005" position="float"/></fig><table-wrap position="float" id="T1"><label>Table 1:</label><caption><p id="P99">The table provides an overview of the spatial clustering assessment methods considered in the simulation study: the positive area proportion function (PAPF), the global Moran&#x02019;s I statistic (MI), the Getis-Ord general G statistic (GG), the spatial scan statistic (SSS), Ripley&#x02019;s K-function (RK), Ripley&#x02019;s D-function (RD) and the average nearest neighbor method (ANN). The null and alternative hypotheses (both left- and right-tailed) are specified for each method, along with the Monte Carlo procedure used to estimate the null distribution, if applicable.</p></caption><table frame="hsides" rules="rows"><colgroup span="1"><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/></colgroup><thead><tr><th align="left" valign="top" rowspan="1" colspan="1">Method</th><th align="left" valign="bottom" rowspan="1" colspan="1">
<inline-formula>
<mml:math id="M443" display="inline"><mml:msub><mml:mrow><mml:mi mathvariant="normal">H</mml:mi></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</th><th align="left" valign="top" rowspan="1" colspan="1"><inline-formula><mml:math id="M444" display="inline"><mml:msub><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> (Left-tailed)</th><th align="left" valign="top" rowspan="1" colspan="1"><inline-formula><mml:math id="M445" display="inline"><mml:msub><mml:mrow><mml:mi>H</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> (Right-tailed)</th><th align="left" valign="top" rowspan="1" colspan="1">Monte Carlo Procedure</th></tr></thead><tbody><tr><td align="left" valign="top" rowspan="1" colspan="1">PAPF</td><td align="left" valign="top" rowspan="1" colspan="1">Random labeling (conditional on <italic toggle="yes">n</italic>)</td><td align="left" valign="top" rowspan="1" colspan="1">Dispersion</td><td align="left" valign="top" rowspan="1" colspan="1">Clustering</td><td align="left" valign="top" rowspan="1" colspan="1">
<inline-formula>
<mml:math id="M446" display="inline"><mml:mi mathvariant="bold">Y</mml:mi><mml:mo>~</mml:mo><mml:mi mathvariant="normal">S</mml:mi><mml:mi mathvariant="normal">W</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">R</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>N</mml:mi><mml:mo>,</mml:mo><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">p</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>,</mml:mo></mml:math>
</inline-formula>
<break/>
<inline-formula>
<mml:math id="M447" display="inline"><mml:mrow><mml:mi mathvariant="bold-italic">P</mml:mi><mml:mo>=</mml:mo><mml:mfenced><mml:mrow><mml:msup><mml:mi>N</mml:mi><mml:mrow><mml:mo>&#x02212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msup><mml:mi>N</mml:mi><mml:mrow><mml:mo>&#x02212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:mfenced></mml:mrow></mml:math>
</inline-formula>
</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">MI</td><td align="left" valign="top" rowspan="1" colspan="1">Random labeling</td><td align="left" valign="top" rowspan="1" colspan="1">Negative spatial autocorrelation</td><td align="left" valign="top" rowspan="1" colspan="1">Positive spatial autocorrelation</td><td align="left" valign="top" rowspan="1" colspan="1">NA</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">GG</td><td align="left" valign="top" rowspan="1" colspan="1">Random labeling</td><td align="left" valign="top" rowspan="1" colspan="1">Low-low clustering</td><td align="left" valign="top" rowspan="1" colspan="1">High-high clustering</td><td align="left" valign="top" rowspan="1" colspan="1">NA</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">SSS</td><td align="left" valign="top" rowspan="1" colspan="1"><inline-formula><mml:math id="M448" display="inline"><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>&#x02019;s are iid Bernoulli(<italic toggle="yes">p</italic>)</td><td colspan="2" align="left" valign="bottom" rowspan="1">There exists a set <inline-formula><mml:math id="M449" display="inline"><mml:mi>&#x02110;</mml:mi></mml:math></inline-formula> indexing a (contiguous) cluster of <inline-formula><mml:math id="M450" display="inline"><mml:msub><mml:mrow><mml:mi>a</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mtext>s</mml:mtext></mml:math></inline-formula> such that <inline-formula><mml:math id="M451" display="inline"><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>&#x02019;s are iid Bernoulli(<inline-formula><mml:math id="M452" display="inline"><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>) for <inline-formula><mml:math id="M453" display="inline"><mml:mi>i</mml:mi><mml:mo>&#x02208;</mml:mo><mml:mi>&#x02110;</mml:mi></mml:math></inline-formula> and iid Bernoulli(<inline-formula><mml:math id="M454" display="inline"><mml:msub><mml:mrow><mml:mi>p</mml:mi></mml:mrow><mml:mrow><mml:mi>o</mml:mi><mml:mi>u</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>) for <inline-formula><mml:math id="M455" display="inline"><mml:mi>i</mml:mi><mml:mo>&#x02209;</mml:mo><mml:mrow><mml:mi>&#x02110;</mml:mi></mml:mrow></mml:math></inline-formula>
<xref rid="TFN1" ref-type="table-fn">*</xref></td><td align="left" valign="top" rowspan="1" colspan="1">
<inline-formula>
<mml:math id="M456" display="inline"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>~</mml:mo><mml:mrow><mml:mrow><mml:mi mathvariant="normal">SWoR</mml:mi></mml:mrow><mml:mo>&#x02061;</mml:mo><mml:mrow><mml:mfenced separators="|"><mml:mrow><mml:mi>N</mml:mi><mml:mo>,</mml:mo><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">p</mml:mi></mml:mrow></mml:mfenced></mml:mrow></mml:mrow><mml:mo>,</mml:mo></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi mathvariant="bold-italic">p</mml:mi><mml:mo>=</mml:mo><mml:mfenced separators="|"><mml:mrow><mml:msup><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msup><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:mfenced></mml:mtd></mml:mtr></mml:mtable></mml:math>
</inline-formula>
</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">RK</td><td align="left" valign="top" rowspan="1" colspan="1">centroids arise from HPP<sup><xref rid="TFN2" ref-type="table-fn">&#x02020;</xref></sup></td><td align="left" valign="top" rowspan="1" colspan="1">centroids dispersed</td><td align="left" valign="top" rowspan="1" colspan="1">centroids clustered</td><td align="left" valign="top" rowspan="1" colspan="1">Generate <inline-formula><mml:math id="M457" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> points from a continuous uniform distribution on <inline-formula><mml:math id="M458" display="inline"><mml:mi>&#x1d49c;</mml:mi></mml:math></inline-formula></td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">RD</td><td align="left" valign="top" rowspan="1" colspan="1">Random labeling</td><td align="left" valign="top" rowspan="1" colspan="1">Controls more clustered than cases</td><td align="left" valign="top" rowspan="1" colspan="1">Cases more clustered than controls</td><td align="left" valign="bottom" rowspan="1" colspan="1">Select cases via <inline-formula><mml:math id="M459" display="inline"><mml:mi mathvariant="normal">SWoR</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>N</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">n</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">p</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>,</mml:mo><mml:mspace linebreak="newline"/><mml:mi mathvariant="bold-italic">p</mml:mi><mml:mo>=</mml:mo><mml:mfenced separators="|"><mml:mrow><mml:msup><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msup><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:mfenced></mml:math></inline-formula></td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">ANN</td><td align="left" valign="top" rowspan="1" colspan="1">centroids arise from HPP<sup><xref rid="TFN2" ref-type="table-fn">&#x02020;</xref></sup></td><td align="left" valign="top" rowspan="1" colspan="1">centroids clustered</td><td align="left" valign="top" rowspan="1" colspan="1">centroids dispersed</td><td align="left" valign="top" rowspan="1" colspan="1">NA</td></tr></tbody></table><table-wrap-foot><fn id="TFN1"><label>*</label><p id="P100">For left-tailed test <inline-formula><mml:math id="M460" display="inline"><mml:msub><mml:mi>p</mml:mi><mml:mrow><mml:mi>o</mml:mi><mml:mi>u</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>&#x0003e;</mml:mo><mml:msub><mml:mi>p</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> for right-tailed test, <inline-formula><mml:math id="M461" display="inline"><mml:msub><mml:mi>p</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>&#x0003e;</mml:mo><mml:msub><mml:mi>p</mml:mi><mml:mrow><mml:mi>o</mml:mi><mml:mi>u</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula></p></fn><fn id="TFN2"><label>&#x02020;</label><p id="P101">HPP: homogeneous Poisson process</p></fn></table-wrap-foot></table-wrap><table-wrap position="float" id="T2" orientation="landscape"><label>Table 2:</label><caption><p id="P102">Simulation study results for study area <inline-formula><mml:math id="M462" display="inline"><mml:msub><mml:mrow><mml:mi>&#x1d49c;</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> Journal Pre-proof A1 (the regular grid). Results displayed include the empirical rejection rate (ERR) of the positive area proportion function (PAPF), the global Moran&#x02019;s I statistic (MI), the Getis-Ord general G statistic (GG), the spatial scan statistic method (SSS), Ripley&#x02019;s K-function (RK), Ripley&#x02019;s D-function (RD) and the average nearest neighbor method (ANN). For DGMs <inline-formula><mml:math id="M463" display="inline"><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>, single-tailed test indicative of clustering are denoted with a C, while dispersion tests are denoted with a D. All tests were conducted at a level of <inline-formula><mml:math id="M464" display="inline"><mml:mi>&#x003b1;</mml:mi><mml:mo>=</mml:mo><mml:mn>0.05</mml:mn></mml:math></inline-formula>.</p></caption><table frame="hsides" rules="none"><colgroup span="1"><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/></colgroup><thead><tr><th rowspan="2" align="center" valign="middle" colspan="1">DGM</th><th rowspan="2" align="center" valign="middle" colspan="1">Method</th><th rowspan="2" align="center" valign="middle" colspan="1">ERR</th><th rowspan="2" align="center" valign="middle" colspan="1">Method</th><th rowspan="2" align="center" valign="bottom" colspan="1">Global</th><th rowspan="2" align="center" valign="bottom" colspan="1">
<inline-formula>
<mml:math id="M465" display="inline"><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</th><th rowspan="2" align="center" valign="bottom" colspan="1">
<inline-formula>
<mml:math id="M466" display="inline"><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</th><th rowspan="2" align="center" valign="bottom" colspan="1">
<inline-formula>
<mml:math id="M467" display="inline"><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</th><th colspan="2" align="center" valign="bottom" rowspan="1">ERR<hr/></th><th rowspan="2" align="center" valign="bottom" colspan="1">
<inline-formula>
<mml:math id="M468" display="inline"><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mn>6</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</th><th rowspan="2" align="center" valign="bottom" colspan="1">
<inline-formula>
<mml:math id="M469" display="inline"><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mn>7</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</th><th rowspan="2" align="center" valign="bottom" colspan="1">
<inline-formula>
<mml:math id="M470" display="inline"><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mn>8</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</th><th rowspan="2" align="center" valign="bottom" colspan="1">
<inline-formula>
<mml:math id="M471" display="inline"><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mn>9</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</th><th rowspan="2" align="center" valign="bottom" colspan="1">
<inline-formula>
<mml:math id="M472" display="inline"><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mn>10</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</th></tr><tr><th align="center" valign="middle" rowspan="1" colspan="1">
<inline-formula>
<mml:math id="M473" display="inline"><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</th><th align="center" valign="middle" rowspan="1" colspan="1">
<inline-formula>
<mml:math id="M474" display="inline"><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mn>5</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</th></tr><tr><th colspan="15" align="left" valign="middle" rowspan="1">
<hr/>
</th></tr></thead><tbody><tr><td rowspan="4" align="left" valign="middle" colspan="1">
<inline-formula>
<mml:math id="M475" display="inline"><mml:msub><mml:mrow><mml:mi>I</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</td><td align="left" valign="middle" rowspan="1" colspan="1">ANN</td><td align="left" valign="middle" rowspan="1" colspan="1">97.5</td><td align="left" valign="middle" rowspan="1" colspan="1">RK</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">25.8</td><td align="left" valign="middle" rowspan="1" colspan="1">4.8</td><td align="left" valign="middle" rowspan="1" colspan="1">18.6</td><td align="left" valign="middle" rowspan="1" colspan="1">11.2</td><td align="left" valign="middle" rowspan="1" colspan="1">13.8</td><td align="left" valign="middle" rowspan="1" colspan="1">2.0</td><td align="left" valign="middle" rowspan="1" colspan="1">7.4</td><td align="left" valign="middle" rowspan="1" colspan="1">4.0</td><td align="left" valign="middle" rowspan="1" colspan="1">1.8</td><td align="left" valign="middle" rowspan="1" colspan="1">18.0</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">SSS</td><td align="left" valign="middle" rowspan="1" colspan="1">3.0</td><td align="left" valign="middle" rowspan="1" colspan="1">RD</td><td align="left" valign="middle" rowspan="1" colspan="1">6.0</td><td align="left" valign="middle" rowspan="1" colspan="1">6.4</td><td align="left" valign="middle" rowspan="1" colspan="1">3.6</td><td align="left" valign="middle" rowspan="1" colspan="1">3.6</td><td align="left" valign="middle" rowspan="1" colspan="1">7.6</td><td align="left" valign="middle" rowspan="1" colspan="1">7.6</td><td align="left" valign="middle" rowspan="1" colspan="1">7.0</td><td align="left" valign="middle" rowspan="1" colspan="1">7.6</td><td align="left" valign="middle" rowspan="1" colspan="1">6.0</td><td align="left" valign="middle" rowspan="1" colspan="1">6.6</td><td align="left" valign="middle" rowspan="1" colspan="1">10.6</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">MI</td><td align="left" valign="middle" rowspan="1" colspan="1">5.8</td><td align="left" valign="middle" rowspan="1" colspan="1">PAPF</td><td align="left" valign="middle" rowspan="1" colspan="1">3.6</td><td align="left" valign="middle" rowspan="1" colspan="1">4.2</td><td align="left" valign="middle" rowspan="1" colspan="1">3.6</td><td align="left" valign="middle" rowspan="1" colspan="1">5.8</td><td align="left" valign="middle" rowspan="1" colspan="1">4.4</td><td align="left" valign="middle" rowspan="1" colspan="1">3.6</td><td align="left" valign="middle" rowspan="1" colspan="1">3.2</td><td align="left" valign="middle" rowspan="1" colspan="1">3.4</td><td align="left" valign="middle" rowspan="1" colspan="1">3.2</td><td align="left" valign="middle" rowspan="1" colspan="1">3.0</td><td align="left" valign="middle" rowspan="1" colspan="1">3.6</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">GG</td><td align="left" valign="middle" rowspan="1" colspan="1">4.8</td><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/></tr><tr><td rowspan="4" align="left" valign="middle" colspan="1">
<inline-formula>
<mml:math id="M476" display="inline"><mml:msub><mml:mrow><mml:mi>I</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</td><td align="left" valign="middle" rowspan="1" colspan="1">ANN</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">RK</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.2</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">75.6</td><td align="left" valign="middle" rowspan="1" colspan="1">18.6</td><td align="left" valign="middle" rowspan="1" colspan="1">99.8</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">SSS</td><td align="left" valign="middle" rowspan="1" colspan="1">2.8</td><td align="left" valign="middle" rowspan="1" colspan="1">RD</td><td align="left" valign="middle" rowspan="1" colspan="1">4.4</td><td align="left" valign="middle" rowspan="1" colspan="1">10.0</td><td align="left" valign="middle" rowspan="1" colspan="1">7.0</td><td align="left" valign="middle" rowspan="1" colspan="1">7.0</td><td align="left" valign="middle" rowspan="1" colspan="1">2.0</td><td align="left" valign="middle" rowspan="1" colspan="1">2.0</td><td align="left" valign="middle" rowspan="1" colspan="1">2.8</td><td align="left" valign="middle" rowspan="1" colspan="1">4.4</td><td align="left" valign="middle" rowspan="1" colspan="1">6.0</td><td align="left" valign="middle" rowspan="1" colspan="1">5.6</td><td align="left" valign="middle" rowspan="1" colspan="1">3.4</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">MI</td><td align="left" valign="middle" rowspan="1" colspan="1">5.0</td><td align="left" valign="middle" rowspan="1" colspan="1">PAPF</td><td align="left" valign="middle" rowspan="1" colspan="1">7.6</td><td align="left" valign="middle" rowspan="1" colspan="1">5.8</td><td align="left" valign="middle" rowspan="1" colspan="1">5.4</td><td align="left" valign="middle" rowspan="1" colspan="1">8.6</td><td align="left" valign="middle" rowspan="1" colspan="1">5.4</td><td align="left" valign="middle" rowspan="1" colspan="1">5.0</td><td align="left" valign="middle" rowspan="1" colspan="1">6.0</td><td align="left" valign="middle" rowspan="1" colspan="1">5.0</td><td align="left" valign="middle" rowspan="1" colspan="1">6.4</td><td align="left" valign="middle" rowspan="1" colspan="1">4.6</td><td align="left" valign="middle" rowspan="1" colspan="1">4.8</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">GG</td><td align="left" valign="middle" rowspan="1" colspan="1">6.4</td><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/></tr><tr><td colspan="15" align="left" valign="middle" rowspan="1">
<hr/>
</td></tr><tr><td rowspan="3" align="left" valign="middle" colspan="1">
<inline-formula>
<mml:math id="M477" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</td><td align="left" valign="middle" rowspan="1" colspan="1">SSS</td><td align="left" valign="middle" rowspan="1" colspan="1">99.8</td><td align="left" valign="middle" rowspan="1" colspan="1">RD</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">MI</td><td align="left" valign="middle" rowspan="1" colspan="1">97.0</td><td align="left" valign="middle" rowspan="1" colspan="1">PAPF</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">91.6</td><td align="left" valign="middle" rowspan="1" colspan="1">95.2</td><td align="left" valign="middle" rowspan="1" colspan="1">99.0</td><td align="left" valign="middle" rowspan="1" colspan="1">99.6</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">GG</td><td align="left" valign="middle" rowspan="1" colspan="1">96.4</td><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/></tr><tr><td rowspan="3" align="left" valign="middle" colspan="1">
<inline-formula>
<mml:math id="M478" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>6</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</td><td align="left" valign="middle" rowspan="1" colspan="1">SSS</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">RD</td><td align="left" valign="middle" rowspan="1" colspan="1">0.4</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">1.4</td><td align="left" valign="middle" rowspan="1" colspan="1">1.4</td><td align="left" valign="middle" rowspan="1" colspan="1">2.0</td><td align="left" valign="middle" rowspan="1" colspan="1">2.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">2.2</td><td align="left" valign="middle" rowspan="1" colspan="1">1.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.4</td><td align="left" valign="middle" rowspan="1" colspan="1">0.4</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">MI</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">PAPF</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">GG</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/></tr><tr><td colspan="15" align="left" valign="middle" rowspan="1">
<hr/>
</td></tr><tr><td rowspan="3" align="left" valign="middle" colspan="1">
<inline-formula>
<mml:math id="M479" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>8</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</td><td align="left" valign="middle" rowspan="1" colspan="1">SSS</td><td align="left" valign="middle" rowspan="1" colspan="1">78.0</td><td align="left" valign="middle" rowspan="1" colspan="1">RD</td><td align="left" valign="middle" rowspan="1" colspan="1">91.6</td><td align="left" valign="middle" rowspan="1" colspan="1">37.2</td><td align="left" valign="middle" rowspan="1" colspan="1">93.4</td><td align="left" valign="middle" rowspan="1" colspan="1">93.4</td><td align="left" valign="middle" rowspan="1" colspan="1">91.6</td><td align="left" valign="middle" rowspan="1" colspan="1">91.6</td><td align="left" valign="middle" rowspan="1" colspan="1">79.9</td><td align="left" valign="middle" rowspan="1" colspan="1">72.2</td><td align="left" valign="middle" rowspan="1" colspan="1">60.2</td><td align="left" valign="middle" rowspan="1" colspan="1">54.4</td><td align="left" valign="middle" rowspan="1" colspan="1">53.2</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">MI</td><td align="left" valign="middle" rowspan="1" colspan="1">97.6</td><td align="left" valign="middle" rowspan="1" colspan="1">PAPF</td><td align="left" valign="middle" rowspan="1" colspan="1">93.6</td><td align="left" valign="middle" rowspan="1" colspan="1">91.8</td><td align="left" valign="middle" rowspan="1" colspan="1">95.8</td><td align="left" valign="middle" rowspan="1" colspan="1">97.6</td><td align="left" valign="middle" rowspan="1" colspan="1">96.0</td><td align="left" valign="middle" rowspan="1" colspan="1">92.2</td><td align="left" valign="middle" rowspan="1" colspan="1">87.2</td><td align="left" valign="middle" rowspan="1" colspan="1">75.0</td><td align="left" valign="middle" rowspan="1" colspan="1">65.8</td><td align="left" valign="middle" rowspan="1" colspan="1">60.8</td><td align="left" valign="middle" rowspan="1" colspan="1">56.6</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">GG</td><td align="left" valign="middle" rowspan="1" colspan="1">97.2</td><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/></tr><tr><td rowspan="3" align="left" valign="middle" colspan="1">
<inline-formula>
<mml:math id="M480" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>12</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</td><td align="left" valign="middle" rowspan="1" colspan="1">SSS</td><td align="left" valign="middle" rowspan="1" colspan="1">93.6</td><td align="left" valign="middle" rowspan="1" colspan="1">RD</td><td align="left" valign="middle" rowspan="1" colspan="1">73.6</td><td align="left" valign="middle" rowspan="1" colspan="1">20.6</td><td align="left" valign="middle" rowspan="1" colspan="1">83.2</td><td align="left" valign="middle" rowspan="1" colspan="1">83.2</td><td align="left" valign="middle" rowspan="1" colspan="1">67.2</td><td align="left" valign="middle" rowspan="1" colspan="1">67.2</td><td align="left" valign="middle" rowspan="1" colspan="1">39.0</td><td align="left" valign="middle" rowspan="1" colspan="1">53.6</td><td align="left" valign="middle" rowspan="1" colspan="1">41.8</td><td align="left" valign="middle" rowspan="1" colspan="1">38.4</td><td align="left" valign="middle" rowspan="1" colspan="1">34.2</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">MI</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">PAPF</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">99.6</td><td align="left" valign="middle" rowspan="1" colspan="1">96.6</td><td align="left" valign="middle" rowspan="1" colspan="1">90.0</td><td align="left" valign="middle" rowspan="1" colspan="1">86.0</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">GG</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/></tr><tr><td colspan="15" align="left" valign="middle" rowspan="1">
<hr/>
</td></tr><tr><td rowspan="2" align="left" valign="middle" colspan="1">
<inline-formula>
<mml:math id="M481" display="inline"><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</td><td align="left" valign="middle" rowspan="1" colspan="1">MI</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">RD</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">2.6</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">77.4</td><td align="left" valign="middle" rowspan="1" colspan="1">77.4</td><td align="left" valign="middle" rowspan="1" colspan="1">56.6</td><td align="left" valign="middle" rowspan="1" colspan="1">54.2</td><td align="left" valign="middle" rowspan="1" colspan="1">25.4</td><td align="left" valign="middle" rowspan="1" colspan="1">24.2</td><td align="left" valign="middle" rowspan="1" colspan="1">28.4</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1">PAPF</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">98.0</td><td align="left" valign="middle" rowspan="1" colspan="1">69.8</td><td align="left" valign="middle" rowspan="1" colspan="1">52.8</td><td align="left" valign="middle" rowspan="1" colspan="1">41.2</td><td align="left" valign="middle" rowspan="1" colspan="1">30.0</td><td align="left" valign="middle" rowspan="1" colspan="1">32.6</td><td align="left" valign="middle" rowspan="1" colspan="1">24.4</td></tr><tr><td rowspan="2" align="left" valign="middle" colspan="1">
<inline-formula>
<mml:math id="M482" display="inline"><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</td><td align="left" valign="middle" rowspan="1" colspan="1">MI</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">RD</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">82.2</td><td align="left" valign="middle" rowspan="1" colspan="1">82.2</td><td align="left" valign="middle" rowspan="1" colspan="1">53.6</td><td align="left" valign="middle" rowspan="1" colspan="1">82.4</td><td align="left" valign="middle" rowspan="1" colspan="1">61.6</td><td align="left" valign="middle" rowspan="1" colspan="1">57.8</td><td align="left" valign="middle" rowspan="1" colspan="1">30.8</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1">PAPF</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">98.4</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">98.6</td><td align="left" valign="middle" rowspan="1" colspan="1">97.4</td><td align="left" valign="middle" rowspan="1" colspan="1">92.0</td><td align="left" valign="middle" rowspan="1" colspan="1">91.8</td></tr><tr><td colspan="15" align="left" valign="middle" rowspan="1">
<hr/>
</td></tr><tr><td rowspan="4" align="left" valign="middle" colspan="1">
<inline-formula>
<mml:math id="M483" display="inline"><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</td><td align="left" valign="middle" rowspan="1" colspan="1">MI D</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">RD D</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">4.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">MI C</td><td align="left" valign="middle" rowspan="1" colspan="1">30.6</td><td align="left" valign="middle" rowspan="1" colspan="1">RD C</td><td align="left" valign="middle" rowspan="1" colspan="1">98.8</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">42.6</td><td align="left" valign="middle" rowspan="1" colspan="1">42.6</td><td align="left" valign="middle" rowspan="1" colspan="1">99.4</td><td align="left" valign="middle" rowspan="1" colspan="1">99.4</td><td align="left" valign="middle" rowspan="1" colspan="1">99.2</td><td align="left" valign="middle" rowspan="1" colspan="1">99.0</td><td align="left" valign="middle" rowspan="1" colspan="1">98.0</td><td align="left" valign="middle" rowspan="1" colspan="1">86.8</td><td align="left" valign="middle" rowspan="1" colspan="1">73.4</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">SSS c</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">PAPF D</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">GG C</td><td align="left" valign="middle" rowspan="1" colspan="1">15.6</td><td align="left" valign="middle" rowspan="1" colspan="1">PAPF C</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">28.8</td><td align="left" valign="middle" rowspan="1" colspan="1">33.8</td><td align="left" valign="middle" rowspan="1" colspan="1">72.8</td><td align="left" valign="middle" rowspan="1" colspan="1">97.2</td><td align="left" valign="middle" rowspan="1" colspan="1">99.8</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">99.4</td><td align="left" valign="middle" rowspan="1" colspan="1">97.0</td></tr></tbody></table></table-wrap><table-wrap position="float" id="T3" orientation="landscape"><label>Table 3:</label><caption><p id="P103">Simulation study results for study area <inline-formula><mml:math id="M484" display="inline"><mml:msub><mml:mrow><mml:mi>&#x1d49c;</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> (the US counties). Results displayed include the empirical rejection rate of the positive are proportion function (PAPF), the global Moran&#x02019;s I statistic (MI), the Getis-Ord general G statistic (GG), the spatial scan statistic method (SSS) Ripley&#x02019;s K-function (RK), Ripley&#x02019;s D-function (RD) and the average nearest neighbor method (ANN). For DGMs <inline-formula><mml:math id="M485" display="inline"><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>, single-tailed test indicative of clustering are denoted with a C, while dispersion tests are denoted with a D. All tests were conducted at a level of <inline-formula><mml:math id="M486" display="inline"><mml:mi>&#x003b1;</mml:mi><mml:mo>=</mml:mo><mml:mn>0.05</mml:mn></mml:math></inline-formula>.</p></caption><table frame="hsides" rules="none"><colgroup span="1"><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/></colgroup><thead><tr><th rowspan="2" align="left" valign="middle" colspan="1">DGM</th><th rowspan="2" align="left" valign="middle" colspan="1">Method</th><th rowspan="2" align="left" valign="middle" colspan="1">ERR</th><th rowspan="2" align="left" valign="middle" colspan="1">Method</th><th rowspan="2" align="left" valign="bottom" colspan="1">Global</th><th rowspan="2" align="center" valign="bottom" colspan="1">
<inline-formula>
<mml:math id="M487" display="inline"><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</th><th rowspan="2" align="center" valign="bottom" colspan="1">
<inline-formula>
<mml:math id="M488" display="inline"><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</th><th rowspan="2" align="center" valign="bottom" colspan="1">
<inline-formula>
<mml:math id="M489" display="inline"><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</th><th colspan="2" align="center" valign="bottom" rowspan="1">ERR<hr/></th><th rowspan="2" align="center" valign="bottom" colspan="1">
<inline-formula>
<mml:math id="M490" display="inline"><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mn>6</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</th><th rowspan="2" align="center" valign="bottom" colspan="1">
<inline-formula>
<mml:math id="M491" display="inline"><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mn>7</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</th><th rowspan="2" align="center" valign="bottom" colspan="1">
<inline-formula>
<mml:math id="M492" display="inline"><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mn>8</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</th><th rowspan="2" align="center" valign="bottom" colspan="1">
<inline-formula>
<mml:math id="M493" display="inline"><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mn>9</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</th><th rowspan="2" align="center" valign="bottom" colspan="1">
<inline-formula>
<mml:math id="M494" display="inline"><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mn>10</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</th></tr><tr><th align="center" valign="middle" rowspan="1" colspan="1">
<inline-formula>
<mml:math id="M495" display="inline"><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</th><th align="center" valign="middle" rowspan="1" colspan="1">
<inline-formula>
<mml:math id="M496" display="inline"><mml:msub><mml:mrow><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mn>5</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</th></tr><tr><th colspan="15" align="left" valign="top" rowspan="1">
<hr/>
</th></tr></thead><tbody><tr><td rowspan="4" align="left" valign="middle" colspan="1">
<inline-formula>
<mml:math id="M497" display="inline"><mml:msub><mml:mrow><mml:mi>I</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</td><td align="left" valign="middle" rowspan="1" colspan="1">ANN</td><td align="left" valign="middle" rowspan="1" colspan="1">26.8</td><td align="left" valign="middle" rowspan="1" colspan="1">RK</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">SSS</td><td align="left" valign="middle" rowspan="1" colspan="1">3.6</td><td align="left" valign="middle" rowspan="1" colspan="1">RD</td><td align="left" valign="middle" rowspan="1" colspan="1">5.2</td><td align="left" valign="middle" rowspan="1" colspan="1">0.8</td><td align="left" valign="middle" rowspan="1" colspan="1">5.0</td><td align="left" valign="middle" rowspan="1" colspan="1">5.8</td><td align="left" valign="middle" rowspan="1" colspan="1">5.0</td><td align="left" valign="middle" rowspan="1" colspan="1">5.6</td><td align="left" valign="middle" rowspan="1" colspan="1">6.4</td><td align="left" valign="middle" rowspan="1" colspan="1">6.8</td><td align="left" valign="middle" rowspan="1" colspan="1">7.0</td><td align="left" valign="middle" rowspan="1" colspan="1">6.0</td><td align="left" valign="middle" rowspan="1" colspan="1">5.4</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">MI</td><td align="left" valign="middle" rowspan="1" colspan="1">3.8</td><td align="left" valign="middle" rowspan="1" colspan="1">PAPF</td><td align="left" valign="middle" rowspan="1" colspan="1">7.6</td><td align="left" valign="middle" rowspan="1" colspan="1">5.2</td><td align="left" valign="middle" rowspan="1" colspan="1">4.8</td><td align="left" valign="middle" rowspan="1" colspan="1">7.0</td><td align="left" valign="middle" rowspan="1" colspan="1">7.2</td><td align="left" valign="middle" rowspan="1" colspan="1">8.6</td><td align="left" valign="middle" rowspan="1" colspan="1">6.8</td><td align="left" valign="middle" rowspan="1" colspan="1">6.8</td><td align="left" valign="middle" rowspan="1" colspan="1">7.0</td><td align="left" valign="middle" rowspan="1" colspan="1">6.2</td><td align="left" valign="middle" rowspan="1" colspan="1">6.4</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">GG</td><td align="left" valign="middle" rowspan="1" colspan="1">4.2</td><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/></tr><tr><td rowspan="4" align="left" valign="middle" colspan="1">
<inline-formula>
<mml:math id="M498" display="inline"><mml:msub><mml:mrow><mml:mi>I</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</td><td align="left" valign="middle" rowspan="1" colspan="1">ANN</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">RK</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">76.4</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">SSS</td><td align="left" valign="middle" rowspan="1" colspan="1">2.8</td><td align="left" valign="middle" rowspan="1" colspan="1">RD</td><td align="left" valign="middle" rowspan="1" colspan="1">2.8</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">4.2</td><td align="left" valign="middle" rowspan="1" colspan="1">3.6</td><td align="left" valign="middle" rowspan="1" colspan="1">2.2</td><td align="left" valign="middle" rowspan="1" colspan="1">4.6</td><td align="left" valign="middle" rowspan="1" colspan="1">3.4</td><td align="left" valign="middle" rowspan="1" colspan="1">2.6</td><td align="left" valign="middle" rowspan="1" colspan="1">2.8</td><td align="left" valign="middle" rowspan="1" colspan="1">2.6</td><td align="left" valign="middle" rowspan="1" colspan="1">2.8</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">MI</td><td align="left" valign="middle" rowspan="1" colspan="1">7.2</td><td align="left" valign="middle" rowspan="1" colspan="1">PAPF</td><td align="left" valign="middle" rowspan="1" colspan="1">6.6</td><td align="left" valign="middle" rowspan="1" colspan="1">5.0</td><td align="left" valign="middle" rowspan="1" colspan="1">5.8</td><td align="left" valign="middle" rowspan="1" colspan="1">5.2</td><td align="left" valign="middle" rowspan="1" colspan="1">7.0</td><td align="left" valign="middle" rowspan="1" colspan="1">6.2</td><td align="left" valign="middle" rowspan="1" colspan="1">5.6</td><td align="left" valign="middle" rowspan="1" colspan="1">5.4</td><td align="left" valign="middle" rowspan="1" colspan="1">6.0</td><td align="left" valign="middle" rowspan="1" colspan="1">6.0</td><td align="left" valign="middle" rowspan="1" colspan="1">6.4</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">GG</td><td align="left" valign="middle" rowspan="1" colspan="1">7.0</td><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/></tr><tr><td colspan="15" align="left" valign="middle" rowspan="1">
<hr/>
</td></tr><tr><td rowspan="3" align="left" valign="middle" colspan="1">
<inline-formula>
<mml:math id="M499" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</td><td align="left" valign="middle" rowspan="1" colspan="1">SSS</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">RD</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">3.6</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">MI</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">PAPF</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">99.4</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">GG</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/></tr><tr><td rowspan="3" align="left" valign="middle" colspan="1">
<inline-formula>
<mml:math id="M500" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>6</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</td><td align="left" valign="middle" rowspan="1" colspan="1">SSS</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">RD</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">MI</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">PAPF</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">GG</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/></tr><tr><td colspan="15" align="left" valign="middle" rowspan="1">
<hr/>
</td></tr><tr><td rowspan="3" align="left" valign="middle" colspan="1">
<inline-formula>
<mml:math id="M501" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>8</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</td><td align="left" valign="middle" rowspan="1" colspan="1">SSS</td><td align="left" valign="middle" rowspan="1" colspan="1">87.8</td><td align="left" valign="middle" rowspan="1" colspan="1">RD</td><td align="left" valign="middle" rowspan="1" colspan="1">98.2</td><td align="left" valign="middle" rowspan="1" colspan="1">1.8</td><td align="left" valign="middle" rowspan="1" colspan="1">99.6</td><td align="left" valign="middle" rowspan="1" colspan="1">67.2</td><td align="left" valign="middle" rowspan="1" colspan="1">31.2</td><td align="left" valign="middle" rowspan="1" colspan="1">24.8</td><td align="left" valign="middle" rowspan="1" colspan="1">22</td><td align="left" valign="middle" rowspan="1" colspan="1">16.4</td><td align="left" valign="middle" rowspan="1" colspan="1">15.4</td><td align="left" valign="middle" rowspan="1" colspan="1">16.2</td><td align="left" valign="middle" rowspan="1" colspan="1">16.4</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">MI</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">PAPF</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">11.4</td><td align="left" valign="middle" rowspan="1" colspan="1">99.0</td><td align="left" valign="middle" rowspan="1" colspan="1">78</td><td align="left" valign="middle" rowspan="1" colspan="1">42.0</td><td align="left" valign="middle" rowspan="1" colspan="1">32.0</td><td align="left" valign="middle" rowspan="1" colspan="1">24.0</td><td align="left" valign="middle" rowspan="1" colspan="1">22.2</td><td align="left" valign="middle" rowspan="1" colspan="1">20.6</td><td align="left" valign="middle" rowspan="1" colspan="1">18.2</td><td align="left" valign="middle" rowspan="1" colspan="1">16.2</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">GG</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/></tr><tr><td rowspan="3" align="left" valign="middle" colspan="1">
<inline-formula>
<mml:math id="M502" display="inline"><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mn>12</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</td><td align="left" valign="middle" rowspan="1" colspan="1">SSS</td><td align="left" valign="middle" rowspan="1" colspan="1">98.6</td><td align="left" valign="middle" rowspan="1" colspan="1">RD</td><td align="left" valign="middle" rowspan="1" colspan="1">23.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">16.6</td><td align="left" valign="middle" rowspan="1" colspan="1">19.8</td><td align="left" valign="middle" rowspan="1" colspan="1">17.2</td><td align="left" valign="middle" rowspan="1" colspan="1">17.6</td><td align="left" valign="middle" rowspan="1" colspan="1">19.6</td><td align="left" valign="middle" rowspan="1" colspan="1">20.4</td><td align="left" valign="middle" rowspan="1" colspan="1">18.6</td><td align="left" valign="middle" rowspan="1" colspan="1">18.6</td><td align="left" valign="middle" rowspan="1" colspan="1">19.0</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">MI</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">PAPF</td><td align="left" valign="middle" rowspan="1" colspan="1">97.2</td><td align="left" valign="middle" rowspan="1" colspan="1">8.0</td><td align="left" valign="middle" rowspan="1" colspan="1">98.2</td><td align="left" valign="middle" rowspan="1" colspan="1">57.2</td><td align="left" valign="middle" rowspan="1" colspan="1">36.2</td><td align="left" valign="middle" rowspan="1" colspan="1">28.0</td><td align="left" valign="middle" rowspan="1" colspan="1">25.4</td><td align="left" valign="middle" rowspan="1" colspan="1">24.0</td><td align="left" valign="middle" rowspan="1" colspan="1">22.6</td><td align="left" valign="middle" rowspan="1" colspan="1">21.6</td><td align="left" valign="middle" rowspan="1" colspan="1">21.0</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">GG</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/></tr><tr><td colspan="15" align="left" valign="middle" rowspan="1">
<hr/>
</td></tr><tr><td rowspan="2" align="left" valign="middle" colspan="1">
<inline-formula>
<mml:math id="M503" display="inline"><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</td><td align="left" valign="middle" rowspan="1" colspan="1">MI</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">RD</td><td align="left" valign="middle" rowspan="1" colspan="1">93.8</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">98.0</td><td align="left" valign="middle" rowspan="1" colspan="1">39.8</td><td align="left" valign="middle" rowspan="1" colspan="1">15.6</td><td align="left" valign="middle" rowspan="1" colspan="1">5.4</td><td align="left" valign="middle" rowspan="1" colspan="1">4.2</td><td align="left" valign="middle" rowspan="1" colspan="1">3.8</td><td align="left" valign="middle" rowspan="1" colspan="1">3.6</td><td align="left" valign="middle" rowspan="1" colspan="1">3.6</td><td align="left" valign="middle" rowspan="1" colspan="1">3.6</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1">PAPF</td><td align="left" valign="middle" rowspan="1" colspan="1">98.8</td><td align="left" valign="middle" rowspan="1" colspan="1">1.2</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">40.8</td><td align="left" valign="middle" rowspan="1" colspan="1">17.2</td><td align="left" valign="middle" rowspan="1" colspan="1">7.0</td><td align="left" valign="middle" rowspan="1" colspan="1">6.8</td><td align="left" valign="middle" rowspan="1" colspan="1">5.4</td><td align="left" valign="middle" rowspan="1" colspan="1">3.8</td><td align="left" valign="middle" rowspan="1" colspan="1">2.6</td><td align="left" valign="middle" rowspan="1" colspan="1">2.4</td></tr><tr><td rowspan="2" align="left" valign="middle" colspan="1">
<inline-formula>
<mml:math id="M504" display="inline"><mml:msub><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</td><td align="left" valign="middle" rowspan="1" colspan="1">MI</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">RD</td><td align="left" valign="middle" rowspan="1" colspan="1">99.2</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">99.2</td><td align="left" valign="middle" rowspan="1" colspan="1">55</td><td align="left" valign="middle" rowspan="1" colspan="1">14.4</td><td align="left" valign="middle" rowspan="1" colspan="1">8.4</td><td align="left" valign="middle" rowspan="1" colspan="1">8.0</td><td align="left" valign="middle" rowspan="1" colspan="1">5.2</td><td align="left" valign="middle" rowspan="1" colspan="1">4.8</td><td align="left" valign="middle" rowspan="1" colspan="1">4.6</td><td align="left" valign="middle" rowspan="1" colspan="1">4.4</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1"/><td align="left" valign="middle" rowspan="1" colspan="1">PAPF</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">64.0</td><td align="left" valign="middle" rowspan="1" colspan="1">22.0</td><td align="left" valign="middle" rowspan="1" colspan="1">11.6</td><td align="left" valign="middle" rowspan="1" colspan="1">6.0</td><td align="left" valign="middle" rowspan="1" colspan="1">3.4</td><td align="left" valign="middle" rowspan="1" colspan="1">2.0</td><td align="left" valign="middle" rowspan="1" colspan="1">2.6</td><td align="left" valign="middle" rowspan="1" colspan="1">1.2</td></tr><tr><td colspan="15" align="left" valign="middle" rowspan="1">
<hr/>
</td></tr><tr><td rowspan="4" align="left" valign="middle" colspan="1">
<inline-formula>
<mml:math id="M505" display="inline"><mml:msub><mml:mrow><mml:mi>M</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math>
</inline-formula>
</td><td align="left" valign="middle" rowspan="1" colspan="1">MI D</td><td align="left" valign="middle" rowspan="1" colspan="1">30.0</td><td align="left" valign="middle" rowspan="1" colspan="1">RD D</td><td align="left" valign="middle" rowspan="1" colspan="1">20.4</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.4</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.2</td><td align="left" valign="middle" rowspan="1" colspan="1">0.8</td><td align="left" valign="middle" rowspan="1" colspan="1">1.8</td><td align="left" valign="middle" rowspan="1" colspan="1">4.2</td><td align="left" valign="middle" rowspan="1" colspan="1">14.6</td><td align="left" valign="middle" rowspan="1" colspan="1">40.2</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">MI C</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">RD C</td><td align="left" valign="middle" rowspan="1" colspan="1">4.8</td><td align="left" valign="middle" rowspan="1" colspan="1">2.0</td><td align="left" valign="middle" rowspan="1" colspan="1">3.2</td><td align="left" valign="middle" rowspan="1" colspan="1">31.4</td><td align="left" valign="middle" rowspan="1" colspan="1">9.8</td><td align="left" valign="middle" rowspan="1" colspan="1">2</td><td align="left" valign="middle" rowspan="1" colspan="1">0.4</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">SSS</td><td align="left" valign="middle" rowspan="1" colspan="1">100.0</td><td align="left" valign="middle" rowspan="1" colspan="1">PAPF D</td><td align="left" valign="middle" rowspan="1" colspan="1">87.0</td><td align="left" valign="middle" rowspan="1" colspan="1">90.8</td><td align="left" valign="middle" rowspan="1" colspan="1">14.8</td><td align="left" valign="middle" rowspan="1" colspan="1">0.2</td><td align="left" valign="middle" rowspan="1" colspan="1">0.6</td><td align="left" valign="middle" rowspan="1" colspan="1">2.0</td><td align="left" valign="middle" rowspan="1" colspan="1">4.6</td><td align="left" valign="middle" rowspan="1" colspan="1">8.4</td><td align="left" valign="middle" rowspan="1" colspan="1">14.4</td><td align="left" valign="middle" rowspan="1" colspan="1">25.6</td><td align="left" valign="middle" rowspan="1" colspan="1">36.2</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">GG</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">PAPF C</td><td align="left" valign="middle" rowspan="1" colspan="1">24.8</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td><td align="left" valign="middle" rowspan="1" colspan="1">9.0</td><td align="left" valign="middle" rowspan="1" colspan="1">31.2</td><td align="left" valign="middle" rowspan="1" colspan="1">23.8</td><td align="left" valign="middle" rowspan="1" colspan="1">16.6</td><td align="left" valign="middle" rowspan="1" colspan="1">10.4</td><td align="left" valign="middle" rowspan="1" colspan="1">6.4</td><td align="left" valign="middle" rowspan="1" colspan="1">4.0</td><td align="left" valign="middle" rowspan="1" colspan="1">1.8</td><td align="left" valign="middle" rowspan="1" colspan="1">0.0</td></tr></tbody></table></table-wrap></floats-group></article>