A recent perspective in this journal (
The problem of regional differences in spatial resolution of county-referenced data is, unfortunately, reflected in counties, ZIP codes, and census tracts, as shown in plots of nearest-neighbor distances among unit centroids as a function of longitude (
Longitudinal patterns in nearest-neighbor distances for A) counties, B) ZIP codes, and C) census tracts across the lower 48 United States, showing trends toward greater spacing among districts in the western United States compared with the eastern United States in all 3 regionalizations.
The problem of coarse spatial resolution is only partially addressed by the ZIP code or census tract solution. ZIP codes and census tracts cover fixed areas and can misrepresent the spatial precision of epidemiologic records. A traveling salesperson who covers the state of Wyoming each week would be represented identically as his or her next-door neighbor who is housebound, although spatial precision differs considerably between the 2 persons. Precision of the housebound neighbor could be better represented than county, ZIP code, or census tract. ZIP codes and census tracts change periodically, and ZIP codes do not have defined spatial extents per se (
The biodiversity world has already addressed this challenge. The point-radius method for georeferencing locality descriptions (
As an example of how the point-radius method would be applied, the locality for our traveling salesperson would be assigned to his or her house, but the error radius would be 360 km (based on the corner-to-corner distance across Wyoming). The housebound neighbor might have a similar set of coordinates (next door), but the error radius might be 0.1 km (breadth of the house plus the imprecision of the global positioning system unit). When a researcher uses these data, he or she might wish to analyze occurrence of this disease with a spatial precision of 1 km; e.g., applying a filter to exclude those data records too imprecise for this study, he or she would exclude the data record for the salesperson (because the salesperson may have contracted the disease in another sector of the state) but include that for the housebound neighbor. Alternatively, the researcher may include variable degrees of precision in the analysis according each to record a precision or certainty corresponding to its error radius, as in recent spatial analyses of Marburg virus transmission risk (
How specifically would this method be implemented in public health surveillance? If data are to be captured initially on paper, the data recorder would simply record the focal point of the person’s activities (usually a residence) and an approximate description of the person’s movements (e.g., broadly across the state, housebound, within 20 miles). These descriptions are easily georeferenced post hoc by using recently developed software tools (e.g., Biogeomancer,
The point-radius approach is novel to most epidemiologic applications but offers considerable advantages. When fine-resolution data are available, researchers will have this more precise information and can distinguish it from coarser resolution data; when actual data are coarser, this information is also expressed. Researchers will be able to filter epidemiologic occurrence information to retain those data that are sufficiently precise for particular applications, thus offering a considerable improvement over any of the 3 polygon-based approaches (ZIP codes, census tracts, and counties). Thus, the recent publication cited (
In his comment, Peterson reiterates the need for improved methods for collecting and presenting spatial epidemiologic data for vector-borne diseases (
With regard to practical implementation of the point-radius method in a public health setting, Peterson states, “If data are to be captured initially on paper, the data recorder would simply record the focal point of the person’s activities (usually a residence) and an approximate description of the person’s movements (e.g., broadly across the state, housebound, within 20 miles)” (
Our first concern is that Peterson’s scenario does not distinguish between a car trip to the mall at noon and spending an evening on the golf course. In reality, one activity presents minimal risk for exposure to mosquitoes infected with West Nile virus, whereas the other is a potential high-risk activity. Giving equal weight to the movements represented by these activities will assuredly produce an unreliable result for probable pathogen exposure site. Other issues are patient recall and reluctance to provide information on movement patterns and specific activities. Peterson’s suggestion that the data recorder would simply record the focal point of the person’s activities and an approximate description of the person’s movements is therefore a grossly oversimplified solution to a complex public health problem.
With regard to the second concern, the average physician likely lacks the knowledge, time, and training in vector-borne disease epidemiology and ecology needed to accurately assess when and where risk for pathogen exposure occurred. To be of use, the method will require in-depth patient interviews by specially trained personnel from local or state health departments. Even then, we doubt that the quality of data gleaned would justify the cost incurred.
We fail to see that the quality of information gathered by using the point-radius method would be an improvement over our suggestion. In our original article, we suggested using sets of standardized questions that are tailored to a given vector-borne disease. We also indicated that a critical minimal need includes a basic assessment of whether pathogen exposure likely occurred in 1) the peridomestic environment, 2) outside the peridomestic environment but within the county of residence, or 3) outside the county of residence (
The challenge of how to most effectively collect and present spatial epidemiologic data is neither conceptual nor technologic; rather, it is logistic and legal. Any new method must 1) weigh the public health utility of the method against the time and cost required for the public health system to implement it and 2) comply with existing patient privacy laws. The point-radius method clearly fails on the first count and also likely will present substantial problems in terms of patient privacy.
We agree that presenting data for case counts and disease incidence by ZIP code or census tract falls short of the desired level of spatial precision. However, this realistic compromise 1) is a marked improvement over the current practice to display only county-based spatial patterns for case counts or incidence; 2) incurs only minimal added time and cost for the public health community; and 3) can be implemented, especially for census tracts, with minimal concerns regarding patient privacy.