Next Article in Journal
Enhancing Self-Esteem and Body Image of Breast Cancer Women through Interventions: A Systematic Review
Next Article in Special Issue
The Impact of Individual Mobility on Long-Term Exposure to Ambient PM2.5: Assessing Effect Modification by Travel Patterns and Spatial Variability of PM2.5
Previous Article in Journal
Repeated Antigen-Based Rapid Diagnostic Testing for Estimating the Coronavirus Disease 2019 Prevalence from the Perspective of the Workers’ Vulnerability before and during the Lockdown
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Spatial Heterogeneity in Positional Errors: A Comparison of Two Residential Geocoding Efforts in the Agricultural Health Study

1
Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892, USA
2
Westat, 1600 Research Blvd., Rockville, MD 20850, USA
*
Author to whom correspondence should be addressed.
These authors have contributed equally to this work.
Int. J. Environ. Res. Public Health 2021, 18(4), 1637; https://doi.org/10.3390/ijerph18041637
Submission received: 22 December 2020 / Revised: 18 January 2021 / Accepted: 4 February 2021 / Published: 9 February 2021
(This article belongs to the Special Issue Spatial Data Uncertainty in Public Health Research)

Abstract

:
Geocoding is a powerful tool for environmental exposure assessments that rely on spatial databases. Geocoding processes, locators, and reference datasets have improved over time; however, improvements have not been well-characterized. Enrollment addresses for the Agricultural Health Study, a cohort of pesticide applicators and their spouses in Iowa (IA) and North Carolina (NC), were geocoded in 2012–2016 and then again in 2019. We calculated distances between geocodes in the two periods. For a subset, we computed positional errors using “gold standard” rooftop coordinates (IA; N = 3566) or Global Positioning Systems (GPS) (IA and NC; N = 1258) and compared errors between periods. We used linear regression to model the change in positional error between time periods (improvement) by rural status and population density, and we used spatial relative risk functions to identify areas with significant improvement. Median improvement between time periods in IA was 41 m (interquartile range, IQR: −2 to 168) and 9 m (IQR: −80 to 133) based on rooftop coordinates and GPS, respectively. Median improvement in NC was 42 m (IQR: −1 to 109 m) based on GPS. Positional error was greater in rural and low-density areas compared to in towns and more densely populated areas. Areas of significant improvement in accuracy were identified and mapped across both states. Our findings underscore the importance of evaluating determinants and spatial distributions of errors in geocodes used in environmental epidemiology studies.

1. Introduction

In environmental epidemiology studies of spatially- and temporally-referenced exposures, geocoding of residential addresses is typically used to assign latitude and longitude coordinates in a Geographic Information System (GIS) as a first step in an exposure assessment. However, relatively few studies have compared geocoding methods or assessed the accuracy of geocodes [1,2,3,4,5]. In rural areas, residential proximity to crop fields treated with pesticides can increase pesticide concentrations in homes, with implications for the risk of adverse health outcomes [6,7,8]. Positional errors and imprecision in geocoding may result in exposure misclassification, with implications for exposure assessment of drinking water sources [3,4], non-occupational agricultural pesticides [2,3,9], air pollution [5,10], and other environmental exposures [11,12].
Positional accuracy is defined as the distance between the residence’s location, best determined using a field survey method such as a Global Positioning System (GPS) device or digitally enhanced aerial orthoimagery [13,14,15], and the geocoded point location linked to the address [16]. Evaluating variability in the accuracy of geocoding methods allows investigators to evaluate the potential impact of this variability on their analyses of environmental exposures at the residence and human health [17,18,19]. The assessment of geocoding accuracy is particularly important in rural areas, where studies have shown greater positional error than in urban or suburban areas [20,21]. In addition to positional accuracy, geocoding quality is quantified by a match rate that is the proportion of addresses that can be geocoded [22]. Positional accuracy and match rates of rural addresses geocoded using street databases tend to be worse than urban or suburban addresses due to the larger street segments and distances between the house and public road, which may magnify interpolation errors [2]. Match rates also tend to be lower in rural areas compared to urban areas in part due to the prevalence of rural routes and Post Office (PO) boxes [16], which cannot be located accurately by geocoding [3,23].
The match rate and positional accuracy of geocoding depend on the assumptions behind the matching algorithms as well as the accuracy of the underlying reference (street) datasets [24,25]. The address is matched to a spatial coordinate in the street reference dataset through a matching process that enables matching at different levels of precision based on the address input information available, while a minimum match score that reflects the level of confidence in locating the address must be met or exceeded [26,27]. These algorithms and datasets have improved over time, highlighting the need to assess how older geocode coordinates correspond to newer geocode coordinates obtained with more accurate and complete street databases [25]. In this study, we compared coordinates obtained from an older geocoding effort (Geocode database Version 1: 2012–2016) to a newer geocoding effort (Geocode database Version 2: 2019) in the Agricultural Health Study (AHS), a large cohort of pesticide applicators and their spouses in Iowa (IA) and North Carolina (NC), USA. The primary objective of this study was to compare the geocodes to gold-standard coordinates (GPS and rooftop coordinates) in order to evaluate positional accuracy by geocode match status and rurality and determine if geocodes improved over time. We also conducted a spatial analysis to determine geographic areas with improvements in positional accuracy between the two geocoding efforts.

2. Materials and Methods

2.1. Study Population

The AHS is a prospective cohort of 52,394 licensed private pesticide applicators residing in Iowa and North Carolina, 32,345 of their spouses, and 4916 commercial applicators in Iowa. Details about the study design and cohort have been described [28]. Notably, over 80% of private pesticide applicators were enrolled in the study in Iowa and North Carolina, two states with large farming populations in the Midwest and Southeast, respectively [29]. To date, the AHS has included four phases, starting with enrollment (Phase 1) in 1993–1997 and followed by three follow-up interviews. At Phase 1, applicators and spouses provided the address of their current residence. Applicators and spouses also confirmed or provided an update to their address (including updating rural route addresses with street addresses) during follow-up interviews at Phase 2 (1999–2003), Phase 3 (2005–2010), and Phase 4 (2013–2015). Additionally, address changes were recorded during tracing efforts from 2012 to 2018. A street address from later follow-up surveys was substituted when an earlier address was a rural route or PO Box if the participant had not moved. As spouses resided at the same set of addresses as applicators at Phase 1, this analysis is limited to only private and commercial pesticide applicators (N = 36,792 in IA; N = 20,518 in NC).

2.2. Geocoding Process

Phase 1 enrollment address geocodes are being used to assess environmental exposures including nitrate in private wells and proximity to concentrated animal feeding operations for studies of cancer incidence [11,30,31]. The Phase 1 addresses have been geocoded several times and have been compiled into distinct geocode database versions. In Version 1, IA and NC addresses were batch- and interactively geocoded with the Esri ArcGIS Geocoding Engine software using the NAVTEQ 2011 street database in 2012 for IA and the HERE (formerly Navteq) 2015 street database in 2016 for NC [3]. During batch geocoding, an input table of participant addresses was automatically matched to the reference street database, primarily using the Point Address (not widely used until 2014) and Street Address locators. Point Address locators use a database of address points based on land parcel data; matches return latitude and longitude coordinates for an address, typically in the center of the land parcel for the property. Street Address locators use a spatially-referenced street database that provides the range of possible addresses for a street segment on the odd and even sides of the street. The input address number’s latitude and longitude coordinates are assigned by interpolation within the address range on the street segment. Street Address settings for batch-matched addresses included a street offset of 30 feet (9.1 m) from the street centerline and an end offset (squeeze factor) of 10% of the length of the street segment. The street centerline and end offsets are used to approximate the distance of the home away from the center of the street and to avoid addresses at the end of a street from being placed too close to an intersecting road, respectively [26]. The minimum match score/tolerance, a setting with a range of 0 to 100 to control how closely addresses match to their most likely candidate, was set to 80, the default value. Other match status types (Street Centroid, ZIP Centroid, City Centroid) were typically returned when only a partial address was available or if the street address did not exist within the range of existing address numbers on the street.
In the first geocoding effort, we conducted interactive geocoding when a street address was known but did not return a match with the Point Address or Street Address locators. Interactive geocoding is the manual process of correcting address information and assigning geographic coordinates. Addresses were searched using Google Maps® and other internet sites to identify potential spelling errors, formatting inconsistencies, or areas where the reference data were missing or incomplete. If the address was in the form of a rural route or PO Box, interactive geocoding was not used because the location could not be improved beyond the ZIP code centroid.
IA and NC Phase 1 addresses were batch geocoded again in 2019 (database Version 2). Batch geocoding in 2019 used the Esri ArcGIS Geocoding Engine software and HERE StreetMap Premium 2018 V1, a street reference dataset in which each locator had its own default match score and offset. The Point Address locator was used for any address meeting the minimum match score of 93. The minimum match score used for the Street Address locator was 85. The Street Address locator used a default offset of 25 feet (7.6 m) from the center line and an end offset of 10%. The quality of match statuses for both geocoding efforts was considered good if the status was Point Address, Interactive (Version 1 only), or Street Address; other match statuses were considered poor.

2.3. Positional Accuracy: Rooftop Coordinates for a Subset of Iowa Participants

Rooftop locations of enrollment addresses for IA AHS participants who resided in 14 counties in western Iowa (Supplemental Figure S1) were assigned by researchers at the University of Iowa/Iowa Field Station by comparing geocoded participant addresses (geocoded with ArcGIS) to latitude/longitude data from the Iowa Roof-top Coordinates project. Orthoimagery, ESRI’s street database, Iowa E911 address data, and Iowa spatial parcel information were also used to aid in the assignments [3]. A total of 3566 applicators had rooftop coordinates linked to their geocoded address. Of these participants, 3467 and 3368 had a good match status for their Version 1 and Version 2 geocodes, respectively.

2.4. Positional Accuracy: Comparison to Home GPS Readings

The Biomarkers of Exposure and Effect in Agriculture (BEEA) study is a molecular epidemiologic sub-study in the AHS (N = 1681), in which most participants had a GPS reading taken at the front entrance of their home at the time of an in-home interview [32]. Residence locations were primarily in eastern IA but covered most of the state in NC (Supplemental Figure S1). Quality assurance and quality control were performed on the GPS coordinates for participants if the geocode and GPS locations were ≥500 m apart (n = 304). GPS readings were checked against the street addresses to ensure they were located on the correct street and city. If address attributes did not match, the GPS locations were not considered a gold standard for computing positional error. After excluding participants who moved since enrollment and those without GPS coordinates or verified coordinates at the Phase 1 address, 1258 participants (991 in IA, 267 in NC) were eligible for inclusion in this analysis. Of these participants, 1234 and 1192 had a good match status for their Version 1 and Version 2 geocodes, respectively.

2.5. Statistical Analysis

For participants with a good geocode match status in both geocoding efforts (N = 32,035 or 87.1% in IA, N = 16,919 or 82.5% in NC), we calculated distances between Version 1 and Version 2 geocodes in ArcGIS. Separately for Version 1 and 2 geocodes with a good match status, we calculated positional error (Z), defined as the Euclidean distance between the geocode and one of the three gold-standard coordinate datasets (GPS readings in IA and NC and rooftop coordinates in IA). Means and distributions of these distances and positional errors were calculated by the type of good geocode match status. Improvement was assessed as the difference in positional error between Version 1 and Version 2 geocodes if both were a good match status (calculated as ZVersion1-ZVersion2). Linear regression models were used to evaluate whether rural status and population density were predictive of the positional error of Version 2 geocodes and improvement in positional error between geocoding efforts. For modeling, positional error improvement was parameterized as a natural logarithm-transformed ratio (ln(ZVersion2/ZVersion1)). Rural status was determined based on whether the GPS or rooftop coordinates were located within a Census 2000 Incorporated Place (considered non-rural) or located outside an Incorporated Place (rural). Block-level population density at the GPS or rooftop coordinates was obtained from the 2010 Census. Moran’s I statistic was used to examine the spatial autocorrelation of residuals from the linear regression models (Euclidean inverse-distance weighting in meters with p-values based on a normal approximation).

2.6. Spatial Analysis

Full details of the spatial analyses are presented in Supplemental Appendix A. For linear regression models with significant spatial autocorrelation of the residuals, a simultaneous autoregressive error model was used with an adjacency matrix based on k-nearest neighbors to determine the coefficients and p-values for the two predictor variables. To provide a visual representation of areas with low and high positional error without showing participant locations (which are protected identifiable information), positional errors of the geocodes were spatially interpolated through kriging. The best-fitting semivariogram of natural log-transformed positional error between each gold standard and geocoding effort was selected using the gstat package in R (Ver 3.5.3). The improvement in positional error was evaluated spatially using a spatial relative risk function. We extended the conventional approach of the spatial relative risk function to compare a binary temporal grouping (Version 2 geocodes vs. Version 1 geocodes). We incorporated spatial densities that were weighted by the positional error (in meters); the “risk” is characterizing the spatial distribution of positional error in the Version 2 geocodes relative to Version 1 geocodes across a study area. To accomplish this, the spatial density of study participants for each of the three gold-standard coordinate sets was weighted by their positional error (Z) for both Version 1 and Version 2 geocodes. Next, the spatial relative risk function of the weighted spatial densities was calculated using the spatstat package in R (Ver 1.64-1). We present the natural log-transformed relative risk estimate (ln(E)) and its standard error (σln(E)), which we approximate using the delta method (σln(E) = σE/E) where the standard error of the relative risk estimate (σE) is divided by the relative risk estimate (E) at each smoothed grid location in the study area. Significance in positional error improvement or deterioration, compared to an expectation of homogeneous relative risk (null value of E = 1), was determined as smoothed grid cells with a relative risk estimate (E) that exceeded a two-tailed 95% confidence interval under a normal approximation for the spatial relative risk function. This approach is based on similar assumptions by Hazelton and Davies (2009) but implemented for weighted spatial densities [33].

3. Results

3.1. Geocoding Process

Match status and match quality for both geocoding efforts are presented in Table 1, separately by state. Over 85% of addresses had a good geocode status with a higher percentage in Version 1 (IA = 91.5%, NC = 85.4%) compared to Version 2 (IA = 88.7%, NC = 82.5%) due to the interactive geocoding of Version 1 street addresses that did not have a good match status after batch geocoding. Though the Point Address locator was used minimally in the Version 1 effort, it was prioritized in batch geocoding processes by 2019, and thus most addresses were geocoded to this match type in 2019 (IA = 72.1%, NC = 68.8%).
Distances between Version 1 and Version 2 geocoded coordinates with good match statuses varied by state (Table 2). Overall, the median distance between geocode pairs was 170 m (interquartile range, IQR: 53–293 m) for IA and 87 m (IQR: 39–202 m) for NC. Median distance between geocodes was generally lowest when match statuses were the same, such as when both were Point Address (145 m in IA and 57 m in NC) or Street Address (2 m in IA and 0 m in NC). Though 95% of the distances between pairs were under 936 m for IA and 556 m for NC, there were several extreme values (maximums of 48.5 km in IA and 35.6 km in NC).

3.2. Positional Accuracy

Positional error calculations for Iowa participants with rooftop coordinates are presented in Table 3 for both geocoding efforts. Ninety-seven percent had a good match status in the Version 1 (mostly Street Address) and Version 2 (mostly Point Address) geocoding efforts. Overall, the median positional error was 124 m (IQR: 60–290 m) for the Version 1 geocodes and 65 m (IQR: 30–171 m) for the Version 2 geocodes. For Version 2, Point Address geocodes were more accurate than Street Address geocodes based on each percentile cutpoint (5, 25, 50, 75, 95%). For participants with good quality geocodes in both efforts, 67.4% of geocodes had less positional error in the Version 2 geocoding effort; median improvement between the geocoding efforts was 41 m (IQR: −2 to 168 m).
Positional errors computed from the GPS readings are presented in Table 4. Overall, geocodes were more accurate in the Version 2 geocoding effort than the Version 1 effort as quantified by the mean, median, and other percentile distributions of positional error. Geocodes were also more accurate for NC compared to IA for both Version 1 and Version 2 datasets. Additionally, in the Version 2 dataset, the median positional errors of geocodes assessed with the Point Address locator were more accurate than Street Address geocodes for both IA (Point Address = 166 m; Street Address = 193 m) and NC (Point Address = 42 m, Street Address = 178 m). For IA participants, the median improvement between efforts was 9 m (IQR: −80 to 133 m), and positional error decreased for 53.2% of the participants. For NC participants, the overall median improvement was 42 m (IQR: −1 to 109 m), and 71.4% had a more accurate Version 2 geocode.
Supplemental Tables S1 and S2 show the positional error of geocodes by rural status in both IA and NC as assessed by comparisons to rooftop or GPS coordinates, respectively. Across both states, both geocoding efforts and both reference coordinate sets (rooftop or GPS) geocodes for non-rural addresses were more accurate than geocodes for addresses in rural areas. For example, the median positional error for rural addresses geocoded in the Version 2 dataset in IA was 173 m (IQR: 73–242 m), but it was 25 m (IQR: 14–59 m) for the non-rural addresses geocoded in the same effort. Though the positional error of geocodes was comparable between non-rural addresses in IA and NC, the accuracy of rural geocodes was much higher in NC (Median = 48 m; IQR: 25–158 m) than in IA (Median = 173 m; IQR: 73–242 m) in the Version 2 dataset. Improvement in geocoding accuracy between the 1st and 2nd geocoding efforts was seen in both the rural and non-rural geocodes in both states.
Using linear regression models, we found that rural status and population density were significant predictors of positional error in the Version 2 geocodes and of improvement in positional error between geocoding efforts in IA (Table 5). For the IA rooftop coordinates, increased population density resulted in smaller positional errors (est = −4.78 m per 100 persons/km2 change; p = 0.05) as well as improvement in positional error between the two geocoding efforts (est = 0.98; p < 0.01). For the IA GPS comparisons, non-rural location was predictive of lower positional error (estimate = −507.8 m; p = 0.04) and improvement (improvement ratio = 0.69; p = 0.06). For the GPS comparisons in NC, a state with a higher overall population density, non-rural location and increased population density decreased positional error; however, neither variable was significant. Significant spatial autocorrelation of residuals was observed in the improvement models using the IA rooftop coordinates (Moran’s I = 0.076; p < 0.01) and IA GPS coordinates (Moran’s I = 0.012; p < 0.01); plots of the global Moran’s I are presented in Supplemental Figure S2.
Spatially smoothed maps of positional error and improvement for each of the three reference sets are provided in Figure 1. No clear area-wide patterns emerged from the positional error maps. We noted a significant improvement in positional error in several clusters in Iowa using the GPS reference set, with the largest cluster located in central Iowa, west of the city of Waterloo. There was also a significant improvement in positional error in one area of eastern NC located southeast of the city of Raleigh. Based on Iowa rooftop coordinates, there were multiple areas of significant improvement in positional error, including the entirety of Kossuth County in north-central Iowa.

4. Discussion

In this comparison of two geocoding efforts in the AHS, we found that the positional accuracy of the geocoded residential addresses improved in both IA and NC. We noted smaller positional errors for addresses in NC compared to Iowa, and we found greater positional error in rural and less densely populated areas. We found that positional errors were spatially autocorrelated in IA and that certain regions of both IA and NC had significantly improved geolocations between the two geocoding efforts.
Our findings show that the newer geocoding locators and street reference datasets can more accurately capture the location of residences, which is important for exposure classification in environmental epidemiology studies. Using gold-standard locations as assessed by rooftop coordinates or on-site GPS readings, batch-matched geocodes showed substantial improvement in the second geocoding effort in 2019 compared with the first effort in 2012 and 2016, for IA and NC, respectively. This improvement is partly explained by the shift from the predominant use of the Street Address locator for the Version 1 geocodes (86.5% for IA, 68.7% for NC) to the more accurate Point Address locator for Version 2 (72.1% for IA, 68.8% for NC). However, even addresses matched by the Street Address locator in 2019 showed an improvement in accuracy. Accuracy may have improved over time for several reasons, including data corrections or other updates, such as a change in address ranges for a street segment.
Use of the Point Address locator has increased the positional accuracy of geocodes, likely due to the inclusion of parcel data (property boundaries). When digital parcel data are used for geocoding, coordinates are assigned to an address based on the parcel centroid or the location of the major structure on the property [34]. However, our results may illustrate limitations of geocoding using parcel data in rural areas. The large positional errors for some Point Address geocodes in this study may have occurred due to the poor geocoding accuracy for addresses located in large parcels, which are common in rural areas [16,24,35].
The positional errors we observed for both time periods are comparable to geocoding errors observed in other studies that evaluated geocoded addresses in Iowa [2,36,37] and other US states and countries [16]. In a case-control study of non-Hodgkin lymphoma in Iowa, Ward et al. compared two geocoding methods to GPS measurements at homes and found similar positional errors (medians = 61 and 62 m) to those in our study [2]. Zimmerman et al. modeled the probability distribution of positional errors for 2354 rural addresses in Carroll County IA and found batch-geocoding procedures to have a median error of 168 m [36]. Ganguly et al. found median positional errors of 26 m for batch geocoding using both Bing Maps and ArcGIS for 160 homes exposed to traffic-related air pollution in Detroit, Michigan [35]. Comparing ArcGIS geocodes to residential coordinates obtained using aerial orthoimagery for 506 urban addresses in Saint Louis, Missouri, Schootman et al. found a median positional error of 31 m [13].
We observed greater positional error for rural addresses than for non-rural addresses (located within US Census incorporated places) in both IA and NC. Median positional error based on GPS for rural addresses in IA was more than three times that for non-rural addresses (173 m vs. 23 m), whereas rural and non-rural median errors were more similar in NC (48 m vs. 29 m). The positional error distributions by rural status that we observed were similar to a previous comparison of AHS geocoded addresses that were not limited to enrollment geocodes [3] and similar to the differences observed by Ward et al. for residences located in incorporated places (medians = 50 to 56 m) and rural areas (medians = 88 to 212 m). Others have also examined predictors that might explain the increased positional error observed in rural areas in Iowa and found that street intersection density was predictive of positional error [37]. Our regression models suggest that rural status and population density were predictive of improvement in geocoding accuracy between the two efforts in Iowa. Commercially available geocoding databases likely target improvements to areas with higher population density, which might partly explain the finding of lower positional error among rural NC addresses compared to those in IA. The higher population density in NC may also explain the lack of associations between rural status and population density with positional accuracy and improvement in our linear regression models.
Inaccurate geocodes contribute to exposure misclassification in epidemiological studies [2,3,4]. In an analysis within AHS, Jones et al. found that positional errors comparable to those in our evaluation reduced the sensitivity and specificity of environmental exposure estimates (proximity to row crops and animal feeding operations) even at large buffer sizes (5 km). In several exposure scenarios, they noted up to a 50% reduction in the odds ratio for a hypothetical nested case-control study when address-matched geocodes were used instead of gold-standard coordinates [3]. An analysis of proximity to corn and soybean crops in Iowa by Ward et al. using home GPS coordinates as gold-standard locations found that misclassification was greatest for distances of 100 m compared to 250 and 500 m [2]. An analysis of exposure to concentrated animal feeding operations in Carroll County, Iowa by Mazumdar et al. found median positional errors of 211 m and 46 m [12] that resulted in modest attenuation of a true odds ratio from 1.21 to 1.18 and 1.17 depending on the geocoding method [12]. In an investigation of geocoding methods in rural areas, Vieira et al. evaluated addresses that were erroneously geocoded to street segments not receiving public water supplies or to street segments serviced by a different public water system. Geocoding errors were primarily due to incorrect street numbers and did not result in much exposure misclassification as the entire lengths of the streets were serviced by the same public water system [4]. However, for areas serviced by small public water supplies such as the rural water cooperatives in IA and NC, connections to the public supply can vary from address to address on the same street. Thus, small positional errors could result in the assignment of the wrong drinking water source.
After accounting for population density and rural status, we found significant spatial autocorrelation of positional error improvement in IA. Although clear patterns did not emerge when mapping positional error in IA and NC, our findings of significant spatial autocorrelation in IA and a high degree of spatial heterogeneity are important observations, especially as this could result in spatial clustering of geocoding-related misclassification in future observational studies. Though we did not find significant spatial autocorrelation among models of positional error of Version 2 geocodes, other studies have shown spatial autocorrelation among positional errors from automated geocoding in Iowa [38] and Florida [24]. The clustering of improvements in positional error in certain areas in IA and NC suggests that improvements in geocoding processes or reference databases maintained by commercial geocoding firms are not uniform across large geographic areas.
The strengths of this study include the availability of gold-standard GPS and rooftop coordinates across the study area of both states and the use of both spatial and aspatial models to visualize and determine significant predictors of geocoding accuracy and positional error improvement. Though gold-standard coordinates were not available for all AHS participants, our large sample size of reference coordinates allowed for an examination of accuracy by state, rurality, and match status. The different years of geocodes for IA (2012) and NC (2016) in the Version 1 dataset limit direct comparisons of improvements over time between the states. We tested for spatial autocorrelation of model residuals to determine if another spatial predictor (i.e., beyond rurality and population density) was contributing to positional error and/or improvement. For the two models that showed spatial autocorrelation in the residuals, we performed simultaneous autoregressive error models in order to obtain unbiased coefficients and p-values for the effect of population density and rurality on geocode improvement. Though we were not able to provide maps with local Moran’s I because of our need to protect the confidentiality of participants’ locations, we visualized geocode improvement with spatial relative risk functions and provided global Moran’s I plots in the supplemental to further examine the spatial autocorrelation of the residuals. Our interpolated surface maps provide an overview of positional error across the study area; however, these maps should not be used to extract exact values. It should also be noted that the GPS receivers we used to verify geocodes have positional error that can range from 10 to 20 m [22]. Finally, this study was based in IA and NC in largely rural populations, and all of our results may not be generalizable to other US states. However, our findings of improvements in positional accuracy over time are likely to be relevant to studies in other locations.

5. Conclusions

Recent improvements in the spatial datasets and technology underlying geocoding processes have reduced the positional error in batch geocoded addresses, as demonstrated for AHS participants. Our results indicate that researchers maintaining long-running prospective cohorts may want to update participant geocodes as batch geocoding methods improve over time. Additionally, if available, rooftop coordinates and GPS readings are the most accurate locational data for residential addresses and should be utilized when available. These findings demonstrate the utility and importance of assessing the positional accuracy of geocoding methods (past and present) that are often used in exposure assessment for environmental epidemiology studies.

Supplementary Materials

The following are available online at https://www.mdpi.com/1660-4601/18/4/1637/s1, Figure S1: Number of AHS participants per county with gold-standard rooftop and GPS coordinates in Iowa and North Carolina, Figure S2: Global Moran’s I plots of residuals from Iowa GPS and Iowa Rooftop linear regression improvement ratio models, Table S1: Positional error (m) of Version 1 and Version 2 geocodes compared to rooftop coordinates for Iowa subcohort by rural status, Table S2: Positional error (m) of Version 1 and Version 2 geocodes compared to GPS by rural status.

Author Contributions

Conceptualization, J.A.F., M.S., I.D.B., R.R.J. and M.H.W.; Methodology, J.A.F., I.D.B. and M.H.W.; Formal Analysis, A.R.F., J.A.F., M.S. and I.D.B.; Investigation, J.A.F., M.S., I.D.B. and M.H.W.; Data Curation, A.R.F., L.E.B.F., J.N.H. and M.G.; Writing—Original Draft Preparation, J.A.F., M.S. and M.H.W.; Writing—Review and Editing, J.A.F., M.S., I.D.B., A.R.F., L.E.B.F., J.N.H., M.G., R.R.J. and M.H.W.; Visualization, J.A.F. and I.D.B.; Supervision, M.H.W.; Project Administration, L.E.B.F. and M.H.W.; Funding Acquisition, L.E.B.F. and M.H.W. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the intramural research program of the National Institutes of Health, National Cancer Institute (Z01-CP010119).

Institutional Review Board Statement

All study protocols were reviewed and approved by the National Institutes of Health and relevant contractors.

Informed Consent Statement

Consistent with human subjects protection requirements in effect at the time of enrollment, implied consent for completion of questionnaires and follow-up was approved by all relevant institutional review boards.

Data Availability Statement

Requests for data, including the data used in this manuscript, are welcome as described on the Study Website (https://www.aghealth.nih.gov/collaboration/process.html). Data requests may be made directly at www.aghealthstars.com; registration is required. The Agricultural Health Study is an ongoing prospective study. The data sharing policy was developed to protect the privacy of study participants.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Faure, E.; Danjou, A.M.N.; Clavel-Chapelon, F.; Boutron-Ruault, M.-C.; Dossus, L.; Fervers, B. Accuracy of Two Geocoding Methods for Geographic Information System-Based Exposure Assessment in Epidemiological Studies. Environ. Health 2017, 16, 15. [Google Scholar] [CrossRef] [Green Version]
  2. Ward, M.H.; Nuckols, J.R.; Giglierano, J.; Bonner, M.R.; Wolter, C.; Airola, M.; Mix, W.; Colt, J.S.; Hartge, P. Positional Accuracy of Two Methods of Geocoding. Epidemiology 2005, 16, 542–547. [Google Scholar] [CrossRef]
  3. Jones, R.R.; DellaValle, C.T.; Flory, A.R.; Nordan, A.; Hoppin, J.A.; Hofmann, J.N.; Chen, H.; Giglierano, J.; Lynch, C.F.; Beane Freeman, L.E.; et al. Accuracy of Residential Geocoding in the Agricultural Health Study. Int. J. Health Geogr. 2014, 13, 37. [Google Scholar] [CrossRef] [Green Version]
  4. Vieira, V.M.; Howard, G.J.; Gallagher, L.G.; Fletcher, T. Geocoding Rural Addresses in a Community Contaminated by PFOA: A Comparison of Methods. Environ. Health 2010, 9, 18. [Google Scholar] [CrossRef] [Green Version]
  5. Kinnee, E.J.; Tripathy, S.; Schinasi, L.; Shmool, J.L.C.; Sheffield, P.E.; Holguin, F.; Clougherty, J.E. Geocoding Error, Spatial Uncertainty, and Implications for Exposure Assessment and Environmental Epidemiology. Int. J. Environ. Res. Public Health 2020, 17, 5845. [Google Scholar] [CrossRef]
  6. Ward, M.H.; Lubin, J.; Giglierano, J.; Colt, J.S.; Wolter, C.; Bekiroglu, N.; Camann, D.; Hartge, P.; Nuckols, J.R. Proximity to Crops and Residential Exposure to Agricultural Herbicides in Iowa. Environ. Health Perspect. 2006, 114, 893–897. [Google Scholar] [CrossRef] [PubMed]
  7. Gunier, R.B.; Ward, M.H.; Airola, M.; Bell, E.M.; Colt, J.; Nishioka, M.; Buffler, P.A.; Reynolds, P.; Rull, R.P.; Hertz, A.; et al. Determinants of Agricultural Pesticide Concentrations in Carpet Dust. Environ. Health Perspect. 2011, 119, 970–976. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  8. Dereumeaux, C.; Fillol, C.; Quenel, P.; Denys, S. Pesticide Exposures for Residents Living Close to Agricultural Lands: A Review. Environ. Int. 2020, 134, 105210. [Google Scholar] [CrossRef] [PubMed]
  9. Deziel, N.C.; Friesen, M.C.; Hoppin, J.A.; Hines, C.J.; Thomas, K.; Freeman, L.E.B. A Review of Nonoccupational Pathways for Pesticide Exposure in Women Living in Agricultural Areas. Environ. Health Perspect. 2015, 123, 515–524. [Google Scholar] [CrossRef] [Green Version]
  10. Gilboa, S.M.; Mendola, P.; Olshan, A.F.; Harness, C.; Loomis, D.; Langlois, P.H.; Savitz, D.A.; Herring, A.H. Comparison of Residential Geocoding Methods in Population-Based Study of Air Quality and Birth Defects. Environ. Res. 2006, 101, 256–262. [Google Scholar] [CrossRef]
  11. Fisher, J.A.; Freeman, L.E.B.; Hofmann, J.N.; Blair, A.; Parks, C.G.; Thorne, P.S.; Ward, M.H.; Jones, R.R. Residential Proximity to Intensive Animal Agriculture and Risk of Lymphohematopoietic Cancers in the Agricultural Health Study. Epidemiology 2020, 31, 478–489. [Google Scholar] [CrossRef] [PubMed]
  12. Mazumdar, S.; Rushton, G.; Smith, B.J.; Zimmerman, D.L.; Donham, K.J. Geocoding Accuracy and the Recovery of Relationships between Environmental Exposures and Health. Int. J. Health Geogr. 2008, 7, 13. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Schootman, M.; Sterling, D.A.; Struthers, J.; Yan, Y.; Laboube, T.; Emo, B.; Higgs, G. Positional Accuracy and Geographic Bias of Four Methods of Geocoding in Epidemiologic Research. Ann. Epidemiol. 2007, 17, 464–470. [Google Scholar] [CrossRef] [PubMed]
  14. Zhan, F.B.; Brender, J.D.; De Lima, I.; Suarez, L.; Langlois, P.H. Match Rate and Positional Accuracy of Two Geocoding Methods for Epidemiologic Research. Ann. Epidemiol. 2006, 16, 842–849. [Google Scholar] [CrossRef]
  15. Zhang, Z.; Manjourides, J.; Cohen, T.; Hu, Y.; Jiang, Q. Spatial Measurement Errors in the Field of Spatial Epidemiology. Int. J. Health Geogr. 2016, 15, 21. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  16. Zandbergen, P.A. Geocoding Quality and Implications for Spatial Analysis. Geogr. Compass 2009, 3, 647–680. [Google Scholar] [CrossRef]
  17. Jacquez, G.M. A Research Agenda: Does Geocoding Positional Error Matter in Health GIS Studies? Spat. Spat. Epidemiol. 2012, 3, 7–16. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  18. Krieger, N.; Waterman, P.; Lemieux, K.; Zierler, S.; Hogan, J.W. On the Wrong Side of the Tracts? Evaluating the Accuracy of Geocoding in Public Health Research. Am. J. Public Health 2001, 91, 1114–1116. [Google Scholar] [PubMed] [Green Version]
  19. Han, D.; Bonner, M.R.; Nie, J.; Freudenheim, J.L. Assessing Bias Associated with Geocoding of Historical Residence in Epidemiology Research. Geospat. Health 2013, 7, 369–374. [Google Scholar] [CrossRef] [Green Version]
  20. Khan, S.; Pinault, L.; Tjepkema, M.; Wilkins, R. Positional Accuracy of Geocoding from Residential Postal Codes versus Full Street Addresses. Health Rep. 2018, 29, 3–9. [Google Scholar]
  21. Cayo, M.R.; Talbot, T.O. Positional Error in Automated Geocoding of Residential Addresses. Int. J. Health Geogr. 2003, 2, 10. [Google Scholar] [CrossRef] [Green Version]
  22. Ribeiro, A.I.; Olhero, A.; Teixeira, H.; Magalhães, A.; Pina, M.F. Tools for Address Georeferencing-Limitations and Opportunities Every Public Health Professional Should Be Aware of. PLoS ONE 2014, 9, e114130. [Google Scholar] [CrossRef] [PubMed]
  23. Hurley, S.E.; Saunders, T.M.; Nivas, R.; Hertz, A.; Reynolds, P. Post Office Box Addresses: A Challenge for Geographic Information System-Based Studies. Epidemiology (Cambridge, Mass.) 2003, 14, 386–391. [Google Scholar] [CrossRef] [PubMed]
  24. Koo, H.; Chun, Y.; Griffith, D.A. Modeling Positional Uncertainty Acquired Through Street Geocoding. Int. J. Appl. Geosp. Res. 2018, 9, 1–22. [Google Scholar] [CrossRef] [Green Version]
  25. Zandbergen, P.A. A Comparison of Address Point, Parcel and Street Geocoding Techniques. Comput. Environ. Urban Syst. 2008, 32, 214–232. [Google Scholar] [CrossRef]
  26. ESRI Geocoding Options Properties—Help | ArcGIS for Desktop. Available online: https://desktop.arcgis.com/en/arcmap/10.3/guide-books/geocoding/geocoding-options-properties.htm (accessed on 21 December 2020).
  27. Patel, N. Geocoding: Delivering High Location Accuracy. Available online: https://www.esri.com/arcgis-blog/products/analytics/analytics/geocoding-delivering-high-location-accuracy/ (accessed on 18 January 2021).
  28. Alavanja, M.C.; Sandler, D.P.; McMaster, S.B.; Zahm, S.H.; McDonnell, C.J.; Lynch, C.F.; Pennybacker, M.; Rothman, N.; Dosemeci, M.; Bond, A.E.; et al. The Agricultural Health Study. Environ. Health Perspect. 1996, 104, 362–369. [Google Scholar] [CrossRef]
  29. Questionnaires & Study Data | Agricultural Health Study. Available online: https://aghealth.nih.gov/collaboration/questionnaires.html (accessed on 17 January 2021).
  30. Wheeler, D.C.; Nolan, B.T.; Flory, A.R.; DellaValle, C.T.; Ward, M.H. Modeling Groundwater Nitrate Concentrations in Private Wells in Iowa. Sci. Total Environ. 2015, 536, 481–488. [Google Scholar] [CrossRef]
  31. Messier, K.P.; Wheeler, D.C.; Flory, A.R.; Jones, R.R.; Patel, D.; Nolan, B.T.; Ward, M.H. Modeling Groundwater Nitrate Exposure in Private Wells of North Carolina for the Agricultural Health Study. Sci. Total Environ. 2019, 655, 512–519. [Google Scholar] [CrossRef]
  32. Hofmann, J.N.; Beane Freeman, L.E.; Lynch, C.F.; Andreotti, G.; Thomas, K.W.; Sandler, D.P.; Savage, S.A.; Alavanja, M.C. The Biomarkers of Exposure and Effect in Agriculture (BEEA) Study: Rationale, Design, Methods, and Participant Characteristics. J. Toxicol. Environ. Health Part A 2015, 78, 1338–1347. [Google Scholar] [CrossRef] [Green Version]
  33. Hazelton, M.L.; Davies, T.M. Inference Based on Kernel Estimates of the Relative Risk Function in Geographical Epidemiology. Biometr. J. 2009, 51, 98–109. [Google Scholar] [CrossRef]
  34. Rushton, G.; Armstrong, M.P.; Gittler, J.; Greene, B.R.; Pavlik, C.E.; West, M.M.; Zimmerman, D.L. Geocoding in Cancer Research: A Review. Am. J. Prev. Med. 2006, 30, S16–S24. [Google Scholar] [CrossRef]
  35. Ganguly, R.; Batterman, S.; Isakov, V.; Snyder, M.; Breen, M.; Brakefield-Caldwell, W. Effect of Geocoding Errors on Traffic-Related Air Pollutant Exposure and Concentration Estimates. J. Expo. Sci. Environ. Epidemiol. 2015, 25, 490–498. [Google Scholar] [CrossRef] [Green Version]
  36. Zimmerman, D.L.; Fang, X.; Mazumdar, S.; Rushton, G. Modeling the Probability Distribution of Positional Errors Incurred by Residential Address Geocoding. Int. J. Health Geogr. 2007, 6, 1. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  37. Zimmerman, D.L.; Li, J. The Effects of Local Street Network Characteristics on the Positional Accuracy of Automated Geocoding for Geographic Health Studies. Int. J. Health Geogr. 2010, 9, 10. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  38. Zimmerman, D.L.; Li, J.; Fang, X. Spatial Autocorrelation among Automated Geocoding Errors and Its Effects on Testing for Disease Clustering. Stat. Med. 2010, 29, 1025–1036. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Figure 1. Spatial distribution of positional error of Version 2 geocodes and positional error improvement between Version 1 and Version 2 geocodesa for Iowa rooftop coordinates and Iowa and North Carolina GPS coordinates. a Version 1: enrollment addresses were geocoded in 2012 for Iowa and in 2016 for North Carolina. Version 2: addresses were geocoded in 2019 for both states. b Figure denotes the spatially kriged positional error of Version 2 geocodes. Darker colors denote higher values of positional error. c Figure denotes significant improvement ratio, which is calculated as the estimated spatial relative risk of Version 2 geocode and Version 1 geocode (ZVersion2/ZVersion1; weighted by their respective positional error) that exceeds an asymptotic normal null expectation of homogenous relative risk. Blue-colored areas denote significant improvement in positional error between Version 1 and Version 2 geocodes, grey-colored areas denote insignificant change, and red-colored areas denote significant deterioration (which was not observed).
Figure 1. Spatial distribution of positional error of Version 2 geocodes and positional error improvement between Version 1 and Version 2 geocodesa for Iowa rooftop coordinates and Iowa and North Carolina GPS coordinates. a Version 1: enrollment addresses were geocoded in 2012 for Iowa and in 2016 for North Carolina. Version 2: addresses were geocoded in 2019 for both states. b Figure denotes the spatially kriged positional error of Version 2 geocodes. Darker colors denote higher values of positional error. c Figure denotes significant improvement ratio, which is calculated as the estimated spatial relative risk of Version 2 geocode and Version 1 geocode (ZVersion2/ZVersion1; weighted by their respective positional error) that exceeds an asymptotic normal null expectation of homogenous relative risk. Blue-colored areas denote significant improvement in positional error between Version 1 and Version 2 geocodes, grey-colored areas denote insignificant change, and red-colored areas denote significant deterioration (which was not observed).
Ijerph 18 01637 g001
Table 1. Geocode match status (number [N], percentage [%] for enrollment addresses, Agricultural Health Study applicators in Iowa (n = 36,792) and North Carolina (n = 20,518)).
Table 1. Geocode match status (number [N], percentage [%] for enrollment addresses, Agricultural Health Study applicators in Iowa (n = 36,792) and North Carolina (n = 20,518)).
Version 1 aVersion 2 a
Match StatusN%Match StatusN%
IowaGood Match Status
Interactive17304.7---------
Point Address1200.3Point Address26,52872.1
Street Address31,81186.5Street Address611916.6
Total33,66191.5Total32,64788.7
Poor Match Status
Street Centroid8282.3Street Centroid8952.4
ZIP Centroid22696.2ZIP Centroid32168.7
City Centroid240.1City Centroid240.1
Total31218.5Total413511.2
Unassigned b100.0Unassigned b100.0
Overall36,792 Overall36,792
North CarolinaGood Match Status
Interactive2811.4---------
Point Address313915.3Point Address14,11268.8
Street Address14,10068.7Street Address281513.7
Total17,52085.4Total16,92782.5
Poor Match Status
Street Centroid530.3Street Centroid2201.1
ZIP Centroid18048.8ZIP Centroid223010.9
City Centroid10.0City Centroid10.0
Total18589.1Total245111.9
Unassigned b11405.6Unassigned b11405.6
Overall20,518 Overall20,518
a Version 1: enrollment addresses were geocoded in 2012 for Iowa and in 2016 for North Carolina. Version 2: addresses were geocoded in 2019 for both states. b No coordinates were assigned during the geocoding process due to no match to any locator.
Table 2. Distance between Version 1 and Version 2 geocodes for addresses in Iowa and North Carolina by match status.
Table 2. Distance between Version 1 and Version 2 geocodes for addresses in Iowa and North Carolina by match status.
Version 1 a Match StatusVersion 2 a Match Status Distance (m) between Geocodes
NMean (SD)Min5%Median (IQR)95%Max
Iowa
InteractivePoint Address914498 (1839)114186 (55–355)117025,987
Street Address2791210 (2258)621284 (83–1129)634816,728
Point AddressPoint Address105157 (112)224145 (60–208)382579
Street Address8170 (217)6674 (51–234)640640
Street AddressPoint Address25,081389 (1214)026202 (88–326)93348,535
Street Address5648241 (1658)012 (2–45)73747,713
Total32,035372 (1338)02170 (53–293)93648,535
North Carolina
InteractivePoint Address80307 (892)0475 (19–173)14707318
Street Address203530 (5394)617153 (46–9038)14,20215,237
Point AddressPoint Address2922112 (137)01857 (36–130)3791451
Street Address12567 (1652)55106 (14–192)58095809
Street AddressPoint Address11,105255 (865)330121 (63–243)60035,568
Street Address2780196 (1228)000 (0–23)53222,202
Total16,919225 (892)0087 (39–202)55635,568
a Version 1: enrollment addresses were geocoded in 2012 for Iowa and in 2016 for North Carolina. Version 2: addresses were geocoded in 2019 for both states.
Table 3. Positional error (m) of and improvement between Version 1 and Version 2 geocodes compared to rooftop coordinates for Iowa participants by match status.
Table 3. Positional error (m) of and improvement between Version 1 and Version 2 geocodes compared to rooftop coordinates for Iowa participants by match status.
Rooftop Coordinate vs. Geocode Positional Error (m)
NMean (SD)Min5%Median (IQR)95%Max
Version 1 Geocodes a
Interactive130315 (464)423162 (63–338)11763809
Street Address3337386 (1074)326123 (60–287)122715,172
Total3467383 (1058)326124 (60–290)121815,172
Version 2 Geocodes a
Point Address3038102 (153)0664 (29–171)2476847
Street Address402417 (1506)91967 (37–176)126614,796
Total3440139 (543)0665 (30–171)26514,796
Improvement b3368245 (1021)−14,317−14741 (−2–168)107714,887
a Version 1: enrollment addresses were geocoded in 2012 for Iowa and in 2016 for North Carolina. Version 2: addresses were geocoded in 2019 for both states. b Improvement calculated as the difference between the positional error of the Version 1 and Version 2 geocodes and limited to those with geocodes of good match status in both efforts.
Table 4. Positional error (m) of Version 1 and Version 2 geocodes compared to Global Positioning System (GPS) coordinates for Iowa and North Carolina participants by geocode match status.
Table 4. Positional error (m) of Version 1 and Version 2 geocodes compared to Global Positioning System (GPS) coordinates for Iowa and North Carolina participants by geocode match status.
GPS vs. Geocode Positional Error (m)
NMean (SD)Min5%Median (IQR)95%Max
IowaVersion 1 Geocodes a
Interactive43249 (242)1222192 (79–325)7031174
Point Address3138 (35)9999147 (99–168)168168
Street Address922400 (1154)530149 (77–329)101215,609
Total968392 (1128)530150 (78–328)99815,609
Version 2 Geocodes a
Point Address866199 (546)317166 (59–227)46715,567
Street Address82720 (1734)1027193 (76–451)337611,884
Total948244 (742)317167 (62–236)50815,567
Improvement b934148 (905)−1765−3029 (−80–133)61512,102
North CarolinaVersion 1 Geocodes a
Interactive192-----
Point Address4397 (104)102461 (37–102)320550
Street Address222330 (1458)732131 (81–252)68219,324
Total266291 (1335)727117 (67–225)61619,324
Version 2 Geocodes a
Point Address227107 (166)31042 (23–122)4811147
Street Address31838 (3451)2427178 (71–325)93119,404
Total258195 (1213)31146 (25–150)51519,404
Improvement b25861 (224)−1002−20542 (−1–109)4141316
a Version 1: enrollment addresses were geocoded in 2012 for Iowa and in 2016 for North Carolina. Version 2: addresses were geocoded in 2019 for both states. b Improvement calculated as the difference between the positional error of the Version 1 and Version 2 geocodes and limited to those with geocodes of good match status in both efforts.
Table 5. Regression models predicting positional error of Version 2 geocodes and the improvement ratio in positional error between Version 1 and Version 2 geocodes.
Table 5. Regression models predicting positional error of Version 2 geocodes and the improvement ratio in positional error between Version 1 and Version 2 geocodes.
Positional Error of Version 2 GeocodesImprovement Ratio a
ModelEstimatep-ValueEstimatep-Value
Iowa Rooftop
Intercept158.1<0.0010.448<0.001
Non-rural location b−53.10.1310.8920.246
Pop. Density
(100 persons/km2) c
−4.780.0480.9860.025
Moran’s I of residuals0.0010.409
Iowa GPS
Intercept564.1<0.0010.8560.006
Non-rural location b−507.80.0370.7080.079
Pop. Density
(100 persons/km2) c
6.360.7101.0110.445
Moran’s I of residuals0.0040.140
North Carolina GPS
Intercept375.2<0.0010.504<0.001
Non-rural location b−281.60.6060.9550.906
Pop. Density
(100 persons/km2) c
−28.10.5961.0220.572
Moran’s I of residuals−0.0030.9440.0190.141
a Improvement ratio is calculated as the positional error of the Version 2 geocode divided by the positional error of the Version 1 geocode (ZVersion2/ZVersion1). A simultaneous autoregressive error model was used for Iowa Rooftop and Iowa GPS improvement models, as aspatial linear models showed significant spatial autocorrelation of the residuals based on the Moran’s I statistic (p < 0.001). b Non-rural location defined as the location being within a Census 2000 Incorporated Place. Estimate represents the change in positional error associated with a residence in a non-rural location. c Population density at the block level from the 2010 Census. Estimate represents the change in positional error associated with an increase in population density of 100 persons per km2 at the residence.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Fisher, J.A.; Spaur, M.; Buller, I.D.; Flory, A.R.; Beane Freeman, L.E.; Hofmann, J.N.; Giangrande, M.; Jones, R.R.; Ward, M.H. Spatial Heterogeneity in Positional Errors: A Comparison of Two Residential Geocoding Efforts in the Agricultural Health Study. Int. J. Environ. Res. Public Health 2021, 18, 1637. https://doi.org/10.3390/ijerph18041637

AMA Style

Fisher JA, Spaur M, Buller ID, Flory AR, Beane Freeman LE, Hofmann JN, Giangrande M, Jones RR, Ward MH. Spatial Heterogeneity in Positional Errors: A Comparison of Two Residential Geocoding Efforts in the Agricultural Health Study. International Journal of Environmental Research and Public Health. 2021; 18(4):1637. https://doi.org/10.3390/ijerph18041637

Chicago/Turabian Style

Fisher, Jared A., Maya Spaur, Ian D. Buller, Abigail R. Flory, Laura E. Beane Freeman, Jonathan N. Hofmann, Michael Giangrande, Rena R. Jones, and Mary H. Ward. 2021. "Spatial Heterogeneity in Positional Errors: A Comparison of Two Residential Geocoding Efforts in the Agricultural Health Study" International Journal of Environmental Research and Public Health 18, no. 4: 1637. https://doi.org/10.3390/ijerph18041637

APA Style

Fisher, J. A., Spaur, M., Buller, I. D., Flory, A. R., Beane Freeman, L. E., Hofmann, J. N., Giangrande, M., Jones, R. R., & Ward, M. H. (2021). Spatial Heterogeneity in Positional Errors: A Comparison of Two Residential Geocoding Efforts in the Agricultural Health Study. International Journal of Environmental Research and Public Health, 18(4), 1637. https://doi.org/10.3390/ijerph18041637

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop