Spatial Heterogeneity in Positional Errors: A Comparison of Two Residential Geocoding Efforts in the Agricultural Health Study
Abstract
:1. Introduction
2. Materials and Methods
2.1. Study Population
2.2. Geocoding Process
2.3. Positional Accuracy: Rooftop Coordinates for a Subset of Iowa Participants
2.4. Positional Accuracy: Comparison to Home GPS Readings
2.5. Statistical Analysis
2.6. Spatial Analysis
3. Results
3.1. Geocoding Process
3.2. Positional Accuracy
4. Discussion
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Faure, E.; Danjou, A.M.N.; Clavel-Chapelon, F.; Boutron-Ruault, M.-C.; Dossus, L.; Fervers, B. Accuracy of Two Geocoding Methods for Geographic Information System-Based Exposure Assessment in Epidemiological Studies. Environ. Health 2017, 16, 15. [Google Scholar] [CrossRef] [Green Version]
- Ward, M.H.; Nuckols, J.R.; Giglierano, J.; Bonner, M.R.; Wolter, C.; Airola, M.; Mix, W.; Colt, J.S.; Hartge, P. Positional Accuracy of Two Methods of Geocoding. Epidemiology 2005, 16, 542–547. [Google Scholar] [CrossRef]
- Jones, R.R.; DellaValle, C.T.; Flory, A.R.; Nordan, A.; Hoppin, J.A.; Hofmann, J.N.; Chen, H.; Giglierano, J.; Lynch, C.F.; Beane Freeman, L.E.; et al. Accuracy of Residential Geocoding in the Agricultural Health Study. Int. J. Health Geogr. 2014, 13, 37. [Google Scholar] [CrossRef] [Green Version]
- Vieira, V.M.; Howard, G.J.; Gallagher, L.G.; Fletcher, T. Geocoding Rural Addresses in a Community Contaminated by PFOA: A Comparison of Methods. Environ. Health 2010, 9, 18. [Google Scholar] [CrossRef] [Green Version]
- Kinnee, E.J.; Tripathy, S.; Schinasi, L.; Shmool, J.L.C.; Sheffield, P.E.; Holguin, F.; Clougherty, J.E. Geocoding Error, Spatial Uncertainty, and Implications for Exposure Assessment and Environmental Epidemiology. Int. J. Environ. Res. Public Health 2020, 17, 5845. [Google Scholar] [CrossRef]
- Ward, M.H.; Lubin, J.; Giglierano, J.; Colt, J.S.; Wolter, C.; Bekiroglu, N.; Camann, D.; Hartge, P.; Nuckols, J.R. Proximity to Crops and Residential Exposure to Agricultural Herbicides in Iowa. Environ. Health Perspect. 2006, 114, 893–897. [Google Scholar] [CrossRef] [PubMed]
- Gunier, R.B.; Ward, M.H.; Airola, M.; Bell, E.M.; Colt, J.; Nishioka, M.; Buffler, P.A.; Reynolds, P.; Rull, R.P.; Hertz, A.; et al. Determinants of Agricultural Pesticide Concentrations in Carpet Dust. Environ. Health Perspect. 2011, 119, 970–976. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Dereumeaux, C.; Fillol, C.; Quenel, P.; Denys, S. Pesticide Exposures for Residents Living Close to Agricultural Lands: A Review. Environ. Int. 2020, 134, 105210. [Google Scholar] [CrossRef] [PubMed]
- Deziel, N.C.; Friesen, M.C.; Hoppin, J.A.; Hines, C.J.; Thomas, K.; Freeman, L.E.B. A Review of Nonoccupational Pathways for Pesticide Exposure in Women Living in Agricultural Areas. Environ. Health Perspect. 2015, 123, 515–524. [Google Scholar] [CrossRef] [Green Version]
- Gilboa, S.M.; Mendola, P.; Olshan, A.F.; Harness, C.; Loomis, D.; Langlois, P.H.; Savitz, D.A.; Herring, A.H. Comparison of Residential Geocoding Methods in Population-Based Study of Air Quality and Birth Defects. Environ. Res. 2006, 101, 256–262. [Google Scholar] [CrossRef]
- Fisher, J.A.; Freeman, L.E.B.; Hofmann, J.N.; Blair, A.; Parks, C.G.; Thorne, P.S.; Ward, M.H.; Jones, R.R. Residential Proximity to Intensive Animal Agriculture and Risk of Lymphohematopoietic Cancers in the Agricultural Health Study. Epidemiology 2020, 31, 478–489. [Google Scholar] [CrossRef] [PubMed]
- Mazumdar, S.; Rushton, G.; Smith, B.J.; Zimmerman, D.L.; Donham, K.J. Geocoding Accuracy and the Recovery of Relationships between Environmental Exposures and Health. Int. J. Health Geogr. 2008, 7, 13. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Schootman, M.; Sterling, D.A.; Struthers, J.; Yan, Y.; Laboube, T.; Emo, B.; Higgs, G. Positional Accuracy and Geographic Bias of Four Methods of Geocoding in Epidemiologic Research. Ann. Epidemiol. 2007, 17, 464–470. [Google Scholar] [CrossRef] [PubMed]
- Zhan, F.B.; Brender, J.D.; De Lima, I.; Suarez, L.; Langlois, P.H. Match Rate and Positional Accuracy of Two Geocoding Methods for Epidemiologic Research. Ann. Epidemiol. 2006, 16, 842–849. [Google Scholar] [CrossRef]
- Zhang, Z.; Manjourides, J.; Cohen, T.; Hu, Y.; Jiang, Q. Spatial Measurement Errors in the Field of Spatial Epidemiology. Int. J. Health Geogr. 2016, 15, 21. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Zandbergen, P.A. Geocoding Quality and Implications for Spatial Analysis. Geogr. Compass 2009, 3, 647–680. [Google Scholar] [CrossRef]
- Jacquez, G.M. A Research Agenda: Does Geocoding Positional Error Matter in Health GIS Studies? Spat. Spat. Epidemiol. 2012, 3, 7–16. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Krieger, N.; Waterman, P.; Lemieux, K.; Zierler, S.; Hogan, J.W. On the Wrong Side of the Tracts? Evaluating the Accuracy of Geocoding in Public Health Research. Am. J. Public Health 2001, 91, 1114–1116. [Google Scholar] [PubMed] [Green Version]
- Han, D.; Bonner, M.R.; Nie, J.; Freudenheim, J.L. Assessing Bias Associated with Geocoding of Historical Residence in Epidemiology Research. Geospat. Health 2013, 7, 369–374. [Google Scholar] [CrossRef] [Green Version]
- Khan, S.; Pinault, L.; Tjepkema, M.; Wilkins, R. Positional Accuracy of Geocoding from Residential Postal Codes versus Full Street Addresses. Health Rep. 2018, 29, 3–9. [Google Scholar]
- Cayo, M.R.; Talbot, T.O. Positional Error in Automated Geocoding of Residential Addresses. Int. J. Health Geogr. 2003, 2, 10. [Google Scholar] [CrossRef] [Green Version]
- Ribeiro, A.I.; Olhero, A.; Teixeira, H.; Magalhães, A.; Pina, M.F. Tools for Address Georeferencing-Limitations and Opportunities Every Public Health Professional Should Be Aware of. PLoS ONE 2014, 9, e114130. [Google Scholar] [CrossRef] [PubMed]
- Hurley, S.E.; Saunders, T.M.; Nivas, R.; Hertz, A.; Reynolds, P. Post Office Box Addresses: A Challenge for Geographic Information System-Based Studies. Epidemiology (Cambridge, Mass.) 2003, 14, 386–391. [Google Scholar] [CrossRef] [PubMed]
- Koo, H.; Chun, Y.; Griffith, D.A. Modeling Positional Uncertainty Acquired Through Street Geocoding. Int. J. Appl. Geosp. Res. 2018, 9, 1–22. [Google Scholar] [CrossRef] [Green Version]
- Zandbergen, P.A. A Comparison of Address Point, Parcel and Street Geocoding Techniques. Comput. Environ. Urban Syst. 2008, 32, 214–232. [Google Scholar] [CrossRef]
- ESRI Geocoding Options Properties—Help | ArcGIS for Desktop. Available online: https://desktop.arcgis.com/en/arcmap/10.3/guide-books/geocoding/geocoding-options-properties.htm (accessed on 21 December 2020).
- Patel, N. Geocoding: Delivering High Location Accuracy. Available online: https://www.esri.com/arcgis-blog/products/analytics/analytics/geocoding-delivering-high-location-accuracy/ (accessed on 18 January 2021).
- Alavanja, M.C.; Sandler, D.P.; McMaster, S.B.; Zahm, S.H.; McDonnell, C.J.; Lynch, C.F.; Pennybacker, M.; Rothman, N.; Dosemeci, M.; Bond, A.E.; et al. The Agricultural Health Study. Environ. Health Perspect. 1996, 104, 362–369. [Google Scholar] [CrossRef]
- Questionnaires & Study Data | Agricultural Health Study. Available online: https://aghealth.nih.gov/collaboration/questionnaires.html (accessed on 17 January 2021).
- Wheeler, D.C.; Nolan, B.T.; Flory, A.R.; DellaValle, C.T.; Ward, M.H. Modeling Groundwater Nitrate Concentrations in Private Wells in Iowa. Sci. Total Environ. 2015, 536, 481–488. [Google Scholar] [CrossRef]
- Messier, K.P.; Wheeler, D.C.; Flory, A.R.; Jones, R.R.; Patel, D.; Nolan, B.T.; Ward, M.H. Modeling Groundwater Nitrate Exposure in Private Wells of North Carolina for the Agricultural Health Study. Sci. Total Environ. 2019, 655, 512–519. [Google Scholar] [CrossRef]
- Hofmann, J.N.; Beane Freeman, L.E.; Lynch, C.F.; Andreotti, G.; Thomas, K.W.; Sandler, D.P.; Savage, S.A.; Alavanja, M.C. The Biomarkers of Exposure and Effect in Agriculture (BEEA) Study: Rationale, Design, Methods, and Participant Characteristics. J. Toxicol. Environ. Health Part A 2015, 78, 1338–1347. [Google Scholar] [CrossRef] [Green Version]
- Hazelton, M.L.; Davies, T.M. Inference Based on Kernel Estimates of the Relative Risk Function in Geographical Epidemiology. Biometr. J. 2009, 51, 98–109. [Google Scholar] [CrossRef]
- Rushton, G.; Armstrong, M.P.; Gittler, J.; Greene, B.R.; Pavlik, C.E.; West, M.M.; Zimmerman, D.L. Geocoding in Cancer Research: A Review. Am. J. Prev. Med. 2006, 30, S16–S24. [Google Scholar] [CrossRef]
- Ganguly, R.; Batterman, S.; Isakov, V.; Snyder, M.; Breen, M.; Brakefield-Caldwell, W. Effect of Geocoding Errors on Traffic-Related Air Pollutant Exposure and Concentration Estimates. J. Expo. Sci. Environ. Epidemiol. 2015, 25, 490–498. [Google Scholar] [CrossRef] [Green Version]
- Zimmerman, D.L.; Fang, X.; Mazumdar, S.; Rushton, G. Modeling the Probability Distribution of Positional Errors Incurred by Residential Address Geocoding. Int. J. Health Geogr. 2007, 6, 1. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Zimmerman, D.L.; Li, J. The Effects of Local Street Network Characteristics on the Positional Accuracy of Automated Geocoding for Geographic Health Studies. Int. J. Health Geogr. 2010, 9, 10. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Zimmerman, D.L.; Li, J.; Fang, X. Spatial Autocorrelation among Automated Geocoding Errors and Its Effects on Testing for Disease Clustering. Stat. Med. 2010, 29, 1025–1036. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Version 1 a | Version 2 a | |||||
---|---|---|---|---|---|---|
Match Status | N | % | Match Status | N | % | |
Iowa | Good Match Status | |||||
Interactive | 1730 | 4.7 | --- | --- | --- | |
Point Address | 120 | 0.3 | Point Address | 26,528 | 72.1 | |
Street Address | 31,811 | 86.5 | Street Address | 6119 | 16.6 | |
Total | 33,661 | 91.5 | Total | 32,647 | 88.7 | |
Poor Match Status | ||||||
Street Centroid | 828 | 2.3 | Street Centroid | 895 | 2.4 | |
ZIP Centroid | 2269 | 6.2 | ZIP Centroid | 3216 | 8.7 | |
City Centroid | 24 | 0.1 | City Centroid | 24 | 0.1 | |
Total | 3121 | 8.5 | Total | 4135 | 11.2 | |
Unassigned b | 10 | 0.0 | Unassigned b | 10 | 0.0 | |
Overall | 36,792 | Overall | 36,792 | |||
North Carolina | Good Match Status | |||||
Interactive | 281 | 1.4 | --- | --- | --- | |
Point Address | 3139 | 15.3 | Point Address | 14,112 | 68.8 | |
Street Address | 14,100 | 68.7 | Street Address | 2815 | 13.7 | |
Total | 17,520 | 85.4 | Total | 16,927 | 82.5 | |
Poor Match Status | ||||||
Street Centroid | 53 | 0.3 | Street Centroid | 220 | 1.1 | |
ZIP Centroid | 1804 | 8.8 | ZIP Centroid | 2230 | 10.9 | |
City Centroid | 1 | 0.0 | City Centroid | 1 | 0.0 | |
Total | 1858 | 9.1 | Total | 2451 | 11.9 | |
Unassigned b | 1140 | 5.6 | Unassigned b | 1140 | 5.6 | |
Overall | 20,518 | Overall | 20,518 |
Version 1 a Match Status | Version 2 a Match Status | Distance (m) between Geocodes | ||||||
---|---|---|---|---|---|---|---|---|
N | Mean (SD) | Min | 5% | Median (IQR) | 95% | Max | ||
Iowa | ||||||||
Interactive | Point Address | 914 | 498 (1839) | 1 | 14 | 186 (55–355) | 1170 | 25,987 |
Street Address | 279 | 1210 (2258) | 6 | 21 | 284 (83–1129) | 6348 | 16,728 | |
Point Address | Point Address | 105 | 157 (112) | 2 | 24 | 145 (60–208) | 382 | 579 |
Street Address | 8 | 170 (217) | 6 | 6 | 74 (51–234) | 640 | 640 | |
Street Address | Point Address | 25,081 | 389 (1214) | 0 | 26 | 202 (88–326) | 933 | 48,535 |
Street Address | 5648 | 241 (1658) | 0 | 1 | 2 (2–45) | 737 | 47,713 | |
Total | 32,035 | 372 (1338) | 0 | 2 | 170 (53–293) | 936 | 48,535 | |
North Carolina | ||||||||
Interactive | Point Address | 80 | 307 (892) | 0 | 4 | 75 (19–173) | 1470 | 7318 |
Street Address | 20 | 3530 (5394) | 6 | 17 | 153 (46–9038) | 14,202 | 15,237 | |
Point Address | Point Address | 2922 | 112 (137) | 0 | 18 | 57 (36–130) | 379 | 1451 |
Street Address | 12 | 567 (1652) | 5 | 5 | 106 (14–192) | 5809 | 5809 | |
Street Address | Point Address | 11,105 | 255 (865) | 3 | 30 | 121 (63–243) | 600 | 35,568 |
Street Address | 2780 | 196 (1228) | 0 | 0 | 0 (0–23) | 532 | 22,202 | |
Total | 16,919 | 225 (892) | 0 | 0 | 87 (39–202) | 556 | 35,568 |
Rooftop Coordinate vs. Geocode | Positional Error (m) | ||||||
---|---|---|---|---|---|---|---|
N | Mean (SD) | Min | 5% | Median (IQR) | 95% | Max | |
Version 1 Geocodes a | |||||||
Interactive | 130 | 315 (464) | 4 | 23 | 162 (63–338) | 1176 | 3809 |
Street Address | 3337 | 386 (1074) | 3 | 26 | 123 (60–287) | 1227 | 15,172 |
Total | 3467 | 383 (1058) | 3 | 26 | 124 (60–290) | 1218 | 15,172 |
Version 2 Geocodes a | |||||||
Point Address | 3038 | 102 (153) | 0 | 6 | 64 (29–171) | 247 | 6847 |
Street Address | 402 | 417 (1506) | 9 | 19 | 67 (37–176) | 1266 | 14,796 |
Total | 3440 | 139 (543) | 0 | 6 | 65 (30–171) | 265 | 14,796 |
Improvement b | 3368 | 245 (1021) | −14,317 | −147 | 41 (−2–168) | 1077 | 14,887 |
GPS vs. Geocode | Positional Error (m) | |||||||
---|---|---|---|---|---|---|---|---|
N | Mean (SD) | Min | 5% | Median (IQR) | 95% | Max | ||
Iowa | Version 1 Geocodes a | |||||||
Interactive | 43 | 249 (242) | 12 | 22 | 192 (79–325) | 703 | 1174 | |
Point Address | 3 | 138 (35) | 99 | 99 | 147 (99–168) | 168 | 168 | |
Street Address | 922 | 400 (1154) | 5 | 30 | 149 (77–329) | 1012 | 15,609 | |
Total | 968 | 392 (1128) | 5 | 30 | 150 (78–328) | 998 | 15,609 | |
Version 2 Geocodes a | ||||||||
Point Address | 866 | 199 (546) | 3 | 17 | 166 (59–227) | 467 | 15,567 | |
Street Address | 82 | 720 (1734) | 10 | 27 | 193 (76–451) | 3376 | 11,884 | |
Total | 948 | 244 (742) | 3 | 17 | 167 (62–236) | 508 | 15,567 | |
Improvement b | 934 | 148 (905) | −1765 | −302 | 9 (−80–133) | 615 | 12,102 | |
North Carolina | Version 1 Geocodes a | |||||||
Interactive | 1 | 92 | - | - | - | - | - | |
Point Address | 43 | 97 (104) | 10 | 24 | 61 (37–102) | 320 | 550 | |
Street Address | 222 | 330 (1458) | 7 | 32 | 131 (81–252) | 682 | 19,324 | |
Total | 266 | 291 (1335) | 7 | 27 | 117 (67–225) | 616 | 19,324 | |
Version 2 Geocodes a | ||||||||
Point Address | 227 | 107 (166) | 3 | 10 | 42 (23–122) | 481 | 1147 | |
Street Address | 31 | 838 (3451) | 24 | 27 | 178 (71–325) | 931 | 19,404 | |
Total | 258 | 195 (1213) | 3 | 11 | 46 (25–150) | 515 | 19,404 | |
Improvement b | 258 | 61 (224) | −1002 | −205 | 42 (−1–109) | 414 | 1316 |
Positional Error of Version 2 Geocodes | Improvement Ratio a | |||
---|---|---|---|---|
Model | Estimate | p-Value | Estimate | p-Value |
Iowa Rooftop | ||||
Intercept | 158.1 | <0.001 | 0.448 | <0.001 |
Non-rural location b | −53.1 | 0.131 | 0.892 | 0.246 |
Pop. Density (100 persons/km2) c | −4.78 | 0.048 | 0.986 | 0.025 |
Moran’s I of residuals | 0.001 | 0.409 | ||
Iowa GPS | ||||
Intercept | 564.1 | <0.001 | 0.856 | 0.006 |
Non-rural location b | −507.8 | 0.037 | 0.708 | 0.079 |
Pop. Density (100 persons/km2) c | 6.36 | 0.710 | 1.011 | 0.445 |
Moran’s I of residuals | 0.004 | 0.140 | ||
North Carolina GPS | ||||
Intercept | 375.2 | <0.001 | 0.504 | <0.001 |
Non-rural location b | −281.6 | 0.606 | 0.955 | 0.906 |
Pop. Density (100 persons/km2) c | −28.1 | 0.596 | 1.022 | 0.572 |
Moran’s I of residuals | −0.003 | 0.944 | 0.019 | 0.141 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Fisher, J.A.; Spaur, M.; Buller, I.D.; Flory, A.R.; Beane Freeman, L.E.; Hofmann, J.N.; Giangrande, M.; Jones, R.R.; Ward, M.H. Spatial Heterogeneity in Positional Errors: A Comparison of Two Residential Geocoding Efforts in the Agricultural Health Study. Int. J. Environ. Res. Public Health 2021, 18, 1637. https://doi.org/10.3390/ijerph18041637
Fisher JA, Spaur M, Buller ID, Flory AR, Beane Freeman LE, Hofmann JN, Giangrande M, Jones RR, Ward MH. Spatial Heterogeneity in Positional Errors: A Comparison of Two Residential Geocoding Efforts in the Agricultural Health Study. International Journal of Environmental Research and Public Health. 2021; 18(4):1637. https://doi.org/10.3390/ijerph18041637
Chicago/Turabian StyleFisher, Jared A., Maya Spaur, Ian D. Buller, Abigail R. Flory, Laura E. Beane Freeman, Jonathan N. Hofmann, Michael Giangrande, Rena R. Jones, and Mary H. Ward. 2021. "Spatial Heterogeneity in Positional Errors: A Comparison of Two Residential Geocoding Efforts in the Agricultural Health Study" International Journal of Environmental Research and Public Health 18, no. 4: 1637. https://doi.org/10.3390/ijerph18041637
APA StyleFisher, J. A., Spaur, M., Buller, I. D., Flory, A. R., Beane Freeman, L. E., Hofmann, J. N., Giangrande, M., Jones, R. R., & Ward, M. H. (2021). Spatial Heterogeneity in Positional Errors: A Comparison of Two Residential Geocoding Efforts in the Agricultural Health Study. International Journal of Environmental Research and Public Health, 18(4), 1637. https://doi.org/10.3390/ijerph18041637