Propensity Score Weighting with Mismeasured Covariates: An Application to Two Financial Literacy Interventions
Abstract
:1. Introduction
As evidenced in this quote, the lack of financial literacy among micro-entrepreneurs has real consequences. Lusardi and Mitchell (2014) discuss the wider impacts of a lack of financial literacy, such as lower participation in financial markets, poor investment decisions, susceptibility to financial scams, inadequate retirement planning, increased credit card and mortgage debt, etc. See Morgan and Trinh (2019) for a recent example.“Walk into a typical micro or small business in a developing country and spend a few minutes talking with the owner, and it often becomes clear that owners are not implementing many of the business practices that are standard in most small businesses in developed countries. Formal records are not kept, and household and business finances are combined. Marketing efforts are sporadic and rudimentary. Some inventory sits on shelves for years at a time, whereas more popular items are frequently out of stock. Few owners have financial targets or goals that they regularly monitor and act to achieve.”
2. Measurement Error in Covariates
3. Empirics
3.1. Potential Outcomes Framework
3.2. Strong Ignorability
3.3. Strong Ignorability with Measurement Error
3.4. Estimation
- (i)
- is an i.i.d. sample of.andare bounded away from zero, andandare bounded forover compact support.
- (ii)
- ,, andforare γ-times continuously differentiable with bounded and integrable derivatives for some positive integer γ.
- (iii)
- K is differentiable to order γ and satisfiesAlso,is compactly supported on, symmetric around zero, and bounded.
- (iv)
- andas.
3.5. Case of Unknown Measurement Error Distribution
3.6. Inference
4. Simulation
5. Application
6. Conclusions
Author Contributions
Funding
Conflicts of Interest
Appendix A. Derivation of Equations
Appendix A.1. Derivation of (6) and (7)
Appendix A.2. Derivation of (5)
Appendix A.3. Derivation of (11) and (12)
Appendix B. Proof of Theorems
Appendix B.1. Proof of Theorem 1
Appendix B.2. Proof of Theorem 2
Appendix C. Lemmas
References
- Abadie, Alberto, and Matias D. Cattaneo. 2018. Econometric Methods for Program Evaluation. Annual Review of Economics 10: 465–503. [Google Scholar] [CrossRef] [Green Version]
- Battistin, Erich, and Andrew Chesher. 2014. Treatment Effect Estimation with Covariate Measurement Error. Journal of Econometrics 178: 707–15. [Google Scholar] [CrossRef] [Green Version]
- Berge, Lars Ivar Oppedal, Kjetil Bjorvatn, and Bertil Tungodden. 2015. Human and Financial Capital for Microenterprise Development: Evidence from a Field and Lab Experiment. Management Science 61: 707–22. [Google Scholar] [CrossRef] [Green Version]
- Bissantz, Nicolai, Lutz Dümbgen, Hajo Holzmann, and Axel Munk. 2007. Non-parametric confidence bands in deconvolution density estimation. Journal of the Royal Statistical Society, Series B 69: 483–506. [Google Scholar] [CrossRef] [Green Version]
- Bound, John, Charles Brown, and Nancy Mathiowetz. 2001. Measurement Error in Survey Data. In Handbook of Econometrics. Edited by James Heckman and Edward E. Leamer. Amsterdam: Elsevier, vol. 5, pp. 3705–43. [Google Scholar]
- Bruhn, Miriam, Dean Karlan, and Antoinette Schoar. 2018. The Impact of Consulting Services on Small and Medium Enterprises: Evidence from a Randomized Trial in Mexico. Journal of Political Economy 126: 635–87. [Google Scholar] [CrossRef] [Green Version]
- Carroll, Raymond J., and Peter Hall. 1988. Optimal rates of convergence for deconvolving a density. Journal of the American Statistical Association 83: 1184–86. [Google Scholar] [CrossRef]
- Chen, Xiaohong, Han Hong, and Denis Nekipelov. 2011. Nonlinear models of measurement errors. Journal of Economic Literature 49: 901–37. [Google Scholar] [CrossRef]
- Cochran, William G., and Donald B. Rubin. 1973. Controlling Bias in Observational Studies: A review. Sankhyā: The Indian Journal of Statistics, Series A (1961–2002) 35: 417–46. [Google Scholar]
- Delaigle, Aurore, and Irene Gijbels. 2004. Bootstrap bandwidth selection in kernel density estimation from a contaminated sample. Annals of the Institute of Statistical Mathematics 56: 19–47. [Google Scholar] [CrossRef]
- Delaigle, Aurore, and Alexander Meister. 2007. Nonparametric regression estimation in the heteroscedastic errors-in-variables problem. Journal of the American Statistical Association 102: 1416–26. [Google Scholar] [CrossRef]
- Delaigle, Aurore, Peter Hall, and Alexander Meister. 2008. On Deconvolution with Repeated Measurements. The Annals of Statistics 36: 665–85. [Google Scholar] [CrossRef]
- Delaigle, Aurore, Jianqing Fan, and Raymond J. Carroll. 2009. A design-adaptive local polynomial estimator for the errors-in-variables problem. Journal of the American Statistical Association 104: 348–59. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Delaigle, Aurore, Peter Hall, and Farshid Jamshidi. 2015. Confidence bands in non-parametric errors-in-variables regression. Journal of the Royal Statistical Society, Series B 77: 149–69. [Google Scholar] [CrossRef]
- Dong, Hao, Taisuke Otsu, and Luke N. Taylor. 2020a. Estimation of Varying Coefficient Models with Measurement Error. Department of Economics, Southern Methodist University: RePEc. Dallas. [Google Scholar]
- Dong, Hao, Taisuke Otsu, and Luke N. Taylor. 2020b. Average Derivative Estimation under Measurement Error. Econometric Theory. in press. [Google Scholar] [CrossRef]
- Drexler, Alejandro, Greg Fischer, and Antoinette Schoar. 2014. Keeping It Simple: Financial Literacy and Rules of Thumb. American Economic Journal: Applied Economics 6: 1–31. [Google Scholar] [CrossRef] [Green Version]
- Fan, Jianqing. 1991a. On the optimal rates of convergence for nonparametric deconvolution problems. The Annals of Statistics 19: 1257–72. [Google Scholar] [CrossRef]
- Fan, Jianqing. 1991b. Asymptotic normality for deconvolution kernel density estimators. Sankhyā: The Indian Journal of Statistics, Series A 53: 97–110. [Google Scholar]
- Fan, Jianqing, and Elias Masry. 1992. Multivariate regression estimation with errors-in-variables: Asymptotic normality for mixing processes. Journal of Multivariate Analysis 43: 237–71. [Google Scholar] [CrossRef] [Green Version]
- Fan, Jianqing, and Young K. Truong. 1993. Nonparametric regression with errors in variables. The Annals of Statistics 21: 1900–25. [Google Scholar] [CrossRef]
- Fan, Yanqin. 1995. Average derivative estimation with errors-in-variables. Journaltitle of Nonparametric Statistics 4: 395–407. [Google Scholar] [CrossRef]
- Fernandes, Daniel, John G. Lynch Jr., and Richard G. Netemeyer. 2014. Financial Literacy, Financial Education, and Downstream Financial Behaviors. Management Science 60: 1861–83. [Google Scholar] [CrossRef]
- Fisher, Ronald A. 1935. The logic of inductive inference. Journal of the Royal Statistical Society 98: 39–82. [Google Scholar] [CrossRef] [Green Version]
- Frisch, Ragnar. 1934. Statistical Confluence Analysis by Means of Complete Regression Systems. Oslo: University Institute for Economics. [Google Scholar]
- Heckman, James J., and Richard Robb Jr. 1985. Alternative Methods for Evaluating the Impact of Interventions: An overview. Journal of Econometrics 30: 239–67. [Google Scholar] [CrossRef]
- Hong, Hwanhee, David A. Aaby, Juned Siddique, and Elizabeth A.Stuart. 2019. Propensity Score–Based Estimators With Multiple Error-Prone Covariates. American Journal of Epidemiology 188: 222–30. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Horowitz, Joel L. 2009. Semiparametric and Nonparametric Methods in Econometrics. Berlin: Springer. [Google Scholar]
- Horvitz, Daniel G., and Donovan J. Thompson. 1952. A Generalization of Sampling Without Replacement from a Finite Universe. Journal of the American Statistical Association 47: 663–85. [Google Scholar] [CrossRef]
- Imbens, Guido W., and Jeffrey M. Wooldridge. 2009. Recent Developments in the Econometrics of Program Evaluation. Journal of Economic Literature 47: 5–86. [Google Scholar] [CrossRef] [Green Version]
- Jakubowski, Maciej. 2010. Latent Variables and Propensity Score Matching: A Simulation Study with Application to Data from the Programme for International Student Assessment in Poland. Empirical Economics 48: 1287–25. [Google Scholar] [CrossRef] [Green Version]
- Kato, Kengo, and Yuya Sasaki. 2018. Uniform confidence bands in deconvolution with unknown error distribution. Journal of Econometrics 207: 129–61. [Google Scholar] [CrossRef] [Green Version]
- Kato, Kengo, and Yuya Sasaki. 2019. Uniform confidence bands for nonparametric errors-in-variables regression. Journal of Econometrics 213: 516–55. [Google Scholar] [CrossRef]
- Koopmans, Tjalling. 1937. Linear Regression Analysis of Economic Time Series. Amsterdam: Econometric Institute, Harrlem-de Erwen F Bohn N.V. [Google Scholar]
- Li, Qi, and Jeff Racine. 2003. Nonparametric estimation of distributions with categorical and continuous data. Journal of Multivariate Analysis 86: 266–92. [Google Scholar] [CrossRef] [Green Version]
- Lounici, Karim, and Richard Nickl. 2011. Global uniform risk bounds for wavelet deconvolution estimators. The Annals of Statistics 39: 201–31. [Google Scholar] [CrossRef] [Green Version]
- Lusardi, Annamaria, and Olivia S. Mitchell. 2014. The Economic Importance of Financial Literacy. Journal of Economic Literature 52: 5–44. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- McCaffrey, Daniel F., J. R. Lockwood, and Claude M. Setodji. 2013. Inverse Probability Weighting with Error-Prone Covariates. Biometrika 100: 671–80. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- McKenzie, David, and Chrisopher Woodruff. 2013. What Are We Learning from Business Training and Entrepreneurship Evaluations around the Developing World? World Bank Research Observer 29: 48–82. [Google Scholar] [CrossRef]
- McMurry, Timothy L., and Dimitris N. Politis. 2004. Nonparametric Regression with Infinite Order Flat-Top Kernels. Journal of Nonparametric Statistics 16: 549–62. [Google Scholar] [CrossRef]
- Meister, Alexander. 2009. Deconvolution Problems in Nonparametric Statistics. Berlin: Springer. [Google Scholar]
- Morgan, Peter J., and Long Q. Trinh. 2019. Determinants and Impacts of Financial Literacy in Cambodia and Viet Nam. Journal of Risk and Financial Management 12: 19. [Google Scholar] [CrossRef] [Green Version]
- Neyman, Jerzy S. 1923. On the Application of Probability Theory to Agricultural Experiments. Essays on Principles. Section 9. Annals of Agricultural Sciences 10: 1–51. [Google Scholar]
- Reiersøl, Olav. 1950. Identifiability of a Linear Relation Between Variables Which Are Subject to Error. Econometrica 18: 375–89. [Google Scholar] [CrossRef]
- Rosenbaum, Paul R., and Donald B. Rubin. 1983. The Central Role of the Propensity Score in Observational Studies for Causal Effects. Biometrika 70: 41–55. [Google Scholar] [CrossRef]
- Rosenbaum, Paul R. 2010. Design of Observational Studies. Springer Series in Statistics. New York: Springer Publishing Company. [Google Scholar]
- Roy, A. D. 1951. Some Thoughts on the Distribution of Earnings. Oxford Economic Papers 3: 135–46. [Google Scholar] [CrossRef]
- Rubin, Donald B. 1974. Estimating Causal Effects of Treatments in Randomized and Nonrandomized Studies. Journal of Educational Psychology 66: 688–701. [Google Scholar] [CrossRef] [Green Version]
- Rubin, Donald B. 1986. Statistical Matching Using File Concatenation with Adjusted Weights and Multiple Imputations. Journal of Business & Economic Statistics 4: 87–94. [Google Scholar]
- Rudolph, Kara E., and Elizabeth A. Stuart. 2018. Using Sensitivity Analyses for Unobserved Confounding to Address Covariate Measurement Error in Propensity Score Methods. American Journal of Epidemiology 187: 604–13. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Schennach, Susanne M. 2016. Recent advances in the measurement error literature. Annual Review of Economics 8: 314–77. [Google Scholar] [CrossRef] [Green Version]
- Schennach, Susanne M. 2019. Convolution without independence. Journal of Econometrics 211: 308–18. [Google Scholar] [CrossRef] [Green Version]
- Stefanski, Leonard A., and Raymond J. Carroll. 1990. Deconvolving kernel density estimators. Statistics 21: 169–84. [Google Scholar] [CrossRef]
- VanderWeele, Tyler J., and Onyebuchi A. Arah. 2011. Bias Formulas for Sensitivity Analysis of Unmeasured Confounding for General Outcomes, Treatments, and Confounders. Epidemiology 22: 42–52. [Google Scholar] [CrossRef] [Green Version]
- Van Es, Bert, Shota Gugushvili, and Peter Spreij. 2008. Deconvolution for an atomic distribution. Electronic Journal of Statistics 2: 265–97. [Google Scholar] [CrossRef] [Green Version]
- Webb-Vargas, Yenny, Kara E. Rudoplh, David Lenis, Peter Murkami, and Elizabeth A. Stuart. 2017. An Imputation-Based Solution to Using Mismeasured Covariates in Propensity Score Analysis. Statistical Methods in Medical Research 26: 1824–37. [Google Scholar] [CrossRef]
- Wooldridge, Jeffrey M. 2010. Econometric Analysis of Cross Section and Panel Data. Cambridge: MIT Press. [Google Scholar]
1. | |
2. | |
3. | As and are independent and identically distributed under Assumption 7, we have . As is symmetric around zero under Assumption 7, , which implies . Thus, is obtained by plugging in the sample analogue of . |
4. | Data are available at https://www.aeaweb.org/articles?id=10.1257/app.6.2.1. |
5. | Note, we do not analyze one outcome included in Drexler et al. (2014). Savings amount is excluded from our analysis as the first-stage propensity score ignoring measurement error did not converge. |
Known Error Distribution | ||||||||
Estimator | ||||||||
Error Type | OS | SS | OS | SS | ||||
Sample Size | 250 | 500 | 250 | 500 | 250 | 500 | 250 | 500 |
Bias | 0.056 | −0.217 | −0.316 | −0.292 | 0.131 | −0.226 | −0.322 | −0.297 |
SD | 0.371 | 0.104 | 0.126 | 0.096 | 0.559 | 0.109 | 0.128 | 0.098 |
RMSE | 0.375 | 0.241 | 0.340 | 0.307 | 0.575 | 0.251 | 0.346 | 0.313 |
Unknown Error Distribution | ||||||||
Estimator | ||||||||
Error Type | OS | SS | OS | SS | ||||
Sample Size | 250 | 500 | 250 | 500 | 250 | 500 | 250 | 500 |
Bias | 0.094 | −0.216 | −0.315 | −0.292 | 0.185 | −0.224 | −0.321 | −0.297 |
SD | 0.451 | 0.103 | 0.127 | 0.096 | 0.690 | 0.108 | 0.129 | 0.098 |
RMSE | 0.461 | 0.239 | 0.340 | 0.307 | 0.714 | 0.249 | 0.346 | 0.313 |
Known Error Distribution | ||||||||
Estimator | ||||||||
Error Type | OS | SS | OS | SS | ||||
Sample Size | 250 | 500 | 250 | 500 | 250 | 500 | 250 | 500 |
Bias | −0.082 | −0.121 | −0.170 | −0.152 | −0.086 | −0.138 | −0.178 | −0.160 |
SD | 0.273 | 0.103 | 0.125 | 0.096 | 0.400 | 0.107 | 0.127 | 0.098 |
RMSE | 0.285 | 0.159 | 0.211 | 0.180 | 0.409 | 0.175 | 0.219 | 0.188 |
Unknown Error Distribution | ||||||||
Estimator | ||||||||
Error Type | OS | SS | OS | SS | ||||
Sample Size | 250 | 500 | 250 | 500 | 250 | 500 | 250 | 500 |
Bias | −0.077 | −0.120 | −0.169 | −0.152 | −0.064 | −0.137 | −0.177 | −0.160 |
SD | 0.333 | 0.103 | 0.126 | 0.095 | 0.523 | 0.108 | 0.128 | 0.098 |
RMSE | 0.342 | 0.158 | 0.211 | 0.180 | 0.527 | 0.174 | 0.219 | 0.188 |
Known Error Distribution | ||||||||
Estimator | ||||||||
Error Type | OS | SS | OS | SS | ||||
Sample Size | 250 | 500 | 250 | 500 | 250 | 500 | 250 | 500 |
Bias | −0.023 | −0.159 | −0.240 | −0.218 | 0.002 | −0.165 | −0.244 | −0.221 |
SD | 0.287 | 0.102 | 0.124 | 0.095 | 0.416 | 0.107 | 0.126 | 0.097 |
RMSE | 0.288 | 0.189 | 0.270 | 0.237 | 0.417 | 0.197 | 0.275 | 0.242 |
Unknown Error Distribution | ||||||||
Estimator | ||||||||
Error Type | OS | SS | OS | SS | ||||
Sample Size | 250 | 500 | 250 | 500 | 250 | 500 | 250 | 500 |
Bias | −0.008 | −0.158 | −0.239 | −0.217 | 0.021 | −0.163 | −0.243 | −0.221 |
SD | 0.337 | 0.102 | 0.125 | 0.094 | 0.538 | 0.106 | 0.127 | 0.097 |
RMSE | 0.338 | 0.188 | 0.269 | 0.237 | 0.539 | 0.195 | 0.274 | 0.241 |
Dependent Variable | Standard Accounting | Rule-of-Thumb | ||||
---|---|---|---|---|---|---|
OLS | IPW | IPW-ME | OLS | IPW | IPW-ME | |
Business and Personal Financial Practices | ||||||
Separate Business and | 0.00 | 0.00 | −0.05 | 0.08 | 0.08 | 0.08 |
Personal Cash | (0.03) | (0.03) | {0.02} | (0.03) | (0.03) | {0.10} |
{{0.14}} | {{0.24}} | |||||
524 | 532 | |||||
Keep Accounting Records | 0.04 | 0.04 | 0.05 | 0.12 | 0.12 | 0.08 |
(0.05) | (0.05) | {0.10} | (0.03) | (0.03) | {0.09} | |
{{0.25}} | {{0.21}} | |||||
524 | 533 | |||||
Separate Business and | 0.04 | 0.04 | 0.00 | 0.12 | 0.12 | 0.11 |
Personal Accounting | (0.05) | (0.05) | {0.08} | (0.03) | (0.03) | {0.14} |
{{0.24}} | {{0.25}} | |||||
521 | 532 | |||||
Set Aside Cash for | 0.07 | 0.07 | 0.08 | 0.12 | 0.12 | 0.13 |
Business Purposes | (0.03) | (0.03) | {0.14} | (0.04) | (0.04) | {0.14} |
{{0.19}} | {{0.23}} | |||||
524 | 532 | |||||
Calculate Revenues | 0.01 | 0.01 | −0.03 | 0.06 | 0.06 | 0.07 |
Formally | (0.04) | (0.04) | {0.04} | (0.03) | (0.03) | {0.15} |
{{0.16}} | {{0.23}} | |||||
524 | 533 | |||||
Business Practices | 0.07 | 0.07 | −0.15 | 0.14 | 0.14 | 0.13 |
Index | (0.06) | (0.06) | {−0.17} | (0.04) | (0.04) | {0.18} |
{{−0.14}} | {{0.15}} | |||||
525 | 534 | |||||
Any Savings | 0.01 | 0.01 | −0.03 | 0.08 | 0.08 | 0.05 |
(0.05) | (0.05) | {0.01} | (0.04) | (0.04) | {0.04} | |
{{0.19}} | {{0.10}} | |||||
529 | 540 |
Dependent Variable | Standard Accounting | Rule-of-Thumb | ||||
---|---|---|---|---|---|---|
OLS | IPW | IPW-ME | OLS | IPW | IPW-ME | |
Objective Reporting Quality | ||||||
Any Reporting Errors | −0.04 | −0.04 | −0.09 | −0.09 | −0.09 | −0.15 |
(0.04) | (0.04) | {−0.07} | (0.03) | (0.03) | {−0.15} | |
{{−0.04}} | {{−0.21}} | |||||
496 | 508 | |||||
Raw Profit | 918 | 914 | 1123 | 1094 | 1086 | 857 |
Calculation Difference | (746) | (726) | {1158} | (551) | (538) | {925} |
(RD$), weekly | {{262}} | {{690}} | ||||
273 | 289 | |||||
Absolute Value Profit | −324 | −368 | −633 | −642 | −641 | −840 |
Calculation Difference | (643) | (622) | {−595} | (471) | (460) | {−803} |
(RD$), weekly | {{−98}} | {{−919}} | ||||
273 | 289 | |||||
Business Performance | ||||||
Total Number | 0.08 | 0.08 | 0.37 | −0.04 | −0.04 | 0.11 |
of Employees | (0.09) | (0.09) | {−0.02} | (0.09) | (0.09) | {−0.01} |
{{0.61}} | {{0.98}} | |||||
523 | 533 | |||||
Revenue Index | −0.02 | −0.02 | −0.02 | 0.10 | 0.10 | 0.04 |
(0.05) | (0.05) | {−0.03} | (0.05) | (0.05) | {0.04} | |
{{−0.09}} | {{0.03}} | |||||
511 | 518 | |||||
Sales, | −649 | −686 | −543 | 604 | 665 | 220 |
Average Week (RD$) | (810) | (791) | {−619} | (942) | (941) | {−480} |
{{1663}} | {{130}} | |||||
367 | 386 | |||||
Sales, | −672 | −696 | −386 | 1168 | 1111 | 389 |
Bad Week (RD$) | (513) | (497) | {−291} | (538) | (533) | {641} |
{{116}} | {{−35}} | |||||
359 | 373 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Dong, H.; Millimet, D.L. Propensity Score Weighting with Mismeasured Covariates: An Application to Two Financial Literacy Interventions. J. Risk Financial Manag. 2020, 13, 290. https://doi.org/10.3390/jrfm13110290
Dong H, Millimet DL. Propensity Score Weighting with Mismeasured Covariates: An Application to Two Financial Literacy Interventions. Journal of Risk and Financial Management. 2020; 13(11):290. https://doi.org/10.3390/jrfm13110290
Chicago/Turabian StyleDong, Hao, and Daniel L. Millimet. 2020. "Propensity Score Weighting with Mismeasured Covariates: An Application to Two Financial Literacy Interventions" Journal of Risk and Financial Management 13, no. 11: 290. https://doi.org/10.3390/jrfm13110290
APA StyleDong, H., & Millimet, D. L. (2020). Propensity Score Weighting with Mismeasured Covariates: An Application to Two Financial Literacy Interventions. Journal of Risk and Financial Management, 13(11), 290. https://doi.org/10.3390/jrfm13110290