Top Incomes and Inequality Measurement: A Comparative Analysis of Correction Methods Using the EU SILC Data
Abstract
:1. Introduction
2. Materials and Methods
2.1. Reweighting
2.2. Replacing
2.2.1. Pareto Distribution
2.2.2. Generalized Beta Type 2 Distribution
2.3. Corrected Gini for EU States and EU-Wide
3. Data
4. Results
4.1. Reweighting
4.2. Replacing
5. Discussion
Supplementary Materials
Author Contributions
Funding
Acknowledgments
Conflicts of Interest
References
- Alvaredo, Facundo, and Thomas Piketty. 2014. Measuring top incomes and inequality in the Middle East: Data limitations and illustration with the case of Egypt. Working Paper 832. Giza, Egypt: Economic Research Forum. [Google Scholar]
- Alvaredo, Facundo, Lydia Assouad, and Thomas Piketty. 2017. Measuring Inequality in the Middle East 1990–2016: The World’s Most Unequal Region? WID. Available online: http://wid.world/document/alvaredoassouadpiketty-middleeast-widworldwp201715/ (accessed on 29 May 2018).
- An, Di, and Roderick J. A. Little. 2007. Multiple imputation: An alternative to top coding for statistical disclosure control. Journal of the Royal Statistical Society A 170: 923–40. [Google Scholar] [CrossRef]
- Atkinson, Anthony Barnes, and John Micklewright. 1983. On the reliability of income data in the family expenditure survey 1970–1977. Journal of the Royal Statistical Society Series A 146: 33–61. [Google Scholar] [CrossRef]
- Atkinson, Anthony B., Thomas Piketty, and Emmanuel Saez. 2011. Top incomes in the long run of history. Journal of Economic Literature 49: 3–71. [Google Scholar] [CrossRef]
- Bartels, Charlotte, and Maria Metzing. 2017. An Integrated Approach for Top-Corrected Ginis. IZA DP #10573. Available online: https://www.iza.org/publications/dp/10573/an-integrated-approach-for-top-corrected-ginis (accessed on 29 May 2018).
- Brachmann, Klaus, Andreas Stich, and Mark Trede. 1996. Evaluating parametric income distribution models. Allgemeines Statistisches Archiv 80: 285–98. [Google Scholar]
- Brzezinski, Michał. 2013. Parametric modelling of income distribution in Central and Eastern Europe. Central European Journal of Economic Modelling and Econometrics 3: 207–30. [Google Scholar]
- Burkhauser, Richard V., Shuaizhang Feng, and Jeff Larrimore. 2010. Improving imputations of top incomes in the public-use current population survey by using both cell-means and variances. Economic Letters 108: 69–72. [Google Scholar] [CrossRef]
- Burricand, Carine. 2013. Transition from survey data to registers in the French SILC survey. In The Use of Registers in the Context of EU-SILC: Challenges and Opportunities. Edited by Markus Jäntti, Veli-Matti Törmälehto and Eric Marlier. Luxembourg: European Union. [Google Scholar]
- Butler, Richard J., and James B. McDonald. 1989. Using incomplete moments to measure inequality. Journal of Econometrics 42: 109–19. [Google Scholar] [CrossRef]
- Cowell, Frank A., and Emmanuel Flachaire. 2007. Income distribution and inequality measurement: The problem of extreme values. Journal of Econometrics 141: 1044–72. [Google Scholar] [CrossRef] [Green Version]
- Cowell, Frank A., and Maria-Pia Victoria-Feser. 1996. Poverty measurement with contaminated data: A robust approach. European Economic Review 40: 1761–71. [Google Scholar] [CrossRef]
- Cowell, Frank, and Maria-Pia Victoria-Feser. 2007. Robust Lorenz curves: A semiparametric approach. Journal of Economic Inequality 5: 21–35. [Google Scholar] [CrossRef]
- Dagum, Camilo. 1980. The generation and distribution of income, the Lorenz curve and the Gini ratio. Economie Appliquee 33: 327–67. [Google Scholar]
- Davidson, Russell, and Emmanuel Flachaire. 2007. Asymptotic and bootstrap inference for inequality and poverty measures. Journal of Econometrics 141: 141–66. [Google Scholar] [CrossRef] [Green Version]
- Deaton, Angus. 2005. Measuring poverty in a growing world (or measuring growth in a poor world). Review of Economics and Statistics 87: 20–22. [Google Scholar] [CrossRef]
- European Commission. 2006. EU-SILC User Database Description. EU-SILC/BB D(2005). Luxembourg: European Commission. [Google Scholar]
- Frick, Joachim R., and Kristina Krell. 2010. Measuring Income in Household Panel Surveys for Germany: A Comparison of EU-SILC and SOEP. SOEP paper 265. Berlin, Germany: German Institute for Economic Research (DIW). [Google Scholar] [Green Version]
- Hlasny, Vladimir. 2016. Unit Nonresponse Bias to Inequality Measurement: Worldwide Analysis Using Luxembourg Income Study Database. LIS Technical Paper. Luxembourg City, Luxembourg: Luxembourg Income Study. [Google Scholar]
- Hlasny, Vladimir, and Vito Intini. 2015. Representativeness of Top Expenditures in Arab Region Household Surveys. UN-ESCWA/EDID Working Paper #11. Beirut, Lebanon: United Nations Beirut Economic and Social Commission for Western Asia. [Google Scholar]
- Hlasny, Vladimir, and Paolo Verme. 2015. Top Incomes and the Measurement of Inequality: A Comparative Analysis of Correction Methods Using Egyptian, EU and US Survey Data. ECINEQ Conference Paper 2015-145. Verona, Italy: Society for the Study of Economic Inequality. [Google Scholar]
- Hlasny, Vladimir, and Paolo Verme. 2017. The Impact of Top Incomes Biases on the Measurement of Inequality in the United States. ECINEQ Working Paper #2017-452. Verona, Italy: Society for the Study of Economic Inequality. [Google Scholar]
- Hlasny, Vladimir, and Paolo Verme. 2018. Top incomes and the measurement of inequality in Egypt. The World Bank Economic Review 32: 428–55. [Google Scholar]
- Jagielski, Maciej, and Ryszard Kutner. 2013. Modelling of income distribution in the European Union with the Fokker–Planck equation. Physica A: Statistical Mechanics and Its Applications 392: 2130–38. [Google Scholar] [CrossRef]
- Jäntti, Markus, Veli-Matti Törmälehto, and Eric Marlier, eds. 2013. The Use of Registers in the Context of EU–SILC: Challenges and Opportunities, Eurostat, European Commission. Luxembourg: Publications Office of the European Union, Available online: http://ec.europa.eu/eurostat/documents/3888793/5856365/KS-TC-13-004-EN.PDF (accessed on 29 May 2018).
- Jenkins, Stephen P. 2007. Inequality and the GB2 Income Distribution. IZA Discussion Papers 2831. Bonn, Germany: Institute for the Study of Labor (IZA). [Google Scholar] [Green Version]
- Jenkins, Stephen P. 2009a. Distributionally-sensitive inequality indices and the GB2 income distribution. Review of Income and Wealth 55: 392–98. [Google Scholar] [CrossRef]
- Jenkins, Stephen P. 2009b. GB2LFIT: Stata Module to Fit a GB2 Distribution to Unit Record Data. Colchester: Institute for Social and Economic Research, University of Essex. [Google Scholar]
- Jenkins, Stephen P. 2017. Pareto models, top incomes and recent trends in UK income inequality. Economica 84: 261–89. [Google Scholar] [CrossRef] [Green Version]
- Jenkins, Stephen P., and Philippe Van Kerm. 2015. Paretofit: Stata Module to Fit a Type 1 Pareto Distribution. April 2007. Available online: https://ideas.repec.org/c/boc/bocode/s456832.html (accessed on 29 May 2018).
- Jenkins, Stephen P., Richard V. Burkhauser, Shuaizhang Feng, and Jeff Larrimore. 2011. Measuring inequality using censored data: A multiple-imputation approach to estimation and inference. Journal of the Royal Statistical Society 174: 63–81. [Google Scholar] [CrossRef]
- Korinek, Anton, Johan A. Mistiaen, and Martin Ravallion. 2006. Survey nonresponse and the distribution of income. Journal of Economic Inequality 4: 33–55. [Google Scholar] [CrossRef]
- Korinek, Anton, Johan A. Mistiaen, and Martin Ravallion. 2007. An econometric method of correcting for unit nonresponse bias in surveys. Journal of Econometrics 136: 213–35. [Google Scholar] [CrossRef]
- Lakner, Christoph, and Branko Milanovic. 2013. Global Income Distribution from the Fall of the Berlin Wall to the Great Recession. World Bank Policy Research Working Paper Series #6719, Washington, DC, USA: World Bank. [Google Scholar]
- Litchfield, Julie A. 1999. Inequality: Methods and Tools. Article for World Bank’s Web Site on Inequality, Poverty, and Socio-Economic Performance. Available online: http://siteresources.worldbank.org/INTPGI/Resources/Inequality/litchfie.pdf (accessed on 29 May 2018).
- McDonald, James B. 1984. Some generalized functions for the size distribution of income. Econometrica 52: 647–63. [Google Scholar] [CrossRef]
- Mistiaen, Johan A., and Martin Ravallion. 2003. Survey Compliance and the Distribution of Income. Policy Research Working Paper #2956. Washington, DC, USA: The World Bank. [Google Scholar] [Green Version]
- Modarres, Reza, and Joseph L. Gastwirth. 2006. A cautionary note on estimating the standard error of the Gini index of inequality. Oxford Bulletin of Economics and Statistics 68: 385–90. [Google Scholar] [CrossRef]
- Pareto, Vifredo. 1896. La courbe de la repartition de la richesse, Ecrits sur la courbe de la repartition. de la richesse, (writings by Pareto collected by G. Busino, Librairie Droz, 1965), 1–15. Available online: https://www.cairn.info/ecrits-sur-la-courbe-de-la-repartition-de-la-riche--9782600040211.htm (accessed on 29 May 2018).
- Reiter, Jerome P. 2003. Inference for partially synthetic, public use microdata sets. Survey Methodology 29: 181–88. [Google Scholar]
- Singh, S. K., and Gary S. Maddala. 1976. A function for the size distribution of income. Econometrica 44: 963–70. [Google Scholar] [CrossRef]
- Törmälehto, V.-M. 2017. High Income and Affluence: Evidence from the European Union Statistics on Income and Living Conditions (EU-SILC). Statistical Working Papers. Luxembourg: Eurostat, Publications Office of the European Union. [Google Scholar]
1 | Similar methods include Lakner and Milanovic (2013) who combined corrections for unit non-response with corrections for measurement errors among top incomes, and calibrated the estimated Pareto distribution among top incomes using aggregate income information from national accounts data. Bartels and Metzing (2017) replaced the top one percent of incomes in the EU Statistics on Income and Living Conditions (SILC) surveys with Pareto estimates obtained using World Wealth and Income Database information. |
2 | Notable exception is that of income surveys based on national tax registers (Burricand 2013; Jäntti et al. 2013). |
3 | An illustration is in order. Suppose there are two income groups residing in two national regions. Region 1 has a higher share of the richer income group, and correspondingly a higher unit non-response rate, as the richer households are less likely to participate. As a result, mean income and income inequality index may or may not differ across the two regions. To correct the mean incomes and inequality indexes in each region as well as nationally, we wish to give more weight to each richer household until the sum of weights equals the underlying regional population, because behind each responding rich household there are more non-responding rich households. Equation (2) ‘blows up’ the weight of each responding household systematically, under the household-level behavioral rules specified in Equation (1), to fit the joint weighted mass of the responding households to the underlying regional population (Equation (3)). In one region the weighted mass of the responding households may exceed the underlying population, while in the other region it may fall short (because of the restrictions imposed in Equation (1)), but the nationwide sum of the weighted masses equals the underlying national population. |
4 | Exclusion of influential regions and EU member states was also tried, but is not discussed here, as it prevents the estimation of inequality for EU member states and EU at large. (These results are available on request.) |
5 | This decision also can be viewed as upholding the anonymity axiom that inequality measures be based only on the welfare aggregate itself, and independent of other household characteristics (Litchfield 1999). |
6 | Indeed, during GB2 estimation on the SILC with Eurostat sampling weights, the algorithm could not converge due to the bottommost income observations (2.50 Euro/year or less). This indicates atypical distribution of the bottommost incomes. Indeed, there are over 100 observations in the SILC with annual income less than 100 Euro, suggesting measurement errors. |
7 | Conversely, if all EU-wide incomes were used for estimation and replacement, this estimation and replacement would be largely done on the richest member states. Poorer states would then be represented with largely true incomes, while the richest states would be largely replaced, a dubious exercise. Moreover, while the Pareto law may hold for each EU member state, there is no guarantee that it would hold on incomes EU wide. |
8 | This analysis cannot be performed across multiple waves of SILC for several reasons: SILC was first collected only in 2004; Availability of countries has varied by wave; member states are not required to collect or publish sub-national non-response rates, and some statistical agencies have declined to compute them for the authors of this study citing lack of resources. |
9 | For more information on the SILC see: http://ec.europa.eu/eurostat/web/income-and-living-conditions. |
10 | Sampling weights in the SILC are distributed very widely, from essentially zero to 38,357.27 (mean 901.89, standard deviation 1050.31) in the 2011 round. This also suggests that comparing unweighted, SILC weighted, and our non-response probability weighted statistics may yield very different estimates. Moreover, sampling weights in the SILC are trimmed from below and from above to limit the extent to which individual observations can influence sample-wide statistics. To evaluate how much this trimming affects survey-wide results, we could compare results across alternative weighting schemes, or replace the trimmed weights with imputed values. |
11 | For Cyprus, Estonia, Germany, Iceland, Latvia, Lithuania, Luxembourg, Malta, Portugal, Slovenia and Switzerland, non-response rates are available by the degree of urbanization (db100 variable): dense, intermediate or thin level of population density. In 2009 for Slovakia and the UK, only nationwide non-response rates are available. |
12 | There are two editions of the EU-SILC survey produced by Eurostat. The Production Data Base (PDB) includes all available variables for responding and nonresponding households, while a Users Data Base (UDB) excludes nonresponding units and variables that could potentially allow identification of households. Related to our analysis, the PDB includes variables DB120, DB130 and DB135, defining responding and non-responding households, DB060-DB062, identifying primary sampling units, and DB075, separating the traditional non-response rate (households interviewed for the first time) from the attrition rate (households from the 2nd to the 4th interview). Unfortunately, the PDB is not shared with users for confidentiality reasons, so in this study we rely on the UDB datasets. |
13 | This includes 27 country indicators, with Hungary and Slovakia; Denmark and Norway; and Ireland and Island respectively sharing single indicators due to their empirical similarities, and The Netherlands serving as a baseline country. Alternatively, 12 regional indicators plus a baseline were considered, in agreement with geopolitical division of Europe and with empirical distribution of incomes, inequalities and non-response rates across countries. Refer to Table S16 in the online Supplementary Material. However, this less parameterized specification still produced inconsistent results due to the remaining systematic heterogeneity within the 13 European regions. |
14 | Note that applying the sampling weights to the distribution of incomes uncorrected for unit non-response reduces the Gini in the SILC by 5.7 percentage points. This happens because sampling weights in the SILC (correcting for various sampling issues including region-level non-response) and the estimated non-response correction weights are related negatively for most households. SILC sampling weights are higher among households with atypical incomes, and lower among households in the middle of the national income distributions. Hence, combination of the two sets of weights serves to dampen the effect of inflating the representation of atypical units with very low incomes. This dampening—which lowers the estimate of inequality—overshadows the double-correction for unit non-response among top-income households. |
15 | The high corrections of the Ginis in Belgium and Slovakia are not due to atypical distributions of incomes across national regions—Gini decomposition shows similar within- and between-region components (Table 2, two columns before the last column). Instead, it is due to exceptionally thin top-income distributions with rare extreme incomes. Tables S4–S9 show that the Pareto coefficients estimated among the highest quartile of incomes in Belgium (particularly from the 75–80th percentile to the 92–94th percentile) are the highest or among the highest of all EU member states. Pareto coefficients estimated for Slovakia are also above average, but not exceptionally high. These thin top ends of the income distribution suggest that the few observed extreme incomes, when reweighted, can have great influence on the measurement of inequality. This also explains the high standard errors on the Ginis. |
16 | The number of regions j selected for the estimation of Equation (3) determines the weight that the model attributes to within-region as opposed to between-regions information and this choice leads to significantly different estimations of the correction bias. Analyses using finer degrees of disaggregation have been found to typically yield lower corrections for unit non-response (Hlasny 2016; Hlasny and Verme 2017, 2018). In Table 1 and Table 2, however, the estimates come from a model on the entire set of 31 member states, using a fixed degree of disaggregation into 162 regions. |
17 | Analogous replacement was also done for the top 0.2, 0.5 and 0.7 percent of incomes. The effects of these replacements are smaller than those in Table 3, as they reflect the replacement of individual outlying observations. |
18 | The parametric Gini estimates among top incomes in Tables S4–S9 were calculated under smooth fitted Pareto curves rather than from any observations or fitted values per se. As a robustness check, we have re-estimated these Ginis by replacing top incomes with numbers drawn randomly from the corresponding Pareto distributions, and bootstrapping the exercise. These Ginis from random draws are very similar to the smooth-distribution Ginis in Tables S4–S9, but have slightly higher standard errors due to sampling errors. |
19 | To validate the procedure, we again compare the parametric and quasi-nonparametric Ginis in each state (refer to the previous footnote). Indeed, using random income draws from a generalized beta distribution produces a similar correction of the Gini as numerical inference of the Gini under a smooth distribution. |
20 | This analysis cannot be performed across multiple waves of SILC for several reasons: SILC was first collected only in 2004; Availability of countries has varied by wave; member states are not required to collect or publish sub-national non-response rates, and some statistical agencies have declined to compute them for the authors of this study citing lack of resources. |
Variable | Coefficient Estimate |
---|---|
Intercept | 12.377 (1.306) |
Log(income) | −1.047 (0.127) |
AT | −0.571 (0.156) |
BE | −1.386 (0.134) |
BG | −1.360 (0.414) |
CH | −0.112 (0.164) |
CY | 0.146 (0.311) |
CZ | −1.212 (0.227) |
DE | 0.042 (0.175) |
EE | −2.221 (0.232) |
EL | −1.611 (0.169) |
ES | −0.381 (0.187) |
FI | −0.248 (0.158) |
FR | −0.452 (0.145) |
HR | −3.035 (0.219) |
IE, IS | −0.794 (0.155) |
IT | −0.866 (0.133) |
LT | −1.790 (0.289) |
LU | −0.982 (0.144) |
LV | −2.249 (0.251) |
MT | −0.533 (0.294) |
DK, NO | −1.289 (0.135) |
PL | −1.583 (0.241) |
PT | −0.259 (0.348) |
RO | −0.869 (0.719) |
SE | −1.229 (0.133) |
SI | −1.284 (0.165) |
HU, SK | −1.330 (0.265) |
UK | −0.972 (0.141) |
Regions j | 31 member states |
Households i | 238,383 |
Uncorrected Gini | 45.10 (0.08) |
Gini using stat. agency weights | 38.91 (0.13) |
Gini corrected for unit non-response bias | 48.34 (0.84) |
Gini corrected for unit non-resp. bias, with sampling wts. | 42.61 (0.83) |
Unit non-response bias | 3.25 |
Bias (using sampling wts.) | 3.70 |
Member State | Sub-National Regions | Househds. | National Non-Response Rate (%) | Mean Equivalized Disposable Income (Euro) | State Gini, SILC Weighted Households | Pure within-Region Contrib. (%) | Pure between-Region Contrib. (%) | State Gini, SILC Weighted & Non-Response Corrected |
---|---|---|---|---|---|---|---|---|
Austria | 3 | 6183 | 22.6 | 23,713.37 | 27.59 (0.40) | 34.2 | 10.4 | 29.54 (0.75) |
Belgium | 3 | 5897 | 36.7 | 21,622.14 | 27.63 (0.91) | 39.5 | 10.7 | 47.61 (18.45) |
Bulgaria | 2 | 6548 | 7.5 | 3415.42 | 35.99 (0.58) | 49.3 | 14.5 | 37.88 (1.03) |
Croatia | 1 | 6403 | 43.3 | 5981.46 | 32.07 (0.36) | -- | -- | 32.81 (0.57) |
Cyprus | 3 | 3916 | 10.2 | 20,084.84 | 31.65 (1.02) | 44.3 | 16.8 | 36.41 (4.35) |
Czech Rep. | 8 | 8865 | 17.1 | 8402.77 | 25.91 (0.37) | 12.4 | 20.7 | 27.53 (0.57) |
Denmark | 1 | 5306 | 44.4 | 28,441.21 | 27.45 (0.55) | -- | -- | 31.00 (1.30) |
Estonia | 2 | 4980 | 26.0 | 6475.47 | 32.62 (0.55) | 54.4 | 12.3 | 34.15 (0.82) |
Finland | 4 | 9342 | 18.1 | 23,870.09 | 26.83 (0.37) | 24.7 | 20.3 | 29.71 (1.92) |
France | 21 | 11,348 | 18.0 | 24,027.78 | 30.84 (0.45) | 7.2 | 20.3 | 36.99 (1.72) |
Germany | 3 | 13,473 | 12.6 | 21,496.55 | 30.21 (0.33) | 41.0 | 7.1 | 32.41 (0.77) |
Greece | 4 | 5969 | 26.5 | 12,704.72 | 32.92 (0.57) | 27.6 | 17.0 | 35.67 (1.10) |
Hungary | 3 | 11,680 | 11.2 | 5146.29 | 26.86 (0.26) | 34.1 | 22.0 | 27.58 (0.31) |
Iceland | 2 | 3008 | 24.8 | 20,668.26 | 24.99 (0.64) | 53.8 | 6.0 | 28.00 (1.68) |
Ireland | 8 | 4333 | 19.6 | 39,831.65 | 32.92 (0.56) | 14.9 | 23.8 | 34.82 (1.10) |
Italy | 5 | 19,234 | 25.0 | 18,353.37 | 31.72 (0.29) | 21.6 | 23.7 | 35.56 (1.00) |
Latvia | 2 | 6549 | 18.9 | 5048.72 | 34.98 (0.39) | 49.0 | 17.0 | 36.46 (0.48) |
Lithuania | 2 | 5157 | 18.6 | 4588.81 | 33.02 (0.57) | 50.0 | 16.6 | 33.95 (0.65) |
Luxembourg | 3 | 5442 | 43.3 | 37,232.63 | 27.32 (0.47) | 35.5 | 12.7 | 29.42 (0.86) |
Malta | 2 | 4070 | 11.8 | 12,167.55 | 28.29 (0.44) | 81.5 | 1.9 | 28.95 (0.52) |
The Netherlands | 1 | 10,469 | 14.5 | 22,726.06 | 25.66 (0.34) | -- | -- | 27.01 (0.56) |
Norway | 1 | 4621 | 50.7 | 38,616.14 | 24.98 (0.59) | -- | -- | 29.39 (3.05) |
Poland | 6 | 12,861 | 14.9 | 5849.61 | 32.10 (0.39) | 17.5 | 10.1 | 34.32 (0.73) |
Portugal | 3 | 5740 | 7.9 | 10,462.34 | 35.07 (0.57) | 32.8 | 19.2 | 36.35 (0.72) |
Romania | 4 | 7614 | 3.3 | 2447.42 | 32.37 (0.39) | 25.0 | 13.3 | 32.58 (0.41) |
Slovakia | 4 | 5200 | 14.5 | 6983.48 | 27.30 (1.26) | 28.5 | 15.0 | 36.58 (9.42) |
Slovenia | 1 | 9246 | 23.8 | 12,714.07 | 25.84 (0.29) | -- | -- | 26.54 (0.38) |
Spain | 19 | 12,900 | 37.2 | 14,584.40 | 32.67 (0.26) | 6.7 | 23.6 | 33.03 (0.29) |
Sweden | 1 | 6694 | 36.5 | 23,727.45 | 25.76 (0.36) | 36.8 | 9.0 | 28.65 (2.52) |
Switzerland | 3 | 7502 | 24.0 | 39,327.92 | 30.28 (0.49) | 42.6 | 12.0 | 34.82 (1.60) |
UK | 37 | 8009 | 27.3 | 20,843.59 | 32.85 (0.57) | 3.1 | 24.5 | 39.32 (2.88) |
Wtd. Mean [EU wide] | 5.23 [162] | 7695 [238],[559] | 23.9 | 17,929.58 | 29.61 [38.91] | -- | -- | 32.99 [42.61] |
Correction of Extreme Observations | Sampling Correction | Sample Size k obs. Replaced | Gini | Bias in Original Gini (pc.pt.) |
---|---|---|---|---|
Estimation on top 15–hth percentile of incomes | ||||
Semi-param. estimation, h = 1% | Unweighted | = 33,380 k = 2400 | 44.92 (0.07) | −0.18 |
Eurostat weights | = 34,475 k = 2587 | 38.71 (0.13) | −0.20 | |
Semi-param. estimation, h = 5% i | Unweighted | = 23,841 k = 11,939 | 45.49 (0.11) | +0.39 |
Eurostat weights | = 24,517 k = 12,545 | 38.85 (0.14) | −0.06 | |
Semi-param. estimation, h = 6% i | Unweighted | = 21,463 k = 14,317 | 45.87 (0.16) | +0.77 |
Eurostat weights | = 21,994 k = 15,068 | 43.27 (11.01) | +4.36 | |
Estimation on top 20–hth percentile of incomes | ||||
Semi-param. estimation, h = 1% | Unweighted | = 45,295 k = 2400 | 44.98 (0.07) | −0.12 |
Eurostat weights | = 46,702 k = 2587 | 38.81 (0.13) | −0.10 | |
Semi-param. estimation, h = 5% | Unweighted | = 35,756 k = 11,939 | 45.72 (0.10) | +0.62 |
Eurostat weights | = 36,744 k = 12,545 | 39.66 (0.16) | +0.75 | |
Semi-param. estimation, h = 8% | Unweighted | = 23,860 k = 19,086 | 47.26 (0.18) | +2.16 |
Eurostat weights | = 29,302 k = 19,987 | 42.15 (0.47) | +3.24 | |
Estimation on top 25–hth percentile of incomes | ||||
Semi-param. estimation, h = 1% | Unweighted | = 57,218 k = 2400 | 45.04 (0.08) | −0.06 |
Eurostat weights | = 58,841 k = 2587 | 38.89 (0.14) | −0.02 | |
Semi-param. estimation, h = 5% | Unweighted | = 47,679 k = 11,939 | 46.09 (0.17) | +0.99 |
Eurostat weights | = 48,883 k = 12,545 | 40.12 (0.19) | +1.21 | |
Semi-param. estimation, h = 8% | Unweighted | = 40,532 k = 19,086 | 47.89 (0.20) | +2.79 |
Eurostat weights | = 41,441 k = 19,987 | 42.41 (0.46) | +3.50 |
Correction of Extreme Observations | Sampling Correction | Sample Size k obs. Replaced | Gini | Bias in Original Gini (pc.pt.) |
---|---|---|---|---|
Estimation on top 70–hth percentile of incomes | ||||
Semi-param. estimation, h = 1% | Unweighted | = 164,423 k = 2400 | 43.89 (0.05) | −1.21 |
Eurostat weights | = 167,932 k = 2587 | 37.64 (0.08) | −1.27 | |
Semi-param. estimation, h = 5% | Unweighted | = 154,944 k = 11,939 | 44.86 (0.08) | −0.24 |
Eurostat weights | = 158,093 k = 12,545 | 36.80 (0.09) | −2.11 | |
Semi-param. estimation, h = 10% | Unweighted | = 143,233 k = 14,317 | 44.73 (0.14) | −0.37 |
Eurostat weights | = 145,699 k = 15,068 | 35.57 (0.07) | −3.34 |
© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Hlasny, V.; Verme, P. Top Incomes and Inequality Measurement: A Comparative Analysis of Correction Methods Using the EU SILC Data. Econometrics 2018, 6, 30. https://doi.org/10.3390/econometrics6020030
Hlasny V, Verme P. Top Incomes and Inequality Measurement: A Comparative Analysis of Correction Methods Using the EU SILC Data. Econometrics. 2018; 6(2):30. https://doi.org/10.3390/econometrics6020030
Chicago/Turabian StyleHlasny, Vladimir, and Paolo Verme. 2018. "Top Incomes and Inequality Measurement: A Comparative Analysis of Correction Methods Using the EU SILC Data" Econometrics 6, no. 2: 30. https://doi.org/10.3390/econometrics6020030
APA StyleHlasny, V., & Verme, P. (2018). Top Incomes and Inequality Measurement: A Comparative Analysis of Correction Methods Using the EU SILC Data. Econometrics, 6(2), 30. https://doi.org/10.3390/econometrics6020030