3.1. Theoretical Mechanism and Research Hypotheses
In light of the previous literature, this research proposes the theoretical mechanism for digital finance to raise residents’ total income and optimize income distribution (see
Figure 1). We present a detailed description of the theoretical mechanism and two major hypotheses in the following section.
As mentioned previously, it has been demonstrated that financial services enable residents to increase their production and sources of income, thus raising their total income [
9,
10,
40] and optimizing income distribution [
32,
41]. However, there are three pain points faced by traditional financial institutions when they offer services to users, especially long-tail users. The three pain points are illustrated on the left side of
Figure 1.
First, information asymmetry. It is challenging for long-tail users to provide standardized and qualified information necessary for financial risk analysis or even just to open bank accounts. Therefore, financial institutions have difficulty collecting reliable information from these users and identifying real demanders. Thus, financial services are no longer provided to these users in order to avoid the issues of adverse selection and moral hazard.
Second, geographical constrains. Users are required to travel to the branches of financial institutions in order to obtain traditional financial services. Due to difficulties in transmitting information as well as transportation and time expenses, traditional financial institutions are hesitant or not able to serve their users in remote areas.
Third, diseconomies of scale. For traditional financial services, the reliance on paper credentials, a large number of workers, and cash transactions are all necessary, which result in high costs. Thus, there will be no economies of scale for financial institutions if the cost of providing these services is prohibitively high, even if the demand for financial services remains high.
These three pain points make traditional financial institutions’ services less efficient and more expensive [
42], thereby undermining their capacity to improve residents’ income and income distribution.
Furthermore, the negative trickle-down effect and elite capture effect exacerbate these three pain points for traditional financial services. First, a negative trickle-down effect is often observed between urban and rural areas because the financial infrastructure in rural areas is far less developed than in urban areas [
43]. Hence, residents in urban areas (or economically developed areas) and large- and medium-sized enterprises obtain access to increasing financial resources, while the financial resources of rural residents and MSMEs are gradually encroached upon. Second, similar to the negative trickle-down effect, residents in rural areas are often affected by the elite capture effect. The elite capture effect refers to the phenomenon in which elite farmers with certain power and relationship advantages in rural areas have more chances to access financial resources such as agricultural loans [
44] and even tend to occupy the resources for themselves, which makes it more difficult for poor farmers to obtain financial services. Both the negative trickle-down effect and the elite capture effect lead to lower levels of financial accessibility, lower levels of financial efficiency, and higher costs for long-tail users. As a result, these long-tail users are not able to fulfill their own financial needs and improve their income.
Luckily, digital finance offers a solution to the three pain points. Three key characteristics of digital finance make it a valuable tool for addressing traditional financial institutions’ three pain points: high capacity for acquiring and processing information, instant cross-spatial information dissemination, and a low marginal cost effect. They are illustrated on the right side of
Figure 1.
First, digital finance technologies provide cost-effective and low-risk methods for acquiring large quantities of data [
45]. Consequently, financial institutions can acquire and process information efficiently and construct massive databases through algorithms [
35], thereby embedding massive information systems into financial services and enhancing the identification and selection of potential qualified users of financial services [
46]. Consequently, information asymmetry between users and financial institutions can be reduced, thus enhancing the financial efficiency of financial institutions as well as financial accessibility for users. Particularly, this feature can also mitigate the elite capture effect.
Second, digital finance has the ability of instant cross-spatial information dissemination [
47]. This can allow users to access a wide range of financial services online without visiting any branch; in other words, it can reduce geographical restrictions as well as regional and institution–user boundaries, thereby eliminating the negative trickle-down effect. Furthermore, it is worth noting that digital finance may have a spatial spillover effect when it comes to narrowing the geographical income gap (e.g., urban-rural income gap). Digital finance can eliminate geographical constraints, allowing financial institutions in one region to serve residents in its neighboring regions, which may directly contribute to narrowing income gaps within neighboring regions. In addition to the direct impact, the digital financial development in one region can also have an indirect impact on the income gaps within neighboring regions. The mechanism is that the digital financial development in one region first improves the digital financial development in its neighboring regions, and then the digital financial development in its neighboring regions will further narrow the income gaps within neighboring regions.
Third, due to the internet’s ubiquitous nature, digital finance has a low marginal cost effect, i.e., through the application of digital technologies, long-tail users can be provided with financial services at a reduced marginal cost [
48]. In this case, this group of users will also benefit from economies of scale. As a result of its extensive capabilities for acquiring and processing information, as well as its low marginal costs, digital finance can also be utilized by financial institutions as a means of integrating financial services into a variety of daily life scenarios. As a result, it may be possible to provide customized financial services tailored to the needs of particular individuals. Various financial services will be made available to consumers more frequently and efficiently in a more convenient manner, thereby improving residents’ income.
Accordingly, it is natural for us to find the research purpose, which is to verify the impact of digital finance on residents’ total income and income distribution, while considering its special effects. Thus, two major theoretical hypotheses are proposed as follows, including sub-hypotheses for each:
Hypothesis 1 (H1). Digital financial development has a positive impact on the increase in residents’ total income.
Hypothesis 1a (H1a). Digital financial development has a positive impact on the increase in urban residents’ total income.
Hypothesis 1b (H1b). Digital financial development has a positive impact on the increase in rural residents’ total income.
Hypothesis 2 (H2). Digital financial development has a positive impact on the optimization of residents’ income distribution.
Hypothesis 2a (H2a). Digital financial development has a total spatial effect on the optimization of residents’ income distribution.
Hypothesis 2b (H2b). Digital financial development has a spatial direct effect on the optimization of residents’ income distribution.
Hypothesis 2c (H2c). Digital financial development has a spatial spillover effect on the optimization of residents’ income distribution.
3.3. Econometric Model
To test Hypothesis 1 (H1), the study draws on previous studies [
50] to adopt a two-stage generalized method of moments (GMM) estimation approach with endogeneity treatment to construct an empirical model. This will enable us to learn the impact of digital financial development on residents’ total income. The GMM model exhibits several robust properties that contribute to its widespread use and reliable estimation results in empirical analysis. First, the GMM model does not require the assumption of homoscedastic errors, allowing it to effectively handle the presence of heteroscedasticity. Second, the GMM model can utilize appropriate instrumental variables to address endogeneity issues, leading to consistent estimators. Third, the GMM estimators maintain their robustness even in small sample sizes, in contrast to traditional least squares methods. Fourth, the GMM model does not necessitate the assumption of normally distributed or any other specific error term distributions, as long as the first-moment conditions are satisfied. Given the robustness of the two-stage GMM model, this study employs the two-stage GMM approach to conduct the regression analysis.
In order to deal with endogeneity issues, endogeneity treatment is considered in the GMM model by using instrument variables. The models are presented as follows:
where
i represents province,
t represents year. The dependent variables
and
denote urban and rural residents’ disposable income per capita, respectively, for province
i in year
t; the independent variable
represents the value of the Peking University Digital Financial Inclusion Index for province
i in year
t as a measure of regional digital financial development.
represents other control variables, as displayed in
Table 1 above;
represents the region fixed effect;
represents the random disturbance term. In addition, this study employs mobile phone ownership per capita as an instrumental variable to represent the digital financial development of a region. This variable is utilized in the regression analysis, maintaining consistency with the baseline model. The results of the exogeneity test, under-identification test, weak identification test, overidentification test, and endogeneity test for this instrumental variable can be found in
Section 4.1.
3.4. Spatial Econometric Model
This study tests Hypothesis 2 (H2) based on the spatial econometric model, which has three parts: measuring residents’ income distribution, spatial correlation, and spatial spillover effect. This will enable us to learn the impact of digital financial development on residents’ income distribution.
As mentioned above, the research uses the urban-rural income gap to represent residents’ income distribution. A large number of methods have been proposed to measure the urban-rural income gap, such as the Gini coefficient [
51,
52], the urban-rural income ratio [
53], and the Theil index [
54,
55]. Among them, Theil Index is adopted in this paper because, compared with other measures, it takes into account the urban-rural population ratio in different areas additionally. This can reduce the income gap underestimation, which is commonly caused when rural populations are much larger than urban populations (as is the case in China) [
54,
56]. The model is presented as below:
where
i represents province,
t represents year, and
j represents urban or rural area (1 for urban and 2 for rural).
represents the urban or rural residents’ total income for province
i in year
t;
represents the sum of the urban and rural residents’ total income for province
i in year
t;
indicates population for urban or rural area in province
i in year
t, and
represents the sum of the urban and rural population for in year
t. In general, a higher Theil index indicates a greater urban-rural income gap. The results of the Theil index for all Chinese provinces from 2011 to 2019 are presented in
Appendix A.
Spatial clustering refers to a typical spatial distribution in which regions neighboring each other show a high degree of similarity but differ significantly from those of other regions. For the purpose of detecting the spatial clustering of the digital financial development together with the urban-rural income gap, respectively, for 31 Chinese provinces, Moran’s
I index [
57] is applied to compute the spatial correlation of these two variables. There are two types of spatial correlation calculated by Moran’s
I index, i.e., global spatial correlation and local spatial correlation [
58,
59], whereas the former refers to the spatial correlation of all provinces as a whole and the latter refers to the spatial correlation of a specific province related to other neighboring provinces.
The global Moran’s
I index (
) is computed as the following equations:
where
i and
k represent different provinces,
n = 31;
represents the value of digital financial development or urban-rural income gap for province
i and
represents the value of digital financial development or urban-rural income gap for province
k;
represents the mean value of
;
represents the variance of
;
denotes spatial weight matrix, and it is constructed to test whether the variables are spatially correlated. Based on the fact that the sample of this research met the spatial continuity sample requirements, the neighboring weight matrix model is selected among all spatial weight matrix models [
60]. As a result, the provinces adjacent to each other are given a weight of 1, otherwise 0. In addition, although Hainan province is an island that is not adjacent to any other province, it is considered to be adjacent to Guangdong province in this paper. It is worth noting that global Moran’s
I range
, and its absolute value reflects the degree of spatial correlation (spatial clustering). Thus, an index value of 0 means that the spatial distribution of
and
is random, while an index value of 1 (or −1) indicates the highly positive (or negative) clustered spatial distribution of
and
.
The local Moran’s
I index (
) is computed as the following equations:
where the meanings of symbols are the same as those in Equations (4)–(7), while
denotes the relative value of variable
for province
i, and
denotes the relative value of variable
for the neighboring provinces of province
i. It is worth noting that although the value of local Moran’s
I indicates the degree of spatial correlation for province
i, scholars [
58,
61] pay more attention to
and
, especially their signs. It is because
and
can provide more detailed information than the single value of local Moran’s
I, and the combination of their signs can be used to observe four types of spatial correlation scenarios. This will be discussed in
Section 4.2.1.
In this paper, spatial effect refers to the spatial impact of digital finance on residents’ income distribution within the same or in different provinces. This is different from the spatial correlation illustrated above, which focuses on the spatial relationship of one variable itself. In that case, a spatial autoregression (SAR) model is constructed to calculate the spatial effect not only for the same province but also for its first-order and above adjacent provinces. The spatial autoregression (SAR) model excels at capturing spatial dependence, handling spatial heterogeneity, addressing endogeneity issues, quantifying spatial spillover effects, and accommodating different spatial scales, providing a robust framework for modeling spatial interactions, making it a powerful tool for spatial econometric analysis.
where
represents the coefficient of the dependent variable in the SAR model;
represents the total spatial effect;
represents the coefficient of the independent variable in the SAR model;
also represents the spatial weight matrix;
represents the region fixed effect;
represents the time fixed effect; and
represents the random disturbance term.
According to Lesage and Pace [
62], the total spatial effect in Equation (10) can be decomposed into two parts: spatial direct effect and spatial spillover effect. Spatial direct effect refers to the impact of one region’s independent variable on the dependent variables for the same province, while spatial spillover effect refers to the impact of one region’s independent variable on the dependent variables for its first-order and above adjacent regions. In Equation (10), the spatial direct effect is calculated by the mean value of all the elements on the main diagonal of the
matrix, while the spatial spillover effect is calculated by the mean value of all the elements on the non-main diagonal of the same matrix. Those are presented by the coefficients of the variables in the empirical results. The sum of the spatial direct effect and spatial spillover effect is equal to the total spatial effect.