1. Introduction
Governments worldwide are quite concerned about the distribution and unrestricted flow of data elements as a result of the emergence of the digital era and data-intensive research methodologies [
1]. In 2022, China proposed reasonably reducing the barrier for market participants to obtain data and improving their capacity to share data and make them universally accessible. Government data openness, based on land, resources, market supervision, livelihood development, and other administrative subjects, opens data to the community in order to achieve the value of the economic, political, and social three-dimensional expectations of a data set [
2]. Not only does it enhance resource efficiency, environmental protection, and sustainable development, but it also provides statistical support for attaining sustainable development goals. The utilization of land resources is crucial for both economic and social progress [
3]. As industrialization and urbanization continue to expand, there is a growing need for land, which is limited by both natural conditions and planning regulations [
4]. Enhancing the effectiveness of green space utilization in urban areas can address the limitations of land resources and reconcile the imbalance between supply and demand. Additionally, it can facilitate the simultaneous advancement of both high economic productivity and a healthy ecological environment [
5,
6]. Previous studies have clarified the impact of open public data on urban green innovation. Expanding on this, does open access to public government data have a similar effect on CGLUE? To what extent does it have an impact? Further, what are the key pathways that contribute to improved CGLUE? Are there any heterogeneous effects? These are all questions worthy of in-depth discussion and investigation. Therefore, this study focused on analyzing the correlation between government data openness and CGLUE and exploring the mechanism between these two variables. This work provides valuable theoretical insights and has practical significance for optimizing the allocation of data elements in China and encouraging the high-quality and sustainable development of land resources.
Previous studies pertaining to the subject of this paper have primarily focused on two areas: open government data and enhancing CGLUE. When studying OGD, scholars have mainly discussed the evaluation of government data openness and its effects [
7,
8]. First, when evaluating OGD, previous studies have mainly analyzed the security governance of public data openness within the framework of the digital economy and public access to the data factor market model [
9]. Secondly, in previous research on the effect of opening government data, scholars have mainly conducted in-depth discussions from macro and micro perspectives. The macro component has focused on the influence of open government data on regional green technology innovation, high-quality economic growth, urban entrepreneurial energy, and optimization of the business environment [
10,
11,
12,
13]. The effects of open government data on enterprise digital transformation, private enterprise performance, and enterprise innovation have been considered at a small scale [
14,
15,
16]. Furthermore, this has confirmed the beneficial impacts that the transmission of data in the form of open government data has on China’s economic and social progress. Studies in the field of enhancing CGLUE have revealed strong correlations between urban attributes, such as the economic development level, population density, industrial structure, urban scale, spatial structure, and the efficiency of green utilization in urban areas [
17,
18,
19]; some scholars have studied the effects of economic growth targets, digital infrastructure, energy utilization, environmental protection policies, and other types of socio-economic factors on CGLUE [
20,
21,
22,
23].
As for research on the relationship between OGD and CGLUE, there are not many direct research results on these two variables in the theoretical field at present. However, with the continuous improvement in the theoretical and empirical system of domestic research on the relationship between government data openness and CGLUE, some scholars have started to pay attention to the green effect of government data openness from an indirect perspective. In previous studies, public data have been widely used in scenarios such as residents’ environmental participation, urban traffic management, and green land utilization, generating considerable benefits [
24,
25,
26]. For example, in the field of resident participation, cities have launched apps based on public environmental data, through which residents can learn about air quality, noise levels, and other information about their environment [
27], which motivates them to participate in environmental protection activities. In the area of urban traffic management, cities use public transport data and real-time traffic monitoring to optimize bus routes and schedules and reduce traffic congestion [
28], thereby reducing carbon emissions. To improve efficiency and environmental friendliness in the area of land use, governments use GIS and public land data to facilitate scientific land-use planning and improve the green utilization of land [
29,
30]. They can also use public data to identify damaged ecological areas, formulate ecological restoration plans, monitor changes in forest cover, and promote large-scale afforestation projects. It can be seen that previous studies in China have analyzed the green effect of open government data from an indirect perspective, and there are not many direct studies on the effect of bringing an open government data platform online on the green land-use efficiency of urban land. This provided a new space for exploration when carrying out this study.
Because of this, this study examined the effects of OGD on CGLUE and the mechanism of the role of actual urgency using a quasi-natural experiment based on the government’s online data platform. The preservation and wise use of land resources have become crucial concerns for sustainable urban development as a result of the acceleration of urbanization. In order to realize the green and sustainable use of land resources, urban planning and land use can be improved by researching the effects of open public data on land use.
Compared with the existing literature, the marginal contributions of this paper are as follows: First, regarding the subject of this research, this study examined the impact of open public data on CGLUE, which expanded the existing literature on the effect of open public data and addressed gaps in the existing research content. Secondly, it theoretically clarified the effect of OGD on CGLUE, which not only deepened the research on expanding the green environmental benefits of open government data but also identified a new path for improving CGLUE. Thirdly, it provided policy insights and reference suggestions for the balanced development of the economy and the environment, guided by the national data policy of open government data.
3. Research Design
3.1. Data Source and Description
This study adopted panel data from 267 cities at the prefecture level and above in China from 2011 to 2020. Among them, data related to economic growth targets were extracted from government work reports at all levels, green patent data came from the patent search platform of the State Intellectual Property Office, and the rest of the raw data came from official documents or websites such as The China Urban Statistical Yearbook, The China Urban Construction Statistical Yearbook, statistical bulletins, the EPS database, etc. Missing values for individual cities were supplemented according to the average growth rate. It should be noted that because some cities in Tibet, Xinjiang, and other provinces had more missing data for key variables and the missing information could not be accurately restored via interpolation, this study excluded these cities to ensure the reliability of the research results.
3.2. Variable Selection
3.2.1. Explained Variable: Urban Green Land-Use Efficiency (CGLUE)
In this paper, in accordance with Tone [
44] and Tan et al. [
45], from the perspective of inputs and outputs, we use the SBM model under the assumption of constant returns to scale to measure the CGLUE, which can comprehensively consider the non-expected outputs and correct the slack variables so as to greatly improve the accuracy and practicability; the expression is as follows:
In Equation (1), is the CGLUE with a value range between (0, 1); , , are the quantities of inputs, desired, and undesired outputs; and their relaxation vectors are characterized by , respectively. A relaxation vector is a variable used in optimization problems to make the constraints more flexible. It allows the model to have some room for adjustment while satisfying strict constraints, making the solution more realistic and feasible. The relaxation vectors , and are used to adjust the input, desired output and non-desired output constraints so that answers are feasible in the model; are input–output variables; reflects decision unit weights; t indicates a time period or a specific decision-making unit (DMU), showing that the variable changes over time or is associated with different time units; k is used as an index to identify each DMU, while typically denotes the set of valid or reference decision units associated with the k’th decision unit.
The input indicators for measuring the CGLUE in this paper are selected from land, capital and labor factors, which are characterized by a built-up area, fixed capital stock, and the number of employees in the secondary and tertiary industries, respectively; the desired outputs take into account the economic, social and environmental benefits at the same time, which are measured by the real added value of the secondary and tertiary industries, the average wage of employees and the greening coverage of the built-up area, respectively. At last, non-desired outputs consider pollution emissions and are characterized by environmental pollution intensity; they are calculated using the entropy method to assign weights to industrial wastewater, soot and sulfur dioxide emissions after taking them into account. The details of each indicator are shown in
Table 1.
3.2.2. Explanatory Variable: Open Government Data (DID)
The local government promoted the online data platform of each prefecture-level city, which provided the ideal quasi-natural experimental scenario for the area in this study. This study set the model value logic as follows: if a city successfully went online with a public data platform during the sample period and the observation period included the platform’s opening year and after, the city’s DID took a value of 1, which indicated that it was affected by the policy; otherwise, the DID took a value of 0, which indicated that it was not affected by the policy. In this paper, the time when each prefecture-level city first went online with a public data platform was set as the time of policy implementation.
3.2.3. Control Variables
To ensure that the degree of urban growth had no bearing on this study’s findings, relevant factors were included at the geographical level.
Specifically, the following geographic-level control variables were included:
The level of economic development (GDP, the logarithm of gross domestic product per capita). Cities with higher levels of economic development typically invest more resources in land use, infrastructure development, and public services [
45]. This may lead to an increase in land-use efficiency or a shift to green use, so the inclusion of GDP per capita as a control variable helped to exclude its direct effect on CGLUE.
The level of financial development (FIN, selected to reflect the year-end deposit and loan balances of the financial institutions in a city as a share of GDP). The maturity of the financial market affects firms’ investment decisions. Cities with high levels of financial development attract more green investments and sustainable projects [
46], directly affecting CGLUE. Controlling for this variable helped to determine the impacts of other factors on CGLUE.
The Internet penetration rate (INTEGRATE, the logarithm of the number of people using the Internet). The Internet provides a platform for public participation in policy formulation and monitoring. The greater the number of people using the Internet, the greater the public’s interest and participation in government data openness [
47], which may lead to more effective government measures of land management and green utilization. Therefore, controlling for this variable facilitated better analysis of the independent impact of open government data on CGLUE.
The urbanization level (URBAN, selected to characterize the urban population as a proportion of the total population). Increasing urbanization levels are usually accompanied by significant changes in land-use patterns. As urbanization progresses, the development and use of land become more intensive and diverse [
48], which may affect the efficiency of green land use. Therefore, controlling for the level of urbanization helped us to more accurately assess the independent impact of GOV on land-use efficiency.
Government intervention (GOV, expressed using the share of general government budget expenditure in the GDP). The share of government budget expenditure reflects the government’s financial investment in public services, infrastructure development, and environmental protection [
49]. A higher proportion of budget expenditure may imply that the government has adequate financial support to promote green land use. Therefore, controlling for this variable helped us to more accurately assess the impact of government data openness on the efficiency of green land use.
The degree of industrialization (IND, the ratio of value added by secondary industry to the GDP of each city) was chosen to portray the level of industrialization development. The degree of industrialization directly affects land use and layout [
49]. Highly industrialized cities usually need large amounts of land for industrial facilities, transport infrastructure, etc., which may lead to decreases in land-use efficiency. Controlling for the degree of industrialization helped us to better understand the independent impact of open government data on green land-use efficiency.
Infrastructure (INF, selected to represent the road area per capita in cities). The road area per capita directly reflects the level of infrastructure development in a city. The construction and layout of roads affect land-use efficiency and functional zoning [
50], which in turn affect green use efficiency. Controlling for this variable helped us to better understand the impact of open data on land-use efficiency.
Patents for green inventions in cities (GPI, the logarithm of the number of green inventions patented in a city in a year). An increase in the number of green patents is usually combined with economic growth and can contribute to the formation of a green economy [
51]. Therefore, considering the number of green patents when discussing the impact of open government data on land-use efficiency helped us to understand the relationship between economic growth and environmental sustainability.
The strength of environmental regulation (ER, measured using the number of words in sentences containing environmental words for each city as a proportion of the total number of words in entire government work reports). The strength of environmental regulation directly affects decisions made by companies and individuals on land use. Stronger environmental regulation usually drives land use in a more sustainable direction [
44], thereby affecting the efficiency of green land use. Therefore, the impact of environmental regulation needed to be controlled when analyzing the impact of open government data on green land-use efficiency.
The energy consumption level (PEGY). (PEGY, with reference to the standard coal conversion coefficients for each energy source in the
China Energy Statistical Yearbook, the artificial, total natural gas supply, total liquefied petroleum gas supply and total electricity consumption of society as a whole was calculated for each prefecture-level city, converted to 10,000 tons of standard coal and summed up to give the energy consumption, which was measured by taking the logarithm of the number of tons of standard coal). The level of energy consumption directly affects the efficiency and manner of land use. A high energy consumption level may imply high-intensity development and utilization of land resources [
52], thereby affecting green land-use efficiency. Therefore, when analyzing the impact of open government data on the efficiency of green land use, controlling for the level of energy consumption helped isolate the impacts of other factors.
Descriptive statistics for the main variables addressed in this article are displayed in
Table 2.
3.3. Model Design
To investigate the influence of OGD on CGLUE, we built a benchmark regression model as follows:
In Equation (2), the explanatory variable represents the CGLUE of city i in year t. is the implementation of OGD by city i in year t. captures the impact of open government data on CGLUE. denotes the control variable. ε is the random error term of the model. In addition, the following basic treatments were performed: Initially, the modified standard errors, grouped at the city level in the regression, were used in all regression models. Second, in order to absorb fixed effects as much as possible, we controlled for both time () and city () fixed effects.
3.4. Benchmark Regression
The findings estimating the influence of OGD on CGLUE are shown in
Table 3. Specifically, column (1) displays the estimation results without fixation or the addition of control variables. The estimation results with the control variables gradually included, accounting for time and individual-level variations, are shown in columns (2) through (11). Column (11) is the basis for the follow-up analyses.
The accuracy of Hypothesis 1 is confirmed by the regression results in columns (1)–(11) in
Table 3, which show that all estimated DID coefficients passed the significance test at the 5% level. This suggests that the level of CGLUE is greatly increased by using OGD. Open government data can provide more information and references for urban planners and decision-makers, helping them to formulate land-use policies more scientifically. This fosters a certain degree of equilibrium between urban expansion and the preservation of natural resources, thereby encouraging the establishment of environmentally friendly policies [
53]. Simultaneously, the transparency of government data aids in comprehending the present state and requirements of land utilization, facilitating the efficient planning and allocation of land resources to attain the most advantageous distribution and utilization of land [
54].
3.5. Robustness Tests
3.5.1. Parallel Trend Test
Prior to using the DID model to ascertain the correlation between government data transparency and the efficiency of urban green land use, it was necessary to ensure that the experimental and control groups adhered to the premise of parallel trends. Thus, this research utilized the event study method to perform a parallel trend test. The
y-axis represents the impact of opening government data on CGLUE, while the
x-axis represents the year of policy implementation.
Figure 1 illustrates that the influence on CGLUE was negligible prior to the policy’s implementation but became a significant factor afterwards.
3.5.2. Placebo Test
This work employed an indirect placebo test, based on research conducted by other researchers, to mitigate the impact of unobservable variables on the research findings. This test was repeated 500 times for greater accuracy. Specifically, a pseudo-variable False_DID was constructed for each randomly assigned city. It was then used to replace the true DID and re-estimate the baseline model, and this step was iterated 500 times.
Figure 2 shows the distribution of T-values for the estimation based on the False_DID. In
Figure 2, it can be seen that the T-values of the 500 placebo tests were normally distributed and centered on 0. Only a few results were close to the true T-value (the red dashed line), which suggests that some unobserved macroscopic or time-varying factors did not affect the main results of this study.
3.5.3. Propensity Score Matching
Through the random selection of treatment and control groups, propensity score matching successfully mitigates the endogeneity problem resulting from selectivity bias. Additionally, propensity score matching can lessen the heterogeneity bias in treatment and control groups caused by other unobservable characteristics. Thus, propensity score matching was used in this study for robustness testing, and the particular model was built as follows:
We utilized k nearest zero matching with k = 4 to determine the propensity score, denoted as P. Based on the regression results presented in
Table 4, it is evident that the variable DID continued to have a significant impact on the variable CGLUE. In addition, we employed caliper and kernel matching to conduct additional robustness checks. Based on the regression results, our benchmark regression was still confirmed.
3.5.4. Excluding Interference from Other Policies
To eliminate the influence of other policies on the efficiency of green land use in urban areas, we gathered and organized relevant policies, such as Big Data Pilot Cities (DID1) and Smart Cities (DID2), that coincided with the time frame of our study. We then incorporated dummy variables for these policies into the basic regression analysis. The estimated coefficients of the Difference-in-Differences (DIDs) method in columns (1)–(3) of
Table 5 passed the significance test at the five percent level. These coefficients remained positive even after accounting for the effects of these policies, both individually and together. This indicates the strength and reliability of the findings.
3.6. Mechanism Analysis
The above empirical analyses verified the positive effect of OGD on CGLUE. In a theoretical analysis, this paper argues that government data openness promotes CGLUE by increasing urban green innovation capacity and public environmental concern. Urban green innovation capacity (INNOVATION) was measured using urban green patent applications, while public environmental concern (PEC) was measured using the public haze search index.
We confirmed the validity of this statement, and
Table 6 presents the outcomes of an examination of how OGD impacts CGLUE.
Table 6 displays the effect of OGD on green innovation capability. Column (1) represents the impact without the control variables, whereas column (2) represents the impact with the control variables. The analysis revealed that the estimated coefficient of the DID was statistically significant and positive, with a significance level of at least 10%. This suggests that the openness of government data has a positive impact on a city’s green innovation potential, supporting Hypothesis 2. According to Guo et al. [
55], the unrestricted movement of data components facilitates the swift implementation of scientific and technical advancements. With the support of data, new technologies and concepts can be verified faster in urban practice, thereby improving CGLUE. Meanwhile, the openness of data elements provides a common information platform for all parties in society. Experts from different fields can cooperate based on shared data resources and can jointly explore innovative paths for green urban land use [
56].
Columns (3) and (4) in
Table 6, respectively, indicate how public environmental concerns were affected by OGD with and without the control factors. It can be seen that the estimated coefficients passed at the 10% significance level, suggesting that open government data increase public environmental concern by promoting CGLUE. Therefore, Hypothesis 3 was verified. Public concern for the environment can stimulate demand for green technologies and innovations [
57]. OGD makes it easier for the public to understand urban environmental problems and resource utilization, triggering public demand for innovative technologies and methods to solve these issues [
58]. Due to this need, businesses and academic institutions have increased their funding for studying and developing green technologies, encouraging the growth of green innovation and enhancing CGLUE. At the same time, the public’s increased concern for the environment has prompted the government to develop environmental policies and plans that are more in line with the public’s needs [
23]. When formulating urban land-use policies, the government will give more consideration to the public’s demands and expectations and will listen to the public’s opinions and suggestions so as to formulate more scientific policies that are more in line with the concept of green development and improve CGLUE.
3.7. Heterogeneity Analysis
The preceding section primarily confirmed, from the standpoint of the entire sample, the promotional effect of government data availability on CGLUE. In order to explore potential areas for policy focus to further enhance CGLUE through data openness, this study delved deeply into the heterogeneity of government data openness affecting CGLUE based on the geographical characteristics of location, digital financial characteristics, and resource-based city characteristics.
3.7.1. Location Heterogeneity
The location characteristics (EAST) dummy variable created in this study had values of 1 for cities in the eastern area and 0 for cities in the central and western regions. The results are displayed in columns (1) and (2) of
Table 7, respectively. The coefficient of the DID had a more pronounced impact on the CGLUE in the eastern cities. This was because the eastern cities typically exhibit greater levels of economic growth and population density, with greater pressure on land resources [
59]. Open government data can help these cities to more effectively address the challenges of limited land resources and high environmental pressure and can guide these cities towards green and sustainable development. Meanwhile, in the eastern region of China, the national policy orientation towards green development and sustainable city building is clear [
60]. Government data openness is more likely to be supported by the government and society in such a policy context and is more likely to be incorporated into urban development planning and practice, thereby more significantly influencing improvements in CGLUE.
3.7.2. Heterogeneity of Digital Financial Characteristics
This study measured the level of digital financial development (DF) in cities using their digital financial inclusion knowledge. A value of 1 was assigned to scores above the median DF, and a value of 0 was assigned to those below it. The outcomes are displayed in columns (3) and (4) of
Table 7, respectively, and the DID coefficients were proven to be more significant for CGLUE with larger amounts of digital finance. This was because cities with higher levels of digital finance tend to prioritize the use and utilization of data and are more likely to embrace data-driven decision-making models in land planning, resource management, and environmental protection. Open government data provide these cities with more accurate data support, which can help decision-makers better analyze and assess land use and formulate more scientific and effective green development strategies [
61]. At the same time, cities with higher levels of digital finance tend to have more developed financial systems and capital markets and can more easily obtain financial support and investment. Government data openness can increase the transparency of the urban environment and land-use situation, enhance investor confidence in green projects [
62], and promote improvements in CGLUE.
3.7.3. Resource Endowment Heterogeneity
This work aimed to create dummy variables, referred to as RBCs, to represent urban resource endowment. These variables were constructed using a list of cities that are known for their resource-based economies. More precisely, RBCs were represented as 1 to indicate resource-based cities or 0 to indicate non-resource-based cities. The coefficients of the DID analysis had a greater impact in non-resource-based cities, as shown in columns (5) and (6) of
Table 7. This was because non-resource-based cities lack the advantages of natural resources and therefore rely more on innovation and development to drive economic growth and ensure sustainability. Open government data help these cities achieve sustainable development [
63]. This assistance helps guide cities in adopting environmentally friendly and sustainable approaches to land use. Additionally, the utilization of OGD has the potential to improve the reputation and level of transparency in cities that do not rely on natural resources [
64,
65]. This, in turn, can help attract skilled individuals and investments. Urban innovation and development, as well as green land use and sustainable development, can be stimulated by influxes of talent and cash.
5. Conclusions
This study empirically examined the impact of government data openness on CGLUE. It was found that government data openness significantly improved CGLUE. Further mechanism analyses showed that government data openness improved CGLUE by increasing public environmental concern and enhancing the urban green innovation capacity. Heterogeneity analyses showed that the effect of open government data on CGLUE was more obvious in eastern cities, cities with higher levels of digital finance development, and non-resource-based cities.
Based on the aforementioned findings, this report presents the following specific policy recommendations:
First, government departments should establish a unified open government data platform to ensure data standardization, easy access, and open sharing and to promote data interconnectivity among city departments. In addition, government departments at all levels should take the initiative to disclose relevant environmental and land-use data and encourage private institutions and individuals to use these data for green project development and innovation. On the other hand, a system for evaluating CGLUE should be established, and urban green projects should be evaluated and monitored on a regular basis so that policies and measures can be adjusted in a timely manner.
Secondly, education and publicity campaigns targeting environmental protection and green utilization should be carried out to raise public awareness and bring attention to environmental issues. In addition, a green innovation fund can be established to encourage enterprises, scientific research institutions, and others to develop green technological innovations and carry out projects. Collaboration among governmental departments, corporations, research institutes, and social organizations should be facilitated to collectively enhance CGLUE.
Finally, for eastern cities, the construction of open government data platforms should be strengthened to improve the comprehensiveness of the data and ensure they are updated in real time to facilitate urban management and meet citizens’ needs. For cities with high levels of digital finance, financial institutions should be pushed to increase their investments in and support for green projects, to establish green financial products and service systems, and to direct capital flows to environmental protection and sustainable areas. For non-resource-based cities, special green innovation funds should be set up to support green technology enterprises and projects, the application and dissemination of green technology should be promoted in urban land use, and green enterprises should be attracted by establishing green industrial parks and demonstration zones to generate a green industrial agglomeration effect and promote CGLUE.