1. Introduction
With international trade expansion and accelerated globalization, the movements of people and cargoes are frequent. In order to profit from efficient travelling, people have higher requirements for the speed and timeliness of transportation. Transportation infrastructures are of crucial importance to satisfy efficient travelling requirements [
1,
2]. Compared with that of water, rail, or road, the contribution of air transportation is becoming more and more necessary and important. Air transportation plays an irreplaceable role in the modern world [
3]. Air transportation is a necessary means for the fast and effective movements of people and cargoes over large distances, and it is critical to the functioning of countries and the world economy. Examining the structure and growth mechanisms of air transportation thoroughly is crucial [
3].
For the past several decades, the complex network theory has helped the research community to investigate various networks’ structures and properties. Many types of relationships in nature and human society can be abstracted as networks, in which the individuals are represented as vertices and the interactions as edges [
4]. The ability to analyze realistic networks has been improved, along with measurement techniques and the advancement of computational power. The analysis of air transportation networks within the complex network framework is one of the most important fields to have attracted adequate research interest [
5,
6]. Analyzing the complex network model of airports considering all flights among destination airports throughout a country or the world has been the subject of many complex network studies. For air transportation, the structure and properties of the air network play a vital role. Complex network theory is naturally a useful tool for practitioners and researchers to grasp the overall air network features and flow patterns, to identify the importance of individual airports, and so on [
4].
(i) The literature on complex network models of airline networks.
In complex network models of airline networks, the air network is modelled as a graph that comprises airports as vertices linked by flights. In the literature, the air network structure has been investigated from world-wide or national angles [
1,
4,
7,
8,
9,
10,
11,
12,
13,
14,
15,
16,
17,
18,
19].
The world-wide airline network (WAN) has been analyzed by its topology, as well as by the perspective of its dynamics. The majority of published studies have reached the consensus that the WAN exhibits scale-free and small-world properties [
7,
8,
9,
10,
11,
12]. The small-world property implies that the average topological distances between vertices increase very slowly (logarithmically or even more slowly), along with the number of vertices increasing. Research on the correlation between weight and topology indicated that the correlation follows a power-law [
8,
9]. To the centrality of airports, the most connected vertices on the WAN are not necessarily the most central, because of many factors, such as geographical-political constraints [
10,
11]. Besides, some weighted evolving network models were proposed to find the weighted features of the WAN [
9,
12]. The national (or regional) airline networks, such as those of China [
4,
13,
14,
15,
16,
17]), India [
8], the U.S. [
18,
19], and other countries [
20,
21], have also been extensively investigated. Although every nation or region has its own characteristics; e.g., the U.S. airline network is nearly mature [
18] while the airline networks are constantly growing in developing countries, it was found that the national airline networks have scale-free, small-world, disassortative mixing properties.
Moreover, airline networks are not static, since the departure and arrival time of flights varies considerably within a day or a week. Characterizing the temporal properties is thus of great importance in the understanding of evolution processes taking place in air networks. There are some studies focusing on the evolution of airline networks in different time-scale intervals, such as a year [
3,
4,
5,
19,
21,
22], a quarter [
18], or a week [
13]. Although some valuable insights have been provided in the literature by analyzing the structure of air networks at different time scales (e.g., annual, monthly, or daily scales), there is an increasing need to characterize the network temporal evolution at smaller scales [
23].
(ii) The air networks from smaller-scale perspectives.
The air networks may be analyzed from smaller-scale perspectives, and this is what we aim for in this paper. Since China’s aviation system has evolved with its own unique mechanism and structure, exploring the statistical and structure characteristics of China’s aviation system is of the utmost interest and importance for researchers [
17]. As a case study on the Chinese air passenger network (CAPN), this paper investigates the overall network features, the structure features of two types of subnets extracted by time-scale constraints and the periodicity of one type of subnets. The time-scale interval for extracting subnets is one day or every two-hours, which classifies the CAPN subnets into two types. The first type, which consists of seven subnets, is extracted by taking one day in a week as the time-scale interval. The second type is extracted by taking two hours as the time-scale interval. For every day in a week, the time interval from 6:00 a.m. to 10:00 p.m. is considered. With the two-hour interval, the second type consists of 56 subnets with the time range from 6:00 a.m. on Monday to 10:00 p.m. on Sunday. A set of indicators was used to characterize the temporal process. Moreover, although the topological structure of an air network obviously has an impact on the network’s performance, the capacity (i.e., available passenger-seats in this paper) flow through such structures is also an important indicator, since the capacity indicated by available passenger-seats induces profits and passengers’ travelling behavior.
(iii) The data and the CAPN
To model the CAPN, the data on each week’s domestic flights was collected in the latter half of the year 2016. The data was obtained from the Civil Aviation Administration of China and the official website of each airport. The flight data includes the flight number, aircraft type, departure airport, destination airport, departure time, arrival time, and the flight frequency in a week. The CAPN consists of domestic and international airports for handling regular passenger flights conducted by more than 40 domestic airlines. Only direct flights are considered and all international flights are excluded from our study. The data involves all cities with civil airports operating in mainland China. The location and name of each city are used to represent the corresponding airport. For some large cities with more than one airport, these airports are assumed to have the same location in the city and flight data is summed.
Based on more than 21 thousand items of domestic flights, the CAPN consists of 174 airports denoted by vertices, and 1787 airlines denoted by edges. The CAPN is denoted as (V, E), where the vertex set (V) represents the 174 airports, and the edge set (E) represents the 1787 direct airlines between airports. The CAPN is also represented by an adjacency matrix A and a weight matrix W that are symmetric, regardless of the direction. The element of A takes the value of 1 if there is any flight link between city and ; otherwise, takes the value of 0. The element of W represents the weights (the available passenger-seats in this paper) assigned to the respective edge. The aircraft type of every flight and the number of configured seats are known in advance.
(iv) Contributions and sections
The contributions of this paper include the following: First, from the non-weighted and the weighted perspectives, the structural features of the CAPN are provided, which offer an insight into the configuration mode of the air capacity for the whole CAPN; second, to find the temporal features, we analyze the CAPN from a more microcosmic perspective by extracting two types of subnets with the time-scale interval constraints of each day or every two-hours. For most indicators, a one-day cycle, two-day cycle, and/or three-day cycle can be found.
The rest of this paper is structured as follows. In
Section 2, we introduce several indicators that are capable of expressing the network type, the cluster, the assortative mixing, the centrality, and the rich-club. The non-parametric spectral destiny estimation with a window function is presented, so as to find the cycles in some indicators. In
Section 3, the structural features of the CAPN from the non-weighted and the weighted perspectives are described. The temporal features of two types of CAPN subnets are analyzed.
Section 3 also discusses the implicit periodicity mathematically through the use of the spectral destiny estimation method. The conclusions are drawn in
Section 4.
3. Results
3.1. The Structural Features of the CAPN
(i) In the CAPN, for an airport, is the number of directly-connected airports, is the average minimum number of airlines that one needs to take to get from an airport to another airport, and is the probability that two airports that are directly connected to the same airport are also directly connected to each other. These basic indicators can convey two common properties of real networks as the CAPN.
One is the small-world property, as indicated by a high clustering coefficient and a small average shortest path.
Table 1 compares the topological properties of the CAPN with those of other types of air transportation networks. The clustering coefficient (
C = 0.80) is slightly larger than those of other types of air transportation networks. The average shortest path length (d) of the CAPN is 1.98, which indicates that, on average, about two flight changes are required to connect most airport-pairs. The average shortest path length of the CAPN is similar to those of India [
1], Brazil [
21], and the U.S. [
18], but much less than that of the world [
11,
32]. It can be inferred that the CAPN evolves as a small-world topology, which means that it is possible to fly from one airport to another through a small number of intermediate airports. All of the network features presented in
Table 1 imply that the CAPN has properties that are similar to small-world characteristics.
The other is the scale-free property, which is characterized by a power-law
. The cumulative degree distribution ((>
k)) of the CAPN is analyzed manually and illustrated in
Figure 1, in which it can be seen that a two-regime power-law distribution is followed. It is confirmed that a small number of airports carry the majority of the airlines. This scale-free structure could be robust for a random failure, but vulnerable to a targeted or specific failure.
(ii) The weighted clustering coefficient of the CAPN is 0.85, which is larger than the clustering coefficient of the CAPN (i.e., 0.80). This statistical result reflects that the airports are connected closely, and that the large passenger capacities take part in the formation of clusters on that basis.
(iii)
Figure 2 shows the result that the higher the degrees that the vertices have, the lower the degrees that their neighbors have on the CAPN. This result implies that the CAPN tends to be disassortative. National hubs or large airports are required to provide connectivity to a large number of low-degree destinations. Concluded by
, the passenger capacities are assigned between high-degree vertices.
The weighted assortativity coefficient (i.e.,
) better describes the assortativity of networks with weighted edges. If
, a high-weighted edge tends to connect two similar degree vertices; otherwise, the degree of vertices connected by high-weighted edges may have large differences [
35]. The value of
is −0.53, and for
, it is −0.17 for the CAPN. This indicates that large passenger capacities exist between large airports on the CAPN when considering the results of
and
.
(iv) In
Table 2, we list the top 15 airports. The most central vertices on the CAPN are identified by the four centrality indicators. Generally speaking, the airports located in larger cities always have more central positions, such as Beijing, Shanghai, Guangzhou, Shenzhen, Chengdu, Xi’an, etc. These airports usually act as the national air-hubs. What is the most different is the ranking of the betweenness centrality. The betweenness centrality measures the importance of a vertex in connecting the others, and the airports with key positions between others have high rankings.
Besides the national air-hubs located in larger cities, some airports that may act as local air-hubs; for example, Urumqi, Hohhot, Kunming, Guiyang, Lhasa, etc., also have high rankings. In practice, regional airline networks are constructed in some borderline provinces of China, in which airports located in the provincial capitals are the local air-hubs and the local airlines constitute the edges. The regional networks make the local air-hubs hold important positions in the whole CAPN.
(v) In the CAPN, the first 35 vertices with degrees larger than 35 (i.e., ) are chosen as the rich-club. The rich-club coefficient is 0.93, which indicates that there are direct airlines between the majority of airports with large degrees. The weighted rich-club coefficient is 0.88, which means that close to 90% of large capacities exist between airports with large degrees. The behavior of the rich vertices in the CAPN implies that rich vertices are closely connected with each other, and that the passenger capacities among them are dominant.
3.2. The Temporal Features of the CAPN Subnets
We investigate the temporal features of the CAPN by certain subnets. To analyze the CAPN more microcosmically, the time-scale interval constraint for extracting subnets is per day or every two-hours, since the collected flight information includes some details. The first type of subnets, named SDD in this paper, was extracted by taking every day in a week as the time-scale interval. The SDD consists of seven networks; each of which is modelled by referring to the data on all flights on each day of a week. The second type of subnets, named SDT in this paper, was extracted by taking two hours as the time-scale interval. On every day in a week, the time from 6:00 a.m. to 10:00 p.m. was considered, while other time intervals with not enough flights are excluded. For an airport, any flights with departure times in the corresponding two-hour interval are considered. Within the two-hour interval, the SDT consists of 56 networks with a time range from 6:00 a.m. on Monday to 10:00 p.m. on Sunday.
3.2.1. The SDD Network
The number of vertices in every SDD network is almost the same as that in the CAPN, while the edges of every SDD network are obviously less than those of the CAPN. In official schedules, some flights between two airports are scheduled every two-days, which leads to different numbers of edges on each day of a week. The indicator results show that the features have no particular difference among the seven SDD networks, and between the seven SDD networks and the CAPN (see
Table 3). The SDD network has small-world, disassortative mixing, and rich-club properties. Some properties of subnets of different days have slight fluctuations. Probable reasons for this are summarized as follows. Firstly, there is usually more than one flight between two airports, and the two airports are still linked, although some flights have been eliminated by the time interval constraint. Secondly, some of the feeder airports and flights are filtered out by the time interval constraint for the reduction of vertices and edges, which occurs alternately on different days. The absence of some of airports and flights does not happen on the same day. On the whole, the numerical values of some indicators of the SDD networks remain relatively stable.
3.2.2. The SDT Network
SDT network properties may shed light on the air network performance and the changing rule more accurately and microcosmically in the course of all periods of every day. Corresponding to the eight two-hour intervals of each day in a week, the 56 SDT networks are numbered from 1 to 56. As is shown in
Figure 3, any one of the SDT networks is smaller than the CAPN in the numbers of included vertices and edges. The maximum number of edges of SDT networks is 716, which is even less than half of the CAPN edges. The scales of 56 SDT networks imply an obvious periodicity tendency. On each day, the departure time of the majority of flights is from 10:00 a.m. to 12:00 a.m., and most airports are involved at the time from 8:00 a.m. to 10:00 a.m., and from 12:00 a.m. to 2:00 p.m. However, in this situation, the peak numbers of the involved airports and flights do not appear in the same time interval. The possible reason for this is that more feeder airports that have few scheduled flights take part in the network from 10:00 a.m. to 12:00 a.m.; e.g., the remote airports in Yushu, Altay, and Korla, and tourist airports in Jingdezhen, Jiuzhaigou, and Dunhuang only have one or two flights from 10:00 a.m. to 12:00 a.m.
Including a smaller number of edges, the properties of SDT networks change a little, compared to the CAPN. The CAPN obviously has a small-world property. When it comes to SDT networks, the average clustering coefficient is only 0.49, and the average short path is 2.39, which manifests that the small-world property is weakened, together with the outcome that the average degree is 9.14 for the SDT networks, which is far less than that of the CAPN (i.e., 20.54). There is no sufficient direct connection between airports in SDT networks, which leads to difficult transportation. The property that the weighted clustering coefficients are always larger than the clustering coefficients is kept still, which can be found from
Figure 4. Including those limited by less edges, not so many clusters are formed in SDT networks. The edges with higher weight are much more likely to form clusters and large-capacity airlines still play an important role between airports such as Beijing, Shanghai, and Guangzhou, where passenger capacities on the airline always have the top rankings in all SDT networks.
The SDT also has the property that
is larger than
. An edge with a large passenger capacity will always want to link the vertices with large degrees. This is easy to understand, since more flights will be scheduled between the airports that are large enough, and provide transportation services for large cities. Besides, some flights from large airports will connect other airports at small scales to maintain the connectivity and flexibility of the whole air network. As a result, the SDT networks become disassortative.
decreases along with the degree increase, and so does the
. This indicates that the larger degree that a vertex has, the smaller the average degree that its neighbor vertices have, which is in accordance with how the disassortative networks perform. The disassortative property reflects the tendency that large degree airports link many small degree airports to enhance the total passenger demand in a cost-effective and infrastructure-efficient design [
18]. This tendency is described quantitatively by using the assortativity coefficient. The assortativity coefficient of the SDT networks is negative, and it seems periodical. The average assortativity coefficient is −0.42, and the average weighted assortativity coefficient is −0.20, which indicate the type of disassortative networks. The time interval from 6:00 a.m. to 8:00 a.m. is the most disassortative. It is speculated that large airports may operate earlier to connect other airports, while many airports do not have scheduled flights between each other from 6:00 a.m. to 8:00 a.m. The property that
is larger than
is maintained (see
Figure 5), conveying that in SDT networks, the large capacity is likely to exist between airports with similar scales. Large airports ensure the dominant position by controlling the flow.
For the rankings of centrality, all of the rankings of SDT networks show a similar structure that is in accordance with the result that the CAPN shows. There are two types of airports at the top of the rankings. The first are those located in large cities with relatively-well-developed economies and societies. The second are those that act as local hubs, providing powerful bridges. It is costly and unnecessary to make all airports directly connect. In local regions, there are some feeder airports whose flights are few and they only connect with each other and their local hub. Generally, these two types of airports are more significant in the extracted subnets. There is no obvious change in the rankings of the important vertices. The average rich-club coefficient of all SDT networks is 0.59, which is obviously smaller than that of the CAPN. It indicates that over half of the connections between most airports with large degrees are linked by direct flights. A periodic property is implied in the rich-club coefficient results. The rich-club coefficient is small at the beginning and at the end of a day, and it is large in the period from 8:00 a.m. to 10:00 a.m. (see
Figure 6). Though rich-club members are not closely connected, the weighted rich-club coefficient with an average value 0.87 is relatively large, and there is no particular fluctuation in the weighted rich-club coefficient results. The passenger capacity of the rich-club is dominant in SDT networks, and the important airports usually sustain high flow volumes.
3.3. The Periodicity of the CAPN Subnets
Several of the CAPN subnet features imply certain periodic properties, and specific conclusions are attained from an experiential perspective. It is worth exploring further whether the indicators of the CAPN subnets have implicit cycles. We employ here the method of the spectral destiny estimation to mathematically find the implicit periodicity.
Table 4 shows the estimation results and the test results for the SDT network indicators. At the confidence level of
, any
is not larger than
. It is concluded that the peaks shown in
Table 5 are not false, and that all of the implicit cycles are true.
The periodicity of the CAPN subnets is summarized as follows.
(i) For each indicator of SDT networks, there exists an obviously significant cycle that corresponds to the first peak in the sequence of estimated values. For most indicators, the first cycle is 8, which means that a cycle of one natural day is common. Considering the flight schedules (
Table 5), 59% of flights fly every day, and other flights may be absent over some days. The schedule mode of flights in a week indicates that most flights are scheduled with one day as the interval, which contributes to one day as the first cycle.
(ii) Besides the one-day cycle, there are another two types of cycles that can be found by the spectral destiny estimation method; i.e., 15 and 22, which correspond to about two and three days, respectively. Based on schedule modes such as “0101010” and “1010101” in flight schedules (as shown in
Table 5), a cycle of two days can also have an effect. Moreover, there is still quite a large proportion of flights that are scheduled by other schedule modes. The spectral density estimation computation reveals that a three-day cycle can be found.
(iii) Compared with the vertices, the edges of the SDT networks are more flexible and changeable. The number and locations of airports are fixed for a long time, while the schedule of flights is relatively flexible, which leads to differences between the SDT networks. The SDT network is the interaction by airports and flights, where the properties shown are affected by the interaction. Although three implicit cycles are found by the spectral destiny estimation method, the numbers of cycles implied by the sample data of different indicators are different. The indicators that involve vertices and passenger capacities on edges, such as the number of vertices, the average degree, and the weighted rich-club coefficient, generally have two cycles. The indicators that involve the quantity and status of connection between vertices, such as the number of edges and the assortativity coefficient, generally have three cycles.
The Chinese air passenger system presently plays a crucial role in people mobility in China. Finding the mechanism that drives air transportation dynamics is a necessary step to better controlling and optimizing the function of the air passenger system. The mechanism involves the periodicity, which reflects the passenger needs being guided by the fluctuant configuration of the air resources. Understanding the periodic mechanism is fundamental for better air network designs that benefit airlines and passengers.
4. Conclusions
The complex network theory has helped the research community to investigate various air networks’ structures and properties. As a case study, this paper aimed to analyze the Chinese air passenger network (CAPN) from holistic and more microcosmic perspectives. The topological structure of an air network obviously has impacts on the network’s performance, and the capacity (e.g., available passenger-seats) flow through the structure is also an important indicator, since the capacity induces profits and passengers’ travelling behaviors. The CAPN structure features from the non-weighted and the weighted perspectives are described. Two types of subnets extracted by time-scale constraints of one day or every two-hours are used to find the temporal features. The spectral destiny estimation method is used to find the implicit periodicity mathematically, since the CAPN subnet features seem to imply some periodic properties.
The results indicate that the CAPN has a small-world and scale-free structure. The clustering coefficient (i.e., 0.80) is relatively large. The average shortest path length is 1.98, which is similar to those of India, Brazil, and the U.S., but much less than that of the world. The cumulative degree distribution follows a two-regime power-law distribution, which confirms that a small number of airports carry the majority of the flights. The weighted clustering coefficient (i.e., 0.85) is larger than the clustering coefficient (i.e., 0.80), which reflects that the airports are closely connected, and the large capacities take part in the formation of clusters on that basis. The CAPN tends to be disassortative. National hubs or large airports are required to provide connectivity to a large number of low-degree destinations. Large capacities exist between large airports. Apart from the topological properties of whole networks, some important airports, including national air-hubs and local air-hubs, remarkably affect the CAPN. About 90% of large capacities exist between airports with large degrees. Rich vertices are closely connected with each other, and the passenger capacities among them are dominant.
The properties of CAPN subnets extracted by taking two hours as the time-scale interval (i.e., SDT networks) shed light on the air network performance and the changing rule more accurately and microcosmically. For SDT networks, the average clustering coefficient is 0.49, and the average short path is 2.39. Several of the CAPN subnet features imply some periodic property, and the method of the spectral destiny estimation is used to find the implicit periodicity mathematically. For most indicators of SDT networks, a one-day cycle, two-day cycle, and/or three-day cycle can be found. The periodicity mechanism reflects that passenger needs are guided by the fluctuant configuration of air resources. Understanding the periodic mechanism is fundamental to better designing the air network that benefits airlines and passengers.
Our work is based on the adjacency matrix and the weight matrix that represent the CAPN. The matrix data can be attained by emailing the corresponding author. This paper provides some valuable conclusions about the structural characteristics and temporal features of the CAPN, which serve as a baseline for other research. The analysis conclusions offer an insight into the configuration mode of air capacity in the whole CAPN at a macro and industrial level. For future study, the conclusions of this paper may be improved and calibrated by linking the models of firm and passenger behaviors.