1. Introduction
Infrastructure systems such as electrical power, water supply, and transportation are critical assets of our society because they play a fundamental role in delivering essential services to urban systems. With the development and expansion of modern cities, the increasing interdependencies and complexities of infrastructure systems pose new challenges for operations and security management because of their large-scale and nonlinear behavior. Resilience is thus an important attribute of infrastructure systems that are subject to the possibility of disruptions. The resilience of infrastructure systems generally concerns the robustness of the system functionality against disruptions and the efficiency of recovering to normal functions [
1], which are determined to a significant extent by the interdependencies among the infrastructure systems and their interactions [
2]. Given the increasing impact of man-made and natural disasters on infrastructure systems, it is imperative to assess and enhance their resilience. Plans must be made to restore infrastructure systems after disruptive events to reduce their negative influences on the society and economy [
3].
The quantitative metric of resilience has fundamental implications in the understanding of the mechanism of resilience [
4,
5]. Some methods for assessing or quantifying system resilience have been proposed. Bruneau et al. [
6] defined system resilience as the ability to resist the impact of disruptive events and maintain system performance over time. Resilience is quantified by integrating the functionality curve during disruptive events [
7]. Chang and Shinozuka [
8] calculated resilience as the probability that a system would satisfy both rapidity and robustness standards following a disruption. Cimellaro et al. [
9] stated that resilience should be qualified based on the analytical functions describing the system rapidity and robustness. Zobel [
10] developed a multi-dimensional resilience metric that incorporated the balance between the recovery speeds and initial losses. In the above studies, the system resilience metric is usually a static value for a specific event. To provide more information regarding the efficiency of infrastructure restoration during the recovery process, some works extended the above metrics in a number of ways. Henry and Ramirez [
11] developed a metric, whereby the system resilience is calculated as the time-dependent ratio of recovery to maximum functionality loss. This metric can be applied for identifying the important components in a system with the objective of prioritizing investments to protect the most influential components. Simonovic and Peck [
12] proposed a dynamic resilience (DR) metric based on the adaptive capacity of an infrastructure system and the level of system performance, which can be used to evaluate the effect of the restoration efforts at each time step during the restoration process.
Following a disaster, to reduce the negative impact on society and enhance the resilience of infrastructure systems, restoration activities are essential for recovering the performance of disrupted infrastructure systems [
13]. For a small-scale disruptive event, restoration efforts are separately performed by infrastructure managers without coordination across systems. However, following a large-scale disaster, multiple infrastructure systems may be severely disrupted simultaneously. For instance, following Hurricane Sandy in 2013, millions of people were left without electricity and communication networks, subways were closed because of flooding, and gasoline shortages lasted for weeks in some cities [
14]. To minimize the disastrous impacts, extra restoration resources are required to help expedite the recovery of the infrastructure systems. The restoration resources are provided by local or central governments in countries such as China, by the Public Assistance Scheme and Disaster Recovery Fund or by insurance companies in countries such as the USA (the responsibility of government sectors such as the Department of Homeland Security is coordinated for information-sharing at the state or national level in the case of emergencies) [
14]. In general, the resource budget is not unlimited, and an optimized restoration resource allocation strategy could increase the efficiency of the recovery process. Some studies have been conducted to address optimal resource allocation in post-disaster recovery scenarios of infrastructure systems. With the objective of minimizing the direct impacts from a disruption and the sum of the direct and indirect impacts, Mackenzie et al. [
15] proposed and compared two different decision models to determine optimal resource allocation to assist the recovery of disrupted infrastructure systems. To minimize the overall losses resulting from a disaster, Mackenzie et al. [
16] developed a static model and a dynamic decision model to facilitate the recovery of infrastructure systems. Through a vulnerability analysis, Zhang et al. [
17] developed a resource allocation model to quickly restore critical components in interdependent infrastructure systems after a disruptive event. With the objective of maximizing the resilience of interdependent infrastructure systems over time, Zhang et al. [
18] proposed a mixed integer programming model to determine a restoration resource allocation strategy for interdependent infrastructure systems.
Interdependencies among infrastructure systems should be taken into consideration in restoration resource allocation problems [
18]. The effects of interdependencies include the propagation of effects from one system to another [
19]. A disruptive event that directly impacts some infrastructure systems can trigger indirect impacts on other systems. Quantitative assessments have shown that indirect disaster impacts caused by infrastructure interdependencies may be more severe than the direct impacts [
4,
20]. Therefore, the recovery process of infrastructure systems following a disaster is also affected by the interdependencies among them. Some approaches have been developed to analyze the impact of infrastructure interdependencies [
21]. Network-based models and economic theory-based models have been used for this purpose. In network-based models, interdependent infrastructures are described as multilayer networks. The interdependencies among the systems are usually quantified at the component level [
22,
23]. In comparison, in economic-theory-based models such as the dynamic inoperability input–output model (DIIM), the impacts of interdependencies between different systems are analyzed at the system level [
24]. Infrastructure systems or subsystems are selected as the smallest analysis units [
25].
There is a problem in the aforementioned resource allocation studies that requires further investigation in order to be solved. The objectives of resource allocation remain unchanged during the recovery process of infrastructure systems. These objectives include maximizing system resilience, minimizing the total cost, and minimizing the recovery time. In practice, from the perspective of decision makers, the objective of resource allocation is dependent on emergency scenarios and is variable during the recovery process. Specifically, following a disaster, the performance of the disrupted infrastructure systems first decreases sharply; to enhance the resilience of these systems and meet the basic needs of users, the objective of restoration resource allocation is to aid in recovering the infrastructure systems’ performance to a specific level as quickly as possible (adaptive capacity of system resilience). At this stage, the minimization of the recovery time of the infrastructure systems takes priority in the decision-making process. Second, when the basic needs of the users have been satisfied, the goal of restoration resource allocation will change. Minimizing the overall losses during the following recovery process is more important (restorative capacity of system resilience). With a limited restoration resource budget, the allocation should be adjusted at this stage.
To this end, the objective of this study is to develop a two-stage restoration resource allocation model that can be applied to determine the optimal allocation at different recovery stages. The main contributions of the present research include (i) the use of the DR metric proposed by Simonovic and Peck [
12] to evaluate the effect of restoration resource efforts at each time step, which is critical for decision makers in determining the objectives of resource allocation at different stages during the recovery process. (ii) Combining the system recovery speed and system performance loss together, a two-stage optimization resource allocation model is developed. The first stage aims to recover infrastructure systems’ DR to quickly meet the basic needs of users, while the second stage aims at minimizing the overall losses during the recovery process after Stage I. The effects of infrastructure interdependencies on the resource allocation strategy, and on the resource allocation objective during the recovery process are incorporated in the model using the DIIM. (iii) Applying realistic infrastructure systems data obtained from the US Bureau of Economic Analysis (BEA), the utility of the proposed model in decision making is demonstrated by comparing it with other resource allocation strategies through a case study. The proposed approach can provide decision makers with more significant insight into the resource allocation effects at different recovery stages and assist in making informed decisions.
This paper is organized as follows.
Section 2 presents the development of a two-stage resource allocation model for enhancing the resilience of interdependent infrastructure systems.
Section 3 presents a numerical method for solving the proposed model.
Section 4 examines the utility of the model through a case study.
Section 5 discusses the results.
2. Model Formulation
2.1. Resilience Metric
The resilience metric is fundamentally important to the investigation of the effects of restoration resource allocation. To evaluate the effect of restoration resource efforts at each time step during the recovery process, the DR metric proposed by Simonovic and Peck [
12] is employed in this study.
Figure 1 illustrates the performance of an infrastructure system following a disruptive event, where
denotes the occurrence time of the disruptive event;
denotes the end of the recovery process of the system;
represents the expected system performance level, which is usually a constant; and
represents the real performance line under a specific restoration strategy. In
Figure 1, when the disruptive event occurs at
, the system performance drops immediately because of the direct disruption of the event and then decreases for a while owing to failure propagation within and across different infrastructure systems. In the absence of restoration activities, the system performance stays constant after the failure propagation. Under different restoration strategies, the recovery process of the system will be different. The area between the expected system performance level
and real performance line
represents the loss of system performance over the time horizon
under a restoration strategy. In the mathematical form, the loss in performance
can be obtained using Equation (1).
The DR is represented by Equation (2).
The value of
is between 0 and 1 and changes with time
during the recovery process. The resilience metric proposed by Bruneau et al. [
7], which has been widely used in the literature, is a static value for a specific event and represents the normalized performance loss of the system over the recovery time horizon. The DR
is estimated during the recovery process and changes with time. It is appropriate to evaluate the effect of restoration efforts at each time step during the recovery process.
As shown in Equation (2), the DR of an infrastructure system is dependent on the system performance loss
. Generally, the performance of different infrastructure systems is expressed in different units such as m
3 of water distribution volume or kW of power transmission capacity. Nevertheless, the DIIM [
25] provides an approach for expressing the performance loss of different infrastructure systems in the same form. The DIIM is focused on the inoperability of systems owing to perturbations resulting from a disruptive event, and the negative consequences are measured in economic losses and inoperability (i.e., percentage of ‘dysfunctionality’ relative to an ideal state). In Equation (1), the term
can be expressed as
where
represents the expected system performance level in monetary units and
represents the inoperability of the system, which quantifies the proportional extent to which an infrastructure is not functioning in an as-planned manner at
t.
For simplification, the occurrence time of the disruptive event is considered to be . Using Equations (2) and (3), the DR of an infrastructure system is expressed as shown in Equation (4).
2.2. Model of Infrastructure System Interdependence
Infrastructure systems are geographically and functionally interdependent [
26,
27]. An interdependence analysis is essential when investigating the performance change of infrastructure systems or when designing recovery plans [
23,
28]. In this study, to capture the infrastructure system interdependencies at the system level, the DIIM introduced by Haimes [
25] is applied to describe the recovery dynamics of infrastructure systems.
For
N types of systems, let vector
denote the performance inoperability of the systems at
t, which can be expressed in the following form.
where
is the initial inoperability of the infrastructure systems at
;
is a normalized interdependent matrix; matrix
, where
denotes the recovery rate of infrastructure
, measures the capability of the system to recover from the disruption and reach a desired performance level. According to the literature [
16,
18],
can be described as
where
denotes the basic restoration capacity of infrastructure
’s managers;
denotes the cost-effectiveness parameter, which describes the effectiveness of allocating resources to infrastructure
’s recovery rate; and
represents the restoration resource allocated to infrastructure
. The function described in Equation (6) is strictly increasing and marginally decreasing with respect to
.
2.3. Mathematical Model of Resource Allocation
Based on the previous descriptions and definitions, and while taking into consideration the decision makers’ point of view, this section formulates a two-stage programming model for restoration resource allocation to enhance the resilience of infrastructure systems. The division of the stages in the model is dependent on the DR of the infrastructure systems.
At Stage I, with a limited restoration resource budget, in order to meet the basic needs of the users quickly, the objective of resource allocation is to aid the DR of infrastructure systems to recover to the basic level as soon as possible. The model for resource allocation is expressed as follows:
s.t.Equation (7) is the objective function, which represents the minimization of the maximum value of elements in the set under different resource allocation strategies, where represents the time at which infrastructure ’s DR recovers to its basic level under a resource allocation strategy. Expressions (8)–(12) are the constraints. Constraint (8) is drawn from Equation (6), where is a diagonal matrix and represents the restoration capacities of the infrastructure systems. Infrastructure ’s restoration capacity is the sum of the basic restoration capacity of infrastructure ’s managers () and the restoration capacity acquired from the allocated restoration resources (). Constraint (9), drawn from Equation (5), expresses the inoperability of the infrastructure systems at time t with the consideration of the interdependencies among the systems. Constraint (10) ensures that the resources allocated to each infrastructure system are non-negative. Constraint (11) ensures that the restoration resources allocated to the infrastructure systems are no more than the resource budget . Constraint (12) is the formula for determining , where denotes the DR of infrastructure at time , and its expression is obtained from Equation (4). represents the basic level of infrastructure ’s DR, which meets the basic needs of users.
At Stage II, after the basic needs of the users have been satisfied, the objective of restoration resource allocation is to minimize the overall losses during the following recovery process of the infrastructure systems.
Expression (13) presents the overall losses during the recovery process of the infrastructure systems following Stage I. The first term represents the economic losses of the infrastructure systems and is obtained from the DIIM model. The second term represents the usage cost of the restoration resources, where is the time at which infrastructure ’s DR recovers to the expected level. represents the usage cost of the unit restoration resources in a time step. denotes the restoration resources allocated to infrastructure at Stage II. Constraint (14) ensures that the restoration resources allocated to each infrastructure system at Stage II are non-negative. denotes the adjustment of the restoration resources allocated to infrastructure at Stage II in comparison with that at Stage I. Constraint (15) ensures that the resources allocated to the infrastructure systems do not exceed the resource budget . Constraint (16) represents the restoration capacity matrix of the infrastructure systems at Stage II. Constraint (17) is the expression of the inoperability of systems at . is determined by the value of the objective function at Stage I and is the starting time of Stage II. Constraint (18) is the formula used to determine , which is the time at which infrastructure ’s DR recovers to its expected level .
3. Solution Method
In previous research, heuristic algorithms have been used to solve resource allocation problems [
29]. In this study, the genetic algorithm (GA) is applied to solve the two-stage programming model.
For the model of stage I, the procedure to search for the optimal solution to the resource allocation model (Equations (7)–(12)) is described by the following steps:
Step 1. Code design. Express each solution to the resource allocation in terms of a genotype vector
subject to the following constraints
Considering a variety of individuals, the genotypes of the initial individuals are randomly generated according to the above constraint conditions.
Step 2. Compute the fitness value of each genotype.
First, transform Equations (8), (9) and (12) into the discrete-time version (see Equations (20)–(22)).
in Equation (12) is represented by a function of
, which is obtained from Equation (4).
Second, compute the inoperability of the infrastructure systems at each time step. The elements of vector are calculated using the recursive formulas of Equations (20) and (21), with the initial inoperability vector . If meets the constraint expressed by Equation (22), then let .
Third, compute the fitness value
The fitness value of each genotype represents the time at which the DR of the infrastructure systems recovers to the basic level under a resource allocation strategy. For genotypes that do not meet constraints (10) and (11), we use a sufficiently large number as the penalty of the unavailable solutions.
Step 3. The methods of roulette, two-point crossover, and random mutation are selected as rules for the selection, crossover, and mutation, respectively. After the above procedure, we select the superior genotypes according to their fitness values in each generation. The stopping rule is the convergence of the optimal fitness value between the two generations. When the algorithm stops, the genotype corresponding to the minimal fitness value is the optimal solution for the resource allocation model of Stage I.
For the model of Stage II, step 1 and step 3 in the solution search procedure are similar to that for the model of Stage I. Each solution to the resource allocation at Stage II is denoted by a genotype. In step 2, as the fitness function of this stage is different, the procedure is as follows.
First, transform Equations (17) and (18) into discrete-time version, see Equations (24) and (25).
Second, compute the inoperability of the infrastructure systems at each time step after . The elements of vector for are calculated using the recursive formula in Equation (24) with the initial inoperability vector . If meets the constraint expressed in Equation (25), then let .
Third, compute the fitness value
The fitness value of each genotype represents the overall losses during the recovery process following Stage I under a resource allocation strategy.
When applying the above solution method, the number of individuals in each generation is set as ( is the number of infrastructure systems). The maximum generation is set as a number that ensures a maximal fitness function value in each generation converge and that does not fluctuate for more than five steps.
5. Discussion
With a limited resource budget, an optimized restoration resource allocation strategy can increase the efficiency of the recovery process of infrastructure systems and reduce the losses resulting from a disaster. Taking into consideration the government decision makers’ point of view, a two-stage restoration resource allocation model is proposed in this study. The objective of the first stage is to quickly recover the infrastructure systems’ DR to meet the basic needs of the users. The objective of the second stage is to minimize the overall losses during the recovery process following the first stage. The problem under study has several unique features over previous research: (i) the DR is applied to evaluate the effect of restoration resource efforts at each time step during the recovery process. Consequently, based on the evaluation, the stage division of resource allocation during the recovery process can be determined. (ii) The objectives of the two-stage resource allocation model are not unique during the recovery process of the infrastructure systems, which balances the recovery time and overall losses. A case study shows that the proposed model can assist decision makers in better understanding the resource allocation effects and making informed decisions following a disaster.
There are some limitations of the presented study. First, the results acquired are dependent on parameter assumptions in the proposed model, including the value of the basic and expected levels of DR. In reality, they are dependent on the characteristics of a specific infrastructure system and the needs of the users of an infrastructure service. For example, the basic level of DR for an electric power system is different from that of the water supply system; the basic level of DR for an infrastructure system serving large cities is probably higher than that serving small towns and villages. In this study, although we believe the parameter assumptions represent reality to some extent, their validity still needs to be explored using detailed data. Second, infrastructure systems are subjected to different types of disasters that could affect system performance differently. For example, earthquakes always disrupt all the infrastructure components located in the area simultaneously. Floods usually shock above-ground infrastructure components over the duration of disaster gradually. This study investigates the restoration resource allocation problem after the disaster. The type, duration and magnitude of disasters are not considered. Third, this manuscript proposes a method for resource allocation among interdependent infrastructure systems from the government’s perspective. This is very common in the case of large-scale disasters. However, in some disasters, especially small-scale disruptive events, restoration efforts of each infrastructure may be performed separately by infrastructure management organizations. Their objectives may differ from event to event. Future research will investigate how different objectives of all players in the recovery process can be considered in optimal allocation resources.