1. Introduction
Global efforts are being made to reduce greenhouse gas emissions due to rising energy demand and climate change issues. Correspondingly, the new paradigm of power generation utilizing renewable resources has emerged from existing fossil-fueled power generation. Out of all renewable energy resources, photovoltaic (PV) generation seems to be the most promising alternative to conventional fossil fuels as installation cost continues to decrease [
1]. In [
2], the photovoltaic power generation capacity of countries belonging to the International Energy Agency (IEA) has increased significantly from 0.3 to 35 GW over the past decade. These recent global trends are described in [
3], and various studies have been conducted to maximize the output of PV sources. Please refer to [
4,
5]. However, one of the important issues of PV generation is that the output characteristics of PV sources result in random behavior due to weather conditions such as the intermittent solar irradiance.
In order to ensure reliable operation and economic dispatch of the system, energy storage systems (ESSs) have been utilized [
6]. In [
7], it is essential to combine renewable energy sources with energy storage systems to maintain or improve the power supply stability and quality. ESSs play an important role, especially when they are coupled with PV generation, which is a desirable method to store surplus energy for securing future power supply and releasing it during power shortages. On that account, many studies have been conducted on ESS. In [
8], multiple types of irrigation scheduled programs that minimize the number of PV resources to be installed were studied and the effects of the variable costs linked to energy are presented. Ref. [
9] reviewed the battery models used in PV systems and verified the models with experimental data. Ref. [
10] conducted a series of economic efficiency studies comparing various PV systems combined with ESS. In the study, the rate of efficiency decrease over time was evaluated and the solar potential in Poland was estimated based on the metrological data. In [
11], to decide the most efficient alternative for a real urban Water Pressurized Network (WPN), the energy production in the PV panel and the energy required to supply water to the population were considered. The method introduced in the study can provide useful information to make decisions on whether it is better to employ batteries or water tanks for energy storage. For more information, please refer to the review papers [
12,
13,
14]. Despite the rich body of work addressing the capacity of an appropriate ESS with various criteria and operating strategies, few studies have considered the stochastic behavior of demand and power generation within a given day. Our research is motivated by the lack of results.
The analysis of the variability and uncertainty caused by the day and night cycle in solar photovoltaic generation is essential because they impact on the various performance measures, including the availability of the system. The system may fail to supply power to loads due to variabilities even though its average capacity is sufficient. Thus, the goal of this research is to develop stochastic models that resolve these issues. For this purpose, the Markov chain models will be introduced.
The most commonly used stochastic (probabilistic) model that addresses the capacity sizing problem of ESS is the Markov chain model [
15]. The advantage of the Markov chain model is addressed in [
16] as follows. “Compared to other existing battery models, stochastic models, particularly Markov chain processes, can model battery-powered systems as a whole. Moreover, the stochastic model can model burst aging processes and consider operational profiles and management, unlike in the case of other conventional models that can only illustrate battery behavior.” Thus, this model has been used as a statistical model to analyze the steady-state behavior of the ESS and has been the basis for simulation represented by the Markov chain Monte Carlo method. In [
17], a two-state Markov chain model is proposed to determine the storage size of a photovoltaic power generation system. The study focused on determining the storage level considering the variability in daily solar radiation and suggested that one or two days of storage would be adequate to meet an acceptable service level. In [
18,
19], one-dimensional Markov chain models were constructed to model energy states in the ESS. The performance measurements of both studies were the power supply availability. Calculating transition probabilities by using load data and supply data based on solar radiation, the analytical results were obtained and verified from Monte Carlo simulation. A similar approach is employed in [
15]. In this paper, discrete-time Markov chain model is proposed for the storage battery state of charge (SOC). The model is used to analyze the storage battery’s remaining capacity. The probability that the combination of solar power generation system and battery storage meets the load demand can be calculated.
Despite various studies using these models, few consider the diurnal and the seasonal variation of solar power and load. In [
20,
21,
22], the diurnal nature of solar power generation and load variations were studied. It is stated that the solar radiation power is daily modulated between zero and a maximum that depends on the latitude on earth and the season. Also, the daily energy demand during nighttime is lower than that of daytime. That is, the load demand and power supply of the systems behave randomly and vary over time. However, such variation is not reflected in the existing Markov chain models. In [
17,
18,
19], the state of Markov chains is defined to represent the level of the energy storage system (battery) and the transition probabilities are derived from the load demand and the power generation data. To calculate the transition probabilities, the probability distribution of the difference between power generation and load is derived. However, the existing models employ a single and time-independent distribution, so the diurnal and the seasonal variation is not considered in the models. All those variations are averaged to obtain the distribution; thus, they are intermingled in a single distribution. While the equilibrium probabilities of the Markov chains provide useful insight to understand the steady-state behavior of the battery level, it has limitations since some important measures, such as the probability that the system fails to meet the load demand, are caused by the abovementioned variations. The PV generation and load demand are fluctuating within a day, power supply availability of ESS per day does not represent availability at a specific time of day. Our research is motivated by such lack of results.
In this paper, two Markov chain models are developed to address the optimal allocation problem of ESS: Embedded Markov chain and Absorbing Markov chain models. The proposed Embedded Markov chain-based ESS model can calculate ESS SOC over time by considering diurnal variation and further calculate optimal PV-coupled ESS capacity. Note that the existing models has limitation to incorporate the fluctuation of load and PV generation within a day. To resolve this issue, two-dimensional Markov chain is proposed. Moreover, with the proposed Absorbing Markov chain model, the time duration until the PV-coupled ESS system fails to supply power to loads can be estimated. While there are various Markov chain models for dealing with ESS problems, no Absorbing Markov chain model has been employed. The model can be used to calculate the average time duration until the newly installed battery state becomes lower than a certain level. We applied our model to power consumption data of a university in Pohang, South Korea, from May 2018 to Feb 2020.
The contributions of our work are as follows. We:
Identified a Markovian property for battery state in a PV-coupled energy storage system.
By discretizing the load and power supply, obtained a two-dimensional discrete-time, time-homogeneous Markov chain with a single communicating class considering the diurnal variations of supply and demand. The distribution of battery SOC over time can be derived and optimal PV-coupled ESS capacity can be obtained from the equilibrium probabilities.
Developed a stochastic model to estimate the average length of time until the SOC of the ESS becomes lower than a certain level. The Absorbing Markov chain-based model is proposed.
The remainder of this paper is organized as follows. Parametric models are presented in
Section 2. In
Section 3, Embedded Markov chain and Absorbing Markov chain models are demonstrated. Finally, the discussion of this paper is concluded in
Section 4.
2. Mathematical Models
In this section, we describe our model. Then, we investigate the Markovian property and the recursive structure of the status of energy storage system to obtain the Markov chain model.
It is assumed that there are
PV sources consisting of the energy system. Let
and
denote the power rate generated by PV sources at time
and the load (demand) rate served by the energy system at time
, respectively. When
, the power generation of the energy system exceeds demand (load). In this case, it is assumed that the remaining power to meet the demand at time
,
, is stored in the storage system. In the case of
, the demand is greater than power generation at
, the shortage,
, is met from the energy stored in the storage system. If the stored energy is insufficient, we assume that the system fails to meet the demand. Let
and
denote the capacity of ESS and the amount of energy stored in the ESS at
, respectively. Note that
is a constant. Then, the battery status satisfies the following equations.
While the above differential equations represent the relationships between
,
and
, there are two difficulties in deriving the solution of the equations. First, it is difficult to obtain closed-form expressions for Equations (1) and (2). In some previous research such as [
23,
24], the differential equations with min–max operations were studied, but the scope of the study was limited. Generally, it has been known that it is difficult to obtain the analytic solution for the differential equation with min–max operations. Also, the function
is not differentiable at some point. Second, to obtain the solution for the battery state over time,
and
are required. However, it is difficult to express
and
in the form of continuous functions. The reason is related to the data collection process. Data on the load and power are collected at regular intervals through sensors installed in the system and stored in the database. Thus, the actual data obtained is not in the form of a continuous function, but a list of values measured at specific time intervals. While we can employ the regression analysis such as curve fitting to specify the continuous function that provides the best fit to the dataset, it is elusive to find the appropriate function considering differentiability, day and night cycle and insolation. Thus, it is more appropriate to employ the discrete time stochastic process to analyze the system.
Let
denote the time interval between two successive data measurements. For example, if
equals one hour, it means that the data on battery status are acquired and investigated every hour. Then, let
be defined as the epochs at which the state of energy storage system is examined
. Note that
Then, battery state at the k-th measurement time point is defined as
. Then, the battery state at the time of measurement can be demonstrated as follows:
where
and
are power generated by the PV sources and load during
, respectively. Note that
only depends on
,
and
. Thus,
is a discrete-time Markov process with stationary transition probabilities. While the system behavior can be described by Equation (3), the values of
and
are obtained in the form of continuous numbers, which leads to the loss of analytical tractability. It is difficult to obtain the equilibrium probabilities for a continuous-state Markov process. Thus, for analytical tractability, the continuous data are discretized to obtain the Markov chain model. We treat any number in
as
. In other words,
is the energy difference between two adjacent states in the Markov chain model, and
and
are assumed to be discrete probability distributions and the values of both distributions will be
hereafter. Then,
becomes a discrete-time Markov chain having states
where
is a maximum integer value satisfying
.
Next, the transition probabilities are considered to derive the equilibrium probabilities. Let
denote the random variable calculated by
. Note that the set of possible values of
is
. Then, Equation (3) is simplified as follows.
Because of day and night cycles and solar radiation quantity,
have a different distribution over time. For example, it is possible to assume an appropriate amount of insolation during the daytime, but
should be assumed because there is no insolation at night. In order to include such variations in the model, it is assumed that the probabilities of
vary with time. Thus, we assume
distributions (defined as
) according to the time zone, and the distribution of
in a specific time zone follows one of the distributions.
denote a function that maps time
to the distribution
. For example, suppose that
follows two distributions depending on whether it is night or day. We use
for the distribution of daytime and
for the distribution of nighttime. Suppose that we start our investigation at 1:00 a.m. (
k = 1) with
T = 1 h, and sunrise and sunset will be at 6:00 a.m. and 7:00 p.m., respectively. Then, for all
which satisfies 6 ≤ (
k mod 24) ≤ 19,
f(
k) = 1 where
a mod
b is the remainder of the Euclidean division of
a by
b. Otherwise,
f(
k) = 2. It means that the distribution of the difference between the power generation and load can be described by
between 6:00 and 19:00, and the distribution of nighttime can be described by
after sunset. Let the probability of the distribution
be defined as follows.
Then, stationary transition probabilities of the Markov chain can be considered by including the time index in the model. Let
M be the time length of a cycle and
gk is the remainder of the division of
k by the cycle length (
M) of our interest, except when the remainder is equal to 0. If the remainder is 0, we define that
gk equals
M.
G represents the set consisting of all g
k’s,
G: = {1, …,
M}. For example, if we are interested in the behavior of the battery state in a daily cycle with
T = 1 h, the length of a cycle is 24 (
M = 24), and the all-time indices for 1:00 a.m. are mapped to
gk = 1. Then, the process
,
is a discrete-time, time-homogeneous Markov chain with a single communicating class. The state space,
S, is as follows.
The transition probabilities are as follows.
where
For
m =
M,
Other transition probabilities are equal to 0.
If we arrange the states of our Markov chain in lexicographical order (i.e., {(1,0), (1,∆), …, (1,
C), (2,0), (2,∆), …, (
M,
C)}), we can write the transition matrix,
Q, of the Markov chain as follows:
where
Om,n is a matrix of
m rows and
n columns, and all elements are zero, and
Note that only the transition from states (
i,
j) to (
i+1,
k) occurs for states
and all
j and
k. In any states (
M,
j) for all
j, the transition can only occur to state (1,
k). This is because the first value in the state of our Markov chain represents time. Thus, our Markov chain is periodic.
Figure 1 provides the transition probability diagram of two examples of our system. The left figure of
Figure 1 shows the state space of the model expressed by dividing day into day and night. For convenience, we assume that the states (1,
j) and (2,
j) are the battery state of day and night, respectively. The transition probability is defined by the distribution of power generation-load occurring over the next 12 h and follows Equations (7)–(12). For example, suppose the battery state is 0 at the start of the day. If the difference between power generation and load during the next 12 h is Δ, the battery state at the beginning of the night becomes Δ, where the transition occurs from (1,0) to (2,Δ). Note that the probability of transition from day to night and transition from night to day can be defined differently. As such, through the two-dimensional Markov chain model including time index, it is possible to consider power supply and demand that appear differently over time. The figure on the right in
Figure 2 is the state space of the model expressed by dividing the day into hours. Transition probabilities are omitted from the figure (they follow Equations (7)–(12)).
In [
25], it is stated that, if the discrete time-homogeneous Markov chain is irreducible, then it is positive recurrent and periodic. There is a unique solution to the balance equations
and
where
is a transition probability matrix (abovementioned),
is a row vector representing the equilibrium probabilities. Each element
in
, is the equilibrium probability that the Markov chain is in state (
i,
j). For example,
represents the probability that the battery state is 0 at 1:00 a.m. if each hour is defined as a time index (
T = 1 h). As such, our Markov chain model can be used to predict the distribution of battery state at each time point. Also, the following performance measures can be considered for capacity design of battery and PV resources of the system.
In [
16], it is stated that “availabilities over 0.99999 or 5 nines are typically required for critical loads, such as conventional communication plants, whereas the U.S. grid availability is about 3 nines [
26].” While most PV-energy storage systems are designed to meet this goal, there is another issue of PV resource size and energy storage system combination should be used to achieve the goal with minimal cost. Let
and
denote the operating cost of one unit of PV module and one unit of energy storage system per unit time, respectively. Then, the total operating cost of the system consisting of
p units of PV modules and
s units of the storage system is
. Since the availability requirement should be satisfied, the problem at hand becomes finding the values of
p and
s that satisfy the requirement
at the lowest cost. Another measurement, the probability that the battery is full, can be used to check whether the storage system is oversized or not comparing to the capacity of PV modules. If the storage system is oversized,
decreases because there is less chance to fill the battery with the power produced by the PV module. Contrary, if the battery size is designed to be small, the power will fill the battery and the remaining power will be lost after the battery state reaches its capacity. In this case, the probability
has a large value.
Also, an Absorbing Markov chain technique can be employed to answer the following questions: (1) When a new battery is installed in the system, what is the average time it takes for the system’s battery state to go below a certain level? (2) If the system does not meet the load, when does the event occur?
By the definition in [
25], state
i is called an absorbing state when once the state is entered, it is impossible to leave. In other words, when a system state enters an absorbing state, it is stuck. A Markov chain that has at least one such state is called an Absorbing Markov chain.
In our Markov chain, we can define some states as absorbing states to answer the abovementioned questions. Let
and
denote sets of transient and absorbing states of the absorbing Markov chain, respectively. Suppose that we are interested in the average time until the battery runs out when a new battery is installed. Then,
If we rearrange the states of Markov Chain in the order of transient states and absorbing states, the transition probability matrix,
P, is
where
is a
-by-
matrix,
is a
-by-
matrix, and
is the
-by-
identity matrix. Note that
is the number of elements of set
A.
Figure 2 illustrates an example of an Absorbing Markov chain model (
T = 1 h). Note that no transition occurs from state (
i,0) for all
i. This is because the states are defined as absorbing states. The average time to run out of battery,
, can be calculated as follows:
where
is a length-
column vector whose entries are all 1. Note that
is a matrix consisting of a single column of
elements. Each element in
represents the average time until the battery is exhausted when starting the system in the corresponding state. For example, the element corresponding to state (7,
C) represents the average time until the battery is depleted when we install a new battery at 7:00 a.m. Thus, we can calculate the average time it takes to run out when a new battery is installed. Through a similar process, the absorbing state can be defined differently to calculate the time it takes for the battery to drop below a certain level.
3. Model Demonstration
The behavior of the proposed two models is simulated and the results are presented using MATLAB R2019b. The irradiance and load demand data sets used in the analysis are obtained at the Handong Global University Campus from March 2018 to February 2020, which were measured hourly. From the solar irradiance data, PV power generation is calculated as shown in Equation (27), considering the number (
), area (
), and efficiency
of PV modules:
Then, the net power of each hour,
, can be calculated from the
and
in Equation (28) and the probability distributions of
are shown in
Figure 3.
Most previous research studies have calculated the net power based on the daily basis [
17,
18]. However, it may not explain the randomness of solar power generation and load demand within a day, which can significantly affect the availability. In
Figure 3, hourly net power distributions are derived and the random characteristics within a day can be confirmed. As we can see in the figure, the net power distribution follows a different distribution every hour and this suggests that even though the average net power per day is sufficient, a power shortage may occur within a day. If the demand for power at a specific time exceeds the amount of power stored in battery + power generation at that time, a power shortage occurs. To consider those stochastic natures, this study assumes one-hour intervals for our Markov chain models. While the volatility may be better reflected in the model if we use a more specific unit of time (such as minutes or seconds), it can be a burden in data processing. For example, if the data interval is reduced to one minute, the data should be measured 60 times more often than the hourly based model, and the number of states in the Markov chain model will also increase. It increases the computational complexity to obtain the equilibrium probabilities [
25].
Depending on the statistical values such as mean and variance values,
Figure 3 shows that the net probability distribution can be classified into three groups: (1) 1:00 a.m. ~ 6:00 a.m. and 7:00 p.m. ~ 12:00 p.m., (2) 7:00 a.m. ~ 8:00 a.m. and 5:00 p.m. ~ 6:00 p.m., and (3) 9:00 a.m. ~ 4:00 p.m. Each group was divided into distributions of
, as previously mentioned in
Section 2, in order to perform the simulation based on the proposed mathematical model. In
, power difference between two adjacent states (Δ) is set with 125 kW, and each histogram is shown in
Figure 4 with the mean values of −1056 kWh, −344 kWh, and 7610 kWh, respectively. Considering the visual features and the statistical values of the net distribution, it can be interpreted that the daily net power cannot represent the hourly measured random characteristic.
Due to the nature of scheduled classes for each semester of the university, the campus power consumption data should be divided into semesters and break as well as weekdays and weekends. However, the average and variance values of the campus usage during the vacation and weekend shows only 9% differences with the average and variance of the entire power usage. Therefore, based on the statistical results, the whole data are used for obtaining enough sample data, regardless of vacation and weekend. When the proposed model is expanded to other applications, the prior data analysis can also be considered as an important factor influencing the results.
A. Performance of Embedded Markov Chain-Based Model
Since the model proposed in this paper is based on the power usage of the university campus, the optimal number of PV modules and ESS capacity is proposed based on the given target power supply availability of 3 nines.
Figure 5 indicates the limiting probabilities according to number of PV modules and ESS capacity. For this experiment, the simulation was performed using the manufacturer of Hanwha with a PV module’s rated output of
, size of
, and efficiency of 0.153.
In
Figure 5, as the number of PV modules increases or the ESS capacity increases, the limiting probability is decreased. Also, the nonlinear characteristic can be checked through the sudden change in the limiting probability at specific points. In order to calculate the number of PV modules and ESS capacity required under a given limiting probability condition, the correlation of the three variables (limiting probabilities, PV module counts, ESS capacity numbers) are shown in
Figure 6.
In
Figure 6, the four points represent the number of modules and ESS capacity when the target limiting probability is 0.001. When the ESS capacity increases, the number of modules has a great influence on the limiting probability. The values of each point are shown in
Table 1.
The optimal number of PV modules and ESS capacity can be calculated from their prices at
Figure 6. According to a report by Mirae Asset Daewoo Research 2020, “ESS connected to renewable energy” the ESS price is
$251/kWh and the cost of installing a PV system including PV modules is
$1.0/W. The results of calculating the price in the case of
Table 1 are summarized in
Table 2.
Therefore, Case 3 (ESS capacity: 25 MW, Number of Module: 67,510) can be proposed as the optimal point for satisfying 3-nine availability and having a minimum price.
B. Performance of Absorbing Markov Chain-Based Model.
When knowing the SOC of the ESS at a specific point, it is possible to predict a ESS’ lifetime until the battery is fully discharged based on the proposed absorbing Markov chain-based model. Considering the aging of the lithium-ion battery, it is considered as the time until the SOC of the battery remains more than 20% instead of complete discharge.
Here, we assume the initial battery SOC at 7:00 p.m. in four cases and estimate the time duration until the battery is discharged. The reason we assume the battery state at 7:00 p.m. as the initial state is that charging does not occur after 7:00 p.m. because the sun has set. As can be observed from the preceding data, no electricity is produced, only demand exists after sunset. Therefore, if the amount of power stored in the battery at sunset is insufficient, it may be difficult to meet the demand generated during the night. Using the absorption Markov chain model, it is possible to calculate the average time to discharge according to the state of the battery at sunset [
25]. This simulation condition is based on the previous embedded Markov chain results (limiting probability: 3 nines, ESS capacity: 25 MW, Number of Modules: 67,510).
The higher the SOC of the battery is, as shown in
Table 3, the more available the time. If the initial battery SOC is above 80%, there is no significant difference in available time. In addition, the initial battery SOC is important because overall availability varies significantly depending on the initial battery SOC.