1. Introduction
In recent years, the fast development of renewable energy sources provides a desired solution to the contradiction between energy and environment. The micro-grid appearing in this background is composed of distributed generation, energy storing devices, energy conversion device, loads and protection devices, etc. The micro-grid has characteristics such as like intermittency and volatility. So, the micro-grid connects to the utility grid, and the non-linear current output causes the frequency deviation of the utility grid and voltage fluctuation. With the increased proliferation of renewable energy such as solar and wind, micro-grid power generation brings more serious power quality problems towards the distribution systems [
1,
2].
Among all kinds of power quality improvement devices, the unified power quality conditioner (UPQC) has become major concern due to the perfect functionality for mitigation of voltage distortion and current perturbations [
3,
4,
5,
6]. The UPQC is composed of a series active power filters (SAPFs) and paralleled active power filters (PAPFs) [
7]. It not only can compensate reactive power and harmonic and negative sequence currents that are caused by non-linear loads or other electrical equipment, but can also compensate the voltage distorted at the source side [
8,
9,
10,
11]. After a rigorous literature survey was conducted, it can be summarized that the research on UPQCs is mainly focused on its topology, compensation instruction detection, compensation control strategy, etc. [
12,
13,
14,
15,
16,
17,
18,
19,
20,
21,
22,
23,
24,
25]. In regard to topology, a novel ten-switch topology for unified power quality conditioner was proposed in Reference [
12]. A transformer-less unified power quality conditioner with fast dynamic control was proposed in Reference [
14]. As for the power quality analysis, an output regulation-based unified power quality conditioner with Kalman filters was reported in Reference [
15]. A UPQC voltage sag detection based on the chaotic immune genetic algorithm was proposed in Reference [
16]. As for a control strategy, there is literature focused on control of UPQCs [
17,
18,
19,
20,
21,
22,
23,
24,
25]. A coordinated control strategy based on reactive power flow in a UPQC was proposed in Reference [
19]. Furthermore, the optimal design and control implementation of a UPQC based on variable phase angle control method was proposed in Reference [
22]. Improvement of unified power quality conditioner performance with an enhanced resonant control strategy was proposed in Reference [
24].
As for the problem of DC voltage control in a UPQC, in Reference [
25], a small signal stability model was developed based on the small signal stability and dynamic interaction problems caused by operation mode-switching in a UPQC. The stability and dynamic characteristics between an independent converter operating mode and combined series and parallel converter operating mode were analyzed. It was concluded that the control parameters of the DC bus voltage of the UPQC had a great influence on the small disturbance stability of the UPQC when switching operation mode, and in Reference [
26], the causes and the inevitability of the fluctuation of voltage on the DC side of UPQC was also discussed in detail. In Reference [
27], a proportional integral (PI) controller was used to maintain the voltage around the reference value for the DC-link side. Theparameters perturbation and load change simulation experiments were performed to verify the effectiveness, robustness, and reliability of the proposed control strategy. In Reference [
28], a novel control strategy was presented using a brain emotional learning-based intelligent controller, according to the structure and emotion processing mechanism of a brain emotional learning computational model, and a multi-objective control strategy was proposed for a UPQC device based on the proposed controller. In Reference [
29], since the DC-link voltage of the UPQC significantly deviates from its reference during a transient event, caused by load connection/disconnection or/and supply-side voltage sag/swell, an adaptive DC-link voltage controller was proposed which limits the DC link voltage deviation during transients and assures a negligible steady-state error. Therefore, the effectiveness of the proposed control schemes were demonstrated through simulations.
In the above reviewed literature, the PI control strategy was still the key solution for DC-link voltage control. On the condition of voltage sag, the DC-link bus of the UPQC plays a great role in the interchange of the energy of the series active power filter (SAPF) and parallel active power filter (PAPF). The PAPF compensates reactive power for load sides and extracts active power from the power grid to compensate DC-link bus voltage sag because of active power consumption in the SAPF. The energy coupling effect between the two filters decreases the stability of the DC-link bus, thus deteriorating UPQC performance. In addition, photovoltaic (PV) cell generation has the nature of intermittence and instability which can cause the DC-link bus voltage of UPQC to become unstable easily. Therefore, a new control scheme for the DC-link bus integrated with a PV charging module in the UPQC is proposed in this paper. To be specific, an optimization algorithm for a PI controller based on the chaos particle swarm optimization based on a multi-agent system (CPSO-MAS) is developed to overcome the issue of dynamic conditions involved in the micro-grid.
A conventional PI controller fails to perform well in the dynamic conditions of the micro-grid due to its poor ability to determine the appropriate values of proportional gain and integral gain. However, the proposed algorithm can optimize both parameters of the voltage loop PI controller. To further verify the effectiveness of the proposed scheme, a simulation and a scaled down laboratory prototype of the UPQC was developed. Through both simulation and experiment with the UPQC harmonic compensation, the compensated current waveform derived from the CPSO-MAS algorithm was proved fairly smooth and nearly sinusoidal towards various conditions in the micro-grid. Besides, the total harmonic distortion (THD) value based on the spectrum of the current waveform meets the national standard of power quality.
2. UPQC Mathematical Modeling
The PV micro-grid with the structure of three-phase three-wire presents a topology diagram as shown in
Figure 1. The UPQC was installed in the distribution system where the SAPF was placed at the source side and the PAPF was presented at the load side sharing the common DC-link, respectively [
30]. The common DC-link was connected by a capacitor. The PV array 1 was connected to the distribution line, and the PV array 2 was integrated to the common DC-link through the boost circuit for charging, and providing energy required for voltage compensation for the SAPF. Moreover, the PV array 2 can be also utilized to the critical load through the PAPF in the event of a power outage. The SAPF utilized to compensate the power supply voltage at the source side is interfaced to the grid through a transformer. The transformer can isolate the inverter from the power grid. The PAPF interfaced with the LC filter was connected to the non-linear load, and it was mainly used for compensation of harmonic current and reactive power.
The equivalent circuit can be presented with a two-phase static coordinate, and the
-axis equivalent circuit is shown in
Figure 2.
Where is the grid voltage and is the grid current. the is load current, is the load voltage, is the output voltage of the SAPF, and is the output voltage of the PAPF. denotes the sum of the line resistance and leakage resistance of the series transformer. denotes the sum of the inductance and leakage inductance of the series transformer. and are, respectively, the capacitance and inductance of the LC filter in the SAPF. and are, respectively, the capacitance and inductance of the LC filter in the PAPF. and are, respectively, the equivalent resistance in the SAPF and PAPF. and are, respectively, the inductive current in the SAPF and PAPF. and are, respectively, the input current to the distributed line from the SAPF and PAPF. is the voltage of the DC-link bus and is the capacitance interfaced to the DC-link bus.
As for the UPQC, if is stable, the SAPF will bypass, and the PAPF will improve the power factor in the distribution system by injecting reactive power. When drops, the SAPF will output the compensation voltage. is maintained in the normal range to make sensitive loads run stably. Then, the PAPF extracts active power from the grid and keeps the DC-link bus voltage stability. Let the equivalent parallel resistance of the DC-link bus loss be . Suppose that the load is a nonlinear load which includes and .
The PAPF of the UPQC mathematical model is presented in Equation (1) [
25]:
The SAPF of the UPQC mathematical model is presented as below:
The DC-link bus of UPQC mathematical model is presented in Equation [
25] (2):
As for the PAPF of the UPQC, the UPQC only compensates the harmonic and reactive current which is affected by loads, and the active current . The three-phase instantaneous reactive current cannot lead to the exchange energy with the utility grid. Therefore, the UPQC exchanges energy with the grid because of the active current . So, the conventional DC-link bus voltage of the UPQC is controlled by injecting active current from the grid.
The PV charging scheme is provided to keep the DC-link bus stable. Therefore, the DC-link bus can obtain energy from the PV cell. However, the drawback is that it is difficult to control it perfectly due to the dynamic conditions such as volatile temperature. So, it is necessary to exploit a better control strategy to ensure the voltage stability and the UPQC performance.
4. The Optimization Algorithm for a PI Controller of a DC-Link Bus Integrated PV Charging Module
4.1. Particle Swarm Optimization Algorithm
The particle swarm optimization (PSO) algorithm is an evolutionary computation based on swarm intelligence [
28]. The algorithm originates from the simulation of the clustering behavior of birds and fish. In the algorithm, every individual is thought to be a particle whose mass and volume is neglected. The individual has the property of speed and position, and another fitness value that determines the particle performance. Then these particles form a swarm. The particle changes its own action and makes progress to the optimal direction through the sharing of information and its own experience when it is searching.
According to the PSO algorithm, the particles are initialized to be a bunch of random particles firstly. Then they search for the optimal solution by iteration. In every iteration, they obtain the information that includes individual optimum
and global optimum
by sharing information. Herein,
is the optimal solution that the particle searches by itself.
is the optimal solution that the entire population searches. Then, the particles update their action according to the shared information. Consider a swarm with
particles stored in the D-dimension searching space. The position vector
and speed vector
of the
particle are given as [
33]:
The
particle updates the speed and position at the searching space according to Equation (14).
where
is iteration times and
is the particle space position of
iteration.
is the particle speed of the
iteration.
is the inertia constant.
,
is the study factor.
,
is the random value with the range of (0, 1).
A standard flowchart of the PSO is shown in
Figure 8.
Compared to genetic algorithms, PSO is easier to perform and faster to converge due to the presence of fewer parameters. Thus, a number of PSO-based algorithms have been developed to get better control performance [
34,
35,
36].
4.2. Particle Swarm Optimization Based on Multi-Agent System Algorithm
Particle swarm optimization based on a multi-agent system algorithm (PSO-MAS) is an improved algorithm characterized by the features of a PSO algorithm and multi-agent system [
33,
34]. Each particle can be viewed as an agent, so each particle not only updates the evolution principle of the PSO algorithm, but also participates in the competition and cooperation with its neighbor. Then, it exchanges information with global optimal particle and updates the current speed and position of particles in accordance with its own experience and the global optimum. Thereby, the particles change the strategy and improve the convergence speed and optimization ability of the swarm.
It is assumed that any agent in the swarm is
, which is equivalent to a particle in the PSO algorithm. Each particle corresponds to a fitness value determined by the optimization problem. As for the DC-link voltage loop PI controller, the agent fitness function is determined in Equation (15).
The agent employs to corresponding action with the environment and makes the fitness value optimal under the premise of satisfying the constraint conditions.
Every agent is randomly initialized in an environment with a total number of
grids. Each agent occupies a grid. The data in the grid represents the position information in the environment of agent. Each agent itself contains two data information of the speed and position of each particle in PSO.
is a positive integer. The total number of cells corresponds to the swarm size in a PSO. In addition, a local environment needs to be defined to obtain the basis for supporting the particle’s next action strategy. In the local environment model, the coordinates (x, y) are used to represent the physical position of the particle. Let
denote the current particle whose coordinates are (x, y), where,
, (
represents the global environment), the local environment of
is defined as
[
33]:
In the PSO-MAS algorithm, each agent has to compete and cooperate with
particles in the local environment. It is assumed that the current agent is
and
is obtained after competition and cooperation. So,
is the agent whose fitness value is minimum in its neighbor particles. If the fitness value of the current particle is less than the fitness value of the optimal neighbor particle, its performance is better. The position in the solution remains unchanged. Otherwise, search and update will be conducted according to the following equation to the approaching position [
34]:
In accordance with Equation (17), even if the current agent is replaced, it still retains its original useful information and fully absorbs the useful information of its neighbors. Therefore, the PSO-MAS algorithm not only enhances the evolutionary performance of the multi-agent system, but also increases the population diversity of the particle swarm which makes information update between agents faster and more efficient.
4.3. Chaos Local Search
Chaos is a complex and irregular behavior generated by non-linear systems. But the seemingly irregular chaotic movement has exquisite internal regularity. It has the characteristics of randomness, ergodicity, and sensitivity to initial conditions. The chaos local search (CLS) algorithm begins at a possible solution of a problem and searches for a solution which reaches a certain standard [
37]. The solution replaces the original solution and continues to iterate until the total number of iterations reaches the preset upper limit. In the process of calculation, the CLS retains a certain number of fine particles and shrinks the region dynamically according to the swarm’s best position. The many neighborhood points of the local optimal solution are produced in the iteration. These random particles replace the particles of poor performance in the shrinking region so that the search process has the ability to avoid falling into local minima.
4.4. The Optimization Algorithm for a PI Controller Based on CPSO-MAS
Due to the good performance of CPSO-MAS, this algorithm can be applied to regulate parameters of voltage PI loop in the photovoltaic charging module, as shown in
Figure 6. The DC-link bus reference voltage
was set to be 600 V. The calculated error between
and
is the input to the PI controller. The optimization algorithm steps for CPSO-MAS-based PI controller are as follows:
(1) Initialization and parameter setting. The grid environment based on the multi-agent system is constructed, and each agent is equipped with eight neighboring particles, which constitutes a local environment of interactive communication together. In this multi-agent environment, the particle swarm number was set to 100, maximum allowable iteration number was set to 100, and inertia weight was set to 0.63. The learning factor was set to 1.28. The particle’s position and velocity in the solution space was initialized. Parameters and were set with the range of (15, 50) and (0.15, 0.86), respectively.
(2) Objective function evaluation. The objective function is given as [
33]:
,
,
,
are weights, and
,
,
is the output of the controlled voltage
.
denotes
as shown in
Figure 6, i.e., the output of the PI controller. Thereby, a set of PI control parameters (
,
) is obtained to minimize the value of the system objective function
based on CPSO-MAS so that the generated current
will perform satisfactorily for UPQC towards various conditions related the distorted voltage and current.
(3) Fitness recalculation after particle agent competition and cooperation. Each particle agent competes and cooperates with eights neighbors and updates the fitness value. According to the action strategy formula [
33]:
where,
.
If
, it is a high-quality particle, otherwise it is a low-quality particle. If agent
is the high-quality particle, its position in the solution space will remain unchanged. On the contrary, agent
will update the position in the solution according the following formula:
where, rand (−1, 1) is random number within the interval of (−1, 1). If
, then
.
,
is the lower and upper limit of the feasible solution space of the optimization problem.
(4) Agent particle updating in the solution space. The position and speed of each agent particle is updated in the solution space. According to PSO iteration formula [
33]:
where, set
,
.
(5) The fitness value calculation is based on CLS. Twenty percent of the particles with the best performance in the group is selected to conduct chaotic local search. The individual extreme value of each particle and all values in the group are updated. If the algorithm meets the end condition of optimization, the search stops and the optimal solution is obtained and parameters (, ) are obtained. Otherwise, the algorithm turns to step (6).
(6) Fitness recalculation of each particle. The search region is shrunk according to the following formula [
32]:
where
is the
dimension variable of the current individual extreme value. The remaining 80% of the particles in the population are generated in the contracted space, and the algorithm turns to step (2).
After that, is the output that is optimized by CPSO-MAS in the voltage PI controller. The with capacitance current generates a PWM signal which drivers IGBT to output the DC-link bus voltage .
5. UPQC Harmonic Detection and Control Strategy
In order to validate the effectiveness of the DC-link bus voltage control strategy, the harmonic elimination in the UPQC is taken as an example to illustrate the influence of the DC-link bus voltage control to the UPQC performance. The schematic of the harmonic detection based on
in the PAPF of the UPQC is shown in
Figure 9.
Where
,
,
is load current,
ua is the grid voltage of phase
which provides voltage phase signal via phase locked loop (PLL).
and
are calculated by
C32 transformation according to
,
,
.
ip and
iq are obtained with
transformation according to
,
. Next,
and
, i.e., the fundamental positive sequence component of
and
, are gained through a low pass filter (LPF). Next, the three-phase fundamental current
,
ibf and
are obtained by calculation of
and
C23, respectively. Thereby, the compensation current signal instructions of UPQC, i.e.,
,
, and
are obtained according to
Figure 10.
where
Furthermore, in
Figure 10, the DC-link bus voltage
and reference voltage
forms a voltage loop by the PI controller in order to exchange energy between the PAPF and grid side when the PV charging module cannot provide enough energy. Then the obtained harmonic compensation instruction
,
,
are obtained after subtraction between the instruction calculation current
,
,
and
,
,
. The PWM signals generated by harmonic compensation instruction drive the IGBT module to compensate the non-linear loads.