Next Article in Journal
PSO Optimized Active Disturbance Rejection Control for Aircraft Anti-Skid Braking System
Next Article in Special Issue
Optimized Score Level Fusion for Multi-Instance Finger Vein Recognition
Previous Article in Journal
Binary Horse Optimization Algorithm for Feature Selection
Previous Article in Special Issue
Neuroevolution for Parameter Adaptation in Differential Evolution
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Optimal Open-Loop Control of Discrete Deterministic Systems by Application of the Perch School Metaheuristic Optimization Algorithm

by
Andrei V. Panteleev
* and
Anna A. Kolessa
Department of Mathematics and Cybernetics, Moscow Aviation Institute (National Research University), 4, Volokolamskoe Shosse, 125993 Moscow, Russia
*
Author to whom correspondence should be addressed.
Algorithms 2022, 15(5), 157; https://doi.org/10.3390/a15050157
Submission received: 7 April 2022 / Revised: 28 April 2022 / Accepted: 5 May 2022 / Published: 7 May 2022
(This article belongs to the Special Issue Metaheuristic Algorithms and Applications)

Abstract

:
A new hybrid metaheuristic method for optimizing the objective function on a parallelepiped set of admissible solutions is proposed. It mimics the behavior of a school of river perch when looking for food. The algorithm uses the ideas of several methods: a frog-leaping method, migration algorithms, a cuckoo algorithm and a path-relinking procedure. As an application, a wide class of problems of finding the optimal control of deterministic discrete dynamical systems with a nonseparable performance criterion is chosen. For this class of optimization problems, it is difficult to apply the discrete maximum principle and its generalizations as a necessary optimality condition and the Bellman equation as a sufficient optimality condition. The desire to extend the class of problems to be solved to control problems of trajectory bundles and stochastic problems leads to the need to use not only classical adaptive random search procedures, but also new approaches combining the ideas of migration algorithms and swarm intelligence methods. The efficiency of this method is demonstrated and an analysis is performed by solving several optimal deterministic discrete control problems: two nonseparable problems (Luus–Tassone and LiHaimes) and five classic linear systems control problems with known exact solutions.

1. Introduction

Global optimization methods are widely used in engineering calculations when designing complex technical systems, problems of information processing, decision making and optimal control, machine learning, etc. [1,2,3,4,5,6,7]. As a rule, the application of the necessary optimality conditions for finding the optimal open-loop control of discrete deterministic systems is associated with the solution of a boundary value problem for a system of difference equations, the computational complexity of which greatly increases with the state vector dimension. The use of quasi-gradient optimization methods often leads to obtaining locally optimal solutions. The use of direct optimization methods, such as the Luus–Jaacola method, makes it possible to obtain a result with the desired accuracy with an acceptable number of runs of the algorithm. Objective functions can be multi-extremal, nondifferentiable, difficult to calculate, or generally described by a black box model. Therefore, the application of necessary and sufficient optimality conditions, as well as quasi-gradient methods of the first and second orders, could be difficult. Among the zero-order methods, in which the search direction is determined only by information about the value of the objective function, two groups can be distinguished: stochastic metaheuristic algorithms [8,9,10,11,12,13,14,15,16] and deterministic methods of mathematical programming [17,18,19]. The former have gained popularity among applied mathematicians and engineers because they use various original heuristics driven by higher-level algorithms. Proof of convergence for these algorithms is usually complex and, thus, hard to provide, but they are able to obtain solutions of sufficiently good quality in an acceptable time. The second group of methods is popular among mathematicians, since, under certain conditions, it can guarantee convergence to the global extremum. In the first group, it is conditionally possible to single out four well-known areas of research.
The first one includes evolutionary methods, in which the search process is associated with the evolution of a solutions population. These include Genetic Algorithms, Differential Evolution, Artificial Immune Systems, Covariance Matrix Adaptation Evolution Strategy, Scatter Search, Variable Mesh Optimization, Memetic Algorithms, Weed Colonization, Cuckoo Search, Self-Organizing Migrating Algorithms, etc.
The second area is represented by Swarm Intelligence algorithms: Particle Swarm Optimization, Artificial Bee Colony, Ant Colony Optimization, Bat-Inspired Algorithm, Cat Swarm Optimization, Firefly Algorithm, Grey Wolf Optimizer, Whale Optimization Algorithm, Bacterial Foraging Optimization, Fish School Search, Glowworm Swarm Optimization, Shuffled Frog-Leaping Algorithm, Krill Herd, Imperialist Competitive Algorithm, Stochastic Diffusion Search, Human Group Optimization Algorithm, etc. In this area, new original algorithms are being created and considered [20]. In the methods of this group, members of the populations exchange information during the search process and use information about the whole population leaders, local leaders among neighbors, and their own best results.
The third area is represented by physics-based algorithms: Central Force Optimization, Simulated Annealing, Adaptive Simulated Annealing, Harmony Search, Big Bang–Big Crunch, Spiral Dynamics Algorithm, Fireworks Algorithm, Grenade Explosion Method, Intelligent Water Drops Algorithm, Electromagnetism-like Mechanism, etc.
The fourth area is represented by multistart-based algorithms: Greedy Randomized Adaptive Search, Tabu Search, etc. In the four listed areas, a group of bio-inspired algorithms can be distinguished due to their nature-inspired interpretations [2,16].
This article is devoted to the development of the swarm intelligence group of methods based on observing the hunting of a school of river perches and hybridization mechanisms. Given the validity of the free lunch theorem [21], the problem of designing new optimization methods aimed at solving a certain class of applied problems remains relevant. The problem of creating new metaheuristic algorithms is important, since there is no universal optimization algorithm suitable for solving all problems of finding a global extremum. Therefore, almost constantly, new search procedures are proposed.
The aim of this work is to develop a new metaheuristic optimization algorithm that can be effective for solving a wide class of problems in optimal control theory. The metaheuristic algorithm proposed in this article can be applied to solve a wide class of problems of parametric optimization of continuous and discrete, deterministic and stochastic control systems in the presence of uncertainty in setting the initial conditions, parameters of the object model and the measuring system. In particular, the method should allow for solving both classical and nonclassical nonseparable optimization problems for nonlinear discrete deterministic dynamical control systems [1]. In addition to classical problems, problems of model predictive control theory could be solved [22].
A method for optimizing the objective function on a parallelepiped set of admissible solutions is proposed herein. It mimics the behavior of a school of river perch when looking for food. The method is hybrid because it uses the ideas of migration algorithms [23], methods that imitate the behavior of cuckoos [24] and frogs [25], and path-relinking procedures [26]. The advantage of the proposed optimization algorithm lies in the ability of feasible solutions to leave the area of attraction of local extremum points. This is achieved both through the main technique, which implements the movement of perches inside the perch cauldron, and through the high-level management of the use of well-established heuristic techniques such as Levy flights to explore distant areas of the set of feasible solutions, regular mixing of schools in order to exchange information, and path-relinking procedures.
The efficiency of this method is demonstrated and an analysis is performed by solving optimal deterministic discrete control problems: two nonseparable problems (Luus–Tassone and Li–Haimes) and five classic linear systems control problems with known exact solutions.

2. Materials and Methods

2.1. Open-Loop Control Problem

Let us consider a nonlinear dynamical system described by a difference equation
x ( t + 1 ) = f ( t , x ( t ) , u ( t ) ) ,
where t represents discrete time, t T = [ 0 , 1 , , N 1 ] ; the number of stages N is given; x is the state vector, x R n ;   u is the control vector, u U ( t ) R q ;   U ( t ) is the control values feasible set which, for each t , comprises the direct product of segments [ a i ( t ) , b i ( t ) ] , i = 1 , 2 , , q ; and f ( t , x , u ) = ( f 1 ( t , x , u ) , , f n ( t , x , u ) ) T is a continuous vector function.
The initial state vector is given:
x ( 0 ) = x 0 .
It is assumed that only the discrete time information t is used in the control law, so open-loop control is considered.
The set of allowable processes D ( 0 , x 0 ) is a set of pairs d = ( x ( ) , u ( ) ) , where x ( ) = { x 0 , x ( 1 ) , , x ( N ) } is a trajectory, u ( ) = { u ( 0 ) , u ( 1 ) , , u ( N 1 ) } is an admissible control, and u ( t ) U ( t ) , satisfying the state Equation (1) and initial condition (2).
The cost functional on the set D ( 0 , x 0 ) is defined as
I ( d ) = t = 0 N 1 f 0 ( t , x ( t ) , u ( t ) ) + F ( x ( N ) ) ,
or
I ( d ) = F ( x ( 0 ) , , x ( N ) ; u ( 0 ) , , u ( N 1 ) ) ,
where f 0 ( t , x , u ) , F ( x ) , F ( x ( ) , u ( ) ) are given continuous functions.
It is required to find a pair d * = ( x * ( ) , u * ( ) ) D ( 0 , x 0 ) that minimizes the cost functional, i.e.,
I ( d * ) = min d D ( 0 , x 0 ) I ( d ) .
The required trajectory x * ( ) = { x 0 , x * ( 1 ) , , x * ( N ) } and control u * ( ) = { u * ( 0 ) , u * ( 1 ) , , u * ( N 1 ) } (elements of a pair d * ) are the optimal trajectory and optimal control, correspondingly.
To solve the problem (5), an algorithm that imitates the behavior of a perch school is proposed. This algorithm belongs to the class of nature-inspired metaheuristic swarm intelligence algorithms [9,11,16]. This problem was also solved using the Sparrow Colony Optimization Method, with results close to those received in this paper [27].

2.2. Bio-Inspired Metaheuristic Optimization Method

The problem of finding the global extremum of the objective function f ( x ) = f ( x 1 , , x n ) on the set of feasible solutions of the form D = [ a 1 , b 1 ] × × [ a n , b n ] is considered.
The Perch School Search method (PSS) imitates the behavior of a European river perch school. Medium-sized perches make schools of 5–12 individuals in order to find food. The smallest perches make schools of approximately 100 individuals. Generally, they take the fish-prey into a circle (perch boiler) and hold it there. Perches attack prey at the boiler edge, moving to its center. To find new food sources, perches use a migration mechanism. Usually, bigger perches swim at the depths, in holes and deep water, hunting by themselves. Rarely, they also make schools, like small ones do, in order to fight with other predatory fish (pikes and sanders). The river perch uses a very aggressive hunting model: it actively pursues the prey, sometimes jumping out even on the surface of the water. The biggest perches are solitary and independent predators. The reason for this is that the big perches do not need to hunt collectively. These perches can hunt by themselves on any size fish available. In the case of feed places, lacking a regular habitat, perch begin to move and search for places full of small fish and other food.
Finite sets I = { x j = ( x 1 j , x 2 j , , x n j ) T , j = 1 , 2 , , N P } D of possible solutions, named populations, are used for solving the problem of finding a global constrained minimum of an objective function, where x j is a perch-individual (potential feasible solution) with number j , and N P is the population size.
In the beginning of the process, the method creates a population by the uniform distribution law on a feasible solution set, D , illustrated in Figure 1a. All solutions are ordered by increasing objective function value. After that, the created population is divided into several schools. The schools form a sequence: The best solution is placed in the first school, the next is placed in the second, etc. The M - th is placed in the M - th school, and the ( M + 1 ) - th is then placed in the first school, and so on. This process is shown in Figure 1b. The described process corresponds to the information exchange among population members for an efficient approach to the global extremum.
In each school, the leader (the best possible solution in the school) is determined by the value of the objective function. Every school makes a perch boil in which there is a hunt, illustrated in Figure 1c. At that time, every perch in the school moves to its leader, exploring the boil edge. During this movement, perches remember their best position. As a result, new leaders’ and members’ positions for each school are found. Among the school leaders, the best and the worst leaders are found.
The perch boiler in the school with the best leader is realized, so perches of this school search the entire area in depth. As a result, the new absolute leader is found.
After that, among all schools, the school with the worst leader is chosen. This leader then moves into another area of the feasible solution set. For that, the school leader’s movement is realized by a Lévy distribution with verification of belonging to the feasible solution set. The uniform distribution law on the parallelepiped set generates the remaining members of the school. A doubled distance from the leader to the nearest edge of set D describes the size of the parallelepiped set. The received school realizes a perch boil and a new leader is found.
The rest of the schools swim in the direction of the current absolute leader of the whole feasible set. At this time, each local school leader moves in a straight line to the global leader. Other members of the school move parallel to the global leader. In this movement, all perches remember their best position and stay there.
In the end of the migration of all schools, the absolute leader is put into the Pool set. Then, the division of the entire population into schools begins again. Therefore, another new global iteration starts until it reaches a given number.
At the final step, leaders in the Pool set (identified at each global iteration of the algorithm) interact with each other. After a given number of times in the Pool set, three perches are chosen, and a path-relinking operation [26] is realized. As a result, this set is replenished by one more solution.
After the end of the path-relinking operation among the Pool set elements, the best answer is found, which is considered to be an approximate solution of the optimization problem.
The proposed method corresponds to hybrid algorithms because it contains ideas described in Shuffled Frog-Leaping Optimization (division into schools) [28], Cuckoo Search (Lévy flights) [29], Self-Organizing Migrating Algorithm (movement to the leader) [23] and the path-relinking algorithm [26] (searching in the Pool set).
Below is a detailed description of the algorithm.
Step 1. Set the parameters of the method:
  • Controlling parameter N S t e p , which specifies number of steps before the end of the movement;
  • Number of schools in the population, M ;
  • Number of perches in a school, s ;
  • Number of perches in the population, N P = s M ;
  • Stop parameter I t e r max , which specifies the maximum number of iterations;
  • Levy distribution parameter λ ;
  • Step size α ;
  • Maximum number of path-relinking operations, P R max ;
  • Number of steps in the path-relinking operation, Δ p r .
Step 2. Creation of the initial perch population.
Step 2.1. Create population I = { x j = ( x 1 j , x 2 j , , x n j ) T , j = 1 , 2 , , N P } D of N P solutions (perches) with randomly generated coordinates x i from the segment [ a i , b i ] using a uniform distribution:
x i j = a i + r a n d i [ 0 ,   1 ] ( b i a i ) , i = 1 , , n ; j = 1 , , N P
where r a n d i [ 0 ,   1 ] is the uniform distribution law on the segment [ 0 ; 1 ] .
Step 2.2. For each solution (perch) in the population, calculate the value of the objective function.
Set the value: i t e r = 0 (global iteration count).
Step 3. Separation of the whole population into schools.
Step 3.1. Arrange the solutions in the population in ascending order of objective function values.
Step 3.2. Make M -many schools of s perches each: The best solution (with the smallest value of the objective function) is put into the first school, the next is put into the second, etc. The M - th is put into the M - th school, and the M + 1 - th is put into the first school, etc. As the result, there are M -many schools of s perches each, so N P = M s . The first perch placed in a school is its leader x l o c , m , m = 1 , , M . The leader of the first school is also the leader of the whole population (the global leader): x l o c , 1 = x g l o b .
Step 4. Realization of the perch boiler in each school.
Step 4.1. For each school m = 1 , , M , do the following.
Move each perch to the school’s leader near its border:
x j , m , k = x j , m + k ( x l o c , m x j , m ) N s t e p   , k = 0 , 1 , , [ σ N s t e p ] ; j = 1 , , s ;
where x j , m is the initial position of perch j in school m ; x j , m , k is a position of this perch during movement; x l o c , m is a position of the school leader numbered m ; [ ] is the integer part of a number; and boiler parameter σ [ 0 , 1 ; 0 , 5 ] is generated via the uniform distribution law at each iteration for each school independently.
After all steps, find the best step for each perch (the step at which the value of the objective function was the smallest); the perch must be in this best position x j , m , n e w :
x j , m , n e w = argmin k = 0 , 1 , , [ σ N s t e p ] f ( x j , m , k ) , j = 1 , , s .
Step 4.2. Choose a new leader in each school: x l o c , m , n e w , m = 1 , , M .
Step 4.3. Arrange the schools in the population in ascending order of objective function values of its leaders. The global leader is in the first school: x l o c , 1 = x g l o b ; local leaders are in other schools: x l o c , m , m = 2 , , M ; and the maximum value of the objective function among leaders is in the school numbered M .
Step 5. School swimming with the global leader.
Step 5.1. Move each perch toward the global leader, moving along the line connecting them (approaching and then receding in the same direction):
x j , 1 , k = x j , 1 + k ( x g l o b x j , 1 ) N s t e p   , k = 0 , 1 , , [ σ 1 N s t e p ] ; j = 1 , , s ;
where boiler parameter σ 1 [ 1 ; 1 , 5 ] is generated via the uniform distribution law at each iteration.
Step 5.2. After all steps, find the best step for each perch (the step at which the value of the objective function was the smallest); the perch must be in this best position x j , 1 , n e w :
x j , 1 , n e w = argmin k = 0 , 1 , , [ σ 1 N s t e p ] f ( x j , 1 , k ) , j = 1 , , s .
Step 5.3. Choose a new leader in the school, x l o c , 1 , n e w = x g l o b , n e w .
Step 6. School swimming with the worst leader.
Step 6.1. Leader swimming: A new position for the leader is randomly generated via the Levy distribution:
x i l o c , M , n e w = x i l o c , M + α i t e r L e v y i ( λ ) , i = 1 , , n ,
where x i l o c , M is the position coordinates of the school’s leader in the current iteration, α is the step size, and λ ( 1 ; 3 ] . To generate a random variable according to the Levy distribution, the following is required:
For each coordinate x i = L e v y i ( λ ) , generate a number R i , i = 1 , , n via the uniform distribution law on the set [ ε ; b i a i ] , where ε = 10 7 is the distinguishability constant, and do the following:
  • Generate numbers θ i = R i 2 π and L i = ( R i + ε ) 1 λ , i = 1 , , n , where λ is a distribution parameter;
  • Calculate values of coordinates using the formulas below:
    x i ( λ ) = L i sin θ i ,   i = 1 , , [ n 2 ] ;   x i ( λ ) = L i cos θ i ,   i = [ n 2 ] + 1 , , n
If the obtained value of the coordinate x i does not belong to the set of feasible solutions, that is, x i [ a i ; b i ] , then repeat the generation process for the coordinate x i .
Step 6.2. Generate new positions of school members x j , M via the uniform distribution on a parallelepiped formed by the direct product of segments [ x i l o c , M x ^ i , x i l o c , M + x ^ i ] , where x ^ i = min { ( x i l o c , M a i ) , ( b i x i l o c , M ) } .
Step 6.3. Make a perch boiler in the obtained school.
Move each perch to a school’s leader near its border:
x j , M , k = x j , M + k ( x l o c , M , n e w x j , M ) N s t e p   , k = 0 , 1 , , [ σ 3 N s t e p ] ; j = 1 , , s ;
where boiler parameter σ 3 [ 0 , 1 ; 0 , 5 ] is generated via the uniform distribution law at each iteration.
After all steps, find the best step for each perch (the step at which the value of the objective function was the smallest); the perch must be in this best position x j , M , n e w :
x j , M , n e w = argmin k = 0 , 1 , , [ σ 3 N s t e p ] f ( x j , M , k ) , j = 1 , , s .
Step 6.4. Choose a new leader for the school, x l o c , M .
Step 7. Swimming of the remaining schools.
For all schools where m { 2 , , M 1 } , do the following.
Step 7.1. Move the school’s leader to the current absolute (best) leader:
x l o c , m , k = x l o c , m + k ( x g l o b , n e w x l o c , m ) N s t e p   , k = 0 , 1 , , [ σ 2 N s t e p ] ;
where boiler parameter σ 2 [ 0 , 6 ; 0 , 8 ] is generated via the uniform distribution law at each iteration for each school, independently.
Step 7.2. Make the movement of other school members parallel to that of the absolute leader:
x j , m , k = x j , m + k ( x g l o b , n e w x l o c , m ) N s t e p   , k = 0 , 1 , , [ σ 2 N s t e p ] ; j = 1 , , s .
After all steps, find the best step for each perch (the step at which the value of the objective function was the smallest); the perch must be in this best position x j , m , n e w :
x j , m , n e w = argmin k = 0 , 1 , , [ σ 2 N s t e p ] f ( x j , m , k ) , j = 1 , , s .
Step 7.3. Every school member must take their best position reached in the swimming process. Choose a new school leader with position x l o c , m , m { 2 , , M 1 } .
Step 8. Finding the new global leader of the population among the schools’ leaders.
Choose the best solution among the solutions, corresponding to the leaders, and put it into the set P o o l .
Step 9. Checking the stop conditions.
If i t e r = I t e r max , go to Step 10. Otherwise, set i t e r = i t e r + 1 and go to Step 3.
Step 10. Intensive searching in the set P o o l .
Step 10.1. Set the value p r = 1 .
Step 10.2. Choose three different random solutions x p o o l p , x p o o l q , x p o o l r in P o o l .
Step 10.3. Find the solution
x p o o l p q = a r g min j = 1 , , Δ p r 1 f ( x p o o l p + j ( x p o o l q x p o o l p ) / Δ p r )
Step 10.4. Add the solution
x n e w = a r g min j = 1 , , Δ p r 1 f ( x p o o l p q + j ( x p o o l r x p o o l p q ) / Δ p r )
into P o o l . Set the value p r = p r + 1 .
Step 10.5. If p r > P R max , go to Step 11. Otherwise, go to Step 10.2.
Step 11. Choosing the best solution. Among the solutions in P o o l , find the best one, which is considered to be an approximate solution of the optimization problem.
During the creation and testing of this algorithm, the following recommendations were developed for choosing parameters.
Parameter selections within the following ranges provide better results: controlling parameter N S t e p [ 50 ; 100 ] ; number of schools in the population M [ 3 ; 10 ] ; number of perches in a school s [ 15 ; 100 ] ; stop parameter I t e r max [ 50 ; 4000 ] ; maximum number of path-relinking operations P R max [ 10 ; 20 ] ; number of steps in path-relinking operation Δ p r [ 5 ; 10 ] ; Levy distribution parameter λ ( 1 ; 3 ] ; step size α [ 0.2 ; 0.6 ] .
The proposed algorithm can be applied in the theory of optimal control using spectral and pseudospectral methods for representing trajectories and control laws in the form of expansions in basic systems with indeterminate coefficients.

3. Results

3.1. Example 1. The One-Dimensional Optimal Control Problem with an Exact Solution

The behavior of the control object is described by the difference equation:
x ( t + 1 ) = x ( t ) + u ( t ) ,
where x R ; t = 0 , 1 , , N 1 ; u R .
The cost functional on the feasible set of processes is:
I = 1 2 t = 0 N 1 γ t u 2 ( t ) + x ( N ) , γ > 1 .
The problem is to find the minimum value of the cost functional and optimal process ( x * ( ) , u * ( ) ) .
Analytical solution: u * ( t ) = γ t ; x * ( t ) = x 0 + ( 1 γ k ) γ 1 ; min I = x 0 + 1 γ N 2 ( γ 1 ) .
Initial condition: x ( 0 ) = 0 ; control constraint: 2 10 4 u 2 10 4 . Number of stages: N = 50 . The exact value of the functional is min I = 581 . 954264398477 .
The obtained approximate value of the functional was I * = 581 . 954264313551 , with a runtime of 5.36 min. We used C# (7.2) with NET Framework (4.7.2), Windows Forms to implement this algorithm and solve the optimal control problems. A 3.3 GHz Intel Core i7-2860QM was used to solve this problem and the others below. Table 1 shows the parameters of the method for solving this problem. Figure 2 shows the obtained control and corresponding trajectory.
To establish the relationship between the parameters and the effectiveness of the method, we solved the problem with other parameters of the method, which are shown in Table 2. The initial condition, control constraint, and number of stages were the same. Figure 3 shows the obtained control and corresponding trajectory.
The obtained approximate value of the functional was I * = 569 . 064908725205 , and the runtime was 0.31 min. In this case, the obtained approximate value of the functional was worse than that before because of the different parameters.
Below is a solution of the same problem from Example 1, but the number of stages was N = 80 ; this is shown in order to demonstrate the growth of time for finding the solution. The parameters of the method are shown in Table 1. Figure 4 shows the obtained control and corresponding trajectory.
The exact value of the functional is min I = 10237 . 001072927329 . The obtained approximate value of the functional was I * = 10237 . 0010729268 , and the runtime was 09:31 min.
Example 1 demonstrates the possibility of solving optimal control problems for economic models with a quality criterion that includes discounting. A similar problem occurs in machine learning problems such as deep reinforcement learning.

3.2. Example 2. Luus–Tassone Nonseparable Control Problem

The behavior of the control object is described by the system of difference equations:
x 1 ( t + 1 ) = x 1 ( t ) 1 + 0 , 01 u 1 ( t ) ( 3 + u 2 ( t ) ) ; x 2 ( t + 1 ) = x 2 ( t ) + u 1 ( t ) x 1 ( t + 1 ) 1 + u 1 ( t ) ( 1 + u 2 ( t ) ) ; x 3 ( t + 1 ) = x 3 ( t ) 1 + 0 , 01 u 2 ( t ) ( 1 + u 3 ( t ) ) ,
where x R 3 , t = 0 , , N 1 , and u R 3 .
The cost functional on the feasible set of processes is:
I = x 1 2 ( N ) + x 2 2 ( N ) + x 3 2 ( N ) + + [ ( k = 1 N x 1 2 ( k 1 ) + x 2 2 ( k 1 ) + 2 u 3 2 ( k 1 ) ) ( k = 1 N x 3 2 ( k 1 ) + 2 u 1 2 ( k 1 ) + 2 u 2 2 ( k 1 ) ) ] 1 2 .
The problem is to find the minimum value of the cost functional and optimal process ( x * ( ) , u * ( ) ) .
Initial state vector x ( 0 ) = ( 2 ; 5 ; 7 ) T ; control constraints: 0 u 1 ( t ) 4 ;   0 u 2 ( t ) 4 ; 0 u 3 ( t ) 0.5 . Number of stages: N = 20 .
The best known value of the cost functional obtained for this problem is min I = 209.26937 [1].
The obtained approximate value of the functional was I * = 209 . 429533683522 , and the runtime was 49.92 s. Table 3 shows the parameters of the method for solving this problem. Figure 5 and Figure 6 show the obtained control and corresponding trajectory.

3.3. Example 3. Li–Haimes Nonseparable Control Problem

The behavior of the control object is described by difference equations:
x ( 1 ) = x ( 0 ) u ( 0 ) ; x ( 2 ) = ( 1 + u ( 1 ) ) x ( 1 ) ; x ( 3 ) = x ( 2 ) + u ( 2 ) ,
where x R , t = 0 , 1 , 2 , and u R .
The cost functional on the feasible set of processes is:
I = [ x 2 ( 0 ) + x 2 ( 1 ) + ( 2 x 2 ( 2 ) + x 2 ( 3 ) ) exp ( x 2 ( 1 ) ) ] × [ 50 + u 2 ( 0 ) + ( u 2 ( 1 ) + u 2 ( 2 ) ) exp ( u 2 ( 0 ) ) ] 1 / 2 .
The problem is to find the minimum value of the cost functional and optimal process ( x * ( ) , u * ( ) ) .
Initial condition: x ( 0 ) = 15 ; control constraint: 1 u ( t ) 1 , t = 0 , 1 , 2 . Number of stages: N = 3 . The best known value of the cost functional obtained for this nonseparable control problem is min I = 1596.4796778 [1].
The obtained approximate value of the functional was I * = 1596.47967783389 , with a runtime of 00.19 s. Table 4 shows the parameters of the method for solving this problem. Figure 7 shows the obtained control and corresponding trajectory; the numerical solution of the problem is presented in Table 5.
Examples 2 and 3 are devoted to solving a class of optimal control problems with nonseparable cost functionals, for which it is difficult to apply necessary and sufficient optimality conditions for discrete deterministic dynamical systems.

3.4. Example 4. The Two-Dimensional Lagrange Optimal Control Problem with an Exact Solution

The behavior of the control object is described by a system of difference equations:
x 1 ( t + 1 ) = x 1 ( t ) + u ( t ) ; x 2 ( t + 1 ) = 2 x 1 ( t ) + x 2 ( t ) ,
where x R 2 , t = 0 , 1 , and u R .
The cost functional on the feasible set of processes is:
I = t = 0 1 [ x 1 2 ( t ) + x 2 2 ( t ) + u 2 ( t ) ]
The problem is to find the minimum value of the cost functional and optimal process ( x * ( ) , u * ( ) ) .
Initial state vector x ( 0 ) = ( 2 ; 1 ) T ; control constraint: 10 5 u 10 5 . Number of stages: N = 2 . The exact solution could be found as u * = { 1 ; 0 } ; x 1 * ( ) = { 2 ; 1 ; 1 } ; x 2 * ( ) = { 1 ; 5 ; 7 } . The exact value of the cost functional is min I = 32 .
The obtained approximate value of the functional was I * = 32 , and the runtime was 01.58 s. Table 6 shows the parameters of the method for solving this problem. Figure 8 shows the obtained control and corresponding trajectory.

3.5. Example 5. The Two-Dimensional Meyer Optimal Control Problem with an Exact Solution

The behavior of the control object is described by a system of difference equations:
x 1 ( t + 1 ) = x 2 ( t ) ; x 2 ( t + 1 ) = x 1 ( t ) u ( t ) ,
where x R 2 , t = 0 , 1 , and u R .
The cost functional on the feasible set of processes is:
I = x 1 2 ( 2 ) + x 2 2 ( 2 )
The problem is to find the minimum value of the cost functional and optimal process ( x * ( ) , u * ( ) ) .
Initial state vector x ( 0 ) = ( 2 ; 3 ) T ; control constraint: 1 u 1 . Number of stages: N = 2 . The exact value of the cost functional is min I = 5 .
The obtained approximate value of the functional was I * = 5 . 00000000000002 , with a runtime of 00.25 s. Table 7 shows the parameters of the method for solving this problem. Figure 9 shows the obtained control and corresponding trajectory; the numerical solution of the problem is presented in Table 8.
Examples 4 and 5 demonstrate the application of the proposed method to a class of linear-quadratic control problems.

3.6. Example 6. The Two-Dimensional Bolza Optimal Control Problem with an Exact Solution

The behavior of the control object is described by a system of difference equations:
x 1 ( t + 1 ) = x 2 ( t ) ; x 2 ( t + 1 ) = 2 x 2 ( t ) x 1 ( t ) + 1 N 2 u ( t ) ,
where x R 2 , t = 0 , 1 , , N 1 , and u R .
The cost functional on the feasible set of processes is:
I = x 1 ( N ) + 1 2 N t = 0 N 1 u 2 ( t )
The problem is to find the minimum value of the cost functional and optimal process ( x * ( ) , u * ( ) ) .
Analytical solution: u * ( t ) = N t 1 N ; min I = 1 3 + 3 N 1 6 N 2 + 1 2 N 3 t = 0 N 1 t 2 .
Initial state vector x ( 0 ) = ( 0 ; 0 ) T ; control constraint: 0 u 100 . Number of stages: N = 10 . The exact value of the cost functional is min I = 0.1425 .
The obtained approximate value of the functional was I * = 0.142499999999796 , with a runtime of 07.39 s. Table 9 shows the parameters of the method for solving this problem. Figure 10 and Figure 11 show the obtained control and corresponding trajectory.

3.7. Example 7. The Two-Dimensional Meyer Optimal Control Problem with an Exact Solution

The behavior of the control object is described by a system of difference equations:
x 1 ( t + 1 ) = x 1 ( t ) + 2 u ( t ) ; x 2 ( t + 1 ) = x 1 2 ( t ) + x 1 ( t ) + u 2 ( t ) ,
where x R 2 , t = 0 , 1 , and u R .
The cost functional on the feasible set of processes is: I = x 2 ( 2 ) .
The problem is to find the minimum value of the cost functional and optimal process ( x * ( ) , u * ( ) ) .
Initial state vector x ( 0 ) = ( 3 ; 0 ) T ; control constraint: 5 u 5 . Number of stages: N = 2 . The exact value of the cost functional is min I = 19 . Note that the discrete maximum principle does not hold for this example.
This problem has two solutions: x 1 1 * ( ) = { 3 ; 1 ; 9 } , x 2 1 * ( ) = { 0 ; 5 ; 19 } , u 1 * ( ) = { 2 ; 5 } and x 1 2 * ( ) = { 3 ; 1 ; 11 } , x 2 2 * ( ) = { 0 ; 5 ; 19 } , u 2 * ( ) = { 2 ; 5 } ; the algorithm finds each of them.
The obtained approximate values of the functional were I 1 * = 18 . 9999999999354 , with a runtime of 00.30, and I 2 * = 18 . 9999999999939 , with a runtime of 00.31 s. Table 10 shows the parameters of the method for solving this problem. Figure 12 and Figure 13 show the obtained control and corresponding trajectory (for the first and second solutions, correspondingly); the numerical solutions of the problem are presented in Table 11 and Table 12.

4. Conclusions

In this paper, we proposed a new bio-inspired metaheuristic optimization algorithm, Perch School Search, for finding the solution to open-loop control problems for discrete deterministic dynamical systems.
The proposed algorithm combines the ideas of several well-known metaheuristic algorithms, providing an appropriate-quality solution for many practical optimization tasks.
The efficiency was shown in Examples 1–7, where the values of the cost functional obtained via the algorithm were equal or close to the analytical solution or to the best known results.
The direction of further development of this work is the application of the proposed global optimization metaheuristic algorithm to the problems of optimal control of bundles of trajectories of nonlinear deterministic discrete dynamical systems, discrete stochastic systems, and systems of joint estimation and control.

Author Contributions

Conceptualization, A.V.P.; methodology, A.V.P. and A.A.K.; software, A.A.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Source code at https://github.com/AeraConscientia/N-dimensionalPerchOptimizer (accessed on 21 March 2022).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Luus, R. Iterative Dynamic Programming, 1st ed.; Chapman & Hall/CRC: London, UK, 2000. [Google Scholar]
  2. Yang, X.S.; Chien, S.F.; Ting, T.O. Bio-Inspired Computation and Optimization, 1st ed.; Morgan Kaufmann: New York, NY, USA, 2015. [Google Scholar]
  3. Goldberg, D. Genetic Algorithms in Search, Optimization and Machine Learning, 1st ed.; Addison-Wesley Publishing Company: Boston, MA, USA, 1989. [Google Scholar]
  4. Michalewicz, Z.; Fogel, D. How to Solve It: Modern Heuristics, 2nd ed.; Springer: New York, NY, USA, 2004. [Google Scholar]
  5. Davtyan, L.G.; Panteleev, A.V. Method of Parametric Optimization of Nonlinear Continuous Systems of Joint Estimation and Control. J. Comput. Syst. Sci. Int. 2019, 58, 360–373. [Google Scholar] [CrossRef]
  6. Panteleev, A.V.; Lobanov, A.V. Application of Mini-Batch Metaheuristic Algorithms in Problems of Optimization of Deterministic Systems with Incomplete Information about the State Vector. Algorithms 2021, 14, 332. [Google Scholar] [CrossRef]
  7. Panteleev, A.V.; Pis’mennaya, V.A. Application of a Memetic Algorithm for the Optimal Control of Bunches of Trajectories of Nonlinear Deterministic Systems with Incomplete Feedback. J. Comput. Syst. Sci. Int. 2018, 57, 25–36. [Google Scholar] [CrossRef]
  8. Beheshti, Z.; Shamsuddin, S.M. A review of population-based meta-heuristic algorithms. Int. J. Adv. Soft Comput. Appl. 2013, 5, 1–35. [Google Scholar]
  9. Brownlee, J. Clever Algorithms: Nature-Inspired Programming Recipes, 1st ed.; LuLu.com: Raleigh, NC, USA, 2011. [Google Scholar]
  10. Chambers, D.L. Practical Handbook of Genetic Algorithms, Applications, 2nd ed.; Chapman & Hall/CRC: London, UK, 2001. [Google Scholar]
  11. Floudas, C.; Pardalos, P. Encyclopedia of Optimization, 2nd ed.; Springer: New York, NY, USA, 2009. [Google Scholar]
  12. Floudas, C.A.; Pardalos, P.M.; Adjiman, C.S.; Esposito, W.R.; Gümüş, Z.H.; Harding, S.T.; Klepeis, J.L.; Meyer, C.A.; Schweiger, C.A. Handbook of Test Problems in Local and Global Optimization, 1st ed.; Kluwer Academic Publishers: Dordrecht, The Netherlands, 1999. [Google Scholar]
  13. Gendreau, M. Handbook of Metaheuristics, 2nd ed.; Springer: New York, NY, USA, 2010. [Google Scholar]
  14. Glover, F.W.; Kochenberger, G.A. (Eds.) Handbook of Metaheuristics; Kluwer Academic Publishers: Boston, MA, USA, 2003. [Google Scholar]
  15. Neri, F.; Cotta, C.; Moscato, P. Handbook of Memetic Algorithms, 1st ed.; Springer: Berlin, Germany, 2012. [Google Scholar]
  16. Yang, X.S. Nature-Inspired Metaheuristic Algorithms, 2nd ed.; Luniver Press: Frome, UK, 2010. [Google Scholar]
  17. Sergeyev, Y.D.; Kvasov, D.E.; Mukhametzhanov, M.S. On the efficiency of nature-inspired metaheuristics in expensive global optimization with limited budget. Sci. Rep. 2018, 8, 453. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  18. Sergeyev, Y.D.; Kvasov, D.E. Deterministic Global Optimization: An Introduction to the Diagonal Approach, 1st ed.; Springer: New York, NY, USA, 2017. [Google Scholar]
  19. Pinter, J.D. Global Optimization in Action (Continuous and Lipschitz Optimization: Algorithms, Implementations and Applications), 1st ed.; Kluwer Academic Publishers: Dordrecht, The Netherlands, 1996. [Google Scholar]
  20. Dragoi, E.N.; Dafinescu, V. Review of Metaheuristics Inspired from the Animal Kingdom. Mathematics 2021, 9, 2335. [Google Scholar] [CrossRef]
  21. Wolpert, D.H.; Macready, W.G. No free lunch theorems for optimization. IEEE Trans. Evol. Comput. 1997, 1, 67–82. [Google Scholar] [CrossRef] [Green Version]
  22. Schwenzer, M.; Ay, M.; Bergs, T.; Abel, D. Review on model predictive control: An engineering perspective. Int. J. Adv. Manuf. Technol. 2021, 117, 1327–1349. [Google Scholar] [CrossRef]
  23. Davendra, D.; Zelinka, I. Self-Organizing Migrating Algorithm. Methodology and Implementation. Studies in Computational Intelligence, 1st ed.; Springer: Cham, Switzerland, 2016; Volume 626. [Google Scholar]
  24. Yang, X.S.; Deb, S. Cuckoo search via Levy flights. In Proceedings of the World Congress on Nature and Biologically Inspired Computing, Coimbatore, India, 9–11 December 2009; pp. 210–214. [Google Scholar]
  25. Eusuff, M.M.; Lansey, K.E. Optimization of water distribution network design using the shuffled frog leaping algorithm. J. Water Resour. Plan. Manag. 2003, 129, 210–225. [Google Scholar] [CrossRef]
  26. Glover, F.; Marti, R.; Laguna, M. Fundamentals of scatter search and path relinking. Control. Cybern. 2000, 39, 653–684. [Google Scholar]
  27. Panteleev, A.; Stanovskaya, Y. Application of the Sparrow Colony Optimization Method in the Optimal Open Loop Control Problems for Discrete Deterministic Dynamical Systems. In SMART Automatics and Energy, 1st ed.; Solovev, D.B., Kyriakopoulos, G.L., Terziev, V., Eds.; Springer Singapore: Singapore, Singapore, 2022; Volume 272, pp. 599–609. [Google Scholar]
  28. Eusuff, M.; Lansey, K.; Pasha, F. Shuffled frog-leaping optimization algorithm: Memetic meta-heuristic for discrete optimization. Eng. Optimiz. 2007, 38, 129–154. [Google Scholar] [CrossRef]
  29. Yang, X.; Deb, S. Engineering optimization by cuckoo search. Int. J. Math. Model. Numer. Optim. 2010, 1, 330–343. [Google Scholar]
Figure 1. Hunting behavior of the river perch; (a) creation a population by the uniform distribution law on a feasible solution set, (b) separation of the whole population into schools, (c) realization of the perch boiler in each school.
Figure 1. Hunting behavior of the river perch; (a) creation a population by the uniform distribution law on a feasible solution set, (b) separation of the whole population into schools, (c) realization of the perch boiler in each school.
Algorithms 15 00157 g001
Figure 2. Obtained optimal control and corresponding optimal trajectory in Example 1.
Figure 2. Obtained optimal control and corresponding optimal trajectory in Example 1.
Algorithms 15 00157 g002
Figure 3. Obtained optimal control and corresponding optimal trajectory in Example 1.
Figure 3. Obtained optimal control and corresponding optimal trajectory in Example 1.
Algorithms 15 00157 g003
Figure 4. Obtained optimal control and corresponding optimal trajectory in Example 1.
Figure 4. Obtained optimal control and corresponding optimal trajectory in Example 1.
Algorithms 15 00157 g004
Figure 5. Obtained optimal control in Example 2.
Figure 5. Obtained optimal control in Example 2.
Algorithms 15 00157 g005
Figure 6. Obtained optimal trajectory in Example 2.
Figure 6. Obtained optimal trajectory in Example 2.
Algorithms 15 00157 g006
Figure 7. Obtained optimal control and corresponding optimal trajectory in Example 3.
Figure 7. Obtained optimal control and corresponding optimal trajectory in Example 3.
Algorithms 15 00157 g007
Figure 8. Obtained optimal control and corresponding optimal trajectory in Example 4.
Figure 8. Obtained optimal control and corresponding optimal trajectory in Example 4.
Algorithms 15 00157 g008
Figure 9. Obtained optimal control and corresponding optimal trajectory in Example 5.
Figure 9. Obtained optimal control and corresponding optimal trajectory in Example 5.
Algorithms 15 00157 g009
Figure 10. Obtained optimal control and corresponding optimal trajectory in Example 6.
Figure 10. Obtained optimal control and corresponding optimal trajectory in Example 6.
Algorithms 15 00157 g010
Figure 11. Control discrepancy at each moment of discrete time.
Figure 11. Control discrepancy at each moment of discrete time.
Algorithms 15 00157 g011
Figure 12. Obtained optimal control and corresponding optimal trajectories for the first solution to Example 7.
Figure 12. Obtained optimal control and corresponding optimal trajectories for the first solution to Example 7.
Algorithms 15 00157 g012
Figure 13. Obtained optimal control and corresponding optimal trajectories for the second solution to Example 7.
Figure 13. Obtained optimal control and corresponding optimal trajectories for the second solution to Example 7.
Algorithms 15 00157 g013
Table 1. Parameters of PSS for solving the problem in Example 1.
Table 1. Parameters of PSS for solving the problem in Example 1.
N S t e p M s I t e r max λ α P R max Δ p r
5065040001.40.2209
Table 2. Parameters of PSS for solving the problem in Example 1.
Table 2. Parameters of PSS for solving the problem in Example 1.
N S t e p M s I t e r max λ α P R max Δ p r
5062510001.40.2209
Table 3. Parameters of PSS for solving the problem in Example 2.
Table 3. Parameters of PSS for solving the problem in Example 2.
N S t e p M s I t e r max λ α P R max Δ p r
1003509001.40.3157
Table 4. Parameters of PSS for solving the problem in Example 3.
Table 4. Parameters of PSS for solving the problem in Example 3.
N S t e p M s I t e r max λ α P R max Δ p r
1004206530.6155
Table 5. Obtained optimal control and optimal trajectory (numerically).
Table 5. Obtained optimal control and optimal trajectory (numerically).
t u x
0 0.42716 15.00000
1 0.09897 0.31450
2 0.08238 0.28337
3 0.20099
Table 6. Parameters of PSS for solving the problem in Example 4.
Table 6. Parameters of PSS for solving the problem in Example 4.
N S t e p M s I t e r max λ α P R max Δ p r
100410011030.6155
Table 7. Parameters of PSS for solving the problem in Example 5.
Table 7. Parameters of PSS for solving the problem in Example 5.
N S t e p M s I t e r max λ α P R max Δ p r
1004151201.50.4105
Table 8. Obtained optimal control and optimal trajectory (numerically).
Table 8. Obtained optimal control and optimal trajectory (numerically).
t u x 1 x 2
0 1 2 3
1 1 3 1
2 1 2
Table 9. Parameters of PSS for solving the problem in Example 6.
Table 9. Parameters of PSS for solving the problem in Example 6.
N S t e p M s I t e r max λ α P R max Δ p r
1003307501.70.6105
Table 10. Parameters of PSS for solving the problem in Example 7.
Table 10. Parameters of PSS for solving the problem in Example 7.
N S t e p M s I t e r max λ α P R max Δ p r
100620751.30.2105
Table 11. Obtained optimal control and optimal trajectories for the first solution (numerically).
Table 11. Obtained optimal control and optimal trajectories for the first solution (numerically).
t u x 1 x 2
0 2 3 0
1 5 0.99999 5.00002
2 9.00001 19
Table 12. Obtained optimal control and optimal trajectories for the second solution (numerically).
Table 12. Obtained optimal control and optimal trajectories for the second solution (numerically).
t u x 1 x 2
0 2 3 0
1 5 1 5
2 11 19
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Panteleev, A.V.; Kolessa, A.A. Optimal Open-Loop Control of Discrete Deterministic Systems by Application of the Perch School Metaheuristic Optimization Algorithm. Algorithms 2022, 15, 157. https://doi.org/10.3390/a15050157

AMA Style

Panteleev AV, Kolessa AA. Optimal Open-Loop Control of Discrete Deterministic Systems by Application of the Perch School Metaheuristic Optimization Algorithm. Algorithms. 2022; 15(5):157. https://doi.org/10.3390/a15050157

Chicago/Turabian Style

Panteleev, Andrei V., and Anna A. Kolessa. 2022. "Optimal Open-Loop Control of Discrete Deterministic Systems by Application of the Perch School Metaheuristic Optimization Algorithm" Algorithms 15, no. 5: 157. https://doi.org/10.3390/a15050157

APA Style

Panteleev, A. V., & Kolessa, A. A. (2022). Optimal Open-Loop Control of Discrete Deterministic Systems by Application of the Perch School Metaheuristic Optimization Algorithm. Algorithms, 15(5), 157. https://doi.org/10.3390/a15050157

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop