1. Introduction
Problems surrounding the lack of control resources (when the dimensions of a control vector are less than the dimensions of the state space of a physically-controlled system) have widespread applications in practice. Oscillatory systems, such as electric circuits [
1], mechanical systems [
2,
3], quantum oscillators [
4,
5], systems involving physical nature [
6,
7], and controlled with one external force, are instances of such systems. Using the oscillatory system, it is possible to investigate periodic solutions and the stability of the model of two coupled oscillators in the optics of chiral molecules [
6], as well as consider the behaviors of biologically complex mechanisms [
7]. Increasing the number of oscillators in these models to sufficiently large
N makes it possible to study (analytically and numerically) the macro-parameters and properties of similar systems [
8,
9].
Thus, the Timoshenko–Ehrenfest beam motion control, which is modeled with a system of
N oscillators that perform forced oscillations under the actions of scalar and limited control forces, was considered in work [
2]. The time-optimal problem for a platform with attached oscillators was investigated in [
3]. The maximum variation of the oscillation energy of the oscillator system for a given time was examined in [
10]. Due to the difficulties in finding an analytical solution for the control problems, it is of interest to asymptotically synthesize the optimal control of a system of an arbitrary number of linear oscillators (with general limited control), as obtained in [
11].
Optimal control problems were investigated using the Pontryagin maximum principle, which was formulated during the seminars on the theory of oscillations and automatic control by L.S. Pontryagin and M.A. Aizerman in the 1950s, where the problem of optimal performance was considered [
12]. For systems considered in [
1,
5], it is often necessary for one of the subsystems to get into the required position as quickly as possible, while the others should remain at rest at the terminal moment. Thus, for a single oscillator, V.G. Boltyansky obtained the synthesis of optimal control in the time-optimal problem [
13]. The optimal control problems for the systems of many oscillators with various criteria have already been studied by F.L. Chernousko in [
14], where a significant complexity of obtaining an analytical solution was noted, and the problem involving two oscillators was solved for a very special case of their frequency ratio.
In this paper, we review the time-optimal control problem for two non-synchronous oscillators controlled by limited scalar impacts with initial zero-phase coordinates but different terminal ones, which are features of the problem statement that considerably complicate it.
One of the key aspects of such control problems is the study of the accessibility and controllability of the oscillatory system [
15]. It can be shown with the methods of geometric control theory [
16], namely: the Sussman–Djurdjevic and the Poincare theorems, as well as with the results of the classical optimal control theory, the La Salle–Conti theorem [
17].
This article is structured as follows. In
Section 2, the optimal control problem is formulated. In
Section 3, the optimal control law is presented using the Pontryagin maximum principle.
Section 4 examines the necessary and sufficient optimality conditions for different classes of control switchings.
Section 5 discusses the continuity of the problem criterion on the control parameter.
Section 6 illustrates the obtained results, in particular, the mathematical modeling of the set of reachability of the first oscillator, the criterion of the problem, and the time intervals of the switching controls. In the conclusion, we suggest directions for further research.
2. Optimal Control Problem Statement
A system consisting of two non-synchronous oscillators, i.e., having different natural frequencies
, was considered. The control
impacts the momenta
of both oscillators, but not their coordinates
. The motion of this linear control system is described with the following system of equations:
The system (
1) can be rewritten in matrix form
, where
The number of non-zero components of the control vector
B is less than the dimensionality of the system; moreover, the control is limited by a value
, which is a parameter of the problem:
At the initial moment, the phase states of both oscillators coincide with the origins of the phase coordinate planes, i.e., their coordinates and momenta are equal to zero. At the terminal moment, the phase coordinates of the second oscillator become zero again, while the first oscillator moves to the state
, which are also parameters of the problem. The explicit forms of the initial and terminal conditions are
where
is a terminal moment and
T defines a transpose operation.
The total time of the system’s movement from the initial state to the terminal one is chosen as the criterion of the problem and needs to be minimized:
The proofs of global controllability and strong accessibility of the system (
1) with the condition (
2) are given in [
15,
18].
3. Solution of the Optimal Control Problem
To investigate and solve problems (
1)–(
5), the Pontryagin maximum principle (PMP) is used. The maximum principles in the canonical coordinates consist of the following statements:
The Hamiltonian for the optimal control problem is expressed as
where
are adjoint variables and
is constant.
A Hamiltonian system, including motion equations and an adjoint system of equations:
Transversality conditions are absent because the optimal problem statement involves fixed endpoints.
The Hamiltonian (
8) is linear with respect to
and, thus, the optimal control is given by the bang–bang control:
for
and
for
, which can be written as
where
is a switching function. The adjoint system can be rewritten in the following form
From the system (
11)
and
can be found:
where
is a constant vector. Combining the expressions (
9) and (
12), an explicit form of the optimal control law can be obtained
This function is uniquely defined and cannot be equal to zero on the whole interval, perhaps with the exception of the isolated points, which lead to the absence of special control modes.
The relay form of the optimal control obtained in this Section allows explicitly writing solutions of the motion Equation (
1) taking into account how many times the control (
13) will change the sign and obtain the necessary extremum conditions for any number of such switchings.
4. The Necessary Extremum Conditions
For the relay control (
13), two values are defined: the moment of switching
,
, when the control changes signs and the duration of control intervals
. Here, a further designation
, which means “
”, is used for convenience. The optimal control
with
switchings and
K control intervals is illustrated in
Figure 1:
Definition 1. The term “-switching control class” defines an optimal solution where the optimal control has exactly switchings.
The solution to the system of motion Equation (
1) with boundary conditions (
3) and (
4) has a general form
The structure of optimal control (
13) allows to write a solution of the system that takes into account not only the number of switchings
, but also the sign of the control at the first interval.
where an additional parameter
k equal to 0 or 1 corresponds to the initial control
and
, respectively.
Remark 1. A system (
15)
consisting of four equations can be used to study the three-switching control class, where four control intervals need to be determined. The two-switching control class occurs as degeneration of other classes. For example, when the outermost interval is zeroed in on a three-switching control class, or the internal interval is zeroed in on a four-switching control class.
Lemma 1. In the two-switching control class, the functional dependencies for the duration of the control intervals , , are valid To investigate control classes with more than three switchings, the necessary extremum conditions are introduced.
Theorem 1 (Necessary Extremum Conditions)
. Any solution to the problem (1)–(5) in the class of piecewise constant controls (13) satisfies the system (15) and equalities: Proof. The switching function (
10) that determines the optimal control (
13) becomes zero at moments
, when control switchings occur:
For a more compact notation, Equation (
17) can be rewritten in matrix form
where
Equation (
18) form the system consisting of
equations
The condition of the existence of the non-zero vector
C in (
19) is equivalent to
equalities
In other words, (
20) are the non-degeneracy conditions of vector
C, defining the optimal control (
13). □
Remark 2. In the case when , the condition of the existence of non-zero vector C has the form which is equivalent to the following equation If , there are (number of permutations) of the non-zero conditions for the vector C Then, in order to find , it is required to use (15) together with equations from the set (
23),
so that all the moments of are contained there. A set that satisfies these conditions is (
20),
for example. Remark 3. Theorem 1 allows finding all of the required switching moments that define vector C as the kernel of the corresponding linear mapping in (19). This implies another necessary condition of the extremum, which is formulated in the form of the following lemma.
Lemma 2. The optimal control with vector C obtained from (19) contains exactly zeros and, according to Definition 1, corresponds to the -switching control class. In the next section, a theorem considering the continuity of the problem criterion from the parameter is formulated and proved.
5. Continuity of
To find sufficient conditions for the continuity of a function
, the implicit function theorem is applied. Let us write out the Jacobian of a system consisting of equations of motion (
15) and additional conditions (
16) obtained by Theorem 1 on the necessary extremum conditions.
The equations of the joint system are denoted by
, where
relate to the equations of motion (
15), and
are responsible for the non-degeneracy conditions (
20). The Jacobian
corresponds to the
-switching control class and consists of partial derivatives of
K functions with
K variables.
Let us transform the above determinant, leading the Jacobian to a simpler form. The determinant of the system will not change if the previous column is subtracted from each column, except for the first.
After multiplying the first and third rows of the resulting matrix by −1 and then replacing the sign for each even column, the transformed Jacobian has the following form
The Jacobian, according to Equation (
25), for the three-switching control class is written as follows
The expression (
26) has an explicit form
Expressions (
21) and (
26) can be compared, or rather their explicit forms (
22) and (
27), respectively. It turns out that with an index shift by one at
, the functional parts of these expressions are equal. Expression (
26) includes four control intervals in the three-switching class and Expression (
21) includes five intervals in the four-switching class, where the first control interval becomes zero at the boundary value
, and that is how the transition into the three-switching class occurs.
This result is derived from several interrelated observations. The first four lines of the Jacobian (
25), ordered by cos, sin, and
,
, which obviously do not change the sign of the determinants, can be written as
and then converted to the form
The matrix
includes the matrix
where the notation
is introduced. For the three-switching control class
, the equality is fulfilled
The last equality follows from the next lemma.
Lemma 3. If a square matrix A of size is composed of M blocks of matrices of size , such that , and a square matrix B of size is made up of M blocks of the product of matrices , where are square matrices of size , then Proof. Matrix
B can be written as the product of matrices
, where matrix
A has the form
and the auxiliary matrix
containing
has the form
Due to the multiplicativity of the determinant and the properties of the block–diagonal matrix, the determinant of matrix
B has the form
□
According to Lemma 3, the corresponding matrices for the three-switching control class are equal
Moreover,
, so the equality (
31) is established.
One can see that for the three-switching control class, the condition
is possible only if the necessary conditions of the extremum (
23) and the equality
for for the four-switching control class are fulfilled simultaneously.
The Jacobian
(
25) can be written via the determinants of the square matrices of size
, composed of
of the matrix
.
The expression (
35) is a decomposition of (
25) by the last
rows, while the first four rows are divided into
corresponding minors.
For further narration, it is necessary to write down the implicit function theorem [
19] for the case of the following system of equations.
which is solved with respect to the variables
. The following system of functional relations (locally equivalent to system (
36)) is studied.
The following designations are introduced:
System (
36) is denoted with
and the mapping (
37) – with
.
A square matrix is reversible only when its determinant is non-zero.
Theorem 2 (The implicit function theorem as in [
19])
. If a mapping , defined in the neighborhood U of a point is such that the following requirements are met,
− reversible matrix,
then there exists a -dimensional interval where and a mapping , such that for any point the following condition holds Now, it is possible to prove the continuity, not only of but also of the criterion by the implicit function theorem applied to our problem.
Lemma 4. For continuity of the functions for -switching control class, the following condition is required Proof. The implicit function theorem 2 is applied for the considered problem. Let
, then
where, as before,
refer to the equations of the systems (
15) and (
20). For the sake of the reader’s comprehension, the arguments of the functions are omitted.
The first condition of Theorem 2 is satisfied if the parameter
is non-zero. For the solution of the optimal control problem, taking into account the term of Theorem 1, the second condition of Theorem 2 is fulfilled. The reversibility of
requires the condition
which is equivalent to
. All conditions of the implicit function theorem are satisfied, so there is a
-dimensional interval
and continuous mapping
f on this interval.
Consequently, the durations of the control intervals are continuous functions of . Since , then is a continuous function of too. □
Lemma 5 (Sufficient conditions of continuity of
)
. is continuous if the non-zero conditions of (23) are met in the -switching class. Proof. We investigate the Jacobian
, which we will rewrite using the formula (
35) in a form
where the total sum is divided into two parts. The set of indexes
always contains a null time element that corresponds to
.
corresponds to all other sets without a null time element. The fulfillment of the conditions (
23) leads to zeroing of the second sum in (
41), due to the fact that each summand is zero. Then, it holds
Due to the linear dependence of any set consisting of four different
by the condition (
23), any vector
except
can be written as a linear combination of
. Then (
42) can be expressed as follows
where
combines both the coefficients
and the coefficients of expansion
by
.
Further, two cases are possible, namely , when is continuous by Lemma 4, or . In this case, if , then are linearly dependent. Moreover, if , then due to and the presence in the first column of only the first and third elements other than zero, it follows that the first and third rows match. Which again entails a linear dependence of . This means that the necessary conditions of the extremum include corresponding to and increasing the number of control switchings by one, which changes the total number of control switchings from to K. Thus, the conditions of the lemma for the number of control switchings are violated, which means that . By Lemma 4, the function in the -switching control class is continuous. □
Corollary 1. In a given control switching class, is a right-continuous function.
Remark 4. It can be concluded from entry (
25)
that due to the absence of in the explicit form , the sum of in (
41)
always equals zero. Corollary 2. Fulfillment of Equation (
23)
in the -switching control class under the condition leads to . Lemma 6. is a continuous function for .
Proof. Let two adjacent control classes consist of P and Q switchings, respectively. Indeed, due to Lemma 5 and Corollary 1 in a given class of control switchings, is continuous. Let us denote the value of the criterion in each of the classes with and at , where the upper index indicates the number of control switchings, and the lower index indicates the optimal value of control switchings. Next, for , let us denote the value of the criterion in the P-switching control class with their optimal number—Q for , and vice versa , the value of the criterion in the Q-switching control class with their optimal number is P and . Then obviously the inequalities are fulfilled, and . Again, let us use Lemma 5 and Corollary 1. So and are continuous on . Then . Thus, taking into account the above inequalities, it follows that . The statement of the lemma follows from the union of all intervals of control switchings. □
Continuity of
means that
is continuous in the phase plane of the first oscillator on terminal positions
in each direction
. Then, applying the equations of characteristics (
7) according to [
20], where
exist, we obtain
The adjoint variables are bounded, including at , so the variation of the criterion will be quite small under small variations of the terminal positions .
6. Modeling
To illustrate the aforesaid theoretical results, numerical modeling for various configurations of the (
1)–(
5) problem were presented. All numerical experiments were conducted via Python scripts, which were developed specifically for this problem. In particular, the SciPy library was used to find solutions to the systems of nonlinear equations, and Matplotlib was used to visualize the obtained results.
Firstly, in
Section 6.1, the motion of the system to state
(with optimal control
(
13) for control classes with different switchings) was described. In these examples, the dependencies of the optimal number of control switchings, the criterion of the problem
, and others on parameter
, were investigated. In
Section 6.2, a range of terminal states of the first oscillator was considered, but now with a fixed value of
. In this case, all the values were considered to be functions of the parameters
.
6.1. Fixed Terminal Position of the First Oscillator
The problem (
1)–(
5) is considered with the following parameters chosen without loss of generality:
Control classes with three, four, and five switchings
are considered. For a range of values
in each control class, the solutions that satisfied Theorem 1 and minimized the criterion (
5) were found.
An important step is checking that the resulting system solutions form, according to Remark 3, the control law with the number of switchings that are initially set. In other words, Lemma 2 must be fulfilled. For example, an attempt to find a solution for a three-switching control class for
results in switching function
containing more than three switchings that do not satisfy Lemma 2.
Figure 2 shows such an example for the parameter
. The points obtained by Theorem 1 are highlighted in red. An extra zero of function
is marked with a red cross. The constant vector
, calculated according to Remark 3, defines the switching function
, which is pictured with a blue line.
Lemma 2 is not satisfied in this case.
However, Lemma 2 will be fulfilled for
, as illustrated in
Figure 3. Moreover, the optimal control law
defined by
is shown.
Vector
C in this case:
The phase portraits of the first and second oscillators satisfy the given boundary conditions (
44) and are presented in
Figure 4.
The dependence of the criterion
on the constraint
is shown in
Figure 5. The blue, red, and black curves show solutions for three-, four-, and five-switching control classes, which are defined by the boundaries of each control class.
Figure 5 shows that
is a continuous function according to Lemma 6.
Figure 6 illustrates the dependencies of the control interval durations from
. The implicit function theorem shows that the durations of control intervals are continuous on
inside each class, which can be observed in the graph. In area I, which corresponds to the five-switching class, the duration of the first control interval becomes zero when the boundary of the selected class
is reached. Region II corresponds to the four-switching class, where the second interval of I becomes the first interval, and it also becomes zero when it reaches the boundary
. The third interval from region I becomes the second interval in II and the first interval in the three-switching region (III). If one of the internal intervals of the four-switching class is nullified, then the transition to the two-switching class takes place.
Figure 7 shows the dependencies of the Jacobian for different switching control classes on
.
As one can see from the graph, when the class boundaries are reached, the Jacobians become equal to zero. Moreover, the Jacobian of the four-switching class is zero when passing to the two-switching class at one point. Lemma 1 is valid for the two-switching control class. Two-, three-, and four-switching control classes for different finite states of the first oscillator () and are presented next. As before, it was checked that the obtained solutions for each final state form, according to Remark 3, the control law with the appropriate number of switchings.
6.2. The Phase Plane of the First Oscillator for Different Control Classes
Different control classes are shown in
Figure 8. The red and green areas correspond to the three- and four-switching control classes with initial controls
, respectively. The reflections of these regions relative to the origins lead to blue and orange regions for the three- and four-switching control classes on the initial intervals
, respectively. The three-switching control classes are separated from each other by parametric curves obtained by Lemma 1. This is due to the fact that nullifying the lateral control interval in the three-switching control class also leads to the two-switching control class (orange and red curves). Zeroing the internal control interval in the ’four-switchings’ results in curves (blue and green) corresponding to the two-switching control classes at different initial controls. The yellow and gray boundary curves were obtained by moving from the four-switching class to the three-switching class by zeroing out the last control interval, with the control at the initial interval coinciding with the initial controls in the regions on either side of the boundary. The white and purple curves correspond to the zeroing of the first control interval in the four-switching class, which means the transition to the three-switching. The simultaneous nullifying of the side interval of the control constancy in the four-switching class corresponds to the blue points where the curves obtained under the conditions
or
intersect.
Figure 9 shows the dependence
of the criterion on the phase coordinates of the first oscillator. Three- and four-switching control classes are highlighted with red and green points. The colors of all curves correspond to the description in
Figure 8.