Steering a Swarm of Large-Scale Underactuated Mechanical Systems Using a Generalized Coordinates Transformation

Salamat, Babak; Elsbacher, Gerhard

doi:10.3390/aerospace9110702

Open AccessArticle

Steering a Swarm of Large-Scale Underactuated Mechanical Systems Using a Generalized Coordinates Transformation

by

Babak Salamat

^*

and

Gerhard Elsbacher

AI Aided Aeronautical Engineering and Product Development, AImotion, Technische Hochschule Ingolstadt, 85049 Ingolstadt, Germany

^*

Author to whom correspondence should be addressed.

Aerospace 2022, 9(11), 702; https://doi.org/10.3390/aerospace9110702

Submission received: 9 September 2022 / Revised: 1 November 2022 / Accepted: 7 November 2022 / Published: 9 November 2022

Download

Browse Figures

Versions Notes

Abstract

:

Steering large-scale particle or robot systems is challenging because of their high dimensionality. We use a centralized stochastic approach that allows for optimal control at the cost of a central element instead of a decentralized approach. Previous works are often restricted to the assumption of fully actuated robots. Here we propose an approach for underactuated robots that allows for energy-efficient control of the robot system. We consider a simple task of gathering the robots (minimizing positional variance) and steering them towards a goal point within a bounded area without obstacles. We make two main contributions. First, we present a generalized coordinate transformation for underactuated robots, whose physical properties should be considered. We choose Euler-Lagrange systems that describe a large class of robot systems. Second, we propose an optimal control mechanism with the prime objective of energy efficiency. We show the feasibility of our approach in robot simulations.

Keywords:

aerospace; generalized multi-coordinates transformation; network control systems; underactuated Euler–Lagrange systems

1. Introduction

The steer of large-scale multi-agent particle systems is challenging due to the high degree of freedom in such distributed systems of loosely coupled robots [1]. The published approaches on this subject can roughly be separated into two complementary classes: (A) centralized approaches assuming complete information and focusing on precision and efficiency [2] and (B) decentralized approaches assuming only partial observability and focusing on simple reactive and behavior-based control [3]. While both concepts are generally justified, the centralized approach may be almost unavoidable for certain tasks. Here, we investigate a control problem in robot swarms with minimal hardware [4,5]. In the case of a simple robot, such as the Kilobot robot with its minimal equipment of sensors [6], certain tasks may be infeasible relying on a decentralized approach. The advantage of having simple hardware is, in turn, that possibly many robots can be built to form a large-scale formation with high redundancy. The automatic control problem can be thought of as macroscopic or stochastic control of a cloud of robots determined by a distribution [4,7].

The global input can be, for example, the mean position of all robots and their variance. The output is a global control law that is broadcasted to all robots or that operates as a moment of force on each robot. The variance may be calculated based on robot positions [5], which could be relaxed in a different approach. An option is to exploit the environment by gathering robots at flat obstacles until minimum variance is achieved [8]. The control iterates over measuring robot positions followed by possibly longer periods of not measuring again but relying on the dynamical model of each robot plus adding Brownian noise on positions, velocities, and accelerations. This is generally related to mean-field models of multi-robot systems [9] and, specifically, the concept of assuming microscopically Brownian particles and the resulting macroscopic evolution of a swarm described by a distribution relates directly to known modeling approaches in swarm robotics based on Langevin equations and Fokker-Planck equations [10].

We propose an optimal energy-efficient control mechanism that minimizes positional variance and steers the robot system’s mean position to a target position. In particular, our work starts by showing that it is possible to obtain mathematically a mapping such that underactuated robot systems take a partial form. However, due to the complexity of the dynamics (coupling of the inertia matrix), it is not possible to design a controller. Another challenge is the fact that the control input matrix

G (q)

is time-variant. However, in [3,11,12], the authors assumed the input matrix to be in the form of

G = {[I_{m} 0_{s}]}^{⊤}

. This assumption on the input matrix G can be applied only to simple robot structures. In this paper, we cover the case where

G (q) = {[G_{u} (q) G_{a} (q)]}^{⊤}

has a general form. Therefore, we relax this assumption on the input matrix G differently from what is done in [3,11,12]. Indeed, finding a transformation to have the robot systems take a partial form is not straightforward. Nonetheless, a new generalized coordinate transformation framework is proposed to decouple the system. This allows the development of an optimal control mechanism with the prime objective of energy efficiency. In control theory, several techniques exist to design energy-efficient control laws [13]. However, the state-dependent Riccati equation (SDRE) [14] does not cancel nonlinear terms, which is advantageous because canceling such nonlinearities would significantly increase the control signals [15]. Furthermore, SDRE parameterizes and characterizes the system to a state-dependent coefficient (SDC) form that is useful for immediate stability analysis. Then, we show that our control design provides set point tracking (stabilization) with semi-global properties. Our proof is based on the Lyapunov stability criterion [16].

Recently in [2], the author introduced a centralized control of underactuated nonidentical Euler–Lagrange systems. The methodology is valid based on the assumption of the accessibility of the information of the whole network. In this paper, we develop a novel centralized control of underactuated large-scale multi-agent systems using only the mean position of all agents and their variance. The experiment highlights three advantages: (i) It resolves the limitation of the existing control strategy by introducing a novel two-step methodology to control the swarm, (ii) it increases the performance by exploiting the torque generated for the orientation of particles and providing smoother trajectories, and (iii) it proposes a base performance comparison with the actuated holonomic swarm of particles [5,17].

2. Euler-Lagrange Dynamics

We consider an underactuated robot system with dynamics described by the well-known Euler-Lagrange (EL) equations of motion

\begin{matrix} M (q) \ddot{q} + C (q, \dot{q}) \dot{q} + \nabla V (q) = G (q) u, \end{matrix}

(1)

where

q \in R^{n}

are the configuration variables,

u \in R^{m}

are the control signals,

M (q) > 0

is the generalized inertia matrix,

C (q, \dot{q})

represent the Coriolis and centrifugal forces,

V (q)

is the systems potential energy, and

G (q)

is the input matrix. First, we make an assumption characterizing the class of generalized coordinate transformation T that we use here.

Assumption 1.

There exists an invertible mapping

Φ : R^{n} \to R^{n}

, such that

\nabla_{q} Φ (q) = T^{- 1} (q) .

(2)

is invertible for all q.

Lemma 1.

Consider a mapping

Φ : R^{n} \to R^{n}

that satisfies Assumption 1 and define the generalised coordinate transformation as follows

q = Φ (q) .

(3)

Then, the EL dynamics (1) can be written as follows

\begin{matrix} M (q) \ddot{q} + C (q, \dot{q}) \dot{q} + \nabla V (q) = G (q) u, \end{matrix}

(4)

where

\begin{matrix} \dot{q} & : = & T^{- 1} (q) \dot{q} \end{matrix}

(5)

\begin{matrix} M (q) & : = & T^{⊤} (q) M (q) T (q) |_{q = Φ^{- 1} (q)} \end{matrix}

(6)

\begin{matrix} V (q) & : = & V (q) |_{q = Φ^{- 1} (q)} \end{matrix}

(7)

\begin{matrix} G & : = & T^{⊤} (q) G (q) |_{q = Φ^{- 1} (q)} \end{matrix}

(8)

and

C (q, \dot{q}) \dot{q}

are the Coriolis and centrifugal forces associated with mass matrix

M (q)

that we can compute by

C (q, \dot{q}) \dot{q} = [\nabla_{q} [M (q) \dot{q}] - \frac{1}{2} \nabla_{q}^{⊤} [M (q) \dot{q}]] \dot{q} .

(9)

The Lagrangian in the new generalized coordinates is

L (q, \dot{q}) = \frac{1}{2} {\dot{q}}^{⊤} M (q) \dot{q} - V (q) .

(10)

Proof.

The proof follows from the coordinate invariance property of the EL equations (or from straightforward calculation computing the derivative of the coordinate transformation and using the original dynamics). □

Remark 1.

Notice that the matrix

T (.)

can be used to shape the form of the mass matrix

M (.)

in the new generalized coordinates. However, we only consider invertible matrices

T (.)

that satisfy the integrability Assumption 1. That is, given an invertible matrix

T (.)

, we assume there exists an invertible mapping

Φ : R^{n} \to R^{n}

that satisfies

\dot{Φ} (q) = T^{- 1} (q) \dot{q} .

Therefore, the generalized coordinated transformation (3) is well-defined.

We consider now mechanical systems (1) with an input matrix of the general form

\begin{matrix} G (q) = [\begin{matrix} G_{u} (q) \\ G_{a} (q) \end{matrix}], \end{matrix}

(11)

where rank

G (q) = m < n

, and

G_{a} (q)

is an invertible

m \times m

matrix.

G_{u} (q)

and

G_{a} (q)

are the underactuated and actuated components of

G (q)

, respectively. The EL dynamics (1) is coupled when

G_{u} (q) ≢ 0

. Furthermore, to simplify the notation, we partition the generalized coordinates and velocity as

q = col (q_{u}, q_{a})

,

\dot{q} = col ({\dot{q}}_{u}, {\dot{q}}_{a})

with

q_{a}, {\dot{q}}_{a} \in R^{m}

and

q_{u}, {\dot{q}}_{u} \in R^{s}

, and partition the inertia and Coriolis matrices as

\begin{matrix} M (q) = [\begin{matrix} m_{u u} (q) & m_{a u}^{⊤} (q) \\ m_{a u} (q) & m_{a a} (q) \end{matrix}], \end{matrix}

\begin{matrix} C (q, \dot{q}) = [\begin{matrix} c_{u u} (q) & c_{u a} (q) \\ c_{a u} (q) & c_{a a} (q) \end{matrix}], \end{matrix}

where

m_{a a} : R^{n} \to R^{m \times m}

,

m_{a u} : R^{n} \to R^{s \times m}

,

m_{u u} : R^{n} \to R^{s \times s}

,

c_{a a} : R^{n} \times R^{n} \to R^{m \times m}

,

c_{a u} : R^{n} \times R^{n} \to R^{s \times m}

,

c_{u a} : R^{n} \times R^{n} \to R^{m \times s}

,

c_{u u} : R^{n} \to R^{s \times s}

. Next, we impose several assumptions to show particular forms of the EL dynamics (1) under generalized coordinate transformations.

Assumption 2.

There exists a function

Φ_{a} : R^{m} \to R^{s}

, such that

{\dot{Φ}}_{a} (q_{a}) = m_{u u}^{- 1} m_{a u}^{⊤} {\dot{q}}_{a} .

(12)

Assumption 3.

The inertia matrix depends only on the actuated variables

q_{a}

, i.e.,

M (q) = M (q_{a})

.

Assumption 4.

The sub-block matrix

m_{u u}

of the inertia matrix is constant.

Assumption 5.

The potential energy can be written as

V (q) = V_{a} (q_{a}) + V_{u} (q_{u}) .

Proposition 1.

The dynamics of the system (1), under Assumption 2 and using the generalised coordinates

q = col (q_{1}, q_{2}) = Φ (q)

, can be written as follows

\begin{array}{l} m_{u u} {\ddot{q}}_{1} + [\nabla_{q_{1}} (m_{u u} {\dot{q}}_{1}) - \frac{1}{2} \nabla_{q_{1}}^{⊤} (m_{u u}^{s} {\dot{q}}_{1})] {\dot{q}}_{1} + \\ [\nabla_{q_{2}} (m_{u u}^{s} {\dot{q}}_{2}) - \frac{1}{2} \nabla_{q_{1}}^{⊤} (m_{a a} {\dot{q}}_{2})] {\dot{q}}_{2} + \\ \nabla_{q_{1}} V (q) = G_{u} (q) u \\ m_{a a}^{s} {\ddot{q}}_{2} + [\nabla_{q_{1}} (m_{a a}^{s} {\dot{q}}_{2}) - \frac{1}{2} \nabla_{q_{2}}^{⊤} (m_{u u} {\dot{q}}_{1})] {\dot{q}}_{1} \\ + [\nabla_{q_{2}} (m_{a a}^{s} {\dot{q}}_{2}) - \frac{1}{2} \nabla_{q_{2}}^{⊤} (m_{a a}^{s} {\dot{q}}_{2})] {\dot{q}}_{2} + \end{array}

(13)

\begin{matrix} \nabla_{q_{2}} V (q) = [G_{a} (q) - G_{u} (q) m_{a u} m_{u u}^{- 1}] u, \end{matrix}

(14)

where

\begin{matrix} [\begin{matrix} q_{1} \\ q_{2} \end{matrix}] & = & [\begin{matrix} q_{u} + Φ_{a} (q_{a}) \\ q_{a} \end{matrix}] \end{matrix}

(15)

\begin{matrix} m_{a a}^{s} (q) & = & m_{a a} (q) - m_{a u} (q) m_{u u}^{- 1} (q) m_{a u}^{⊤} (q) |_{q = Φ^{- 1} (q)}, \end{matrix}

(16)

\begin{matrix} m_{u u} (q) & = & m_{u u} (q) |_{q = Φ^{- 1} (q)}, \end{matrix}

(17)

\begin{matrix} m_{a u} (q) & = & m_{a u} (q) |_{q = Φ^{- 1} (q)} . \end{matrix}

(18)

Proof.

First notice that, under Assumption 2, the coordinate transformation (15) satisfies Assumption 1 with

T (q) = [\begin{matrix} I_{s} & - m_{u u}^{- 1} m_{a u}^{⊤} \\ 0_{m \times s} & I_{m} \end{matrix}] .

(19)

Then, from Lemma 1 we obtain that the dynamics can be written in the form (10) with

\begin{matrix} [\begin{matrix} {\dot{q}}_{1} \\ {\dot{q}}_{2} \end{matrix}] = [\begin{matrix} I_{s} & m_{u u}^{- 1} m_{a u}^{⊤} \\ 0_{m \times s} & I_{m} \end{matrix}] [\begin{matrix} {\dot{q}}_{u} \\ {\dot{q}}_{a} \end{matrix}] \end{matrix}

(20)

and Lagrangian

L (q, \dot{q}) = \frac{1}{2} [\begin{matrix} {\dot{q}}_{1}^{⊤} & {\dot{q}}_{2}^{⊤} \end{matrix}] [\begin{matrix} m_{u u} & 0_{s \times m} \\ 0_{m \times s} & m_{a a}^{s} \end{matrix}] [\begin{matrix} {\dot{q}}_{1} \\ {\dot{q}}_{2} \end{matrix}] - V (q) .

(21)

The dynamics (13) and (14) follow, after some simple calculations, from the EL formula using the Lagrangian (21). □

Corollary 1.

The system (1) satisfying Assumptions 1–3 can be written as in the EL form as follows

\begin{matrix} m_{u u} (q_{a}) {\ddot{q}}_{1} + \nabla_{q_{1}} V (q_{1}, q_{a}) = G_{u} (q) u, \\ m_{a a}^{s} {\ddot{q}}_{a} + [\nabla_{q_{a}} [m_{a a}^{s} (q_{a}) {\dot{q}}_{a}] - \frac{1}{2} \nabla_{q_{a}}^{⊤} [m_{a a}^{s} (q_{a}) {\dot{q}}_{a}]] {\dot{q}}_{a} + \end{matrix}

(22)

\begin{matrix} \nabla_{q_{a}} V (q_{1}, q_{a}) = [G_{a} (q) - G_{u} (q) m_{a u} m_{u u}^{- 1}] u, \end{matrix}

(23)

with

m_{a a}^{s} (q_{a}) = m_{a a} (q_{a}) - m_{a u}^{⊤} (q_{a}) m_{u u}^{- 1} m_{a u} (q_{u})

. In addition, if Assumptions 3 and 4 also holds, then the EL dynamics can be written as follows

\begin{matrix} m_{u u} {\ddot{q}}_{1} + \nabla_{q_{u}} V_{u} |_{q_{u} = q_{1} - Φ_{a} (q_{a})} = G_{u} (q) u, \\ m_{a a}^{s} {\ddot{q}}_{a} + [\nabla_{q_{a}} [m_{a a}^{s} {\dot{q}}_{a}] - \frac{1}{2} \nabla_{q_{a}} [m_{a a}^{s} {\dot{q}}_{a}]] {\dot{q}}_{a} \\ + \nabla_{q_{a}} V_{a} - m_{a u} m_{u u} \nabla_{q_{u}} V_{u} |_{q_{u} = q_{1} - Φ_{a} (q_{a})} = \end{matrix}

(24)

\begin{matrix} [G_{a} (q) - G_{u} (q) m_{a u} m_{u u}^{- 1}] u . \end{matrix}

(25)

Proof.

The proof follows from Proposition 1 and Assumptions 1–3 by setting in (13) and (14) the following conditions:

q_{1} = q_{u} + Φ_{a} (q_{a})

,

q_{2} = q_{a}

,

m_{u u}

is a constant matrix, and

m_{a a}^{s} (q) = m_{a a}^{s} (q_{a})

. The second part follows from the fact that, under Assumption 4, the potential function is

V (q) = V_{a} (q_{a}) + V_{u} (q_{1} - Φ_{a} (q_{a}))

. □

Remark 2.

Notice that the system in the partial linear form (24) and (25) has been used to design a PID passivity-based controller in [18]. In that work, an outer partial feedback linearization (PFL) control is used to obtain the desired form, which compromises the robustness of the closed loop. However, this PFL control can be avoided by using a generalized change of coordinates as shown in Corollary 1.

The generalized coordinate transformation in Proposition 1 is also useful (as it will be shown in the next section) for the underactuated swarm of particles.

3. Underactuated Robot System

We consider an underactuated robot system with masses

m_{1}

,

m_{2}

, and

m_{3}

, as shown in Figure 1 that are rigidly fastened to the mass-less shaft and are free to move in the 2D plane. We now set up the equation of motion of the holonomic robot using convenient coordinates

q = {[q_{1}, q_{2}, q_{3}]}^{⊤} = {[x_{1}, y_{1}, θ]}^{⊤}

. An external force

f_{1}

is applied to

m_{1}

in the direction of

- x_{1}

and

y_{1}

respectively, and

f_{3}

to

m_{3}

in the direction of

- x_{3}

and

y_{3}

respectively. To simplify the notation, we assume that all representative particle masses are the same (e.g.,

m_{i} = m

for

i = 1, \dots, 3

). Applying Lagrange’s equations, it immediately follows that

\begin{matrix} L = \frac{1}{2} {\dot{q}}^{⊤} [\begin{matrix} 3 m & 0 & - 3 L m sin (θ) \\ 0 & 3 m & 3 L m cos (θ) \\ - 3 L m sin (θ) & 3 L m cos (θ) & 5 L^{2} m \end{matrix}] \dot{q} \end{matrix}

(26)

where

(x_{1}, y_{1})

is positioned at the center of the first mass particle, L is the distance between each mass, and

θ

is the inclination angle (see Figure 1). The equations of motion can be written in compact form as

\begin{matrix} M (q) \ddot{q} + C (q, \dot{q}) \dot{q} = G (q) u, \end{matrix}

(27)

where

M (q)

is the generalized inertia matrix

\begin{matrix} M (q) = [\begin{matrix} 3 m & 0 & - 3 L m sin (θ) \\ 0 & 3 m & 3 L m cos (θ) \\ - 3 L m sin (θ) & 3 L m cos (θ) & 5 L^{2} m \end{matrix}] . \end{matrix}

(28)

C (q, \dot{q})

is the Coriolis matrix

\begin{matrix} C (q, \dot{q}) = \\ [\begin{matrix} 0 & 0 & - 3 L \dot{θ} m cos (θ) \\ 0 & 0 & - 3 L \dot{θ} m sin (θ) \\ \frac{3 L \dot{θ} m cos (θ)}{2} & \frac{3 L \dot{θ} m sin (θ)}{2} & h_{s} \end{matrix}] \end{matrix}

(29)

with

\begin{matrix} h_{s} = - \frac{3 L m ({\dot{x}}_{1} cos (θ) + {\dot{y}}_{1} sin (θ))}{2} . \end{matrix}

(30)

Therefore, the elements of the inertia matrix for the holonomic robot are given by

\begin{matrix} m_{u u} & = & 3 m \\ m_{a u} & = & m_{a u}^{⊤} = [\begin{matrix} 0 & - 3 L m sin (θ) \end{matrix}] \\ m_{a a} & = & [\begin{matrix} 3 m & 3 L m cos (θ) \\ 3 L m cos (θ) & 5 L^{2} m \end{matrix}] . \end{matrix}

(31)

The virtual work is given by

\begin{matrix} δ W = [\begin{matrix} - (f_{1} + f_{3}) sin (θ) δ x_{1} \\ + (f_{1} + f_{3}) cos (θ) δ y_{1} \\ 2 L f_{3} δ θ \end{matrix}] . \end{matrix}

(32)

The derivation of (32) is given in Appendix A. Without loss of generality,

G (q)

can be written as

G (q) = [\begin{matrix} - s i n (θ) & 0 \\ c o s (θ) & 0 \\ 0 & 1 \end{matrix}],

(33)

with

u = {[u_{1}, u_{2}]}^{⊤} = {[f_{1} + f_{3}, 2 L f_{3}]}^{⊤}

. Therefore,

G_{u} (q) = [\begin{matrix} - s i n (θ) & 0 \end{matrix}]

and

G_{a} (q) = [\begin{matrix} cos (θ) & 0 \\ 0 & 1 \end{matrix}]

. Note that

G_{a} (q)

is an invertible

2 \times 2

matrix.

Mechanical Properties of the Underactuated Robot

The robot as defined by (27) has several fundamental properties, which can be used to facilitate the design of an automatic control mechanism.

P.1: $M (q)$ is a positive definite matrix.
P.2: The inertia matrix depends only on the actuated variables $q_{a}$ , i.e., $M (q) = M (q_{a})$ .
P.3: The sub-block matrix $m_{u u}$ of the inertia matrix is constant.
P.4: From (28) and (29), and by using (30), we get

$\begin{matrix} \dot{M} (q) - 2 C (q, \dot{q}) = \\ [\begin{matrix} 0 & 0 & \frac{9 L \dot{θ} m cos (θ)}{2} \\ 0 & 0 & \frac{9 L \dot{θ} m sin (θ)}{2} \\ - \frac{9 L \dot{θ} m cos (θ)}{2} & - \frac{9 L \dot{θ} m sin (θ)}{2} & 0 \end{matrix}], \end{matrix}$

(34)

which is a skew-symmetric matrix.

The system has three degrees of freedom and only two actuators, hence, we have an underactuated mechanical system. We have nonlinearities because the generalized inertia matrix is off-diagonal and the input matrix is highly coupled. Due to the lack of more actuators, this system cannot be fully linearized using exact feedback linearization. However, it is still possible to apply PFL to the system, such that the translational dynamics

q_{1} = x_{1}

and

q_{2} = y_{1}

become a double integrator. As already mentioned, PFL compromises the robustness of the closed loop. However, the PFL can be avoided by using the proposed transformation, as shown in Corollary 1. Given the properties P.1–P.4, we apply the generalized coordinate transformations based on Proposition 1 to decouple the system.

Proposition 2.

Considering the holonomic robot in (27), the dynamical system model can be rewritten as

\begin{matrix} {\ddot{x}}_{1} = f_{x_{1}} (θ, \dot{θ}) - u_{1} L sin (θ), \end{matrix}

(35)

\begin{matrix} {\ddot{y}}_{1} = f_{y_{1}} (θ, \dot{θ}) + \frac{5 cos (θ)}{6 m} u_{1} - \frac{cos (θ)}{2 m} u_{2}, \end{matrix}

(36)

\begin{matrix} \ddot{θ} = - \frac{1}{2 m} u_{1} + \frac{1}{2 m} u_{2} . \end{matrix}

(37)

Proof.

By applying Proposition 1 the result follows. □

In the next section, we address questions related to the automatic control of a particle swarm that minimizes energy by applying the transformed underactuated model in (35)–(37). We prove that the mean of configuration variables is controllable and provide conditions under which the variance is also controllable.

4. Control Design for a Swarm

In this section, we present an automatic controller for a swarm of particles that minimizes energy. We show that it only relies on the first two moments of the swarm configuration variables, i.e., the position and the orientation angle distribution. The main objective of our automatic control approach is to act on forces optimally so that particles can reach the desired target position

q^{*} = {[x^{*}, y^{*}, θ^{*}]}^{⊤}

with the stable Euler angle (

lim_{t \to \infty} θ = 0

).

4.1. Swarm Dynamical System Model

By defining

x = (x_{1}, {\dot{x}}_{1}, y_{1}, {\dot{y}}_{1}, θ, \dot{θ})

, the dynamics of (35)–(37) can be written as

\begin{matrix} \dot{x} = A (x) x + B (x) u . \end{matrix}

(38)

The elements of

A (x)

and

B (x)

are

\begin{matrix} [\begin{matrix} {\dot{x}}_{1} \\ {\ddot{x}}_{1} \\ {\dot{y}}_{1} \\ {\ddot{y}}_{1} \\ \dot{θ} \\ \ddot{θ} \end{matrix}] = [\begin{matrix} 0 & 1 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 3 L m sin (θ) \dot{θ} (- \dot{θ} + 1) + 5 L & 3 L m cos (θ) (2 \dot{θ} - 1) \\ 0 & 0 & 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 0 & L {\dot{θ}}^{2} cos (θ) + 3 s i n (θ) & 2 L \dot{θ} sin (θ) \\ 0 & 0 & 0 & 0 & 0 & 1 \\ 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}] [\begin{matrix} x_{1} \\ {\dot{x}}_{1} \\ y_{1} \\ {\dot{y}}_{1} \\ θ \\ \dot{θ} \end{matrix}] \\ + [\begin{matrix} 0 & 0 \\ - L s i n (θ) & 0 \\ 0 & 0 \\ \frac{5 L cos (θ)}{6 m} & - \frac{cos (θ)}{2 m} \\ 0 & 0 \\ - \frac{1}{2 m} & \frac{1}{2 m} \end{matrix}] [\begin{matrix} u_{1} \\ u_{2} \end{matrix}] \end{matrix}

(39)

The system is nonlinear, since matrices

A (x)

and

B (x)

both depend on the current state variables. Firstly, we analyze the number of controllable states as given by the following definition.

Definition 1.

The states in (38) are controllable if the pair

{A (x), B (x)}

is point-wise controllable. This can be observed by the rank of the controllability matrix

\begin{matrix} C = [\begin{matrix} B (x) & A (x) B (x) & \dots & A^{5} (x) B (x) \end{matrix}] . \end{matrix}

(40)

The consequence of Definition 1 is that the

C

matrix for the system in (38) has the full rank (i.e., rank

(C) = 6

). Therefore, all states are controllable. Previous work has shown that the mean and variance of many particles for simple fully actuated particles are controllable [5,7]. Next, we show how we can stabilize the nonlinear underactuated particles by a global state-feedback controller designed via state-dependent Riccati equation (SDRE) control [14]. Motivated by (38) and defining the mean states

\bar{x}

that represent the mean states of N particles, we can write the dynamical system model of the swarm as

\begin{matrix} \dot{\bar{x}} = A (\bar{x}) \bar{x} + B (\bar{x}) u . \end{matrix}

(41)

Interestingly, analyzing the controllability of the swarm dynamics results in the same form as in (38), hence, the mean states are controllable.

4.2. Control Law

Our objective is to find minimum energy inputs that steer the swarm to a given target state defined on

t \in [t_{0}, t_{f}] = [0, \infty]

. To do so, consider now the following cost functional

\begin{matrix} J = \frac{1}{2} \int_{0}^{\infty} ({\bar{x}}^{⊤} Q (\bar{x}) \bar{x} + u^{⊤} R (\bar{x}) u) d t, \end{matrix}

(42)

with respect to the state

\bar{x}

and control input u subject to the nonlinear dynamical system model constraint

\begin{matrix} \dot{\bar{x}} = A (\bar{x}) \bar{x} + B (\bar{x}) u, \end{matrix}

where

Q (\bar{x}) \geq 0

penalizes the state, and

R (\bar{x}) > 0

penalizes the control effort for all

\bar{x}

. We aim for a nonlinear state-feedback controller u that stabilizes solutions of problem (41) and (42).

Remark 3.

Cloutier [14] obtains the nonlinear feedback controller via SDRE. Our interest is to provide an alternative interpretation of solving the problem (41) and (42) via Pontryagin’s minimum principle [13].

From (41) and (42), the Hamiltonian can be written as

\begin{matrix} H = & \frac{1}{2} [{\bar{x}}^{⊤} Q (\bar{x}) \bar{x} + u^{⊤} R (\bar{x}) u] \\ + p^{⊤} (t) [A (\bar{x}) \bar{x} + B (\bar{x}) u], \end{matrix}

(43)

where

p (t)

is the adjoint vector. The necessary condition is derived by differentiating (43) with respect to u which yields

\begin{matrix} \nabla_{u} H = R (\bar{x}) u + B^{⊤} (\bar{x}) p = 0 . \end{matrix}

(44)

We obtain the nonlinear feedback controller

\begin{matrix} u = - R^{- 1} (\bar{x}) B^{⊤} (\bar{x}) p . \end{matrix}

(45)

Now, we define

p ≜ P (x) x

, where the matrix

P (x)

can be obtained by solving the algebraic Riccati equation

\begin{matrix} A^{⊤} (\bar{x}) P + P A (\bar{x}) - P B (\bar{x}) R^{- 1} (\bar{x}) B^{⊤} (\bar{x}) P \\ + Q (\bar{x}) = 0 . \end{matrix}

(46)

By that we fulfill the second optimality condition

\begin{matrix} \dot{p} = - \nabla_{\bar{x}} H (\bar{x}, t, p) . \end{matrix}

(47)

Therefore, as long as the two conditions in (44) and (47) hold, it is always possible to construct a nonlinear feedback controller that solves the problem (41) and (42). The closed-loop solution for this feedback controller is at least a local optimum and possibly the global optimum.

4.3. Stability Analysis

Theorem 1.

Consider the dynamical system model (41), with the feedback controller (45). Assume in addition that for a constant input weighting matrix

R > 0

, the state weighting matrix

Q (x) > 0

can be chosen, such that

\dot{P} (x) < 0

for all x, where

P (\bar{x})

is the solution of (46). Then the zero equilibrium of the closed-loop system is semi-globally stable.

Proof.

Consider the Lyapunov function candidate

\begin{matrix} L (\bar{x}) = {\bar{x}}^{⊤} P (\bar{x}) \bar{x}, \end{matrix}

(48)

the time derivative of which, along the trajectories of the closed-loop dynamical system, is such that

\begin{matrix} \dot{L} (\bar{x}) = {\dot{\bar{x}}}^{⊤} P (\bar{x}) \bar{x} + {\bar{x}}^{⊤} P (\bar{x}) \dot{\bar{x}} + {\bar{x}}^{⊤} \dot{P} (\bar{x}) \bar{x} \\ = {\bar{x}}^{⊤} [\dot{P} (\bar{x}) - Q (\bar{x}) \\ - P (\bar{x}) B (\bar{x}) R^{- 1} B^{⊤} (\bar{x}) P (\bar{x})] \bar{x}, \end{matrix}

(49)

where

P (\bar{x}) B (\bar{x}) R^{- 1} B^{⊤} (\bar{x}) P (\bar{x}) > 0

. In addition, based on the assumed selection of

Q (\bar{x})

, yields

\dot{P} (\bar{x}) < 0

and

\dot{L} (x) < 0

, hence our claim. □

4.4. Controlling Mean and Variance

The variances

σ_{x_{1}}^{2}

and

σ_{y_{1}}^{2}

of N underactuated particle’s position is

\begin{matrix} {\bar{x}}_{1} = \frac{1}{N} \sum_{i = 1}^{N} x_{1 i}, σ_{x_{1}}^{2} = \frac{1}{N} \sum_{i = 1}^{N} {(x_{1 i} - {\bar{x}}_{1})}^{2}, \end{matrix}

(50)

\begin{matrix} {\bar{y}}_{1} = \frac{1}{N} \sum_{i = 1}^{N} y_{1 i}, σ_{y_{1}}^{2} = \frac{1}{N} \sum_{i = 1}^{N} {(y_{1 i} - {\bar{y}}_{1})}^{2} . \end{matrix}

(51)

The objective now is to control both, the mean and variance, effectively to ensure approaching a target position with minimum variance. Therefore, the selected strategy is the hysteresis-based approach following [5,19]. The idea is that the automatic controller regulates the mean states of N underactuated particles with radius r but switches to minimizing variance if the variance exceeds the threshold

σ_{m a x} = 2.5 r + σ_{o p t i m a l}^{2} (n, r)

and until

σ_{m i n} = 15 r + + σ_{o p t i m a l}^{2} (n, r)

is reached [5]. The idea of using such values comes from Graham and Sloane [8]. They proved that the minimum variance to collect N 2D circles with radius r is

0.55 N r^{2}

. Our proposed methodology in total consists of (1) applying the generalized coordinate transformation shown in Algorithm 1 and (2) proposing and analyzing the control mechanism to regulate the mean and variance of the swarm of underactuated particles shown in Algorithm 2.

Algorithm 1 Generalised Coordinate Transformation for Underactuated Particle (27).

begin procedure
Step 1: Partition the generalized coordinates and velocity w
Step 2: Construct the invertible mapping

\dot{Φ} (q) = T^{- 1} (q) \dot{q},

with

T (q) = [\begin{matrix} I_{s} & - m_{u u}^{- 1} m_{a u}^{⊤} \\ 0_{m \times s} & I_{m} \end{matrix}] .

Step 3: Apply Proposition 1.
end procedure

Algorithm 2 Hysteresis-based Mean and Variance Automatic Control

Require: Knowledge of underactuated particle swarm mean

\bar{x}

, variance

{[σ_{x_{1}}^{2} σ_{y_{1}}^{2}]}^{⊤}

, the boundary of the search space

{x_{m i n} x_{m a x} y_{m i n} y_{m a x}}

, and the desired mean state

q^{*} = {[x^{*}, y^{*}, θ^{*}]}^{⊤}

.

x_{g o a l} \leftarrow x^{*}

,

y_{g o a l} \leftarrow y^{*}

loop
if

σ_{x_{1}}^{2} > σ_{m a x}

then
∣

x_{g o a l} \leftarrow x_{m i n}

else
∣

x_{g o a l} \leftarrow x^{*}

end
if

σ_{y_{1}}^{2} > σ_{m a x}

then
∣

y_{g o a l} \leftarrow y_{m i n}

else
∣

y_{g o a l} \leftarrow y^{*}

end
Apply the automatic control law (45) to regulate the underactuated swarm to the desired state

q^{*} = {[x^{*}, y^{*}, θ^{*}]}^{⊤}

end loop

4.5. Fully-Actuated vs. Underactuated Particle Swarm

We now consider a small swarm of

N = 4

particles to showcase the performance of the proposed control law and highlight the advantage of the underactuated particle swarm over the fully-actuated swarm [5]. The sampling time is set to

0.01

s and the physical parameters are given in Table 1. The control gain matrices

Q (\bar{x})

and

R

are based on the assumptions of Theorem 1 and we get

\begin{matrix} Q (\bar{x}) = (1 + 0.01 {({\bar{x}}_{5} - 0)}^{2} + 0.01 {({\bar{x}}_{6} - 0)}^{2}) \\ 0.001 (260, 1, 260, 1, 160, 100), R = [\begin{matrix} 50 & 0 \\ 0 & 50 \end{matrix}] . \end{matrix}

We compare it to the approach of Shahrokhi et al. [5]. Their control gains for the PD controller are

K_{p_{x}} = 0.04

,

K_{p_{y}} = 0.03

,

K_{d_{x}} = 0.03

, and

K_{p_{y}} = 0.04

. Figure 2 compares our approach and the PD controller [5] for the obtained trajectories of mean, variance, and the control inputs. Even though the settling times seem satisfactory for both approaches, the trajectory and the control inputs allow us to discriminate between the two approaches. The control inputs obtained through our approach are significantly smaller resulting in less energy consumption. Also, note that there are no sudden peaks in the control inputs. The fully-actuated approach consumes

1.0347

of energy compared to the under-actuated one that consumes

0.4639

. This is an energy reduction of approximately

57 %

.

Both approaches minimize mean and variance. However, in the underactuated case, we stabilize the mean Euler angle

\bar{θ}

with only two global control inputs. Hence, we reasonably balance the tradeoff between control complexity and system performance.

5. Multi-Robot Simulations and Discussion

We also show the result for a swarm of

N = 8

robots to visualize our results in an accessible way. Time is discretized and the control signal is scaled by

δ t = 0.01

. The underactuated robots and arena boundaries are simulated as physical entities. Each underactuated robot has a random initial pose and the swarm’s mean position has a randomly generated target pose. The nonlinear controller described in Algorithm 2 steers the robot’s from a starting position to a target position (equilibrium point of the swarm) with a stable Euler angle. Figure 3 shows four screenshots during a representative simulation run. This result shows how the properties of the underactuated robot system (e.g., torque and inertia) are exploited to regulate the mean, to minimize the variance, and to steer the swarm to the target on the right pose.

Looking ahead there will be a need for the further abstraction of details like actuators and engines, making it a building-blocks tool using various components. There are several limitations of the automatic control mechanism at this time. At this stage of the method development, non-holonomic constraints do not consider. Furthermore, we only tackle to track only the boundary of a ‘cloud of robots’ and their center of gravity. Possibly also the particle density could be measured instead of each individual robot.

6. Conclusions

We have proposed a centralized automatic stochastic control of large-scale robot systems for underactuated robots based on a generalized change of coordinates. We transform underactuated robot systems to the partial form that can be used for control design. At the cost of centrally tracking all robots, we gain the benefit of optimal energy-efficient control in the task of minimizing positional variance and moving the robot system’s mean to a goal position. The requirement of having to track all robots is unlikely to scale arbitrarily. A future extension of our method could hence be to track only the boundary of a cloud of robots and their center of gravity. Possibly also the particle density could be measured instead of each individual robot. There is no immediate way of transferring our method to a decentralized approach, hence making it complementary to behavior-based approaches from swarm robotics that show increased robustness without a central element. However, centralized and decentralized approaches and their pros and cons are complementary to each other, which needs to be carefully considered by the designer for a given use case. In future work, we plan to test and study our approach on real robots with different physical characteristics, such as the Kilobot and other robots with bigger masses. Also, an extension of the method to a manipulation scenario [7] seems particularly relevant.

Author Contributions

B.S. developed the generalized coordinates transformation under feasibility constraints, obtained the numerical results, and prepared the paper. G.E. conceived the overall concept and supervised the development of the framework, results, and paper. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Derivation of the Variation of the Work δW

In this appendix, the virtual of the work is derived. Let us consider the positions of the mass particles

\begin{matrix} x & = & x_{1} + L cos (θ), \\ y & = & y_{1} + L sin (θ), \\ x_{3} & = & x_{1} + 2 L cos (θ), \\ y_{3} & = & y_{1} + 2 L sin (θ) . \end{matrix}

Now talking variations

\begin{matrix} [\begin{matrix} - f_{1} sin (θ) \\ f_{1} cos (θ) \end{matrix}] δ [\begin{matrix} x_{1} \\ y_{1} \end{matrix}] = [\begin{matrix} - f_{1} sin (θ) δ x_{1} \\ f_{1} cos (θ) δ y_{1} \end{matrix}], \end{matrix}

(A1)

and

\begin{matrix} [\begin{matrix} - f_{3} sin (θ) \\ f_{3} cos (θ) \end{matrix}] δ [\begin{matrix} x_{3} \\ y_{3} \end{matrix}] = [\begin{matrix} - f_{3} sin (θ) (δ x_{1} - 2 L sin (θ)) δ θ \\ f_{3} cos (θ) (δ y_{1} + 2 L cos (θ) δ θ) \end{matrix}] . \end{matrix}

Collecting terms, we have

\begin{matrix} δ W & = & [\begin{matrix} - f_{1} sin (θ) δ x_{1} - f_{3} sin (θ) δ x_{1} + f_{3} {sin}^{2} (θ) 2 L δ θ \\ f_{1} cos (θ) δ y_{1} + f_{3} cos (θ) δ y_{1} + f_{3} {cos}^{2} (θ) 2 L δ θ \end{matrix}] \\ = & [\begin{matrix} - (f_{1} + f_{3}) sin (θ) δ x_{1} \\ + (f_{1} + f_{3}) cos (θ) δ y_{1} \\ 2 L f_{3} δ θ \end{matrix}] . \end{matrix}

(A2)

References

Olfati-Saber, R.; Fax, J.A.; Murray, R.M. Consensus and Cooperation in Networked Multi-Agent Systems. Proc. IEEE 2007, 95, 215–233. [Google Scholar] [CrossRef] [Green Version]
Salamat, B.; Elsbacher, G. Centralized Control in Networks of Underactuated Nonidentical Euler–Lagrange Systems Using a Generalised Multicoordinates Transformation. IEEE Access 2022, 10, 58311–58319. [Google Scholar] [CrossRef]
Nuño, E.; Sarras, I.; Basañez, L. Consensus in Networks of Nonidentical Euler–Lagrange Systems Using P+d Controllers. IEEE Trans. Robot. 2013, 29, 1503–1508. [Google Scholar] [CrossRef]
Becker, A.; Habibi, G.; Werfel, J.; Rubenstein, M.; McLurkin, J. Massive uniform manipulation: Controlling large populations of simple robots with a common input signal. In Proceedings of the 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, Tokyo, Japan, 3–7 November 2013; pp. 520–527. [Google Scholar] [CrossRef]
Shahrokhi, S.; Becker, A.T. Stochastic swarm control with global inputs. In Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Hamburg, Germany, 28 September–3 October 2015; pp. 421–427. [Google Scholar] [CrossRef]
Rubenstein, M.; Ahler, C.; Nagpal, R. Kilobot: A low cost scalable robot system for collective behaviors. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA 2012), Saint Paul, MN, USA, 14–18 May 2012; pp. 3293–3298. [Google Scholar] [CrossRef] [Green Version]
Shahrokhi, S.; Lin, L.; Ertel, C.; Wan, M.; Becker, A.T. Steering a Swarm of Particles Using Global Inputs and Swarm Statistics. IEEE Trans. Robot. 2018, 34, 207–219. [Google Scholar] [CrossRef]
Graham, R.L.; Sloane, N.J.A. Penny-packing and two-dimensional codes. Discret. Comput. Geom. 1990, 5, 1–11. [Google Scholar] [CrossRef]
Elamvazhuthi, K.; Berman, S. Mean-field models in swarm robotics: A survey. Bioinspir. Biomim. 2019, 15, 015001. [Google Scholar] [CrossRef] [PubMed]
Prorok, A.; Correll, N.; Martinoli, A. Multi-level spatial modeling for stochastic distributed robotic systems. Int. J. Robot. Res. 2011, 30, 574–589. [Google Scholar] [CrossRef]
Ortega, R.; Spong, M.; Gomez-Estern, F.; Blankenstein, G. Stabilization of a class of underactuated mechanical systems via interconnection and damping assignment. IEEE Trans. Autom. Control 2002, 47, 1218–1233. [Google Scholar] [CrossRef] [Green Version]
Acosta, J.; Ortega, R.; Astolfi, A.; Mahindrakar, A. Interconnection and damping assignment passivity-based control of mechanical systems with underactuation degree one. IEEE Trans. Autom. Control 2005, 50, 1936–1955. [Google Scholar] [CrossRef]
Kirk, D.; Kirk, D.; Kreider, D. Optimal Control Theory: An Introduction; Prentice-Hall: Hoboken, NJ, USA, 1970. [Google Scholar]
Cloutier, J. State-dependent Riccati equation techniques: An overview. In Proceedings of the 1997 American Control Conference (Cat. No.97CH36041), Albuquerque, NM, USA, 4–6 June 1997; Volume 2, pp. 932–936. [Google Scholar] [CrossRef]
Freeman, R.; Kokotovic, P. Optimal nonlinear controllers for feedback linearizable systems. In Proceedings of the 1995 American Control Conference—ACC’95, Seattle, WA, USA, 21–23 June 1995; Volume 4, pp. 2722–2726. [Google Scholar] [CrossRef]
Isidori, A. Nonlinear Control Systems; Springer: Berlin/Heidelberg, Germany, 1995; Volume 3. [Google Scholar]
Shahrokhi, S.; Lin, L.; Becker, A.T. Planar Orientation Control and Torque Maximization Using a Swarm With Global Inputs. IEEE Trans. Autom. Sci. Eng. 2019, 16, 1980–1987. [Google Scholar] [CrossRef]
Salamat, B.; Tonello, A.M. A Swash Mass Pendulum with Passivity-Based Control. IEEE Robot. Autom. Lett. 2021, 6, 199–206. [Google Scholar] [CrossRef]
Kloetzer, M.; Belta, C. Temporal Logic Planning and Control of Robotic Swarms by Hierarchical Abstractions. IEEE Trans. Robot. 2007, 23, 320–330. [Google Scholar] [CrossRef]

Figure 1. Underactuated robot as a system of particles.

Figure 2. Simulation of control laws. (a): Automatic control of the mean and variance with 4 particles in the search space (black dashed line). Underactuated particle swarm under control Algorithms 1 and 2 over the fully-actuated [5]. (b): In the simulation, increased Brownian noise results in a more agile increase of variance.

Figure 3. Different stages of

N = 8

underactuated robots using our nonlinear controller (Equation (45)). (a) Initial condition. (b) Hysteresis-based control following Algorithm 2. (c) Minimizing mean and variance of robot positions utilizing the arena boundary. (d) Regulating mean state and stabilizing the Euler angle

θ

.

Figure 3. Different stages of

N = 8

underactuated robots using our nonlinear controller (Equation (45)). (a) Initial condition. (b) Hysteresis-based control following Algorithm 2. (c) Minimizing mean and variance of robot positions utilizing the arena boundary. (d) Regulating mean state and stabilizing the Euler angle

θ

.

Table 1. Parameters of the simulated underactuated swarm.

Parameters	Symbol	Value	Unit
mass	m	$0.01$	kg
shaft	L	$0.02$	m

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Salamat, B.; Elsbacher, G. Steering a Swarm of Large-Scale Underactuated Mechanical Systems Using a Generalized Coordinates Transformation. Aerospace 2022, 9, 702. https://doi.org/10.3390/aerospace9110702

AMA Style

Salamat B, Elsbacher G. Steering a Swarm of Large-Scale Underactuated Mechanical Systems Using a Generalized Coordinates Transformation. Aerospace. 2022; 9(11):702. https://doi.org/10.3390/aerospace9110702

Chicago/Turabian Style

Salamat, Babak, and Gerhard Elsbacher. 2022. "Steering a Swarm of Large-Scale Underactuated Mechanical Systems Using a Generalized Coordinates Transformation" Aerospace 9, no. 11: 702. https://doi.org/10.3390/aerospace9110702

APA Style

Salamat, B., & Elsbacher, G. (2022). Steering a Swarm of Large-Scale Underactuated Mechanical Systems Using a Generalized Coordinates Transformation. Aerospace, 9(11), 702. https://doi.org/10.3390/aerospace9110702

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Steering a Swarm of Large-Scale Underactuated Mechanical Systems Using a Generalized Coordinates Transformation

Abstract

1. Introduction

2. Euler-Lagrange Dynamics

3. Underactuated Robot System

Mechanical Properties of the Underactuated Robot

4. Control Design for a Swarm

4.1. Swarm Dynamical System Model

4.2. Control Law

4.3. Stability Analysis

4.4. Controlling Mean and Variance

4.5. Fully-Actuated vs. Underactuated Particle Swarm

5. Multi-Robot Simulations and Discussion

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

Appendix A. Derivation of the Variation of the Work δW

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI