Controlling the Deformation of the Antagonistic Shape Memory Alloy System by LSTM Deep Learning

Hmede, Rodayna; Chapelle, Frédéric; Lapusta, Yuri; Ramón, Juan Antonio Corrales

doi:10.3390/act13120479

Open AccessArticle

Controlling the Deformation of the Antagonistic Shape Memory Alloy System by LSTM Deep Learning

by

Rodayna Hmede

^1,2,*,

Frédéric Chapelle

²,

Yuri Lapusta

² and

Juan Antonio Corrales Ramón

^3,*

¹

Institute for Radiation Protection and Nuclear Safety (IRSN), National Centre for Scientific Research (CNRS), Cadarache, 13115 Saint-Paul-lez-Durance, France

²

Clermont Auvergne INP, CNRS, Institut Pascal, Université Clermont Auvergne, 63000 Clermont-Ferrand, France

³

Centro Singular de Investigación en Tecnoloxías Intelixentes (CiTIUS), Universidade de Santiago de Compostela, 15782 Santiago de Compostela, Spain

^*

Authors to whom correspondence should be addressed.

Actuators 2024, 13(12), 479; https://doi.org/10.3390/act13120479

Submission received: 29 October 2024 / Revised: 24 November 2024 / Accepted: 25 November 2024 / Published: 27 November 2024

(This article belongs to the Special Issue Innovative Actuators Based on Shape Memory Alloys)

Download

Browse Figures

Versions Notes

Abstract

:

The antagonistic system of two shape memory alloy wires is a great inspiration for the robotics field where it is applied as a linear actuator due to its shape memory effect. However, its control is still a challenge due to its hysteresis behavior. For that reason, a new controller is proposed in this paper for the displacement of the system’s effector. It is based on a Long Short-Term Memory neural network model. The aim is achieved by combining temperature-deformation data from an analytical model with voltage-temperature-deformation data from real experiments. Hence, these datasets are studied to overcome the nonlinearity obstacle of this system in order to be able to integrate it into robotic applications.

Keywords:

shape memory alloys; antagonistic system; neural networks; modelling and controlling

1. Introduction

Robotic applications are in need of smart actuator systems capable of fine integration and robustness under various circumstances such as stress. In this context, shape memory alloys (SMAs) are one of the active materials that has been largely studied in this applicative field [1,2,3]. It is considered a smart actuator since it can be trained to deform on its own [4]. This means that it can memorize its original form although it is cooled, heated, stretched, and bent. For that, SMAs have high achievements as sensors and actuators in specific that are investigated in various materials and devices fields in general and smart materials and robotics in specific. Theoretically, an SMA is specified by its solid crystal molecular property. It has two main solid phases: martensite and austenite. Each type of SMA has specific crystallization characteristics (starting and final transformation temperatures) [5]. This nature provides the SMA with two main mechanical properties: shape memory effect and super elasticity. The shape memory effect is characterized by its specific strain–temperature behavior. This is when the SMA is capable of recovering its initial shape. Super elasticity or pseudo elasticity [6] is presented by the stress–strain behavior of the SMA. It can have several shapes: wires, bars, springs, rings, or porous. The antagonistic system (AS), composed of two SMAs, is a challenging system because of its high hysteresis behavior. It is a smart actuator used in various fields, especially systems, biotechnology, automotive, etc. It has a main application in the robotic field as an artificial finger or manipulator actuation [7,8]. It is made up of two identical SMA wires connected at one edge (the midpoint of the whole system) and fixed on the outer edges. The AS has a great studied history among the literature as a linear or rotary actuator. Several publications considered the deformation control of AS SMA using traditional numerical methods such as the finite element method [9] and numerical models [10] and less traditional approaches such as neural networks [11,12]. However, they are based on an approximate linear piecewise relation, identified through this study, of applied temperature (external stimuli) and the system deformation (response). Therefore, the gap exists in control of the fully non-linear behavior of the actuator system with high accuracy.

The neural network (NN) efficiency is proven in the literature [11], especially in the material sciences field. In the review paper [1], the NNs that were used to model SMAs are reviewed. The authors concluded that each study case (shape, role, and properties) requires a specific AI tool as the specific NN model for the active system.

Moreover, AS was modelled using an NN approach in ref. [13], where the stress–strain behavior of the system was studied using a specific model of long–short-term memory (LSTM) NN with regression layers. By that, the super elastic system’s characteristic is modeled using the LSTM NN. We examined in [13] multiple neural network models for capturing the nonlinear behavior of AS SMAs, concluding that the LSTM model was the most effective. Furthermore, recent advancement [14] has integrated LSTM to study the dynamic response of complex nonlinear systems as tensegrity structures to external stimuli, offering a more accurate prediction of its deformed shape. By combining traditional regression techniques with LSTM networks, they demonstrated enhanced predictive accuracy and robustness. Moreover, in ref. [12], the authors approved the LSTM advantages to model nonlinear behaviors and they provided an advanced model for complex nonlinear ones. Hence, the potential of integrating LSTM is scored for enhanced control of nonlinear systems, as it allows for greater adaptability and more reliable performance in real-world applications, such as robotics and structural monitoring. LSTM NN as a recurrent neural network (RNN) showed its potential through a diversity of studies especially in nonlinear modelling and controlling [15]. The LSTM functionality relies on the network’s ability to learn long-term dependencies between time steps in time series and sequence data. It effectively predicts long patterns of unidentified lengths. In this work, the LSTM neural network (NN) is employed with a regression layer to successfully model the antagonistic SMA actuator. However, modeling or controlling its deformation depending on the variation of its temperature has not been studied using AI. Hence, this article’s aim is to model and control the strain temperature behavior of the AS actuator using NN. The system is heated using applied voltage, and the system’s position is represented by the position of its middle point. The strain–temperature relation has been studied analytically and experimentally to build the proper AI ground truth. This latter is used to learn and test the NN’s method that best challenges the AS’s hysteresis behavior.

2. Methodology

The displacement of the midpoint of the AS defines the temperature behavior of the system upon heating the two wires progressively. This behavior will be tested using three models: numerical simulation of an analytical model, experimental model, and AI model. The analytical model is discussed to prove its linear limitation. The experimental model is studied to build the ground truth of the NN model. The NN model will control the system’s temperature to maintain the actuator’s displacement.

The overall test to study the system’s shape memory effect (strain–temperature behavior) is noted in S1. It is performed as follows:

Initial state: placement of the two wires in an austenitic state by heating them.
Step 0: pre-stretch the two wires to transform them into partially oriented martensite.
Step 1: heat the wire SMA1 by applying the voltage.
Step 2: return the wire SMA1 to room temperature by turning off the applied voltage.
Step 3: heat the wire SMA2 by applying the voltage.
Step 4: return the wire SMA2 to room temperature by turning off the applied voltage.

In the following, we call a “cycle” for a “step 1 to step 4” block, and we call a sequence of cycles for the overall test where there are repetitions of the cycle with different applied heating temperatures. Therefore, the cycle is repeated by increasing the voltage sequence to increase the wires’ heating temperature.

3. Numerical Simulation Model

The numerical simulation’s objective is to understand the theoretical temperature-displacement behavior of the SMA AS system. This relation is presented graphically for each SMA separately in Figure 1 following the piecewise linear assumption. The Young’s modulus evolution during the martensitic transformation is then approximated by a linear function of the Young’s modulus of austenite (EA) and martensite (EM), depending on the percentage of austenite transformed in martensite (X). However, in our case, we have the hypothesis of constant Young’s modulus and constant ambient temperature (T_amb).

Figure 1 schematizes the following steps’ descriptions with their reference points (O, O’, A, B, C, D, and E).

E_SMA = X x E_M + (1 − X) E_A.

(1)

3.1. Static Variables

Force equilibrium:

F1 = F2 → ΔF1 = ΔF2

(2)

Stress increment in the SMA wires:

Δσ₁ = ΔF₁/A₁

(3)

Δσ₂ = ΔF₂/A₂

(4)

3.2. Kinematic Variables

Preservation of system total length:

L₁ = L₂ → ΔL₁ = ΔL₂

(5)

Strain increment of the SMA wires:

Δε₁ = ΔL₁/L₁

(6)

Δε₂ = ΔL₂/L₂

(7)

3.3. Behavioral Law

The behaviour law of the SMA depends on its molecular phase. If the SMA is completely austenitic, its behaviour is defined by

Δσ = E_A × Δε

(8)

On the contrary, if the SMA is stretched and transformed from austenite to martensite, its behaviour is defined by

Δσ = α × Δε

(9)

Step 0: Pre-initialization of the system

The two wires are initially austenitic and stretched, so their transformation strain is 3%. The stress–strain curves of each SMA wire are shifted following the equations

σ₁^O’ = σ_cr

(10)

σ₂^O’ = σ_cr

(11)

ε₁^O’ = σ₁^1/E_A

(12)

ε₂^O’ = σ₂^1/E_A

(13)

ε₁^A = 3%

(14)

ε₂^A = 3%

(15)

σ₁^A = σ₁^O’ + α(ε₁^A − ε₁^O’)

(16)

σ₂^A = σ₂^O’ + α(ε₂^A − ε₂^O’)

(17)

Step 1: Heating of SMA wire 1

SMA1 wire will tend to recover its initial length (memorized shape) and exert force on the other wire. SMA2 will then be stretched by following the Equation (9) law, continuing its martensite transformation according to the slope α. Let us call k+ the slope defining the thermo-mechanical behaviour of SMA wire 1 being heated. Using Equation (5), k+ can be formulated as follows:

k+ = Δσ1/Δε1 = −α

(18)

The evolution of the stress–strain of SMA wire 1 when it is fully austenite is defined by point B0, situated at the following intersection:

The equation line representing the austenitic elasticity of SMA1.

σ₁^B0 = E_A × ε₁^B0

(19)
The equation line of slope -α, passing from point A, as follows:

(σ₁^B0 − σ₁^A)/(ε₁^B0 − ε₁^A) = −α

(20)

It gives

ϵ_{1}^{B 0} = \frac{σ_{1}^{A} + α ϵ_{1}^{A}}{α + E_{A}}

(21)

The evolution of the stress–strain curve of SMA wire 2 gives

ε₂^B0 = ε₂^A − ε₁^B0 + ε₁^A

(22)

Moreover, the behaviour of SMA wire 2 gives

σ₂^B0 = σ₂^A + α(ε₂^B0 − ε₂^A)

(23)

The slope of transition from starting austenite at A’ to final austenite at A”, which can be defined by β’ using Equation (24), we calculate Equations (25) and (26).

σ₁^A = βT + T₀

(24)

We can calculate the following:

T₁^A’ = (−A_S + σ₁^A)/β

(25)

T₁^A” = (−A_F + σ₁^B0)/β

(26)

So, the slope of the transition is as follows:

β’= (σ₁^A − σ₁^B0)/(T₁^A’ − T₁^A’’)

(27)

The evolution of the stress–strain of SMA wire 1 when the temperature T between T₁^A’ and T₁^A’’ is defined by point B:

σ₁^A = β’(T₁^B − T₁^A’) + σ₁^A

(28)

B is on AB₀ of slope

- α

in the stress–strain diagram, so

ε₁^B = (−σ₁^B + σ₁^A)/α + ε₁^A

(29)

For the SMA2, according to the symmetry concept: σ₁^B = σ₂^B.

B is on AB₀ of slope +α in the stress–strain diagram, so

ε₂^B = (σ₂^B − σ₂^A)/α + ε₂^A

(30)

Step 2: Cooling of SMA wire 1

Cooling down the SMA1 will decrease the stress in the two wires. SMA1 returns to an austenite/martensite state, so it must undergo unloading to return to the corresponding line. Let us define k⁻ as the slope that defines the thermo-mechanical behaviour of SMA wire 1. k⁻ is the inverse of the charge/discharge law:

K⁻ = Δσ₁/Δε₁ = − E

(31)

The evolution of the stress–strain curve of SMA1 is defined by point C, situated at the following intersection:

The equation line

σ₁^C = α ε₁^C + b

(32)

representing the martensitic transformation plateau, with

b = σ₁^A − α × ε₁^A

The equation of slope k⁻ = −E passing by point B, which is as follows:

(σ₁^C − σ₁^B)/(ε₁^C − ε₁^B) = −E

(33)

That gives

ε₁^C = (b − σ₁^B – E × ε₁^B )/(−E − α)

(34)

and σ₁^C is calculated using Equation (9).

Concerning SMA wire 2, point C is defined by knowing Equation (5) which gives

ε₂^C = ε₂^B − Δε₁

(35)

σ₂^C = σ₂^B + E(ε₂^C − ε₂^B)

(36)

Step 3: Heating of SMA wire 2

Now the same procedure of the SMA1 heating takes place for SMA2. First, we need the complete transformation of the wire to austenite to find the transition slope β” where SMA2 can be retracted so that SMA wire 1 will be stretched. SMA wire 1 starts its martensite transformation, and its behaviour is defined by Equation (9). As presented before, the slope defining the thermo-mechanical behavior of a heated SMA is k⁺ = −α when the other SMA wire is transforming from austenite to martensite.

The evolution of the stress–strain curve of SMA wire 2 is defined by point D₀, situated at the following intersection:

The equation line

σ₂^D0 = E_A × ε₂^D0

(37)
The equation line of slope -α passing by point C:

(σ₂^D0 − σ₂^C )/( ε₂^D0 − ε₂^C ) = −α

(38)

which gives

ε₂^D0 = (σ₂^C + α ε₂^C)/(α + E_A)

(39)

σ₂^D0 = σ₂^C − α(ε₂^D0 − ε₂^C)

(40)

Concerning SMA2, the evolution of the stress–strain curve is defined by Equation (5), which gives

ε₂^D0 = ε₂^C − ε₂^D0 + ε₂^C (Δε₂)

(41)

Knowing the point of the behavior of SMA1 that is determined by its martensitic transformation and defined by Equation (9), which gives

σ₁^D0 = σ₁^C + α(ε₁^D0 − ε₁^C)

(42)

To calculate the slope of transition from starting austenite at C’ to final austenite at C”, which is defined by β’’, we know that σ = βT + T₀, so we can calculate the following:

T₂^C’ = (−A_S + σ₂^C)/β

(43)

T₂^C” = (−A_F + σ₂^D0)/β

(44)

So, the slope of the transition is as follows:

β” = (σ₂^C − σ₂^D0)/( T₂^C’ − T₂^C’’)

(45)

The evolution of the stress–strain of SMA wire 1 when the temperature T between T₂^C’ and T₂^C’’ is defined by point D:

σ₂^D = β” (T₂^D − T₂^C’) + σ₂^C

(46)

D is on CD₀ of slope −α in the stress–strain diagram, so

ε₂^D = (−σ₂^D + σ₂^C)/α + ε₂^C

(47)

For the SMA1, according to the symmetry concept: σ₁^D = σ₂^D.

D is on CD₀ of slope +α in the stress–strain diagram, so

ε₁^D = (σ₁^D − σ₁^C)/α + ε₁^C

(48)

Step 4: Cooling of SMA2

The mechanical behaviour of SMA wire 1 is defined by Equation (9). As previously seen, the thermo-mechanical behaviour of a cooled SMA wire, returning to a mixed martensite/austenite phase, is defined by the slope k⁻ = −E. Point E then defines the evolution of the stress–strain curve of SMA wire 2, situated at the following intersection:

The equation line

σ₂^E = α ε₂^E + b

(49)
The equation line of slope k⁻ = −E passing by point D, that is

(σ₂^E − σ₂^D)/(ε₂^E − ε₂^D) = −E

(50)

which gives:

ε₂^E = (b − σ₂^D – E × ε₂^D)/(−E − α)

(51)

and σ₂ is calculated using Equation (9), which gives:

σ₂^E = σ₂^D − E (ε₂^E− ε₂^D)

(52)

Concerning SMA wire 1, we use the evolution of the strain, which gives:

ε₁^E = ε₁^D − Δε₂

(53)

σ₁^E = σ₁^D + E(ε₁^E − ε₁^D)

(54)

To have explicit knowledge of our SMA, several numerical tests of various heating temperature ranges are studied. The activation temperature (T) depends on the thermal properties of the SMA wire: Ms, Mf, As, and Af. The T values are obtained empirically using MATLAB R2024b, which are between 70 °C and 95 °C. Moreover, the numerical simulation algorithm’s output is the displacement value of the midpoint at each heating step for each wire upon the sequence of cycles where T is increasing progressively. Figure 2a,b presents the results upon applying T on the SMA1 and the SMA2 wires, respectively. As the stress–temperature behaviour of SMA is the shape memory effect, the presence of hysteresis characteristics prevents the precision of these piecewise linear relationships, leading to the analytical model’s limitations and proving the need for AI modelling. We obtain a strictly linear displacement–temperature behavior at step 1 (heating of the SMA1 wire), while it is almost linear at step 3 (heating of the SMA2 wire). At T equals 70 °C, the displacements are 1.3 mm and 2.6 mm in step 1 and step 3, respectively. At T equals 95 °C, the displacements are 13.8 mm and 26.4 mm in step 1 and step 3, respectively. This can be explained well by the shape memory effect of the SMA: at the initialization step of stretching 3% of each wire, the midpoint does not move, having two equal and opposite displacement directions. Upon heating the SMA1 wire, it contracts and causes elongation of the SMA2 wire and an SMA1 state transformation to austenite. Hence, the midpoint is displaced in the negative direction by (−Δl). Upon cooling the SMA1 wire, it remains in its form with light release displacement. Upon heating the SMA2 wire, it contracts and causes an elongation of the SMA1 wire, so the midpoint returns to its initial position by moving (Δl). Then, as SMA2 contracts, the midpoint keeps moving in the positive direction with second (−Δl) as the SMA2 wire’s state transforms to pure austenite. Upon cooling the SMA2 wire, it remains in its form with light displacement. The schematic of this explanation is presented in Figure 3.

4. Experimental Model

The aim of this section is to develop the heating–cooling experiment S1, described previously in the methodology (Section 2) for a two-wire AS based on the single wire presented in Section 4.1. The principle’s schematisation is given in Figure 3. The objective is to measure the position of the midpoint (the connection point between the two wires) and the temperature passing through the wires during the system operation. For that purpose, a physical AS was built and instrumented (see Section 4.2). The experimental results are presented in Section 4.3. Notice that the experimental process respects the same strategy explained in the numerical simulation model part (Section 3) upon the six cycles.

4.1. Initialization and Identification of the SMA Parameters

Experimentally, the used SMA wire is Flexinol of 0.006 inches diameter of a low-temperature (LT) model, which means the SMA is martensite formed at lower temperatures, provided by RobotShop. Flexinol is made up of nickel–titanium (NiTi), enabling this “muscle wire” contraction ability upon electricity application. The single wire length is 30 mm. The wire’s parameters are presented in Table 1.

4.2. Activation and Preparation of the Experimental Setup

The experimental setup is presented in Figure 4, and its CAD design is presented in Figure 5. Sliding of the SMA wires at their two ends in the clamping system is one of the major concerns for the designers of SMA wire-based mechanisms. In our system, the wire is thin (diameter of 0.006 inches), so two adequate clamper tubes at the two ends tighten the SMA wire to achieve accurate actuation strongly. The two SMA wires are connected mechanically in series and electrically in parallel with a switch in the antagonistic system (Figure 6). It is essential to note the electrical isolating significance: (1) between the wires and the metal support, and (2) between the connected wires at the middle point to prevent the electricity passage between them and between the support and the wires, respectively.

Therefore, non-conductor clamper tubes (the copper cylinders in Figure 5) are used for holding the thin wire properly and for electrical isolation. An essential pre-operative step for using SMA in our experiment is provided: Before activating the SMA wires, they are pre-stretched by 3% using the screw connections between the cylinders and the system. Each turn of the pre-stretching screw strains the wire by 1 mm. Therefore, the initialization procedure in this experiment can be considered as pre-stretching each wire by 3% simultaneously.

The setup of S1 consists of the following:

A displacement sensor measures the displacement of the system midpoint.
Two temperature sensors record the temperature of the two wires.
An electric relay switches the current between the two wires.
A switch transistor switches on and then off the current on the actuated wire.
A power supply provides the voltage.
An Arduino setup controls the electric circuit and runs the sensors.
The measured parameters are as follows:
The voltage at the terminals of the wire and the current intensity.
The displacement of the midpoint.
The temperature of the wires.

According to the data sheet (Table 1), the wire needs 2 s to heat up and cool at a low activation temperature. This means each step takes 2 s.

The controlling of the electric circuit is performed by using the Arduino board. The schematic in Figure 6 describes the electric circuit and Arduino connections. The wires are connected electrically in parallel and mechanically in series. This means each wire has an independent electric circuit that Arduino controls to switch the voltage on and off regularly. The aim is to maintain the voltage value passing through the wires. The setup needs a main electric relay in the primary circuit to switch the electricity between the two parallel circuits and a switching transistor to continuously turn the electricity on and off for each circuit. The relay selects the first wire to activate and passes the current through, then it sends it to the transistor that plays the role of a switch to heat the wire by applying the voltage through and then cooling it by blocking the current. Secondly, the relay closes the circuit for SMA1 and opens it for SMA2, and the transistor plays the same role in the SMA2 circuit. Arduino controls the relay and transistor work with each step’s opening and closing time. In addition, Arduino is used to activate and collect the sensors’ data.

4.3. Experimental Results

The temperature–voltage relation was studied at the beginning of the experiments by applying a voltage value on the SMA wire and reading its temperature using the thermal sensor. These experimentally obtained data are presented in Table 2 and Figure 7. This experiment proved the proportional relation between the SMA wire temperature and the applied voltage. Then, upon each step, we connected a temperature sensor to the wires to collect the temperature, a displacement sensor to the midpoint of the two wires to obtain the position or displacement of this point, and a voltmeter to measure the applied voltage at each wire. We figured out the collected data of the midpoint displacement and the wire temperature as a function of the applied voltage. As a result, the temperature-displacement hysteresis behavior of the antagonistic system is obtained. The experimentally collected data are voltage (V), temperature (°C), and displacement (mm). Furthermore, a sequence of cycles of S1 is studied to figure out the displacement–temperature behaviour of the AS SMA system. The experiment starts at time zero, and the zero reference position (the initial position before beginning the experiment) is in the system’s middle as the two wires are of equal length. The sensors return the midpoint displacement (Δl) and the wire temperature T at step 1 (where the SMA1 wire is heated) and step 3 (where the SMA2 is heated). Hence, upon each cycle, we have the midpoint’s actual position, the midpoint displacement, and the temperature of the heated wire (Table 3). Experimentally, the displacement–temperature behaviour of the midpoint of the AS SMA system has a proportional nonlinear relation. It assures the weakness of the analytical model. Therefore, these strictly non-linear graphs will be the learning base. Therefore, we can study the system using AI by this ground truth of the temperature–displacement behaviour.

5. Artificial Intelligence Model

5.1. Ground Truth

The ground truth of the AI study in this work is the experimental data of the displacement–temperature behaviour developed previously using the real system. The experimentally collected data of the AS SMA system are temperature (°C) and displacement (mm). This means, the dataset consists of six cycles of two positions (P1, P3) as a function of temperature. They are presented in Table 3 and Figure 8. The aim is to analyse the evolution of the position of the bonding point between the two SMA wires caused by the voltage applied on the terminals during heating/cooling cycles at room temperature. The displacement–temperature dataset is modelled by the neural network model which consists of the LSTM NN and a regression layer. It is discussed in [13] where the stress–strain behavior is modelled. Moreover, controlling the system depends on predicting the heating temperature needed to reach a precisely targeted position by the midpoint, also called the targeted system position. The learning base consists of the initial position and the target position of the system as inputs and the heating temperature for the two wires as an output. It is time steps data since it is made up of four steps, and each step requires a specific time duration of heating or cooling of the wires. Focusing on the final position of the overall design, we consider step 1 and step 3: the initial position of step 1 (P0) and the final position of step 3. Moreover, considering the intermediate displacement of the system caused by step 1, we take into account P1, which is the final position of step 1 for each cycle.

During our analysis, we checked several metrics such as loss function and R2. We have chosen RMSE as the primary evaluation metric because it effectively quantifies prediction accuracy by penalising larger errors more heavily, making it suitable for the context of our study. Hence, for evaluation, the NN tests depend on the square root of the mean squared error (RMSE) between the predicted and actual values.

5.2. LSTM Modelling of a Sequence of Cycles

The LSTM NN consists of 4 layers: a sequence input layer with 1 feature (variable), a long short-term memory (LSTM) layer with 800 hidden units, a fully connected layer with 1 output, and a regression layer for continuous value prediction. The parameters used in this LSTM setup then include 1 input (the temperature value array), 1 output (the displacement value array), and 800 hidden units determining the amount of information remembered between the states. The LSTM layers update their internal cell and hidden states using the hyperbolic tangent function (tanh) as the state activation function and the hard sigmoid (σ) function as the gate activation function. The fully connected and regression layers play significant roles in the network. Precisely, the regression layer models the relationship between the displacement and the temperature. The network is trained using the ADAM optimizer with a learning rate of 0.005, which decreases by a factor of 0.2 every 125 epochs according to a piecewise schedule. The training process is configured to run for a maximum of 800 epochs, with a gradient threshold of 1 to prevent gradient explosion. The number of hidden units in the LSTM model was determined through empirical testing and optimisation. We evaluated several configurations of hidden units (ranging from 50 to 900) to identify the optimal setup. The selected number of 800 hidden units provided the best trade-off between model accuracy (measured using RMSE) and computational cost. Increasing the number beyond 800 did not result in significant accuracy improvement but increased the training time and risk of overfitting. This choice ensures that the model’s performance is robust while maintaining computational efficiency.

The performance is monitored through training–progress plots, and the model is evaluated using the mean squared error metric. For robust analysis, the dataset is split into 70% for training and 30% for testing of the total experimental dataset.

Hence, this model obtained an excellent modelling of the system behaviour with a low RMSE value. Figure 9a presents the modelling of the displacements of the antagonistic system midpoint as a function of the temperature at step 1 upon a sequence of cycles obtained experimentally. Its RMSE value is

7.2372 \times 10^{- 4}

mm. Figure 9b presents the modelling of the displacements as a function of the temperature at step 3 upon a sequence of cycles obtained experimentally. Its RMSE value is

2.4045 \times 10^{- 4}

mm. Note that the heating temperatures at step 1 and step 3 are the same.

5.3. LSTM Controlling of the Midpoint’s Position

For controlling, the data underwent standardisation prior to training to improve the accuracy and efficiency of the model used. This process scaled both the input and output data to have a standard deviation of 1. Standardisation helps ensure that each feature contributes equally to the model’s learning process, preventing features with larger scales from dominating the model’s training. Moreover, this preprocessing step aids in faster convergence and more stable optimisation during the training of LSTM. Following training, the standardised predictions are transformed back to their original scale, enabling the evaluation of the model’s performance using the key metrics Root Mean Squared Error (RMSE).

Concerning repeatability and reliability, measurements taken to ensure the reliability of our experimental data include (1) careful calibration of equipment—following the activation method of the two SMAs by 3% strain of their initial length, (2) consistent testing conditions using force and heat sensors, and (3) controlled SMA cycling protocols, up to six cycles, upon each position measurement to mitigate fatigue effects.

The NN is designed using recordings of T responding to the initial and targeted system positions. The inputs are three value units: P0 for the initial position, P1 for the intermediate position (caused by step 1), and P2 for the final position (caused by step 3) for a given timestep. The LSTM NN model that has the best training uses 800 hidden units as the LSTM layer’s size with regression and fully connected layers. Consequently, the LSTM NN is computed to control T responding to P1 and P2 respecting the cycle steps, with an RMSE of 2.79 × 10^–4 mm. Figure 10 presents the tested and trained system position–temperature behaviour.

6. Conclusions

In conclusion, the shape memory effect of the antagonistic actuator is studied theoretically and presented numerically using MATLAB. Then the cyclic movement of the system is produced experimentally and measured by the temperature and displacement sensors to extract the AI model’s ground truth. The latter is constituted by the initial, intermediate, and final positions for several consecutive cycles. The final position occurs at the end of heating the SMA2 wire (step 3), the intermediate one occurs at the end of heating the SMA1 wire (step 1), and the initial one appears at the beginning of the cycle. The potential of the LSTM NN model for modelling the shape memory effect and controlling the system’s position has been proven through this work. Moreover, the training parameters have an optimisation role in achieving less RMSE value for the controlling procedure. The number of layers, the maximum epoch number, the activation and training functions, and the number of neurons can summarise these characteristics. From a perspective, as SMA is a challenge in a controlling process, this study can inspire the active material field, especially since the studied system plays an actuator role. Moreover, this study could involve further developments in addition to the cyclic movement. This work could also be extended to control the electric cyclic loop with non-supervision.

Moreover, studying the ambient temperature effect requires further experimental preparations. Therefore, as a perspective, expanding the dataset to include variations in ambient temperature could enhance the model’s robustness. This would improve applicability to a broader range of conditions, ensuring a more reliable and generalisable performance.

We can discuss the non-supervision principle as Reinforcement Learning (RL), which is also a widespread approach in robotic control. This combination could be illustrated as a robotic application for medical tasks, where the global behavior of the smart actuator and the encompassing structure must be addressed.

Author Contributions

Conceptualization, R.H. and F.C.; methodology, R.H., F.C. and J.A.C.R.; validation, R.H., J.A.C.R. and F.C.; software, R.H.; formal analysis, Y.L., J.A.C.R. and F.C.; investigation, R.H., F.C., J.A.C.R. and Y.L.; resources, J.A.C.R. and F.C.; data curation, R.H.; writing—original draft preparation, R.H.; writing—review and editing, F.C., J.A.C.R. and Y.L.; visualization, R.H.; supervision, F.C., J.A.C.R. and Y.L.; project administration, F.C.; funding acquisition, F.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the ANR project AIM (ANR-20-THIA-0001) and the European Union under the European Regional Development Fund (ERDF). It also used materials from the ANR MANIMAT project (ANR-20-CE33-0005). JACR was funded by the Spanish Ministry of Universities through a ’Beatriz Galindo’ fellowship (Ref. BG20/00143), by the Interreg VI-B SUDOE Programme through the research project ROBOTA-SUDOE (Ref. S1/1.1/P0125) and by the European Union (European Regional Development Fund—ERDF).

Data Availability Statement

The data presented in this study are available upon request from the corresponding authors.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of the data; in the writing of the manuscript; or in the decision to publish the results.

References

Hmede, R.; Chapelle, F.; Lapusta, Y. Review of Neural Network Modeling of Shape Memory Alloys. Sensors 2022, 22, 5610. [Google Scholar] [CrossRef] [PubMed]
Kheirikhah, M.M.; Rabiee, S.; Edalat, M.E. A Review of Shape Memory Alloy Actuators in Robotics. In RoboCup 2010: Robot Soccer World Cup XIV; Ruiz-del-Solar, J., Chown, E., Plöger, P.G., Eds.; Springer: Berlin/Heidelberg, Germany, 2011; pp. 206–217. [Google Scholar] [CrossRef]
Duerig, T.W.; Melton, K.N.; Stöckel, D. Engineering Aspects of Shape Memory Alloys; Butterworth-Heinemann: Oxford, UK, 2013. [Google Scholar]
Aïssa, B.; Memon, N.K.; Ali, A.; Khraisheh, M.K. Recent Progress in the Growth and Applications of Graphene as a Smart Material: A Review. Front. Mater. 2015, 2, 58. [Google Scholar] [CrossRef]
Guo, L.; Chen, F.; Chen, S.; Huang, Y.; Zhang, J.; Wang, C.; Yang, S. The improvement of the shape memory effect of Cu-13.5Al–4Ni high-temperature shape memory alloys through Cr-, Mo-, or V-alloying. J. Sci. Adv. Mater. Devices 2023, 8, 100532. [Google Scholar] [CrossRef]
Jahanbazi Asl, F.; Kadkhodaei, M.; Karimzadeh, F. The effects of shape-setting on transformation temperatures of pseudoelastic shape memory alloy springs. J. Sci. Adv. Mater. Devices 2019, 4, 568–576. [Google Scholar] [CrossRef]
Ko, J.; Jun, M.B.; Gilardi, G.; Haslam, E.; Park, E.J. Fuzzy PWM-PID control of cocontracting antagonistic shape memory alloy muscle pairs in an artificial finger. Mechatronics 2011, 21, 1190–1202. [Google Scholar] [CrossRef]
Dilibal, S.; Engeberg, E.D. Finger-like manipulator driven by antagonistic nickel-titanium shape memory alloy actuators. In Proceedings of the 2015 International Conference on Advanced Robotics (ICAR), Istanbul, Turkey, 27–31 July 2015; pp. 152–157. [Google Scholar] [CrossRef]
Boufayed, R.; Chapelle, F.; Destrebecq, J.F.; Balandraud, X. Finite element analysis of a prestressed mechanism with multi-antagonistic and hysteretic SMA actuation. Meccanica 2020, 55, 1007–1024. [Google Scholar] [CrossRef]
Auricchio, F.; Bonetti, E.; Scalet, G.; Ubertini, F. Theoretical and numerical modeling of shape memory alloys accounting for multiple phase transformations and martensite reorientation. Int. J. Plast. 2014, 59, 30–54. [Google Scholar] [CrossRef]
Imoisili, P.E.; Makhatha, M.E.; Jen, T.-C. Artificial Intelligence prediction and optimization of the mechanical strength of modified Natural Fibre/MWCNT polymer nanocomposite. J. Sci. Adv. Mater. Devices 2024, 9, 100705. [Google Scholar] [CrossRef]
Gonzalez, J.; Yu, W. Non-linear system modeling using LSTM neural networks. IFAC-Pap. 2018, 51, 485–489. [Google Scholar] [CrossRef]
Hmede, R.; Chapelle, F.; Lapusta, Y. Modeling the butterfly behavior of SMA actuators using neural networks. Comptes Rendus. Mécanique 2022, 350, 143–157. [Google Scholar] [CrossRef]
Mao, Z.; Kobayashi, R.; Nabae, H.; Suzumori, K. Multimodal Strain Sensing System for Shape Recognition of Tensegrity Structures by Combining Traditional Regression and Deep Learning Approaches. IEEE Robot. Autom. Lett. 2024, 9, 10050–10056. [Google Scholar] [CrossRef]
Yu, Y.; Si, X.; Hu, C.; Zhang, J. A Review of Recurrent Neural Networks: LSTM Cells and Network Architectures. Neural Comput. 2019, 31, 1235–1270. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Stress temperature and stress–strain behaviours upon initialization and activation of the first cycle. (a) Graphs of points (O, O’, A, A’, and B) are for pre-stretching and heating of SMA1 where the grey points (A” and B0) refer to the full austenite transformation; (b) graphs of points (B, B’, and C) are for cooling of SMA1; (c) graphs of points (C, C’, and D) are for heating of SMA2; (d) graphs of points (D, D’, and E) are for cooling of SMA2. The red curves are for SMA1, and the blue curves are for SMA2.

Figure 2. The simulated displacements of the antagonistic system midpoint as a function of the heating temperature upon each cycle of S1 are produced numerically using MATLAB: (a) upon heating SMA1, (b) upon heating SMA2.

Figure 3. Description of the state and form of the AS SMA system: (a) upon the initialization procedure of 3% pre-stretching (∆l₀) and (b) upon a cycle of test S1.

Figure 4. The innovative system consists of two antagonistic SMA wires (SMA1 and SMA2), four clamper tubes to hold the thin ends of the SMA wires, and two pre-stretching screws that strain the wires by screwing and release them by unscrewing.

Figure 5. CAD design of the test bench using CATIA software CATIA V5-6R2018.

Figure 6. The scheme of the electric circuit of the AS SMA system with the Arduino connections, where GND and D5 are the ground (negative) and the 5V (positive) pins of Arduino, respectively, and where S, D, and G of the MOSFET switch transistor are the source, drain, and gate pins, respectively.

Figure 7. The variation of the temperature of the midpoint of the AS SMA system upon applying the voltage for 2 s.

Figure 8. The experimental displacements of the antagonistic system midpoint as a function of the heating temperature upon a sequence of cycles: (a) upon heating SMA1, (b) upon heating SMA2.

Figure 9. The modelled displacements of the antagonistic system midpoint as a function of the wires’ heating temperature upon a sequence of cycles using the LSTM NN model: (a) upon heating SMA1, (b) upon heating SMA2.

Figure 10. The controlled temperature responds to the AS system’s (a) intermediate position (P1) and (b) intermediate position (P3).

Table 1. The supplied data sheet gives the properties of the used shape memory alloy wire.

Characteristic	Value
Off-Time	2 s
Resistance	55 Ω/m
Recommended current at room temperature	410 mA
Recommended pull force	321 g
Recommended deformation	3–5%
Martensite start temperature, Ms	52 °C
Martensite finish temperature, Mf	42 °C
Austenite start temperature, As	68 °C
Austenite finish temperature, Af	78 °C

Table 2. The variation of the temperature of the midpoint of the AS SMA system upon applying the voltage for 2 s.

Voltage (V)	3	6	8	10	13	16	20	22	24
Temperature (°C)	32	42	56	58	89	98	117	120	126

Table 3. The experimentally collected data of the AS SMA system during the four cycles.

Cycle	Step 1					Step 3
Cycle	t (s)	T	P0	P1	$Δ l$	t (s)	$T$	P2	P3	Δl
1	2	31.7	0	−2	−2	6	32	−2	+2	4
2	10	38	+2	−1	−3	14	41	−1	+5	6
3	18	58.7	5	1	−4	22	58.1	+1	+9	8
4	26	75.7	9	4.66	−4.3	30	70.8	4.66	+13.6	9
5	34	84.2	13.6	8.66	−5	38	88	8.66	+18.6	10
6	42	92	18.6	12.6	−6	46	93	12.6	+24.6	12

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Hmede, R.; Chapelle, F.; Lapusta, Y.; Ramón, J.A.C. Controlling the Deformation of the Antagonistic Shape Memory Alloy System by LSTM Deep Learning. Actuators 2024, 13, 479. https://doi.org/10.3390/act13120479

AMA Style

Hmede R, Chapelle F, Lapusta Y, Ramón JAC. Controlling the Deformation of the Antagonistic Shape Memory Alloy System by LSTM Deep Learning. Actuators. 2024; 13(12):479. https://doi.org/10.3390/act13120479

Chicago/Turabian Style

Hmede, Rodayna, Frédéric Chapelle, Yuri Lapusta, and Juan Antonio Corrales Ramón. 2024. "Controlling the Deformation of the Antagonistic Shape Memory Alloy System by LSTM Deep Learning" Actuators 13, no. 12: 479. https://doi.org/10.3390/act13120479

APA Style

Hmede, R., Chapelle, F., Lapusta, Y., & Ramón, J. A. C. (2024). Controlling the Deformation of the Antagonistic Shape Memory Alloy System by LSTM Deep Learning. Actuators, 13(12), 479. https://doi.org/10.3390/act13120479

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Controlling the Deformation of the Antagonistic Shape Memory Alloy System by LSTM Deep Learning

Abstract

1. Introduction

2. Methodology

3. Numerical Simulation Model

3.1. Static Variables

3.2. Kinematic Variables

3.3. Behavioral Law

4. Experimental Model

4.1. Initialization and Identification of the SMA Parameters

4.2. Activation and Preparation of the Experimental Setup

4.3. Experimental Results

5. Artificial Intelligence Model

5.1. Ground Truth

5.2. LSTM Modelling of a Sequence of Cycles

5.3. LSTM Controlling of the Midpoint’s Position

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI