Using a Data Driven Approach to Predict Waves Generated by Gravity Driven Mass Flows

Meng, Zhenzhu; Hu, Yating; Ancey, Christophe

doi:10.3390/w12020600

Open AccessFeature PaperArticle

Using a Data Driven Approach to Predict Waves Generated by Gravity Driven Mass Flows

by

Zhenzhu Meng

^1,*

,

Yating Hu

^1,2 and

Christophe Ancey

¹

ENAC/IIC/LHE, Ecole Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland

²

College of Water Conservancy and Hydropower Engineering, Hohai University, Nanjing 210098, China

^*

Author to whom correspondence should be addressed.

Water 2020, 12(2), 600; https://doi.org/10.3390/w12020600

Submission received: 22 December 2019 / Revised: 9 February 2020 / Accepted: 19 February 2020 / Published: 22 February 2020

(This article belongs to the Special Issue Non-Newtonian Fluids in Environmental Hydraulics: Modelling and Applications)

Download

Browse Figures

Versions Notes

Abstract

:

When colossal gravity-driven mass flows enter a body of water, they may generate waves which can have destructive consequences on coastal areas. A number of empirical equations in the form of power functions of several dimensionless groups have been developed to predict wave characteristics. However, in some complex cases (for instance, when the mass striking the water is made up of varied slide materials), fitting an empirical equation with a fixed form to the experimental data may be problematic. In contrast to previous empirical equations that specified the mathematical operators in advance, we developed a purely data-driven approach which relies on datasets and does not need any assumptions about functional form or physical constraints. Experiments were carried out using Carbopol Ultrez 10 (a viscoplastic polymeric gel) and polymer–water balls. We selected an artificial neural network model as an example of a data-driven approach to predicting wave characteristics. We first validated the model by comparing it with best-fit empirical equations. Then, we applied the proposed model to two scenarios which run into difficulty when modeled using those empirical equations: (i) predicting wave features from subaerial landslide parameters at their initial stage (with the mass beginning to move down the slope) rather than from the parameters at impact; and (ii) predicting waves generated by different slide materials, specifically, viscoplastic slides, granular slides, and viscoplastic–granular mixtures. The method proposed here can easily be updated when new parameters or constraints are introduced into the model.

Keywords:

viscoplastic slide; granular slide; landslide-generated waves; data-driven approach; artificial neural network approach; empirical equation

1. Introduction

When colossal gravity-driven mass flows enter a body of water, such as a sea, a lake, or a reservoir, they sometimes generate large waves. These events are particularly relevant in coastal areas and mountainous countries. Such waves occurred, for example, in Lituya Bay in 1958 [1] and in Vajont, Italy, in 1963 [2]. Predicting the characteristics of waves induced by subaerial landslides is of great importance for risk management in coastal areas [3].

Researchers have conducted experiments using physical models that try to reproduce the physical processes of impulse waves generated by subaerial landslides. They have simplified water geometry by using 2D flumes or 3D basins and idealized the sliding masses as rigid blocks [4,5,6,7,8], granular solids [9,10,11,12,13,14], or viscoplastic fluids [15,16]. Based on reliable experimental data, a number of empirical or semi-empirical equations have been established, either by combining regression techniques with dimensional analysis [11,17,18,19] or by a scaling analysis of governing equations [20,21]. Most equations to date have expressed wave characteristics as power functions of several slide parameters on impact, and some have occasionally involved an additive term [22].

One significant issue has emerged from previous research: on many occasions, empirical equations have fit well with their own experimental data, but they then exhibited large deviations from the datasets obtained by other teams, especially when different slide materials were involved [10,15,16,23]. The performances of the different equations on a given dataset remain uncertain. This uncertainty reflects the limitations of empirical equations with a given functional form. Heller and Spinneken (2013) developed generic empirical equations for blocks of various shapes [24]. They also discussed the data discrepancies between using blocks and granular slides. Actually, none of the existing empirical equations can account for all range of materials used in experiments. Applying empirical equations may be difficult when, for instance, the slide material involves different components. A typical example has been Tang et al. (2018), who conducted experiments using blocks, granular slides and mixture of block and granular slides [25]. Taking the viscoplastic–granular mixture as an example, the representative parameters of these two materials are the yield stress and grain diameter, respectively. Due to the current lack of understanding about how these two materials affect the underlying physics of the slide–water interaction, integrating these two parameters into one equation might be problematic if we have presumed a functional form for that equation in advance.

Another key issue is that all the existing empirical equations express wave characteristics from the parameters relating to the sliding masses on impact; none use the parameters related to the initial stage (i.e., when the mass is still on the slope and starts moving). Putting the emphasis of the parameters on impact makes it easier to control the variables and to provide a quantitative analysis; however, for engineering applications, there is a need to predict wave characteristics before the sliding has occurred. For example, in May 2009, a slight slope failure occurred on the Guopu bank of the Laxiwa reservoir, in China. Based on monitoring data, a faulted rock mass with an approximate volume of

3 \times 10^{7}

m

^{3}

showed signs of general displacement [26]. Although there is a very small probability, should the mass drop into the reservoir, it would generate large waves which may well destroy the nearby arch dam ([27]). In this situation, estimating the characteristics of the potential waves from information on the potential landslide (which is still at rest on the slope) is more than warranted. To study the various physical processes from the initial impact to wave propagation, Heller et al. (2009) took a holistic approach based on a theoretical analysis and semi-empirical equations [17]. For more complex landslide materials, providing physical constraints on the mathematical operators of prediction equations formulation of empirical equations becomes more challenging.

Using an approach that did not assume the functional form of the equation in advance and relied strictly on the data alone, would be preferable for dealing with both of the above issues. To overcome the limitations of empirical equations, the present study presents a data-driven method, known as an artificial neural network (ANN) method, which has been successfully employed in other fields to cope with complicated parameters in experimental data processing and to develop highly accurate predictive models [28,29,30,31,32,33]. In contrast to empirical equations, in which mathematical dependence was fixed in advance, the ANN method provides an approach in which both the explanatory and explained variables in the data ultimately define their internal relationship without any prior assumptions about the equation’s functional form or physical constraints. Moreover, the model can be easily calibrated when new data or parameters become available, which makes it powerful in solving complex problems [34]. Panizzo et al. (2005) compared the ANN method and empirical equations on a simple case (that is, predicting wave characteristics from solid block parameters on impact). The ANN method’s predictive capacities were slightly better than those of empirical equations [35]. To the best of our knowledge, no data-driven method has been used to deal with field data. The key advantage of data-driven methods, namely, their high adaptivity to solving complex problems and dealing with complex parameters, was not further investigated.

Using the ANN method, we (i) estimate the wave characteristics from the parameters of a subaerial mass at the initial stage, when it is at rest and starts moving down the slope, and (ii) predict the wave characteristics generated by different slide mass materials (specifically, viscoplastic slides, granular slides, and mixtures of them), all within one model. For each application, we refined the inputs, outputs and network structures of the model.

2. Experiments

2.1. Physical Model

Figure 1 illustrates a physical model of a mass flow moving down a slope and intruding into a body of water. The whole process can be divided into three stages: in stage I, the slide is at rest, in the container box, and then starts moving; in stage II it moves down the slope and reaches the shoreline; in stage III, it enters the body of water and generates waves. We consider a slope with an inclination of

θ

entering a horizontal flume filled with water. The still-water depth is denoted by

h_{0}

, and the water density is denoted by

ρ_{w}

. We defined two coordinate systems. The first coordinate system (x, y) is defined with its origin located at the shoreline, with the x-axis proceeding out across the water, stream-wise, and the y-axis pointing directly upward. The second coordinate system (s, l) is defined with the l-axis being along the slope and the s-axis being perpendicular to the slope. A slide mass, with a volume of

V_{I}

and density of

ρ_{s}

, is released at a distance

l_{s}

from the shoreline. The slide’s initial shape is idealized as a rectangle with a height of

s_{0}

and length of

l_{0}

. When the sliding mass moves down the slope, its thickness

s (l, t)

and depth average velocity

v_{s} (l, t)

vary as a function of l and t, respectively. The volume of the immersed slide is denoted by

V_{s}

. The free water surface

η (x, t)

depends on the horizontal coordinate x and time t. The wave created by the incursion of the sliding mass is evaluated quantitatively by its height h and amplitude a. The gravity acceleration is denoted by g.

2.2. Experimental Method

Experiments were conducted in a two-dimensional flume at the Swiss Federal Institute of Technology Lausanne (see Figure 2). The experimental facility was devised to mimic snow avalanches penetrating mountain lakes (for further information see [21]). The scale factor between the real world and this facility was approximately 100. The flume consisted of two parts. The first part was a 1.5 m long and 0.12 m wide chute, and it could be tilted at an angle

θ

ranging from

30^{\circ}

to

50^{\circ}

. Its bottom was lined with sandpaper to provide consistent basal friction and its side walls were made of PVC. The second part was a water-filled, transparent glass flume, 2.5 m long, 0.4 m deep, and 0.12 m wide. The slide mass material was initially contained in a box located at the chute entrance, closed off by a 0.4 m high and 0.12 m wide locked gate. The gate was pneumatically activated and could be opened in less than 0.1 s to release the material from the box. The distance from the gate to the shoreline could be varied from 0.5 m to 1.0 m. Once the slide mass material was released, it accelerated energetically, under gravity, and reached velocities as high as 2.5 m/s. Each experiment’s initial settings, including slide mass volume

V_{i}

, initial slide length

l_{0}

, initial slide height

s_{0}

, slope length

l_{s}

, still-water depth

h_{0}

, and slope angle

θ

, were recorded before the slide mass material was released. Because of its reduced dimensions, the set-up was also subject to scale effects due to surface tension and viscosity which could have affected wave propagation when the still water depth

h_{0} <

0.2 m and wave period

T <

0.35 s [36]. As

h_{0} =

0.2 m and 0.38 s

< T <

2.24 s in our experiments, we think such scale effects were not significant.

We selected Carbopol Ultrez 10 viscoplastic material to mimic cohesive landslides, whose rheological behavior can be described using the Herschel–Bulkley model:

τ = τ_{c} + K {\dot{γ}}^{n}

(1)

where

τ_{c}

is the yield stress,

\dot{γ}

is the shear rate, K is the slide mass consistency, and n is a power-law index that reflects shear thinning (or shear thickening when n > 1). The rheological measurements of Carbopol were conducted using a Bohlin Gemini rheometer equipped with striated parallel plates (40 mm diameter; 1 mm gap size). The values of

τ_{c}

, K and n in the Herschel–Bulkley equation were fitted to the rheological measurements. Table 1 shows how the rheological parameters of Carbopol depend on its concentration C and the proportion of NaOH to Ultrez 10 in the composite. See [37] for the Carbopol Ultrez 10 preparation procedure.

We used polymer–water balls to represent granular avalanches. These were produced by soaking dry, water-absorbent beads in water for 4–5 h. Both Carbopol and the polymer–water balls have a density very close to that of water (1000 kg·m

^{- 3}

), which is also similar to that of the ice (910 kg·m

^{3}

) mobilized in snow or ice avalanches. Taking advantage of the similar densities of Carbopol and polymer–water balls, we were able to investigate how mixtures of cohesive and granular materials generated waves without having to consider the effects of the densities of the varying proportions of each material in the mixtures. Due to the difficulties in finding materials with matching higher densities, the question of how density and mixture proportions interact during wave formation could not be investigated in the current study.

A high-speed camera was placed in front of the shoreline, with its optical axis perpendicular to the sidewall. The camera collected images at a frequency of 200 frames per second, acquiring 600 × 800-pixel images, corresponding to an observation window of 48 × 64 cm

^{2}

. We used a 0.2 × 0.4 m

^{2}

mesh grid to calibrate the raw images and determine the size conversion factor. For each image, we measured (a) the free-water surface when the leading wave reached its maximum height, which helped to deduce the wave amplitudes

a_{m}

and

h_{m}

, (b) the velocity

v_{s}

and thickness s of the sliding mass upon impact, and (c) the volume of the underwater part of the sliding mass

V_{s}

.

3. The Artificial Neural Network Method

The ANN method is inspired by how the human brain processes information, and it is constructed from interconnected processing elements called neurons [38] (see Figure 3). ANNs are receiving ever greater attention because of their ability to express complex functions in a flexible form. A typical ANN model consists of three main parts: learning rules, network architecture, and an activation function. The network structure is formed of several layers: one input layer, one output layer, and one or several hidden layers, with each layer containing several neurons. Each of the neurons in a layer is connected to neurons of the adjacent layers via coefficients called weightings.

From a mathematical perspective, the principle of neural networks involves the composition of non-linear functions. Starting with a linear model, considering a dataset z and a vector of inputs x, a linear model for the output

\hat{z} (x)

can be constructed considering

\hat{z} (x) = W x + β

, where the weighting matrix W and the bias vector

β

are obtained by solving an optimization problem that minimizes the overall difference between z and

\hat{z}

. This process is called model training. Such a simple model may lack the flexibility to represent complex functional mapping and, therefore, intermediate variables (layers) y are introduced:

y = σ (W^{(1)} x + β^{(1)})

and

z = W^{(2)} y + β^{(2)}

, where

σ

is a user-specified activation function, like the hyperbolic tangent. The composition of several intermediate layers results in a neural network capable of efficiently representing arbitrarily complex function forms.

In this study, we selected a one-hidden-layer network, as an example, and adopted a back-propagation algorithm to train the network. The algorithm programming was developed using Matlab. Establishing an ANN model consists of three steps: (i) preparing the required data for training the network; (ii) evaluating neural networks with different structures and choosing the optimal one; and (iii) testing the neural network’s performance using data which have not been used previously for training the network.

The back-propagation artificial neural network algorithm (BP-ANN) consists of two paths: the feed-forwards and the feed-backwards paths. The feed-forwards path is expressed by Equations (2) and (3).

y_{i} = F (X_{j}) = F (W_{o j} + \sum_{i = 1}^{I} W_{i j} x_{i})

(2)

Z_{k} = F (Y_{k}) = F (W_{o k} + \sum_{j = 1}^{J} W_{j k} y_{i})

(3)

where

x_{i}

,

y_{j}

, and

Z_{k}

represent the input, hidden, and output layers, respectively,

W_{o j}

and

W_{o k}

are the bias weights for setting the threshold values,

X_{j}

and

Y_{k}

temporarily represent computing results before using the activation function, and F is the activation function applied in the hidden and output layers. For the activation function, we chose the sigmoid function, which ranges between 0 and 1 (see Equation (4)). The activation function is defined on each layer’s neurons and is applied to the sum of the weighted inputs and to each neuron’s bias to generate the neuron output.

F (a) = \frac{e^{a}}{e^{a} + 1} (a = X_{j}, Y_{k})

(4)

Equation (5) displays the residual function for residual back-propagation training.

E = \frac{1}{2} \sum_{k = 1}^{K} e_{k}^{2} = \frac{1}{2} \sum_{k = 1}^{K} {(t_{k} - z_{k})}^{2}

(5)

where

t_{k}

is the predefined target value and

e_{k}

is the residual of each output node. E is the residual between the expected and actual output values. We used a gradient-descent strategy to adjust the weightings, aiming to obtain a minimum E. Equations (6)–(9) express the weightings between the hidden and output layers.

\frac{\partial E}{\partial w_{j k}} = - e_{k} \frac{\partial F (Y_{k})}{Y_{k}} y_{j} = - δ_{k} y_{j}

(6)

and hence

δ_{k} = e_{k} F^{'} (Y_{k}) = (t_{k} - z_{k}) F^{'} (Y_{k})

(7)

Therefore, the weighting adjustments in the hidden and output link

Δ w_{j k}

can be expressed by Equation (8).

Δ w_{j k} = η \times y_{j} \times δ_{k}

(8)

where

η

is the learning rate ranging between 0 and 1. With a lower learning rate, the network model will take longer time to converge. Conversely, a higher learning rate may lead to a widely oscillating network. In addition, maintaining a consistent learning rate across the model is preferable. The new weighting

w_{j k}

is updated by Equation (9), where r is the number of iterations.

w_{j k} (r + 1) = w_{j k} (r) + Δ w_{j k} (r)

(9)

Similarly, the error gradient in the links between the input and hidden layers can be derived from the partial derivative with respect to

w_{i j}

.

\frac{\partial E}{\partial w_{i j}} = (\sum_{k = 1}^{K} \frac{\partial E}{\partial z_{k}} \frac{\partial z}{\partial Y_{k}} \frac{Y_{k}}{y_{j}}) \times \frac{\partial y_{i}}{\partial X_{j}} \times \frac{\partial X_{j}}{\partial w_{i j}} = - Δ j x_{i}

(10)

where

Δ j = F^{'} (X_{j}) \sum_{k = 1}^{K} δ_{k} w_{j k}

(11)

The new weighting dominates the link between the input layer and hidden layer,

δ w_{i j}

, can be updated as:

δ w_{i j} = η \times x_{i} \times δ_{j}

(12)

w_{i j} (r + 1) = w_{i j} (r) + δ w_{i j} (r)

(13)

All the input data were normalized in the range between 0 and 1 using the following equation:

Y = \frac{X - X_{m i n}}{X_{m a x} - X_{m i n}}

(14)

where X is the raw data and Y is the normalized data. The initial parameter settings are shown in Table 2.

4. Results

In Section 4.1, we validate the ANN method by comparing its prediction accuracy against empirical equations, using the experimental data generated by the viscoplastic flow. In Section 4.2, we predict the wave characteristics from the slide mass features at rest and as it started moving (stage I in Figure 1). In Section 4.3, we develop an ANN model which aims to cope with the parameters of a landslide with complex properties, specifically, a mixture of cohesive and granular slide mass materials.

Each model’s performance was evaluated by its coefficient of determination (

R^{2}

), mean square error (MSE), and its sum of squares due to error (SSE), which are expressed as follows:

R^{2} = 1 - \sum_{i = 1}^{ϵ} (\frac{{(y_{p, i} - y_{o, i})}^{2}}{{(y_{p, i} - {\bar{y}}_{o})}^{2}})

(15)

MSE = \sqrt{\frac{\sum_{i = 1}^{ϵ} {(y_{p, i} - y_{o, i})}^{2}}{ϵ}}

(16)

SSE = \sum_{i = 1}^{ϵ} (y_{o, i} - y_{p, i})

(17)

where

ϵ

is the number of series of experimental data,

y_{p, i}

and

y_{o, i}

are the predicted and observed data, respectively, and

{\bar{y}}_{o}

is the average of observed data.

4.1. Model Validation

Most commonly used empirical equations to predict waves generated by landslides involve the following dimensional parameters:

η (x, t) = η (h_{0}, s, v_{s}, g, V_{s}, θ, t, ρ_{w}, ρ_{s})

(18)

Based on a dimensional analysis or a scale analysis, the scaled wave characteristics can be expressed as a function of several dimensionless groups:

X_{n} = δ \prod_{i = 1}^{N} Π_{i}^{β_{i}}

(19)

where X represents the scaled wave characteristics (e.g., the scaled maximum wave amplitude, wave height, wave length, wave period);

Π_{i}

indicates the explanatory variables selected, where N is the number of explanatory variables.

The predicting equations developed by Zitti et al. [21] were the best fit with our experimental data (see Equation (20)).

X_{1, 2} = δ Π_{1}^{β_{1}} Π_{2}^{β_{2}} Π_{3}^{β_{3}}

(20)

where

X_{1, 2} = H_{m}, A_{m}

, and

Π_{1} = \frac{v_{s}}{\sqrt{g h_{0}}}

is the slide mass Froude number,

Π_{2} = \frac{s}{h_{0}}

is the scaled slide mass thickness, and

Π_{3} = \frac{ρ_{s} V_{s}}{ρ_{w} B h_{0}^{2}}

is the scaled impacted slide mass, where B is the width of the flume.

The coefficients of explanatory variables

δ

and

β_{1, 2, 3}

were acquired by fitting the experimental data based on a linear regression technique. The empirical equations of

A_{m}

and

H_{m}

for the present study were:

A_{m} = 1.2973 Π_{1}^{0.6170} Π_{2}^{0.1626} Π_{3}^{0.6406}

(21)

H_{m} = 1.4368 Π_{1}^{0.9700} Π_{2}^{0.0768} Π_{3}^{0.6076}

(22)

Using the same database and explanatory variables as Equation (21), we modeled the experimental data using our ANN method. Thus, the three neurons in the input layer and the two neurons in the output layer were:

Three inputs: $Π_{1}$ , $Π_{2}$ , and $Π_{3}$
Two outputs: $A_{m}$ and $H_{m}$

Of the 291 samples of Carbopol mass slides in the experimental database, 80% (233 samples) were selected as training data for model construction and 20% (58 samples) were saved as test data for model validation, providing an independent measure of ANN performance after training. Samples for each group were selected randomly.

We used a basic three-layer network structure, namely, one input layer, one hidden layer, and one output layer. To select the optimal number of neurons in the hidden layer, we set a random number of neurons and ran the program, determining their performance by

R^{2}

. Each run was repeated five times and

R^{2}

was calculated by eliminating the maximum and minimum coefficients of determination and averaging the results of the remaining three tests. As shown in Figure 4, the

R^{2}

of both

H_{m}

and

A_{m}

reached their maximum values when the hidden layer contained six neurons. Thus, the optimum network for the present study was a three–six–two structure (input–hidden–output).

Model training was constrained by the following indicators: the maximum epoch number was initially set to 100; the objective MSE was set to

1 \times 10^{- 4}

; the minimum gradient was set to

1 \times 10^{- 5}

; and the maximum number of validation fails, which represents the number of successive iterations that the validation performance fails to decrease, was initially set to six. Training would stop once one of the indicators mentioned above reached its initial value; for instance, in the present study, training stopped when the number of validation fails reached 6. Figure 5 illustrates the evolution of these indicators (i.e., gradient, validation fails, and MSE) at each epoch until the training is stopped.

In Figure 5c, the MSEs of the training data and the test data were counted separately. The curves of the evolution of the MSE for these three data series were very close, indicating the model’s high level of adaptability. The best validation performance was an MSE = 0.00025337 at epoch 43, and the training terminated at epoch 48 as the number of validation fails reached six. The gradient = 0.0011736 at epoch 48. Figure 6 displays a histogram of the residuals between the predicted

A_{m}

and the observed

A_{m}

. The probability density of the residuals approximately follows a Gaussian distribution.

Figure 7 displays the observed

A_{m}

and

H_{m}

versus the predicted data modeled using the ANN model and the empirical equations. The

R^{2}

of

A_{m}

and

H_{m}

of the test data in the ANN model were 0.9682 and 0.9479, respectively; the

R^{2}

of

A_{m}

and

H_{m}

of the test data predicted by the empirical equations were 0.9214 and 0.9062, respectively. The ANN model outperformed the best-fitting empirical equation. In addition, the

R^{2}

of

A_{m}

was always slightly higher than that of

H_{m}

, in both models, which may result from measurement errors in the experiments which have been defined in our previous publications [15,16].

4.2. Prediction of Wave Characteristics from Initial Slide Parameters

Previously, empirical or semi-empirical equations determined wave characteristics from the mass slide features on impact (illustrated as stage II in Figure 1), and most equations were established in the form of the power-law equations of several dimensionless groups (see Equation (20)). When we predict the wave characteristics from the slide features at stage I, it is difficult to provide physical constraints on the mathematical structure of predictive equations because of the complex physical mechanisms involved in the whole process. In this case, assuming a functional form for the prediction equation in advance might be problematic. Therefore, a data-driven approach that relies strictly on the data rather than on a fixed form equation is preferable, and the ANN method thus fits this requirement. The process involves the following parameters:

η (x, t) = η (τ_{c}, K, n, l_{0}, s_{0}, l_{s}, h_{0}, θ, ρ_{w}, ρ_{s}, t, g)

(23)

The slide mass’s rheological parameters include

τ_{c}

, K, and n. Although they have little effect on the slide mass–water interaction and wave formation [16], they have great effects on the slide mass flowing down the slope. The Pearson correlation coefficients between each pair of these three parameters were all above 0.9 (see Table 3), indicating that all three parameters correlated highly. We therefore selected the yield stress

τ_{c}

, namely the stress at which the material starts yielding, to represent the rheological parameters.

Figure 8 provides a first insight into how the wave characteristics depend on the rheological properties of the slide mass and on its parameters at the initial stage. It shows experimental data with the yield stress set at

τ_{c}

= 41 Pa, 62 Pa, and 80 Pa. Overall, the maximum wave amplitude

a_{m}

increased with rising yield stress

τ_{c}

and initial slide mass

m_{I}

, and decreased with slope length

l_{s}

.

ϵ = \frac{l_{*}}{h_{*}}

and

ς = \frac{s_{*}}{h_{*}}

are aspect ratios for the l-axis to the y-axis, and for the s-axis to the y-axis, respectively. The natural choice for defining the typical scale introduced by these ratios was to take the dimensions of the reservoir:

l_{*} = l_{0}

,

h_{*} = h_{0}

, and

s_{*} = s_{0}

. The Bingham number can be expressed as

B i = \frac{τ_{c}}{K (v_{*} / s_{*})}

, which is a dimensionless yield stress (relative to the viscous forces). We assumed that the viscoplastic flow reached a near-equilibrium regime, where viscous forces balanced gravity acceleration, and the velocity scale was then

v_{*} = {(ρ_{s} g sin θ / K)}^{1 / n} s_{*}^{1 + 1 / n}

. The Bingham number then became

B i = \frac{τ_{c}}{ρ_{s} g s_{0} sin θ}

(see [40] for further information).

The dimensions involved in Equation (23) are length [L], mass [M], and time [T]. We chose three scaling parameters: water density

ρ_{w}

, still-water depth

h_{0}

, and gravitational acceleration g [19]. Thus, the dimensionless form can be expressed as:

η^{'} = \frac{η (x, t)}{h_{0}} = η^{'} (\frac{τ_{c}}{ρ g s_{0} sin θ}, \frac{l_{0}}{h_{0}}, \frac{s_{0}}{h_{0}}, \frac{l_{s}}{l_{0}}, θ, \frac{ρ_{s}}{ρ_{w}})

(24)

where

η^{'}

is the scaled free-water surface elevation. As in Section 4.1, we selected the scaled maximum wave amplitude

A_{m}

and height

H_{m}

to represent the water surface elevation. As the slide mass density

ρ_{s}

and water density

ρ_{w}

were constant throughout our experiments,

\frac{ρ_{s}}{ρ_{w}}

can be eliminated. There were therefore five neurons in the input layer and two neurons in the output layer:

five inputs: $B i$ , $ϵ$ , $ς$ , $\frac{l_{s}}{l_{0}}$ , and $θ$
two outputs: $A_{m}$ and $H_{m}$

The modeling method used was the same as in Section 4.1. First, based on the optimal number of hidden neurons determined, a five–ten–two network structure was developed; then, the experimental data were divided into training data and test data; finally, the ANN model was trained using the training data and validated using the test data. The

R^{2}

, MSE, and SSE of

A_{m}

were 0.8983, 0.00089, and 0.2591, respectively. The

R^{2}

, MSE, and SSE of

H_{m}

were 0.8497, 0.00295, and 0.8483, respectively. Because

R^{2} > 0.8

, the present model is validated. Yet compared with the scenario that predicted wave characteristics from the slide mass parameters on impact, the prediction accuracy of the ANN method in the present scenario was lower. The more complicated the physical process is, the more information could be lost in prediction.

4.3. Waves Generated by Viscoplastic–Granular Mixtures

Most studies have mimicked landslides in the real world by using a single slide mass material, including granular slides, viscoplastic materials, or solid blocks. However, many landslides in the natural world are mixtures of granular and viscoplastic materials. In the present study, we conducted experiments using mixtures of polymer–water balls and Carbopol, with the percentage of Carbopol in volume varying symmetrically (0%, 20%, 50%, 80% and 100%). Figure 9 shows raw images, captured by a high-speed camera, of Carbopol, polymer–water balls, and mixtures of them, entering the body of water. These represented landslides with different degrees of cohesion.

As shown in Figure 10, larger waves are generated with higher proportions of Carbopol in the mixture, which implies that the slide mass material’s composition influenced wave generation. Here, to provide identical criteria for all slide mass materials, we quantified the slide mass properties using a universal dimensionless group named the Impulse product parameter P, which was proposed by [12]:

P = Π_{1} Π_{2}^{1 / 2} Π_{3}^{1 / 4} cos {(6 / 7 θ)}^{1 / 2}

(25)

where

Π_{1}

,

Π_{2}

, and

Π_{3}

denote the same parameters as in Equation (20).

One issue which should be noted is that the properties of granular slides are usually represented by their grain diameters, whereas the rheological behavior of viscoplastic materials is commonly described using yield stress. It is difficult to integrate these two parameters into one equation in the form of a power-law equation. To overcome this limitation and provide a compatible model for these parameters, we applied the ANN method so as to avoid assuming the functional form of a prediction equation. Here, we predicted the wave characteristics from the mixture’s parameters on impact.

As highlighted above, the dimensionless parameters in modeling experiments with a single material commonly involve the slide Froude number

Π_{1}

, relative slide mass

Π_{2}

, and the relative slide thickness

Π_{3}

. To quantify the properties of mixed viscoplastic and granular slides, we introduced the following dimensionless groups: the Bingham number Bi

= \frac{τ_{c}}{ρ_{s} g s_{0} sin θ}

, which represents the rheological properties of a cohesive material; the scaled diameter of the granular slide mass

D_{s} = \frac{d_{g}}{h_{0}}

, where

d_{g}

is the diameter of a granular particle; the volume ratio of the viscoplastic material in the mixture

R_{V} = \frac{V_{s}}{V_{g} + V_{s}}

, where

V_{s}

is the volume of the viscoplastic slide mass and

V_{g}

is the volume of the granular slides; and the density ratio between the two materials

R_{ρ} = \frac{ρ_{s}}{ρ_{g}}

, which is a constant in the present study.

Hence, the input layer contained six neurons

{Π_{1}

,

Π_{2}

,

Π_{3}

, Bi,

D_{s}

, and

R_{V}}

, and the output layer contained again

{A_{m}

and

H_{m}}

. Using the same method presented in Section 4.1, the number of hidden neurons was determined, and the network’s optimum structure was six–eight–two. The

R^{2}

, MSE, and SSE of

A_{m}

were 0.9325, 0.0072, and 0.2172, respectively. The

R^{2}

, MSE, and SSE of

H_{m}

were 0.9173, 0.00178, and 0.6154, respectively. As

R^{2}

of both

A_{m}

and

H_{m}

were greater than 0.8, the model can be considered as valid. The predicted

A_{m}

and

H_{m}

are illustrated against the experimental data in Figure 11.

5. Discussion

5.1. Model Adaptability

In Section 4.2 and Section 4.3, we presented two applications which were difficult to model using empirical equations with a fixed functional form:

One application was predicting wave characteristics from slide mass features at the initial stage I. When doing this, it is difficult to provide physical constraints on the mathematical structure of predictive equations because of the complex physical mechanisms involved in the whole process. In this case, assuming a functional form for the predictive equation in advance might be problematic.
Another application was predicting waves generated by viscoplastic–granular mixtures. The properties of granular slides are usually represented by their grain diameters, whereas the rheological behaviors of viscoplastic materials are commonly described using yield stress. It is difficult to integrate these two parameters into one equation in the form of a power-law equation.

Both these scenarios can easily be adapted using the ANN method’s high prediction accuracy (see Table 4). This clearly demonstrates the advantage of using a purely data-driven method in terms of model adaptability (and this is not limited to an ANN method). In contrast to equations with fixed formulae, the ANN method has no external constraints, making it a scalable open system. In addition, it has the ability to self-update and is highly adaptable when new parameters become available or fresh constraints appear (they are not limited to the two scenarios presented in this study). With more informative, richer datasets, stronger correlations can be built from the input layer to the output layer.

5.2. Prediction Accuracy

Table 4 displays the coefficient of determination

R^{2}

, mean square error (MSE), and sum of squares due to error (SSE) values for each of the models presented in Section 4. The following features are worth noting:

Compared with the empirical equations based on regression techniques, the ANN model gives more precise predictions. Using the same explanatory variables, the coefficient of determination $R^{2}$ improved from 0.9214 to 0.9682 for $A_{m}$ , and from 0.9062 to 0.9479 for $H_{m}$ . Of course, the improvement in prediction accuracy is not large.
The prediction precision for $A_{m}$ was greater than for $H_{m}$ in predictions made with empirical equations and with the ANN models. This may be because the experimental measurement errors of wave heights $h_{m}$ were larger than those for wave amplitudes $a_{m}$ . Prediction precision not only depends on the prediction performance of the model selected, but it also relies on experimental accuracy.
The predictions of wave features from the parameters at impact were better than the predictions from the parameters at the initial stage. Also, prediction precision decreased when the dataset involved combinations of different slide mass materials. Thus, prediction precision decreased as experimental complexity increased and more parameters were involved.

5.3. Multicollinearity

Multicollinearity is a phenomenon where one explanatory variable in a multiple regression model can be linearly predicted from the others with a substantial degree of accuracy. This may lead to the problem that the multiple regression’s coefficient estimates change erratically in response to small changes in the model. The natural logarithmic form of empirical equation (Equation (20)) can be written as:

ln X = ln δ + α ln Π_{1} + β ln Π_{2} + γ ln Π_{3}

(26)

The coefficients

ln δ

,

α

,

β

, and

γ

were estimated using the least squares (linear regression) method based on experimental data. As length [L] was scaled by the still-water depth

h_{0}

,

h_{0}

appears in the three aggregated parameters

Π_{1}

,

Π_{2}

,and

Π_{3}

, and specifically, they are correlated with

h_{0}^{- 1 / 2}

,

h_{0}^{- 1}

, and

h_{0}^{- 2}

, respectively. The high correlations among explanatory variables may result in multicollinearity during the linear regression. However, to date, none of the studies using empirical equations has discussed multicollinearity.

To estimate the correlations between each pair of explanatory variables, we calculated their Pearson correlation coefficients r. As illustrated in Figure 12, the Pearson correlation coefficient r between

Π_{1}

and

Π_{2}

is relatively high (

r =

0.69), however, it is still under the upper limit of 0.8. Furthermore, to determine how influential the water depth

h_{0}

was in wave generation, we determined the sensitivity of the maximum wave amplitude

a_{m}

to a ±20% change in each of the following parameters (taken in isolation from the others): slide volume on impact

V_{s}

, slide velocity on impact

v_{s}

, slide thickness s and still water depth

h_{0}

. We obtained similar results to those obtained by [17]: the

a_{m}

variations due to changes in these parameters were smaller than 20%, and

a_{m}

was more sensitive to

v_{s}

and

V_{s}

rather than

h_{0}

. We may therefore consider that the multicollinearity lies within an acceptable range.

5.4. Limitations

The present study explored the possibility of extracting models purely from data, however, data-driven models may suffer from a lack of interpretability, e.g., the difficulty in explaining causal relationships between the data, the discrepancy, and the corresponding prediction. The use of deep learning strategies and vast amounts of data in the inference process exacerbate this issue. In addition, when ANN produces a solution, it does not give any clue as to why and how. This reduces trust in the network relevance because of the lack of visual links between outputs, inputs and neurons.

6. Conclusions

This study applied an artificial neural network (ANN) method—one of the most commonly used machine learning methods—to predict the characteristics of waves generated by gravity-driven slide masses. Laboratory experiments were conducted using a viscoplastic material (Carbopol), a granular material (polymer–water balls), and mixtures of them. After validating the ANN model by comparing its prediction accuracy with that of empirical equations, we applied the model to two scenarios: (i) predicting wave characteristics from the parameters of landslides initially at rest on the slope and (ii) integrating the parameters of different categories of slide mass material into one model, i.e., a Bingham number for the viscoplastic material and the grain diameter for the granular material. For each scenario, the inputs, outputs and network structures of the ANN model were refined. In the first scenario, the

R^{2}

for the scaled maximum wave height

H_{m}

and scaled maximum wave amplitude

A_{m}

were 0.8983 and 0.8497, respectively, and in the second scenario, the

R^{2}

for

H_{m}

and

A_{m}

were 0.9325 and 0.9173, respectively. As a purely data-driven method, this ANN method was easy to adapt when new parameters were included or fresh constraints occurred.

Author Contributions

Conceptualization, C.A. and Z.M.; experiments, Z.M. and Y.H.; methodology, Z.M. and Y.H.; validation, Z.M.; formal analysis, Z.M.; writing–original draft preparation, Z.M.; writing–review and editing, Z.M. and C.A.; supervision, C.A.; project administration, C.A.; funding acquisition, C.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the EPFLs Civil Engineering Department and the Swiss National Science Foundation (Grant No. 200021 146271/1 for a project called Physics of Basal Entrainment). Z.M. acknowledges the support of the China Scholarship Council (Grant No. 201506710074).

Acknowledgments

Part of the preliminary tests were conducted by EPFL student Jeremy Bussat. We are grateful to our colleagues Bob de Graffenried for his help.

Conflicts of Interest

The authors declare no conflict of interest.

References

Fritz, H.M.; Mohammed, F.; Yoo, J. Lituya Bay landslide impact generated mega-tsunami 50th Anniversary. Pure Appl. Geophys. 2009, 166, 153–175. [Google Scholar] [CrossRef]
Muller, L. The rock slide in the Vajont Valley. Rock Mech. Eng. Geol. 1964, 2, 148–212. [Google Scholar]
Fuchs, H.; Pfister, M.; Boes, R.M.; Perzlmaier, S.; Reindl, R. Impulse waves due to avalanche impact into Kuhtai reservoir. Wasserwirtschaft 2011, 101, 54–60. [Google Scholar] [CrossRef]
Wiegel, R.L.; Noda, E.K.; Kuba, E.M.; Gee, D.M.; Tornberg, G.F. Water waves generated by landslides in reservoirs. J. Waterw. Harb. Coast. Eng. Div. 1970, 96, 307–333. [Google Scholar]
Liu, P.F.; Wu, T.; Raichlen, F.; Synolakis, C.E.; Borrero, J.C. Runup and rundown generated by three-dimensional sliding masses. J. Fluid Mech. 2005, 536, 107–144. [Google Scholar] [CrossRef] [Green Version]
Heller, V.; Moalemi, M.; Kinnear, R.D.; Adams, R.A. Geometrical effects on landslide-generated tsunamis. J. Waterw. Port Coast. Ocean Eng. 2011, 138, 286–298. [Google Scholar] [CrossRef]
Heller, V.; Spinneken, J. On the effect of the water body geometry on landslide–tsunamis: Physical insight from laboratory tests and 2D to 3D wave parameter transformation. Coast. Eng. 2015, 104, 113–134. [Google Scholar] [CrossRef]
Heller, V.; Bruggemann, M.; Spinneken, J.; Rogers, B.D. Composite modelling of subaerial landslide–tsunamis in different water body geometries and novel insight into slide and wave kinematics. Coast. Eng. 2016, 109, 20–41. [Google Scholar] [CrossRef]
Fritz, H.M. Initial Phase of Landslide Generated Impulse Waves. Ph.D. Thesis, ETH Zurich, Zurich, Switzerland, 2002. [Google Scholar]
Zweifel, A. Impulswellen: Effekte der Rutschdichte und der Wassertiefe. Ph.D. Thesis, ETH Zurich, Zurich, Switzerland, 2004. [Google Scholar]
Heller, V. Landslide Generated Impulse Waves: Prediction of Near Field Characteristics. Ph.D. Thesis, ETH Zurich, Zurich, Switzerland, 2007. [Google Scholar]
Heller, V.; Hager, W.H. Impulse product parameter in landslide generated impulse waves. J. Waterw. Port Coast. Ocean Eng. 2010, 136, 145–155. [Google Scholar] [CrossRef]
Miller, G.S.; Take, A.; Mulligan, R.P.; McDougall, S. Tsunamis generated by long and thin granular landslides in a large flume. J. Geophys. Res. Ocean. 2017, 122, 653–668. [Google Scholar] [CrossRef]
Bullard, G.; Mulligan, R.; Carreira, A.; Take, W. Experimental analysis of tsunamis generated by the impact of landslides with high mobility. Coast. Eng. 2019, 152, 103538. [Google Scholar] [CrossRef]
Meng, Z. Experimental study on impulse waves generated by a viscoplastic material at laboratory scale. Landslides 2018, 15, 1173–1182. [Google Scholar] [CrossRef]
Meng, Z.; Ancey, C. The effects of slide cohesion on impulse-wave formation. Exp. Fluids 2019, 60, 151. [Google Scholar] [CrossRef]
Heller, V.; Hager, W.H.; Minor, H.E. Landslide Generated Impulse Waves in Reservoirs: Basics And Computation; ETH Zurich: Zurich, Switzerland, 2009. [Google Scholar]
Mohammed, F.; Fritz, H.M. Physical modeling of tsunamis generated by three-dimensional deformable granular landslides. J. Geophys. Res. Ocean. 2012, 117, 20160052. [Google Scholar] [CrossRef] [Green Version]
Zitti, G.; Ancey, C.; Postacchini, M.; Brocchini, M. Impulse waves generated by snow avalanches falling into lakes. In Proceedings of the 36th IAHR World Congress, IAHR, The Hague, The Netherlands, 28 June–3 July 2015. [Google Scholar]
Walder, J.S.; Watts, P.; Sorensen, O.E.; Janssen, K. Tsunamis generated by subaerial mass flows. J. Geophys. Res. Solid Earth 2003, 108. [Google Scholar] [CrossRef]
Zitti, G.; Ancey, C.; Postacchini, M.; Brocchini, M. Impulse waves generated by snow avalanches: Momentum and energy transfer to a water body. J. Geophys. Res. Earth Surf. 2016, 121, 2399–2423. [Google Scholar] [CrossRef]
Kamphuis, J.; Bowering, R. Impulse waves generated by landslides. Coast. Eng. 1970, 575–588. [Google Scholar] [CrossRef]
Lindstrøm, E.K. Waves generated by subaerial slides with various porosities. Coast. Eng. 2016, 116, 170–179. [Google Scholar] [CrossRef]
Heller, V.; Spinneken, J. Improved landslide-tsunami prediction: Effects of block model parameters and slide model. J. Geophys. Res. Ocean. 2013, 118, 1489–1507. [Google Scholar] [CrossRef] [Green Version]
Tang, G.; Lu, L.; Teng, Y.; Zhang, Z.; Xie, Z. Impulse waves generated by subaerial landslides of combined block mass and granular material. Coast. Eng. 2018, 141, 68–85. [Google Scholar] [CrossRef]
Su, H.; Li, J.; Cao, J.; Wen, Z. Macro-comprehensive evaluation method of high rock slope stability in hydropower projects. Stoch. Environ. Res. Risk Assess. 2014, 28, 213–224. [Google Scholar] [CrossRef]
Liu, Y.; Wang, X.; Wu, Z.; He, Z.; Yang, Q. Simulation of landslide-induced surges and analysis of impact on dam based on stability evaluation of reservoir bank slope. Landslides 2018, 15, 2031–2045. [Google Scholar] [CrossRef]
Abraham, A. Artificial neural networks. In Handbook of Measuring System Design; Wiley: London, UK, 2005. [Google Scholar]
Yegnanarayana, B. Artificial Neural Networks; PHI Learning Pvt. Ltd.: Delhi, India, 2009. [Google Scholar]
Kim, D.H.; Park, W.S. Neural network for design and reliability analysis of rubble mound breakwaters. Ocean Eng. 2005, 32, 1332–1349. [Google Scholar] [CrossRef]
Lee, A.; Geem, Z.W.; Suh, K.D. Determination of optimal initial weights of an artificial neural network by using the harmony search algorithm: Application to breakwater armor stones. Appl. Sci. 2016, 6, 164. [Google Scholar] [CrossRef] [Green Version]
Armaghani, D.J.; Mohamad, E.T.; Hajihassani, M.; Abad, S.A.N.K.; Marto, A.; Moghaddam, M. Evaluation and prediction of flyrock resulting from blasting operations using empirical and computational methods. Eng. Comput. 2016, 32, 109–121. [Google Scholar] [CrossRef]
Gedik, N. Least squares support vector mechanics to predict the stability number of rubble-mound breakwaters. Water 2018, 10, 1452. [Google Scholar] [CrossRef] [Green Version]
Dou, J.; Yamagishi, H.; Pourghasemi, H.R.; Yunus, A.P.; Song, X.; Xu, Y.; Zhu, Z. An integrated artificial neural network model for the landslide susceptibility assessment of Osado Island, Japan. Nat. Hazards 2015, 78, 1749–1776. [Google Scholar] [CrossRef]
Panizzo, A.; De Girolamo, P.; Petaccia, A. Forecasting impulse waves generated by subaerial landslides. J. Geophys. Res. Ocean. 2005, 110. [Google Scholar] [CrossRef]
Heller, V.; Hager, W.H.; Minor, H.E. Scale effects in subaerial landslide generated impulse waves. Exp. Fluids 2008, 44, 691–703. [Google Scholar] [CrossRef]
Cochard, S. Measurements of Time-Dependent Free-Surface Viscoplastic Flows Down Steep Slopes. Ph.D. Thesis, EPFL Lausanne, Lausanne, Switzerland, 2007. [Google Scholar]
Liu, J.; Chang, H.; Hsu, T.; Ruan, X. Prediction of the flow stress of high-speed steel during hot deformation using a BP artificial neural network. J. Mater. Process. Technol. 2000, 103, 200–205. [Google Scholar] [CrossRef]
Suzuki, K. Artificial Neural Networks-Architectures and Applications; IntechOpen Limited: London, UK, 2013. [Google Scholar]
Ancey, C.; Cochard, S. The dam-break problem for Herschel–Bulkley viscoplastic fluids down steep flumes. J. Non-Newton. Fluid Mech. 2009, 158, 18–35. [Google Scholar] [CrossRef]

Figure 1. Two dimensional physical model of a landslide generating wave: (a) the slide material is at rest and then starts moving (stage I), (b) the slide material moves down the slope and reaches the shoreline (stage II), and (c) the slide material intrudes into the body of water and generates waves (stage III).

Figure 2. The experimental facility.

Figure 3. A biological neuron in comparison to an artificial neural network: (a) human neuron; (b) artificial neuron; (c) biological synapse; and (d) ANN synapses [39].

Figure 4. Variation of

R^{2}

versus the number of neurons in the hidden layer.

Figure 4. Variation of

R^{2}

versus the number of neurons in the hidden layer.

Figure 5. Variations in (a) the gradient, (b) the number of validation fails, and (c) MSE, against epochs.

Figure 6. Error histogram of

A_{m}

with 20 bins. The red part denotes test data and the grey part denotes training data.

Figure 6. Error histogram of

A_{m}

with 20 bins. The red part denotes test data and the grey part denotes training data.

Figure 7. Q-Q plot of observed and predicted (a)

A_{m}

and (b)

H_{m}

, for the empirical equations and the ANN model. Training data and test data in the ANN model are displayed separately.

Figure 7. Q-Q plot of observed and predicted (a)

A_{m}

and (b)

H_{m}

, for the empirical equations and the ANN model. Training data and test data in the ANN model are displayed separately.

Figure 8. Variations in wave amplitude

a_{m}

against

m_{I} l_{s}^{- 1}

, with the water depth

h_{0}

= 0.2 m and slope angle

θ

= 45

^{°}

.

Figure 8. Variations in wave amplitude

a_{m}

against

m_{I} l_{s}^{- 1}

, with the water depth

h_{0}

= 0.2 m and slope angle

θ

= 45

^{°}

.

Figure 9. Raw images of landslides intruding into a body of water, as recorded by a high-speed camera: (a) Carbopol, (b) mixture of 50% Carbopol and 50% polymer–water balls, and (c) polymer–water balls.

Figure 10. Effects of slide mass material composition on the scaled maximum wave amplitude

A_{m}

.

Figure 10. Effects of slide mass material composition on the scaled maximum wave amplitude

A_{m}

.

Figure 11. Predicted (a)

A_{m}

and (b)

H_{m}

with a six–eight–two ANN model versus experimental data. Training data and test data in the ANN model are displayed separately.

Figure 11. Predicted (a)

A_{m}

and (b)

H_{m}

with a six–eight–two ANN model versus experimental data. Training data and test data in the ANN model are displayed separately.

Figure 12. Correlation matrix of explanatory variables

Π_{1}

,

Π_{2}

, and

Π_{3}

in Equation (20).

Figure 12. Correlation matrix of explanatory variables

Π_{1}

,

Π_{2}

, and

Π_{3}

in Equation (20).

Table 1. Rheological characteristics of the Carbopol used in the present study.

C [%]	Ultrez 10 [g]	NaOH [g]	H $_{2}$ O [L]	$τ_{c}$ [Pa]	K [Pa · s $^{n}$ ]	n [-]
1.5	45	18.0	30	38	10.3	0.289
1.6	50	20.7	30	43	12.3	0.293
1.7	53	22.0	30	49	14.4	0.295
1.8	55	22.8	30	53	16.2	0.315
1.9	58	24.0	30	55	17.1	0.321
2.0	60	24.9	30	58	18.9	0.330
2.2	65	26.9	30	60	19.8	0.333
2.3	68	28.2	30	65	23.2	0.339
2.4	70	29.0	30	68	24.6	0.348
2.5	75	31.0	30	74	29.1	0.364
2.7	80	33.2	30	78	32.1	0.388
2.8	85	35.0	30	80	35.8	0.390
3.0	90	37.3	30	85	42.1	0.392

Table 2. Initial settings for the parameters in the ANN model.

Parameters	Initial Setting
Initial weightings	0.2–0.5
Learning rate	0.1
Maximum number of epochs	200
Objective mean square error	0.00001
Training function	traingdx
Momentum parameters	0.9
Activation function	Sigmoid function

Table 3. The Pearson correlation coefficients between

τ_{c}

, K, and n.

Table 3. The Pearson correlation coefficients between

τ_{c}

, K, and n.

	$τ_{c}$	K	n
$τ_{c}$	1	0.9739	0.9604
K	0.9739	1	0.9633
n	0.9604	0.9633	1

Table 4. The

R^{2}

, MSE, and SSE values of the models described.

Table 4. The

R^{2}

, MSE, and SSE values of the models described.

	Empirical Equations		ANN Model (3–6–2) *		ANN Model (5–10–2) **		ANN Model (6–8–2) ***
	$A_{m}$	$H_{m}$	$A_{m}$	$H_{m}$	$A_{m}$	$H_{m}$	$A_{m}$	$H_{m}$
$R^{2}$	0.9214	0.9062	0.9682	0.9479	0.8983	0.8497	0.9325	0.9173
MSE	0.00081	0.00197	0.00025	0.00107	0.00089	0.00295	0.00072	0.00178
SSE	0.2571	0.6266	0.0865	0.3088	0.2591	0.8483	0.2172	0.6154

* Wave characteristics were predicted from dimensionless parameters on impact (see Section 4.1). ** Wave characteristics were deduced from the slide’s initial parameters (see Section 4.2). *** Waves generated by viscoplastic-granular mixtures (see Section 4.3).

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Meng, Z.; Hu, Y.; Ancey, C. Using a Data Driven Approach to Predict Waves Generated by Gravity Driven Mass Flows. Water 2020, 12, 600. https://doi.org/10.3390/w12020600

AMA Style

Meng Z, Hu Y, Ancey C. Using a Data Driven Approach to Predict Waves Generated by Gravity Driven Mass Flows. Water. 2020; 12(2):600. https://doi.org/10.3390/w12020600

Chicago/Turabian Style

Meng, Zhenzhu, Yating Hu, and Christophe Ancey. 2020. "Using a Data Driven Approach to Predict Waves Generated by Gravity Driven Mass Flows" Water 12, no. 2: 600. https://doi.org/10.3390/w12020600

APA Style

Meng, Z., Hu, Y., & Ancey, C. (2020). Using a Data Driven Approach to Predict Waves Generated by Gravity Driven Mass Flows. Water, 12(2), 600. https://doi.org/10.3390/w12020600

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Using a Data Driven Approach to Predict Waves Generated by Gravity Driven Mass Flows

Abstract

1. Introduction

2. Experiments

2.1. Physical Model

2.2. Experimental Method

3. The Artificial Neural Network Method

4. Results

4.1. Model Validation

4.2. Prediction of Wave Characteristics from Initial Slide Parameters

4.3. Waves Generated by Viscoplastic–Granular Mixtures

5. Discussion

5.1. Model Adaptability

5.2. Prediction Accuracy

5.3. Multicollinearity

5.4. Limitations

6. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI