1. Introduction
As the propagation medium of seismic waves and the ground of building structures, a local site will significantly affect the ground motion. Previous investigations have recorded relatively serious earthquake damages in a basin, which is a special site, and its influence on ground motion has been studied. The basin effect includes the selective amplification of long-period ground motion by the soft soil layer and the abnormal amplification of the basin edge [
1,
2]. These special amplifying characteristics of ground motion are some of the representative parameters to measure the ground motion amplification of the basin, as well as the amplification of the peak of the ground acceleration. Therefore, it is of great importance to predict the peak amplification and location of the ground motion acceleration in the basin to evaluate the safety of many high-rise buildings located in basins in inner cities.
The influence analysis of sedimentary basins on ground motion is theoretically a problem of two-dimensional (2D) or three-dimensional (3D) dynamic response analysis. More efficient and flexible numerical simulation methods are mostly adopted, and detailed site data are needed to establish a more accurate 3D computational model. With the development of computing technology, numerical methods have overcome the challenge of computational accuracy and stability, and have achieved great development and wide application. In terms of computing software, general commercial finite element software is widely used in the field of research and design currently, including ANSYS, ABAQUS, FLAC, LS-DYNA, ADINA, Marc, etc. [
3], all of which have perfect modeling, solving, and post-processing functions. In view of the physical characteristics of basins, many scholars have improved the level of existing special analysis methods from many aspects and perspectives, and have developed a series of special analysis software, such as CESAR-LCPC [
4] and AHNSE [
5]. Based on the current analysis techniques, complex 2D and 3D computational models reflecting as many factors as possible have been established to reflect the detailed characteristics of the sedimentary basin geological structure and focal information, etc., and different numerical methods are used to reproduce the ground motion characteristics. Riga et al. [
6] conducted extensive numerical analysis on the linear viscoelastic response of homogeneous sedimentary basins and explored the potential additional influence of sediment heterogeneity and nonlinearity on the weighting coefficient through a finite element model developed in ABAQUS. Bazyar et al. [
7] used the scaled boundary finite element method to directly deal with irregular phenomena in the time domain, and discuss the scattering and amplification of seismic waves under topographic and geological conditions. Meza-Fajardo et al. [
8] reconstructed the engineering geological model of the alluvial valley and evaluated the local response of the ground motion in terms of surface propagation, amplification function, and cumulative kinetic energy by using a 2D finite element numerical simulation method.
The basin effect is often caused by the combined action of many factors, such as subsurface media and irregular site geometries. The 2D characteristics of the variation of heterogeneous media in the basin have been proven to change the nature of seismic wave propagation from deep underground to the surface. The results show that for sedimentary basins with significant amplification characteristics, it is necessary to reflect the joint influence of the spatial geometric characteristics and the physical characteristics of the sedimentary soil layer. Therefore, on the basis of qualitative research, how to quantitatively analyze the amplification effect of different factors on the basin, how to reasonably evaluate the basin effect, and how to scientifically predict the ground motion are the key problems to be solved in engineering applications. However, due to the complexity of the structure and soil of real basins, only approximate or simplified models can be used, so it is necessary to study the laws of the amplification mechanisms of different types of basins, to establish the relationship between the parameters and the amplification characteristics, and to improve the existing methods, taking into account the site’s effect. In order to analyze the significance of different factors on the basin amplification effect, based on the reasonably simplified standard basin model, the influence of each factor on the ground motion is analyzed by changing the parameters of the numerical simulation method. Moczo et al. [
9], based on the numerical modeling of sedimentary basins with different dimensions, studied in detail the relationship between the ground motion response of surface stations and their own geometry, the quality factor of the soil layer medium, and the impedance ratio by changing the depth-to-width ratio of the model. Liang et al. [
10] popularized the indirect boundary integral equation method to solve the scattering of plane waves and SV waves by a mountainous terrain in a half-space and found that the scattering characteristics of plane waves in a mountainous terrain were greatly related to the incident angle, frequency, and characteristics of the mountainous medium. The comparison of different simplified analysis models provides a basis for describing a site’s response reliably using as few parameters as possible. Anquez et al. [
11] established a basin model of the lower Vale valley in France and studied the influence of geometric model simplification on wave propagation by using the discontinuous Galerkin finite element method. Raptakis et al. [
12,
13] calculated the seismic response of a series of 2D Greek Volvi basin models with different levels of detail in order to capture the nature of their seismic response. It is a representative research idea to use a simplified model that reflects the characteristics of ground motion amplification, rather than using a complex real sedimentation basin model.
The influence of the site effect on ground motion can be roughly estimated using ground motion prediction equations, which are primarily based on the observation data obtained from existing stations. These equations extract relevant parameters and reflect the influence of the site effect through simple regression. However, as one of the typical local complex sites, the small- and medium-sized basins are characterized by the undulating shape of the bedrock at the bottom, soil properties, lateral inhomogeneity, and surface topography, which are all factors leading to abnormal ground motion [
14], and have complex seismic wave propagation characteristics. Because the observed data are not rich enough and the relative scale is small, the effect of the regression prediction can hardly reflect the local change characteristics of the ground motion. How to reasonably reflect the influence of local site effect in the ground motion prediction of the basin is a problem that needs to be solved to ensure the project’s safety. In recent years, with the popularity of artificial intelligence, machine learning has been increasingly applied to the field of civil engineering due to its ability to directly acquire information from data, without assuming the obedience law or internal relationship between data or parameters in advance. For example, in the area of automatic diagnosis of concrete arch beam damage, a method based on optimized deep stacked self-encoders and multi-sensor fusion is proposed, which is important in terms of reducing labor costs and risks [
15]. An approach based on an improved bird flocking algorithm and 2D convolutional neural networks is used to assess the torsional capacity of reinforced concrete beams. Accurate and fast torsional capacity assessment is achieved through optimal feature selection and model training [
16]. In the area of earthquake engineering, SunyulKim et al. [
17] develop a ground motion amplification model based on three machine learning techniques (i.e., random forest, gradient boosting, and artificial neural network) using the record database of KiK-net site in Japan, and propose that machine learning is superior to the regression method. The data-driven method based on numerical simulation is widely used in machine learning to address the limitation of the lack of real observational data. Karimi et al. [
14] adopt the fuzzy logic method and flexible interference rules to eliminate the uncertainty caused by the deviation of local data and established the peak ground acceleration prediction model of the Iranian plateau, which can also be improved and updated according to the newly added data. Mir et al. [
18] used a machine learning-based ensemble model to predict radon time series anomalies in earthquake likelihood. However, the traditional artificial neural network also has some shortcomings, such as slow convergence speed, local extreme value, low quantization accuracy, and data over-fitting. In addition, with the continuous expansion of the data set, the low computing speed of the traditional neural network similar to backpropagation neural network can no longer meet the computing needs. CNN has attracted wide attention due to its powerful feature learning and recognition ability. It is not a simple memory superposition but a comprehensive evaluation of training data, which has a certain degree of fault tolerance and has important application value. Based on this, this paper aims to establish an accurate and efficient model for predicting basin amplification characteristics using CNN.
Considering that the one-dimensional (1D) basin model will underestimate the amplification value, and the computational cost of the 3D model is high and time-consuming, we used the simplified standard basin model and the commercial finite element software ABAQUS (version 6.14) to establish a series of 2D basin models, and relied on its PYTHON (version 2.7) interface for secondary development to realize batch modeling to quickly build the CNN sample database. In order to study the site effect, the key parameters affecting the basin effect were taken as the CNN input, and the basin amplification characteristics represented by the peak value of the amplification coefficient and the hazard location were taken as the CNN output to establish the prediction model. To improve the accuracy of prediction, this paper built a CNN model combined with a Bayesian optimization algorithm, analyzed the sensitivity of input parameters, optimized the input parameters to build the optimal CNN model, and used the CNN model to identify the amplification characteristics of basins. Finally, a reasonable surface amplification prediction model for local sedimentary basins was established, which achieved the purpose of rapid surface amplification prediction and provided a rapid and effective prediction method for the next step of seismic hazard assessment and seismic design.