1. Introduction
Destructive seismic events have frequently occurred in the Loess Plateau region of China, causing significant casualties and property damage [
1]. The loess, widely distributed across the region, has unique soil dynamics characteristics, leading to notable regional variations in seismic activity and damage distribution. Therefore, studying the ground motion attenuation behavior in this specific soil layer can significantly improve earthquake disaster prevention and mitigation efforts. With the continuous improvement of China’s National Strong Motion Observation Network, many strong motion records have been recently accumulated from the Loess Plateau region, providing reliable data for establishing prediction equations for ground motion parameters applicable to this area. Building upon existing research, this study uses the latest strong motion records to develop prediction equations for horizontal ground motion parameters in the Loess Plateau region.
The study of ground motion attenuation relationships requires a substantial amount of strong motion records. With advancements in observation technology and the widespread deployment of instruments, the global collection of strong motion records has significantly increased. This progress has provided robust support for research on ground motion attenuation relationships [
2].
In the early stages, the lack of sufficient strong motion records in the Loess Plateau region made it impossible to directly establish ground motion parameter prediction equations through statistical methods. Instead, the prediction of peak ground motion in this region relied on the conversion method proposed by Hu. This method established attenuation relationships for peak ground motion on bedrock [
3,
4], which were then used to predict ground motion parameters at engineering sites.
Research on attenuation relationships of peak ground motion parameters for the Loess Plateau began in the 1980s. Ding et al. divided historical seismic intensity data into loess and bedrock regions [
5]. By comparing data from the western United States, they derived bedrock ground motion attenuation relationships using the intensity-distance method. They suggested adjusting loess ground motion parameters to bedrock values by a factor of 0.6 or amplifying bedrock parameters by a factor of 1.67 to reflect soil amplification effects.
Some researchers used administrative boundaries to conduct regional studies based on seismic intensity maps. For instance, Wen analyzed seismic intensity data from the Shanxi seismic belt and established intensity attenuation relationships for the region [
6]. Zhou et al. performed regression analysis on intensity data from 20 earthquakes of magnitude 5 or higher in Gansu Province, deriving attenuation relationships specific to the province [
7]. Fan et al. established separate intensity attenuation relationships for the northern Loess Plateau, the southern Qinba Mountains, and the Guanzhong Plain in Shaanxi Province. Using data from the western United States, they applied a mapping conversion method to derive bedrock attenuation relationships for these regions [
8].
Yang and Luo analyzed 1221 strong motion records collected from soil sites in the loess regions of Shaanxi, Gansu, Qinghai, and Ningxia during the Wenchuan earthquake. Using direct fitting methods, they derived horizontal and vertical peak acceleration attenuation relationships for the Loess Plateau [
9]. Subsequently, other researchers [
10,
11,
12] conducted extensive studies in the Loess Plateau and other loess regions to address engineering demands or specific research needs.
Existing studies show that ground motion attenuation in loess-covered areas is slower than in nearby non-loess areas. This difference becomes more pronounced for larger earthquakes and aligns with observed macro-seismic damage patterns.
This study collects 4010 strong motion records from 134 seismic events in the Loess Plateau region, which are analyzed and screened, ultimately constructing the dataset required for the prediction equations. Each strong motion record is processed with baseline correction and high-pass filtering and the peak parameters and epicentral distances are extracted. Subsequently, two attenuation models are developed for nonlinear data fitting, assisting the development of prediction equations for horizontal ground motion parameters at the soil layers, including peak ground acceleration (PGA), effective peak acceleration (EPA), and peak ground velocity (PGV). These equations can estimate the ground motion parameters with magnitudes ranging from
MS 3.0 to
MS 6.5 and epicentral distances between 0 and 100 km. PGA refers to the maximum absolute amplitude of the ground motion acceleration time history, measured in cm/s
2 or gal. It characterizes the intensity of the inertial effects of ground motion and reflects its high-frequency characteristics. EPA is the peak acceleration with an averaged significance based on the response spectrum. EPA is formulated as follows:
where
Sa represents the smoothed average value of the acceleration response spectrum with a damping ratio of 5% in the frequency range of 2 to 10 Hz.
Sa is then divided by 2.5 to obtain the EPA [
13]. This study introduces the EPA parameter and analyzes its attenuation relationship to investigate further the impact of ground motion attenuation on structural damage in the Loess Plateau region.
PGV refers to the maximum absolute value of the ground motion velocity time history, measured in cm/s. It is related to the kinetic energy of particle vibrations and is a significant physical quantity for measuring ground motion energy. The velocity peak reflects the intensity of the medium-frequency components of the ground motion. Studies, such as blasting experiments, have revealed that PGV correlates well with the degree of structural damage.
Given the importance, intuitiveness, and clear physical significance of the peak parameters, PGA, EPA, and PGV, this study adopts a magnitude grouping approach. Specifically, it uses two-step fitting methods to establish the prediction equations for PGA, EPA, and PGV in the Loess Plateau region. Based on these equations, the corresponding attenuation curves and fitting residual distribution plots are generated, and the fitting performance of the two attenuation models is analyzed. Finally, based on the analysis results, a suggested selection scheme for the attenuation relationships of peak parameters in the Loess Plateau region is proposed.
2. Methods and Data
The strong motion data used in this work originate from the Strong Motion Observation Data Center of the Institute of Engineering Mechanics, China Earthquake Administration. In December 2023, the dataset was updated to include the most recent seismic event, i.e., the 6.2 magnitude earthquake on 18 December 2023, in Jishishan, Gansu.
2.1. Scope of the Strong Motion Record Dataset in the Loess Plateau Region
The study area primarily encompasses the Loess Plateau region in China, with selected seismic events located between longitudes 95 °E and 115 °E and latitudes 30 °N and 45 °N, covering seven provinces and autonomous regions, including Gansu, Shanxi, Shaanxi, Qinghai, Hebei, Inner Mongolia Autonomous Region, and the Ningxia Hui Autonomous Region.
2.2. Seismic Events in the Loess Plateau Region
This study involves 134 seismic events in the Loess Plateau region of China, including mainshocks and aftershocks. The magnitude of these events spans from
MS 2.4 to
MS 7.1, from 2001 to 2023.
Figure 1 illustrates the distribution of the epicenters and stations, highlighting that the seismic events are mainly concentrated in the Qilian Mountains, Helan Mountains, and Taihang Mountains, all located within the Fenzhou, Yinchuan–Hetao, Longmenshan, and Liupan-Mountains–Qilian-Mountains seismic belts.
Table 1 reports the magnitude distribution of the seismic events, where seismic events with magnitudes between
MS 4.0 and
MS 4.9 account for the highest proportion, reaching 42.5%. The second-highest proportion involves magnitudes between
MS 3.0 and
MS 3.9, at 30.6%.
2.3. Distribution of Strong Motion Records in the Loess Plateau Region
A total of 4010 strong motion records are collected, including records in the east-west, north-south, and vertical directions. Among these, 2678 are horizontal and 1332 are vertical strong motion records.
Table 2 lists the distribution of the strong motion records based on the epicentral distance, with distances between 30 km and 100 km accounting for 48.9% while records closer to the epicenter are relatively few. Only 99 records are within 5 km of the epicenter, accounting for approximately 2.5%.
Figure 2 depicts the epicentral distance distribution of the strong motion records collected in the Loess Plateau region, revealing that the epicentral distance range broadens as the magnitude increases. Most strong motion records are concentrated within an epicentral distance of 200 km, accounting for approximately 91.2% of the total records.
2.4. Relationship Between Magnitude and the Farthest Triggering Distance of Strong Motion Instruments
Equation (2) expresses the relationship between magnitude (
MS) and the farthest triggering distance (
R) of the instruments capturing strong motion in the Loess Plateau region. This parabolic relationship indicates that the farthest epicentral distance where strong motion stations are triggered increases as the magnitude increases. Notably:
where
R (unit: km) represents the farthest triggering distance and
MS (unitless) is the surface wave magnitude. This relationship is derived from strong motion observations in the Loess Plateau region. It applies to
MS values ranging from 2.4 to 7.1, with corresponding epicentral distances between 0 and 400 km.
2.5. Correlation Analysis of PGA and PGV in the Loess Plateau Region
The relationship between PGA (unit: gal) and PGV (unit: cm/s) in the Loess Plateau region is expressed as follows:
This equation represents a statistical relationship between PGA and PGV rather than a physical conversion. Equation (3) applies to the Loess Plateau region of China for earthquakes with magnitudes (MS) ranging from 3.0 to 6.5 and epicentral distances between 0 and 200 km.
Figure 3 depicts the distribution of PGA and PGV concerning the epicentral distance in the region while
Figure 4 illustrates the fitted relationship between PGA and PGV. In
Figure 4, PGV is plotted on the horizontal axis (unit: cm/s) and PGA on the vertical axis (unit: gal, equivalent to cm/s
2). The regression line in
Figure 4, based on Equation (3), reveals a linear relationship between PGA and PGV, with the slope of 18.067 representing the rate at which PGA increases with PGV and the intercept of 6.023 indicating the baseline PGA when PGV is negligible. This relationship allows straightforward estimation of one parameter from the other, providing practical insights into seismic ground motion in the Loess Plateau region.
2.6. Data Processing of Strong Motion Records
The acceleration records used are digital strong motion records with a sensor frequency response range of 0–80 Hz, a 200-sample-per-second sampling rate, and a pre-event storage capability of 10 or 20 seconds. Processing is required due to potential errors caused by the raw records’ baseline drift, background noise, and sensor tilt. Low-frequency noise can cause baseline drift, which introduces significant errors when computing velocity or displacement through integration. Therefore, baseline correction must be performed before conducting the attenuation fitting analysis.
The error-handling methods include high-pass filtering to eliminate background noise, the Iwan correction method to address sensor hysteresis [
14], and linear fitting to correct tilt errors in near-fault records [
15]. Data processing is conducted using the relevant filtering, correction, spectral analysis, and calibration programs developed by our research team. In this study, the two horizontal component records from a single station’s set of recordings are treated as two individual records, collectively called horizontal records.
2.7. Data Screening of Strong Motion Records
This study applies the following principles for screening strong motion records:
- (1)
Directional classification: Separate horizontal and vertical direction datasets were established. However, this study analyzes only the horizontal dataset;
- (2)
Amplitude screening: Due to the minimal impact of records with low PGA on engineering seismic resistance outcomes, this study excludes data where PGA is less than 5 gal to reduce the interference from low-amplitude data;
- (3)
Epicentral distance and station screening: This research focuses on attenuating moderate and minor earthquake strong motion records in the Loess Plateau region. Records with epicentral distances exceeding 100 km and those from non-surface stations are excluded;
- (4)
Site type screening: Considering the limited number and uneven distribution of data from bedrock stations, which makes it challenging to statistically derive reliable predictive models for bedrock sites, records classified under the site type "rock" are excluded;
- (5)
Anomalous data screening: Potential anomalous records are identified by plotting the distribution of PGA against magnitude and epicentral distance. Further waveform checks are performed using SeismoSignal software version 5.1.0 and records with obvious “spikes” or “cut-offs” are removed;
- (6)
Magnitude screening: These records are excluded due to the insufficient number and uneven distribution of strong motion records with magnitudes less than MS 3.0. Additionally, records with magnitudes greater than MS 6.7 are excluded to focus on moderate and minor earthquakes.
2.8. Attenuation Research Dataset
After screening the data based on the abovementioned steps, the dataset used to construct prediction equations for ground motion parameters of moderate and minor magnitudes in loess sites includes 1149 horizontal records. This dataset is suitable for fitting the horizontal attenuation relationships for magnitudes ranging from MS 3.0 to MS 6.5 and epicentral distances from 0 to 100 km. It is used to establish various prediction equations for ground motion parameters at the surface of loess soil layers.
It should be noted that the surface wave magnitude (
MS, unitless) is primarily used as the magnitude scale, owing to its wide adoption in China. Thus, most of the magnitude information in the original record headers is surface wave magnitude, with a minor portion being local magnitude (
ML, unitless). Local magnitude is converted based on the conversion formula provided in the "Earthquake Work Handbook" (China Earthquake Disaster Prevention Center, 1990) [
16]:
In the attenuation relationship research, selecting the distance parameter is critical. Commonly used distance parameters include epicentral distance, hypocentral distance, fault distance, and fault projection distance, among which epicentral distance and fault distance are more widely applied. This study primarily examines the attenuation relationships of earthquakes up to a magnitude of 6.5. Since most records in the dataset do not contain information about the seismic fault, this article uses the epicentral distance (R) as the distance parameter. Epicentral distance is the horizontal distance between the station and the epicenter, which can be calculated using the geographic coordinates of the station and the epicenter.
3. Attenuation Models and Regression Algorithms
Ground motion attenuation models, or the mathematical expressions of ground motion attenuation relationships, describe the variation of ground motion with distance, source depth, and other influencing factors. Since the source mechanism influences ground motion, the propagation medium and site conditions must be included in the model.
3.1. Attenuation Models Used in This Study
This study develops two types of ground motion parameter attenuation models. Model I is the attenuation model used in the “Seismic Ground Motion Parameters Zonation Map of China” (GB18306–2015) [
17], expressed as follows:
where E
M represents the products of E and
M. This notation is introduced for concise representation.
Preliminary analysis indicates a significant correlation between the attenuation rate of ground motion parameters and magnitude. To more accurately describe this relationship, this study references the related research findings [
18,
19] and improves the attenuation model. In the improved model, the attenuation coefficient term is a linear function of magnitude, forming Model II, proposed in this article, expressed as follows:
Model II is introduced as part of the improved model and is described by Equation (6). Where y represents the ground motion parameter, such as PGA or PGV; M denotes the surface wave magnitude;
R is the epicentral distance, measured in kilometers (km); A, B, C, D, E, F, and G are regression coefficients; and
ξ is a random variable with a mean of 0 and a standard deviation of
σ.
3.2. Regression Algorithms Used in This Study
After determining the functional form of the ground motion attenuation relationship, the least squares method is used to regress the seismic record data to obtain the coefficients within the model, constituting a nonlinear multivariate regression analysis problem. The data quality and the chosen regression method directly impact the results’ accuracy. Indeed, the uneven distribution of observational data in magnitude, distance, seismic events, and regions mandates addressing these biases by adding weights and employing a two-step regression method. This study adopts the two-step regression method proposed by Joyner et al. in 1981 [
20], which effectively decouples the correlation between magnitude and distance [
21].
The first step of the two-step regression rewrites the attenuation relationship (Equation (5)) as Equation (7), where the pseudo-variable Ei is a binary variable, i.e., if it is the i-th earthquake, E = 1; otherwise, E = 0. At the same time, bi and R0i represent the influence of the i-th earthquake. In the first step of regression, C is treated as a constant and the values for Hi and R0i within each magnitude band are obtained using the minimum variance principle within the estimated range of C, followed by the second regression step. The second step substitutes the values of R0i and Hi (i = 1, 2, ..., m) obtained from the first step into Equations (8) and (9) for calculation. The regression analysis provides the coefficients A, B, D, and E.
This method effectively decouples the impacts of magnitude and distance. Specifically, the first regression step does not involve magnitude and only analyzes the effect of distance, avoiding the interference of magnitude errors on the distance regression. The second regression step does not consider distance and only examines the impact of magnitude. This strategy ensures that each seismic event is given equal weight in the magnitude regression analysis, avoiding bias due to the abundance of observational data from an earthquake event.
6. Residual Analysis of Horizontal Peak Parameter Attenuation Relationships
In this section, the residuals refer to the logarithmic differences (Base 10) between the observed and predicted values from the strong motion records, which are the logarithms of the observed values minus the logarithms of the predicted values. The observed values are from the attenuation research dataset while the predicted values are calculated based on the peak prediction equations presented in this paper.
Figure 20 depicts the distribution of PGA fitting residuals based on epicentral distance and
Figure 21 illustrates the distribution of PGA fitting residuals based on magnitude.
The left chart in
Figure 20 displays the distribution of PGA residuals for Model I, for a residual range of −1.30 to 2.08 and an average of all residuals of 0.196. The right chart displays the PGA residuals for Model II, with a residual range of −1.21 to 1.02 and an average of all residuals of −0.0657. Comparing the two charts reveals that the fitting residuals for Model II are smaller than those for Model I, indicating that Model II provides a better fit than Model I.
In
Figure 21, the left and right charts represent the distribution of residuals for Model I and Model II, with pentagrams marking the arithmetic mean of residuals for different magnitude brackets. For Model I, the average residuals for each magnitude bracket are as follows:
MS 3.2 at 0.44,
MS 4.2 at 0.50,
MS 4.5 at 0.26,
MS 5.0 at 0.12,
MS 5.5 at 0.09,
MS 6.2 at 0.13, and
MS 6.3 at 0.04. For Model II, the corresponding values are
MS 3.2 at 0.12,
MS 4.2 at −0.04,
MS 4.5 at -0.15,
MS 5.0 at −0.30,
MS 5.5 at −0.22,
MS 6.2 at 0.09, and
MS 6.3 at −0.09.
Hence, there are differences in Model I and Model II performance under various magnitude conditions. Model I underestimates the PGA values for magnitudes less than MS 4.5 and Model II slightly overestimates the PGA values for magnitudes between MS 4.5 and MS 5.5. For magnitudes less than MS 4.5, the predictions from Model II are more accurate, whereas, for magnitudes between MS 4.5 and MS 5.5, the predictions from Model I are more favorable. When the magnitude exceeds MS 5.5, the prediction accuracy of the two models is comparable.
Figure 22 displays the distribution of fitting residuals for EPA, where the left chart refers to Model I, ranging from −1.28 to 2.17 with an average of 0.23, and the right chart refers to Model II, ranging from −1.24 to 1.09 with an average of -0.08. Comparing the two charts indicates that Model II has smaller residuals and a superior fitting result than Model I.
Figure 23 presents the EPA residuals for different magnitude brackets, where the left chart refers to Model I and the right for Model II, with pentagrams indicating the arithmetic mean of residuals for each magnitude. The trend in the distribution of the EPA residuals is roughly consistent with that of PGA. The average residuals for Model I are
MS 3.2 at 0.46,
MS 4.2 at 0.57,
MS 4.5 at 0.32,
MS 5.0 at 0.20,
MS 5.5 at 0.15,
MS 6.2 at 0.10, and
MS 6.3 at 0.06; for Model II, they are
MS 3.2 at 0.15,
MS 4.2 at −0.03,
MS 4.5 at −0.14,
MS 5.0 at −0.28,
MS 5.5 at −0.22,
MS 6.2 at 0.01, and
MS 6.3 at −0.12. Thus, Model I tends to overpredict at lower magnitudes while Model II’s predictions are more accurate, particularly for magnitudes below
MS 4.5.
Figure 24 illustrates the distribution of fitting residuals for PGV based on epicentral distance. The left chart displays the PGV residual distribution for Model I, ranging from −1.41 to 1.27, with an average value of −0.01, and the right diagram shows the PGV residual distribution for Model II, ranging from −1.28 to 1.00, with an average value of −0.08. The distributions reveal that Model II’s fitting residuals are slightly lower than those of Model I, indicating that Model II’s PGV fitting performance is marginally better than Model I’s. Overall, whether for PGA, EPA, or PGV, Model II’s predictive performance exceeds that of Model I, especially under conditions of low magnitude and large epicentral distance.
Figure 25 displays the distribution of PGV fitting residuals based on magnitude. The chart reveals that the range of PGV residuals varies across different magnitudes and the average residual values differ per magnitude bracket. For Model I, the average residuals are
MS 3.2 at 0.10,
MS 4.2 at 0.19,
MS 4.5 at 0.06,
MS 5.0 at 0.01,
MS 5.5 at −0.04,
MS 6.2 at −0.09, and
MS 6.3 at −0.12. For Model II, the corresponding average residuals are
MS 3.2 at 0.05,
MS 4.2 at −0.05,
MS 4.5 at −0.12,
MS 5.0 at −0.17,
MS 5.5 at −0.16,
MS 6.2 at 0.09, and
MS 6.3 at −0.09.
Overall, PGV fitting residuals are slightly smaller compared to PGA and EPA. For magnitudes less than MS 5.0, Model I’s PGV predictions are somewhat lower than the observed values. For magnitudes greater than MS 5.0, Model I’s predictions are slightly overestimated. Additionally, within the magnitude range of MS 4.0 to MS 6.5, Model II’s PGV predictions tend to slightly overestimate compared to the observed values. This suggests that different models have varying predictive accuracies across different magnitude ranges, necessitating the selection of appropriate models based on specific magnitude ranges to enhance prediction accuracy.