1. Introduction
Natural behavior of the hydrological system is changing nonlinearly, influenced by climate change, and it does not always show chaotic characteristics. Coastal areas, where more than 50% of the world’s population inhabit, are particularly more vulnerable to storms, typhoons, and floods than inland areas [
1]. Tidal river areas where freshwater stream meets the ocean are influenced by the tide that cause higher flood levels and greater damages. From this point of view, predicting tidal river water level is important in flood disaster management. The water level of a tidal river is composed of individual hydrologic components such as tide, wave, and river stage. When it rains, the river stage fluctuates due to the basin runoff discharge, which affects the water level of the tidal river. Therefore, it is necessary to analyze how the time series of each component and rainfall occurrence have affected the water-level time series of the tidal river. To do this, we may need to separate hydrological components which include tidal level, wave height and river stage from water level of a tidal river [
2].
Various analysis techniques, which are mostly linear models, have been developed for the predictions of rainfall and runoff which are important for managing water resources and for disaster prevention efficiently. However, a hydrological time series with nonlinear characteristics requires nonlinear models for forecasting and analysis [
3,
4,
5,
6,
7,
8]. In particular, techniques such as the false nearest neighbor algorithm, the Lyapunov exponent method, and the method based on the correlation dimension are used for the chaos analysis to identify the nonlinear dynamics characteristics of hydrological time series [
9,
10,
11,
12,
13,
14].
Sangoyomi et al. (1996) performed chaos analysis using correlation dimension, nearest neighbor dimension, and false neighbor dimension to understand the seasonal variation patterns of the Great Salt Lake (GSL) biweekly volume series [
15]. Then they obtained the correlation dimension of 3.4, suggesting there are four factors affecting the GSL volume series. On the other hand, Kim et al. (2001) used a close returns plot (CRP) to analyze the chaotic characteristics of waste water flow in Fort Collins, Colorado, and the GSL volume series in Utah, USA [
16]. Sivakumar (2002) assessed the reliability of the correlation dimension of the short-term hydrological time series using phase-space reconstruction and an artificial neural network, arguing that the accuracy of the correlation dimension does not depend on the length of the time series but on its ability to rationally show dynamic changes in the system [
17]. Salas et al. (2005) suggested that the increased rainfall duration interferes with the identification of chaos characteristics, finding that the river with the stronger basin storage contribution departs significantly from the behavior of a chaotic system [
18]. Because streamflow result from a complex transformation of precipitation that involves accumulating and routing excess rainfall throughout the basin and adding surface and groundwater flows, the result may be that streamflow at the outlet of the basin depart from low dimensional chaotic behavior. This shows that hydrological processes, such as rainfall, storage, and runoff, affect each other and have chaotic characteristics.
Khatibi et al. (2012) used five nonlinear dynamic methods using the runoff time series observed at the Sohutluhan hydrological station in Turkey, and suggested the existence of low-dimensional chaos [
11]. Meanwhile, Kim et al. (2015a) analyzed the chaotic behavior of hydro-meteorological processes in the Great Salt Lake and Bear River Basin, USA, by applying wavelet transform and correlation dimension to the precipitation, air temperature, discharge, and storage volume. They found out that the chaotic characteristic of the storage volume which is output is likely a byproduct of the chaotic behavior of the reservoir system itself, rather than that of the input data. In addition, Kim et al. (2015b) analyzed the time series characteristics related to the drought length (DL) using nonlinear dynamic techniques, such as BDS statistic and a close returns test (CRT) [
19]. The utility and effectiveness of the methods were first demonstrated on synthetic time series and then tested on real hydrologic time series. They suggested that the daily inflow of the Soyang River in South Korea is different from the DL data in the characteristics. The different characteristics of the original real series and their DLs may be due to the noises from many sources and therefore, noise cancellation may be important for obtaining original property of the time series. On the other hand, Rajagopalan et al. (2019) extracted the dominant quasi-periodic components through the wavelet analysis of the flow rate at the Lees Ferry station in the Colorado River and quantified the trajectory branching in the phase space over time using local Lyapunov exponents [
20]. As a result, the flow rate of the Colorado River was revealed to be highly predictable.
Previous studies have mostly used the runoff data of the rivers inland that are influenced by the rainfall-runoff process. However, tidal rivers are influenced by many other factors, such as tidal level, wave height, river water level, and rainfall, which make their properties nonlinear. Tidal rivers are typically affected by backwater, a zone of spatially decelerating flow that is transitional between the stream flow of upstream and the tidal flow of downstream in the river, and it has a complex characteristic. Since water level and runoff of tidal rivers vary with the influence of the tide, the backwater effect during high tides can cause more flooding damages [
2]. For example, in 2016, the damages by the typhoon ‘Chaba’ in Ulsan city area of South Korea were aggravated by the tide. During the typhoon ‘Jebi’ in 2018, the maximum tide level has been observed, and the effect of tidal currents did not drain the runoff into the river, causing great damage to the Kansai Airport in Japan [
21]. In order to reduce flood damage by tide effect, many studies have been analyzing tides and waves. Siek and Solomatine (2010) predicted storm surges using a nonlinear chaos model and showed that the chaotic characteristics were caused by the tidal level and tidal wave’s complex effects [
2]. In addition, Zounemat-Kermani and Kisi (2016) identified the chaotic characteristics of a coastal area’s dynamic system using significant wave height, wave period, and wave direction [
6]. They suggested that the wind direction has a low correlation dimension of 4 or less, while the significant wave height and the mean wave period have a high correlation dimension of 5 to 8 in their chaotic characteristics. However, these studies conducted chaos characterization only for storm surges or single hydrological factors, implying a limitation in applying the method to tidal rivers affected by various hydrological components.
As mentioned above, many studies have been analyzed to characterize the nonlinear characteristics of rainfall and runoff. Understanding the dynamics of the rainfall and runoff process is one of the most important problems in hydrology for the management of water resources. However, it is very difficult to understand the dynamics of the water level in tidal river because the tidal river is affected not only by rainfall, but also by ocean components such as tide and wave. None of the studies could provide information considering the possibility of the existence of chaotic behavior by decomposing the components affecting the water level in the tidal river. The purpose of this study is to attempt to address this issue. Therefore, this study separates the hydrological components (tide, wave, rainfall-runoff (river stage by rainfall), and noise levels) using daily water level data of a tidal river observed in Ulsan station. In identifying the relationship between the separated components and the original water-level time series, the study quantifies the information provided by the four components in the water-level time series based on the information entropy theory. In addition, the phase-space reconstruction for the time series of each component was conducted to examine the characteristics based on the attractor. The correlation dimension for the time series of each component is analyzed for chaos characterization and CRP is used to suggest the periodicity as well as its chaotic characteristics.
2. Study Area and Methodologies
This study separated a water-level time series of tidal river into tide, wave, rainfall-runoff (or river stage), and noise level components as proposed by Lee et al. (2018) [
2]. Additionally, we analyzed using each component time series for searching for chaotic characteristics. The methodologies for a water level decomposition and for the chaos analysis were described in this section. The flowchart and analysis method of this study is shown in
Figure 1. First, the water level time series was decomposed into tide, wave, rainfall-runoff (or river stage), and noise components through wavelet analysis, curve fitting, and the filtering technique (see Lee et al., 2018) [
2].
Furthermore, the information entropy theory was used to quantify the amount of information provided by four components to the water level time series to determine the relationships between the separated components and the original water level time series. Based on the estimated amount of information, the nonlinear dynamics and chaos analysis were applied to observe the characteristics for each time series of component of the water level time series. The methods we used to analyze the chaotic characteristics of the time series include a metric method called correlation dimension and a topological method called a close returns plot (CRP) [
22,
23].
2.1. Study Area
The study area is Taehwa river basin in Korea, and the water level time series measured at Ulsan station in the basin is obtained. Taehwa River basin is located in the south-eastern region of the Korean peninsula. The basin area is 646 km
2 and average slope of the basin is 0.2770. The length of river is 47.54 km and the shape factor is 0.72, which shows a characteristic that peak runoff can be occurred in a short time. The west side of the basin is mountain area, so the river flows from west to east then meets the open sea in the estuary. The estuary is subject both to marine influences such as tides, waves, and influx of saline water and to riverine influences such as flows of freshwater and sediment. There are two Ulsan stations (
Figure 2). One is for observing the sea surface level and the other is for observing water level of tidal river. Because the Ulsan sea surface level station is at the downstream end of the station for water level, the study area is deemed suitable for applying and verifying the proposed methodology.
2.2. Study Area Water Level and Sea Surface Level Characteristics
The water level and sea-level time series from Ulsan stations are obtained at a daily scale (data size of 2891) from 1 January 2011 to 31 November 2018. The unit of data is daily data, and daily data have been observed stably without missing data since 2011. The water level time series shows the typical characteristics of a tidal river with high variation in low water level, and is also affected by rainfall (
Figure 3a). Meanwhile, the sea level time series have low variation characteristics and are influenced by tides and waves.
2.3. Decomposition and Information Quantification of the Time Series
2.3.1. Decomposition of the Time Series
A tidal river is located in a coastal area and influenced by backflow caused by the mixture of tide and stream. Therefore, the water level and the runoff fluctuate with the tidal impact, and flood damages are aggravated by the backwater effect during high tide. This makes the characteristic of the tidal river water level understand the fluctuation behavior of rainfall, tides, and waves. Consequently, this study decomposed the water level time series as proposed by Lee et al. (2018) [
2], where wavelet analysis and curve fitting are used to separate the water level data into the “tide” and “wave” component and the filtering method is used to separate the rest of the components, “rainfall-runoff or river stage” and “noise”.
2.3.2. Information Quantification
Entropy theory is used to quantify the time series effect of the separated components. Shannon and Weaver (1949) defined the limit entropy as Equation (1) [
24]:
where
is the occurrence probability of
(
), and
is the limit entropy indicating the information amount of
X. When the variable
is the information having a relation with the variable
, the uncertainty of
is likely reduced. Based on this theory, the conditional entropy
can be estimated, as shown in Equation (2), and indicate whether a random variable
helps to predict the value of another random variable
.
where
is a joint probability when
and
.
is a conditional probability of
X at a given
Y. Information transferred between
X and
Y can be explained as Equation (3):
The information sent from
X to
Y is defined as
in Equation (4):
where
is the limit entropy of the single variable
Y and
is the trans-information between
X and
Y.
2.4. Phase Space Reconstruction
2.4.1. Method of Delays
In reality, natural phenomena can be characterized as deterministic, stochastic, or chaotic, and the reconstruction of the phase space and the attractor could be used to identify characteristics of the time series. Packard et al. (1980) and Takens (1981) suggested the method of delays for the phase-space reconstruction of the time series [
25,
26]. A scalar time series
is embedded into m-dimensional space by constructing the vectors as Equation (5):
where
is the delay time and
is the embedding dimension, both of which must be chosen appropriately.
2.4.2. Estimation of Delay Time
There are some methods widely used to determine an appropriate delay time: (1) autocorrelation function (ACF) [
27], (2) average mutual information [
28], (3) correlation integral [
29], and (4) the C-C method [
30]. Although ACF is the general method for obtaining the delay time, its high dependence on linearity is not applicable for the nonlinear system. Meanwhile, the average mutual information method can estimate the delay time regardless of linearity or nonlinearity, but has a limitation in that it requires a large amount of data. Ultimately, this study used ACF and average mutual information to estimate the delay time for the phase space reconstruction of each component and then the attractors can be obtained for each time series. Choosing the proper delay time and method of delays can be found in [
23,
30].
2.5. Correlation Dimension
The correlation dimension is used for the quantitative analysis of the typical geometry and the localized distribution of attractors in a phase space. In obtaining the correlation dimension, Gassberger and Procaccia (1983) proposed a method by estimation of the quantity called correlation integral [
31]. This value is the number of phase vector points calculated for the radius, which is located with certain scaling, and the correlation integral [C(r)], which can be obtained as Equation (6):
where
,
.
is the Heaviside step function where zero is applied to real numbers less than zero, and one is applied to real numbers greater than zero.
N is for the number of all coordinate points in one-dimensional space,
for the number of state vector points in m-dimensional space, and state vector points are expressed by
and
in the phase space. When the value of the radius r is given, it can be converted to the logarithmic scale to obtain
, which is called the correlation dimension, as shown in Equation (7):
The correlation dimension can be defined as the variance of correlation integral from the change in the scale. If the hydrological time series is stochastic, the correlation dimension continues to increase as the embedding dimension m increases. However, if it is chaotic, it is characterized as converging into a certain value, regardless of an increase in the embedding dimension .
2.6. Close Returns Plot
A close returns plot (CRP) is used for analyzing the chaotic characteristics of a time series in this study. The chaos system is characterized by complex and strange attractors on a phase space with the repeated occurrence of a periodic orbit. The analysis is based on the rule that, after a random time, the chaos system passes through the neighborhood point again [
16,
32]. For this,
, indicated by the difference of the points in the time series, is used as in Equation (8):
If the system is close to the state
t =
j at
t = 1 in the phase space, then the necessary condition is
, and
could be drawn diagonally. In addition, the same diagonal line is drawn repeatedly at
i = j. The figure is drawn as in Equation (9):
where
is the radius at an arbitrary point, and
and
are the maximum and minimum values of the time series, respectively. The range of
was suggested in [
16].
5. Summary and Discussions
This study reconstructed the phase space, analyzed the correlation dimension, and a CRP to identify the characteristics of the water level time series of the tidal river. Because the water level time series of a tidal river shows complex hydrological characteristics affected by tides, waves, and other factors, this study decomposed the water level time series into four components to examine each of their hydrological characteristics. In quantifying the effects of the decomposed components on the water level time series, entropy information theory was used. As a result, the tide component was identified to have the most influence on the original Ulsan water level time series by 48%, followed by the wave component at 24%, the rainfall-runoff component at 17%, and the noise component at 11%. This indicates the possibility of the tide component with a periodic fluctuation, but not exact periodic, as the major factor of water level change that leads to chaotic characteristics.
Furthermore, the delay time was calculated using the ACF and average mutual information to examine characteristics of each component. For data such as the tide component and the original time series of the water level, the same delay time was calculated. However, the data with random characteristics, such as the rainfall-runoff component, had different delay times due to the difference in the linearity and nonlinearity. The calculated delay times were used to draw the attractors on two-dimensional phase space. As a result, the water level time series, tide component, and wave component had the trajectory within a certain boundary. However, the noise component presented a random trajectory.
In determining whether each component has stochastic or chaotic characteristics, the correlation dimension was estimated. With the correlation dimension of 1.30, the tide component was influenced by two variables (mid-period and long period). As shown in
Figure 9, only the tide component had a chaotic characteristic as it converged into a certain value, and the other components showed stochastic characteristics as they diverged. These results are significant in determining whether a linear model or a nonlinear model should be used to predict tidal river water levels with complex hydrological characteristics.
Last, this study analyzed the chaotic characteristics of each component using CRP to understand the recurring trajectory in a graph. When regular horizontal line segments repeatedly appear in the CRP, it can be assumed as the periodic data that the corresponding trajectory is continuously visited. On the other hand, the irregular horizontal line segments indicate chaotic characteristics, while random distribution without lines implies stochastic characteristics. As a result, the original water level time series showed aperiodic characteristics caused by tidal effects during no rainfall (
Figure 11a). However, when rainfall occurs, it had oblique line segments indicating stochastic characteristics. Therefore, this time series might be mixed with chaotic and stochastic characteristics, which can be called stochastic-chaos. For the tide component, irregular horizontal line segments appear similar to those of the aperiodic function and so it indicated chaotic characteristics by the uncertainty of natural phenomena. For the other components, the segments are oblique lines, indicating their stochastic characteristics.
Table 4 shows the characteristics of each of the components. The water level time series were influenced by various hydrological factors and showed stochastic characteristics in the correlation dimension, but the characteristics of aperiodic-stochastic chaos in CRP. Salarieh et al. (2009) have suggested that the behaviors related to stochastic chaos characteristics can be identified using the white noise in the deterministic chaos system [
37,
38,
39].
To confirm this behavior, the time series of Gaussian noise showing stochastic characteristics and the Hénon map showing chaotic characteristics were generated and combined as shown in
Figure 13. Then, the correlation dimension is estimated, and CRP is drawn for investigating chaotic characteristics of the combined time series. As we can see in
Figure 13, the time series shows stochastic property in the correlation dimension but stochastic chaos characteristics in CRP.
Figure 13c is showing the stochastic phase on the left side of the yellow line, but the chaotic phase on the right side of the line. Therefore, we can confirm the stochastic chaos characteristics for the combined time series, and the water level time series also shows the same stochastic chaos characteristics. In other words, the water level time series showed chaotic characteristics when there was no rainfall but presented stochastic characteristics when rainfall occurred (
Figure 11a). As for the tide component, which affects the water level time series by 48 h, it shows aperiodic chaotic characteristics, while the other components were characterized as aperiodic stochastic data.
6. Conclusions
This study conducted chaos characterization analysis using the tidal river water level time series of the Ulsan station in Taehwa river basin, Korea. For the analysis, the data from January 2011 to 31 October 2018 were collected as shown in
Figure 3. We considered the effects of increasing water level on tidal rivers which were caused by various hydrological factors such as tides, waves, and rainfall. We identified the components that influence tidal river water-levels and then analyzed the chaotic characteristics for each of them, namely (1) the tide component, (2) the wave component, (3) the rainfall-runoff component, and (4) the noise component. As a result of the water level decomposition, tide component was found to have the most influence on the water level time series. In addition, chaos analysis was performed for each component, which revealed that the tide component has chaotic characteristics while the other components have stochastic characteristics. Additionally, water level time series showed stochastic chaos characteristics in its CRP, and we showed that the combined time series of Gaussian and the Hénon map series also has stochastic chaos properties. This means that the chaotic tide component and stochastic rainfall shock are combined and make the stochastic chaos time series.
Although the results from this study could not allow one to conclude with a definitive answer regarding the existence of chaotic dynamics of the rainfall-runoff process, the study indicates that such an existence should not be excluded. It is necessary, therefore, to continue the investigation to confirm the existence of a chaotic component in the rainfall-runoff process. One notable point is that the stochastic chaos behavior discovered through the combined time series using Gaussian and the Hénon map is similar to that of the water level time series, which suggests a new approach to define the nonlinear behavior. This finding may be helpful for nonlinear modeling for predicting the tidal river water level time series, which can be separated as tide and rainfall components.