1. Introduction
Heart disorders are the leading cause of mortality worldwide and research studies on the numerous cardiovascular diseases have long been a major topic of interest. One of the most pressing study topics in cardiovascular disease is acute myocardial infarction (AMI) and the worldwide mortality it causes. AMI is a major cause of hospital admissions and deaths worldwide [
1,
2]. It is well-known that risk factors such as stress, smoking, obesity, comorbidities, and unhealthy lifestyle are accountable for AMI pathogenesis. However, there is growing evidence showing the existence of other, more complex factors which are also responsible for AMI, such as ambient temperature fluctuations, humidity, and atmospheric pressure [
3,
4,
5,
6,
7,
8,
9].
Low ambient temperature is one of the essential factors for the onset of AMI for medium-latitude populations and is more critical than physical exertion or psychological stress. Increased mortality due to AMI in association with fluctuations in ambient temperature have been reported around the world [
10,
11,
12]. An alternative study claims that a decrease in the ambient temperature by 10 degrees Celsius increases the risk of AMI by 10 percent [
13]. The last observation is also confirmed in [
14].
The importance of atmospheric pressure on cardiovascular events has been highlighted in [
15]. A 10-bar decrease or increase in pressure below or above 1016 mbar is associated with 12 and 11 percent increased incidences, respectively. Ozheredov et al. reported in [
16] that increasing atmospheric pressure positively correlates with arterial blood pressure, indicating the influence of atmospheric pressure on AMI.
Studies from China found that cold weather, in combination with air pollution, has the strongest effect on cardiovascular mortality [
17]. Consistent with this, air and water pollution are considered factors severely triggering AMI in addition to strokes, cardiac arrhythmias, and pulmonary diseases [
17].
It is well-known that environmental factors such as daylight (and circadian rhythms in general) have a pronounced effect on human mental and physical condition [
18]. The disruption of circadian rhythms has a negative effect on morbidity and mortality due to heart diseases. AMI has a significant correlation to the circadian rhythm [
19]. The relationship between the circadian periodicity and AMI has been repeatedly proven in several studies [
20,
21].
It was found that GSs may significantly increase platelet aggregation, blood coagulation, and its viscosity in addition to decreased blood flow in small and medium vessels [
21]. It is also reported that solar activity and its wind-forming geomagnetic storms (GSs) may have effects on human health [
22,
23]. In contrast, the increased LMF activity in higher frequency ranges yields a positive correlation with ischemic cardiac events. The impact of the local earth magnetic field (LMF) on cardiovascular events has also been reported in the scientific literature during recent years. For instance, it is reported that increased LMF activity at various frequencies can have effects on individuals, for example, increased LMF activity in low-frequency ranges is related with an increased occurrence of acute cardiac arrhythmias [
23].
It is shown in [
24] that an increased LMF does have a negative influence on the sensitivity of baroreceptors, which are involved in the self-organization of heart rhythm variability (HRV). It is shown in [
25] that the diastolic blood pressure reduces at least 2 mmHg, and the capillary blood flow increases at least 17 percent, when patients are placed into rooms isolated from LMF. The study in [
26] also implies an assumption that LMF may increase the blood vessel wall tone which is a negative factor for AMI. Data gathered during recent years encouraged us to perform this epidemiological analysis evaluating the causal links between changes in LMF and hospital admissions due to AMI in Lithuania from August 2014 till September 2019.
While analyzing the cardiovascular diseases associated with the LMF, it is imperative to recall the Earth’s natural rhythm. The frequency of the earth’s rhythm is 7.83 (Hz), known as the first “Schumann Resonance” with a (day/night) fluctuation of approximately ±0.5 Hz. The higher frequencies include 14, 20, 26, 33, 39, and 45 (Hz), which all overlap with human brain rhythms, alpha (8–12 Hz), beta (12–30 Hz), and gamma (30–100 Hz) [
27,
28]. In our study, we have analyzed the Schumann frequencies in the same ranges and used the letter S to indicate that we are referring to the Schumann frequencies (SDelta, δ (0–3.5 Hz), STheta, θ (3.5–7 Hz), SAlpha, α (7–15 Hz), SBeta, β (15–32 Hz) and SGamma, γ (32–65 Hz)) for the data.
As expounded upon in the preceding paragraphs, the correlation between LMF and AMI is pronounced [
29]. In this study, it is hypothesized that there exists a long-term correlation between the LMF and AMI. The purpose of the study is to analyze the data gathered from two Lithuanian hospitals and the public health institute to determine the relationships between the LMF and the number of patients registered in hospitals with AMI. Furthermore, the work is unique in that the existence of the association between the AMI and LMF is observed across the lengthy period of the observation window, which would otherwise be overlooked when considering small observation windows. It could be possible that not all of the frequency ranges of the Schumann resonances contribute to defining the relationship between AMI and LMF. Therefore, finding the Schumann frequency ranges that tend to cause the AMI is also an important part of this study. The research methods are based on extensive statistical analysis and the data are analyzed by the help of mathematical and statistical tools.
3. Results
3.1. Description of the Statistical Analysis
Let us begin with presenting a brief overview of the input and output variables used in this study. The input variables are identified as
(where n varies from 1 to 5) and the output variable is denoted as Y. In
Table 1, the variable
is the input parameter which describes the daily average of the Shumann resonances SDelta, δ (0–3.5 Hz), STheta, θ (3.5–7 Hz), SAlpha, α (7–15 Hz), SBeta, β (15–32 Hz) and SGamma, γ (32–65 Hz). The variable Y is the output parameter and represents the total number of patients hospitalized per day (see
Table 1 and
Table 2). In
Figure 1,
Figure 2,
Figure 3,
Figure 4 and
Figure 5, the x-axis shows the timeframe during which the recordings were made, which is from 2014 to 2019. The y-axis shows the daily averages of Schumann resonances observed for the local magnetic field. In
Figure 6, the total number of patients hospitalized each day is shown.
3.2. The Governing Model Equations and Explanations
The main objective of this study is to identify the relationship between
(output) and
, and
(input) variables. To accomplish this task, statistical techniques are employed. The linear regression model is used to analyze the relationship between the input and the output variables. To begin with, the relationship between the output variable
and the input
can be expressed as:
where
is the residual of approximation;
.
Let us denote the observable values of
as
, where
is the number of days available in the dataset. The considered linear regression model then reads:
where
is the
-th observation of variable
(
);
is the
-th residual (
. The model can be expressed in matrix form:
where
,
,
,
.
The vector of model parameters
are determined by the method of least squares:
Following this, the regression equation reads:
3.3. Correlations between Input Variables , and
The model of the multi-parameter linear regression is valid when variables
, and
do not correlate in pairs. This condition is validated by computing Pearson correlation coefficients between the variables:
where
. The computational results are presented in
Table 3.
From
Table 3, it can be observed that variables
are linked by a strong positive linear relationship. However, relationships between
and
are weak. According to the hypothesis test, the correlation value between the variables is equal to zero and they are rejected with a significance level (
p-value) < 0.05. Therefore, the Pearson correlation coefficients are statistically significant. The three regression equations listed in
Table 4 describe the relationship between the input variables.
The null hypothesis about the coefficients being equal to zero is rejected with a significance level (
p-value) of <0.05. Therefore, the alternative hypothesis that the model coefficients do not equal zero is accepted. The hypotheses about the equality of model coefficients with zero are dropped with a significance level (
p-value) of <0.05. Thus, the model coefficients statistically significantly differ from zero. Therefore, some of the variables
,
must be dropped from the regression equation because those variables are multi-colinear, and those correlations are strong. After dropping the variables
, and
, the regression equations reads.
3.4. The Time-Average Model
It appears that the instantaneous relationship between variables Y and and is insignificant. The linear regression model (Equation (7)) yields insignificant coefficients, and the coefficient of determination is very small. However, it would be strange to expect to observe an instantaneous causal relationship between the local magnetic field and the number of hospitalized patients with myocardial infarction. Otherwise, one could expect to observe a sharp increase in the number of myocardial infarctions at the day of the increased geomagnetic activity. However, such relationships have not been reported (our study also does not reveal instantaneous relationships between the local magnetic field and the number of infarctions).
In this work, the influence of the averaged magnetic field on the individuals experiencing the heart attack becomes evident over long observation windows [
32]. One could consider the well-known effect of nonlinear chaotic synchronization. For example, the magnitude of the diffusive coupling between two chaotic oscillators can be below the noise level—but chaotic synchronization can be detected between two (or more) coupled nonlinear oscillators after long transients. A paradigmatic example of such coupled systems is the synchronization of Huygens pendulum clocks discovered more than 300 years ago [
33], but still studied in scientific literature today [
34,
35].
Keeping such small possible interactions in mind, we transform the governing regression equation model (Equation (7)) into a time-average regression model. The instantaneous (24-h based values) are replaced by their overlapping moving averages. The length of the observation window used for the averaging L is set to 90, 120, and 180 days.
For example, the least squares method for the regression model (Equation (7)) yields the following estimates of the model parameters at L = 180:
The following hypothesis is tested with the significance level
:
Additionally, hypotheses about the equality of individual coefficients with zero are tested separately:
In all cases, the main hypotheses
are rejected (
p-value < 0.05). Therefore, the coefficients of the regression equation
differ significantly from zero. The results of hypothesis testing and parameter estimates are given in
Table 5.
In
Table 5, the
value is
and
value is 0.7437. The F-statistics are 2474 and 1702 degrees of freedom, and the
p-value is less than
.
3.5. The Derived Model and Statistical Estimates
The derived regression equation with 180-day moving averaging interval reads:
The variation intervals of variable is and is accordingly. The suitability of the model and special variants of this model are discussed in this subsection.
The variance inflation factor (VIF) is computed for each variable. VIF = 1.0734 < 4 for both variables
and
. Therefore, those variables are not multi-colinear (see
Table 6).
In
Table 6, the 0-multicollinearity criterion marks that the quantities are not multicollinear. Multicollinearity is a statistical phenomenon that occurs when two or more independent variables in a regression model are highly correlated with each other. In other words, multicollinearity indicates a strong linear relationship among the predictor variables. It is possible that the pairwise correlations are small, and yet a linear dependence exists among three or even more variables, for example, if X3 = 3X1 + 2X2 + error.
If VIF < 4 for some variable (in this case, the criterion acquires a value/mark of 0), it means that the variable is not correlated or weakly correlated with other variables, so it is appropriate to include this variable in the regression model.
The determination coefficient of the time-averaged model (
) is
. Therefore, the moving averages of
and
well describe the variation in the moving average of
(see also
Figure 7).
The limitation of the derived model is that the residual errors are not distributed according to the normal distribution. However, this distribution is close to normal. Therefore, it is likely that the problem would vanish if the number of observations were enlarged.
The time-average model at
reads:
The time-average model at
reads:
In both cases, the coefficient at does not differ significantly from zero. Other coefficients do differ significantly from zero (). The determination coefficient at is 0.6. The determination coefficient at is 0.66. Those results prove that the best linear regression model is derived at .
The time-average model at is derived by dropping the variables , and . The model can be constructed by dropping other sets of multi-colinear variables.
The time-average model with dropped variables
at
reads.
and the determination coefficient of the model is 0.72.
Dropping variables
yields the time-average model (
).
However, variable becomes statistically insignificant in this model. The determination coefficient of the model is 0.75.
Finally, dropping
yields the time-average model (
).
The determination coefficient of this model is 0.66.
4. Discussion
As mentioned before, the purpose of the study is to find the long-term relationship between the LMF and the AMI. To achieve the objective, the patients are investigated by the variations of Schumann resonance namely (SDelta (0–3.5 Hz), STheta (3.5–7 Hz), SAlpha (7–15 Hz), SBeta (15–32 Hz) and SGamma (32–65 Hz). The Schumann resonances as well as the number of AMI patients registered in each of the hospitals are recorded each day. The output (Y) represents the total number of patients hospitalized in Lithuania per day with the diagnosis of AMI. The input variable
,
and
represents the daily averages of the Shuman resonance (
) for the LMF measured by the Lithuanian magnetometer. First, the linear regression model equation is posed in (Equation (1)) to show the relationship between the input and output variables. The observable values of the output variable
reads
, so (Equation (1)) translates to (Equation (2)). For convenience, the matrix form of the same is also presented in (Equations (3)–(5)). The proposed model of linear regression is based upon multi-parameters (due to having different Schumann resonances and number of patients registered each day). Therefore, the validity of the multiparameter model in linear regression is proved by computing the Pearson correlation coefficients between the input variables
and
. The typical condition for adhering to the multi-parameter linear regression is “the variables should not correlate in pairs”. Theoretically, this means finding out Schumann resonance which have a weak relationship in pairs. The results of
Table 3 indicate that the variables
and
are weak in their relationship under the significance level of 0.05. This step provides a lead in formulating the equations that describe the relation between variables (
), (
,
) and (
,
). However, the hypothesis about the equality of model coefficients to zero is dropped, with a significance level 0.05 (
p-value < 0.05). Some of the variables,
,
, must be dropped from the regression equation because these variables are multi-colinear, and correlations are strong. After dropping the variables
, and
, the regression equations reads (Equation (7)). However, (Equation (7)) produces results which are based upon the data which are observed for short term scales, or in other words; the linear regression model presented in (Equation (7)) yields statistically insignificant coefficients, and the coefficient of determination is very small.
This raises the question: Does the linear regression model in (Equation (7)) provide enough information to determine the relationship between AMI and LMF? To answer the question, it can be stated that (Equation (7)) happens to represent the instantaneous causal relationship between the LMF and the number of hospitalized patients with AMI. This demonstrates that a rise in geomagnetic activity will likewise increase the number of individuals experiencing heart attacks, which is not possible.
As a result, the paucity of instantaneous correlations leads to the necessity to investigate the long-term relationship between AMI and LMF. Therefore, we transformed the governing equations to a time-averaged regression model, which is the essence of the study. The averages of the relevant Schumann resonances SBeta (15–32 Hz) and SGamma (32–65 Hz) influence the average number of patients.
To shed light upon the various causes of AMI, it is reasonable to ponder whether the LMF is the only cause of AMI. Again, the subject is vast and must be investigated from other perspectives. AMI can perhaps also be due to trauma, excess demand on the heart, etc. Additionally, the origin and the description of the pathomechanisms of this relationship are out of the scope of this study. Detailed investigations of causes, origins, and mechanisms involved remain a definite aim of future research. The study’s main objective is to prove the statistical hypothesis on the existence of long-term relationships between LMF and the number of hospitalized patients with the symptoms of AMI.
6. Limitations
This research study comes with limitations similar to any other scientific work. The first constraint is that only data from individuals who have been diagnosed with AMI and admitted to the hospital were collected. It is probable that there might be cases who have developed the symptoms of AMI but have not been admitted to the hospitals. In other words, off-the-record data are not taken into account.
Another limitation of the study is pertinent to the derived model of the study. It is observed that with a derived model, the residual errors are not distributed according to the normal distribution, although they might be close to the normal. Therefore, it is likely that the problem would vanish if the number of observations were enlarged.