2.1. Data Collection
The survey on the residents’ perceptions of the impact of noise emissions from cargo terminals on their health status and living standards was conducted during the last quarter of 2022. For this research, a questionnaire was created, and the data collection method was divided into the provision of interviews and an online survey disseminated across the relevant and available platforms. The questionnaire was structured in three parts: general data on participants, knowledge of noise pollution, and residents’ perceptions of noise impacts from cargo terminals. Additionally, the final part was intended for suggestions for resolving the noise issues that originated from the cargo terminals in the Port of Split. This initial campaign resulted in 148 completed questionnaires. Due to constraints that occurred during its implementation, the authors repeated the survey in January 2023 and collected an additional 54 responses. A total of 202 participants took part in the survey. It represented approximately 5% of the total exposed population in the examined area. This percentage can be deemed as representative, particularly due to the mainly reluctant attitude of inhabitants toward active participation in the survey. The valorization of individual criteria by residents was performed without former knowledge and experience from an acoustics and noise effects standpoint. The sole information on the noise intensity from specific sources (
Table A1 from
Appendix A) was presented before the questionnaire distribution (
Appendix A) or at the start of the interview.
To determine the total number of residents in the local population exposed to excessive noise levels, the authors implemented an on-site field survey, which combined the field investigation in the form of interviews and periodical measurements with low-cost devices during the preparation phase of this research. These measurements were performed periodically, once a week throughout each month (July–September), predominately during the evening-night periods, mainly due to the absence of background noise that facilitated the measurements. Additionally, the measurement site varied between the two examined districts and analyzed streets. The activity was performed by using low-cost equipment: a digital sound level meter with a measuring range of 30 to 13 dB (A), accuracy of ± 3.5 dB and measuring range resolution 0.1 dB. The rough estimations were already published in the previous paper [
14] and based on two separate inhabited areas surrounding the cargo terminals, the northern (Vranjic peninsula) and southern ones (city of Split). The northern area falls within the jurisdiction of Solin town, while the other is under the authority of the city of Split [
14]. The measurements revealed occasional, momentarily excessive noise levels. Based on the data collected, this noise type was evaluated as irregular. Furthermore, the activity hat caused the noise to exceed threshold values was cargo handling of various cargo types. The results for the estimated number of local inhabitants exposed to excessive noise levels in the two areas surrounding the cargo terminals are presented in
Table 1.
Table 1.
Estimated number of local inhabitants exposed to noise from cargo terminals in the Port of Split [
14].
Table 1.
Estimated number of local inhabitants exposed to noise from cargo terminals in the Port of Split [
14].
| Total Population | Percentage of Exposed Population | Total Exposed People |
---|
Split city district—Brda | 6188 * | 60% | 3713 |
Solin city district—Vranjic | 1066 ** | 70% | 747 |
The process of identification and categorization of noise sources in the Port of Split according to the relevant criteria was already carried out in the previous research phase and published in the work of Vukić et al. [
14]. The main conclusions and data from the previous work were an important prerequisite and basis for conducting the current study.
2.2. Methodology and Data Analysis
Several methods were applied to process the data retrieved from the conducted survey. The descriptive statistics were used in the principal part of the questionnaire analysis and set criteria as a pre-phase of data processing. It was relevant to draw initial conclusions and perform valorization of individual parameters used for the selected parametric data analysis application and statistical tools. The authors used Pareto analysis, linear regression, and two-way ANOVA with the Tukey post hoc test for the data processing of individual criteria.
The Pareto analysis is commonly used to illustrate the distribution frequency of descriptive data classified in categories, usually provided in a chart form, showing which ones principally affect the examined problem in descending order (from left to right). In addition, the accumulated percentage of frequencies is shown by a line [
30]. As one of the main premises in the use of Pareto graphs related to the identification of organizational deficiencies, this theory can be applied for various purposes [
31]. The function of Pareto analysis is prioritization of the parameters having the most influence on the selected problem compared to the remaining factors. The final objective is the improvement of opportunities for identification [
32]. Basically, the Pareto principle is based on the premise that 80% of implications are a function of 20% of causes, nominated as the vital few [
33], or that the majority of problems (80%) can be resolved by 20% of effort [
34]. The advantages of the Pareto chart are thoroughly explained in Joiner Associates Staff [
35,
36], and its application as a statistical tool to perform case studies in Realyvásquez-Vargas et al. [
30]. The Pareto analysis in this study was utilized to determine the location of the noise source from the cargo terminals and identify the most influential port activities generating a nuisance while exceeding the noise threshold levels.
Due to the specific needs and objectives of the research, the authors applied regression analysis for the selected data. Generally, regression analysis is a statistical method that analyzes correlation and mutual relationships between dependent and independent variables. Linear regression models a linear relationship between a dependent variable and one or more independent variables [
37]. Determination of the relationship between the independent and dependent variables results in a defined point in the coordinate system. The set of such items determines the correlations of direction, shape, and strength. The linear expression explains the relationship between dependent and independent variables and can be expressed as:
where
y is the dependent variable (output value),
x is the independent variable (input value), and parameter
b is the weight of the independent variable (regression coefficient). The expression
bx is a variable part, being changed by the regression coefficient, while
a is a fixed part.
Regression analysis is also nominated as the method of least squares, and the most significant value is the coefficient of determination or representativeness (
R2), which indicates the strength of the correlation. The coefficient of determination (2) was retrieved from Stat Trek [
38] and expressed as follows:
where
N is the number of observations used to fit the model,
xi is the
x value for observation
i,
is the mean
x value,
yi is the
y value for observation
i,
is the mean
y value,
σx is the standard deviation of
x, and
σy is the standard deviation of
y.
Generally, the Chaddock scale is used in statistics to qualitatively assess the set criteria evaluating the density of mutual connections [
39], with the aim of determining the degree of correlation [
40]. The Chaddock scale is used for interpretation of the results, where the value
R2 = 1 indicates a complete, functional, or deterministic correlation, while
R2 < 1 defines some degree of statistical or stochastic correlation. A positive correlation between variables occurs when the increase in one variable is accompanied by the increase in another one, while a negative correlation can be defined as the disproportionate change in the values of two variables where one variable decreases and another one increases. The interpretation of correlation analysis results according to the Chaddock scale is indicated in Borovskaya et al. [
41]. The authors applied a linear regression model in this research to examine the dependence between variables of the distances of selected residences (streets) toward the noise sources and noise intensity from the cargo port, assessed by residents for three different times of the day (day, evening, and night). The reference point of the noise source was set to berth 2 of the cargo terminal, representing the mean distance between the two districts examined and the most frequent berthing place of cargo ships (
Figure 1).
It was to determine whether the results of the estimated noise intensity correlated with the distance, thus whether the assessed noise decreases with the distance from the noise source, and vice versa.
The ANOVA test represents a statistical analysis tool used to evaluate whether two or more datasets have been statistically significant by examining the differences in averages using variance [
42]. In addition, ANOVA is a statistical test to perform hypothesis testing among other alternatives, where the independent variable is categorical and the dependent variable scalar, especially if the scale data distribution is parametric [
43]. The presumptions of the ANOVA test are summarized in [
44]. The application of the ANOVA test procedure starts with the hypothesis that all factor groups have equal mean values. The total sum of squares (TSS) represents the variability in the dataset, which is further divided into two components, the variability (sum of squares) between groups and variability (sum of squares) within the groups. The F-value, determined as the ratio of factor effects to error level, is defined for every available factor. The
p-value outlines the probability of obtaining the F-value when the factor is not significant and quantifies the risk of incorrectly rejecting the null hypothesis and categorizing an evident effect when it is. The statistical significance threshold for the p-value is commonly set at 5% [
45]. The ANOVA is generally used to test if the means of three or more independent groups of continuous data vary significantly concerning a single factor (one-way ANOVA) or two factors (two-way ANOVA). The additional function of the ANOVA is the ability to test the interaction factor, so it examines whether the effects of one factor on the response variable depend on the level of a second factor [
46]. For the selected part of this comprehensive research, the authors used two-way ANOVA because more than one independent variable was tested. As the ANOVA test compares the means between groups, it fails to identify which particular pairs of means are significant. As for this deficiency, the ANOVA test is usually applied with multiple comparison techniques, while the most frequently used is the Tukey honestly significant difference (HSD) test. The Tukey test is a post hoc test, meaning the variable comparison is applied after the data have been collected [
47]. Two-way ANOVA with Tukey post hoc was used in this paper as a control measure of the selected independent and dependent data. The parametric data analysis was employed to determine whether there was a significant difference between place of residence and level of education, selected as independent variables, and the noise intensity at three different times of the day (day, evening, and night), as a dependent variable. These parameters were selected due to the respondents belonging to different districts examined and the possibility of different results among groups concerning the education status of respondents, primarily for the lack of prior knowledge in determining the noise levels. The purpose was to examine whether the parameters of the place of residence and education level individually and commonly impacted the assessed noise intensity for three different times of the day (day, evening, and night). The significance level was set to 0.05. For this analysis, three null hypotheses were assumed, as follows:
H0—The respondents’ place of residence has no significant effect on the assessed noise intensity (day, night, evening).
H0—The respondents’ level of education has no significant effect on the assessed noise intensity (day, night, evening).
H0—The interaction between the place of residence and level of education has no significant effect on the assessed noise intensity (day, night, evening).
Finally, the authors used several tools for data calculation: Microsoft Excel for the Pareto and regression analysis, and IBM® SPSS® Statistics for Windows, Version 21.0 for the two-way ANOVA with Tukey post hoc test.