1. Introduction
In recent decades, tourism has become one of the most important and significant sectors in the economies of many countries. In fact, the sector accounts for 10–12% of the world’s Gross Domestic Product (GDP) and approximately 14% of total employment [
1]. Even though tourism can sustain high levels of employment, the sector is a source of environmental impacts with consequent public health concerns [
2]. One of the most significant impacts of tourism is the generation of municipal waste, which increases as the seasonal population of tourists rises [
3,
4]. The heterogeneous nature of hospitality waste poses a significant risk to water and air quality, and is generally liable to cause various health hazards if not properly managed.
From a sustainability perspective, one approach to the reduction of the threatening environmental and health impacts from hospitality generated wastes is the conversion to useful value-added or alternative products. In this regard, several governments have launched policies to promote the conversion of municipal waste to green precursors or products, many with a specific focus on green fertilizers, bioelectricity, biofuels or bioadsorbents [
5,
6,
7,
8,
9].
However, to manage and reutilize hospitality sector waste (HSW) in a sustainable way, accurate prediction of HSW generation rate and composition is important [
10,
11,
12]. A failure to make accurate HSW predictions and assessments could lead to several widespread problems in waste management systems and the environment, including irrelevant policies, increased environmental impacts as well as inadequate or overestimated capacity of disposal infrastructures. Inefficient disposal or waste management infrastructure can cause serious impacts on health [
1,
4]. Specifically, improperly managed, designed and operated recycling/incineration plants cause air pollution or spread of disease. For instance, hotel kitchen waste ferments after a short time, creating conditions favourable to the growth and survival of microbial pathogens and resulting in the spread of infectious diseases. Also, spent cooking oil is a common hospitality waste; note that unattended spent cooking oil attracts flies, vermin and rats, which could create a health hazard and pest control problem.
Cyprus, politically partitioned into two main parts (south and north), is a major tourist destination in the Mediterranean region. Comparatively speaking, recent years have seen tourism growing at a faster rate in North Cyprus (formally the Turkish Republic of North Cyprus (TRNC)). Meanwhile, in TRNC, the available statistical information regarding waste generation from hospitality industry demonstrates a lack of sufficient reliable data per hospitality facility; hence, it is difficult to develop accurate forecasting systems.
According to the TRNC Hoteliers Association, there was nearly an 83% and 68% bed occupancy rate in the peak and lean seasons, respectively, in 2014–2016. The increasing inflow of tourists in the first quarter of 2017 indicated that the occupancy rate is expected to increase by 6–8% in the peak season of 2017, subsequently leading to more HSW. Of concern is the lack of studies that quantify the magnitude of waste generated in the accommodation sector of TRNC and the subsequent effect of this problem on the environment.
To mitigate the impact of HSW on the ecosystem, we need reliable data concerning HSW generation. Meanwhile, the process of predicting HSW generation is challenging and often intensified by uncontrollable parameters [
10,
13]. In recent years, various conventional, regression, non-algorithm and descriptive statistical methods of forecasting municipal solid waste (MSW) generation have been reported [
13,
14,
15,
16].
However, there are limited data concerning the forecasting of HSW generation in the peak and lean seasons as well as an optimal prediction model for this purpose. Hence, this paper tries to contribute to filling the mentioned gaps in the HSW generation rates, specifically in TRNC. The outcome of this research is expected to help policymakers and accommodation sector owners to initiate sustainable waste management practices.
In this study, multiple linear regression (MLR), central composite design (CCD) and artificial neural network (ANN) models were applied in predicting the rate of hospitality sector waste generation. Among these methods, MLR is widely applied to forecasting waste generation due to its simple algorithm and well-developed statistical theory [
15]. However, MLR can neither adapt to new situations nor learn from new data; its precision is poor when imprecise data are utilised and it rarely considers all factors affecting waste generation [
12,
17,
18].
CCD under response surface methodology is a combination of a statistical and mathematical technique for empirical modelling of complex problems in which the response of interest is influenced by several independent variables. CCD considers the interaction effects between the operational parameters to produce high prediction accuracy on complex nonlinear systems [
19]. To the best of the authors’ knowledge, there are no reported data on the application of CCD to forecasting waste generation. ANN is a brain neuron-inspired data-driven technique that can directly learn linear and nonlinear relationships between variables from a set of data compared to the conventional forecasting techniques [
20,
21,
22].
The strengths and weaknesses of the proposed models were elucidated and an optimal prediction model was established based on conformity with the actual dataset and sensitivity analyses. To date, most studies in this field have focused specifically on the prediction of the total municipal solid waste (MSW) generation rate without considering the interactive effects of the influencing factors (viz., waste management practices, nationality of tourists, nature of waste generated and actual sources of waste in the hospitality facility) to manage HSW sustainably. This paper is written under the belief that the prediction of the amount of HSW produced will be helpful in the stages of transportation, storage, disposal and reutilization and, thus contribute to a sustainable tourism management.
2. Research Methodology
2.1. Research Area and Dataset
Given that the purpose of this research is to predict waste generation rates in the accommodation sector and explore the effects of variables contributing to the waste generation rates, a quantitative approach was employed. Three districts, Nicosia, Famagusta and Girne, were selected to assess the waste generation in the accommodation sectors of TRNC according to the concentrated tourism activities in these districts.
A total of 22 accommodation options, including non-starred guesthouses and large, medium and small hotels, were investigated in this study. Seventy-five percent of these facilities are situated in Girne (a tourism hub), 18% in Nicosia (the capital city) and 7% in Famagusta (a port and student city). The investigated facilities were composed of 36%, 30%, 27% and 7% small, large and medium hotels and guesthouses, respectively. The tourism activities in TRNC remain active seasonally, with the fewest activities taking place in winter (the lean season) and most taking place in summer (the peak season).
A pilot study was conducted to minimise ambiguity in the sampling questions. Also, prior to data collection, the management of the accommodations were assured of confidentiality to minimise the social desirability bias and ensure the accuracy and credibility of the sample data. The data from daily waste generated were collected randomly over a specified period of each month of the lean and peak tourism seasons. We calculated the average daily and yearly generation rate per room and sub-units of the accommodation.
2.2. Model Development and Description of the Input Parameters
The MLR, CCD and ANN as linear, quadratic and non-algorithmic models, respectively, were used to predict the waste generation rates in the hospitality sector of TRNC. To train and test the models, a 3-fold cross-validation procedure was employed to avoid any possible desirability bias. Hence, the average results of three different simulations were compared with the actual data and reported herein.
Among the different parameters that affect the generation rate of hospitality waste, five independent parameters were selected as the most effective ones, including the nationality of tourists visiting the investigated facilities, the nature of waste management practices in each facility, the type of waste generated, the seasonal flow and the type of the accommodation. These parameters were encoded as presented in
Table 1.
2.3. Multiple Linear Regression Analysis
The multiple linear regression (MLR) as a predictive analysis, attempts to explain the relationship between a dependent variable and two or more explanatory variables. The MLR model for predicting the HSW generation can be described as follows:
The predicted value of HSW generated is represented by the dependent variable y, x1, …, xn represent the five independent variables in this study, and β0, …, βn denote the impact of each independent variable on the response variable.
2.4. Principle of Central Composite Design
Central composite design (CCD) is an efficient approach for modelling complex problems in which the responses are influenced by various independent variables. Hence, we can minimise time consumption and reduce experimental complexities [
14]. Herein, the SigmaXL software (Ver 7.0, Ontario, Canada) was employed to generate 5-level-5-factors CCD) matrix. Five independent variables, viz., nationality (A), accommodation type (B), season (C), type of waste (D), and waste management practice (E), were selected based on pilot studies and literature reports to assess their effects on the waste generation rates (WGR).
The independent variables were coded into two levels, low (−1) and high (+1), and the axial points are coded as (+α) and (−α). The total number of experimental data runs generated from the CCD is 44, obtained according to Equation (2):
where
N is the total number of runs required,
x is the number of variables and
xr is the repeated runs.
The range of the chosen independent variables, with actual and coded levels, is presented in
Table 2, where only the most influential runs were selected out of 44 experimental runs. The factorial design comprises 32 full factorials, 10 axial points and two repeated runs, which resulted in an orthogonal distribution of 44 experiments. The experiments were run randomly to minimise errors due to the systematic trends in the factors. A quadratic polynomial regression model was recognised to evaluate and quantify the influence of the variables on the responses obtained from the experiments: The data obtained from the experimental design were utilised to generate a polynomial equation that was analysed to quantify the influence of the variables on the waste generation rates (%).
The results thereafter were subjected to analysis of variance (ANOVA). The ANOVA was applied to evaluate and model the relationship between the response variable (waste generation rates (WGR (%)) and the independent variables, also to test the significance and the adequacy of the model. The efficiency of the quadratic polynomial model was articulated based on coefficients of determination (R2), predicted R2 and adjusted R2. The statistical significance of the model was verified with Fisher variation ratio (F-value), the probability value (Prob > F) with 95% confidence level and adequate precision.
2.5. Principle of Artificial Neural Network Model
In the late 1990s, the ANN methodology was introduced to tourism forecasting [
21]. ANN is a bio-inspired computational processing system akin to the vast network of brain neurons [
7]. Lately, research activities in forecasting with ANN have indicated that it can be a promising substitute for conventional linear methods. ANN is highly attractive due to its remarkable characteristics, pertinent particularly to noise and fault tolerance, high parallelism, learning and generalisation capabilities, and nonlinearity [
19,
20,
21,
22,
23].
The typical ANN architecture is organised in three distinct layers (input, one or more hidden, and an output) containing nodes that are interconnected by weighted synapses. The network structure changes based on the input and output information that flows through it. The independent problem variables are represented in the input layer nodes; the nodes in the hidden layer add an internal representation of non-linear data to the network and the output layer of the ANN is the solution to the problem [
21,
24].
The relationship between the output (O) and the inputs (I1, I2, I3…., Ip) is represented mathematically as follows [
25]:
where
Ox (
x = 1, 2, 3, 4, …) is the output variable;
wj and
Wji (
j = 1, 2, 3, …,
n;
i = 0, 1, 2, 3, …,
m) are connection weights;
m and
n represent the number of input and hidden nodes, respectively. The
f corresponds to the sigmoidal activation function;
bx,i and
B0j represent the bias terms associated with each input, output and hidden layer nodes, respectively.
In this study, MATLAB R2017a software (MathWorks, Inc., Natick, MA, USA) was utilised to predict the waste generate rates (WGR) of various classes of accommodation sectors in TRNC. A multilayer ANN architecture was utilized and bias neurons were added to each layer to avoid network collapse. The connecting weights were randomly chosen and changed through the training procedure to obtain the minimised mean squared error (MSE). The developed ANN architecture was utilised to investigate the association between inputs and output (waste generation rates), as depicted in
Figure 1.
2.6. Model Performance Evaluation
To evaluate the prediction performance of the models, four statistical indices were applied; the hybrid fractional error function (HYBRID), standard error of prediction (SEP), mean absolute error (MAE) and correlation coefficient (
R2) values were derived using the following equations:
where
n is the number of observations,
wo is the observed values of rate of waste generation for type
t,
p is the number of independent parameters,
wo’ is the average of HSW generation and
wp is the predicted value of HSW generation for type
t.
R2 measures the closeness of the observed data to the predicted data, MAE is a statistical quantity that measures how close predictions are to the eventual outcomes, and SEP is a measure of the accuracy of the predictions. The smaller the value of the error indices for a specified model, the higher the prediction performance of the model [
20,
21].
4. Conclusions
The accommodation sector is an essential component of the tourism and travel business. It is worth mentioning that increases in hospitality sector operations result in increased quantities of municipal waste, constituting ecosystem damage and a significant increase in the environmental footprint. To curtail the ugly face of tourism activities, precise prediction of the quantity of hospitality waste generated is required to enable the development of an integrated waste management and reutilization system. Note that inaccurate prediction of hospitality waste generated may result in a negative impact on the environment.
For the first time, this study has shown that municipal waste from hospitality facilities can be forecasted by considering measurable and effective parameters via an artificial neural network-inspired forecasting model. The hospitality waste generation rates were analysed based on three categories: recyclable, general waste and food residue. ANN, CCD and MLR were employed to predict the average HSW generation rate using nationality, type of waste, season, accommodation type, and type of waste management practices as predictors. These predictors were selected based on the correlation test and Cronbach’s alpha of 0.93. The results showed that 4159.9 kg (recyclable: 58.5%, general waste: 23.6% and food residue: 17.9%) and 2063.4 kg (recyclable: 33.6%, general waste: 18.5% and food residue: 47.9%) of waste were generated during the peak and lean season from the 22 hospitality facilities investigated, respectively.
Importantly, the use of the ANN model to predict the average HSW generation rate led to reliable results and the difference between the observed and predicted values was not statistically significant. However, the MLR model demonstrated lower prediction accuracy compared to CCD. It was found that Turkish tourists generated more waste (19.16% WGR/day) in large hotels compared with the British (15.1% WGR/day), and Asians generated the least average waste (20.96%) in all the facilities investigated. The findings of this study imply the need for further research to investigate the possible sources of the waste and factors limiting hotels from managing the waste effectively. In conclusion, the results herein are promising and would be useful in establishing a sustainable waste management plans.