1. Introduction
Water demand forecasting is an important issue in the field of water management worldwide [
1]. Water scarcity is a growing threat to humankind and researchers have made many efforts, proposing solutions such as water treatment [
2,
3], water desalination [
4], and optimization of water management systems [
5] to compensate for the scarcity. Showers are common water consumption behaviors. In China, university students often take showers in shared shower rooms and pay water fees using their campus cards. It is necessary for shared shower room managers to forecast the bath water demand (BWD) using parameters such as the day of the week, hours of opening, weather, and holidays. Therefore, short-term BWD forecasting is key for efficient campus water management systems. Good operational and strategic decisions can help improve water distribution performance [
3,
6]; however, traditional time-series forecasting methods highlight the role of time without considering the effects of external factors, including meteorological factors, such as temperature, wind, and precipitation, and socioeconomic factors, such as population characteristics. Using machine learning (ML) to extract useful information from data provided by smart campuses, smart cities, retail, and industrial industries has recently gained popularity [
7,
8,
9]. Businesses use ML to simplify mundane tasks, gain a competitive edge in the market, and increase earnings. The smart campus is a developed and competitive campus environment that integrates work, study, and living. It is built on the Internet of Things (IoT) [
10]. Smart campuses have embraced the IoT to automate numerous scenarios, and the data created by these devices is uploaded (reported) to a cloud server for flow management and data analysis over the Internet [
11,
12].
In this study, our analysis was based on data collected from the shared shower rooms of Capital Normal University (CNU), Beijing, China. Similarly to many campuses in North China, the use of shared shower rooms open to students is more frequent in the afternoons and evenings (especially 3:00–10:00 p.m.) at CNU. Therefore, we selected autoregressive integrated moving average (ARIMA), ARIMA with exogenous input (ARIMAX), random forests (RF), long short-term memory (LSTM), and neural basis expansion analysis for interpretable time series forecasting (N-BEATS) to build short-term BWD forecasting models and applied these to shower buildings at CNU. Furthermore, we investigated the performance of demand forecasting in female and male shower areas. To the best of our knowledge, this is the first empirical study to use a forecasting technique to develop and evaluate the performance of machine learning techniques for the forecasting of BWD in shared shower rooms, considering the majority of articles [
13,
14,
15] that conduct similar analyses often refer to total water demand. In addition, our findings in this study imply that there is a potential for energy savings and the decision making for bath water management may be used to promote energy savings in the future based on accurate BWD forecasts.
2. Literature Review
Accurate water demand forecasting helps ensure the security, stability, and economic operation of smart campuses. It is also advantageous in planning reasonable maintenance arrangements. Many factors can directly or indirectly influence BWD, including the variables of weather such as rainfall, temperature, and air quality, as well as other factors such as class schedules, weekends, and national holidays. Climate variables, in particular, have been frequently used as inputs to multivariate statistical models and machine learning approaches for modeling and predicting urban water time series [
16,
17,
18,
19].
According to Koo et al. [
20], there are no universal methods for water demand forecasting, and forecasting time periods can be categorized as short term (minutes, hourly, daily, or weekly) [
20,
21], medium term (yearly, up to 24 months) [
22], or long term (2-years, 10-years) [
23,
24]. Short-term BWD forecasting is beneficial for operational and managerial decision making, which can decrease overall energy consumption and increase the bath water quality (especially temperature). ML techniques such as LSTM [
25], support vector machine [
26] and random forest [
21] have been widely employed to forecast short-term urban water demand. Accurate and dependable BWD forecasting is critical for bath water management systems and will aid in numerous elements of short-term planning and decision making (e.g., the planning of water boiling and pumping). This requires an accurate and dependable mechanism for BWD forecasting.
Forecasting water demand is a burgeoning subject of research. Numerous researchers have used both traditional statistical models and machine learning approaches to forecast water demand. In recent years, increasingly advanced ML techniques and toolkits have been created to address forecasting issues [
27]. In the 2000s, shared showers in universities in North China began to automate their bath water supply using bath water management systems. The control logic of the early bath water management systems was rather simple: they recorded the time of shower room attendance but did not limit bath water consumption, resulting in inefficient energy and water resource consumption. However, the recent approaches employed by universities, such as IoT devices and smart campus cards [
11], are more accurate and smarter in controlling shower behaviors. Thus, with the accumulation of bath water consumption data, the operation of the system can be optimized using forecasting techniques. Usually, the operators of shared shower rooms make BWD forecasts for the next day based on their experience and plan bath water boiling and flow control actions accordingly, which has always failed to make accurate forecasts. Accurate short-term BWD forecasting can help to minimize heating and pumping costs while also increasing consumer satisfaction. Numerous studies have suggested forecasting short-term water demand using classic statistics, machine learning, and deep neural network (DNN) models.
Linear models were among the first to be widely employed in forecasting water demand. According to Do et al. [
28], an online demand multiplier particle filter can be used to forecast real-time water demand. The ARIMA model was used by Kofinas et al. [
29] to forecast water demand in cities, with good performance forecasting the monthly average urban water demand. Wong et al. [
30] forecasted Hong Kong’s daily water consumption using a correlation analysis of meteorological data and calendar impacts. For water demand forecasting, Hutton and Kapelan [
31] found that using the repeated Bayesian likelihood model improved forecasting accuracy. Quevedo et al. [
32] assessed the effects of seasonal ARIMA (SARIMA) and exponential smoothing models that took calendar effects into account and showed that they were superior in forecasting water demand when temporal and daily periods were considered. Furthermore, Patcha et al. [
33] demonstrated that the ARIMAX model with dew point depression and average temperature input plays an important role in forecasting long-term water consumption rates in Las Vegas.
Candelieri et al. [
34] applied a support vector machine (SVM) to forecast water demand and achieved high generalization ability and efficiency. Brentan et al. [
35] forecasted water demand using a mix of SVM and adaptive Fourier series and obtained better results than SVM alone. Moreover, ML models, RF models [
36], and extreme learning machine models [
37] were seen to be more beneficial than statistical models.
However, studies demonstrate that linear regression methods, when compared to nonlinear regression models, have certain shortcomings in water demand forecasting due to the complexity and nonlinear realities of water demand [
32,
37,
38]. Recent research has showed a strong interest in using neural network models to solve time series forecasting challenges. Neural networks are composed of many layers of computing units (neurons) that are connected by connections between the neurons in a layer [
39]. A neuron in a network transforms data by performing the following computations: multiplying an initial value by a weight, adding the result to additional input values, adjusting the resulting number for the neuron’s bias, and lastly normalizing the output using an activation function [
40]. After all connections are examined, the bias is a neuron-specific number that has an impact on the neuron’s value, and the activation function ensures that values are passed on within a configurable, predicted range. This procedure is repeated until the final output layer is capable of providing regression scores or predictions. All neurons in a particular layer provide an output; their weights are not identical to those in the following neuron layer. This implies that if a neuron on a layer detects a certain pattern, the whole image may suffer and the neuron may be partially or fully muted. A large weight indicates that the input is significant, whereas a lower weight indicates that it should be ignored. As a result, neural networks should be treated as complex systems that reveal complex behavior; rather than the neurons themselves, it is the interactions between the neurons that enable the network to learn.
To develop short-term forecasts of water consumption, Vijai et al. [
41] evaluated DNN models with machine-learning techniques. Xenochristou et al. [
42] compared forecasts of daily water consumption using a stacked model and a DNN model. Koo et al. [
20] analyzed the performance evaluations of LSTM and ARIMA with those of ML models for distinguishing water usage in Korea, and found that the former performed better. In their study, Kuehnert et al. [
25] explored the usefulness of LSTM models for water demand forecasting and showed that LSTM models outperformed the system in operation by a large margin. The cutting-edge N-BEATS model has demonstrated outstanding performance on large-scale time-series challenges. Boris et al. [
22], for example, utilized N-BEATS to forecast mid-term electrical usage and showed that it outperformed statistical and machine learning approaches.
Previous studies [
15,
29,
43,
44,
45,
46,
47,
48] on water demand forecasting have mainly focused on total water consumption in urban or residential areas, given that a high proportion of bath water is hot water, and conservation of bath water has energy and greenhouse gas conservation benefits [
49]. While several researchers have addressed and built water demand forecasting models, there is a dearth of study on the performance evaluation of short-term BWD forecasting in shared shower rooms.
5. Conclusions
This study explored the potential of the ARIMA, ARIMAX, RF, LSTM, and N-BEATS models for producing improved BWD forecasts in shared shower rooms for improving management efficiency, reducing energy and operating costs, and increasing student satisfaction. Calendar information, meteorological variables, and the number of students who took lunch were utilized as covariates to enhance the models’ accuracy. The following are the conclusions of BWD forecasting with machine learning models:
- (1)
All models achieved good forecasting performance on daily total BWD in terms of accuracy. The management level of shared shower rooms is improved with accurate BWD forecasting results. Hence, the cost of heating and pumping bath water can be reduced. Furthermore, there is a large potential for energy savings as a consequence of accurate BWD forecasting in advance.
- (2)
DNN models outperformed statistical models for daily and hourly BWD forecasting, whereas the LSTM models outperformed other models for high-resolution forecasting tasks.
- (3)
In the event of a malfunction or sensor failure, missing data can be created using machine learning models with little resources and time utilizing historical data.
- (4)
ML techniques can make campuses smarter, such as by forecasting canteen attendance and network flow consumption.
In summary, ML models can be applied to develop forecasting systems for smart campuses. In the future, we will work to fetch more external factors that affect BWD and obtain more BWD data to obtain improved forecasting performance.