Next Article in Journal
On Fejér Type Inequalities via (p,q)-Calculus
Next Article in Special Issue
Preserving the Shape of Functions by Applying Multidimensional Schoenberg-Type Operators
Previous Article in Journal
New Geometric Constants in Banach Spaces Related to the Inscribed Equilateral Triangles of Unit Balls
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Simulation and Analysis of Indoor Air Quality in Florida Using Time Series Regression (TSR) and Artificial Neural Networks (ANN) Models

1
UrbSys (Urban Building Energy, Sensing, Controls, Big Data Analysis, and Visualization) Laboratory, M.E. Rinker, Sr. School of Construction Management, University of Florida, Gainesville, FL 32603, USA
2
School of Computer Science and Technology, Xi’an Jiaotong University, Xi’an 710049, China
*
Author to whom correspondence should be addressed.
Symmetry 2021, 13(6), 952; https://doi.org/10.3390/sym13060952
Submission received: 6 May 2021 / Revised: 14 May 2021 / Accepted: 26 May 2021 / Published: 27 May 2021

Abstract

:
Exposures to air pollutants have been associated with various acute respiratory diseases and detrimental human health. Analysis and further interpretation of air pollutant patterns are correspondingly important as monitoring them. In the present study, the 24-h and four-month indoor and outdoor PM2.5, PM10, NO2, relative humidity, and temperature were measured simultaneously for a laboratory in Gainesville city, Florida. The indoor PM2.5, PM10, and NO2 concentrations were predicted using multiple linear regression (MLR), time series regression (TSR), and artificial neural networks (ANN) models. The modeling conducted in this study aims to perform a cross comparison study between these models in a symmetric environment. The value of root-mean-square error was improved by 18.33% in comparison with the MLR model. In addition, the value of the coefficient of determination was improved by 24.68%. The ANN model had the best performance and could predict the target air pollutants at 10-min intervals of the studied building with 90% accuracy levels. The TSR model showed slightly better performance compared to the MLR model. These results can be accordingly referred for studies analyzing indoor air quality in similar building types and climate zones.

1. Introduction

Indoor air quality (IAQ) deterioration has been linked to the increased health risk of sick building syndrome (SBS) and building-related illness (BRI) [1,2]. People spend most of their daily lives in indoor environments, even though indoor air pollutant concentrations are on average three to five times higher than outdoors [3,4,5]. The target air pollutants selected in this study were particulate matter (PM2.5, PM10) and nitrogen dioxide (NO2). They are the most common indoor air pollutants defined in both the United States Environmental Protection Agency (US EPA) and the World Health Organization (WHO) standards [6,7,8]. Fine particle matter is mainly generated from incomplete combustion, vehicle emissions, and dust [1,6,9]. The most common source of reddish-brown gas (NO2) is the combustion of fossil fuel [1,6,10,11]. Effective analysis and prediction of indoor air pollutant patterns are correspondingly critical as monitoring them.
In recent years, laser-based semiconductor techniques and low-cost air pollution monitoring techniques have rapidly surged [12,13,14]. There has been growing interest in determining the relationships between indoor and outdoor air quality using real-time measured data. Various prediction models have been developed to predict the IAQ based on environmental parameters. For example, Hassanvand et al. [15] and Chithra et al. [16] developed linear regression models to study and forecast the mass concentrations of indoor and outdoor particulate matter (PM10, PM2.5, and PM1) in a residential house and a school building, respectively. Mohammadyan et al. [17] applied multiple regression models using indoor and outdoor environmental data (fine particles, temperature, and relative humidity). The model performed moderately predictive for the measured indoor pollutants. Jafta et al. [18] developed leave-one-out cross-validation-based multivariate models using PM2.5 concentrations, meteorological information, and household human activities as inputs, while Shupler et al. [19] added housing material, season as one of the inputs. Kim et al. [20] used principal component analysis (PCA)-based linear models to predict indoor particulate matter (PM) by performing dimension reduction. Tong et al. [21] reported that the linear mixed-effects regression model showed moderately predictive for indoor PM2.5, and the fitted model resulted that outdoor PM2.5, relative humidity, and indoor PM2.5 were positively associated. Yuchi et al. [22] and Lin et al. [23] reported that random forest-based machine learning predictive models and multiple linear regression models showed similar performance when predicting levels of certain indoor pollutants. Previous literature review reveals that most studies predicting indoor air quality focused on outdoor concentrations and human behaviors. Although most of the measured data are ordered by time, and many observations showed potential time delay between outdoor pollutants leak and the indoor concentrations [24,25,26,27]. However, to our knowledge, no study of relationships between indoor and outdoor air quality has attempted to include time-delay effects within the variables in regression analysis. Besides, artificial neural networks (ANN) have gained attention by a few researchers in recent years for indoor air quality forecasting [28,29,30,31,32,33]. Park et al. [25], Challoner et al. [34], and Saad et al. [35] reported that ANN-predicted indoor pollutant concentrations showed better performance than other regression models. Conversely, Liu et al. [31] developed nonlinear regression models that predicted IAQ levels showed higher performance over other prediction models (partial least squares, backpropagation ANN, and least-squares).
This study performed numerical simulations with monitored indoor and outdoor environmental data, coupled with the multiple linear regression model, the time series regression model, and the ANN-based model, to determine the relationships between indoor and outdoor air quality. Furthermore, these models are expected to predict the indoor air pollutant concentrations in a laboratory building in Florida. The objective of this paper is to perform a cross comparison study between multiple linear regression (MLR), time series regression (TSR), and non-linear ANN models. The hypothesis of this study is that the real-time monitored air quality data can fit better under non-linear models such as ANN compared to linear prediction models. The outline of this paper is as follows. In Section 2, the sampling protocols and methods of TSR and ANN are introduced. In Section 3, results of a cross-comparison of the aforementioned models are presented. The final section addresses the conclusion of this article.

2. Materials and Methods

2.1. Sampling Site and Sampling Protocol

The measurement of the test building was conducted between 17 May 2020 (Sunday) and 18 September 2020 (Friday) in Gainesville city, Florida, United States. The building chosen for air quality monitoring is mechanically conditioned throughout the year, and the windows are closed to meet the American Society of Heating, Refrigerating, and Air-Conditioning Engineers (ASHRAE) Standard 62.1-2019 [36]. The laboratory building mainly functions as an office space with a 321 sq. ft floor area coupled with a single ducted air handling unit, which can provide a PM2.5 removal efficiency of 25% [37]. The standardized US EPA protocol for characterizing IAQ in large office buildings is followed to reduce measurement uncertainty [12,38]. A four-month indoor and outdoor air quality measurement was carried out with 10 min sampling intervals for 24 h continuously. Two US EPA (Air Quality Sensor Performance Evaluation Center) qualified monitors (Air Quality Egg, Version-2018) were used to measure the concentration of indoor and outdoor PM2.5, PM10, and NO2, as well as relative humidity and temperature (RHT) [11,39,40]. The basic specifications of the monitor are shown in Table 1. The indoor monitor was placed about 3.6 feet above the floor, 1.5 feet away from an interior wall. Outdoor measurement was conducted simultaneously with indoors. A weatherproofed monitor was set up 5 feet above the surface of the deck outside the room (Figure 1).

2.2. Multiple Linear Regression Model

The monitored data are extracted and subjected to descriptive statistics such as mean, standard deviation, indoor/outdoor ratio (I/O), and variance using Anaconda Python and Jupyter Notebooks [41]. Multiple linear regressions were performed to assess and predict the relationship between indoor air pollutants (PM2.5, PM10, and NO2) and other measured environmental parameters. A multiple linear regression model can be expressed as Equation (1) [22,42,43]:
Y i n = β 0 + β 1 x 1 + β 2 x 2 + β 3 x 3 + + β p 1 x p 1 + ε
where Y i n is the dependent variable, β = ( β 0 ,   β 1 , ,   β p 1 ) are regression coefficients, x = x 1 ,   x 2 ,   x p 1 are the independent variables, and ε is the random error term. The performance of developed MLR models was examined by calculating the coefficient of determination (R2), and root means square error (RMSE). The performance indicators were calculated as Equations (2) and (3) [44,45,46]:
R 2 = i n y ^ i y ¯ i 2 i n y i y ¯ 2
RMSE = i n y ^ i y i 2 n
where i is the prediction variable at present while y ^ i is the predicted value at each time, y i is the measured NO2 (PM2.5 or PM10) concentration, y ¯ is the average value of measured NO2 (PM2.5 or PM10) concentrations.

2.3. Time Series Regression Model

To predict the dependency between recorded variables when considering time-lag and time-accumulation effects, the TSR models were developed using MATLAB R2020a [47,48]. The autoregressive (AR) approaches were implemented for forecasting based on the same datasets used in the MLR models [49,50]. In an AR method, each variable is a linear function of the past values of itself and other previously recorded values [50,51,52]. The inputs of the model were the indoor and outdoor levels of temperature (°F), relative humidity (%), PM2.5 (µg/m3), PM10 (µg/m3), and NO2 (ppb). The indoor PM2.5, PM10, or NO2 of the laboratory building was considered as the output. The input variables were then slowly increased by adding the target pollutant of previous minutes (10 min intervals) to find the best-fit model (See Figure 2). To evaluate the performance of the TSR models and avoid the biased estimation, we computed RMSE between simulated and real-time measured data [29,44].
From Figure 2a,b, the best results were found by the implementation of the past 250 min observed value of indoor PM2.5 and indoor PM10, respectively. Figure 2c shows the best results were found by the implementation of the past 470 min observed value of indoor NO2. The following are the obtained equations for the TSR model [49,50]:
PM 2.5 _ in = k = 0 K a k PM 2.5 _ out t k + bT out t + cH out t + dPM 10 _ out t + eNO 2 _ out t +   fT in t + gH in t + hPM 10 _ in t + iNO 2 _ in t +   j
PM 10 _ in = k = 0 K a k PM 10 _ out t k + bT out t + cH out t + dPM 2.5 _ out t + eNO 2 _ out t + fT in t + gH in t + hPM 2.5 _ in t + iNO 2 _ in t + j
NO 2 _ in = k = 0 K a k NO 2 _ out t k + bT out t + cH out t + dPM 2.5 _ out t + ePM 10 _ out t + fT in t + gH in t + hPM 2.5 _ in t + iPM 10 _ in t + j
where a , b , c , d , e , f , g , h ,   i ,   and   j are the coefficient. k is the number of time intervals (10 min). T out t   and   T in t   are the outdoor and indoor temperature measured at time t, respectively. H out t   and   H in t   represent the outdoor and indoor relative humidity at time t. PM 2.5 _ out t ,   PM 10 _ out t ,   PM 2.5 _ in t ,   and   PM 10 _ in t   are the measured values of outdoor and indoor PM2.5 and PM10 at time t. NO 2 _ out t and   NO 2 _ in t represent the outdoor and indoor calibration of NO 2   (pbb) at time t.

2.4. Artificial Neural Networks Model

Artificial neural networks have been widely employed to investigate complex ambient-air processes. Many studies have predicted the concentrations of indoor air pollutants with ANN [33,35,46]. Multi-layer perceptron (MLP) network is one of the major types of ANN architectures [53]. Usually, the MLP network is consists of input, output, and multi-layers (including a hidden layer) [54]. Each layer is composed of several interconnected non-linear processing components called neurons or nodes [29,32]. The selected neurons were connected to other neurons in adjacent layers through adaptive synaptic coefficients [29,54]. In this study, two-layer feed-forward neural network models were developed using the neural network toolbox of MATLAB R2020a [48]. The same dataset used in the MLR and TSR approach, 18,144 time-series data for each input neuron used to train the ANN models. The logistic sigmoid transfer function and linear activation function were used to avoid the fitting problem between input and output datasets. The input dataset was normalized using min–max normalization. The ANN models were trained with the Levenberg–Marquardt (LM) based back-propagation (BP) algorithm [44,52,55]. The LM algorithm is one of the most effective algorithms which accelerates the convergence rate of the ANN with multilayer perceptron (MLP) architectures and reserves computational resources [35,46]. Studies have found the optimal number of hidden neurons to obtain the best network performance. In this study, the number of hidden layers was computed to be 10 from several iterations [52,54,55,56,57]. As shown in Figure 3, the input layer contains nine neurons of indoor and outdoor air quality parameters. The measured indoor concentrations of PM2.5, PM10, and NO2 were considered as the desired target of the ANN models. For each training, the database was randomized into three groups: 70% training, 15% validation, and 15% testing [32,54]. The optimum network architecture of the ANN models was developed with 10 hidden layers [52,56]. The performance of developed ANN models was examined by calculating the coefficient of determination (R2) and root means square error (RMSE) [44,46].

3. Results and Discussion

3.1. Measured Environmental Parameters

The descriptive statistics of measured indoor and outdoor environmental data are presented in Table 2 and Figure 4. The mean indoor and outdoor temperatures were 70.7 °F and 81.5 °F, respectively, while outdoor relative humidity had a slightly higher mean and a larger standard deviation than indoors. The average mean of indoor PM2.5, PM10, and NO2 was significantly lower indoors than outdoors. It shows that the indoor concentration of PM2.5 range was 0 μg/m3 to 38.5 μg/m3, indoor PM10 was 0 μg/m3 to 68 μg/m3, and NO2 was 72.3 ppb to 83.9 ppb. Figure 5 shows the plots of the four-month-long continuous records based on 10 min intervals. Most indoor PM2.5 and PM10 data met the 24 h safe levels in ASHRAE 62.1-2019 standard [36]. The indoor concentrations of both particulate matters followed a similar diurnal pattern as the outdoor concentrations. It can be seen from Figure 4c. The indoor NO2 data show a significantly stable pattern than the outdoors across the sampling period. The potential reasons for this trend might be due to most gaseous pollutants have infiltrated and a lack of indoor NO2 sources [1,12,58]. However, the majority of indoor NO2 data lie above the annual requirements of ASHRAE 62.1 standard (53 ppb) [11,36].

3.2. Correlation between Indoor and Outdoor Data

Simple linear regression and the nonlinear Spearman’s rank correlation were performed to extract the direct correlation between indoor data and their corresponding outdoor data. The dataset was also used to investigate the symmetry among indoor and outdoor time-series concentrations. The input dataset was normalized using standardized normalization. Figure 6a,b show the linear correlation and symmetric distributions between indoor and outdoor particulate matters (PM2.5 and PM10) with very coefficient of determination (R2_PM2.5 = 0.26; R2_PM10 = 0.24, p < 0.01). However, statistically moderate correlation (R_PM2.5 = 0.53; R_PM10 = 0.52) were found in the Spearman’s rank results. From Figure 7, Spearman’s coefficient shows a significant correlation coefficient (R_NO2 = 0.87) between indoor and outdoor NO2. Conversely, a very low correlation was found between indoor and outdoor NO2 in the linear regression analysis (Figure 6c). There were statistically significant mismatches between the non-linear Spearman’s coefficients and the linear correlation coefficients. The results also indicated that the univariate linear regression method might not accurately predict the indoor concentrations of PM2.5, PM10, and NO2. Since these results are not compelling, multivariate analysis and further validation should be considered for the IAQ prediction model.

3.3. Cross-Comparison of the MLR, TSR, and ANN Models in the Prediction of the Indoor PM2.5 and PM10

The developed TSR and ANN models were used to simulate the 10 min average indoor particulate matter concentrations (PM2.5 and PM10) of a laboratory building near downtown Gainesville city, Florida. To better evaluate the prediction performance of each model, their output results were compared to the results of conventional multiple linear regression (MLR). Figure 8 shows the original and simulated indoor PM2.5 by the time-series and developed models for the period of 17 May 2020 to 18 September 2020. It can be observed that the MLR, TSR, and ANN models can successfully predict the indoor PM2.5 concentrations and their trend. The adjusted p-values which are less than 0.001 were validated by using F-test. From the simulated data, the R2 values of the models (MLR, TSR, ANN) are above 0.9 (p < 0.001), and their MRSE values are less than 0.1 (p < 0.001) (Table 3). TSR and MLR had the same R2 values, but the RMSE value of TSR was 1.31% less than that of MLR. Additionally, the R2 value of the ANN model was found to be 0.08% higher than the MLR model, and the RMSE of ANN was lower than the MLR by 5.37%. Similar performance was observed in the case of indoor PM10 (Figure 9). MLR (R2 = 0.9985, p < 0.001), TSR (R2 = 0.9986, p < 0.001), ANN (R2 = 0.9995, p < 0.001) all successfully predicted indoor PM10 concentrations. The RMSE of the TSR model was lower than the MLR by 1.02%. The R2 value of the ANN model was 0.09% higher than the MLR, and the RMSE of ANN was 13.66% lower than the MLR. In summary, the ANN models had the best performance among all the models to predict indoor particulate matters (PM2.5 and PM10), and the TSR models had better performance than the MLR. A random forest-based ANN in a residential environment studying indoor PM2.5 concentrations found with R2 value and cross-validation R2 values of 0.93 and 0.65, respectively [59]. Park et al. [25] also conducted an ANN study in which the average value of indoor PM10 concentration had an average value of 0.62 and mean RMSE of 33.04 in Seoul metropolitan subway stations. These studies had different environmental conditions which might have caused alternate results compared to our study. Besides, several studies have proved that factors, such as building materials, indoor environment conditions, human activities, and others, may contribute to the indoor particulate matter concentrations [57,60,61,62,63,64].

3.4. Cross-Comparison of the MLR, TSR, and ANN Models in Prediction of the Indoor NO2

The predicted results of the developed models for indoor NO2 concentrations are illustrated in Figure 10 and Table 4. The adjusted p-values which are less than 0.001 were validated by using F-test. The validation of each developed model using 15% of the dataset showed that the R2 (p < 0.001) value of models were 0.7230 (MLR), 0.7233 (TSR), and 0.9014 (ANN), while the RMSEs (p < 0.001) were 3.8861, 3.8558, and 3.1737, respectively, indicating that the models successfully predicted indoor NO2 concentrations. The reason RMSE is higher in NO2 prediction models than particulate matter models can be ascribed to the high spatial variation for NO2 concentrations in the outdoor environment. However, again the ANN model shows the best performance in predicting the indoor NO2 levels comparing to the TSR and MLR. The R2 value (p < 0.001) of the ANN model was found to be 24.68% higher than the MLR model, and the RMSE (p < 0.001) of ANN was lower than the MLR by 18.33%. The TSR model gave slightly better performance compared to the MLR model. A study in an office room in a commercial building in Dublin showed that the prediction ability of R2 was 0.899 through ANN model compared to 0.11 through linear regression model [34].

4. Conclusions

The study measured the time-series data of concentrations of air pollutants (PM2.5, PM10, and NO2), as well as temperature and relative humidity for both indoor and outdoor of a research lab, for a duration of four months in the midst of the COVID-19 pandemic (Supplementary Materials). The real-time measured data were used to develop predictive models to analyze the relationship between pollutant concentrations along with its symmetry. The hypothesis was verified using the abovementioned predictive models. In addition, the results showed that there was a complex non-linear relationship between indoor and outdoor environmental parameters. Indoor air quality (PM2.5, PM10, and NO2) can be accurately estimated by the MLR, TSR, and ANN models. The comparison results showed that the ANN model had the best performance and could predict the PM2.5 and PM10 at 10 min intervals of the studied building with 90% accuracy levels. The TSR model had a similar performance with the MLR model when considering the time delay between indoor and corresponding outdoor air pollutants. In case of indoor PM2.5 and PM10 prediction, the R2 values under the MLR models were more than 0.95 (p < 0.01) and the RMSE values were less than 0.1 (p < 0.01). In comparison, ANN model prediction performance was better than the TSR model resulting in an increase in R2 value by 0.09% and decrease in RMSE value by 13.66%. In the case of indoor NO2 prediction, the mean R2 value under the MLR model was more than 0.7 (p < 0.01) and the RMSE value was less than 4. In comparison, ANN model prediction performance was better than the TSR model resulting in an increase in R2 value by 24.68% (p < 0.01) and decrease in RMSE value by 18.33%. The results in this study indicated that ANN should be predominantly used compared to linear regression models for future research involving time-symmetric environmental parameters. This study can be accordingly referred by building managers to make dynamic adjustments to indoor air quality in similar building types and climate zones. However, the accuracy of the prediction may be influenced by other uncertain parameters such as building type and indoor human activities. The building parameters can be included in the ANN model analyses for better understanding of the relationships between indoor and outdoor air quality.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/sym13060952/s1.

Author Contributions

Conceptualization, H.Z. and X.Y.; Data collection and analysis, H.Z.; Methodology, H.Z. and X.Y.; Validation, X.Y.; Writing—original draft, H.Z.; writing—review and editing, R.S.; supervision, R.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not Applicable.

Informed Consent Statement

Not Applicable.

Data Availability Statement

The data presented in this study are available in Supplementary Materials.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Ahmed Abdul-Wahab, S.A.; En, S.C.F.; Elkamel, A.; Ahmadi, L.; Yetilmezsoy, K. A review of standards and guidelines set by international bodies for the parameters of indoor air quality. Atmos. Pollut. Res. 2015, 6, 751–767. [Google Scholar] [CrossRef]
  2. Saeki, Y.; Kadonosono, K.; Uchio, E. Clinical and allergological analysis of ocular manifestations of sick building syndrome. Clin. Ophthalmol. 2017, 11, 517–522. [Google Scholar] [CrossRef] [Green Version]
  3. Raysoni, A.U.; Stock, T.H.; Sarnat, J.A.; Chavez, M.C.; Sarnat, S.E.; Montoya, T.; Holguin, F.; Li, W.-W. Evaluation of VOC concentrations in indoor and outdoor microenvironments at near-road schools. Environ. Pollut. 2017, 231, 681–693. [Google Scholar] [CrossRef]
  4. Verriele, M.; Schoemaecker, C.; Hanoune, B.; Leclerc, N.; Germain, S.; Gaudion, V.; Locoge, N. The MERMAID study: Indoor and outdoor average pollutant concentrations in 10 low-energy school buildings in France. Indoor Air 2016, 26, 702–713. [Google Scholar] [CrossRef] [PubMed]
  5. WHO. World Health Statistics 2020: Monitoring Health for the SDGs, Sustainable Development Goals. Available online: https://www.who.int/publications/i/item/9789240005105 (accessed on 27 February 2021).
  6. NAAQS. Primary National Ambient Air Quality Standards (NAAQS) for Nitrogen Dioxide. Available online: https://www.epa.gov/no2-pollution/primary-national-ambient-air-quality-standards-naaqs-nitrogen-dioxide (accessed on 26 July 2019).
  7. US-EPA. Report on the Environment, What Are the Trends in Indoor Air Quality and their Effects on Human Health? Available online: https://www.epa.gov/report-environment/indoor-air-quality (accessed on 3 November 2020).
  8. WHO. Ambient Air Pollution Attributable Death Rate (per 100 000 Population). Available online: https://www.who.int/data/gho/data/indicators/indicator-details/GHO/ambient-air-pollution-attributable-death-rate-(per-100-000-population) (accessed on 26 February 2021).
  9. Song, P.; Wanga, L.; Hui, Y.; Li, R. PM2.5 Concentrations Indoors and Outdoors in Heavy Air Pollution Days in Winter. Procedia Eng. 2015, 121, 1902–1906. [Google Scholar] [CrossRef] [Green Version]
  10. Challoner, A.; Gill, L. Indoor/outdoor air pollution relationships in ten commercial buildings: PM2.5 and NO2. Build. Environ. 2014, 80, 159–173. [Google Scholar] [CrossRef]
  11. Zhang, H.; Srinivasan, R. A systematic review of air quality sensors, guidelines, and measurement studies for indoor air quality management. Sustainability 2020, 12, 9045. [Google Scholar] [CrossRef]
  12. Zhang, H.; Srinivasan, R.; Ganesan, V. Low cost, multi-pollutant sensing system using raspberry pi for indoor air quality monitoring. Sustainability 2021, 13, 370. [Google Scholar] [CrossRef]
  13. Yi, W.Y.; Lo, K.M.; Mak, T.; Leung, K.S.; Leung, Y.; Meng, M.L. A Survey of Wireless Sensor Network Based Air Pollution Monitoring Systems. Sensors 2015, 15, 31392–31427. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Castell, N.; Dauge, F.R.; Schneider, P.; Vogt, M.; Lerner, U.; Fishbain, B.; Broday, D.; Bartonova, A. Can commercial low-cost sensor platforms contribute to air quality monitoring and exposure estimates? Environ. Int. 2017, 99, 293–302. [Google Scholar] [CrossRef] [PubMed]
  15. Hassanvand, M.S.; Naddafi, K.; Faridi, S.; Arhami, M.; Nabizadeh, R.; Sowlat, M.H.; Pourpak, Z.; Rastkari, N.; Momeniha, F.; Kashani, H.; et al. Indoor/outdoor relationships of PM10, PM2.5, and PM1 mass concentrations and their water-soluble ions in a retirement home and a school dormitory. Atmos. Environ. 2014, 82, 375–382. [Google Scholar] [CrossRef]
  16. Chithra, V.S.; Shiva Nagendra, S.M. Impact of outdoor meteorology on indoor PM10, PM2.5 and PM1 concentrations in a naturally ventilated classroom. Urban Clim. 2014, 10, 77–91. [Google Scholar] [CrossRef]
  17. Mohammadyan, M.; Alizadeh-Larimi, A.; Etemadinejad, S.; Latif, M.T.; Heibati, B.; Yetilmezsoy, K.; Abdul-Wahab, S.A.; Dadvand, P. Particulate air pollution at schools: Indoor-outdoor relationship and determinants of indoor concentrations. Aerosol Air Qual. Res. 2017, 17, 857–864. [Google Scholar] [CrossRef]
  18. Jafta, N.; Barregard, L.; Jeena, P.M.; Naidoo, R.N. Indoor air quality of low and middle income urban households in Durban, South Africa. Environ. Res. 2017, 156, 47–56. [Google Scholar] [CrossRef] [PubMed]
  19. Shupler, M.; Godwin, W.; Frostad, J.; Gustafson, P.; Arku, R.E.; Brauer, M. Global estimation of exposure to fine particulate matter (PM2.5) from household air pollution. Environ. Int. 2018, 120, 354–363. [Google Scholar] [CrossRef]
  20. Kim, M.; Sankararao, B.; Kang, O.; Kim, J.; Yoo, C. Monitoring and prediction of indoor air quality (IAQ) in subway or metro systems using season dependent models. Energy Build. 2012, 46, 48–55. [Google Scholar] [CrossRef]
  21. Tong, X.; Ho, J.M.W.; Li, Z.; Lui, K.H.; Kwok, T.C.Y.; Tsoi, K.K.F.; Ho, K.F. Prediction model for air particulate matter levels in the households of elderly individuals in Hong Kong. Sci. Total Environ. 2020, 717, 135323. [Google Scholar] [CrossRef] [PubMed]
  22. Yuchi, W.; Gombojav, E.; Boldbaatar, B.; Galsuren, J.; Enkhmaa, S.; Beejin, B.; Naidan, G.; Ochir, C.; Legtseg, B.; Byambaa, T.; et al. Evaluation of random forest regression and multiple linear regression for predicting indoor fine particulate matter concentrations in a highly polluted city. Environ. Pollut. 2019, 245, 746–753. [Google Scholar] [CrossRef] [PubMed]
  23. Lin, B.; Huangfu, Y.; Lima, N.; Jobson, B.; Kirk, M.; O’Keeffe, P.; Pressley, S.N.; Walden, V.; Lamb, B.; Cook, D.J. Analyzing the relationship between human behavior and indoor air quality. J. Sens. Actuator Netw. 2017, 6, 13. [Google Scholar] [CrossRef] [Green Version]
  24. Chao, C.Y.; Wong, K.K. Residential indoor PM10 and PM2.5 in Hong Kong and the elemental composition. Atmos. Environ. 2002, 36, 265–277. [Google Scholar] [CrossRef]
  25. Park, S.; Kim, M.; Kim, M.; Namgung, H.G.; Kim, K.T.; Cho, K.H.; Kwon, S.B. Predicting PM10 concentration in Seoul metropolitan subway stations using artificial neural network (ANN). J. Hazard. Mater. 2018, 341, 75–82. [Google Scholar] [CrossRef]
  26. Goyal, R.; Khare, M. Indoor-outdoor concentrations of RSPM in classroom of a naturally ventilated school building near an urban traffic roadway. Atmos. Environ. 2009, 43, 6026–6038. [Google Scholar] [CrossRef]
  27. Deng, G.; Li, Z.; Wang, Z.; Gao, J.; Xu, Z.; Li, J.; Wang, Z. Indoor/outdoor relationship of PM2.5 concentration in typical buildings with and without air cleaning in Beijing. Indoor Built Environ. 2015, 26, 60–68. [Google Scholar] [CrossRef]
  28. Spinelle, L.; Gerboles, M.; Villani, M.G.; Aleixandre, M.; Bonavitacola, F. Field calibration of a cluster of low-cost commercially available sensors for air quality monitoring. Part B: NO, CO and CO2. Sens. Actuators B Chem. 2017, 238, 706–715. [Google Scholar] [CrossRef]
  29. Maleki, H.; Sorooshian, A.; Goudarzi, G.; Baboli, Z.; Tahmasebi Birgani, Y.; Rahmati, M. Air pollution prediction by using an artificial neural network model. Clean Technol. Environ. Policy 2019, 21, 1341–1352. [Google Scholar] [CrossRef] [PubMed]
  30. De Vito, S.; Piga, M.; Martinotto, L.; Di Francia, G. CO, NO2 and NOx urban pollution monitoring with on-field calibrated electronic nose by automatic bayesian regularization. Sens. Actuators B Chem. 2009, 143, 182–191. [Google Scholar] [CrossRef]
  31. Liu, H.; Yang, C.; Huang, M.; Wang, D.; Yoo, C.K. Modeling of subway indoor air quality using Gaussian process regression. J. Hazard. Mater. 2018, 359, 266–273. [Google Scholar] [CrossRef]
  32. Rahman, N.H.A.; Lee, M.H.; Suhartono; Latif, M.T. Artificial neural networks and fuzzy time series forecasting: An application to air quality. Qual. Quant. 2015, 49, 2633–2647. [Google Scholar] [CrossRef]
  33. Oh, H.J.; Kim, J. Monitoring air quality and estimation of personal exposure to particulate matter using an indoor model and artificial neural network. Sustainability 2020, 12, 3794. [Google Scholar] [CrossRef]
  34. Challoner, A.; Pilla, F.; Gill, L. Prediction of indoor air exposure from outdoor air quality using an artificial neural network model for inner city commercial buildings. Int. J. Environ. Res. Public Health 2015, 12, 15233–15253. [Google Scholar] [CrossRef] [PubMed]
  35. Mad Saad, S.; Andrew, A.M.; Shakaff, A.Y.M.; Mohd Saad, A.R.; Kamarudin, A.M.Y.; Zakaria, A. Classifying sources influencing indoor air quality (IAQ) using artificial neural network (ANN). Sensors 2015, 15, 11665–11684. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  36. ANSI/ASHRAE. ANSI/ASHRAE Standard 62.1-2019,Ventilation for Acceptable Indoor Air Quality; American Society of Heating, Refrigerating and Air-Conditioning Engineers: Atlanta, GA, USA, 2019. [Google Scholar]
  37. MERV. What is a MERV Rating? Indoor Air Quality (IAQ). Available online: https://www.epa.gov/indoor-air-quality-iaq/what-merv-rating-1 (accessed on 5 June 2020).
  38. EPA. A Standardized EPA Protocol for Characterizing Indoor Air Quality in Large Office Buildings; EPA: Washinton, DC, USA, 2016. [Google Scholar]
  39. AQ-SPEC. Air Quality Egg Evaluation Summary; AQ-SPEC: Diamond Bar, CA, USA, 2018. [Google Scholar]
  40. AQ-SPEC. Laboratory Evaluation Air Quality Egg 2018 Model; AQ-SPEC: Diamond Bar, CA, USA, 2018. [Google Scholar]
  41. Mendez, K.M.; Pritchard, L.; Reinke, S.N.; Broadhurst, D.I. Toward collaborative open data science in metabolomics using Jupyter Notebooks and cloud computing. Metabolomics 2019, 15, 1–16. [Google Scholar] [CrossRef] [Green Version]
  42. Donnelly, A.; Misstear, B.; Broderick, B. Real time air quality forecasting using integrated parametric and non-parametric regression techniques. Atmos. Environ. 2015, 103, 53–65. [Google Scholar] [CrossRef]
  43. Uyanık, G.K.; Güler, N. A Study on Multiple Linear Regression Analysis. Procedia-Soc. Behav. Sci. 2013, 106, 234–240. [Google Scholar] [CrossRef] [Green Version]
  44. Ul-Saufie, A.Z.; Yahaya, A.S.; Ramli, N.A.; Rosaida, N.; Hamid, H.A. Future daily PM10 concentrations prediction by combining regression models and feedforward backpropagation models with principle component analysis (PCA). Atmos. Environ. 2013, 77, 621–630. [Google Scholar] [CrossRef]
  45. Munir, S.; Mayfield, M.; Coca, D.; Jubb, S.A.; Osammor, O. Analysing the performance of low-cost air quality sensors, their drivers, relative benefits and calibration in cities—A case study in Sheffield. Environ. Monit. Assess. 2019, 191. [Google Scholar] [CrossRef] [Green Version]
  46. Shams, S.R.; Jahani, A.; Kalantary, S.; Moeinaddini, M.; Khorasani, N. The evaluation on artificial neural networks (ANN) and multiple linear regressions (MLR) models for predicting SO2 concentration. Urban Clim. 2021, 37, 100837. [Google Scholar] [CrossRef]
  47. Kugiumtzis, D.; Tsimpiris, A. Measures of analysis of time series (MATS): A MATLAB toolkit for computation of multiple measures on time series data bases. J. Stat. Softw. 2010, 33, 1–30. [Google Scholar] [CrossRef] [Green Version]
  48. MathWorks MathWorks-Matlab, R2020a at a Glance. Available online: https://www.mathworks.com/products/new_products/release2020a.html (accessed on 5 April 2020).
  49. Corduas, M.; Piccolo, D. Time series clustering and classification by the autoregressive metric. Comput. Stat. Data Anal. 2008, 52, 1860–1872. [Google Scholar] [CrossRef]
  50. Binkowski, M.; Marti, G.; Donnat, P. Autoregressive convolutional neural networks for asynchronous time series. 35th. Int. Conf. Mach. Learn. 2018, 2, 933–945. [Google Scholar]
  51. Spengler, J.D.; Ferris, B.G.; Dockery, D.W.; Speizer, F.E. Sulfur Dioxide and Nitrogen Dioxide Levels Inside and Outside Homes and the Implications on Health Effects Research. Environ. Sci. Technol. 1979, 13, 1276–1280. [Google Scholar] [CrossRef]
  52. Ashtiani, A.; Mirzaei, P.A.; Haghighat, F. Indoor thermal condition in urban heat island: Comparison of the artificial neural network and regression methods prediction. Energy Build. 2014, 76, 597–604. [Google Scholar] [CrossRef]
  53. Abdullah, S.; Ismail, M.; Ahmed, A.N.; Abdullah, A.M. Forecasting particulate matter concentration using linear and non-linear approaches for air quality decision support. Atmosphere 2019, 10, 667. [Google Scholar] [CrossRef] [Green Version]
  54. Dai, X.; Liu, J.; Zhang, X.; Chen, W. An artificial neural network model using outdoor environmental parameters and residential building characteristics for predicting the nighttime natural ventilation effect. Build. Environ. 2019, 159, 106139. [Google Scholar] [CrossRef]
  55. Mendell, M.J.; Fisk, W.J.; Kreiss, K.; Levin, H.; Alexander, D.; Cain, W.S.; Girman, J.R.; Hines, C.J.; Jensen, P.A.; Milton, D.K.; et al. Improving the health of workers in indoor environments: Priority research needs for a National Occupational Research Agenda. Am. J. Public Health 2002, 92, 1430–1440. [Google Scholar] [CrossRef] [PubMed]
  56. Xie, H.; Ma, F.; Bai, Q. Prediction of indoor air quality using artificial neural networks. 5th. Int. Conf. Nat. Comput. 2009, 2, 414–418. [Google Scholar] [CrossRef]
  57. Li, D.; Qin, H.; Cheng, X.; Wu, F. Some notes on split Newton iterative algorithm. Math. Methods Appl. Sci. 2016, 39, 1935–1944. [Google Scholar] [CrossRef]
  58. Gvozdić, V.; Kovač-Andrić, E.; Brana, J. Influence of Meteorological Factors NO2, SO2, CO and PM10 on the Concentration of O3 in the Urban Atmosphere of Eastern Croatia. Environ. Model. Assess. 2011, 16, 491–501. [Google Scholar] [CrossRef]
  59. Li, Z.; Tong, X.; Ho, J.M.W.; Kwok, T.C.; Dong, G.; Ho, K.F.; Yim, S.H.L. A practical framework for predicting residential indoor PM2.5 concentration using land-use regression and machine learning methods. Chemosphere 2021, 265, 129140. [Google Scholar] [CrossRef]
  60. Johra, H.; Heiselberg, P. Influence of internal thermal mass on the indoor thermal dynamics and integration of phase change materials in furniture for building energy storage: A review. Renew. Sustain. Energy Rev. 2017, 69, 19–32. [Google Scholar] [CrossRef]
  61. Švajlenka, J.; Kozlovská, M.; Pošiváková, T. Assessment and biomonitoring indoor environment of buildings. Int. J. Environ. health Res. 2017, 27, 427–439. [Google Scholar] [CrossRef] [PubMed]
  62. Zhang, H.; Srinivasan, R. A Biplot-Based PCA Approach to Study the Relations between Indoor and Outdoor Air Pollutants Using Case Study Buildings. Buildings 2021, 11, 218. [Google Scholar] [CrossRef]
  63. Johra, H.; Heiselberg, P.; Le Dréau, J. Influence of envelope, structural thermal mass and indoor content on the building heating energy flexibility. Energy Build. 2019, 183, 325–339. [Google Scholar] [CrossRef]
  64. Švajlenka, J.; Kozlovská, M.; Pošiváková, T. Biomonitoring the indoor environment of agricultural buildings. Ann. Agric. Environ. Med. 2018, 25, 292–295. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Figure 1. Floor plan of the sampling building.
Figure 1. Floor plan of the sampling building.
Symmetry 13 00952 g001
Figure 2. RMSE of tested models (RMSE = y-axis, data measurements—‘k’ = x-axis): (a) Indoor_PM2.5, (b) Indoor_PM10, (c) Indoor_NO2.
Figure 2. RMSE of tested models (RMSE = y-axis, data measurements—‘k’ = x-axis): (a) Indoor_PM2.5, (b) Indoor_PM10, (c) Indoor_NO2.
Symmetry 13 00952 g002
Figure 3. A schematic structure of a two-layer feed-forward network used in this study.
Figure 3. A schematic structure of a two-layer feed-forward network used in this study.
Symmetry 13 00952 g003
Figure 4. Box plots of air pollutant concentrations: (a) PM2.5, (b) PM10, (c) NO2.
Figure 4. Box plots of air pollutant concentrations: (a) PM2.5, (b) PM10, (c) NO2.
Symmetry 13 00952 g004
Figure 5. Four-month evaluation of indoor and outdoor air quality measured at 10 min intervals (2016 observational points for two weeks equating to 18144): (a) PM2.5, (b) PM10, (c) NO2.
Figure 5. Four-month evaluation of indoor and outdoor air quality measured at 10 min intervals (2016 observational points for two weeks equating to 18144): (a) PM2.5, (b) PM10, (c) NO2.
Symmetry 13 00952 g005aSymmetry 13 00952 g005b
Figure 6. Comparison between indoor and outdoor data in using linear regression: (a) PM2.5, (b) PM10, (c) NO2.
Figure 6. Comparison between indoor and outdoor data in using linear regression: (a) PM2.5, (b) PM10, (c) NO2.
Symmetry 13 00952 g006
Figure 7. Spearman correlation heat map for indoor and outdoor environmental parameters.
Figure 7. Spearman correlation heat map for indoor and outdoor environmental parameters.
Symmetry 13 00952 g007
Figure 8. Simulated indoor PM2.5 concentrations at 10 min intervals (2016 observational points for two weeks equating to 18144).
Figure 8. Simulated indoor PM2.5 concentrations at 10 min intervals (2016 observational points for two weeks equating to 18144).
Symmetry 13 00952 g008
Figure 9. Simulated indoor PM10 concentrations at 10 min intervals (2016 observational points for two weeks equating to 18144).
Figure 9. Simulated indoor PM10 concentrations at 10 min intervals (2016 observational points for two weeks equating to 18144).
Symmetry 13 00952 g009
Figure 10. Simulated indoor NO2 concentrations at 10-min intervals (2016 observational points for two weeks equating to 18144).
Figure 10. Simulated indoor NO2 concentrations at 10-min intervals (2016 observational points for two weeks equating to 18144).
Symmetry 13 00952 g010
Table 1. Specifications of the selected sensor modules.
Table 1. Specifications of the selected sensor modules.
Measured
Parameter
ProductManufacturerMeasurement
Tolerance
Measuring
Range
PM2.5, PM10PMS5003Plantower±10% or 100~500 μg/m3;0~500 µg/m3, ≥1000 µg/m3
NO23SP_NO2_5FP SPEC±5% or 10 ppb0~5 ppm
CO3SP_CO_1000 SPEC±2%0 to 1000 ppm
RHTDHT22Aosong Electronic±0.5 °C and ±1%−40 °F to 176 °F, 0% to 100%
Table 2. Descriptive statistics of indoor environmental parameters.
Table 2. Descriptive statistics of indoor environmental parameters.
Indoor Outdoor I/O Ratio
Environmental
Parameters
Average ± SDMinMaxMedianAverage ± SDMinMaxMedianMean
PM2.5 (µg/m3)2.29 ± 2.27038.51.707.20 ± 4.35068.06.400.33
PM10 (µg/m3)2.40 ± 2.37045.01.808.06 ± 4.74077.67.200.31
NO2 (ppb)60.9 ± 7.3530.713361.9132 ± 68.708761411.34
Temperature (°F)75.0 ± 1.2272.383.974.881.5 ± 7.7065.010679.71.03
Relative Humidity (%)70.7 ± 3.9050.982.171.571.1 ± 14.127.890.375.40.92
Table 3. Summary of the prediction models for indoor PM2.5 and PM10.
Table 3. Summary of the prediction models for indoor PM2.5 and PM10.
Indoor PM2.5Indoor PM10
R2RMSEChangesR2RMSEChanges
R2RMSER2RMSE
MLR0.99860.0862--0.99850.0906--
TSR0.99860.08510−1.31%0.99860.08970−1.02%
ANN0.99940.08160.08%−5.37%0.99950.07820.09%−13.66%
Table 4. Summary of the prediction models for indoor NO2.
Table 4. Summary of the prediction models for indoor NO2.
Indoor NO2
R2RMSEChanges
R2RMSE
MLR0.72303.8861--
TSR0.72333.85580.05%−0.78%
ANN0.90143.173724.68%−18.33%
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Zhang, H.; Srinivasan, R.; Yang, X. Simulation and Analysis of Indoor Air Quality in Florida Using Time Series Regression (TSR) and Artificial Neural Networks (ANN) Models. Symmetry 2021, 13, 952. https://doi.org/10.3390/sym13060952

AMA Style

Zhang H, Srinivasan R, Yang X. Simulation and Analysis of Indoor Air Quality in Florida Using Time Series Regression (TSR) and Artificial Neural Networks (ANN) Models. Symmetry. 2021; 13(6):952. https://doi.org/10.3390/sym13060952

Chicago/Turabian Style

Zhang, He, Ravi Srinivasan, and Xu Yang. 2021. "Simulation and Analysis of Indoor Air Quality in Florida Using Time Series Regression (TSR) and Artificial Neural Networks (ANN) Models" Symmetry 13, no. 6: 952. https://doi.org/10.3390/sym13060952

APA Style

Zhang, H., Srinivasan, R., & Yang, X. (2021). Simulation and Analysis of Indoor Air Quality in Florida Using Time Series Regression (TSR) and Artificial Neural Networks (ANN) Models. Symmetry, 13(6), 952. https://doi.org/10.3390/sym13060952

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop