Comparison between Predictive and Measurement Methods of Speech Intelligibility for Educational Rooms of Different Sizes with and without HVAC Systems

Di Loreto, Samantha; Serpilli, Fabio; Lori, Valter; Di Perna, Costanzo

doi:10.3390/en16062719

Open AccessArticle

Comparison between Predictive and Measurement Methods of Speech Intelligibility for Educational Rooms of Different Sizes with and without HVAC Systems

by

Samantha Di Loreto

^*,†

,

Fabio Serpilli

^†

,

Valter Lori

^†

and

Costanzo Di Perna

^†

Department of Industrial Engineering and Mathematical Sciences, Università Politecnica delle Marche, Brecce Bianche 12, 60131 Ancona, Italy

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Energies 2023, 16(6), 2719; https://doi.org/10.3390/en16062719

Submission received: 16 February 2023 / Revised: 6 March 2023 / Accepted: 12 March 2023 / Published: 14 March 2023

(This article belongs to the Special Issue Applications of Building Energy Performance Simulation)

Download

Browse Figures

Versions Notes

Abstract

:

Heating ventilation and air conditioning (HVAC) systems represent one of the main noise sources inside classrooms. This explain why HVAC systems require careful design, competent installation and balancing, and regular maintenance. Many factors influence the classroom acoustical design, such as air handlers or fans, the velocity of air inside the classroom, as well as the size and acoustical treatment of ducts, returns, and diffusers. Acoustic parameters, including background-noise levels, reverberation time, and intelligibility, were analyzed in 17 classrooms at the Università Politecnica in the Marche region. The study of intelligibility was performed by measuring the objective parameters in situ and using prediction methods to determine the intelligibility score. The relationship between speech intelligibility measurements and speech intelligibility calculation has been studied. The relationship between the STI values with the background-noise levels and the reverberation time was also studied. This research shows that a comparison between predictive methods and measurement methods results in speech intelligibility for classrooms of different sizes with and without HVAC systems. The current method of calculating the voice transmission index (STI), proposed by national and international standards, has been used to determine speech intelligibility scores in classrooms. The results show that the calculation tool has computational robustness allowing its use in preliminary evaluations of speech intelligibility, design of the optimal type of school buildings, and sound amplification systems in classrooms that comply with Italian regulations.

Keywords:

intelligibility; acoustic measurements; acoustic comfort; speech transmission index

1. Introduction

Today, numerous studies about the effects of IEQ’s performance in the internal environment have been developed.

In fact, over the years, various systems have been developed for the evaluation of the internal comfort of buildings with particular regard to schools and recreational environments.

In [1], the aim is to provide an overview of the IEQ parameters often measured in schools, their measurement methods, and the main conclusions of these measurements. Most of the studies considered in this review measured the parameters of temperature, humidity,

{C 0}_{2}

and volatile organic compounds (VOCs) in schools. Other parameters considered include the concentration of suspended particulates, the air speed, the presence of mold, and the quality of lighting and acoustic comfort.

In [2], the aim is to identify the key factors influencing the quality of the indoor environment and to develop a sustainability assessment system to improve the performance of residential buildings. The assessment model has been developed based on a review of the existing literature regarding factors that influence the quality of the indoor environment and based on the authors’ practical experiences. The model was then validated through interviews with residents of sample houses in the Biobio region (Cile).

Another study utilized a set of innovative techniques, such as virtual reality (VR) and augmented reality (AR), to visualize the occupants’ comfort in real-time.

In [3], a new approach to integrating occupant feedback and the probabilistic occupant comfort model into BIM was proposed. It also featured innovative techniques to facilitate BIM as a more effective platform for visualization to guide decision-makers in addressing building operational problems focused on occupant comfort.

The theme of acoustic comfort (ambient noise, sound insulation, reverberation time, speech intelligibility) in primary and secondary school classrooms, as well as in university classrooms, has been the focus of several studies all around the world [4].

High noise levels in classrooms can have a negative impact on students’ learning and well-being. Studies have shown that high noise levels can cause students to tire early, negatively affect cognitive abilities such as attention, memory, and reading, and make it more difficult to understand the lesson content [5].

Prolonged exposure to high noise levels can also cause stress and anxiety and lead to negative health effects such as hearing loss, cardiovascular disease, and sleep disturbances. Children are more sensitive to noise than adults and can be negatively affected by high noise levels at lower levels than adults [6].

Adequate acoustic comfort is particularly important in schools, as poor acoustic conditions can negatively impact the learning and well-being of students and teachers.

The DM 11/01/2017 [7] establishes specific criteria for assessing the indoor acoustic quality of public buildings, including reference values for sound pressure level, reverberation time, and background-noise level.

The values specified in the DM 11/01/2017 [7] are based on international standards, which provide guidelines for measuring and assessing the acoustic environment in buildings. The goal of these regulations is to ensure that public buildings provide a comfortable and safe environment for those who use them by controlling noise pollution. These laws are established to keep people safe and healthy and to guarantee good learning and working conditions.

The reference values specified in the DM 11/01/2017 [7] are evaluated in compliance with the national standards UNI 11532-1 [8] and UNI 11532-2 [9]. These two standards provide guidelines for the assessment of indoor acoustic quality in terms of the sound pressure level, reverberation time, and background-noise level.

UNI 11532-1 [8] specifies methods for measuring and assessing the acoustic environment in buildings, while UNI 11532-2 [9] sets out the reference values and acceptance criteria for various acoustic parameters, such as sound pressure level, reverberation time, and background-noise level.

The standard UNI EN ISO 9921 [10] specifies the requirements for the performance of speech communication and recommends the level of speech communication quality required for conveying comprehensive messages in several case studies.

In [11], it is highlighted that the acoustic conditions of a learning environment, such as the level of background noise and the amount of reverberation, can have a significant impact on a child’s ability to attend to and remember information during primary school. This is because the perception of speech sounds is still developing in young children, and background noise and reverberation can make it more difficult for them to distinguish and understand speech. This can negatively impact their comprehension and retention of the material being taught in the classroom.

Several studies [12,13,14] have shown that children perform better in classrooms with better acoustic conditions, improved speech intelligibility, and reduced background noise.

In [15], many measurements of the intelligibility of speech were made to calculate the disturbance produced by different amounts of vocal force. The results of this case study show less than a 5% deterioration in intelligibility over the range, from a moderately low voice to a very loud voice (from 55 to 78 dB in a free field at one m from the lips).

Other studies [16,17,18,19,20] have shown that speech intelligibility is influenced by the reverberation time (RT) and signal-to-noise ratios (SNR).

In [21], speech intelligibility tests were carried out in 12 university classrooms in Korea; the test results indicate that young adult listeners at university have a mean score of 95% correct at a signal-to-noise ratio (SNR) value of +3 dBA, which is a considerably lower SNR value than for the younger students in elementary schools. As a result, the development of effective objective indicators of quality and/or intelligibility is of particular interest; the measured parameters include reverberation time, early decay times, energy ratios, and STI values. The STI is a physical metric related to the intelligibility of speech degraded by additive noise and reverberation [22].

Scientists nowadays consider the STI to be the parameter that best reflects the intelligibility of speech (in a sound transmission system) [23]. Consequently, the STI measure correlates well with subjective intelligibility scores for stimuli distorted by linear filtering, reverberation, and additive noise. The experiments in the literature evaluate the effectiveness of the prevision method in predicting speech intelligibility.

In [24], the potential binaural effect of reducing reflection and reverberation was studied. These conditions create a reduction in intelligibility because echoes and strong discrete reflections, arriving late, lead directly to a wrong assessment when using the STI.

Similarly, in [25], the STI approach was revisited, and a variation was proposed that processes the modulation envelope in short time segments, requiring only an assumption of quasi-stationarity (rather than the stationary assumption of STI) of the modulation signal.

Based on the tests in [26], the corresponding relation between the STI and speech intelligibility in large spaces was modified, and a new rating threshold for the STI was also proposed.

In Makito et al. [27], the aim is to evaluate the speech intelligibility of the different types of sound sources and the reverberation time in the classrooms and to compare the results. The study attempted to find ways to improve the design of public address systems for better speech intelligibility in the classrooms. It attempted also to identify best practices for the design of sound systems in classrooms evaluating the differences in the speech intelligibility scores between different types of sound sources.

The main objectives of this work are summarized as follows:

To study and compare the acoustical performance of several university classrooms based on different geometrical configurations;
To determine how well various prediction methods, such as computer simulations and mathematical models, can predict the speech intelligibility of a given space based on the measurement results;
To find correlation factors between prediction methods and measurement methods as high as possible, indicating that the prediction methods provide results that are highly correlated with actual speech intelligibility measurements.

This study used the existing calculation method of the speech transmission index (STI) via the indirect method described in IEC 60268-16 [28] to model speech intelligibility in the classrooms and then compare the results of the model with actual measurement data to determine the correlation between the two.

Section 2 describes the methods used, including a brief description of the places and the acoustic parameters studied. Section 3 presents a discussion of the results. Finally, Section 4 provides the conclusions of the study.

2. Materials and Methods

This section introduces the descriptors that characterize the internal acoustic quality of school environments and the related standards and procedures. Then, it illustrates the characteristics of the evaluated classrooms and the measuring equipment. The method of calculating the voice transmission index is explained in detail.

2.1. Reference Value for RT, STI, and C50

Reverberation Time T20 and T30 are the values of the reverberation time estimated by the slope of the Schroeder backward-integrated decay, respectively, in the [dB] ranges: [−5, 163-25] for T20 and [−5, −35] for T30.

According to standard 11532-2 [9], the optimal reverberation time,

T_{ott}

, which corresponds to a conventional room occupation of 80%, with the exception of category A5 as per the categories shown in Table 1, is determined according to the specific room activity as a function of the room volume, according to the formulas shown in Table 2.

The categories of the environment, in relation to the destined use, are reported in Table 1.

The reference values for the optimal reverberation times for A1–A4 categories are reported in Table 2.

Reference values for

C_{50}

;

C_{50}

is the ratio, in dB, between the “useful energy” received in the first 50 ms of the impulse response to the energy received in subsequent instances. The term “energy” represents the square of the instantaneous values of the pressure impulse response.

C_{50}

is defined in the ISO 3382-1 [29] through Equation (1) as follows.

C_{50} = 10 log \frac{\int_{0}^{50 m s} p^{2} (t) d t}{\int_{50 m s}^{\infty} p^{2} (t) d t} d B

(1)

The

C_{50}

descriptor can be used as an alternative to the STI for categories A1, A2, A3, and A4 for environments smaller than 250

m^{3}

.

For environments with V ≥ 250

m^{3}

, only the STI is applicable. The reference values for

C_{50}

are shown in Table 3.

The

C_{50}

values in each single measurement position are obtained as the arithmetic average of the measured values in the octave bands 500–1000–2000 Hz.

Reference values for STI; The STI aims to objectively quantify speech intelligibility at a specific location in one environment when speech is produced through a normalized signal at another specific location in the same environment.

Speech intelligibility is measured using a metric called the speech transmission index (STI), which ranges from 0 to 1 with higher values indicating better speech intelligibility. It is measured by comparing the speech signal with the background noise in a room.

STI values from above 0.6 to 0.7 are considered good and provide intelligible speech. It is worth noting that values above 0.5 are still considered acceptable but lower than optimal, and values under 0.5 are considered poor and problematic for effective communication.

Proper acoustical design of a classroom, utilizing sound-absorbing materials, and proper sound isolation techniques, can help improve speech intelligibility by reducing excessive noise levels and controlling reverberation.

The speech transmission index (STI) is based on the measurement of the modulation transfer function (MTF).

The MTF is a measure of how well a system (such as a room or a communications system) transmits the modulations, or variations, of a signal. The modulation transfer function (MTF) is a measure of the reduction in the amplitude of a sinusoidal input signal as it passes through a system. The MTF is the ratio of the amplitude of the sinusoidal output signal to the amplitude of the input signal.

In the case of the STI, the MTF is used to measure the reduction in amplitude of a speech signal as it passes through a room and is represented by a ratio between the speech signal and the background noise. This ratio is then used to calculate the STI value which ranges from 0 to 1 with higher values indicating better speech intelligibility.

For each modulation frequency, the MTF is determined by the ratio between the modulation index of the signal at the listener,

m_{0}

, and the modulation index of the test signal,

m_{i}

. A family of MTF curves is determined, in which each curve is relative to each octave band of speech emission and is defined by the values that the modulation index reduction factor m assumes for each modulation frequency present in the envelope of a natural speech signal.

For the STI measurement, 7 octave bands, from 125 Hz to 8 kHz, and 14 modulation frequencies, between 0.63 Hz and 12.5 Hz at octave intervals of one-third, are considered. The 98 (7 × 14) m-values are finally summarized in a single index, the STI, varying between 0 and 1, which represents the effect of the transmission system on intelligibility. The STI quantifies the combined effect of background-noise interference and reverberation on speech intelligibility reduction, with or without sound amplification systems.

Figure 1 illustrates the MTF for the octave band centered on 250 Hz for two simple transmission systems: one with exponential reverberation (case A) only and the other with only interfering noise (case B).

The UNI EN ISO 9921 standard [10] establishes a relationship between the STI value and their subjective assessment in terms of the intelligibility for a normally hearing user. The values are shown in Table 4:

Another classification of speech intelligibility is provided by BS EN 60268–16 [28]; the standard defines qualification intervals for the levels of the STI obtained, as shown in Figure 2. The typical STI requirements for dedicated applications are also provided in Figure 3.

There are two measurement methods for STI: the direct and indirect methods. The direct method uses modulated (speech-like) test signals to directly measure the modulation transfer function. Typically modified pink noise with modulation frequencies was used. In this case, the measurement signal is either applied as an electric input to the system or through a “human speaker” loudspeaker to a microphone. The indirect method uses impulse response and forward energy integration (Schroeder integral) to derive the modulation transfer function. The STI can be measured at the same time as other room acoustic parameters. This means that speech intelligibility will normally be measured using an omnidirectional speaker.

2.2. Room Descriptions and Measurements

The university building selected for the measurement campaign is located in a suburban area of Ancona. The acoustics of the building are affected by the surrounding environment.

In suburban areas, the level of background noise is generally lower than in urban areas, but it can still be significant, and sound reflections from nearby buildings and surfaces can impact the acoustics of the building.

In addition, the classrooms are located at the rear of the building in relation to the access road. The external SPL during the daytime period is between 45 and 55 dB (A).

The measurement campaign was carried out in different classrooms with and without the HVAC system to understand the acoustic performance of the building.

Table 5 summarizes the main characteristics of the lecture rooms: type of school (school number and grade), name of the room, year of construction, location characteristics, and main finishing materials.

The list of geometric dimensions of each classroom is provided in Table 6.

Figure 4 shows some of the classrooms at the University of Ancona and the measurement positions, as required by UNI 11532-2 [9].

The measurements in each classroom were completed for four measurement points, chosen in compliance with the UNI 11532 standard. Three positions were selected along the imaginary line traced on the longitudinal axis of the classroom between the sound source and the back of the classroom, and a position was selected as representative of the most unfavorable listening condition (due to background noise, distance from the speaker, etc.). The STI measurements were derived from the impulse response measures and background-noise measures with the indirect methodology proposed by BS EN 60268–16 [28]. Table 7 shows the results of measurements for the RT₃₀ from 125 kHz to 8 KHz, the STI for each measurement point, and the STI mean value and intelligibility rating according to [28].

2.3. STI Prediction Using Indirect Method

Prediction of the STI of a sound system may be based on the MTF matrix that is calculated from the predicted room acoustic and electro-acoustic parameters and from the measured or estimated background-noise levels for each octave band contributing to the STI version chosen. The STI measure uses artificial signals (e.g., sinewave-modulated signals) as probe signals to assess the reduction in the signal modulation in several frequency bands and for a range of modulation frequencies (0.6–12.5 Hz). As requested in the reference standard, the speech spectrum at 1 meter in front of the mouth of a male speaker with the ambient noise spectrum is reported in Table H.1 of UNI EN ISO 9921:2004. Table 8 and Table 9 were concatenated.

The STI was calculated based on the modulation transfer function (MTF), and the calculations used the method of Houtgast and Steeneken (1973) [30].

In (UNI, 2020) for the calculation of the STI in classrooms without amplification systems and with volumes > 250

m^{3}

, an emission signal at 1m on the axis to the source equal to 70 dB is required.

Therefore, to calculate the predictive STI, the reference signal of the speech was increased by 10 dB.

The modulation transfer function of the transmission path may be quantified by comparing the ratio of the modulation depth at the output and input of the test signal, and it is written as Equation (2):

m (f_{m}) = \frac{| \int_{0}^{\infty} h {(t)}^{2} e^{- j 2 π f_{m} t} d t |}{\int_{0}^{\infty} h {(t)}^{2} d t} {[1 + 10^{- \frac{SNR}{10}}]}^{- 1}

(2)

where:

$m (f_{m})$ is the modulation transfer function of the transmission channel;
$h (t)$ is the impulse response of the transmission channel;
SNR is the signal-to-noise ratio in dB.

Considering a diffuse reverberant field, the impulse response is written as Equation (3):

h (t) = \frac{Q}{r^{2}} δ (t) + \frac{13.8 Q}{r_{c}^{2} T} e^{- \frac{13.8 t}{T}}

(3)

where:

Q is the directivity factor for the sound source (talker);
r is the talker-to-listener distance;
T is the reverberation time of the room space.

The reverberation time was calculated with the method described in UNI EN 12354-6 [31], starting from the acoustic absorption of the room.

3. Results

This section presents the experimental results. First, it presents the STI values of the classrooms described in Section 2.2 according to the method described in Section 2.3. The result of the correlation between the measured and predicted STI results is then shown.

3.1. Evaluation of Speech Intelligibility through the Predictive Method

The model is based on our experience with predictions for rooms, such as rooms in dwellings and offices, and common spaces in buildings, such as stairwells, corridors, and rooms containing machinery and technical equipment. It is not intended to be used for very large or irregularly-shaped spaces, such as concert halls, theaters, and factories.

The impulse response of the classroom was calculated in the four different positions of the room.

In Figure 5, the graph of the simulated reverberation time vs. the measured reverberation time for each classroom for a conventional occupation of the environment equal to 80% is reported.

The calculation was made the same for all of the classes previously listed.

For categories A1–A4, if the measurement is performed in a furnished but unoccupied environment, the measured values must be corrected with Equation (4) to compare them with the reference limits.

R T_{i n o c c} = \frac{R T_{o c c}}{[1 - R T_{o c c} * \frac{Δ A_{p e r s}}{0.16 V}]}

(4)

where:

${R T}_{o c c}$ is the optimal reverberation time for the room occupied at 80% in seconds;
${R T}_{i n o c c}$ is the optimal reverberation time when the room is not occupied (measurement result), in seconds;
V is the volume of the room in cubic meters;
$Δ A_{p e r s}$ is the equivalent additional surface area of the acoustic absorption of people in square meters.

A constant MTF over the modulation frequencies indicates that speech intelligibility is mainly determined by background noise. A continuously decreasing MTF indicates an important influence of the reverberation, and an MTF that decreases first and then increases again indicates the presence of an echo.

The STI index can finally be obtained by using the weighted average method for the modulation transmission index on the considered octave bands as follows in Equation (5):

S T I = \sum_{k = 1}^{7} (α_{k} * {MTI}_{k}) - \sum_{k = 1}^{6} β_{k} * {({MTI}_{k} * {MTI}_{k + 1})}^{\frac{1}{2}}

(5)

The STI weighting factors (

α

) and redundancy factors (

β

) for male and female speech are shown in Table 10 as a function of the octave bands.

For example, the calculation of the AT1 classroom is given. The same procedure has been applied to all other classrooms under examination.

Figure 6 shows the result of the simulation of the modulation transfer function in the seven octave bands calculated for P1:

Table 11 shows the relationship between

α

,

β

, and MTI

α β

to determine the STI for the AT1 classroom.

The same calculation was carried out for all of the positions, and the STI simulation results are shown in Table 12.

From the comparison between the results of the STI obtained between the measured and simulated values (see Table 13), it can be seen that the difference is very low.

This demonstrates that the predictive model turns out to be very effective at ensuring a good internal quality of the classrooms during the design phase.

3.2. Correlations between Measurments and Predictive Methods

Considering the results of the simulations and according to the background literature, a statistical analysis for the case study was carried out.

The purpose of the correlation study is to highlight an interdependent link between the statistical variables. The linear correlation is measured by Bravais–Pearson [32], as seen in Equation (6).

r = \frac{\sum_{i = 1}^{n} (x_{i} - \bar{x}) (y_{i} - \bar{y})}{\sqrt{\sum_{i = 1}^{n} {(x_{i} - \bar{x})}^{2}} \sqrt{\sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2}}}

(6)

A linear regression model is the relation between a dependent, or response, variable y and one or more independent, or predictor, variables

x (1)

, …,

x (n)

. A simple linear regression considers only one independent variable using the relation in Equation (7):

Y_{i} = β_{0} + β_{1} X_{i} + ϵ_{i}

(7)

where

β_{0}

is the y-intercept,

β_{1}

is the slope (or regression coefficient), and

ϵ_{i}

is the error term. It begins with a set of n observed values of x and y given by Equation (8):

(x_{1}, y_{1}), (x_{2}, y_{2}), \dots, (x_{n}, y_{n})

(8)

Using the simple linear regression relation, these values form a system of linear equations. It is possible to represent these equations in matrix form as:

[\begin{matrix} (\begin{matrix} Y_{1} \\ Y_{2} \\ ⋮ \\ Y_{n} \end{matrix}) = (\begin{matrix} 1 & x_{1} \\ 1 & x_{2} \\ ⋮ & ⋮ \\ 1 & x_{n} \end{matrix}) (\begin{matrix} β_{0} \\ β_{1} \end{matrix}) \end{matrix}]

[\begin{matrix} Y = (\begin{matrix} Y_{1} \\ Y_{2} \\ ⋮ \\ Y_{n} \end{matrix}), X = (\begin{matrix} 1 & x_{1} \\ 1 & x_{2} \\ ⋮ & ⋮ \\ 1 & x_{n} \end{matrix}), B = (\begin{matrix} β_{0} \\ β_{1} \end{matrix}) \end{matrix}]

Then, the Equation (7) can be expressed more concisely, as explained in Equation (9):

Y = X * B

(9)

A polynomial regression analysis was used to identify the most appropriate correlation between the measurement and predictive method, as shown in Figure 7.

The proposed correlation model between the measurements of the STI versus the simulations of the STI is based on a polynomial function, according to the following Equation (10):

y = a * x^{3} + b * x^{2} + c * x + d

(10)

where y is the response variable, and a, b, c, and d represent partial correlation coefficients (coefficients with a 95% confidence bound).

a = 16.53 (−13.76, 46.83);
b = −25.71 (−67.04, 15.62);
c = 13.11 (−4.937, 31.17);
d = −1.701 (−4.186, 0.7837).

In Table 14, the results of the correlation displayed in Figure 7 are reported.

This can be attributed to the particular geometry of those classrooms, which makes the acoustic quality more sensitive to reverberation conditions.

Moreover, the only classrooms that have an unacceptable “poor” intelligibility are the classrooms S1, S2, and S3, in which controlled mechanical ventilation systems have been installed.

VMC sources worsen the internal acoustic quality, especially in terms of speech intelligibility and ambient noise levels, with negative consequences for both the teacher and the student.

Overall, the predictive model proved to be an effective tool for improving the internal quality of classrooms during the design phase.

It can be applied to analyze the acoustic quality of classrooms in different conditions, with different geometries, orientations, and materials, and can be used to quickly and accurately assess the acoustic intelligibility of school classrooms.

4. Conclusions

This paper systematically provides the flow of the STI indirect test method specified in BS EN 60268-16 and introduces in detail the calculation formula involved in the indirect method, with reference to Schroeder’s frequency analysis and, therefore, to the limits of the validity of the sound equations of classical theory, which are associated with the simulation of the room. The aim is to find correlation factors between the prediction and measurement methods as high as possible, indicating that the prediction methods provide results that are highly correlated with actual speech intelligibility measurements.

The proposed approach utilizes first the measurements of the the acoustical performance and the speech intelligibility of several university classrooms based on different geometrical configurations according to national and international standards. Second, speech intelligibility was determinedby the prediction methods indicated in the ANNEX L of the BS EN 60268-16 standard. Then, the correlation between the prediction methods and the measurements in situ.

This study highlighted that the one of the major problems when developing this type of prediction is represented by the error generated by a low signal-to-noise ratio. Therefore, the choice of the speech spectrum, as well as the residual noise setting, represents an important choice in order for overestimation errors of the STI not to be incurred.

Although the standard is clear in recommending standard spectra, a possible solution could be to simulate the environment impulse response using a commercial room acoustic software and enter, during the input phase, an environmental noise that could be representative of the acoustic scene of the room.

However, measurement methods provide actual data on the acoustic characteristics of a room, which can be used to validate the predictions made by predictive methods.

Additionally, an HVAC system can have a significant effect on speech intelligibility in educational room, either positively, for example by providing improved ventilation, or negatively, for example, by introducing additional noise into the room, depending on the HVAC system’s design, installation, and maintenance.

The installation of acoustic panels and other sound absorbing materials can be recommended in order to improve the acoustic quality of the classrooms where VMC systems are installed.

The results show that the calculation tool has computational robustness that allows its use in preliminary evaluations of language intelligibility, design o fthe optimal type of school buildings, and sound amplification systems in classrooms that comply with Italian regulations.

Author Contributions

Conceptualization, S.D.L.; methodology, S.D.L.; software, S.D.L.; validation, S.D.L., F.S., and V.L.; formal analysis, S.D.L., F.S., and V.L.; investigation, S.D.L., F.S., and V.L.; resources, S.D.L., F.S., and V.L.; data curation, S.D.L., F.S., and V.L.; writing—original draft preparation, S.D.L., F.S., and V.L.; writing—review and editing, S.D.L., F.S., V.L., and C.D.P.; visualization, S.D.L., F.S., V.L., and C.D.P.; supervision, S.D.L., F.S., V.L., and C.D.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the necessARIA project.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

This study did not report any data.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

STI	Speech transmission index
IS	Intelligibility score
MTF	Modulation transfer function
SPL	Sound pressure level
${STI}_{m}$	Speech transmission index (measured)
${STI}_{p}$	Speech transmission index (predicted)

References

Tran, M.T.; Wei, W.; Dassonville, C.; Martinsons, C.; Ducruet, P.; Mandin, C.; Héquet, V.; Wargocki, P. Review of Parameters Measured to Characterize Classrooms’ Indoor Environmental Quality. Buildings 2021, 13, 433. [Google Scholar] [CrossRef]
Quesada-Molina, F.; Astudillo-Cordero, S. Indoor Environmental Quality Assessment Model (IEQ) for Houses. Sustainability 2023, 15, 1276. [Google Scholar] [CrossRef]
Alavi, H.; Forcada, N.; Bortolini, R.; Edwards, D.J. Enhancing occupants’ comfort through BIM-based probabilistic approach. Autom. Constr. 2021, 123, 103528. [Google Scholar] [CrossRef]
Sala, E.; Viljanen, V. Improvement of acoustic conditions for speech communication in classrooms. Appl. Acoust. 1995, 45, 81–91. [Google Scholar] [CrossRef]
Zannin, P.H.T.; Petri, D.; Zwirtes, Z. Evaluation of the acoustic performance of classrooms in public schools. Appl. Acoust. 2018, 70, 626–635. [Google Scholar] [CrossRef]
Winn, M.B.; Teece, K.H. Listening Effort Is Not the Same as Speech Intelligibility Score. Trends Hear. 2021, 25, 23312165211027688. [Google Scholar] [CrossRef]
Ministerial Decree 11 October 2017; Criteri Ambientali Minimi per l’Affidamento di Servizi di Progettazione e Lavori per la Nuova Costruzione, Ristrutturazione e Manutenzione di Edifici Pubblici. Ministero dell’ambiente e della tutela del territorio e del mare: Rome, Italy, 2017.
Standard UNI 11532-1:2018; Aratteristiche Acustiche Interne di Ambienti Confinati—Metodi di Progettazione e Tecniche di Valutazione—Parte 1: Requisiti Generali. Organismo nazionale Italiano di normazione: Rome, Italy, 2018.
Standard UNI 11532-2:2020; Internal Acoustical Characteristics of Confined Spaces—Design Methods and Evaluation Techniques—Part 2: Educational Sector. Organismo nazionale Italiano di normazione: Rome, Italy, 2020.
Standard UNI EN ISO 9921:2004; Assessments of Speech Communication. International Organization for Standardization: Geneva, Switzerland, 2004.
Puglisi, G.E.; Warzybok, A.; Astolfi, A.; Kollmeier, B. Effect of reverberation and noise type on speech intelligibility in real complex acoustic scenarios. J. Build. Environ. 2021, 204, 108137. [Google Scholar] [CrossRef]
Mogas-Recalde, J.; Palau, R.; Márquez, M. How Classroom Acoustics Influence Students and Teachers: A Systematic Literature Review. J. Technol. Sci. Educ. 2021, 11, 245–259. [Google Scholar] [CrossRef]
Wang, S.; Zhang, D. tudent-centred teaching, deep learning and self-reported ability improvement in higher education: Evidence from Mainland China. Innov. Educ. Teach. Int. 2019, 56, 581–593. [Google Scholar] [CrossRef]
Puglisi, G.E.; Warzybok, A.; Astolfi, A.; Kollmeier, B. Effect of competitive acoustic environments on speech intelligibility. J. Technol. Sci. Educ. 2021, 2069, 01216. [Google Scholar] [CrossRef]
Pickett, J.M. Effects of Vocal Force on the Intelligibility of Speech Sounds. J. Acoust. Soc. Am. 2005, 28, 1956. [Google Scholar] [CrossRef]
Bradley, J.S.; Reich, R.; Norcross, S.G. On the combined effects of signal-to-noise ratio and room acoustics on speech intelligibility. J. Acoust. Soc. Am. 1999, 106, 1820. [Google Scholar] [CrossRef] [PubMed]
Yang, D.; Mak, C.M. An investigation of speech intelligibility for second language students in classrooms. Appl. Acoust. 2018, 134, 54–149. [Google Scholar] [CrossRef]
Yang, W.Y.; Bradley, J.S. Effects of room acoustics on the intelligibility of speech in classrooms for young children. J. Acoust. Soc. Am. 2009, 125, 922–933. [Google Scholar] [CrossRef] [Green Version]
Rennies, J.; Best, V.; Roverud, E.; Kidd, G. Energetic and Informational Components of Speech-on-Speech Masking in Binaural Speech Intelligibility and Perceived Listening Effort. Trends Hear. 2019, 23, 2331216519854597. [Google Scholar] [CrossRef]
Galbrun, L.; Kitapci, K. Accuracy of speech transmission index predictions based on the reverberation time and signal-to-noise ratio. Appl. Acoust. 2014, 81, 1–14. [Google Scholar] [CrossRef]
Choi, Y.J. The intelligibility of speech in university classrooms during lectures. Appl. Acoust. 2020, 162, 107211. [Google Scholar] [CrossRef]
Goldsworthy, R.L.; Greenberg, J.E. Analysis of speech-based speech transmission index methods with implications for nonlinear operations. J. Acoust. Soc. Am. 2004, 116, 3679. [Google Scholar] [CrossRef]
Steeneken, H.; Houtgast, T. A review of the MTF concept in room acoustics and its use for estimating speech intelligibility in auditoria. J. Acoust. Soc. Am. 1980, 77, 318–326. [Google Scholar] [CrossRef]
Peters, R. (Ed.) Uncertainty in Acoustics: Measurement, Prediction and Assessment, 1st ed.; CRC Press: Boca Raton, FL, USA, 2020. [Google Scholar] [CrossRef]
Schwerin, B.; Paliwal, K. An improved speech transmission index for intelligibility prediction. Speech Commun. 2014, 65, 9–19. [Google Scholar] [CrossRef]
Hongshan, L.; Ma, H.; Kang, J.; Wanga, C. The speech intelligibility and applicability of the speech transmission index in large spaces. Appl. Acoust. 2020, 167, 107400. [Google Scholar] [CrossRef]
Kawata, M.; Tsuruta-Hamamura, M.; Hasegawa, H. Assessment of speech transmission index and reverberation time in standardized English as a foreign language test rooms. Appl. Acoust. 2023, 202, 109093. [Google Scholar] [CrossRef]
Standard BS EN IEC 60268-16:2020; Sound system Equipment Objective Rating of Speech Intelligibility by Speech Transmission Index. IEC–International Electrotechnical Commission: London, UK, 2020.
Standard ISO 3382-1:2009; Acoustics—Measurement of Room Acoustic Parameters—Part 1: Performance Spaces. International Organization for Standardization: Geneva, Switzerland, 2009.
Houtgast, T.; Steeneken, H.J.M. The Modulation Transfer Function in Room Acoustics as a Predictor of Speech Intelligibility. Acta Acust. United Acust. 1973, 28, 66–73. [Google Scholar] [CrossRef]
Standard EN ISO 12354-6:2006; Building Acoustics—Estimation of Acoustic Performance of Buildings from the Performance of Elements—Part 6: Sound Absorption in Enclosed Spaces. International Organization for Standardization: Geneva, Switzerland, 2006.
Walker, D.A. The Pearson Product-Moment Correlation Coefficient and Adjustment Indices: The Fisher Approximate Unbiased Estimator and the Olkin-Pratt Adjustment (SPSS). J. Mod. Appl. Stat. Methods 2017, 48, 540–546. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Theoretical expression of MTF according to BS EN 60268–16.

Figure 2. Qualification intervals for STI levels.

Figure 3. Value for STI qualification bands and typical applications.

Figure 4. Plan and measurement positions of some classrooms.

Figure 5. Reverberation time value in the octave bands between 125 Hz and 4000 Hz simulated (empty room) and measured for each classroom.

Figure 6. Modulation transmission ratio in the 7 octave band of AT1 classroom.

Figure 7. Results of polynomial regression between measurements of STI and simulations of STI. Several noteworthy results were: correlation 0.80; covariance 0.32; probability 0.005.

Table 1. Categories of the environment in relation to the destined use according to UNI 11532-2:2020.

CATEGORY	Activities in the Environment	Methods of Intervention
A1	Music	Objective achieved with integrated design of geometry, furniture, residual noise control
A2	Spoken/conference	Objective achieved with integrated design of geometry, furniture, residual noise control
A3	Lesson/communication as speech and lecture	Objective achieved with integrated design of geometry, furniture, residual noise control
A4	Special classroom lecture/communication	Objective achieved with integrated design of geometry, furniture, residual noise control
A5	Sport	Objective achieved with integrated design of geometry, furniture, residual noise control
A6	Areas and spaces not intended for learning and libraries	Objective achieved with sound absorption and residual noise control

Table 2. Categories of the occupied environment in relation to the destined use according to UNI 11532-2:2020.

CATEGORY	Occupied Environment 80%
A1	${RT}_{ott}$ = (0.45 logV + 0.07) − $30 m^{3} \leq V < 1000 m^{3}$
A2	${RT}_{ott}$ = (0.37 logV − 0.14) − $50 m^{3} \leq V < 5000 m^{3}$
A3	${RT}_{ott}$ = (0.32 logV − 0.17) − $30 m^{3} \leq V < 5000 m^{3}$
A4	${RT}_{ott}$ = (0.26 logV − 0.14) − $30 m^{3} \leq V < 500 m^{3}$

Table 3. Reference values for C₅₀.

	<250 $m^{3}$
Without amplification system or with system off	>=2 dB

Table 4. Relation between STI and speech intelligibility according to UNI EN ISO 9921:2004, Table F.1.

Intelligibility Rating	Sentence Score %	STI
Excellent	100	>0.75
Good	100	0.60 to 0.75
Fair	100	0.45 to 0.60
Poor	70 to 100	0.30 to 0.45
Bad	<70	<0.30

Table 5. List of classrooms indicating name of the room, year of construction, location, main finishing materials, and mechanical ventilation system.

Name of Room	Location Characteristics	Floor	Walls	Ceiling	VMC System
140/1	Urban outskirts	Artificial stone tiles, PVC	Plasterboard	Suspended ceiling	-
140/2	Urban outskirts	Artificial stone tiles, PVC	Plasterboard	Suspended ceiling	-
140/3	Urban outskirts	Artificial stone tiles, PVC	Plasterboard	Suspended ceiling	-
155/D1	Urban outskirts	PVC	Plasterboard	Suspended ceiling	-
155/D2	Urban outskirts	PVC	Plasterboard	Suspended ceiling	-
155/D3	Urban outskirts	PVC	Plasterboard	Suspended ceiling	-
155/D4	Urban outskirts	PVC	Plasterboard	Suspended ceiling	-
160/1	Urban outskirts	Artificial stone tiles, PVC	Plasterboard	Suspended ceiling	-
160/2	Urban outskirts	Artificial stone tiles, PVC	Plasterboard	Suspended ceiling	-
AT1	Urban outskirts	PVC	Plasterboard	Suspended ceiling	-
AT2	Urban outskirts	PVC	Plasterboard	Suspended ceiling	-
AT3	Urban outskirts	PVC	Plasterboard	Suspended ceiling	-
EN1	Urban outskirts	Artificial stone tiles	Plasterboard	Suspended ceiling	-
EN3	Urban outskirts	Artificial stone tiles	Plasterboard	Suspended ceiling	-
S1	Urban outskirts	Artificial stone tiles	Gypsum board	Non-suspended ceiling	yes
S2	Urban outskirts	Artificial stone tiles	Gypsum board	Non-suspended ceiling	yes
S3	Urban outskirts	Artificial stone tiles	Gypsum board	Non-suspended ceiling	yes

Table 6. List of classrooms indicating name of the room and geometric dimensions.

Name of Room	Length [m]	Width [m]	Height [m]	Area [ $m^{2}$ ]	Volume [ $m^{3}$ ]
140/1	18.2	9.0	3.4	163.8	556.9
140/2	11.9	8.8	3.9	105.2	410.3
140/3	15.3	8.9	3.9	136.5	537.6
155/D1	12.3	8.9	3.4	109.4	371.9
155/D2	12.3	8.9	3.4	109.4	371.9
155/D3	12.3	8.9	3.4	109.4	371.9
155/D4	12.3	8.9	3.4	109.4	371.9
160/1	10.2	8.9	3.4	90.9	309.1
160/2	10.4	8.9	3.4	93.2	317.0
AT1	13.4	9.3	3.0	125.2	375.7
AT2	13.8	9.3	3.0	128.6	385.9
AT3	17.9	6.3	3.0	113.1	339.4
EN1	11.3	8.9	3.4	100.7	342.3
EN3	11.2	6.4	3.4	71.4	242.7
S1	7.5	7.4	3.0	55.5	166.5
S2	10.4	10.6	3.5	110.2	385.8
S3	14.3	16.2	3.0	231.7	695.0

Table 7. Measured RT₃₀, average RT₃₀, STI, and average STI and intelligibility rating for each classrooms.

	${RT}_{30}$ [s]							STI
	125 Hz	250 Hz	500 Hz	1 kHz	2 kHz	4 kHz	8 kHz	avg	P1	P2	P3	P4	avg	Rating
140/1	0.90	0.93	0.76	0.71	0.60	0.75	0.72	0.77	0.57	0.52	0.46	0.43	0.50	fair
140/2	0.51	0.86	0.97	0.72	0.50	0.77	0.57	0.70	0.52	0.40	0.37	0.34	0.41	poor
140/3	0.55	0.57	0.50	0.60	0.69	0.93	0.83	0.67	0.60	0.58	0.35	0.33	0.47	fair
155/D1	0.55	0.93	0.87	0.88	0.79	0.37	0.44	0.69	0.63	0.57	0.54	0.52	0.57	fair
155/D2	0.53	0.88	0.87	0.86	0.75	0.32	0.37	0.65	0.62	0.57	0.54	0.52	0.56	fair
155/D3	0.61	1.02	0.93	0.89	0.78	0.37	0.42	0.72	0.55	0.51	0.53	0.49	0.52	fair
155/D4	0.61	0.86	0.88	0.87	0.75	0.35	0.41	0.68	0.60	0.55	0.54	0.52	0.55	fair
160/1	0.67	1.19	0.81	0.81	0.71	0.65	0.61	0.78	0.59	0.42	0.51	0.39	0.48	fair
160/2	0.62	0.76	0.99	0.99	0.87	0.75	0.61	0.80	0.58	0.50	0.45	0.40	0.48	fair
AT1	0.42	0.85	0.74	0.72	0.82	0.95	0.26	0.68	0.73	0.61	0.59	0.45	0.60	fair
AT2	0.55	0.35	0.45	0.74	0.78	0.82	0.21	0.56	0.70	0.60	0.60	0.56	0.62	good
AT3	0.49	0.57	0.48	0.60	0.88	0.98	0.30	0.61	0.82	0.62	0.54	0.50	0.62	good
EN1	0.72	1.16	0.85	0.78	0.64	0.71	0.41	0.75	0.61	0.56	0.55	0.41	0.53	fair
EN3	0.71	0.99	0.88	0.77	0.63	0.68	0.38	0.72	0.56	0.52	0.49	0.43	0.50	fair
S1	2.40	1.80	1.70	1.93	1.94	1.66	1.24	1.81	0.29	0.25	0.28	0.29	0.28	bad
S2	2.30	1.90	1.72	1.94	1.87	1.66	1.30	1.81	0.28	0.26	0.28	0.27	0.27	bad
S3	2.26	1.72	1.75	1.98	1.97	1.68	1.25	1.80	0.25	0.28	0.29	0.30	0.28	bad

Table 8. Speech spectrum at 1m in front of the mouth of a male speaker according to UNI EN ISO 9921:2004, Table H.2.

Octave Band [Hz]	125	250	500	1000	2000	4000	8000
SPL@1m (dB)	62.9	62.9	59.2	53.2	47.2	41.2	35.2

Table 9. Ambient noise spectrum according to UNI EN ISO 9921:2004, Table H.1.

Octave Band [Hz]	125	250	500	1000	2000	4000	8000
SPL@1m (dB)	41	43	50	53.2	47	42	39

Table 10. MTI octave band weighting factors according to BS EN 60268-16.

Octave Band	125 Hz	250 Hz	500 Hz	1000 Hz	2000 Hz	4000 Hz	8000 Hz
$α_{m a l e}$	0.085	0.127	0.230	0.233	0.309	0.224	0.173
$β_{m a l e}$	0.085	0.078	0.065	0.011	0.047	0.095	-
$α_{f e m a l e}$	-	0.117	0.223	0.216	0.328	0.25	0.194
$β_{f e m a l e}$	-	0.099	0.066	0.062	0.025	0.076	-

Table 11. Results of the calculation of STI for P1 for AT1 classroom.

Octave Band [Hz]	125	250	500	1000	2000	4000	8000
$α_{k, m a l e}$	0.085	0.127	0.230	0.233	0.309	0.224	0.173
${MTI}_{k} * α_{k, w}$	0.054	0.090	0.163	0.157	0.207	0.118	0.071
$β_{k, m a l e}$	0.085	0.078	0.065	0.011	0.047	0.095	0.000
${MTI}_{k} * β_{k, w}$	0.054	0.055	0.046	0.007	0.032	0.005	0.000
$\sum α_{k} * M T I_{k}$	0.860
$\sum β_{k} * M T I_{k}$	0.244
STI (P1)	0.62

Table 12. Results of the calculation of the STI for P1, P2, P3, P4 and the STI mean for the AT1 classroom.

STI (P1)	STI (P2)	STI (P3)	STI (P4)	STI (avg)
0.62	0.55	0.53	0.53	0.55
STI mean with measurement uncertainty			Speech quality in accordance with CEI EN 60268-16
0.53			Fair

Table 13. Comparison between the results of the STI measured in each classroom and the STI calculated according to the predictive method described in IEC 60268-16.

Classroom	${STI}_{avg}$ -Measured	Intelligibility Rating According to ISO 9921	${STI}_{avg}$ -Calculated	Intelligibility Rating According to ISO 9921
140/1	0.50	fair	0.46	fair
140/2	0.41	poor	0.51	fair
140/3	0.47	fair	0.46	fair
155/d1	0.57	fair	0.50	fair
155/d2	0.56	fair	0.49	fair
155/d3	0.52	fair	0.49	fair
155/d4	0.55	fair	0.49	fair
160/1	0.48	fair	0.55	fair
160/2	0.48	fair	0.56	fair
AT1	0.60	fair	0.49	fair
AT2	0.62	good	0.49	fair
AT3	0.62	good	0.47	fair
EN2	0.53	fair	0.52	fair
EN3	0.50	fair	0.39	poor
S1	0.28	poor	0.30	poor
S2	0.27	poor	0.30	poor
S3	0.28	poor	0.29	bad

Table 14. Results of the correlation between measurement and predicted STI.

SSE	R-Square	Adjusted R-Square	RMSE
0.022	0.7993	0.7458	0.0043

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Di Loreto, S.; Serpilli, F.; Lori, V.; Di Perna, C. Comparison between Predictive and Measurement Methods of Speech Intelligibility for Educational Rooms of Different Sizes with and without HVAC Systems. Energies 2023, 16, 2719. https://doi.org/10.3390/en16062719

AMA Style

Di Loreto S, Serpilli F, Lori V, Di Perna C. Comparison between Predictive and Measurement Methods of Speech Intelligibility for Educational Rooms of Different Sizes with and without HVAC Systems. Energies. 2023; 16(6):2719. https://doi.org/10.3390/en16062719

Chicago/Turabian Style

Di Loreto, Samantha, Fabio Serpilli, Valter Lori, and Costanzo Di Perna. 2023. "Comparison between Predictive and Measurement Methods of Speech Intelligibility for Educational Rooms of Different Sizes with and without HVAC Systems" Energies 16, no. 6: 2719. https://doi.org/10.3390/en16062719

APA Style

Di Loreto, S., Serpilli, F., Lori, V., & Di Perna, C. (2023). Comparison between Predictive and Measurement Methods of Speech Intelligibility for Educational Rooms of Different Sizes with and without HVAC Systems. Energies, 16(6), 2719. https://doi.org/10.3390/en16062719

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Comparison between Predictive and Measurement Methods of Speech Intelligibility for Educational Rooms of Different Sizes with and without HVAC Systems

Abstract

1. Introduction

2. Materials and Methods

2.1. Reference Value for RT, STI, and C50

2.2. Room Descriptions and Measurements

2.3. STI Prediction Using Indirect Method

3. Results

3.1. Evaluation of Speech Intelligibility through the Predictive Method

3.2. Correlations between Measurments and Predictive Methods

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI