1. Introduction
Immersive virtual environments or IVEs are a set of hardware and software applications that allow users to immerse themselves in artificially constructed virtual environments and interact with their contents in real-time. IVEs provide users the impression of being inside and living in a virtual environment [
1]. As such, IVEs have attracted significant research attention concerning their potential in replicating real-world experiences. Many IVE-based studies on building and urban design and operation have been reported, including daylight simulation [
2,
3,
4], energy studies [
5], sound and acoustic studies [
6,
7], and space design and wayfinding studies [
8]. These studies utilized IVEs in experiments to collect human-based data. Specifically, some studies used IVEs to investigate the thermal state of participants, i.e., their thermal sensation, thermal comfort, and thermal acceptability (e.g., [
9,
10,
11,
12,
13]). The human thermal states play a significant role in studying building performance in terms of energy consumption [
14]. Depending on the thermal states, the occupants may interact with the building components such as changing the thermostat settings or opening/closing windows, leading to changes in building energy use and impacting overall building performance [
15]. Thus, an improved understanding of the human thermal states during the design phase of a building is essential for the building to meet its energy performance expectation during the operational stage. However, collecting thermal state data during design is challenging as it requires an existing environment that could closely replicate the desired design. Constructing such a design may not always be practical as it is time-consuming and expensive. IVEs, on the other hand, might be utilized as a tool to quickly and cost-effectively mimic designs. In addition, IVEs provide engineers or researchers the capability to immerse humans in different scenarios and examine their reactions to controlled environmental changes [
12,
16]. Thus, IVEs may become a new way of studying human–building interactions and thermal states in built environments that do not exist yet [
11,
17]. However, IVE-based experiments are often limited by a range of factors [
18], one of which is how to handle the potential influence of the outdoor temperature variations, such as seasonal temperature variations, on the thermal state of participants.
The impact of outdoor temperature variations on human physiology and comfort has drawn significant research interest in the past. For example, studies have shown seasonal effects on human physiological responses such as sweat rate, rectal temperature, and metabolic rate when participants were exposed to identical experimental scenarios in different seasons (e.g., [
19,
20]). Given the connection between physiological responses and thermal sensation and comfort [
21,
22], several studies (e.g., [
23,
24,
25,
26,
27]) have concluded that differences exist in occupant’s thermal sensation between different outdoor temperature conditions (i.e., summer and winter). Such findings align with previous studies that show a close link between thermal states and seasons or variations of outdoor temperatures (e.g., [
28,
29]). Although this line of research is still ongoing and different observations were reported (e.g., [
30]), all studies pointed out the effect of outdoor temperature variations on human physiology. However, the results mainly differ on the sensitivity of physiological measures to outdoor temperature variations and the extent to which outdoor temperature variations influence the thermal states.
In our review of existing literature, there is no study discussing the effect of outdoor temperature variations on the results of thermal-related IVE experiments. Conversely, IVE experiments may happen at any time of the year. Thus, we conducted an initial investigation into the effect of two contrasting outdoor temperature conditions (i.e., cold and warm outdoor temperature conditions) on the thermal state of participants in IVE experiments and compared the results with the baseline in situ experiments. Three hypotheses are proposed across four outdoor temperature comparisons in which the participant’s physiological and thermal state responses were used as indicators to quantity their thermal experience. The hypotheses and the outdoor temperature comparisons are elaborated in detail in
Section 3.
3. Research Objectives and Hypotheses
The objective of this study is to investigate the impact of outdoor temperature variations on the thermal states of participants in IVE experiments. To achieve the objective, four comparisons are selected, where the in situ setting is considered a baseline.
Comparison 1 (C1): Cooling in warm outdoor conditions for IVE experiments (match) vs. cooling in warm outdoor conditions for in situ experiments (match);
Comparison 2 (C2): Heating in cold outdoor conditions for IVE experiments (match) and heating in cold outdoor conditions for in situ experiments (match);
Comparison 3 (C3): Cooling in warm outdoor conditions for in situ experiments (match) vs. cooling in cold outdoor conditions for IVE experiments (mismatch);
Comparison 4 (C4): Heating in cold outdoor conditions for in situ experiments (match) vs. heating in warm outdoor conditions for IVE experiments (mismatch).
The selection is based on the assumption that cooling typically happens in warm outdoor conditions, such as during summer, and heating happens more in cold outdoor conditions, such as during winter. These two are called matching conditions. Therefore, comparisons C1 and C2 address two matching conditions, under which results from IVE experiments are compared with those from the baseline, i.e., in situ experiments. On the other hand, IVE experiments can be done in mismatching conditions, such as cooling in cold outdoor conditions and heating in warm outdoor conditions. The mismatch conditions represent the possibility, and the flexibility, of IVE experiments to be conducted in conditions that the outdoor temperature may influence. Comparisons C3 and C4 address two such mismatch conditions, under which IVE experiments are conducted, such as cooling in cold outdoor conditions (C3) and heating in warm outdoor conditions (C4). Comparison C3 happens when an IVE experiment on cooling is conducted in cold outdoor conditions (e.g., winter). For example, an experiment is conducted to determine an ideal indoor cooling temperature set-point for a west-orientation office during summer. Instead of doing the experiment in summer, an IVE experiment may be conducted on a cold winter day. Comparison C4 is about conducting an IVE experiment on heating on a hot summer day. For example, an experiment may be about determining the thermal state of participants using a new personalized heating system when the outdoor temperature is below freezing. Instead of doing such an experiment in winter, an IVE experiment may be conducted on a hot summer day. In the two comparisons C3 and C4, the baseline is matching in situ experiments. Therefore, the comparisons will provide insight into the impact of outdoor temperature variations on IVE experiments. Although results from C1 and C2 cannot directly reflect the effects of outdoor temperature variations, as they only explore the effectiveness of IVE experiments for heating and cooling sequences in the same outdoor temperature condition, they provide support and background information to understand and interpret results from C3 and C4, which study the efficacy of IVE experiments when outdoor temperature variations exist.
Results of IVE experiments are compared with those of in situ experiments using three metrics, the perceived temperature, the thermal state votes, and the overall skin temperature of participants. The first metric assesses the measured indoor temperature (called the control temperature distribution in this study) around a participant when a thermal state vote is recorded. The indoor temperature around participants is measured by the same temperature sensor used in this study [
13]. The second metric measures the participant’s thermal state votes at a temperature range, otherwise known as the thermal state vote distribution in this study. Finally, since the human body controls skin temperature to balance heat transfer to the surrounding environment [
50,
51,
53,
67,
68,
69], the third metric is the overall skin temperature of participants. Consequently, we proposed the following hypotheses:
The control temperature in IVE experiments does not differ significantly from in situ experiments at each level of a thermal state scale in each of the four comparisons.
where,
denotes the control temperature at a specific level of a thermal state scale (i.e., sensation, comfort, or acceptability) in the in situ experiments of a particular comparison condition, and
denotes the control temperature at a specific level of a thermal state scale in the IVE experiments of a particular comparison condition.
The thermal state vote in IVE experiments does not differ significantly from in situ experiments at a specific indoor temperature range in each of the four comparisons.
where,
denotes the thermal state vote at a specific indoor temperature range (i.e., cool, neutral, and warm) in the in situ experiments of a particular comparison condition, and
denotes the thermal state vote at a specific indoor temperature range in the IVE experiments of a particular comparison condition.
The overall skin temperature in IVE experiments does not differ significantly from in situ experiments at a specific indoor temperature range in each of the four comparisons.
where,
denotes the mean of the overall skin temperature on the body at a specific indoor temperature range (i.e., cool, neutral, and warm) in the in situ experiments of a certain comparison condition, and
denotes the mean of the overall skin temperature on the body at a specific indoor temperature range in the IVE experiments of a certain comparison condition.
The cool, neutral, and warm indoor temperature ranges were selected based on the ASHRAE psychrometric chart [
70], where cool temperature range falls below 22 °C, neutral temperature range falls between 22 °C and 27 °C, and warm temperature range falls above 27 °C in both cold and warm outdoor conditions. Consequently, the neutral temperature range represents the comfort zone in both outdoor conditions, whereas cool and warm temperature ranges represent the thermal conditions outside the comfort zone.
4. Research Methods
4.1. Recruitment
The Institutional Review Board (IRB) of the university approved this study prior to the recruitment. Participants in this research included undergraduate students, graduate students, and staff members. Participants were recruited using flyers and word of mouth. Incentives such as payments ($10 per hour) were provided to the participants to participate in this study.
4.2. Climate Chamber
A climate chamber (shown in
Figure 1) situated on the university’s campus was used for IVE and in situ experiments. The chamber is a refurbished office space that includes a testing space, control and resting space, and a mechanical room that is also used as storage. The walls and ceiling of the testing space are constructed with a wood joist framework, and plywood decking is used on top. The insides of the walls are lined with moisture-resistant gypsum boards. The capacity and performance of the chamber were tested before conducting the experiments. The chamber can simulate a wide variety of climatic conditions, including temperatures as low as 60 °F/15 °C and as high as 90 °F/32 °C, with relative humidity levels ranging from 40% to 90% RH. A thermostat and a humidity sensor linked to the chamber’s HVAC system control the room temperature and humidity, which can be set and adjusted using the chamber control software provided by Metasys
®. In addition, the chamber’s HVAC system also maintains and monitors the CO
2 level.
4.3. Data Collection Tools
4.3.1. Indoor Temperature
The indoor temperature surrounding the participants was measured using Vernier surface temperature sensors (description in
Table 1). Three of these sensors were mounted at ASHRAE’s suggested heights of 4/0.1, 24/0.6, and 43/1.1 inches/meters from the floor [
45]. Among them, the sensor placed at the height of 24/0.6 inches/meters was chosen as the closest-participant air temperature (i.e., the control temperature) (
Figure 2) [
71]. Temperature data was sent at one-second intervals and recorded using Logger Pro 13. Furthermore, outdoor air temperature data were recorded just before the start of each experimental session. The source of such data is the Integrated Surface Hourly Database at the NOAA’s National Climate Data Center (NCDC) [
72].
4.3.2. Overall Skin Temperature
The same surface temperature sensors (shown in
Table 1) were used to collect the overall skin temperature, and the data were captured using the same Logger Pro 13 application at one-second intervals. The sensors were placed at eight body locations of participants for sampling their skin temperatures, i.e., forehead, neck, chest, upper back, posterior forearm, hand, anterior calf, and foot. The weighting factors of the mean skin temperature equations and the sensitivity of the body parts [
50,
52,
73,
74,
75,
76] were used to choose these local body regions.
The overall skin temperature (OST) was then calculated using the following formula:
where,
is the chest skin temperature,
is the upper back skin temperature,
is the forearm skin temperature,
is the hand skin temperature,
is the calf skin temperature and
is the foot skin temperature.
This formula was derived based on the 17-point mean skin temperature formula [
77]. The following steps explain the derivation method:
Out of 17 skin locations in the reference formula, the weighting factors of six locations i.e., chest, upper back, forearm, hand, calf, and foot, were selected.
The results from a previous study [
71] showed that the temperature on the forehead was significantly different between IVE and in situ experiments because of the use of the HMD device. Thus, to avoid the HMD’s impact on the overall skin temperature results, the final derived equation did not include the forehead skin temperature to ensure comparability between IVE and in situ experiments.
The final derived equation did not include the neck skin temperature because the reference formula did not consider the neck.
The adjusted weighting factors shown in equation 1 were then obtained by dividing each of the original six weighting factors by the total sum of those same six weighting factors.
4.3.3. Surveys
In this study, several questionnaire data were collected using the Qualtrics online software:
Demographics: Age, gender, education level, and employment status were collected only on the first study visit.
General information: Information such as food and beverage intake, cigarette smoking within the past hour before the experiment, alcohol intake, and intense physical activity within the past 12 h before the experiment was collected before each experimental trial. This information was used for the pre-screening of the participants.
Thermal states: The participant’s thermal sensation was measured using the ASHRAE Standard 55 Thermal Comfort descriptive seven-point Likert scale [
45,
46]. The general thermal comfort and acceptability were measured using six-point Likert scales.
Table 2 shows the thermal state scales.
The general thermal acceptability scale was then grouped into two categories by combining the negative scales into the “Not Acceptable” group and the positive scales into the “Acceptable” group.
- 4.
Igroup Presence Questionnaire (IPQ): The participant’s virtual experience, particularly presence in IVE, was collected using the IPQ questionnaire [
60]. As mentioned earlier, four sub-measures are included in this 13-question survey, and they are general presence (one question), spatial presence (four questions), involvement (four questions), and experienced realism (four questions). A five-point scale (strongly disagree (1) to strongly agree (5)) is used to assess all four sub-measures. Each sub-measure score is calculated by summing the answers to the individual sub-measure questions and then translating the total into a percent. As a result, the score varies from 20 to 100 for each sub-measure, respectively. A high score suggests that the individual has a higher presence in IVE, and the lower score indicates a low presence in IVE.
- 5.
Simulator Sickness Questionnaire (SSQ): The participant’s motion sickness or cybersickness while using IVE was collected using the SSQ questionnaire [
62]. As mentioned earlier, three sub-measures are included in this 16-question survey, and they are nausea (seven questions), oculomotor (seven questions), and disorientation (seven questions). The three sub-measures are measured using a four-point scale (none (0) to severe (3)). Each sub-measure score is determined as follows: Nausea is determined as [X] × 9.54, where [X] is the sum of all the responses to nausea; oculomotor is determined as [Y] × 7.58, where [Y] is the sum of all the responses to oculomotor; disorientation is determined as [Z] × 13.92, where [Z] is the sum all the responses to disorientation. Finally, the score for Total Cybersickness is determined as [X] + [Y] + [Z] × 3.74. A high score in each sub-measure implies that the individual is less at ease in IVE.
4.4. Immersive Virtual Environment
An HTC Vive head-mounted display device (HMD) was used to deliver immersive virtual environments. Autodesk 3ds Max was used to create the chamber’s 3D model. As illustrated in
Figure 3, the model was then imported into Unreal Engine 4 with the material textures and lightmaps. In an IVE, users view the virtual world of the chamber interior using a head-mounted display (HMD), while the climate chamber provides space heating and cooling. This study did not include any interactions with the virtual environment.
4.5. Experimental Procedure
4.5.1. Pre-Experiment Session
The participants were instructed to wear a previously specified piece of clothing (clo of 0.5-0.6) to the lab in all sessions after signing their consent forms, which included light slacks and a light long-sleeved shirt or a T-shirt. After arriving at the chamber, the participants sat in the control/resting space (temperature set at 75 °F/23.8 °C and humidity set at 55%) where a pre-experiment general survey was administered to screen them for alcohol or cigarette intake, as well as for any strenuous physical activities at least 12 h before each experimental session. If they were found to be having any of the above, they were excluded from the experiments. Their demographics and other personal information were also recorded. This pre-experiment session, including resting time, took about 10 min. The session was also intended to help participants acclimatize to the chamber’s indoor temperature to mitigate any influence from their prior thermal state. Afterward, participants were instructed to get into the testing space, sit in a chair, and have the skin temperature sensors affixed to their bodies.
4.5.2. Experimental Session
This study used the same experimental approach as the previous one [
71]. Data were collected in two outdoor temperature conditions, one in the cold condition (December–March) and the other in warm (May–September). The experimental design procedure is shown in
Figure 4. Each participant partook in two experimental settings (i.e., IVE and in situ) conducted in cold or warm conditions. Each experimental setting included a heating or a cooling sequence. The experimental sessions of IVE and in situ were carried out consecutively on the very day. In contrast, the experimental sessions of the same participants using the heating or the cooling sequence were at least two weeks apart. The heating sequence involved three temperature steps, which were 65 °F/18.3 °C, 75 °F/23.8 °C, and 85 °F/29.4 °C. The cooling sequence also involved three temperature steps in reverse, which were 85 °F/29.4 °C, 75 °F/23.8 °C, and 65 °F/18.3 °C. For instance, in a heating sequence, the temperature is first set at 65 °F/18.3 °C, and then the indoor temperature is monitored until it stabilizes at that set-point. After the temperature becomes stable, the thermal states of participant’s are recorded, and the temperature is changed from 65 °F/18.3 °C to 75 °F/23.8 °C. Later, when the temperature becomes stable at 75 °F/23.8 °C, the thermal states are again recorded, and the temperature is adjusted to 85 °F/29.4 °C, and the same data collection procedure is followed. On the other hand, an exact but reverse order is observed in the cooling sequence where the temperature is initially set at 85 °F/29.4 °C, then decreased to 75 °F/23.8 °C, and then to 65 °F/18.3 °C after collecting thermal state data at each of those step temperatures. Overall, four experimental sessions were conducted in both cold and warm conditions, and these were (1) in situ experiments involving heating sequence, (2) IVE experiments involving heating sequence, (3) in situ experiments involving cooling sequence, and (4) IVE experiments involving cooling sequence.
In order to counterbalance and reduce the order effect, participants were allocated randomly to each of the experimental sessions. The heating sequence was followed by cooling for 50 percent of participants, while the cooling sequence was followed by heating for the other 50 percent. In a similar fashion, 50 percent of the participants did the in situ experiment first, followed by the IVE experiment. In addition, between every IVE and in situ experimental session, there was a 10-minute break. The sensors were detached from the participant’s bodies so that they could move outside to the resting space and get acclimatized to the comfort temperature (75 °F23.8 °C). This was done to mitigate the influence of their thermal states on the next experimental session. The skin temperatures were constantly measured at one-second intervals from the beginning through the completion of each experimental session. The participants’ thermal state responses were recorded after the indoor temperature stabilized around each step temperature. An identical data collection approach was used for both IVE and in situ experiments. During the IVE experimental session, the participants used the HTC device to view the virtual environment. Before each IVE trial, there was a familiarization step where the participants explored the virtual scene for a few minutes to calm their excitement and anxiety. The participants’ tasks were restricted to sedentary levels of physical activity, such as seated at rest. The surface skin temperature sensors were directly taped on the skin. The indoor relative humidity was kept at 55% during the tests, while CO2 levels were kept below 1000 parts per million.
4.5.3. Post-Experiment Session
After completing an experimental session, the participants completed the post-IVE experiment surveys inside the chamber. These surveys consisted of the IPQ and SSQ questionnaires to measure the participant’s sense of presence and cybersickness in IVEs, respectively. No post-experiment surveys were conducted after in situ experiments.
4.6. Data Preparation and Cleaning
After data collection, the average of indoor control temperature (i.e., data from sensor at 24/0.6 inches/meters) and skin temperature were computed by considering the data from the exact starting time when the indoor temperature had reached the desired level (i.e., steadied at target temperature) to the completion of the thermal state questionnaires. All data were synchronized using the universal time stamp. For analyzing the control temperature at each level of a thermal state scale, the data were grouped according to the number of scales of a specific thermal state level (e.g., seven groups for sensation, two groups for acceptability, and six groups for comfort). For analyzing thermal state vote responses and overall skin temperatures, the data were grouped into three categories, i.e., cool, neutral, and warm based on comfort zone and outside comfort zone indoor temperature ranges shown in the ASHRAE psychrometric chart. According to this chart, for both cold and warm outdoor conditions, the indoor comfort zone falls between 22 °C and 27 °C and outside comfort zone falls below 22 °C and above 27 °C. As a result, in this study, the cool category (outside comfort zone) includes all the observations below 22 °C, and the warm category (outside comfort zone) includes all the observations above 27 °C. The neutral category (comfort zone) includes all the observations between 22 °C and 27 °C.
Since C3 and C4 compare cold and warm outdoor temperature conditions, data cleaning was necessary to ensure contrasting outdoor temperature conditions. The local outdoor air temperature typically varies from 6.1 °C/43 °F to 32.7 °C/91 °F annually, and the hottest periods generally run from mid-June to mid-September, while December through mid-February are, on average, the coldest months in any year [
78]. Therefore, the data collected from December to March were considered cold conditions, and data collected from May to September were considered warm conditions. Due to the limitations of the experiment location’s outdoor temperature conditions, such as long, hot summers and brief, moderate winters [
72], the outdoor temperatures were identical during some of the experiments conducted in the December–March period and May–September period. Therefore, it was necessary to clean the data to ensure that each participant’s cold and warm condition data were collected in two contrasting outdoor temperature conditions. As such, this study assumed an upper-bound outdoor temperature of 70 °F/21.1 °C for the cold condition and a lower-bound outdoor temperature of 80 °F/26.6 °C for the warm condition. These thresholds were chosen based on references to the local average temperature in March and May. Therefore, if the corresponding outdoor temperature during experiment sessions in cold conditions was greater than 70 °F/21.1 °C, that particular data was removed. Likewise, if the corresponding outdoor temperature during experiment sessions in warm conditions was less than 80 °F/26.6 °C, then that particular data was removed. It is to be noted that the data cleaning was not performed for comparisons C1 and C2 because, in these comparisons, the in situ and IVE data were compared within the respective seasonal period. After cleaning, 42 observations (out of 270) were removed from comparison C3, and 69 observations (out of 305) were removed from comparison C4.
Appendix A Table A1 demonstrates that, after cleaning, the mean outdoor temperature between the warm and cold conditions differed by at least 27 °F/15 °C.
Furthermore, after cleaning the data in comparisons C3 and C4, the mean control temperature at the three indoor temperature ranges (i.e., cool, neutral, and warm) was compared in all four comparisons to ensure that the mean control temperature in the IVE experiments does not differ significantly from the in situ experiments (e.g., the cooling sequence in the warm outdoor condition/cool indoor temperature range in in situ vs. the cooling sequence in the warm outdoor condition/cool indoor temperature range in IVE). A two-tailed pairwise T-test was used for comparison C1, and an independent sample T-test was used for comparisons C2, C3, and C4. The tests revealed that all the p-values were non-significant (p > 0.05) in comparisons C1 and C2, indicating that under the same outdoor temperature conditions, the control temperature of the IVE experiments was comparable with in situ experiments. However, there were two cases, C3 at the cool indoor temperature range and C4 at the neutral indoor temperature range, where the p-values were less than 0.05. The tests suggest that the control temperature of the IVE experiments differed significantly from that of the in situ experiments in those two cases.
4.7. Statistical Analysis and Sample Size
To analyze the mean control temperature at each level of a thermal state scale (i.e., first hypothesis), two-tailed pairwise T-tests were used for comparison C1 because of paired data, and independent sample T-tests were used for comparisons C2, C3, and C4 because of unpaired data. At the same time, to analyze the overall skin temperature at the three indoor temperature ranges (i.e., third hypothesis), two-tailed pairwise T-tests for comparison C1 and independent sample T-tests for comparisons C2, C3, and C4 were used. On the other hand, to analyze the thermal state vote responses at the three indoor temperature ranges (i.e., second hypothesis), two-tailed Wilcoxon Signed Rank tests for comparison C1 and Rank Sum tests for comparisons C2, C3, and C4 were used. The statistical tests were performed using the statistical software package RStudio [
79] with a significance level set at α < 0.05.
Seventy-two people were recruited for this study, including thirty-seven male and thirty-five female participants. The sample size supports a commonly acceptable statistical power of 0.8 [
80], with the following differences in the sample means between the IVE and in situ experiments: 1 scale point for average thermal state vote, 1 °C for overall skin temperature, and 1.7 °C for control temperature. Out of seventy-two people, 50% (n = 36) were White, 25% (n = 18) were Asian, and 25% (n = 18) were from other ethnicities. The mean and standard deviation of the participants’ ages were 24.01 and 5.83, respectively. When classified according to BMI, 65.2% of participants were normal (n = 47), 16.6% (n = 12) were obese, 22.2% (n = 16) were overweight, and 2.7% (n = 2) were underweight.
6. Discussions and Limitations
The analysis suggests that, at the same level of participants’ thermal sensation, acceptability, and comfort vote, the perceived indoor temperature is comparable between IVE and in situ experiments when they are performed in the same outdoor temperature conditions (i.e., C1 and C2) as well as in variable outdoor temperature conditions (i.e., C3 and C4) (
Table A2). Similar results have also been observed when the perceived indoor temperature of IVE and in situ experiments were compared using the overall distribution of participants’ thermal sensation, acceptability, and comfort vote across all four comparisons (C1, C2, C3, and C4) (
Table A3). In other words, these results show that the outdoor temperature variations, specifically in comparisons C3 and C4, did not impact the participants’ perceived indoor temperature at each thermal state level in IVE experiments when compared with the baseline in situ experiments. The experimental procedure can create comparable indoor temperature conditions regardless of the outdoor temperature conditions (e.g., C3 and C4), and such comparability is not affected by the difference in the nature of comparisons, i.e., C1 and C2 versus C3 and C4.
Using ASHRAE’s indoor temperature ranges (i.e., cool, neutral, and warm) to organize and compare thermal state votes has produced mixed results (
Table A4). On the one hand, the results of thermal sensation votes were comparable between IVE and in situ experiments at all three temperature ranges when those experiments were performed in the same outdoor temperature conditions (i.e., C1 and C2) as well as in variable outdoor temperature conditions (i.e., C3 and C4). This result suggests that the outdoor temperature variations, specifically in comparisons C3 and C4, did not impact the participants’ thermal sensation votes in IVE experiments when compared with the baseline in situ experiments. On the other hand, when the IVE and in situ experiments were conducted in the same outdoor temperature conditions, i.e., C1 (cooling in warm outdoor conditions) and C2 (heating in cold outdoor conditions), the thermal acceptability votes were comparable at all three temperature ranges in only C2, and the thermal comfort votes were comparable at all three temperature ranges in both C1 and C2. Furthermore, in C4, the thermal acceptability and comfort votes in the IVE experiments conducted in heating in warm outdoor conditions were comparable to the votes of the in situ experiments conducted in heating in cold outdoor conditions at all three temperature ranges. In other words, the outdoor temperature variations did not impact the participants’ thermal acceptability and comfort votes in IVE experiments. However, discrepancies in the thermal comfort and acceptability votes were observed in only a few cases (four out of twenty-four cases) in both C1 (same outdoor temperature condition) and C3 (different outdoor temperature condition). These discrepancies may not be simply attributed to control temperature differences. For example, even though the control temperature in C3 (the cool indoor temperature range) and C4 (the neutral indoor temperature range) is significantly different between IVE and in situ experiments, thermal comfort, and acceptability votes were different only in C3, not in C4, implying that the control temperature difference should not be the sole factor influencing thermal state votes.
Analysis of the overall skin temperature reveals that it is comparable between IVE and in situ experiments when they are performed in the same outdoor temperature conditions (i.e., C1 and C2) and variable outdoor temperature conditions, but only in C4 at all three indoor temperature ranges (
Table A5). These results suggest that the outdoor temperature variations, specifically in comparison C4 (i.e., cooling in warm outdoor conditions for in situ experiments vs. cooling in cold outdoor conditions for IVE experiments), did not affect the participant’s overall skin temperature. However, in the variable outdoor temperature conditions of comparison C3 (i.e., cooling in warm outdoor conditions for in situ experiments vs. cooling in cold outdoor conditions for IVE experiments), the overall skin temperatures were different between IVE and in situ experiments at all three indoor temperature ranges. These results suggest that the outdoor temperature variations in comparison C3 may have impacted participants’ overall skin temperature. Further investigation shows significant mean skin temperature differences at three locations (forearm, hand, and foot) at all three temperature ranges in C3. Therefore, these individual differences may have influenced overall mean skin temperature differences in C3. Contrastingly, in C4, significant differences were only observed in the upper back, forearm, and hand skin temperatures at a neutral indoor temperature range. Nonetheless, these individual differences did not influence the overall mean skin temperature results in C4. These observations suggest that more studies are needed on the relationship between local and the overall skin temperatures and the choice of the overall skin temperature analysis method. It is also worth noting that, in general, irrespective of IVE and in situ experiments, higher local and overall skin temperatures were observed in the warm outdoor condition (
Table A6), suggesting the potential impact of outdoor temperature on the skin temperatures. This finding is supported by previous non-IVE studies conducted in a climate chamber, which reported higher mean skin [
83] and finger skin temperatures [
84] in summer than in winter. Thus, it seems that the overall skin temperature is more sensitive to outdoor temperature than thermal state votes.
The presence and cybersickness scores were comparable to other reported studies, except for the oculomotor score (
Table A7). The scores suggest that the virtual environment is overall adequate to support this study. The oculomotor metric measures the “eyestrain, difficulty to focus, blurred vision, and headache” [
62]. In this study, since the visual aspect was not the focus and participants’ activities did not involve visual tasks, the impact of the oculomotor metric seems negligible. On the other hand, it is not clear if metrics of cybersickness are associated with physiological responses, such as skin temperatures, because cybersickness is a controlled variable. So, it is unknown if any of the local skin temperature differences between IVE and in situ experiments can be explained by cybersickness.
Despite these significant findings, the results may have been influenced by a few limitations. First, the experiments were performed in a humid subtropical climate, with long, hot summers and short, moderate winters [
72]. Therefore, the results are limited to such outdoor temperature conditions. Future IVE studies can be performed to validate the findings in different climatic regions. Second, only skin temperatures were used as a physiological response to the thermal conditions. The results show that skin temperatures are more sensitive to outdoor temperature variations. The finding is helpful for future experimenters to choose metrics and mitigate the influence, but the study was not focused on understanding the pathway from outdoor temperature, skin temperature, and thermal states. As thermal states involve psychological factors, separate analyses are needed to understand the pathway. In addition, apart from skin temperatures, other physiological indicators of thermal states, such as heart rate, may be considered in future research. Third, the influence of visual stimuli such as simulating summer and winter scenes (e.g., snow) in IVE experiments may generate different responses [
40,
85]. So, the relationship between the visual stimuli, outdoor temperature, and physiological responses in both IVE and in situ experiments needs to be investigated further. Fourth, this study focused on investigating the effect of outdoor temperature variations on the participants’ thermal states as a whole in IVE experiments and did not investigate the individual characteristics such as gender, age, and ethnicity. Thus, future IVE studies may consider the combined effect of outdoor temperature variations and the individual characteristics on thermal states. Fifth, there is evidence that the radiation from interior surfaces, such as walls, affects human physiology and thermal states [
86,
87]. Therefore, future IVE studies may consider investigating the effect of surface radiation on human physiology and thermal states under different outdoor temperature conditions. Lastly, the focus of this study is limited to a thermal state analysis. The impact of outdoor temperature conditions on adaptive behavior in IVE experience, in comparisons such as C3 and C4, is not explored.