1. Introduction
In recent years, sensor technology has significantly improved and wearable devices have become increasingly popular, allowing easily registering subjects’ physiological responses during their daily activities [
1,
2]. Physiological signals are successfully used to measure arousal [
3,
4,
5]. Arousal is an uncontrolled human reaction, related to attention and cognitive alertness, activated by stimuli that require high psycho-physical engagement, and thus activated in particular during cognitive tasks and stressful conditions. Although it has been proven that, sometimes, stress can have a positive effect on a person by improving his/her alertness state or his/her ability to react [
6], it has also been proven that a high and continuous level of stress or cognitive load can affect the physical and mental well-being of the subject. Illnesses such as depression, anxiety, and sleep disorders are, indeed, often due to excessive stress or workload [
7].
In view of its importance, the automatic recognition of stress and excessive cognitive load has recently become an object of study, even in different application areas. For instance, systems able to recognize emotion and, above all, stress experienced by subjects can be used in working or academic environments [
8] in order to monitor and identify the emotional state of employees or students. In this regard, it has been proven that a high level of stress or cognitive load due to excessive workload can increase the level of fatigue, decrease the subject’s working capability, and consequently, bring physical and mental illness, which can lead to workplace absence [
9]. Similarly, automatic stress recognition systems can be used in the context of vehicle driving for the detection of excessive mental fatigue states, which can reduce a person’s driving skills [
10]. Finally, algorithms of stress detection can also be used in recreational areas for the development of systems able to modify their parameters based on the user’s emotional state. Concrete examples concern music-retrieval systems able to interact with a user to suggest a music playlist using both external inputs and his/her physiological signals [
11] or video games in which some internal game parameters, such as the difficulty, are set based on the player’s emotions and stress level [
12]. Moreover, systems able to recognize the subject’s emotional state can also be involved in the medical area. For instance, emotion recognition systems can be used to monitor the health state of convalescent patients [
6] or to help elderly subjects during their daily activities [
13,
14]. In all of these contexts, the development of systems able to recognize, interpret, and simulate human affect can be seen as a necessary step to make technologies user friendly and able to interact actively with people.
Several works in the literature have focused their attention on the analysis of physiological signals as an honest indicator of humans’ emotion and mood [
3,
4]. In particular, the heart rate of a person is positively correlated with the perceived level of arousal and stress [
15], thus resulting in being promising in the recognition of this specific emotional state [
16]. Several stressors have been adopted in the literature to elicit arousal and mental stress [
17]. Among them, mental arithmetic calculus or the Stroop test has been applied as a stressor in numerous experiments to evoke a high level of cognitive load in the participants [
18]. In many of these studies, the use of physiological signals related to heartbeat has allowed positive performance both in statistical [
19,
20] and classification analysis [
21,
22,
23].
Despite the progress in sensor technology and the relative simplicity of acquiring physiological signals from the human body, there are still some critical issues that must be addressed to fully exploit the potential that the analysis of physiological signals can offer. Although it is currently easier to acquire data, which can overcome the low cardinality of datasets, the application of machine learning techniques is still limited by inter-subject heterogeneity. Even in the same resting condition, without external stimuli, physiological signals appear to be significantly subject-dependent.
In this work, the reduction of inter-subject heterogeneity was faced in the case of the heartbeat, mainly detected through Photoplethysmography (PPG) [
24]. The PPG signal is one of the most-used signals to measure arousal [
5,
25] and, consequently, to detect stress and evaluate the experienced cognitive load [
20,
26]. The PPG signal of each subject appears different, both in terms of amplitude and beat frequency. Regarding the amplitude, differences can be due to the subjects’ skin characteristics or to different sensor adherence during the acquisition phase. Concerning the diversity in terms of heartbeat, according to the American Heart Association, the heartbeat frequency of a resting adult can vary in the range between 60 and 100 beats per minute, and it depends on many different factors, both personal (such as age, sex, ethnicity, sports ability, diet, illnesses, prescribed medications, etc.) and environmental (humidity, temperature, etc.) [
27]. Normalization procedures based on data rescaling are often used [
28] to overcome amplitude variability within subjects. The main strategies adopted involve rescaling to the range
[
29], normalizing by dividing by the maximum value of the signal [
30], and applying the Z-score [
31]. None of these methods, however, take into account the effective differences in the subjects’ heartbeat, which are not only related to amplitude, but also to frequency.
The aim of this work was to solve this inter-subject variability, proposing a novel personalized PPG normalization based on the heartbeat of the subject in a resting state condition.
To validate the normalization procedure presented here, PPG data belonging to two different datasets were considered: the Cognitive Load and Affective Walkability in Different Age Subjects(CLAWDAS) dataset, partially introduced in [
32,
33,
34], and the Cognitive Load, Affect and Stress recognition (CLAS) dataset, available in [
35].
The proposed normalization was analyzed considering a binary classification task to discriminate cognitive load in the relaxed state and compared with normalization strategies adopted in the literature.
The paper is organized as follows. In
Section 3, the two considered datasets are described, while the preliminary signal preprocessing strategies applied to each of them are reported in
Section 4. The novel personalized PPG normalization strategy, based on the subject’s resting state heartbeat, is presented in
Section 5. Extracted features, classification strategies, and the adopted cross-validation approach are reported in
Section 6. The comparison of the classification performances on the two datasets, obtained with different normalization strategies, is then reported and analyzed in
Section 7. Finally, the conclusions are drawn in the last section.
2. State-of-the-Art
In the last decade, the number of publications and citations related to heartbeat signals collected by photopletysmography has widely increased [
36]. In particular, the growing popularity of cheaper and non-invasive wearable devices has allowed extending the analysis of such signals from health applications to other areas of interest such as entertainment [
25,
37], driving monitoring [
38,
39], or emotional analysis [
40,
41]. In many of these applications, the pipeline used to pre-process the PPG signals is similar and includes noise and motion artifact removal, as well as signal de-trending [
42]. In [
43], the PPG signals were pre-processed using a two-step procedure: first, each signal was filtered using a three-order low-pass Bessel filter and a notch filter to remove basic noise and baseline shift, then a direct comparison between PPG and ECG signals was performed to detect and eliminate the motion artifacts. Similarly, in [
44], a bandpass filter was used to remove power line interference and motion artifacts and de-trend the signals, while a visual analysis was performed to remove the high peaks of noise. Other PPG denoising strategies, including methods based on empirical mode decomposition or wavelet decomposition, were summarized in [
45]. In several literature works, the denoising and de-trending steps were followed by an amplitude normalization step performed to take into account subjects’ heterogeneity. The amplitude of the PPG signals varies from person to person, and it is both influenced by subjective and environmental factors [
45,
46].
Many strategies of data rescaling have been proposed in the literature to normalize the signals of each person in a similar range of values [
47]. A min–max normalization was applied to the filtered signal of each participant to rescale the heart rate data in the range
in [
29,
48,
49]. Similarly, in [
5], the amplitude of each subject’s PPG signal was rescaled in the range
using a min–max normalization strategy followed by the multiplication by a constant factor
. Another normalization strategy usually applied in the literature is the Z-normalization or Z-score normalization. In this method, the amplitude of each subject’s signal is standardized using the formula
, where
and
are, respectively, the mean and variance of the analyzed PPG signal. This method was applied in [
50] as the last step of the pre-processing phase, while in [
51], it was applied as an intermediate operation between the denoising step and the motion artifact removal step. Finally, in [
19,
52], an amplitude normalization based on the subject’s baseline was considered using the formula
, where
is the PPG signal,
is the mean baseline value, and
is the maximum range of each subject.
While the amplitude normalization is usually considered in the analysis of PPG signals, the definition of a normalization strategy that considers the differences in the subjects’ heartbeat from a frequency point of view is still an open area of research. In [
5], the PPG signals of each subject were segmented into single-peak frames, and each frame was used as a separate signal in the analysis. This method allows solving the issue of subject’s heartbeat heterogeneity, but restricts the features involved in the analysis to only peak morphological characteristics. In the following sections, our novel personalized PPG normalization based on the heartbeat of the subject in a resting state condition is introduced and described. In particular, a cognitive load classification task is considered in the analysis. We selected this task because it is widely analyzed in the literature, with positive results in terms of accuracy, making the validation of our proposal easier.
3. Dataset Description
Several datasets exist in the literature that acquire multimodal physiological data in the field of emotion recognition, among them DEAP [
53], MAHNOB-HCI [
54], EMDB [
55], AMIGOS [
56], ASCERTAIN [
57], CASE [
58], CLAS [
35], and CLAWDAS [
32,
33]. Three main criteria have been applied for the selection of the two datasets, CLAS and CLAWDAS, considered here: the presence of cognitive load tasks, the use of wearable devices for physiological signals’ acquisition, and the presence of a baseline in the resting state condition. Both selected datasets fit these criteria, thus resulting in being suitable for the analysis proposed. In particular, CLAWDAS was selected as the proprietary dataset, while CLAS was chosen for the high number of participants involved, as well as for the use of the same wearable devices adopted in CLAWDAS. Below, the two datasets are described in detail.
In the CLAS dataset, the physiological signals of 60 healthy volunteers (mostly students between 20 and 27 years old, 17 women) were acquired while they were performing interactive or perceptive tasks. In particular, the interactive tasks were introduced to evaluate the level of concentration and the cognitive capacity of different individuals by solving Math Problems, Logic Problems, and Stroop Tests. In the perceptive tasks, different emotions were elicited in the participants by images and video selected from the DEAP dataset [
53]. During the whole experiment, three types of physiological signals were simultaneously recorded by means of Shimmer sensors [
34] Electrocardiography (ECG), Plethysmography (PPG), and Electrodermal Activity (EDA). The signals were acquired with a sampling rate of 256 Hz and a resolution of 16 bits per sample. In addiction, for each subject, 3D accelerometer data and metadata were also collected.
For the purpose of this work, only PPG signals collected during the interactive tasks were considered. For each participant, this phase of the experiment was characterized by the following steps:
1 min of Baseline (BL) in the resting state condition;
3 min of Math Problems in which the participant solves different simple Mathematical Problems (MPs) in a limited interval of time;
30 s of the Neutral State (NS), in which neutral audio-visual stimuli are displayed;
3 min of Stroop Tests (STs), where the user is expected to correctly match the color of the text with the meaning of the word, having a strict time constraint for each assignment;
30 s of the Neutral State (NS), in which neutral audio-visual stimuli are displayed;
5 min of Logic Problems (LPs) consisting of several simple logical problems often used during IQ tests;
30 s of the Neutral State (NS), in which neutral audio-visual stimuli are displayed.
Moreover, the 3 Neutral State repetitions of 30 s for each subject in the picture test of the perceptive session were also considered. For further details of this dataset, please refer to [
35].
CLAWDAS is a dataset collected in a controlled laboratory environment at the Research Center for Advanced Science and Technology (RCAST) at The University of Tokyo. The experiments performed at RCAST were focused on finding differences in physiological responses related to different ages of the subjects, involved in several tasks, from cognitive to listening and walking ones.
The experiments involved two different groups of subjects: a population of 16 Japanese young adults with average age = 24.7 years old (4 women) and a population of 20 Japanese elderly people with average age = 65.15 years old (10 women). During the whole experiment, the heartbeat of each participant was collected though Photopletysmography (PPG) using the Shimmer3 GSR+ Unit [
34] with a sampling frequency of 128 Hz. In addition to PPG, the Electrodermal Activity (EDA) of each subject was acquired using the same sensor. The Shimmer3 GSR+ Units are non-invasive and completely painless sensors that could be easily worn by the participants, as shown in
Figure 1.
CLAWDAS data acquired during cognitive and listening tasks were considered here. The experimental protocol was composed of the following steps:
3 min of questionnaires to collect the personal details and the current emotional state of each participant using STAI Questionnaires;
1 min of Baseline (BL) in the resting state condition;
6 min of Reading (R) and Comprehension (C) tasks composed by two repetitions (trials) of 2 min of R followed by 1 min of self-assessment and C questions;
1 min of Baseline (BL) in the resting state condition;
A 15 min sequence composed of six repetitions of the following two tasks:
2 min of Audio Listening (AL); in this task, relaxation was induced by natural sounds (
Figure 2, right);
30 s of cognitive load, induced by mental Math Calculations (MCs) that involve sums, subtractions, and multiplications (
Figure 2, left).
Each repetition has a different audio track and Math Calculation.
1 min of Baseline (BL) in the resting state condition
The experimental protocol was reviewed and approved by the Research Ethics Committee at The University of Tokyo, Japan (Nos. 19-283 and 19-376). The CLAWDAS dataset was partitioned into two distinct subsets, according to the age of the participants: CLAWDAS Young, which included all the signals acquired from young adults, and CLAWDAS Elderly, which grouped all the signals collected from the elderly. In the following analysis, the two groups were considered separately.
Table 1 reports the number of instances for each task in the CLAS and CLAWDAS datasets, keeping distinct the two subsets of CLAWDAS related to subjects’ age (Young and Elderly).
4. Signal Preprocessing: Denoising and Amplitude Normalization
In this section, the preprocessing operations applied to raw PPG data are detailed and differentiated for CLAS and CLAWDAS, respectively.
4.1. Denoising Strategies
The raw PPG signals are usually corrupted by noise and motion artifacts, which can undermine their interpretation and use [
59]. In the CLAS dataset, the signals had already been preprocessed by the authors during the acquisition phase [
35], and thus, no further denoising procedure was applied.
Concerning the CLAWDAS, the PPG raw signals of each subject were preprocessed by a multiresolution wavelet denoising strategy, as suggested by [
59,
60]. The signal was divided into frequency sub-bands using the Stationary Wavelet Transform (SWT) [
61] with the Fejer–Korovkin mother wavelet [
62] and four levels of decomposition. A soft thresholding was applied to the detail coefficients of each sub-band. The universal threshold calculated by the formula
was adopted, where
is the length of the
j-th wavelet coefficient and
k is the sub-band [
63]. The SWT was implemented with the à trous algorithm [
64]. A preliminary operation of replicate padding was applied to the signal in order to obtain a length divisible by
[
61], with
level.
4.2. Amplitude Normalization
In order to normalize the signals with respect to the amplitude, a Z-score operation, defined by the formula , was applied to the PPG recordings after the denoising procedure.
In CLAWDAS, the amplitude normalization, (AmpN), as well as the denoising preprocessing were applied to the signal of each subject, before splitting it into the different experimental trials.
A similar procedure was also applied to the CLAS dataset signals. However, in this case, the authors already split the data into single trials, according to their experimental protocol (see
Section 3), with no preliminary amplitude normalization. Thus, in order to apply a similar procedure to both datasets, the segmented trials of each subject were concatenated to re-build the original acquired signal. Then, the Z-score amplitude normalization was applied to each subject signal. Finally, the amplitude normalized signals were split back into the trials, related to single tasks, using the markers properly defined during the previous phase of concatenation.
5. Personalized PPG Normalization Based on Subject Resting State Heartbeat
The American Heart Association has underlined that the heart rate frequency of an adult in a resting state can vary in the range between 60 and 100 beats per minute. This inter-subject variability depends on many different factors, both personal and environmental. In the case of the CLAS and CLAWDAS datasets, the subjects’ heartbeat range of the baseline recordings belongs to the one reported by the literature, as depicted in
Figure 3, where the average heartbeat distribution for each dataset is plotted.
In order to remove this inter-subject variability, the core idea of our normalization procedure is to map PPG signals, defined in the Discrete Time Domain (DTD), into a new Subject Normalized discrete Domain (SND), applying a mapping procedure based on the resting state heartbeat frequency. In this SND, all the subjects have the same resting state heart frequency. For each subject, a subject-based resampling frequency can be calculated, so that for all the subjects, the heart frequency of the resting state in the SND is equal, despite the original subject-peculiar frequency in the Continuous Time Domain (CTD).
Then, the PPG data acquired during all the experimental tasks were also mapped into this new domain, applying the calculated subject-based resampling frequency and obtaining subject normalized PPG signals that can be considered for population-based analysis.
Defining
as the sampling frequency of the PPG signal, given by the acquisition device, and
as the heartbeat frequency in the resting state condition in the CTD, the corresponding normalized heartbeat frequency
in the DTD is:
We now define the subject normalized heartbeat frequency of the resting state in the SND as , where SNsample stands for Subject Normalized sample, which is the independent variable of the SND.
The subject-based resampling frequency that permits mapping the DTD PPG signal into the SND one is defined as
and can be calculated as follows:
As our goal was to obtain a domain where the inter-subject variability is discounted, the
for all subjects’ baseline should be the same. Once this constant value is chosen, the
resampling frequency for each subject can be calculated from Equation (
2). Then, all the PPG data of the same subject can be resampled accordingly and mapped into the SND, making the SN data reliable for population-based analysis. In
Table 2, the notation introduced is summarized for the sake of clarity. Taking this into account, Equations (
1) and (
2) can be rewritten as follows:
The resting state heartbeat in the SND () can be arbitrarily chosen, only paying attention to possible aliasing effects. In our calculations, we set , which corresponds to one beat on 128 SNsamples in the SND.
To analyze the effect of our normalization proposal, let us consider some numerical examples. In the case of a sampling frequency of
Hz, as in the case of the CLAWDAS dataset, we can observe from Equation (
3) that, in the case of a subject with a baseline heartbeat frequency of
, corresponding to
, the subject-based resampling frequency is
, meaning that there are no differences between the signal in the DTD and in the SND. Note that
is generally considered as the minimum value for normal people. For heartbeat frequencies higher than
, the mapping from the DTD to the SND implies an over-sampling, while for lower frequencies, the consequent under-sampling does not introduce aliasing, as 128 samples are guaranteed between two consecutive peaks.
In
Figure 4, PPG signals corresponding to the first baseline in the CLAWDAS Elderly dataset of Subjects 11 and 18, respectively, are considered. In the first row, the signals in the DTD are reported, showing the difference between the two subjects’ heartbeat frequencies: for Subject 11,
, corresponding to
, while for Subject 18,
, corresponding to about
. In the second row, the same two signals resampled in the SND are shown. Note that we assumed in defining our procedure that the heartbeat during a resting state is a stationary and periodic signal; however, this is not the case in real life, justifying not having
strictly equal to
for both subjects in the SND.
In the case of multiple baseline signals, the heartbeat frequency is evaluated as the average of the heartbeat frequency of all of them. This procedure was applied, for example, during the normalization of the CLAWDAS signals. In this case, in fact, three different baseline signals were acquired from each subject.
The pseudocode that allows applying the personalized normalization to the PPG signals of a generic subject “s” is reported in Algorithm 1. The first two rows initialize the counter variable “i” to 0 and set the SND resting state heartbeat frequency (FSNB) to the value arbitrarily chosen. In Rows [3–11], the resting state heartbeat of subject “s” is computed. Using this value, the subject-based resampling frequency “fsnc” is evaluated in Row 12. Finally, in Rows [13–15], all the signals collected from subject “s” are resampled using the new sampling frequency “fsnc”.
Algorithm 1: Pseudocode of the personalized PPG normalization applied to the signals of subject “s”. |
- 1:
- 2:
resting state heartbeat frequency in subject normalized domain - 3:
foreach baseline signal “bl” collected from subject “s”do - 4:
compute the heartbeat frequency of “bl” as beats/second - 5:
- 6:
end for - 7:
ifsubject “s” has more than one “bl” signalthen - 8:
mean of the value in fbtrial - 9:
else - 10:
the only element in fbtrail - 11:
end if - 12:
multiply fb for FSNB - 13:
foreach signal “t” acquired from subject “s”do - 14:
resample “t” using the new subject normalized sampling frequency fsnc - 15:
end for
|
6. Classification Setting
A binary classification task is herein proposed on CLAS, CLAWDAS Young, and CLAWDAS Elderly, to evaluate the performance of the personalized PPG normalization presented in this work. In particular, a cognitive load recognition task was considered in the analysis. We selected this task of classification because it is widely analyzed in the physiological signals’ literature with also positive performance [
21,
22]. Two classes were thus considered: the class corresponding to signals collected during high cognitive load tasks: High CL, and the class related to low cognitive load tasks: Low CL. The tasks used as representative of each class changed according to the dataset considered.
In particular, according to [
35], in the CLAS dataset, the PPG data collected during the three cognitive tasks (MP, ST, and LP) were labeled as High CL, while the NS data were labeled as Low CL. In order to make the two classes balanced, MP, ST, and LP signals were split into two non-overlapped segments of equal length, reaching a cardinality of 360 instances for both classes.
In CLAWDAS Young and CLAWDAS Elderly, the data collected during the MC task were selected for the High CL class, while the data collected during the AL task were chosen as representatives of the Low CL class. Thereby, the two classes were equally balanced, with 96 instances each for CLAWDAS Young and 120 instances each for CLAWDAS Elderly. In this study, the Math Calculation task was preferred over the Reading and Comprehension tasks as the High CL class because it is able to elicit high mental stress according to what has been defined in the literature [
17].
In all the performed analysis, seven handcrafted features were extracted as characteristics useful to describe the PPG signals:
Minima, Maxima, Mean, and Standard Deviation of the signal;
Peak Rate, which represents the mean number of peaks;
Inter-Beat Interval (IBI), which represents the mean distance between two peaks in a row;
Root Mean Square of Successive Distance (RMSSD), which represents the variance of the distance between two consecutive peaks [
65].
The last three features were evaluated in the discrete domain and reported with respect to samples. For the sake of clarity, it is recalled that the meaning of samples changes according to the type of normalization strategy adopted. In particular, samples refer to subject normalized samples when the features are evaluated on signals with personalized normalization based on resting state heartbeat, while this term refers to discrete time samples in all the other cases. All the features so evaluated were also standardized by applying the z-score before being used as the input to the different classifiers.
Using the the seven features introduced above, three binary classification experiments were performed for each of the two datasets considered, comparing the following three normalization strategies:
AmpN: Amplitude Normalization, as described in
Section 4.2;
SubjFeatN: amplitude normalization followed by a Subject Feature Normalization. This feature normalization was performed with respect to the subject baseline on the Peak Rate, IBI, and RMSSD features as follows:
where
;
represents the feature value before the normalization;
is the new normalized value; and
is the mean value of the
feature evaluated on the subject resting state;
PersFreqN: amplitude normalization followed by the Personalized Normalization based on the resting state heartbeat, described in
Section 5.
The pre-processing and feature extraction operations applied to each signal and the three considered normalization strategies are summarized in
Figure 5.
For each analysis, four different classification models were tested: a Classification and Regression Tree (CART) with Gini’s diversity index as the criterion of splitting and 100 as the max number of decision splits, and three Support Vector Machines (SVMs) with different kernels: Linear (SVM Linear), Gaussian (SVM Gaussian), and polynomial Cubic (SVM Cubic). In particular, for the Gaussian kernel SVM, the kernel scale was set to 3.3 in order to consider a medium Gaussian SVM.
A Leave One Subject Out (LOSO) cross-validation [
66] was applied to evaluate the performance of the trained classifiers. At each iteration, the data used to train the classifier consisted of the signals collected from all the subjects, except one, whose instances were instead used to test the performance of the model. An overall confusion matrix was finally generated, joining the single confusion matrices resulting from each iteration. From this confusion matrix, several well-known evaluation metrics were extracted. In particular, we selected accuracy to evaluate the general performance of the classifier and the single class F1-score [
67] to assess, instead, the goodness of the classification model in recognizing the single classes. The classification settings described above are summarized in
Table 3.
7. Results and Discussion
The three different types of normalization, AmpN, SubjFeatN, and PersFreqN, are compared in this section, considering the high class and low class cognitive load binary classification task for the two chosen datasets. The performance corresponding to each of the classification settings described in
Table 3 are reported in
Table 4,
Table 5 and
Table 6, for CLAS, CLAWDAS Young, and CLAWDAS Elderly, respectively. In particular, the results of the different normalization strategies are reported in terms of accuracy and single-class F1-score generated using the LOSO cross-validation approach.
Note that, despite the considered classifier, SVM Linear, SVM Cubic, SVM Gauss, and CART, in all the experiments carried out, the normalization strategy herein proposed, PersFreqN, outperformed the other two normalization procedures, for all the datasets. This observation is further supported by a visual comparison of the performance of the classifiers reported in terms of accuracy in the bar plot of
Figure 6, varying the normalization strategy and the involved dataset. In particular, PersFreqN achieved the best accuracy for all the dataset and type of classifier, with the sole exception of the CART classifier for the CLAWDAS Young dataset, where the best performance was achieved by the SubjFeatN strategy. On the other hand, the AmpN normalization was in general the worst approach.
Comparing the datasets, the best performance was observed on the CLAS dataset. In this case, the proposed PersFreqN allowed reaching an accuracy of 81% adopting the SVM classifier with the polynomial cubic kernel. This result significantly outperformed the accuracy of 74% reached using AmpN and the 73% obtained using SubjFeatN.
For the CLAWDAS datasets, the highest accuracies achieved were 79% for CLAWDAS Young and 80% for CLAWDAS Elderly, both obtained with the proposed PersFreqN and SVM with a linear kernel. These values show a significant improvement with respect to the other normalization strategies AmpN and SubjFeatN, which were always lower than 66%.
Another consideration regards the classifier that allowed reaching the best results. In general, from the three
Table 4,
Table 5 and
Table 6, it emerged that the highest accuracy values were usually achieved by the SVM classifiers with the linear or cubic kernel, whereas the lowest ones were generally obtained by the CART classifier.
Finally, a last consideration should be made on the SubjFeatN normalization strategy. In general, this normalization produced a performance higher than that of AmpN, even if it appeared less effective compared to the proposed PersFreqN strategy. These results confirmed that a normalization strategy that takes into account not only amplitude normalization, but also subject’s characteristics should be adopted to remove inter-subject variability.
As a final remark, all the adopted classification settings were able to classify with comparable performance both classes, as indicated by the values of the single-class F1-score in all the tables. However, the introduction of the proposed PersFreqN seemed to produce even more balanced classification results.
8. Conclusions
While considering physiological data, signal normalization not only with respect to amplitude, but also with respect to personal characteristics is mandatory to perform subject-based analysis, especially if machine learning techniques should be applied. Personalized normalization on PPG data, both at the feature level, SubjFeatN, and with respect to heartbeat frequency, PersFreqN, introduced an increase in the classification performance, considering different classification models and datasets. In particular, the personalized PPG normalization based on subject heartbeat herein proposed, PersFreqN, outperformed the other strategies and permitted significantly reducing inter-subject heterogeneity. Moreover, the proposed normalization could also be useful for intra-subject analysis, especially when comparing the physiological responses of the same subject, on different days or even at different moments of the day: it is well known, in fact, that the physiological responses not only depend on external stimuli, but also on physical and internal conditions, which can significantly vary for the same subject with respect to time.
Author Contributions
F.G.: conceptualization, methodology, formal analysis, investigation, data curation, writing—original draft preparation; A.G.: software, validation, formal analysis, investigation, data curation, writing—original draft preparation, visualization; M.G.: investigation, writing—review and editing; S.B.: project leader, supervision. All authors have read and agreed to the published version of the manuscript.
Funding
This research was partially supported by the FONDAZIONE CARIPLO “LONGEVICITY-Social Inclusion for the Elderly through Walkability” (Ref. 2017-0938) and by the Japan Society for the Promotion of Science (Ref. L19513).
Institutional Review Board Statement
The experimental protocol for CLAWDAS dataset was reviewed and approved by the Research Ethics Committee at The University of Tokyo, Japan (Nos. 19-283 and 19-376).
Informed Consent Statement
Informed consent was obtained from all subjects involved in the study.
Data Availability Statement
The data presented in this study are available on request from the corresponding author.
Acknowledgments
We want to give our thanks to Katsuhiro Nishinari and his staff, in particular Kenichiro Shimura and Daichi Yanagisawa, for their indispensable support during the experiment held at RCAST—The University of Tokyo.
Conflicts of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Abbreviations
The following abbreviations are used in this manuscript:
PPG | Photopletysmography |
CLAS dataset | Cognitive Load, Affect and Stress Recognition dataset |
CLAWDAS dataset | Cognitive Load and Affective Walkability in Different Age Subjects dataset |
BL | Baseline |
MP | Math Problem |
ST | Stroop Test |
LP | Logic Problem |
NS | Neutral State |
MC | Math Calculation |
R | Reading |
C | Comprehension |
SWT | Stationary Wavelet Transform |
CTD | Continuous Time Domain |
DTD | Discrete Time Domain |
SND | Subject Normalized discrete Domain |
High CL | High Cognitive Load |
Low CL | Low Cognitive Load |
IBI | Inter-Beat Interval |
RMSSD | Root Mean Square of Successive Distance |
SVM | Support Vector Machine |
SVM Linear | Support Vector Machine with Linear kernel |
SVM Cubic | Support Vector Machine with polynomial Cubic kernel |
SVM Gauss | Support Vector Machine with Gaussian kernel |
CART | Classification and Regression Tree |
LOSO cross-validation | Leave One Subject Out Cross-Validation |
AmpN | Amplitude Normalization |
SubjFeatN | Amplitude normalization followed by a Subject Feature Normalization |
PersFreqN | Personalized Normalization based on baseline heartbeat Frequency |
References
- Can, Y.S.; Arnrich, B.; Ersoy, C. Stress detection in daily life scenarios using smart phones and wearable sensors: A survey. J. Biomed. Inform. 2019, 92, 103139. [Google Scholar] [CrossRef] [PubMed]
- Han, H.J.; Labbaf, S.; Borelli, J.L.; Dutt, N.; Rahmani, A.M. Objective stress monitoring based on wearable sensors in everyday settings. J. Med. Eng. Technol. 2020, 44, 177–189. [Google Scholar] [CrossRef] [PubMed]
- Shu, L.; Xie, J.; Yang, M.; Li, Z.; Li, Z.; Liao, D.; Xu, X.; Yang, X. A review of emotion recognition using physiological signals. Sensors 2018, 18, 2074. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Alskafi, F.A.; Khandoker, A.H.; Jelinek, H.F. A Comparative Study of Arousal and Valence Dimensional Variations for Emotion Recognition Using Peripheral Physiological Signals Acquired from Wearable Sensors. In Proceedings of the 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Guadalajara, Mexico, 1–5 November 2021; pp. 1104–1107. [Google Scholar]
- Lee, M.S.; Lee, Y.K.; Pae, D.S.; Lim, M.T.; Kim, D.W.; Kang, T.K. Fast emotion recognition based on single pulse PPG signal with convolutional neural network. Appl. Sci. 2019, 9, 3355. [Google Scholar] [CrossRef] [Green Version]
- Choi, J.; Ahmed, B.; Gutierrez-Osuna, R. Development and evaluation of an ambulatory stress monitor based on wearable sensors. IEEE Trans. Inf. Technol. Biomed. 2011, 16, 279–286. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Umematsu, T.; Sano, A.; Taylor, S.; Picard, R.W. Improving students’ daily life stress forecasting using lstm neural networks. In Proceedings of the 2019 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI), Chicago, IL, USA, 19–22 May 2019; pp. 1–4. [Google Scholar]
- Durán-Acevedo, C.M.; Carrillo-Gómez, J.K.; Albarracín-Rojas, C.A. Electronic Devices for Stress Detection in Academic Contexts during Confinement Because of the COVID-19 Pandemic. Electronics 2021, 10, 301. [Google Scholar] [CrossRef]
- Setz, C.; Arnrich, B.; Schumm, J.; La Marca, R.; Tröster, G.; Ehlert, U. Discriminating stress from cognitive load using a wearable EDA device. IEEE Trans. Inf. Technol. Biomed. 2009, 14, 410–417. [Google Scholar] [CrossRef]
- Saeed, A.; Trajanovski, S.; Van Keulen, M.; Van Erp, J. Deep physiological arousal detection in a driving simulator using wearable sensors. In Proceedings of the 2017 IEEE International Conference on Data Mining Workshops (ICDMW), New Orleans, LA, USA, 18–21 November 2017; pp. 486–493. [Google Scholar]
- Bandini, S.; Gasparini, F.; Giltri, M. Personalized music experience for the wellbeing of elderly people. In International Conference on Internet Science; Springer: Cham, Switzerland, 2019; pp. 335–340. [Google Scholar]
- Vachiratamporn, V.; Moriyama, K.; Fukui, K.i.; Numao, M. An implementation of affective adaptation in survival horror games. In Proceedings of the 2014 IEEE Conference on Computational Intelligence and Games, Dortmund, Germany, 26–29 October 2014; pp. 1–8. [Google Scholar]
- Fonseca, D.; da Silva, K.C.N.; Rosa, R.L.; Rodríguez, D.Z. Monitoring and Classification of Emotions in Elderly People. In Proceedings of the 2019 International Conference on Software, Telecommunications and Computer Networks (SoftCOM), Split, Croatia, 19–21 September 2019; pp. 1–6. [Google Scholar]
- Bartolomé-Tomás, A.; Sánchez-Reolid, R.; Latorre, A.F.S.J.M.; Fernández-Caballero, A. Arousal Detection in Elderly People from Electrodermal Activity Using Musical Stimuli. Sensors 2020, 20, 4788. [Google Scholar] [CrossRef]
- Yates, H.; Chamberlain, B.; Norman, G.; Hsu, W.H. Arousal detection for biometric data in built environments using machine learning. In Proceedings of the IJCAI 2017 Workshop on Artificial Intelligence in Affective Computing, Melbourne, Australia, 19–25 August 2017; pp. 58–72. [Google Scholar]
- Ghosh, A.; Danieli, M.; Riccardi, G. Annotation and prediction of stress and workload from physiological and inertial signals. In Proceedings of the 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Milan, Italy, 25–29 August 2015; pp. 1621–1624. [Google Scholar]
- Karthikeyan, P.; Murugappan, M.; Yaacob, S. A review on stress inducement stimuli for assessing human stress using physiological signals. In Proceedings of the 2011 IEEE 7th International Colloquium on Signal Processing and its Applications, Penang, Malaysia, 4–6 March 2011; pp. 420–425. [Google Scholar]
- Nourbakhsh, N.; Wang, Y.; Chen, F.; Calvo, R.A. Using galvanic skin response for cognitive load measurement in arithmetic and reading tasks. In Proceedings of the 24th Australian Computer-Human Interaction Conference, Melbourne, Victoria, Australia, 26–30 November 2012; pp. 420–423. [Google Scholar]
- Gokay, R.; Masazade, E.; Aydin, C.; Erol-Barkana, D. Emotional state and cognitive load analysis using features from BVP and SC sensors. In Proceedings of the 2015 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI), San Diego, CA, USA, 14–16 September 2015; pp. 178–183. [Google Scholar]
- Xuan, Q.; Wu, J.; Shen, J.; Ji, X.; Lyu, Y.; Zhang, Y. Assessing cognitive load in adolescent and adult students using photoplethysmogram morphometrics. Cogn. Neurodyn. 2020, 14, 709–721. [Google Scholar] [CrossRef]
- Chandra, V.; Priyarup, A.; Sethia, D. Comparative Study of Physiological Signals from Empatica E4 Wristband for Stress Classification. In International Conference on Advances in Computing and Data Sciences; Springer: Cham, Switzerland, 2021; pp. 218–229. [Google Scholar]
- Beh, W.K.; Wu, Y.H.; Wu, A.Y.A. Robust PPG-based Mental Workload Assessment System using Wearable Devices. IEEE J. Biomed. Health Inform. 2021. [Google Scholar] [CrossRef]
- Li, F.; Xu, P.; Zheng, S.; Chen, W.; Yan, Y.; Lu, S.; Liu, Z. Photoplethysmography based psychological stress detection with pulse rate variability feature differences and elastic net. Int. J. Distrib. Sens. Netw. 2018, 14, 1550147718803298. [Google Scholar] [CrossRef]
- Castaneda, D.; Esparza, A.; Ghamari, M.; Soltanpur, C.; Nazeran, H. A review on wearable photoplethysmography sensors and their potential future applications in health care. Int. J. Biosens. Bioelectron. 2018, 4, 195. [Google Scholar] [PubMed] [Green Version]
- Ayata, D.; Yaslan, Y.; Kamasak, M.E. Emotion based music recommendation system using wearable physiological sensors. IEEE Trans. Consum. Electron. 2018, 64, 196–203. [Google Scholar] [CrossRef]
- Kalra, P.; Sharma, V. Mental stress assessment using PPG signal a deep neural network approach. IETE J. Res. 2020, 1–7. [Google Scholar] [CrossRef]
- Avram, R.; Tison, G.H.; Aschbacher, K.; Kuhar, P.; Vittinghoff, E.; Butzner, M.; Runge, R.; Wu, N.; Pletcher, M.J.; Marcus, G.M.; et al. Real-world heart rate norms in the Health eHeart study. NPJ Digit. Med. 2019, 2, 58. [Google Scholar] [CrossRef]
- Goh, C.H.; Tan, L.K.; Lovell, N.H.; Ng, S.C.; Tan, M.P.; Lim, E. Robust PPG motion artifact detection using a 1-D convolution neural network. Comput. Methods Programs Biomed. 2020, 196, 105596. [Google Scholar] [CrossRef]
- Lee, S.W.; Woo, D.K.; Son, Y.K.; Mah, P.S. Wearable Bio-Signal (PPG)-Based Personal Authentication Method Using Random Forest and Period Setting Considering the Feature of PPG Signals. JCP 2019, 14, 283–294. [Google Scholar] [CrossRef]
- Nemati, S.; Ghassemi, M.M.; Ambai, V.; Isakadze, N.; Levantsevych, O.; Shah, A.; Clifford, G.D. Monitoring and detecting atrial fibrillation using wearable technology. In Proceedings of the 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Orlando, FL, USA, 16–20 August 2016; pp. 3394–3397. [Google Scholar]
- Costadopoulos, N.; Islam, M.Z.; Tien, D. Using Z-score to Extract Human Readable Logic Rules from Physiological Data. In Proceedings of the 2019 11th International Conference on Knowledge and Systems Engineering (KSE), Da Nang City, Vietnam, 24–26 October 2019; pp. 1–6. [Google Scholar]
- Gasparini, F.; Grossi, A.; Nishinari, K.; Bandini, S. Age-Related Walkability Assessment: A Preliminary Study Based on the EMG. In AIxIA 2020—Advances in Artificial Intelligence; Baldoni, M., Bandini, S., Eds.; Springer International Publishing: Cham, Switzerland, 2021; pp. 423–438. [Google Scholar]
- Gasparini, F.; Grossi, A.; Bandini, S. A Deep Learning Approach to Recognize Cognitive Load Using PPG Signals. In Proceedings of the 14th PErvasive Technologies Related to Assistive Environments Conference, Corfu, Greece, 29 June–2 July 2021; Association for Computing Machinery: New York, NY, USA, 2021; pp. 489–495. [Google Scholar] [CrossRef]
- Shimmer Discovery in Motion. 2008. Available online: www.shimmersensing.com (accessed on 4 April 2022).
- Markova, V.; Ganchev, T.; Kalinkov, K. Clas: A database for cognitive load, affect and stress recognition. In Proceedings of the 2019 International Conference on Biomedical Innovations and Applications (BIA), Varna, Bulgaria, 8–9 November 2019; pp. 1–4. [Google Scholar]
- Almarshad, M.A.; Islam, M.S.; Al-Ahmadi, S.; BaHammam, A.S. Diagnostic Features and Potential Applications of PPG Signal in Healthcare: A Systematic Review. Healthcare 2022, 10, 547. [Google Scholar] [CrossRef]
- Liu, R.; Hu, X. A Multimodal Music Recommendation System with Listeners’ Personality and Physiological Signals. In Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020, Virtual Event, China, 1–5 August 2020; pp. 357–360. [Google Scholar]
- Koh, D.W.; Lee, S.G. An evaluation method of safe driving for senior adults using ECG signals. Sensors 2019, 19, 2828. [Google Scholar] [CrossRef] [Green Version]
- Meteier, Q.; Capallera, M.; Ruffieux, S.; Angelini, L.; Abou Khaled, O.; Mugellini, E.; Widmer, M.; Sonderegger, A. Classification of drivers’ workload using physiological signals in conditional automation. Front. Psychol. 2021, 12, 268. [Google Scholar] [CrossRef]
- Zhang, T.; El Ali, A.; Wang, C.; Hanjalic, A.; Cesar, P. Corrnet: Fine-grained emotion recognition for video watching using wearable physiological sensors. Sensors 2020, 21, 52. [Google Scholar] [CrossRef] [PubMed]
- Dzedzickis, A.; Kaklauskas, A.; Bucinskas, V. Human emotion recognition: Review of sensors and methods. Sensors 2020, 20, 592. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Labati, R.D.; Piuri, V.; Rundo, F.; Scotti, F. Photoplethysmographic Biometrics: A Comprehensive Survey. Pattern Recognit. Lett. 2022, 156, 119–125. [Google Scholar] [CrossRef]
- Wang, M.; Huang, C.; Chen, H.; Ye, S. Preprocessing PPG and ECG signals to estimate blood pressure based on noninvasive wearable device. In Proceedings of the 3rd International Conference on Engineering Technology and Application (ICETA 2016), Kyoto, Japan, 28–29 April 2016; pp. 1103–1106. [Google Scholar] [CrossRef] [Green Version]
- Ahmed, S.; Bhuiyan, T.A.; Nii, M. PPG Signal Morphology-Based Method for Distinguishing Stress and Non-Stress Conditions. J. Adv. Comput. Intell. Intell. Inform. 2022, 26, 58–66. [Google Scholar] [CrossRef]
- Park, J.; Seok, H.S.; Kim, S.S.; Shin, H. Photoplethysmogram Analysis and Applications: An Integrative Review. Front. Physiol. 2022, 2511. [Google Scholar] [CrossRef]
- Yousef, Q.; Reaz, M.; Ali, M.A.M. The analysis of PPG morphology: Investigating the effects of aging on arterial compliance. Meas. Sci. Rev. 2012, 12, 266. [Google Scholar] [CrossRef] [Green Version]
- Golgouneh, A.; Tarvirdizadeh, B. Fabrication of a portable device for stress monitoring using wearable sensors and soft computing algorithms. Neural Comput. Appl. 2020, 32, 7515–7537. [Google Scholar] [CrossRef]
- Di Lascio, E.; Gashi, S.; Santini, S. Laughter recognition using non-invasive wearable devices. In Proceedings of the 13th EAI International Conference on Pervasive Computing Technologies for Healthcare, Trento, Italy, 20–23 May 2019; pp. 262–271. [Google Scholar]
- Pinge, A.; Bandyopadhyay, S.; Ghosh, S.; Sen, S. A Comparative Study between ECG-based and PPG-based Heart Rate Monitors for Stress Detection. In Proceedings of the 2022 14th International Conference on COMmunication Systems & NETworkS (COMSNETS), Bangalore, India, 4–8 January 2022; pp. 84–89. [Google Scholar]
- Zhou, J.; Arshad, S.Z.; Luo, S.; Yu, K.; Berkovsky, S.; Chen, F. Indexing cognitive load using blood volume pulse features. In Proceedings of the 2017 CHI Conference Extended Abstracts on Human Factors in Computing Systems, Denver, CO, USA, 6–11 May 2017; pp. 2269–2275. [Google Scholar]
- Temko, A. Accurate heart rate monitoring during physical exercises using PPG. IEEE Trans. Biomed. Eng. 2017, 64, 2016–2024. [Google Scholar] [CrossRef] [Green Version]
- Swangnetr, M.; Kaber, D.B. Emotional state classification in patient–robot interaction using wavelet analysis and statistics-based feature selection. IEEE Trans.-Hum.-Mach. Syst. 2012, 43, 63–75. [Google Scholar] [CrossRef]
- Koelstra, S.; Muhl, C.; Soleymani, M.; Lee, J.S.; Yazdani, A.; Ebrahimi, T.; Pun, T.; Nijholt, A.; Patras, I. Deap: A database for emotion analysis; using physiological signals. IEEE Trans. Affect. Comput. 2011, 3, 18–31. [Google Scholar] [CrossRef] [Green Version]
- Soleymani, M.; Lichtenauer, J.; Pun, T.; Pantic, M. A multimodal database for affect recognition and implicit tagging. IEEE Trans. Affect. Comput. 2011, 3, 42–55. [Google Scholar] [CrossRef] [Green Version]
- Carvalho, S.; Leite, J.; Galdo-Álvarez, S.; Gonçalves, O.F. The emotional movie database (EMDB): A self-report and psychophysiological study. Appl. Psychophysiol. Biofeedback 2012, 37, 279–294. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Miranda-Correa, J.A.; Abadi, M.K.; Sebe, N.; Patras, I. Amigos: A dataset for affect, personality and mood research on individuals and groups. IEEE Trans. Affect. Comput. 2018, 12, 479–493. [Google Scholar] [CrossRef] [Green Version]
- Subramanian, R.; Wache, J.; Abadi, M.K.; Vieriu, R.L.; Winkler, S.; Sebe, N. ASCERTAIN: Emotion and personality recognition using commercial sensors. IEEE Trans. Affect. Comput. 2016, 9, 147–160. [Google Scholar] [CrossRef]
- Sharma, K.; Castellini, C.; Stulp, F.; Van den Broek, E.L. Continuous, real-time emotion annotation: A novel joystick-based analysis framework. IEEE Trans. Affect. Comput. 2017, 11, 78–84. [Google Scholar] [CrossRef] [Green Version]
- Biswas, A.; Roy, M.S.; Gupta, R. Motion artifact reduction from finger photoplethysmogram using discrete wavelet transform. In Recent Trends in Signal and Image Processing; Springer: Singapore, 2019; pp. 89–98. [Google Scholar]
- Raghuram, M.; Madhav, K.V.; Krishna, E.H.; Reddy, K.A. On the performance of wavelets in reducing motion artifacts from photoplethysmographic signals. In Proceedings of the 2010 4th International Conference on Bioinformatics and Biomedical Engineering, Chengdu, China, 18–20 June 2010; pp. 1–4. [Google Scholar]
- Nason, G.P.; Silverman, B.W. The stationary wavelet transform and some statistical applications. In Wavelets and Statistics; Springer: New York, NY, USA, 1995; pp. 281–299. [Google Scholar]
- Li, W. Wavelets for electrocardiogram: Overview and taxonomy. IEEE Access 2018, 7, 25627–25649. [Google Scholar] [CrossRef]
- Donoho, D.L.; Johnstone, J.M. Ideal spatial adaptation by wavelet shrinkage. Biometrika 1994, 81, 425–455. [Google Scholar] [CrossRef]
- Holschneider, M.; Kronland-Martinet, R.; Morlet, J.; Tchamitchian, P. A real-time algorithm for signal analysis with the help of the wavelet transform. In Wavelets; Springer: Berlin/Heidelberg, Germnay, 1990; pp. 286–297. [Google Scholar]
- Stein, P.K.; Bosner, M.S.; Kleiger, R.E.; Conger, B.M. Heart rate variability: A measure of cardiac autonomic tone. Am. Heart J. 1994, 127, 1376–1381. [Google Scholar] [CrossRef]
- Schmidt, P.; Reiss, A.; Dürichen, R.; Laerhoven, K.V. Wearable-based affect recognition—A review. Sensors 2019, 19, 4079. [Google Scholar] [CrossRef] [Green Version]
- Bishop, C.M. Pattern Recognition and Machine Learning; Springer: New York, NY, USA, 2006. [Google Scholar]
Figure 1.
Sensors used to collect physiological data in the CLAWDAS dataset.
Figure 1.
Sensors used to collect physiological data in the CLAWDAS dataset.
Figure 2.
Example of signal acquisition in the CLAWDAS dataset: (left) math calculation; (right) relaxing audio listening.
Figure 2.
Example of signal acquisition in the CLAWDAS dataset: (left) math calculation; (right) relaxing audio listening.
Figure 3.
Distributions of the average heartbeat of the resting state PPG signals in CLAS (left), CLAWDAS Young (middle), and CLAWDAS Elderly (right) datasets.
Figure 3.
Distributions of the average heartbeat of the resting state PPG signals in CLAS (left), CLAWDAS Young (middle), and CLAWDAS Elderly (right) datasets.
Figure 4.
PPG data in resting state conditions for Subjects 11 (left) and 18 (right) are herein reported, in the original discrete time domain (DTD) and in the Subject Normalized Domain (SND) (top and bottom rows, respectively). Before the proposed normalization (top row), the subjects have different heartbeat frequencies, while, in the SND, they are more similar.
Figure 4.
PPG data in resting state conditions for Subjects 11 (left) and 18 (right) are herein reported, in the original discrete time domain (DTD) and in the Subject Normalized Domain (SND) (top and bottom rows, respectively). Before the proposed normalization (top row), the subjects have different heartbeat frequencies, while, in the SND, they are more similar.
Figure 5.
Pre-processing and feature extraction pipeline applied to the signals of the two datasets. The operations performed change according to the experiment considered: Amplitude Normalization (AmpN), amplitude normalization followed by a Subject Feature Normalization (SubjFeatN), and amplitude normalization followed by the Personalized Normalization based on resting state heartbeat (PersFreqN).
Figure 5.
Pre-processing and feature extraction pipeline applied to the signals of the two datasets. The operations performed change according to the experiment considered: Amplitude Normalization (AmpN), amplitude normalization followed by a Subject Feature Normalization (SubjFeatN), and amplitude normalization followed by the Personalized Normalization based on resting state heartbeat (PersFreqN).
Figure 6.
Bar plot comparison of the accuracy obtained using different classifiers and normalization procedures. Three datasets are considered: CLAS, CLAWDAS Young, and CLAWDAS Elderly.
Figure 6.
Bar plot comparison of the accuracy obtained using different classifiers and normalization procedures. Three datasets are considered: CLAS, CLAWDAS Young, and CLAWDAS Elderly.
Table 1.
Number of instances for each task in the CLAS dataset (first 5 columns) and in the CLAWDAS dataset (last 5 columns), distinguished into CLAWDAS Young and CLAWDAS Elderly. BL = Baseline, MP = Math Problem, ST = Stroop Test, LP = Logic Problem, NS = Neutral State, MC = Math Calculation, R = Reading, C = Comprehension.
Table 1.
Number of instances for each task in the CLAS dataset (first 5 columns) and in the CLAWDAS dataset (last 5 columns), distinguished into CLAWDAS Young and CLAWDAS Elderly. BL = Baseline, MP = Math Problem, ST = Stroop Test, LP = Logic Problem, NS = Neutral State, MC = Math Calculation, R = Reading, C = Comprehension.
| Num Subj. | BL | MP | ST | LP | NS | BL | MC | AL | R | C |
---|
CLAS | 60 | 60 | 60 | 60 | 60 | 360 | - | - | - | - | - |
CLAWDAS Young | 16 | - | - | - | - | - | 46 | 96 | 96 | 32 | 32 |
CLAWDAS Elderly | 20 | - | - | - | - | - | 60 | 120 | 120 | 40 | 40 |
Table 2.
Correspondences between the three domains.
Table 2.
Correspondences between the three domains.
Domain | Continuous Time | Discrete Time | Subject Normalized |
---|
Acronym | CTD | DTD | SND |
Heartbeat | [] | [] | [] |
Resting state Heartbeat | [] | [] | [] |
Sampling frequency | – | [] | [] |
Table 3.
Summary of the classification settings.
Table 3.
Summary of the classification settings.
Types of Normalization Considered | AmpN, SubjFeatN, PersFreqN |
---|
Features Used | Maximum, Minimum, Mean, Standard Deviation, Peak Rate, IBI, RMSSD |
Classifier Involved | SVM Linear, SVM Cubic, SVM Gauss, and CART |
Performance Evaluation Method | LOSO | | | |
Evaluation Metrics | Accuracy, single-class F1-score |
Dataset | CL High Class | CL Low Class |
Task | Num. of signals | Task | Num. of signals |
CLAS | Math, Stroop, and Logic Test | 360 | Neutral State | 360 |
CLAWDAS Young | Math Calculation | 96 | Audio Listening | 96 |
CLAWDAS Elderly | Math Calculation | 120 | Audio Listening | 120 |
Table 4.
Performance comparison on the CLAS dataset, varying the normalization strategies (columns) and classification models (rows). Two performance metrics are evaluated using a LOSO validation strategy: Accuracy (Acc) and single-class F1-score. The best performances reached for each type of normalization are underlined, while the highest accuracy value of all is highlighted in bold.
Table 4.
Performance comparison on the CLAS dataset, varying the normalization strategies (columns) and classification models (rows). Two performance metrics are evaluated using a LOSO validation strategy: Accuracy (Acc) and single-class F1-score. The best performances reached for each type of normalization are underlined, while the highest accuracy value of all is highlighted in bold.
| AmpN | SubjFeatN | PersFreqN |
---|
Classifier | | High CL | Low CL | | High CL | Low CL | | High CL | Low CL |
| Acc | F1-Score | F1-Score | Acc | F1-Score | F1-Score | Acc | F1-Score | F1-Score |
SVM Linear | 66% | 0.65 | 0.67 | 67% | 0.68 | 0.67 | 76% | 0.77 | 0.76 |
SVM Cubic | 74% | 0.74 | 0.73 | 73% | 0.74 | 0.72 | 81% | 0.81 | 0.81 |
SVM Gauss | 72% | 0.72 | 0.71 | 73% | 0.72 | 0.73 | 78% | 0.78 | 0.77 |
CART | 72% | 0.71 | 0.73 | 66% | 0.65 | 0.66 | 73% | 0.72 | 0.73 |
Table 5.
Performance comparison on the CLAWDAS Young dataset, varying the normalization strategies (columns) and classification models (rows). Two performance metrics are evaluated using a LOSO validation strategy: Accuracy (Acc) and single-class F1-score. The best performances reached for each type of normalization are underlined, while the the highest accuracy value of all is highlighted in bold.
Table 5.
Performance comparison on the CLAWDAS Young dataset, varying the normalization strategies (columns) and classification models (rows). Two performance metrics are evaluated using a LOSO validation strategy: Accuracy (Acc) and single-class F1-score. The best performances reached for each type of normalization are underlined, while the the highest accuracy value of all is highlighted in bold.
| AmpN | SubjFeatN | PersFreqN |
---|
Classifier | | High CL | Low CL | | High CL | Low CL | | High CL | Low CL |
| Acc | F1-Score | F1-Score | Acc | F1-Score | F1-Score | Acc | F1-Score | F1-Score |
SVM Linear | 66% | 0.62 | 0.69 | 68% | 0.63 | 0.72 | 79% | 0.77 | 0.80 |
SVM Cubic | 66% | 0.66 | 0.66 | 68% | 0.67 | 0.68 | 72% | 0.72 | 0.72 |
SVM Gauss | 63% | 0.59 | 0.66 | 66% | 0.61 | 0.70 | 76% | 0.75 | 0.76 |
CART | 56% | 0.59 | 0.51 | 68% | 0.67 | 0.69 | 64% | 0.63 | 0.64 |
Table 6.
Performance comparison on CLAWDAS Elderly, varying the normalization strategies (columns) and classification models (rows). Two performance metrics are evaluated using a LOSO validation strategy: Accuracy (Acc) and single-class F1-score. The best performances reached for each type of normalization are underlined, while the the highest accuracy value of all is highlighted in bold.
Table 6.
Performance comparison on CLAWDAS Elderly, varying the normalization strategies (columns) and classification models (rows). Two performance metrics are evaluated using a LOSO validation strategy: Accuracy (Acc) and single-class F1-score. The best performances reached for each type of normalization are underlined, while the the highest accuracy value of all is highlighted in bold.
| AmpN | SubjFeatN | PersFreqN |
---|
Classifier | | High CL | Low CL | | High CL | Low CL | | High CL | Low CL |
| Acc | F1-Score | F1-Score | Acc | F1-Score | F1-Score | Acc | F1-Score | F1-Score |
SVM Linear | 59% | 0.58 | 0.60 | 69% | 0.65 | 0.72 | 80% | 0.80 | 0.81 |
SVM Cubic | 59% | 0.55 | 0.62 | 64% | 0.61 | 0.66 | 72% | 0.71 | 0.73 |
SVM Gauss | 63% | 0.61 | 0.64 | 75% | 0.74 | 0.76 | 78% | 0.78 | 0.78 |
CART | 54% | 0.51 | 0.57 | 68% | 0.68 | 0.67 | 75% | 0.75 | 0.74 |
| Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).