1. Introduction
Diabetes mellitus, colloquially known as diabetes, has been estimated to affect 450 million people in the global population [
1]. This condition is characterized by abnormal levels of blood sugar. The National Diabetes Data Group (NDDG) identifies three main types of diabetes: type 1, type 2, and gestational diabetes [
2]. Type 1 diabetes, formerly known as juvenile diabetes, is an autoimmune disorder due to the pancreas’s inability to produce enough, or any, insulin. An individual who has this condition must undergo daily insulin therapy via insulin injections or an insulin pump. If insulin levels are too low, the result will be that the blood glucose levels (BGLs) will be too high, which is known as hyperglycemia. If too much insulin is administered, this will cause the BGL to be too low, which is known as hypoglycemia. Type 2 diabetes, also known as adult-onset diabetes and the most prevalent of the three, frequently causes hyperglycemia due to insulin resistance. In other words, the body builds a tolerance to insulin and can no longer adequately process it [
2]. The third type of diabetes, gestational diabetes, occurs when hormones from the placenta reduce the production of insulin in the mother’s body. Although this condition is usually not chronic, it can affect both the mother and baby’s health. Aside from these three types of diabetes mentioned, the NDDG also states several “impaired glucose intolerance” disorders can result in symptomatic and asymptomatic individuals.
Diabetes may result in the development of secondary complications that can be life-threatening, such as cardiovascular disease and renal failure. Other less severe secondary complications related to diabetes include nerve damage, ketosis, and various skin conditions. All the above complications dramatically affect the quality of life of patients who have diabetes. Accurately being able to measure blood glucose is an essential step in the healthcare of diabetes patients.
Traditional glucometers fall within a family of devices that utilize enzymatic reactions to produce electrical signals readable using the meter. These enzymatic reactions are glucose oxidase, glucose dehydrogenase, and hexokinase. The hexokinase method is the de facto gold standard for its high specificity [
3]. Practically, this home-monitoring (e.g., self-monitoring) approach, which requires a drop of blood, leads to poor monitoring habits, resulting in few measurements per day (i.e., three to seven samples per day), and providing a brief glimpse of blood glucose. Several methods have been proposed to improve the sampling rate of traditional self-monitoring techniques and overcome their inconveniences.
Some of these methodologies have even developed as commercial devices. For example, the Glucose Oxidase Needle (GON) approach is also an enzyme-based sensor like the traditional glucometers in which users must insert an enzyme-doped platinum needle into their subcutaneous tissue to estimate their blood glucose [
4] from the interstitial fluid (ISF). The main difference between the traditional glucometers and GON-based devices is that the sampling rate has been increased by measuring the blood glucose automatically every 15 min [
4]. In addition, users have access to their data via a smartphone application. This approach is considered minimally invasive since the platinum needle is small and the depth at which it is placed is shallow. Therefore, these GON-based devices are currently considered continuous glucose monitoring (CGM) devices. The Dexcom G5 and Abbot’s Freestyle Libre II devices are two commercial products based on this GON approach. The sensors’ lifespan in these commercial devices is limited, needing to be replaced every two weeks. The need to continually purchase these sensors makes the accessibility of these commercial devices difficult for users from low-income households.
In 2020, Huang et al. reviewed devices based on electrical impedance spectroscopy (EIS), which relies on implementing capacitance sensing via interdigital sensors, to provide a more accessible approach with reduced e-waste [
5]. Those interdigital sensors (also known as fringing field sensors) measure the equivalent impedance of the subcutaneous tissue at frequencies higher than 200 kHz when electromagnetic fields cross the human skin. Although the impedance has been shown to be well correlated with the amount of glucose concentration in the ISF of the skin [
6], the accuracy of those devices is highly sensitive to the magnitude of the electromagnetic field. Caduff’s research group, who was an early adopter of this EIS approach, developed the commercial PENDRA product [
6] which utilized capacitive sensors to estimate the blood glucose from the measured impedance of human skin. Despite the great potential of this device to become the first commercial noninvasive continuous monitor (NICGM) device, it ultimately failed in the market due to its poor performance in real-world settings (i.e., it only successfully estimated the BGL of two-thirds of users). Moreover, the PENDRA device required a complex calibration procedure that had to be performed by a team of healthcare professionals [
7]. To increase the applicability of the PENDRA device, in 2015, Caduff’s research team proposed the integration of multiple impedance spectroscopy sensors into a single device [
8]. Another similar approach to the EIS-based devices are the microwave planar resonant glucose sensors [
9,
10,
11]. These devices, classified into frequency-based, Qu-based, insertion/return loss-based, and phase-based sensors, rely on measuring the dielectric properties of the skin using resonant methods. Although these sensors are under further development to improve sensitivity, there is potential promise by integrating active circuitry and machine learning techniques.
Another noninvasive approach to estimating blood glucose is metabolic heat conformation (MHC). This technique takes advantage of the fact that most of the heat generated by the human body is a result of the cellular process that converts glucose into energy [
12]. This heat is then dispersed into the surrounding environment in the form of convection, evaporation, and radiation. Within the MHC technique, blood glucose is estimated based on ambient information (i.e., temperature and humidity), hemoglobin concentration, oxyhemoglobin concentration, and blood flow rate. These latter features can be measured from the fingertips of users’ hands using an optical multiwavelength spectrometer [
12]. The main drawbacks of this approach are its high sensitivity to environmental factors and the poor sampling rate due to current implementations do not allow continuous measurements. Nonetheless, it is important to highlight the fact that optical approaches have garnered a great deal of attention in estimating blood glucose. Most of these optical approaches are focused on infrared (IR) [
5], mid-infrared (mid-IR) [
13,
14], and near-infrared (near-IR) [
15] spectroscopy. The tradeoff between these three spectroscopic techniques is the penetration depth versus the glucose absorption.
To finalize the discussion of current noninvasive methods to estimate blood glucose, we would like to describe a multimodal approach implemented in the GlucoTrack device [
16]. This device integrates three different techniques (i.e., ultrasonic, electromagnetic, and thermal) for the noninvasive estimation of the BGL from the user’s earlobe. The ultrasonic sensor measures the speed that an acoustic wave travels through the user’s earlobe. The electromagnetic sensor provides an equivalent measurement of the skin impedance, like the EIS approach. Finally, the thermal approach applies a known amount of energy for a predetermined period and obtains the heat transfer characteristics of the tissue, like the MHC approach. Although none of these techniques directly measure the BGL, a strong correlation has been shown to exist between the individual measurement characteristics and the BGL in the earlobes’ tissue [
16]. Whereas the procedure for users to use the GlucoTrack device is quite simple (i.e., simple attachment of the sensor clip onto their earlobe), this device still does not provide continuous measurements due to the impracticality of wearing this earlobe clip during the day, hampering its commercial adoption.
In this work, we investigate a synergistic approach to accurately predict BGLs by combining noninvasive biometrics measurements with machine learning (ML) algorithms. The proposed multimodal approach integrates optical, electromagnetic, and thermal techniques to measure up to 14 features. The investigated noninvasive features include heart rate (HR), skin temperature (sTEM), heat flux (HF), electrodermal activity (EDA, also known as galvanic skin response, i.e., GSR), pulse oximetry (SpO2), systolic (SYS) and diastolic (DIAS) blood pressures, ambient temperature (aTEM), and ambient humidity (aHUM). We chose these features because they have been shown to perform well in predicting BGLs or have a moderate correlation with blood sugar, and their respective sensors can be implemented in a smartwatch-like wearable device [
16,
17], enabling continuous measurements during the day and a rapid commercial adoption. This pilot study aims to test different combinations of noninvasive features and ML algorithms and quantify their performance in predicting an individual’s blood glucose. We used two datasets to evaluate the combination of noninvasive features and ML algorithms to estimate accurate blood glucose values. The selected machine learning algorithms for this study fall under the category of regression algorithms. Among the different regression algorithms, we tested the performance of linear regression (LR), Support Vector Regression (SVR), K Nearest Neighbors Regression (KNN), Decision Trees Regression (DTR), bagging trees regression (BTR), Random Forest Regression (RFR), Gaussian process regression (GPR), and Multi-layer Perceptron Regression (MLP). All these models are popularly used in the data science community.
4. Discussion and Conclusions
This study explored the feasibility of using multiple noninvasive sensors to predict blood glucose concentrations. Firstly, we investigated our approach by using the already-existing OhioT1DM dataset. The original dataset was preprocessed to select only the features that fit into a wearable device (e.g., heart rate, GSR, skin ambient, and ambient temperature). Nine different regression algorithms, including models that can account for both linear and nonlinear relationships between the target glucose concentration and the features, were tested. The high RMSE value (>45 mg/dL) and low
R2 coefficient provided by the best-trained model for Subjects 559 and 563 from the OhioT1DM dataset show that the investigated features (e.g., heart rate, GSR, skin ambient, and ambient temperature) are not sufficient for the accurate prediction of glucose level. Note that 1702 instances from a total of 13,008 were predicted outside the clinically acceptable range (regions C–E in the Clarke Error Grid) for the subject with the lowest variance in the glucose concentration and the highest number of instances (e.g., subject 563). To improve the performance of the regression model, we also tested the performance of the models using additional engineered features. Among the multiple engineered features, we included the product of the heart rate and GSR (correlation coefficient = 0.26 and 0.20 for subjects 559 and 563,
Figure 3), the product of the GSR and skin temperature (correlation coefficient = −0.14, and −0.12 for subjects 559 and 563), and the triple product between GSR, skin temperature, and the heart rate. Nonetheless, the best-trained model using engineered features (not shown) did not enable predictions of glucose concentrations within the A and B regions in
Figure 4.
Therefore, a new dataset including additional features was required to predict glucose concentrations. This new dataset (e.g., the UofM dataset) contains nine additional features compared to the Ohiot1DM dataset: SpO2, motion, ambient humidity, ventral and dorsal moisture, heat flux, blood volume pulse, and inter-beat interval. Adding these features into the regression algorithms enabled training models that follow the target glucose concentrations. The performance of the best-trained models was significantly increased (
R2 ≈ 0.71–0.72 for the UofM dataset versus
R2 ≈ 0.11–0.16 for the OhioT1DM dataset). This improvement is related to the fact that the feature in the UofM dataset presents a much higher correlation with the target glucose concentration (
Figure 5), enhancing the model’s predictive power. Overfitting was combatted using cross-validation and ensemble tree models since they insert some randomness into the training process, and therefore are less prone to overfitting. Although the best model for Subject 2 is a Gaussian process regression algorithm, the performance of a bagged trees ensemble regression algorithm for Subject 2 is almost identical (RMSE of the bagged tree ensemble is 13.91 mg/dL versus 13.72 mg/dL for the Gaussian process model). This means that the bagged tree model generates predicted glucose values within the clinical range (e.g., A and B regions in the Clarke Error Grid) for both UofM subjects.
Figure 5 shows the additional correlation between the input features. For example, the systolic and diastolic blood pressure seems to be moderately and highly correlated to the heart rate, EDA, heat flux, and skin temperature. Considering the correlation between the blood volume pressures and the other features, we investigated the model performance without using the systolic and diastolic blood pressures as input features. The values of the RMSE are 10.192 mg/dL and 21.8522 mg/dL for Subjects 1 and 2, respectively. The RMSE value was increased by a factor of 1.21× and 1.59× for Subjects 1 and 2, respectively. Therefore, removing the blood pressure features leads to a less predictive model. Based on this reduction, the best approach may be to create engineered features taking advantage of the strong correlation between the features in
Figure 5.
Table 6 and
Table 7 report the performance of the best-trained model for Subjects 1 and 2 from the UofM dataset, respectively, when we removed features systematically. Comparing the last two rows in
Table 6 and
Table 7, the performance of the regression algorithm seems quite invariant to SpO2 and motion features. Although the removal of blood pressure and skin moisture (third and fourth rows in
Table 6 and
Table 7) penalize the accuracy of the predicted blood glucose, the
R2 value between the target and predicted glucose values is still highly correlated (e.g.,
R2 equal to 0.49–0.56 for subject 1 and 0.64–0.66 for subject 2). The removal of features such as IBI, BVP, and HF (second row in
Table 6 and
Table 7) predicts blood glucose values that fall within region C, causing the patient to make wrong actions to manage his/her condition. Finally, in the first row in
Table 6 and
Table 7, we selected the same features to the OhioT1DM dataset (e.g., heart rate, EDA, and skin temperature). Even though the range of our glucose values is highly reduced (e.g., blood glucose level ranges from 68 to 172 mg/dL), this result proves that these four features do not provide significant predictive power to estimate blood glucose levels. Note that the prediction power of a truncated version of the Ohio dataset within the healthy range (e.g., 60–200 mg/dL) is still null and poor for Subjects 559 and 563 (e.g.,
R2 equal to 0.06 for subject 559 and 0.13 for subject 563). This result shows that these features are not sufficient to predict accurate values of blood glucose, even though the truncated version of the Ohio dataset contains approximately 10,000 instances per subject. Future work should investigate feature engineering and extraction to produce more meaningful features while decreasing the input features in our model. Principal component analysis or similar dimensionality reduction techniques could also be used to transform the data of some features into a lower-dimensional space, thereby reducing the feature to sample ratio while preserving the information. For developing commercial wrist-worn devices, it is mandatory to identify the most critical features that the device must incorporate.
Future work will also be focused on investigating more sophisticated methods to preprocess the dataset. For example, in this study, the dataset was generated by simply removing outliers, averaging the values over the same time signature, and selecting the values that match with the timestamp of the target blood glucose. We will also consider extracting additional features from current measurements to improve the predictive model in future work. For example, we could use the MATLAB-based KARDIA software package to measure phasic cardiac responses and time- and frequency-domain heart rate variability using the IBI data generated by the Empatica E4 wristband [
20]. Note that blood glucose levels have been well correlated to heart rate variability using photoplethysmography [
19,
21,
22]. In addition, we could investigate the extraction of better features by decomposing the skin conductance (i.e., EDA) into its tonic and phase components using the MATLAB-based software package Ledalab [
40]. These two packages are distributed free of charge.
Future work should also be focused on investigating a global model (as opposed to user-specific or local), including specific user information as input features. This would allow a global model to discriminate based on key biometrics (such as BMI, age, race, and gender) to increase the accuracy of blood glucose level predictions for individuals.
In conclusion, for the first time, to our knowledge, this study demonstrates the use of noninvasive sensors implemented in a wrist-based device to predict blood sugar concentrations noninvasively within the healthy range, which may be significant for athletes and military personnel who require continuous monitoring of their blood glucose. Our results should be viewed as a proof-of-concept study, still requiring a larger dataset, including a broad range of individuals (different ages, races, and health conditions). Furthermore, this multimodal approach must be tested to accurately estimate both hypo- and hyper-glycemic events. Even though accurate and continuous blood glucose monitoring is important for general health management, this technology would have the greatest impact within the diabetic community. Future datasets must include individuals with diabetes. We believe that this is the first step before a clinical research study to determine the key features to be implemented in a glucose-based smartwatch.