1. Introduction
Biometric authentication, a longstanding method for verifying user identity, employs various anatomical features such as palms, irises, fingerprints, veins, and facial structures [
1]. The human hand, in particular, possesses multiple attributes suitable for biometric authentication. Among these, palm prints and geometrics serve as the primary visible characteristics, while palm vein patterns function as an invisible feature. Similar to fingerprints, palm prints are distinct for each individual and derived from unique skin patterns. As an additional advantage, both low-resolution imaging and cost-effective capture devices can be utilized for palm print analysis [
2]. The geometric features of the hand encompass all visible characteristics, including finger shape and joints as well as palm shape and size. These attributes are typically combined with other biometric measures to enhance authentication accuracy [
3]. Palm vein patterns, known for their high precision in authentication, act as a distinct unit and are resistant to spoofing. These patterns are identified through the analysis of specific regions in palm infrared images [
4]. However, authentication methods employing geometrics and palm prints may be vulnerable to compromise, while those utilizing palm vein patterns necessitate supplementary infrared equipment. Despite the absence of tampering and equipment concerns, conventional biometric authentication methods involving palm features remain susceptible to spoofing attacks [
5,
6]. Thus, robust anti-spoofing techniques are essential in combination with forgery and alteration resilience, as well as equipment accessibility. Recently, facial recognition, an alternative biometric authentication approach, has employed remote photoplethysmography (rPPG) technology to bolster anti-spoofing measures, demonstrating high accuracy [
7]. rPPG extracts heart rate information from skin color differences observed through camera imaging, eliminating the need for additional wearable equipment by utilizing an RGB camera alone [
8]. Moreover, as rPPG relies on heartbeat signals, the risk of damage is minimal. This presents the possibility of applying heart rate information derived from rPPG to palm print authentication methods. Consequently, this study examines the use of palm rPPG signals in an RGB environment, demonstrating that palm rPPG signals offer a spoof-resistant authentication solution and proposing a novel approach for palm biometric authentication.
Figure 1 illustrates the method proposed in this study.
2. Related Works
Previous research on spoofing detection for hand biometrics related to the proposed method can be categorized into two categories. As shown in
Table 1, these include methods that use human physical characteristics and methods based on image quality.
Hong Chen et al., 2005 utilized plaster and paper cards to fabricate artificial hand silhouettes, which were employed in experiments conducted with the HandKey II system. Specifically, five artificial hands with geometric features were utilized for testing [
5]. However, it is important to note that the reliability of their findings was limited due to the utilization of a small dataset. In contrast, our research aims to enhance the reliability of the model’s performance by employing a larger dataset consisting of 138 samples of real palm data and 124 samples of artificial palm data. Furthermore, our study overcomes potential limitations associated with the geometric approach, where performance may be compromised when encountering swollen hands or obscured hand regions. By focusing on bio-signals extracted from the hand rather than relying solely on hand geometry, our approach enables the detection of spoofing attempts independent of hand shape.
Haixia Wang et al., 2023 proposed a novel dual-wavelength synchronization acquisition system tailored for palm biometrics. The study demonstrated the system’s capability to accurately extract SpO2 and pulse rate from palm fingerprint and palm vein images. Moreover, it was established that the integration of SpO2 and pulse rate significantly enhances the anti-counterfeiting effectiveness of palm biometrics. To investigate the system’s performance, artificial palm prints and artificial palm veins were fabricated by utilizing diverse materials. Objects were classified as genuine when the SpO2 readings fell within the range of 70% to 100%, and similarly, objects were categorized as genuine when the pulse rate ranged between 40 and 200. The PPG signal was processed using a residual network, and a classification approach was employed, with the train and test data split in a ratio of 4:1 to enable cross-validation. Leveraging the dynamic function, a three-layer anti-spoofing strategy was devised, ensuring the preservation of palm biometric recognition capabilities while achieving robust anti-spoofing functionality without necessitating additional hardware [
9]. In contrast with this study’s approach, where SpO2 and pulse rate were employed, our research focuses solely on the rPPG signal for spoofing detection. This simplification of the algorithm yielded comparable accuracy results. Furthermore, palm vein image acquisition devices, which operate in the NIR spectrum, can be costly. In contrast, our study presents the advantage of conveniently detecting spoofing without the need for a separate NIR device, as it effectively utilizes an RGB camera.
The method presented in this study exhibits robustness against various sources of noise, including artifacts worn on the hand, due to its independence from the hand’s geometric characteristics. Moreover, by solely utilizing the features extracted from the rPPG signal within the palm’s region of interest (ROI), our technique offers simplicity compared to complex bio-signal-based anti-spoofing methods while demonstrating comparable or superior performance.
Vivek Kanhangad et al., 2013 introduced a methodology aimed at safeguarding a palm-print-based biometric system against spoofing attacks by utilizing human hand photos. The proposed approach employed a local texture pattern analysis to extract palm print features. Specifically, a local binary pattern (LBP) was utilized to train a classifier responsible for determining the authenticity of an input hand image, distinguishing between real and fake palms. Support vector machines (SVMs) served as the classification model. The study employed a dataset consisting of 611 samples from 100 subjects, achieving an impressive accuracy rate of 97.35% [
10]. While LBP has exhibited limitations in scenarios involving uniform brightness or susceptibility to noise and lighting variations, its usage has declined in recent times. However, our proposed methodology overcomes these challenges by leveraging bio-signals instead of relying solely on hand feature extraction, thereby offering a viable solution to address these issues.
Vivek Kanhangad et al., 2015 introduced a novel approach for the detection of display- and print-based spoofing attacks targeting palm print authentication systems. The study specifically focused on two distinct categories of sensor-level attacks, namely print-based and display-based attacks. The analysis of acquired hand images was conducted to estimate surface reflectance, and the feature set was constructed using first- and higher-order statistical features derived from the distribution of pixel intensities and sub-band wavelet coefficients. A trained binary classifier leveraged the identification information to discern between genuine and fake hand images. The study utilized a dataset consisting of 1300 samples obtained from 230 subjects. The results demonstrated a spoof acceptance rate of 79.8% when presented with a counterfeit digital or printed copy, and the proposed approach consistently achieved an average 10-fold cross-validation classification accuracy of above 99% [
11]. It is noteworthy that while the extraction of surface reflectance served as the basis for feature extraction in the referenced paper, our research diverges in the utilization of rPPG bio-signals extracted from the hand. If a bio-signal is used, spoofing can be detected by being less sensitive to external factors.
Asish Bera et al., 2021 introduced a presentation attack detection (PAD) method that employs visual quality evaluation to mitigate illicit attempts on hand biometric systems. In their study, the hand images of 255 subjects were genuine samples, and counterfeit images were obtained by capturing images using a Canon EOS 700D camera for each authentic sample. Additionally, artificial, fake images were created by introducing Gaussian blur and noise to the original images. A threshold-based gradient size similarity quality metric was proposed, taking into account the intensity variation between adjacent pixels, to differentiate real hands from fake hands. Classification experiments were conducted, utilizing k-nearest neighbors, random forests, support vector machine classifiers, and deep convolutional neural networks. Notably, an average classification error of 1.5% was achieved using the k-nearest neighbors and random forest classifiers [
12]. Since this paper uses artificially created noise from real hand data for fake hands, if spoofing is attempted in a method other than noise, such as using a printed hand photo, fake data may not be identified. In our case, this problem does not occur because the extracted bio-signal is used.
Xiaoming Li et al., 2015 proposed a novel approach leveraging binarized statistical image features (BSIFs) and image quality evaluation. The evaluation of image quality revealed that the re-captured images exhibited blurry and low-detail characteristics, thus making palm print a suitable feature due to its ability to provide more textural information compared to the original image features. Data collection was carried out using iPhone 5 and iPhone 5s devices. The BSIFs calculated binary codes for each pixel using a filter, and the filter’s base vector was learned from natural images through an independent component analysis. An SVM was employed for the learning process [
13]. It should be noted that, in this case, the utilization of features indicating that fake hand data possess lower levels of detail than real hand data makes it challenging to determine high-resolution fake hand data. However, our study addresses this limitation by capturing both fake hand images and real hand images using the same device. Consequently, it becomes feasible to detect spoofing in images of comparable quality.
P. Pravallika et al., 2016 investigated the applicability of their approach to iris, face, and palm print modalities. Their study focused on liveness detection and demonstrated that biometric security can be enhanced through image quality evaluation and the fusion of diverse biometric characteristics. To discriminate between real and fake samples, LDA, QDA, and SVM classifiers were employed. Image quality evaluation was performed using FR-IAQ (21IQMs) and NIR-IQA (41QMs). The highest accuracy was achieved using an SVM with an FGR at 9.2%, an FFR at 10.1%, and an HTER at 9.65% [
14]. Since the quality of the fake image is assumed to be different from the quality of the real image, spoofing detection may not work well if the quality is similar.
Mina Farmanbar et al., 2017 introduced a novel approach that combines texture-based methods with image quality evaluation metrics to mitigate spoofing attacks on face and palm prints. The texture-based methods employed in the study included LBP and HOG descriptors. For the generation of fake data, printed paper was utilized, and a dataset was collected from 50 individuals. Seven image quality features were employed, namely one-peak signal-to-noise ratio, structural similarity, mean squared error, normalized cross-correlation, maximum difference, normalized absolute error, and mean difference [
15]. It is worth noting that this method operates under the assumption that spoofed images exhibit distinct quality differences compared to genuine images, which can affect the performance depending on the quantity and quality of the image data used for spoofing. In contrast, our proposed approach leverages the rPPG signal extracted from the hand, which is a biological signal that is inherently difficult to manipulate, thus offering a higher level of security.
In our study, we focused on performing spoofing detection solely based on the rPPG signal, which results in reduced computational complexity compared to methods utilizing multiple image quality characteristics. By relying on the rPPG signal, our approach demonstrates consistent performance irrespective of variations in skin condition or skin type, making it suitable for daily usage scenarios.
4. Results
The rPPG signal, frequency-converted and extracted with video lengths of 3, 5, and 7 s, was classified using an SVM model. The results using features without PCA and the precision, recall, and F1-score values extracted with PCA of 15 or fewer components are summarized in
Table 2. Without PCA, the accuracies were 97.16% for a 3 s video, 98.4% for a 5 s video, and 97.28% for a 7 s video. The lowest accuracy was observed for the 3 s video, while the highest was achieved with the 5 s video.
SVM kernels employ the radial basis function (rbf) kernel. Concerning the parameter configuration, when utilizing GridSearchCV, the regularization parameter (C) was set to 10.0, and the kernel coefficient (gamma) was established as 1000.0. Even in cases where PCA was not applied, the values of C and gamma remained consistent at 10.0 and 1000.0, respectively. A receiver operating characteristic (ROC) curve was employed to visualize the obtained results [
21]. The ROC curve illustrates the diagnostic ability for specific decision criteria in binary classification situations, with larger area under curve (AUC) values indicating enhanced classification accuracy.
Figure 11 displays the curves for each experimental result. However, to prevent spoofing, it is essential to avoid confusing fake palm signals with real ones. A comparison between the confusion matrix results when the false positive (FP) value is 0 and the confusion matrix results from the previous experiment is illustrated in the subsequent figure. In
Figure 12, the number 1 denotes a real palm signal, and 0 represents a fake palm signal. The 5 s video, which had the highest accuracy at 97.2%, experienced a 1.2% reduction in accuracy. The 3 s video recorded 95.6% accuracy, and the 7 s video achieved 95.9% accuracy, indicating an accuracy drop of 1 to 3%.
5. Discussion
Recently, anti-spoofing techniques utilizing rPPG signals have been extensively researched. The rPPG signal can be measured from the skin surface of images, and in vision-based biometric methods that involve skin areas like the face or hands, synchronized rPPG features with heartbeats are not observed in cases of non-live biometric information.
In prior face anti-spoofing research, a method for face anti-spoofing based on a 9 s video was proposed [
22]. In previous studies, using rPPG signals extracted from the face and employing deep learning models achieved 99% accuracy. This outperforms the method proposed for the hand area in this study. However, for anti-spoofing research using rPPG, it is important to shorten the measurement time for ease of use and to minimize motion noise during measurements while ensuring accuracy in various environments. In this study, a video length of 5 s was proposed as the length that could generate optimal accuracy. Additionally, by confirming the feasibility of using a 3 s video, we significantly reduced the measurement time compared to previous research. Furthermore, while previous research found that face anti-spoofing may cause overfitting issues due to learned features from limited datasets, we defined handcrafted features in the frequency domain of rPPG signals and utilized an SVM model to make decisions in the multidimensional space for classifiers. This ensured the interpretability and scalability of the results. The dataset used in this study was obtained in an uncontrolled environment using various smartphone camera devices without specifying a particular camera model. This effort represents an attempt to capture videos in wild environments, and it is expected that there will be minimal performance degradation when used in real-world scenarios.
However, this dataset does not consider various skin tones. Although there is an argument that palm skin color shows little variation across races, the impact of our study focusing on hands rather than faces could be less significant, yet empirical validation through diverse racial datasets is necessary. Furthermore, the results were derived by only using an SVM model in the result generation method. The possibility of higher accuracy was hindered by not using additional classification models. In future research, we plan to explore anti-spoofing approaches using rPPG in infrared images and hand recognition in RGB environments using infrared images.
6. Conclusions
In this study, we propose a palm spoofing detection method utilizing the remote photoplethysmography (rPPG) signal of the palm as a potential means of biometric authentication. During palm rPPG extraction, valid authentication time is analyzed through video length adjustment and frequency conversion. We define video lengths of 3 s, 5 s, and 7 s, and at each length, the frequency data of real and fake palm rPPG signals are classified using an SVM model. In the SVM model, the power value obtained by dividing the heartbeat frequency into 30 equal intervals and the subsequent value are input as PCA components and compared. This results in accuracies of 97.73% for 3 s, 97.76% for 5 s, and 99.09% for 7 s using PCA. Without using PCA, accuracies of 97.16% for 3 s, 98.4% for 5 s, and 97.28% for 7 s are achieved. Our findings indicate that the 3 s video length may not allow for the extraction of valid features, owing to its short duration. Consequently, we suggest a 5 s video, which yields the highest accuracy of 98.4%, as the optimal length. In spoofing detection, false positives (FPs) that identify fake video signals as genuine are critical. For a 5 s video with an FP value of 0, the accuracy is calculated at 97.2%. Accuracies for other durations also exhibit decreases of about 2%. Additionally, we compared the results with and without PCA. For the 3 s and 5 s results, it was observed that the performance determined as the principal component value was reduced in proceeding with PCA. This suggests that most of the features in the extracted heart rate band have significant meaning. On the other hand, as the signal extraction video length increases, performance improves when extracting results using PCA. This means that the longer the length of the image, the more meaningless information is included in the 30 features. The findings demonstrate that the palm rPPG signal serves as a spoofing-resistant authentication method in an RGB environment, offering a novel palm authentication technique. This study presents a method for preventing hand image spoofing in an RGB setting.