2.1. Alternating Electric-Field Detection Principle
The electric-field detecting system consists mainly of an emitting portion and a sensing portion. The emitting portion includes an excitation source and an emitting electrode. The excitation source applies a sinusoidal alternating voltage to the emitter electrode, and the quantity of charge on the electrode changes with time, thereby forming an alternating electric field in space. Normally, the frequency f of the alternating voltage is between 10 and 150 kHz. It is chosen outside the frequency range of the radio to avoid being affected by radio signals. The electromagnetic wave wavelength λ=c/f generated by the charge on the electrode exceeds 2 km, and the detection distance of the system does not exceed 1 m. The emitting electrode is centimeter-sized, much smaller than the electromagnetic wavelength λ. Therefore, in the detection range of the system, the time-varying electromagnetic field can be regarded as quasi-static electrical near field at each moment. The electrostatic field equation can be used to solve the distribution of the electric field and potential in space. To improve the dynamic response characteristics of the system, the system carrier operates at a frequency between 40 and 100 kHz, and the performance of the system with a carrier frequency of 40 kHz is tested.
The sensing portion includes sensing electrodes and signal-processing circuit. The sensing electrode generates induced current under the alternating electric field. The current is coupled to the circuit in the form of a voltage through a snubber capacitor. The signal-processing circuit amplifies and filters the input voltage, and then transmits it to the host computer through the data acquisition system. When the hand enters the detection area of the system, the charge on the hand is redistributed under the action of the electric field, causing disturbance to the original electric field. The distortion of the electric field is fed back to the signal receiving end of the detector, from which the system obtains the hand information and analyzes its position and posture.
The equivalent circuit model of the electric-field detection system is shown in
Figure 1.
RX1 and
RX2 are the sensing electrode, and the remaining electrodes are omitted from the model.
TX is the emitting electrode.
GND is the system ground.
VTX is the excitation source. The capacitance between the emitting electrode and the system ground is
CTXG, the capacitance between the sensing electrodes and the emitting electrode are
CRX1TX and
CRX2TX, respectively. Since the sensing electrodes are in the same plane, the capacitance between them is negligible. After the hand enters the electric field, the capacitance formed with the sensing electrodes are
CRX1H and
CRX2H, and the capacitance formed with the system ground is
CHG. The change in the position of the hand causes a change in the capacitance between the hand and the sensing electrodes, and the induced charge
QRX of the electrode changes accordingly, thereby causing a change in the voltages
φRX1 and
φRX2 on the sensing electrodes. The detection system recognizes the gesture based on the difference in voltage changes of the electrodes.
When the hand does not enter the detection area, the charge density
σTX can be calculated according to the potential
VTX of the emitting electrode, and the electric field intensity
ETXRX0 near the sensing electrode can be known, thus the induced voltage
φTXRX0 can be expressed by the equation below.
When the hand enters the electric field, the electric-field intensity
ETXH near the hand can be known, according to Gauss’s law, the quantity of induced charge formed on the hand is
Qh, which can be expressed as below.
Since the curvature changes greatly on the hand, the distribution of charge is extremely uneven, so we divide it into several curved surfaces with different charge densities, with area of
Si, charge density of
σi, and the distance from the sensing electrode of
Ri (
i = 1, 2, …, n). The induced charge on the hand causes change to the spatial electric field, and the electric-field intensity
ERX on the sensing electrode is deduced from above equation.
The sensing electrode potential
φRX can be calculated by integrating
ERX. Calculate the change ∆
φ of the potential of different sensing electrodes after the hand enters the electric field as follows.
Because the shape of the human hand is irregular and cannot be solved analytically, the variation of the potential on the sensing electrode under different gestures is analyzed by simulation.
2.2. Design and Implementation of Electric-Field Detection System
The electric-field detection system is mainly composed of four parts, namely the sensing electrode, the insulating material, the emitting electrode, and the system ground. According to the function of each part, a reasonable system structure is designed, in which the system ground must be placed at the bottom of the model to shield the interference source below, ensuring that the sensing electrode is only affected by the emitting electrode and the hand above. The insulating material is above the system ground to isolate the system ground from other electrodes. The emitting electrode and the sensing electrode are above the insulating layer, which can be placed in the same layer or separately on the upper and lower layers. When the electrodes are in the same layer, the electric field formed by the emitting electrode in the space has less effect on the sensing electrode. The initial potential value of the sensing electrode is low. After the hand enters the detection range, the change in the potential of the sensing electrode will be small. By placing the sensing electrode above the emitting electrode, the change in the potential of the sensing electrode is significantly improved. Therefore, the hierarchical distribution of each part of the detection system is the sensing electrode layer, the insulating layer, the emitting electrode layer, the insulating layer, and the system ground from the top to the bottom, with a total of five layers.
To improve the sensitivity of the system, it is necessary reasonably design the thickness of the insulating layer, increase the coupling degree between the hand and the sensing electrode, and reduce the coupling degree between the emitting electrode and the sensing electrode. When the system detection panel area is close to that of the hand, the area of the sensing electrode should be increased as much as possible to increase the capacitance
CRX1H and
CRX2H between the hand and the sensing electrode. Under the premise of meeting the portability requirements of the system, the distance between the sensing electrode and the emitting electrode is appropriately increased to reduce the capacitances
CRX1TX and
CRX2TX between them. In addition, the system ground and the emitting electrode should maintain a low degree of coupling, so as to reduce the effect of the system ground on the sensing electrode in the one hand, and to improve the driving capability of the excitation source on the other hand. Based on above analysis, the distance between the sensing electrode and the emitting electrode is determined to be 10 mm, and the distance between the emitting electrode and the ground is also 10 mm. On this basis, the model of the detection system is shown in
Figure 2.
3D modeling software SolidWorks is used to establish different gesture models. All these models are imported into Maxwell, which is an electromagnetic field simulation software [
26]. A total of four different gestures are drawn, including stretched hand, W-shaped gesture, V-shaped gesture, and fist, as shown in
Figure 3.
According to the size of the ordinary human hand, the area of the sensing electrode of the detection system is determined to be 20 cm*12 cm, which is slightly larger than that of the hand. The hand can be divided into three areas: fingers, palm, and wrist. The fingers can be divided into three parts to detect whether the fingers of the hand stretch. Correspondingly, the palm is also divided into three parts to detect whether the fingers of the hand bend. The human wrist is very flexible. The wrist is divided into three parts to measure the pitch, yaw, and roll of its, as shown in
Figure 4a. Therefore, the detection system is set with nine sensing electrodes, and the sizes of the sensing electrodes are also different according to the proportion of the parts occupying the hand.
Create panel area matrix, as shown in
Figure 4b.
The size of No1, No3, No7, No9 area is 6 cm × 3 cm, the size of No2, No8 area is 6 cm × 2 cm, the size of the No4, No6 area is 4 cm × 3 cm, and the size of No5 area is 4 cm × 2 cm.
After the simulation model is established, the materials, solution domain, excitation source, and boundary conditions of each part of the system are set. The material of the sensing electrode, the emitting electrode, and the system ground is set to be copper, and the insulating material is set to be FR4 (a common material for Printed Circuit Board). The material properties of the human hand are set, including the relative dielectric constant of 77 (slightly less than that of water). The effective detection area of the system is within a sphere centered on it with a radius of 50 cm, so the solution domain is set to be a sphere with a radius of 5 m, and the material of this area is set to be air. The excitation source is set on the emitting electrode, and the voltage amplitudes are 0 V, 0.8 V, 1.5 V, 2.5 V, 3.3 V respectively, and the solution domain boundary and the system ground are set to be 0 V. The simulation calculation is performed to simulate the change in potential on the sensing electrodes under the alternating voltage. The potential changes on the sensing electrodes when the hand is 1 mm, 20 mm, 40 mm, 60 mm, 80 mm, 100 mm away from the detection electrode for the four kinds of gestures are obtained.
Figure 5 shows the inductive charge density on the hand when the 4 gestures are 1 mm away from the sensing electrodes under the emitting electrode voltage of 3.3 V. Since the emitting electrode is loaded with a positive voltage, the hand is negatively charged. The colors in the figure are from red to purple, indicating that the negative charge density gradually increases. The charge density of the blue and purple parts exceeds −3 × 10
−9 C/m
2, and the charge density of the yellow and green parts is −6 × 10
−10 ~ −2.5 × 10
−10 C/m
2, the orange and red portion has very little charge, and its effect is basically negligible. Except that the distribution of charge for stretched hand is relatively uniform, for the other three gestures charge is relatively concentrated. Since the difference in distributions of charge for each gesture is obvious, it can be inferred that there is a difference in the potential change on the sensing electrodes.
Spline interpolation is performed on the discrete sequence composed of the potential values of the sensing electrodes under different emitting electrode voltages. The variation of the sensing electrode potential in one cycle of the alternating excitation source is simulated, and the effects of the four gestures on the potential of the sensing electrodes are analyzed.
Figure 6a shows a simulation model diagram of a stretched hand, and
Figure 6b shows the relative change curve of the potential for stretched hand. The left thumb is closest to the detection panel, so the overall potential of the sensing electrodes in the palm region changes greatly, and the potential of the electrode in
No6 region changes the most, exceeding 0.1 V. At the same time, due to the full extension of the fingers, the potential changes of the electrodes in the upper layer are relatively uniform.
Making a W-shaped gesture, the thumb and the index finger are closest to the detection panel, and the hand charge is concentrated here, so the potential change of the No6 region electrode is much larger than those of the remaining electrodes, which is almost 0.5 V. The potential of the No5 region electrode is relatively large, but is only 0.2 V, and the electrode potential changes in other regions are less than 0.1 V. Making a V-shaped gesture, the curved little finger, ring finger, and thumb are closest to the detection panel, so the potential of the palm and the wrist area changes greatly. The average potential change is more than 0.3 V. When a fist is above the sensing electrodes, charge is concentrated on the curved fingers, the potential change of the electrode in the finger area is extremely small, the potential change of the electrode in the palm area is the largest, and the potential change of the electrode in the No6 region exceeds 0.35 V.
When the distance between the hand and the sensing electrodes is 1 mm, the difference in the potentials of all electrodes under different gestures is obvious, and the degree of discrimination is high. However, the magnitude of the potential change is limited to a maximum of 0.5 V. As the distance between the hand and the electrode increases, the amplitude of the potential change further decreases, and the feature extraction difficulty increases. Taking the W-shaped gesture as an example, the maximum potential change of the sensing electrodes is calculated when the distance between the hand and the electrode is 1 mm, 20 mm, 40 mm, 60 mm, 80 mm, and 100 mm, respectively. When the distance reaches 40 mm, the maximum electrode potential change is less than 30 mV, and even less than 10 mV for some electrodes. According to the operator’s habits, the distance between the hand and the electrode is usually within 150 mm. Therefore, the detection system not only needs to improve the structure, but also design the signal-processing circuit to adjust the potential of the sensing electrodes, in order to increase the detection distance of the system, so that it can meet the demand of practical applications.
The structure of the detection system is shown in
Figure 7. The emitting electrode
TX2 and the sensing electrode
RX1′ with the same structure as above are designed with the system
GND as symmetry plane. Due to the shielding effect of
GND, the upper and lower electrodes do not affect each other so that could be regarded as two independent systems. The excitation source is applied to the two emitting electrodes
TX1 and
TX2, and the potential values on the sensing electrodes
RX1 and
RX1′ are basically the same. After the hand enters the electric-field detection range, the potential on the sensing electrode
RX1′ changes under its effect, and the potential of the sensing electrode
RX1 does not change, so the potential difference between
RX1 and
RX1′ changes.
φRX1 and
φRX1′ are input to the differential amplifier circuit for filtering. Finally, the voltage signal is transmitted by the data acquisition system to the host computer.
The electric-field detection system is shown in
Figure 8. The sensing electrode layer is made of a flexible printed-circuit board (FPC), the insulating layer is made of a 10 mm thick epoxy resin plate (FR4), and the emitting electrode and system
GND are made of brass film. To increase the detection range of the system and reduce the radio wave interference in the space, an alternating signal with amplitude of 5 V and frequency of 40 kHz is applied to the emitting electrode. According to the simulation result, the gain of the differential amplifying circuit is adjusted to be 30. To reduce the effect of 50 Hz power frequency signal in space on the signal, a fourth-order Bessel band-pass filter circuit is designed with a center frequency of 40 kHz and a bandwidth of 1 kHz. The acquisition system selects the USB bus-based data acquisition card, with the range of ± 10 V, and the sampling frequency of 1 MS/s. It mainly consists of the power supply, signal source, oscilloscope, detection electrode, signal-processing circuit, and data acquisition system.
Since the sensing electrodes of the detection system have different sizes, the gain of the signal-processing circuit needs to be fine-tuned to ensure that the sensitivities of all electrodes are the same. The metal ball is suspended above the center of the detection panel, and the height of the ball is adjusted from 10 mm to 150 mm at an interval of 10 mm to collect the potential change of the sensing electrodes at different distances. The gain of the surrounding electrode signal-processing circuit is adjusted with reference to the change in the center sensing electrode potential.
The electric-field detection system is tested. The stretched right hand of the operator lifts flat above the detection panel, and the wrist is 80 mm away from the panel. The potential difference curves of the system sensing electrodes are shown in
Figure 9. It can be seen that the potential difference of the sensing electrodes in the palm region is relatively large, and the potential of the electrode in the
No4 region is the largest, which is close to 1.5 V. Since the right thumb is closest to the detection panel, the overall potential difference of the [
No1,
No4,
No7] region electrodes is larger than that of regions [
No3,
No6,
No9]. The distribution of charge on the stretched hand is relatively scattered, so the change in the potential difference of the region electrodes is relatively the same, basically about 1 V.
The measurement and simulation results under the four gestures are compared to know that it has high similarity, which verifies the effectiveness of the detection system and signal-processing circuit. The conditioning of the sensing electrodes potential signal by the signal-processing circuit not only increases the detection distance of the system, but also improves the discrimination of different gesture features.
2.3. Gesture-Recognition Method
Gestures are characterized by diversity and difference, and the gestures made by operators are different. The electric-field detection system needs to extract the gesture features from different hand states. We set several common hand states. State 1 is for the hand keeping flat above the electrode, state 2 is for the hand lifting, state 3 is for the hand sagging, state 4 is for the hand side spinning, state 5 is for the hand side shifting. The detection system can recognize 4 gestures in these 5 states: stretched hand, fist, V-shaped gesture, and W-shaped gesture. The distance between the wrist and the panel is taken as the standard distance. After many tests, the comfort distance for operator is 60 mm ~ 90 mm. In this range, the operator can freely change the hand state. This paper mainly studies the characteristics under different hand states and postures under the comfortable distance.
1600 sets of different gesture data of 4 operators in 5 states acquired. We collect 80 sets of data for each gesture of each operator in each state, and randomly select half of them as training set samples. Therefore, 800 sets of data are used to perform parameter training of the recognition algorithm, and the remaining data is used to verify the accuracy of the identification method.
In different hand states, there is a significant difference in the relative distance between the hand and the sensing electrodes of the system so that the potential of each sensing electrode is obviously different. In comparison, different hand postures have less influence on the potential of the sensing electrodes. Therefore, gesture recognition is performed in two steps, first determining the state of the hand, and then distinguishing the gesture.
The detection system divides the space into 9 parts by the sensing electrodes. The potential matrix
V defining the composition of the sensing electrodes is,
Therefore, the total potential sum of the sensing electrodes is VE = ∑ei (i = 1, 2, …, 9), the standard deviation of the sensing electrodes is σ. The matrix is a basic attribute of a gesture, which could be combined and transformed to form important features of the recognition gesture. First, some elements of the potential matrix are combined to analyze the change of the potential in the partial areas, including a potential matrix row vector [x1, x2, x3], a column vector [y1, y2, y3], upper right region z1 composed of e2 and y3, and lower right region z2 composed of e8 and y1. Calculating the sum of the elements in each area could be expressed as [Vx1, Vx2, Vx3], [Vy1, Vy2, Vy3], [Vz1, Vz2].
In view of the difference in gestures, even if one person repeats the same gesture, there will be different, so the potential changes of the sensing electrodes are different. However, the proportional relationship of the electrode potentials is determined, so it is normalized.
The sum of normalized potentials of the regions [
x1,
x2,
x3], [
y1,
y2,
y3] and [
z1,
z2] are [
dx1,
dx2,
dx3], [
dy1,
dy2,
dy3] and [
dz1,
dz2] respectively. According to the test data, the potential distributions of all regions indifferent hand states are different.
VE decreases significantly when the hand lifts, and increases significantly when the hand drops. When the hand spins to the right, the normalized potential and [
dy1,
dy2,
dy3] increase step by step. When the hand moves to the left,
dz1 increases and
dz2 decreases. Through the statistics of the test data, the characteristics of the specific hand state are extracted, as shown in
Table 1.
The gesture characteristics under different hand states are different. In general, the stretched hand means stretching all fingers, therefore, the standard deviation σ is relatively small. The fingers of the fist are all bent, so the difference between
dx1 and
dx2 is large. Making the V-shaped gesture means that three curved fingers are closer to the electrodes, so
VE is larger. Making the W-shaped gesture means that the index finger and thumb are closer to
y1, so the difference between
dy1 and
dy3 is larger. The remaining features are determined based on different hand states. The hand posture characteristics of each hand state are shown in
Table 2.
Based on the hand state characteristics, the decision tree model is constructed using the C4.5 algorithm [
27]. First, the five hand states are marked as
xi (
i = 1, 2, …, 5), and the probability of occurrence in the data set is
p(
xi). Entropy of data set
S before splitting could be expressed below.
Calculate the sum of the entropies of all data sets divided by feature
T,
Where m represents the number of data sets after splitting, and calculates the difference in entropy before and after splitting, i.e., the information gain.
The split information of the data sets divided by feature
T is,
Information gain rate could be deduced from above equations.
The C4.5 algorithm measures each feature by the information gain rate. The greater the information gain rate, the higher the purity of the data set after splitting.
According to the formula above, the information gain rate of each feature of the first splitting point is calculated to be [1, 0.6837, 0.7624, 1], and it can be seen that feature 1 and 4 are the optimal features for the initial splitting. On the whole, the information gain rate of the hand features is generally higher, and the feature differentiation is high.
For the new data set generated after splitting, the information gain rate of different features is calculated again to obtain the optimal splitting feature. Splitting is implemented through constant iterative operations until all features are traversed. After determining the state of the hand, the decision tree model is established according to the hand posture feature to perform gesture recognition.