1. Introduction
Wireless sensors are widely used in transportation systems to ensure the system security, lower the fuel consumption and increase system efficiency etc. [
1,
2,
3]. Cargo shipments security is a critical challenge for shippers, every year cargo theft costs billions of dollars. A Transportation Security Sensor Network (TSSN) architecture is developed to realize the vision of trusted corridors [
3]. In TSSN the cargo security is monitored with active and battery-powered container seals (sensors) to report security seal events timely. Any unauthenticated attempts to unlock the container will be reported to the operations center through mobile network and internet. A security Seal Interrogation Transceiver (SIT) is designed to communicate with the container seals over a wireless network, which should be secured with secret-key, or else an adversary could interfere the wireless communication link to disable the alert system.
The security issue in TSSN is similar of that in Wireless Sensor Networks (WSNs). There are two main challenges on the secure communication: 1. the low cost of wireless nodes leads to severe resource constraints such as limited battery power, memory and low computation capability; 2. the open nature of the wireless link makes it easy to be eavesdropped. The normally used public cryptography approaches are not suitable for TSSN because sensor nodes (container seals) are source constrained devices that cannot afford for public key cryptography. Therefore, symmetric key-based schemes are widely used in WSNs, because of the advantages of low cost in power comsumption, time execution and code size [
4,
5]. The main challenge for WSNs to implement symmetric key-based scheme is the secret-key distribution in the network.
Physical layer key generation schemes could offer a solution of the issue [
6]. The main advantage of physical layer key generation scheme is that the key is directly generated in physical layer, and there is not any key distribution process. In an ideal situation the eavesdropper could not get any information about the key.
The theoretical aspects of secrecy extraction from correlated random source have been firstly studied by [
6,
7]. It is shown that correlated observations of random sources could be used to distill secret-keys by discussing over a public channel, while the information rate leaked to the eavesdropper can be arbitrarily low. The supremum of achievable secret-key rate is called secret-key capacity. In recent years, significant interests in developing practical approaches to generate secret-key between multiple users have been attracted [
8,
9,
10,
11,
12,
13,
14,
15,
16,
17,
18,
19,
20,
21,
22,
23,
24,
25,
26,
27,
28,
29,
30,
31,
32,
33,
34]. It is shown in [
9,
11] that there is a trade-off between the secret-key rate and the public communication rate in the key agreement protocols.
One of the main issues of these schemes is how to find proper random sources for secret-key generation. Such sources should create correlated randomness between the legitimate users, would have high level of randomness, and should be difficult for the eavesdropper to observe.
In this paper we propose a method to create artificial correlated random sources for wireless sensors to generate secret-key in a cooperative TSSN. The random sources are created by multiple nodes in the system, when the cooperative helpers send independent symbols simultaneously, different channel vectors result in different receiving signals, which prevents the eavesdropper from getting a copy of the legitimate users’ signals. Since the random source is artificially generated, even when CSI (Channel State Information) of the wireless channels changes slowly, high secret-key rate can still be achievable.
In the proposed scheme, the helpers have no idea of the receiving signals of the users. When the eavesdropper is equipped with multiple antennas or there are multiple eavesdroppers, it is possible for them to get what the helpers send. However, since the eavesdroppers have no idea about the legitimate nodes’ CSI, they still cannot get what the legitimate users get. It is proven that, from computational complexity security point of view, the proposed scheme is secure with enough helpers even when the antenna number of the eavesdropper is unlimited.
In [
8], the random source is also artificial signals, secret-key is generated by opportunistic transmission over the quasi-static fading channel by sending signals when the channel condition of the legitimate users are better than the eavesdropper’s. However, this approach is based on certain assumptions that are hard to be realized in practice, while the proposal in this paper does not have. In [
27,
33], authors have investigated the impact of cooperative relay nodes on the secret-key generation, these algorithms are based on the wireless channel reciprocity of the users and relays. Since the achievable secret-key rate scales linearly with the number of relays, when there are large number of available relays, better system performance could be achieved. While the proposed scheme in this paper can generate secret-key artificially, our proposal is more suitable for the system with relatively small number of relays. The scheme of [
34] is based on the knowledge of the eavesdropper and the communication capacity of the nodes is unlimited, that is not practical in a real system.
The organization of the paper is as follows.
Section 1 introduces the proposed scheme of the system.
Section 2 presents secret-key rate analysis.
Section 3 studies the system security with different thread models. Numerical result is presented in
Section 4. Finally, we conclude the paper in
Section 5.
2. Proposed Scheme
The system is shown in
Figure 1, which is a wireless sensor network equipped in a rail-borne cargo. There are two types of nodes in the network. One is Cargo Sensor (CS), which is placed on the cargo container. Normally CS is battery-powered and designed to monitor security seal events or the status of the cargo. The other type of node in the network is Monitor Center (MC), which could be installed in the cab of the locomotive. CS reports sensor data to MC or receives commands from MC through secure wireless link, which is encrypted by symmetric cryptographic scheme. In this paper, we propose a novel key agreement or key distribution scheme in TSSN.
The proposed scheme is shown in
Figure 2 and
Figure 3. MC tries to update the secret-key of one of the CSs which is marked as A (the red container in
Figure 3). All the communication parties can communicate with each other through a public channel. Other CSs can help MC and A to achieve the goal, we call them helpers. Note that helpers are not fixed, A could also play as a helper when MC wants to update the key of another CS.
Here we assume the number of the helpers is . A passive eavesdropper is located at somewhere trying to crack the secret-key, he can access the signals from all the communication partners. The system is a narrow band system, the wireless channels are assumed to be block fading with the coherent time . Note that all the nodes in the system are equipped with single antenna.
The railway radio propagation environment is significantly impacted by the railway structures, such as cuttings, viaducts, and tunnels etc., so the wireless channel’s propagating characteristics changes while the train traveling from one site to another. Since the system is a narrow band system, the channel can be modeled as a complex random variable. The complex random channel gain is considered to keep constant within the coherent time . Then the nodes in the system have to measure and renew the channel gain when the transmitting time is longer than .
There are three stages in the proposed scheme.
In Stage 1 (ST1) within , all the helpers synchronously send random symbols to A to create stochastic signals at A. There are two sub-stages in ST1: the first is the backward signal transmission sub-stage (ST1-B) from partner A to the helpers. In this sub-stage, User A sends channel estimation sequence to the helpers, then they can estimate the channel coefficients of the links from A to helpers. Due to the radio propagation reciprocity, the helpers then know the channel gains of the links from themselves to A. On the other hand, the eavesdropper can also get the channel estimation sequence from A, then he can estimate the channel gain between himself and A. Because user A and helpers do not send any messages about the channel information, the eavesdropper cannot get any information about the helpers’ links directly.
The second sub-stage in ST1 is forward signal transmission sub-stage (ST1-F) from the helpers to user A. All the helpers individually send
K random symbols to user A, then the symbols that user A receives are
where
denotes the complex channel gains between the helpers and A,
denotes complex zero-mean Gaussian random symbols sent by the helpers which are independent of each other, and
denotes the receiving noise of user A. Here we assume that
, which is a reasonable assumption for a practical system. Then the transmitting power of the helpers is
. In the proposed scheme, we set all the transmitting power to be the same, that is
.
During Stage 2 (ST2), all the helpers repeat the
K symbols sent in the first stage multiplied with weight factors
. There are also two sub-stages in this stage. The first is the backward sub-stage (ST2-B) for channel estimation. In this sub-stage, MC send channel estimation sequence to the helpers to estimate the complex channel gain. In addition, the eavesdropper also can only get the channel information between MC and himself. Next is the forward sub-stage (ST2-F) for the helpers to repeat the
K symbols, then the symbols that MC receives are
where
denotes the complex channel gains between the helpers and MC, and
is the noise of MC. We also assume that
. To create correlated sources for the partners, we set
where
is power factor to adjust the total transmitting power in the second stage. Note that
is only determined by the helper’s own CSI and a global factor
, no other global information should be shared between the helpers. Then the receiving symbols of MC are
Then , when signal power is high enough, the symbols received by MC and A are almost the same.
Note that if is a small low value, which means the link between MC and helper i is very weak. Then the transmit power of helper i during ST2-F has to be very high, and possibly exceeds the maximum transmit power of the helper. If the helper repeat the random symbols only with maximum power, MC’s receiving signals will not be the same as A’s without considering noises. There are three solutions: One is that MC can use antenna with high antenna gain, for example, using directional antenna, then usually will be lower . The other solution is that helpers can adjust the global power factor to make sure all the helper’s transmit power in ST2 is lower than the upper limit. However, this solution has the risk to leak some information of the legitimate user’s channel information. The last one is that the helper i is mute in ST2-F when the transmit power should be higher than the maximum power of himself. In an additional stage, helper i broadcast his signals sent in ST1, and tells every one that he is mute in ST2. Then A can remove what the helper i sends in his receiving signals with the check-sum information from MC in ST3. This additional stage will not influence the model and analysis in the paper.
In Stage 2, we assume perfect channel estimation of
and
, which is not true in a practical system. When considering channel estimation errors, the receiving signals of MC will be
where
and
are the channel estimation errors, and
It is easy to know that
which means
can be considered as additional Gaussian noise. Then channel estimation errors will result in an additional noise to the receiving signals of MC, and in turn decrease the performance of the system. In this paper, we mainly focus on the upper bound of the system performance, the further discussion of the model in a real system is left for future work.
During the above two stages, the eavesdropper gets the signals as
where
denotes the complex channel gains between the helpers and the eavesdropper,
and
are the noises of the eavesdropper in the two stages respectively.
Note that all the noise terms , , and are zero-mean white independent complex Gaussian random variables with the variance .
After stages of transmission, there should be a stage of hand-shaking, which is Stage 3 (ST3). During ST3, A and MC exchange information to distill a common secret-key. In this paper, we concentrate on the class of key agreement protocols in which only A sends messages to MC. A computes a secret-key and sending message from x, then sends to MC over the public channel, the total length of these sequences is . MC then computes the key from y and . A secret-key rate is achievable if for any and all sufficiently large number , there is a secret-key agreement such that
Then secret-key capacity is defined as the supremum of secret-key rates achievable for the model.
Since the length of is relatively long, the transmission of may be divided into several frames, which depends on the coherent time . Here we do not show any details on ST3, for the information about the link between A and B is public, and does not affect the key generation process in this paper.
In general, a closed-form expression of the secret-key capacity is still an open problem. Nevertheless, in [
6,
7], upper bound and lower bound of the secret-key capacity are shown. Since we focus on the design of the correlated sources for secret-key generation in this paper, a tight upper bound is good enough for demonstrating the effectiveness of the proposal.
4. Threat Model Analysis
The above analysis on secret-key rate is based on the information about the eavesdropper, we can only get a theoretical upper bound of system performance. Actually, in a real system, we have no idea of the potential eavesdropper, so the information-theoretic security is not available. In this section, we discuss the practical challenges on the system without assumptions on the eavesdropper.
4.1. Passive Attacks
Since a passive eavesdropper does not send any signal, it is hard to estimate how much information is leaked to such an attacker. Different stage of the process has different level of the possibility to leak the confidential messages.
ST1-B and ST2-B: During these sub-stages, the partner A and MC send channel probing signals to the helpers, then they can estimate the channel gains of themselves. For a passive eavesdropper, he can also get the channel probing signals, then the channels state information of his own is known by the eavesdropper. However, the legitimate system links is still unknown by the eavesdropper. In a poor multipath scattering environment, the eavesdropper may have a strong correlation in measurements of the wireless channels. However, since there are many helpers in the system, the eavesdropper could not have correlation with all the helpers. So when the helper number is large enough, the leaking information to the eavesdropper is limited.
ST1-F and ST2-F: In these sub-stages, all the helpers send signals simultaneously to partner A. The eavesdropper can also get the signals. However, the two signals are different linear combination of . When the node number N goes to infinite, the eavesdropper’s signals is independent of what node A and MC receive. More node number can help the system to achieve higher security level.
When the eavesdropper is placed very near A or MC, or else the whole system works in a poor multipath scattering environment, the eavesdropper’s channels might have a strong correlation with the legitimate users. In these cases, the phase of the complex channel coefficients is usually still independent of each others, while the modulo of the channel coefficients or the RSS (received signal strength) are correlated to each others. The proposed scheme is even very sensitive of the phases, which is demonstrated in
Figure 4.
In
Figure 4, the worst and best cases are demonstrated. The best situation is that the eavesdropper’s channels are independent of the legitimate users’. The worst case is that the eavesdropper’s channels are highly related with the legitimate users’, even the extreme case is
, that is the eavesdropper’s channels have the same modulo of A’s. We consider the average correlation coefficient of
x and
to show how the helper number improve the system security. The average correlation coefficient is defined as
It is shown in
Figure 4 that even the channels are highly correlated, the signals intercepted by the passive eavesdropper is almost independent of the legitimate receiver’s when the helper number is large enough, which means the eavesdropper almost could not get any useful information from the receiving signals.
ST3: ST3 is usually an error correction process. Due to error of channel measuring and noise, the extracted bits at A and MC sides are usually not identical. During this stage, parity bit information may be transmitted openly to correct errors. The eavesdropper can get the messages in this stage. In order to eliminating the eavesdropper’s partial information about the key, there will be a privacy amplification process [
6]. In the privacy amplification phase, both legitimate parts compress the information to their “real entropy”. However, in a practical system, the information leaked to the eavesdropper is hard to estimated, then the “real entropy” is hard to be decided. A possible solution is to get average or maximum leaking information by experiments.
Multi-Antenna attack: The most threatening passive attack is multi-antenna attack, that is, the eavesdropper is equipped with multiple antennas to intercept the signals. In this case, the eavesdropper performs like a MIMO (Multiple-In-Multiple-Out) system . In an ideal situation, when the antenna number of the eavesdropper is unlimited, the eavesdropper could know exactly the signals from every transmitting antenna. Then the receiving signals of the eavesdropper can be considered as . It is easy to know that , then the secret-key rate of the system is down to zero, that is, the system cannot get information theory secrecy any more. However we will prove in a following subsection that the system is still secure with enough number of helpers.
4.2. Active Attacks
In [
36], the author classifies the now existing active attacks into three types: disruptive jamming attack, manipulative jamming and channel manipulation attack.
Disruptive jamming: The purpose of disruptive jamming attacks is to minimize the key generation rate between legitimate users. The jamming signals can be injects in every stage of the proposed scheme. Most harmful behavior is to disrupt the channel probing process, without accurate channel estimation, the secret-key rate of the system will be dramatically deduced. A possible solution of this issue is proposed in [
36], random probing signals is used to hide the channel state information, which is also suitable of the scheme of this paper.
Manipulative jamming: In [
37], a manipulative attack is proposed (Man-In-The-Middle Attack) to control the channel measurements at legitimate users. In our proposal, it does not work, any misleading of the channel measurements will cause failure on the key generation. Manipulative jamming on channel probing process will deduce the secret-key rate instead of compromising the generated key.
A possible way of manipulative jamming attack is that the attacker can transmit signals with high enough power in substage ST1-F and ST2-F, then the receiving signals of A and MC are mainly controlled by the attacker, thus the generated key is compromised by the malicious third-party. Since the attacker acts just like a normal helper, this type of attack is hard to be defended against. One possible solution to address the issue is power detecting: After generating a new key, all the helpers can report through the open channel about the average power during the period they transmitted to MC (This action will leak part of the channel information to the eavesdropper, which should be considered in ST3). Then MC can compare the average power that he has received and the messages helpers reports. If there be an attacker, the receiving signal power will be higher than the sum of all the reported power, which means the generated key is possibly manipulated. The more attacker controls, the more easy he would be detected.
Channel manipulation attack: Because the key is not generated from the channel information, channel manipulation attack can not influence the key generation process of this paper. On the other hand, since the system is equipped in a train which travels through a long distance, channel manipulation is almost impossible for any potential attacker.
4.3. Multi-Antenna Attack
From the scheme, we know that the eavesdropper cannot get any information of the channel gain from his receiving signals and . When information theory secrecy is not achievable, the unknown of can still help the system to achieve computationally secure secrecy.
The term of secret-key rate is based on information theory secrecy, or unconditionally secure secrecy. When the eavesdropper is equipped with multiple antennas, or else there are multiple cooperative eavesdroppers, the performance of the system will be lower. If number of the antennas or the eavesdroppers is infinite, the eavesdropper could possible almost know what the helpers send. In this case the system cannot achieve unconditional secrecy any more. However, the system is still computationally secure, which means cracking the secret-key is equivalent to the solution of some problem known to be laborious.
The analysis of this case is valuable, because in a practical system, we cannot limit the number of the eavesdropper’s antenna. The proposed scheme is computationally secure when there are infinite eavesdroppers. The reason is that in our system what the helpers send is different from what the users receive, multiple antennas can help the eavesdropper to get the sending signals but not the receiving signals. Without knowledge of the legitimate user’s channels, the eavesdropper still cannot crack the secret-key.
We assume that when there are infinite eavesdroppers, they could know exactly what the helpers send during the first two stages in an ideal situation. The eavesdroppers know the symbols and , i =1, ..., N, but they have no idea about . The method for the eavesdroppers to crack the secret-key is just guess. Then the problem is what is the probability for the eavesdropper to crack the key for one trial.
Since the receiving signals of the two legitimate users and the eavesdropper are all Gaussian signals, we consider
as the effective signals, then user A, B and the eavesdropper all get a noisy version of
s. We re-model the signals at the legitimate users as
When the eavesdropper tries to estimate
with
, where
denotes random selected complex vector as the estimation of the CSI, then the estimated signals can be written as
where
denotes the equivalent noise of the estimation and is independent of
s, then we have
The SNRs of the legitimate users and the eavesdropper are
If we consider the trial of the eavesdropper as an observation of the random source s, we can compute a secret-key rate of the system, , where . When the legitimate users want to generate secret-key with rate , any secret-key generation process of our model with lower than could be secure. In addition, if the secret-key generating rate is higher than , the system is no longer secure, or else, we can say in this case the eavesdropper can crack the key. Then we have the following theorem:
Theorem 2. If the system channel gain is statistically independent complex Gaussian random variables with the same variance ,
is the channel estimation vector, the legitimate users tend to generate secret-key with rate , the average probability of one trial for the eavesdroppers to crack the key is Theorem 2 means when we want to achieve secret-key rate of , if the SNRs at the legitimate users, or , are large enough, and the number of the helpers is also large enough, the probability for the eavesdropper to crack the key can be arbitrary small.
For example, when and are both 20 dB and we want to achieve the secret-key rate of 1 nats/symbol, and there are 20 helpers. Then the probability is about , this means if the eavesdropper wants to ensure probability to get the proper secret-key, he have to do about independent trials, which is almost impossible to be done.
Note that Theorem 2 does not limit the probability density function of , which means the eavesdropper can guess the channel gains in any way as he will, and the this will not affect the average probability for him to get the secret-key.
5. Numerical Results
In this section, we demonstrate the performance of the proposed schemes numerically. We perform the simulations with three types of configure: fixed channels, Rayleigh fading channels and line-of-sight (LOS) channels. In this section, all the notion of secret-key rate is actually the upper bound of the secret-key capacity.
We randomly generate some channel gains shown in
Table 1, the noise power
is 0 dBm. We compare the
and
in
Figure 5 . We can see that
and
are very close to each other, especially when the transmitting power is high, they are almost the same.
Figure 5 shows that
is a tight upper bound of the secret-key capacity, and verifies the result of Theorem 1.
In
Figure 6 we consider wireless communication system in fading environment. The channels of the users and the eavesdropper are all Rayleigh fading channels which are independent of each other, and follow the unit variance zero mean complex Gaussian distribution. The noise power
is also set as 0 dBm. We compare the results with different numbers of the helpers and different algorithms, 1000 times of experiments are performed to get average secret-key rate, which are shown in
Figure 6. The secret-key rates increase linearly with the total transmitting power, as shown in (
31), and more helpers result in better performance. Even when we have no idea about the eavesdropper, the performance of the algorithm is fairly good.
Figure 6 also shows the performance of a simple key generation scheme of channel model (Section 4.1 in [
35]). In this model, the common randomness is from user A, that is, user A sends random signals to MC, and then the two users try to generate a secret-key from these random signals by public discussion. This is a typical channel model for secret-key generation. We implement the same transmitting power configure as the proposed schemes. It is shown in
Figure 6 that the performance of channel model saturates with the increasing of the transmitting power, the secret-key rate is much lower than the proposed schemes in high SNR region.
Secret-key generation process is not a common communication process, whose transmitting rate increases with the transmitting power. Consider the lower bound of secret-key capacity, . The first part of is the mutual information of x and y, which increases with the transmitting power. In addition, the second part of will also increase with the transmitting power. This means when antenna power increases, the mutual information of x and y is larger, and the leaked information to the eavesdropper is also larger. Then if the correlated random source observed by the two legitimate parties is some types of radio signals, higher sending power would not benefit the system performance much. However, the proposal in this paper performs just like a common communication system. The reason is that the leaked information to the eavesdropper is almost fixed, which is mainly determined by the correlation coefficient of x, y and . The eavesdropper does not get more information when the signal power is higher, that is, does not increase with the signal power.
Figure 7 illustrates the secret-key rate of the system versus helper number with the same configure as in
Figure 6. It is shown that more helpers result in better performance in average, but the secret-key rate increases more slowly when the helper number becomes large. The reason is when the number of helpers is large enough, the receiving signals of the eavesdropper are almost independent of the legitimate users’ (as shown in
Figure 4), the secret-key rate is almost saturated to its upper bound
. Then more helpers benefit a little to the system performance when the helper number is large. In a practical system, more helpers cause higher system complexity, then there would be a trade-off between system complexity and secret-key rate.
Figure 8 compares the proposed schemes and the channel model key generation scheme in LOS channel. In the experiment, the eavesdropper moves along the horizonal line between A and MC. Channels between any two nodes are modeled by a simple line-of-sight channel model including the path loss effect and a random phase:
where
d is the distance between any two nodes,
is the path loss exponent,
is the random phase uniformly distributed within
. There are four helpers placed along the horizonal line, with equal spacing of 20 m. The noise power is
= −60 dBm. It is shown that the secret-key rate of channel model is almost zero when the eavesdropper is placed close to user A, while the proposed schemes model can still achieve fairly good performance near the two users. In this case, the channels of the system are highly correlated to each other, most of the physical layer secret-key generation schemes could not achieve fairly good performance except for the algorithm in this paper. This is ascribed to the sensitivity of the proposal.
We do not compare the proposed schemes with other CSI based key generation algorithms of [
13,
14,
15,
16,
17,
18,
19,
20,
21,
22,
23,
24,
25,
26,
27,
28,
29,
30,
31,
32,
33] in numerical simulations, because the system configures are different. All the CSI based algorithms depend on the coherence time and bandwidth of the wireless channel models, while the performance our scheme mainly depends on the symbol rate of the helpers. Typically, coherent time of a wireless communication system would be longer than 10 milliseconds, then for a narrow band system, there will be up to several thousands of bits secret-key generated by the system per second in high SNR region with small number of relays. In addition, for the scheme in this paper, hundreds of thousands of bits secret-key could be generated per second with only 10 kHz bandwidth in high SNR region.
6. Conclusions
In this paper, we have investigated the design of the correlated Gaussian sources with multiple cooperative helpers for physical layer secret-key generation for TSSN. The proposed scheme can help to update the secret-key of the wireless sensors in the system dynamically and securely.
In traditional distributed physical layer security communication systems, cooperative helpers are used as relays or interference sources. The basic idea of these algorithms is beam-forming. The main difference between the proposed schemes and the traditional algorithms is that the helpers send independent random signals in our schemes. What the helpers send is different from what the users receive, even the helpers themselves have no idea of the receiving signals of the users. This helps to create spatial differences between the legitimate users and the eavesdropper.
The proposed scheme provides an artificial random source for secret-key generation, then it is possible to get high secret-key rate by increasing the symbol rate of the helpers. Traditional CSI based secret-key generation schemes can only achieve up to several hundreds of bits secret-key per second for narrow band system, and highly depend on the coherence time of the channels. The proposed scheme, by contrast, can achieve hundreds of thousands of bits per second for narrow band system, and could possibly generate several mega bits secret-key per second for wideband system.
Note that when there are too many helpers, the synchronization is difficult for a practical system. The estimation of the channel CSI could not be accurate, the estimation errors accumulate with the increasing of the helper number. These facts will destroy the correlationship of the legitimate users, and suffer the performance of the system. How to find a balance or an optimal solution for a practical system with these issues is left for future work.