1. Introduction
Network time synchronization refers to the process of synchronizing time across nodes in a network using network protocols. For surface time synchronization, it usually combines the standard time reference provided by a GNSS and propagates time information through the network. Network time synchronization is widely used in data centers, distributed systems, communication networks, and other scenarios requiring time synchronization. It focuses on achieving time synchronization through network protocols in distributed computing environments and involves a clock hierarchical architecture and adaptation to different network topologies.
The Network Time Protocol (NTP) is the most widely used network time protocol. It uses a hierarchical structure to organize time servers. Research in recent years has focused mainly on improving NTP security and resilience to interference, such as the work of Burbank et al. [
1]. The Precision Time Protocol (PTP) is a typical centralized network protocol in which one device in the network is selected as the master clock and other devices act as slave clocks, obtaining their time from the master clock. Researchers have been working to improve the accuracy of PTP synchronization using hardware timestamps and clock filtering algorithms [
2].
Network time synchronization methods are also widely applied in cloud computing and distributed databases. Research has focused on achieving reliable time synchronization in high-latency and unstable network environments [
3]. It includes FTSP [
4], RBS [
5], and DMTS [
6] for wireless sensor networks. The emergence of Time-Sensitive Networking (TSN) [
7,
8,
9] further demonstrates that the future direction of sensor network data lies in reliable and real-time communication, emphasizing efficient and reliable data transmission. A key feature of TSN is the design of redundant paths, which ensures that network traffic can continue smoothly even if one path fails, a capability that is often challenging for most centralized networks to achieve.
The network structures of these protocols can be referenced for underwater time synchronization, but the synchronization process is not applicable due to the large propagation delay underwater. Compared with surface applications, underwater sensor nodes are usually powered by batteries, and the limited power supply severely restricts the computational and communication capabilities of the sensor nodes. Therefore, the structure of underwater networks is relatively simple and can generally be divided into three types based on network methods: centralized, distributed, and multi-hop, as shown in
Figure 1.
In 2003, Ganeriwal et al. [
10] proposed the Time Synchronization Protocol for Sensor Networks (TPSN) based on centralized network structures. Similarly to the NTP protocol, it classifies network nodes hierarchically and assigns each node a level number. The root node, which serves as the clock source for the entire network, is designated level 0, while other nodes synchronize with a node from the previous level, ultimately synchronizing all nodes with the root node. However, this method cannot prevent network paralysis caused by root node failure and is not suitable for networks with highly mobile nodes.
In 2017, Wang Shuo et al. [
11] proposed an energy-optimal clustering method for underwater sensor networks, dividing time synchronization into inter- and intra-cluster synchronization [
12] and calculating the optimal number of clusters by solving mathematical expectations. This method is not suitable for highly mobile networks, as significant changes in network topology require recalculation of the optimal number of clusters and re-clustering.
In 2020, Kong Weiquan et al. [
13] proposed a dual-cluster head-time synchronization algorithm for underwater sensor networks based on clustering. By selecting two nodes as primary and secondary cluster heads, it improved the network robustness and improved the accuracy and efficiency of the time synchronization. It also considered the impact of node mobility on the synchronization accuracy, reducing estimation errors to some extent by introducing advanced node mobility models. However, given the complexity of the mobility of the underwater network, this method is difficult to implement.
For time synchronization in mobile networks, the accuracy of synchronization can be improved by providing the assistance of dynamic nodes [
14,
15,
16] or by employing more accurate models of network node mobility [
17,
18,
19,
20]. However, despite the benefits of clustering nodes in reducing the messaging overhead for time synchronization in underwater sensor networks and improving the overall network synchronization accuracy, there are still deficiencies when it comes to dealing with mobile nodes. Furthermore, in small areas, the efficiency gains from centralized networking synchronization are not significant.
Multi-hop networking offers higher robustness compared with clustering models. For example, Sun C et al. [
21] proposed Multi-Hop Time Synchronization for Underwater Acoustic Networks (MSUAN) in 2012, emphasizing the reduction in synchronization errors through multilevel timestamp exchanges. Wen J et al. [
22] proposed an Improved Multi-Hop Time Synchronization for Underwater Acoustic Networks (IMSUAN) in 2013, improving synchronization accuracy by establishing a time synchronization tree, optimizing the node-listening mechanisms, and using global bias thresholds to filter out inaccurate timestamps. In 2021, Shams R et al. [
23] proposed an algorithm that simultaneously performs node localization and time synchronization in multi-hop underwater sensor networks using a single anchor node. The algorithm employs gradient techniques to solve unconstrained optimization problems and optimizes node localization and time synchronization through calculations of angle information and propagation delays.
However, both clustering models and multi-hop network models are more suitable for large-scale network nodes, i.e., when node communication distances cannot cover the entire network. In small-scale networks (where node communication distances can cover the network), both models are not different from centralized networks and are limited by central nodes. Moreover, the actual synchronization algorithms do not differ from the time synchronization algorithms between two points (such as DE-Sync [
24]) and do not fully utilize the network information. In most cases, underwater sensor nodes, whether mobile submersibles or stationary underwater beacons, have some ability to acquire location information. Integrating location information with sound velocity information can greatly simplify the time synchronization process. Higher location information accuracy can lead to higher time synchronization accuracy. However, the computational complexity brought about by multi-information fusion and the communication pressure from distributed network structures increases the energy consumption burden of underwater networks. How to achieve rapid time synchronization in underwater networks while maintaining a certain level of energy consumption was the research objective of this study.
To address these issues, this paper proposes a time synchronization method based on a probability graph model for underwater networks. This method solves the marginal probability density function of the system clock offset parameters and performs binary simplification to quickly calculate the clock offset parameters of various sensors within the local network, achieving dynamic time synchronization in distributed networks. Compared with conventional network time synchronization methods, the probability-graph-model-based time synchronization method employs distributed networking, reducing the dependence on central nodes and improving network robustness. This method improves multi-user protocols, integrates location and sound velocity information, significantly enhances the synchronization efficiency, and reduces the computational requirements for nodes through binary simplification of the algorithm.
This paper briefly introduces the application scenarios and the necessary measurement parameters of the proposed method in
Section 2.
Section 3 provides a detailed overview of the topologies and limitations of distributed networks suitable for probabilistic graphs, the advantages of the message-passing scheme compared with other underwater network time synchronization methods, the mathematical and physical models when the probabilistic graph method processes single-cycle and multicycle data, the binary simplification method to reduce the model complexity and improve the computational efficiency, the impacts of different parameter errors on the clock offset estimation, and the pseudocode for the calculations of the single-cycle and multi-cycle single-node models.
Section 4 presents real field experiment validations that compared the improvements in multicycle data against single-cycle data, as well as performance comparisons with traditional underwater time synchronization methods under multi-cycle data conditions.
3. Principles of Underwater Network Time Synchronization Methods
To simplify the existing network time synchronization protocol and increase synchronization efficiency, additional information is clearly needed as a support. Information fusion would increase the computational burden on nodes while reducing the central node’s constraints on the network. Adopting a distributed network would require further network communication resources. This paper attempts to utilize factor graph models to simplify the entire network time synchronization process, thereby reducing the communication and computational demands of the network synchronization.
3.1. Network Topology
The probabilistic graph method adopts a distributed hierarchical network approach. Taking the underwater network in
Figure 3 as an example (it is generally assumed that the communication range of the cluster head nodes is greater than that of the ordinary nodes), the network can be divided into six networks based on the communication ranges of the nodes. Each network forms its own distributed network, while the five cluster head nodes form a larger distributed network. In this case, each node is not a central node for its local network, and the cluster head nodes only serve as time reference standards for the local network rather than initiators of all synchronization activities. When a cluster head node fails, the distributed network can use the time of any node within the network as a reference, achieving unification of the local network.
The scope of hierarchical distributed networking is limited by the communication range of the cluster head nodes. For larger-scale networking, the communication distance of the cluster head nodes cannot cover the entire underwater acoustic network, which requires multi-hop-assisted networking. As shown in
Figure 4, distributed networking is used within clusters, while multi-hop communication is employed between clusters.
3.2. Message-Passing Scheme
Cluster-based networking and multi-hop networking can significantly improve the synchronization efficiency in large-scale networks. However, these improvements are primarily based on optimizations in a topological structure and offer no enhancement in local areas (within the communication coverage of a single node) compared with inter-platform time synchronization methods.
The intra-cluster information transmission scheme (multiuser protocol) is shown in
Figure 5. It is evident that existing network time synchronization methods, while needing to consider information differentiation between multiple users, are fundamentally not different from one-way or two-way time synchronization methods. They do not fully utilize the additional information available in sensor networks, thus missing the opportunity to simplify the time synchronization process in underwater sensor networks.
The probability graph method for time synchronization, as illustrated in
Figure 6, differs from methods previously used that require multiple interaction cycles to complete a single time synchronization. In contrast, the probability graph method can complete a time synchronization in each interaction cycle (
). In addition, multiple sets of interactions can further improve the accuracy of the synchronization.
3.3. Factor Graph Models
Underwater network time synchronization methods based on probabilistic graphical models are generally applied to underwater sensor networks (clusters, reference networks, etc.) that require high refresh rates. In this scenario, the reference points have certain initial position values and the unification of the time reference can be achieved through relative ranging. The mathematical model for the clock source of this method is
where
T is the local time axis,
t is the standard reference time axis,
a is the clock drift (clock frequency error), and
is the clock offset (clock phase error). Due to the ability to achieve a real-time reference alignment of time, in most cases, the model does not require a correction for clock drift
a.
The fusion parameters include reference position information, reference clock offset, relative ranging information, time delay difference measurement information, and relative ranging signal reception moments. At this point, the global function is represented as
In Equation (
2),
represents the set of clock offsets for each reference beacon,
T denotes the set of all reception times of the measurement signal,
represents the set of measurements of the time delay difference,
D is the set of range measurements, and
P is the set of underwater reference positions.
3.3.1. Single-Group Interaction Time Synchronization Model
Taking the range between three beacons A, B, and C (with A as the reference beacon,
as an example, the single-cycle factor graph model as shown in
Figure 7:
represents the reference for the signal reception time, time delay, and reference clock offset. represents the functional relationship between the information from the range and the time delay difference. represents the functional relationship between the information from the range and the reference position.
Let
and the expression of the edge function is
The relationship between the clock offset
, reception time
T, and measured delay
is
represents the measurement error of time, which follows
. The function
can be expressed as
The relationship between the range information
D and the time delay
is
represents the measurement error of the sound velocity; it follows that
. Therefore, the function
can be expressed as [
25]
The relationship between the reference position information
P and the range information
D is
represents the positional error, which can represent the distributions
and
. Then,
and
.
and
denote the horizontal and vertical coordinates of the reference points
i, respectively, while
and
represent the horizontal and vertical components of the distance
, respectively.
Let
and
; then
u follows a non-central chi-square distribution with two degrees of freedom [
26]:
, and the function
can be expressed as
By analogy with the derivation of the reference B, the marginal probability density function for the clock offset of reference C can be obtained similarly.
3.3.2. Comprehensive Time Synchronization Model for Multiple Group Interactions
Compared with single-cycle interactions, a factor graph model that integrates information from multiple interactions can achieve a higher accuracy of clock offset estimation. Taking multiple interactions between reference A and mobile platform B as an example, the single-node time synchronization factor graph model is shown in
Figure 8.
expresses the functional relationship between the clock offset in the adjacent time interval. The multi-cycle interaction model, as shown in
Figure 9, is a combination of the single-cycle model and the single-node model.
Let
n is the maximum number of interactions. Taking ‘
’ as an example, the expression for its marginal function is
The relationship between the clock offset at time
i (denoted as
) and the clock offset at time
j (denoted as
is
Here,
represents the clock drift and
represents the jitter error in the clock phase, which follows
.
3.4. Binary Simplification
Under normal circumstances, the position, distance, delay, clock offset , and other information about the target to be estimated are distributed continuously, and the correlation functions between variables are also distributed continuously. Directly estimating the clock offset by calculating the expectation of the continuous distribution function has a high computational complexity. Therefore, the sampling concept is introduced to discretize the continuous distribution function. The most probable result of the clock offset distribution is divided into grids, with each grid representing an estimated clock offset value. The final estimated result of the clock offset is obtained by the weighted averaging of the estimated values for each grid. The main steps for calculating the weights are as follows. If the measured data set is known, fixed-interval scattered points can be used to represent the grid area . The weight of this grid area is represented by the value of the probability density function at the position . The calculation of the factor graph mainly focuses on the transmission and updating of messages. To reduce the complexity, we adopted a binary mode for the message transmission. The binary mode transforms the sum operations in the message transmission process into logical OR operations and product operations into logical AND operations, greatly reducing the computational load.
Based on the binary mode, the probability density function of the distance information can be simplified to
The probability density function of the delay information can be simplified to
Under a single-group interaction, the probability density function of the node clock offset can be simplified to
The probability density function of the clock offset between nodes under multiple interactions can be separately simplified to
, and
represent the standard deviations of the respective probability density functions. The binary representation of the message transmission is
3.5. Error Analysis
3.5.1. Impact of Time Measurement Error on Synchronization Accuracy
From Equations (
5) and (
6), it can be seen that under the condition that the time measurement values follow a normal distribution
, the synchronization accuracy also follows a normal distribution
. If a standard deviation is used as the accuracy of the estimation, then
.
3.5.2. Impact of Sound Speed Measurement Error on Synchronization Accuracy
The sound speed measurement error is introduced in Equation (
7). Similarly, using one standard deviation as the accuracy of the measurement of the sound speed
, we have
,
, given that
Substituting into Equation (
5) yields
3.5.3. Impact of Position Measurement Error on Synchronization Accuracy
From Equation (
10), it can be seen that
u follows a noncentral chi-square distribution with two degrees of freedom; thus,
Let
represent the accuracy of position measurement
. The error of
is a standard deviation. We have
Given
, we can express
Substituting into Equation (
5) gives
3.5.4. Comprehensive Impact
When considering the input parameter errors
, and
simultaneously, substituting
and
into
yields
Now, substituting
and
into Equation (
5) gives
It can be seen that the main source of error in the estimation of the clock deviation arises from the time measurement error.
3.5.5. Impact of Resolution on Synchronization Precision
The resolution mentioned in this article refers to the minimum unit of clock deviation after binary simplification, which can be expressed as
When , the simplified deviation estimation error is greater than ; when , the simplified deviation estimation error is greater than or equal to the clock deviation estimation error . Therefore, the resolution should ideally be less than the accuracy of the time measurement.
3.6. Algorithm Process
This section focuses on the time synchronization scenario between underwater benchmarks. After establishing a factor graph model, the sum–product algorithm was employed to calculate the marginal probability function. Finally, to simplify the computational process, discretization and binary representation were adopted for the calculations.
The flow of the simplified algorithm is given in Algorithm 1.
Algorithm 1: Single-cycle probability graph time synchronization algorithm. |
Procedure-estimated clock offset ( |
Reference node A and n nodes to be synchronized |
Sound velocity and reciprocal range observations |
Position, speed of sound, time-instant-observed value accuracy |
) |
Select any two points |
Calculate the clock offset of node i at , path 1.1 |
|
Similarly |
4 For all , there exists a that satisfies (16) |
5 For all , there exists a that satisfies (17) |
6 For all , there exists a that satisfies (18) |
Calculating the clock offset at node id when , path 1.2 |
Same as steps |
Calculate the clock offset of node i in set , path 1.3 |
For |
Same as steps |
Calculate the final clock offset of node i |
|
Output |
Repeating for
N cycles, the clock offset
can be obtained for a single node, and by combining the single-node factor graph model, the estimated value of the comprehensive multicycle clock offset can be obtained by Algorithm 2.
Algorithm 2: Comprehensive multi-cycle probability graph time synchronization algorithm. |
Procedure Estimated clock offset ( |
Multiple clock offset measurements for node i |
//Single-cycle clock offset estimation accuracy |
) |
Select the clock offset at time j for node i |
Calculate the clock offset at time j for node i, following path 2.1 to end |
|
3 For all , there exists that satisfies 20 |
|
Calculate the final clock offset at time j |
|
Output |