1. Introduction
With the continuous evolution of network attack methods, the importance of information security is becoming increasingly prominent [
1,
2]. As an important part of the national economy, the information security of ports is crucial. With the development of port intelligence, data link security becomes particularly important. Currently, ports are facing increasingly sophisticated cyber-attacks that may pose a serious threat to port operations and national security [
3,
4,
5,
6]. For example, ports in various countries have suffered from frequent hacking attacks in the past few years, which have caused severe economic losses and operational disruptions. To cope with this challenge, ports need to adopt effective cyber-security protection measures. However, traditional cyber-defense techniques are lacking, and port systems are facing stronger cyber-attack threats. In the field of network security, vulnerability detection technology plays a crucial role. However, relying solely on these technologies is not sufficient to fully guarantee information security. Therefore, it is necessary to conduct a comprehensive security assessment of the port network system and strengthen the security monitoring and detection capabilities of data links. At the same time, it is necessary to enhance the identification and response capabilities of network attacks in order to promptly respond to potential network threats and ensure the stability and security of port network systems. This multi-level protection strategy will more effectively maintain the overall security situation of the port network.
Complex networks exhibit complex structural relationships between nodes and links, effectively representing a large amount of unstructured data in real life. Link prediction, as a key issue in complex network research, utilizes observable network structures to explore the basic principles of network formation, predict possible future links, or identify missing links in the network. This not only helps to understand the evolution and stability of networks deeply but also provides important ideas for solving practical problems in reality. Port systems can be described as complex networks [
7,
8,
9], where nodes represent entities and interactions between entities can be described as edges or links. Link prediction in networks refers to how to predict the possibility of generating a connection between two nodes in a network that have not yet generated a connected edge by using known information about the network structure and other information, which helps to excavate the network structure and evolution. With the wide application of complex networks in modern society, link prediction has become an important network analysis technique with far-reaching practical applications. Intending to make the link prediction model better adapted to different types of complex networks and improve the prediction ability of potential connections in the network, the study of link prediction with the help of graph neural networks combined with higher-order topology information provides a more in-depth and effective method for understanding and applying complex networks.
Ensuring accurate acquisition of node information in link prediction is a challenging task, especially when relying solely on the structural information of the network. Taking the port system network applied in this article as an example, considering the possible privacy protection measures, direct access to certain key information is restricted, which increases the complexity and challenge of information acquisition. Traditional hypergraph models often overly rely on node attribute information and overlook the importance of network topology. Therefore, it is urgent to further explore how to effectively integrate network structures and attribute information to improve the accuracy and robustness of link prediction models.
Although the use of complex networks in link prediction can better apply technology to real life, and although the study of high-order topologies can improve the accuracy and robustness of link prediction, there are still some problems: 1. In some current practical applications, the relationship between nodes can be extremely complex, and the edges of the general graph can only be connected to two nodes. It is difficult to accurately represent many-to-many, many-to-one and other complex relationships; thus, the general graph can not meet the needs; 2. Traditional feature extraction methods only focus on the local neighborhood information of nodes, and they cannot effectively capture the information in the global graph structure, and for large-scale graphs, the traditional feature extraction methods increase the computational and storage costs and reduce the generalization ability of the model intending to extract high-dimensional feature vectors; 3. In some cases, link prediction must rely on network structure information. Therefore, it is important to obtain accurate node attributes. However, in a real network, the information of many users is private, and it is difficult to obtain node information.
Given the above problems, the purpose of the method in this study is to extract and apply the high-order information existing in the graph structure, and this paper starts from the initial input data of the experiment. Firstly, the HWA algorithm is proposed to construct the hypergraph. This involves constructing a hypergraph through a general graph with basic information about the network structure. Secondly, based on the GCN model proposed in this paper, the feature vectors of nodes are extracted according to the input hypergraph correlation matrix. At the same time, residuals are added to solve the problem of gradient disappearance due to the increase in the graph data. Finally, MLP is used for link prediction, and the model is evaluated by AUC and F1 values to comprehensively evaluate the performance and effects of the model and to verify the effectiveness and reliability of the proposed method.
The sections of this paper are organized as follows.
Section 2 introduces an overview of the foundational ideas that support this article.
Section 3 reviews in detail the related work on graph convolutional neural networks and multilayer perceptrons.
Section 4 describes the HWA data modeling method and the node feature extraction method proposed in this paper.
Section 5 focuses on the experimental steps and experimental results. The conclusions of this paper are presented in
Section 6.
2. Related Work
In response to increasingly complex social and information networks, researchers have proposed numerous link prediction methods in order to comprehensively understand and utilize these intricate networks. Qiao Lilin et al. [
10] proposed a key node recognition algorithm, KNIA-MWF, based on multi-attribute weighted fusion. This algorithm combines the objective entropy weight method (EWM) and subjective analytic hierarchy process (AHP) to quantitatively analyze the impact of different attributes on node importance and to comprehensively evaluate the structural importance of nodes in the network through the final topological importance (FTI). This provided important ideas for link prediction in subsequent research. Kuanyang Li et al. [
11] proposed an integrated model-based link prediction algorithm (EMLP) based on the idea of the integrated model, combined with logistic regression and superposition techniques, to improve the stability and accuracy of link prediction, and they experimentally proved that the link prediction of this method has better stability and accuracy. Jing Guo [
12] proposed a single-mechanism link prediction algorithm based on node mechanism similarity, a hybrid link prediction algorithm based on an information entropy improved principal component analysis model and a hybrid link prediction algorithm based on ensemble learning. The reliability and stability of the prediction accuracy of the proposed three algorithms in the network were obtained by experimental tests. Sai Munikoti et al. [
13] proposed a framework based on extensible and universal graph neural networks. The model learns the critical scores of nodes/links on a small representative subset of nodes/links and predicts large graphs by small graphs. The critical scores of nodes/links in large graphs are used to prove the scalability and accuracy of the framework. Zhiwei Zhang et al. [
14] proposed an edge-convolution-based graph neural network called EdgeConvNorm for link prediction. The Hadamard product of the final learned link representation is used to determine the link between two nodes via a binary classifier sigmoid. Intending to solve the over-smoothing problem of the model, a normalization strategy for the link representation is proposed. Yuxuan Xiu et al. [
15] proposed an extended network self-representation (ENSR) model based on the absence of predefined assumptions about the mechanism of network link formation, which produces an optimization model of the optimal ENSR function by minimizing the Frobenius paradigm between the network adjacency matrix and its ENSR transform. Moreover, the ENSR model performs well in link prediction and can adaptively discover potential link formation mechanisms in complex networks. Yanan Wang [
16] proposed a link prediction model called DME based on neighbor contribution and path propagation for the intra-layer link prediction of single-layer networks. Aiming at the inter-layer link prediction of a two-layer network, a link prediction model called SWLEV based on the weighted embedding of edges in the network layer is proposed. By combining edge betweenness and degree centrality, an edge-weighted model is proposed, and an appropriate embedding method is added. The network structure is captured by combining the first-order proximity of the edge strength and the second-order proximity of the number of common neighbors to reduce the dimension of the network. These approaches demonstrate rich exploration and innovation in link prediction for complex networks. From integrated models to graph neural networks, as well as models based on mechanisms without assumptions, all of them meet the experimental expectations. In intricate networks, the in-depth understanding of higher-order structural information is of great significance in solving practical problems.
With the wide application of complex networks, link prediction has become an important task. Graph neural networks have been adopted by many scholars for link prediction due to their advantages of effectively capturing topology information, fusing node attributes, end-to-end learning, applicability to networks of different sizes and effective modeling of contextual information. However, most of the current link prediction methods mainly focus on low-order networks, ignoring the expression of complete semantic information between multiple entities, and, thus, there is a need to identify high-order topological structure information to more adequately capture the topological nature and complex associations of relational data. Tianyu Cheng [
17] proposed a higher-order link prediction algorithm based on higher-order structural information around nodes, which uses biased random wandering to abstract the higher-order structural information around nodes and maps the wandering sequences of the nodes into low-dimensional feature vectors using a representation learning approach to capture the higher-order structural features around nodes. To cope with the effect of network disconnection, an algorithm based on edge relation matrix decomposition is proposed to combine non-negative matrix decomposition with higher-order link prediction to capture the similarity between nodes and edges in order to synthesize the likelihood of generating higher-order structures between nodes and edges in the network. Tao Yi et al. [
18] proposed an efficient primitive neighbor matrix learning algorithm for extracting higher-order structures in directed networks and constructing multiple primitive neighbor matrices, as well as to study a new encoder scheme consisting of three modules: node embedding, information fusion and link prediction. Ru Huang et al. [
19] proposed a link prediction model based on the Weisfeiler–Lehman simplex neural network (WL-SNN), which represents higher-order topological information by simplex features and uses the WL algorithm to construct the local neighborhood to extract higher-order information, in which a third-order Laplace operator is set to capture the simplex features. In addition, a two-channel neural network structure consisting of a simplex neural network and graph convolutional neural network is constructed. Yifan Lu et al. [
20] proposed a heterogeneous hypergraph neural network (NOH) for sensing neighborhood overlap, which learns higher-order semantics while maintaining the original topology. It adaptively combines a heterogeneous hypergraph variational self-encoder with a neighborhood-overlapping perceptual graph neural network to take into account the heterogeneity of the graphs and learn the structural information of the graphs and estimate the overlapping neighborhoods for link prediction. Wenqiang Liu et al. [
21] proposed a knowledge graph embedding method called KANE, inspired by graph convolutional networks, to express high-order structural relationships between entities. Yuyuan Ren [
22] designed a prediction method with strong universality by making full use of the edge connection principle of a heterogeneous, attribute and time-series supernetwork. Aiming at the agglomeration characteristics of different links in heterogeneous supernetworks, a supernetwork link prediction model based on a hypergraph clustering parser is proposed. Kang Fu et al. [
23] proposed two methods for link prediction, H-CCNC and FH-CCNC. These two methods combine node centrality and high-order clustering coefficients to predict missing links in the network from both global and local perspectives, demonstrating the potential to significantly improve prediction accuracy on multiple real datasets. These proposed high-order link prediction algorithms fully consider the high-order structural information around the nodes in the network and use different methods to capture and utilize this information to improve the accuracy and robustness of link prediction.
4. Research Methodology
In this paper, for the security problems of port information dissemination, a research method named port information security link prediction based on the HWA algorithm is proposed, and the overall flowchart of the method is shown in
Figure 2. It is generally divided into three parts. Firstly, the data construction part involves constructing the basic relationships of the ports. By understanding the basic relationships between various entities in the ports, a general graph adjacency matrix is formed. Then, using the hypergraph-based self-attention forest prediction (HWA) algorithm proposed in this paper, the adjacency matrix of the general graph is constructed as a hypergraph association matrix. Next is the data preprocessing part, which uses the hypergraph adjacency correlation matrix as the input and the GCN as the model for training, continuously optimizing the loss function to obtain node feature vectors. Finally, there is the model evaluation section; taking node feature vectors as inputs, the MLP learns through node feature vectors to obtain edge scores for link prediction. Throughout the process, gradient descent and other optimization algorithms are used to update the model parameters to improve the predictive performance of the model. Finally, the model was evaluated using the area under the curve (AUC) and the F1 value as evaluation indicators.
4.1. Constructing the Data Model Based on the HWA Algorithm
In the research method of port information security link prediction based on the HWA algorithm, the first part is the data construction part, which is based on the adjacency matrix of the general graph to understand the relationship between nodes, and an algorithm based on HWA is proposed to construct the hypergraph association matrix. For some current real-world applications, the relationship between nodes can be extremely complex, and the edges of the general graph can only connect two nodes, and it is difficult to accurately represent complex relationships such as many-to-many, many-to-one, and so on; at this time, the general graph can not satisfy the demand, so the hypergraph is used to capture the complex relationship structure. For hypergraphs where node information is difficult to obtain, the first thing that should be carried out is to construct a hypergraph, and determining the number of hyperedges is a key issue in constructing a hypergraph, so this experiment represents the hypergraph association matrix by the adjacency matrix of the general graph. The target output of the algorithm defined in this experiment is the hypergraph association matrix
, and
Figure 3 describes the HWA algorithm as a flow diagram.
Based on the existing network topology, this paper proposes the HWA algorithm to construct the hypergraph. The hypergraph constructed by this algorithm mainly reflects the higher-order attribute information, and in order to better capture the higher-order relationships in the hypergraph and improve the performance of the prediction, the self-attention mechanism is used. Through the self-attention mechanism, these relationships can be transformed into hyperedges in the hypergraph while preserving the complex relationships between nodes, thus better representing the connection patterns between nodes.
A major problem in the implementation of this algorithm is how to determine the total number of hyperedges. To solve this problem, this paper uses the node as the center node of the hyperedges in the hypergraph and then determines the non-center nodes in each hyperedge by the optimal coefficient matrix of the HWA algorithm . Finally, the association matrix of the hypergraph is obtained by combining the variables of the adjacency matrix of the general graph .
The optimal coefficient matrix reflects the interactions between nodes in a general graph based on the local structure. The HWA algorithm to find the optimal coefficient matrix is shown in Equation (9), where
is the unit matrix,
is the transpose of the adjacency matrix of the general graph and
is the penalty coefficient.
The hypergraph association matrix equation shown in Equations (10) and (11) determines the higher-order nodes contained in each hyperedge based on a matrix
containing all possible interactions between the local structures of the network, where
is a matrix mapping, controlled by a threshold
.
denotes the higher-order nodes contained in the hyperedge.
denotes the hypergraph association matrix, and
is the adjacency matrix of the general graph.
By constructing the hypergraph through HWA algorithm, this paper obtains the association matrix of the hypergraph from the adjacency matrix of the general graph based on the network topology without the information of the hypergraph nodes. The algorithm utilizes the self-attention mechanism to demonstrate the degree of association between nodes during the formation of the hypergraph, thus better capturing the complex relationships between nodes. The method helps to subsequently capture the higher-order relationships in the hypergraph, enabling the hypergraph to more accurately express the feature information between the nodes and providing a reliable basis for further analysis and application.
4.2. Obtaining Node Feature Vectors Based on GCN
Traditional feature extraction methods only focus on the local neighborhood information of the nodes and cannot effectively capture the information in the global graph structure, and for large-scale graphs, traditional feature extraction methods increase the computational and storage costs intending to extract high-dimensional feature vectors, which reduces the generalization ability of the model. To avoid the above problems, GCNs are used to extract data features. The use of a GCN can fully utilize the provided graph structure information to obtain the feature representation of each node through inter-node information transfer, and residual connections are added to the GCN to solve the gradient propagation problem and maintain the integrity of information in deep networks.
In the data processing section, the hypergraph correlation matrix constructed in
Section 4.1 is transformed into node features, and the GCN is used as the model for training during the transformation process. During the entire data processing process, the GCN first constructs the input hypergraph correlation matrix into a hypergraph structure; then, the node features are initialized. Next, the node features are multiplied with the hypergraph association matrix using the hypergraph convolution operation combined with a learnable hypergraph convolution kernel to capture the complex relationship between nodes and hyperedges and generate the updated node features. Subsequently, a nonlinear transformation is introduced via a nonlinear activation function to enhance the model representation. The updated node features are then aggregated to obtain a higher-level representation of the nodes. These steps are iterated several times to propagate the information throughout the network. Finally, the iterated node features are input to the output layer for link prediction task processing. The whole process completes the extraction of node features and task processing by optimizing the cross-entropy loss function and continuously adjusting the model parameters to minimize the difference between the predicted values and the real labels.
Figure 4 describes the process of feature extraction by the GCN.
Adopting the GCN effectively utilizes the graph structure information to realize the information transfer between nodes and obtains the feature representation of each node through inter-node information transfer, which enables the model to better capture the local and global information of the graph data and improves its ability to characterize the node features and the performance of the model in terms of representing the node’s feature vectors well. Meanwhile, since graph data usually have a complex structure and a large number of nodes, the problem of gradient vanishing occurs in the deep network as the network depth increases. Therefore, in the above GCN process of obtaining node feature vectors, the residual connection is realized by adding the output of the first hidden layer with the input of the second layer so that the gradient can be directly propagated back to the earliest layer of the network through the residual connection, which, in turn, improves the training efficiency and performance of this model. The addition of residual connections also maintains the integrity of the input features and reduces the loss of information in the inter-layer transfer process.
The residual
, which is the gap between the model output and the actual output, is shown in Equation (12), where
is the output of the GCN model and
is the desired output.
In order to better capture the relationship between nodes and improve the representation ability of the model, a double-headed attention mechanism is added. It dynamically adjusts the importance between nodes by learning the attention coefficient and aggregates the nodes according to the characteristics of the nodes and the attention washing. The double-head attention mechanism introduces additional parameters and nonlinear transformation to improve the representation ability of the model and learn the attention weight. The model can automatically learn the importance and correlation of each node feature, thereby improving the ability of the model to model the different nodes of the soil structure in this paper.
The GCN is used to extract node feature information, fully utilize the graph structure to capture relationships and features between nodes, extract local features and preserve the global structure. This not only improves the flexibility of the model but also enhances the training efficiency and model performance. The hypergraph convolution kernel needs to maintain optimal learning performance throughout the entire iteration process. Therefore, in this paper, the node feature vectors are initialized in the data processing section, and residual connections are added in subsequent iterations to help effectively propagate node information between layers and alleviate the problem of gradient vanishing in the model. Due to the involvement of hypergraphs in this article, the high-order information contained in the hypergraph structure is processed between the layers of the model through activation functions. However, these activation functions may also face the problem of gradient vanishing during gradient propagation, so special attention needs to be paid to the impact of this phenomenon on the model’s performance. The saturation characteristics of the activation function and the complex high-order feature dependencies may lead to a gradual weakening of the gradient transmitted to each layer, thereby limiting the ability of effective learning. For the challenges encountered in this article, we chose the Relu activation function and added residual connections to alleviate the problem of gradient vanishing. The learnability of the hypergraph convolution kernel is effectively improved through the above two parts. Due to the involvement of hypergraphs in this article, the high-order information contained in the hypergraph structure is processed between the layers of the model through activation functions. However, these activation functions may also face the problem of gradient vanishing during gradient propagation, so special attention needs to be paid to the impact of this phenomenon on the model’s performance. The saturation characteristics of the activation function and the complex high-order feature dependencies may lead to a gradual weakening of the gradient transmitted to each layer, thereby limiting the ability of effective learning. For the challenges encountered in this article, we chose the Relu activation function and added residual connections to alleviate the problem of gradient vanishing. Residual connections not only help to address the problem of gradient vanishing, accelerate training and improve the model’s depth and expressiveness, but it can also help prevent overfitting and optimize the model’s performance. The addition of a dual-head attention mechanism improves the model’s ability to represent the graph structure applied in this paper. In summary, this model is suitable for processing complex graphical data, improving the model’s performance and generalization ability and providing a reliable basis for subsequent link prediction tasks and applications.