1. Introduction
The Tang Dynasty represents a pinnacle era in Chinese history, marked by flourishing culture and art. The excavation of Tang Dynasty gold and silver artifacts not only showcases the treasures of ancient Chinese craftsmanship but also provides crucial physical evidence for studying the social, political, economic, and cultural aspects of the Tang Dynasty. Many of these artifacts are classified as first-class national cultural relics or are prohibited from being exhibited abroad, highlighting their significance and value in world cultural heritage. These artifacts are diverse, encompassing a wide range of items, including utensils, drinking vessels, containers, medical tools, everyday miscellaneous items, ornaments, and religious instruments. As a valuable part of cultural heritage, Chinese scholars have conducted detailed research on the historical and artistic value of these artifacts. Yang [
1] pointed out that, since its discovery in 1970, the Tang Dynasty gold and silver hoard from Hejiacun has received widespread academic attention and the functions and historical significance of the objects have been clarified. Zhang [
2] explored the influence of Sogdian foreign culture on the creation, development, transformation, and integration of the octagonal cup with Tang Dynasty culture. Qi [
3,
4,
5] studied nearly a thousand pieces of Tang Dynasty gold and silverware, providing detailed identification and in-depth analysis of each object, resulting in the most comprehensive and in-depth research on Tang Dynasty gold and silverware to date, both domestically and internationally. A rich cultural heritage allows designers to fully utilize their knowledge for innovation and development, enabling the brilliance of Tang Dynasty gold and silverware to be carried forward in contemporary times. The design knowledge of Tang Dynasty gold and silver artifacts can be categorized and analyzed through interdisciplinary research across various fields, such as art history, cultural studies, materials science, and craftsmanship technology, reflecting their diversity and cross-cultural characteristics.
In the digital age, utilizing cultural heritage to rapidly aggregate diverse knowledge and construct a comprehensive knowledge system is a crucial step in the entire design process. This approach provides data and information support for preliminary research and information collection, helping designers make more reasonable and efficient design decisions while saving time and resources. In less technologically advanced eras, knowledge collection mainly relied on paper documents, field investigations, and manual collection. Clearly, traditional recording methods have become increasingly difficult to use. With the development of big data technology, the use of digital tools in the field of knowledge collection has become increasingly widespread. During the knowledge collection phase, to quickly obtain information and conduct effective preliminary research, both domestic and international studies have focused on the development of design knowledge databases. Google Arts & Culture [
6] collaborates with numerous world-renowned museums and institutions, providing online access to a variety of artworks and cultural artifacts. Additionally, the platform offers high-resolution images, virtual reality tours, interactive exhibitions, and detailed background information. The Palace Museum’s Digital Artifact Database is based on the Knowledge Graph of Ancient Chinese Movable Cultural Relics. Building on the catalog information of the museum’s collection, it expands to include interdisciplinary concepts and vocabulary from fields such as Forbidden City studies, art history, iconography, and biology. This allows users to access and utilize the information resources of the museum’s collection from multiple dimensions. Chinese scholar Wei Tong [
7] constructed a multilingual terminology electronic dictionary using Ming and Qing dynasty porcelain vases as an example, providing a new perspective for the digital preservation of cultural heritage.
From the perspective of knowledge management, researchers have constructed various models, such as an OWL-based and ontology-based building lifecycle management model, a lightweight and efficient large-scale RDF (resource description framework) data management system, and an XML topic map and ontology-based product development knowledge representation model to support knowledge sharing during the product development process [
8]. Analysis reveals that during the knowledge acquisition process in the aforementioned knowledge management models, there are issues of inefficiency and the inability to quickly and effectively obtain the necessary knowledge. Additionally, the architectural design of these platforms lacks sufficient visualization capabilities. Cultural heritage knowledge is complex and multifaceted, often encompassing multiple interdisciplinary fields. Designers need to extract effective knowledge information from vast amounts of data. Thus, establishing a visual data model based on cultural heritage knowledge has become an urgent problem to address.
The formation of modern knowledge graphs began in 2007, with landmark projects including DBpedia and Freebase [
9]. Knowledge graphs demonstrate unique advantages in the field of data integration, particularly in handling large-scale datasets that span various industries and formats. They are commonly used for data integration, search engine optimization, and intelligent recommendation systems. By analyzing existing big data modeling methods, they can be categorized into two types: metamodeling and ontology modeling. Each provides guidance and structural support at different levels for the construction and application of knowledge graphs. These two modeling methods have different focuses: metamodeling primarily concentrates on the integration, sharing, and exchange of data, while ontology modeling focuses on knowledge representation and logical reasoning [
10]. Ontology modeling and metamodeling have become critical steps in the construction of knowledge graphs. Therefore, this paper proposes a unified data model for cultural heritage knowledge information aimed at design, based on knowledge graph technology.
The main steps in constructing a knowledge graph include knowledge modeling, knowledge storage, knowledge extraction, knowledge fusion, knowledge computation, and knowledge application. Among these, knowledge extraction is the key step in establishing a knowledge graph, encompassing entity recognition and relationship extraction. Currently, most researchers adopt the traditional pipeline approach, dividing named entity recognition and relationship extraction into two separate subtasks processed sequentially. First, named entity recognition is performed, then the entities extracted by the entity model are paired for relationship matching, and, finally, relationship classification is achieved [
11]. The pipeline approach has distinct advantages in modularization, specialization, and ease of maintenance. However, in the traditional pipeline method, separating entity recognition and relationship extraction can lead to the propagation of errors. Joint extraction, on the other hand, effectively connects the tasks of entity recognition and relationship extraction. Joint extraction typically relies on deep learning methods, using a single model to simultaneously identify entities and extract the relationships between them. Tahsin [
12] used three variants of BERT (BERT, DistilBERT, RoBERTa) to train models for analyzing consumer complaints about laptops. Barroso [
13] explored how federated learning (FL) can be used in natural language processing tasks to handle disagreements among annotators. He proposed the FLEAD (federated learning for exploiting annotators’ disagreements) method, which uses FL technology to independently learn from all annotators’ opinions. Islam [
14] proposed a federated learning-based method that improves model performance through collaborative training among multiple clients without sharing data. Huang [
15] and others constructed a design knowledge graph framework and developed a design knowledge extraction model based on federated learning, enhancing efficiency and improving the structuring and visualization of design knowledge. Li Chao [
16] and colleagues pointed out the lack of reannotated data for named entity recognition in the field of cultural relics naming and the issue of nested entities in cultural relic names. They established the “Few Relics Data” dataset to address the problem of nested entities. Traditional federated learning has pioneered a new approach to joint entity-relationship extraction models, employing a method that first extracts relationships and then progressively predicts entities.
The aforementioned studies have effectively strengthened the interaction between subtasks during the learning process, significantly improving the error accumulation issue inherent in pipeline extraction. However, these methods mostly rely on unidirectional semantic features for entity recognition, which limits their ability to comprehensively identify contextual information within text segments and address the problem of entity redundancy.
The research is based on the construction of knowledge graphs using graph attention networks (GAT) and the BERT pretrained model. GAT is a deep learning model designed to handle graph-structured data. A graph is composed of nodes and edges, commonly found in scenarios such as social networks, biological networks, and knowledge graphs. Each node can contain a feature vector and edges represent the connections between nodes.
In traditional graph convolutional networks (GCNs), Liu [
17] explores a hybrid approach that combines transformer and GCNs for text classification tasks. Meanwhile, Ullah [
18] addresses the issue of over-smoothing in GCNs, where node representations become indistinguishable as the network depth increases. The paper suggests adding fully connected layers to mitigate this problem and prevent information from becoming overly smooth. Finally, Senior [
19] discusses the relationship between transformer architecture and graph neural networks (GNNs), pointing out that transformer can be considered a special type of GNN and that GNNs may offer better inductive biases in certain tasks. In contrast to GCN, graph attention networks (GAT) introduce an attention mechanism that allows each node to dynamically assign different weights to its neighbors, thereby capturing the dependencies between nodes more flexibly. Li [
20] proposed a new framework called multirelational graph attention network (MRGAT), aimed at better modeling the complex relationships and semantic information in knowledge graphs. Peng et al. [
21] employed dependency-GAT to capture long-distance dependencies between natural language questions and database schemas, improving the accuracy of SQL generation through alignment-enhanced generation.
BERT (bidirectional encoder representations from transformers) is a pretrained language model based on the transformer architecture, introduced by Google in 2018. Based on the transformer architecture, BERT aims to enhance text comprehension by learning sentence context in a bidirectional manner. The emergence of BERT has revolutionized the field of NLP, significantly improving performance in various tasks such as question-answering systems, sentiment analysis, and text classification. Asudani [
22] reviews traditional word embedding models (such as TF-IDF and bag of words) and distributed word embedding models (such as Word2Vec, GloVe, and fastText). It also introduces context-based embedding models like ELMo, GPT, and BERT. Kanakarajan [
23] introduces BioELECTRA, a model based on ELECTRA, pretrained specifically for the biomedical domain using full texts from PubMed and PMC (PubMed Central). Compared to BERT, these models lack the contextual awareness and deep understanding provided by BERT, which can pose challenges in more complex fine-tuning tasks and limit performance in tasks requiring deeper contextual understanding. Its innovation lies in its ability to understand the contextual semantics bidirectionally, greatly improving performance in natural language processing tasks. Rouabhi [
24] used BERT and BioBERT models, both transformer based, with a primary focus on improving multilabel classification performance through data augmentation. Kim et al. [
25] demonstrated significant performance improvements with the pretrained BERT model in multiple medical NLP tasks, particularly excelling in processing Korean medical texts.
The combination of GAT and BERT allows for better handling of complex relationships and contextual dependencies in graph structures, making them suitable for processing complex texts. Given the richness and diversity of knowledge surrounding Tang Dynasty gold and silver artifacts, integrating these two models can significantly enhance the effectiveness of knowledge graph construction and multilabel classification tasks.
3. Unified Data Modeling in the Cultural Heritage Knowledge Collection Stage for Design
3.1. Data Feature Analysis
Design knowledge spans various stages, including preliminary data collection, conceptual design, and other phases of the design process, involving multiple participants such as designers and user groups. Each group may have different understandings of design knowledge. The usage scenarios and recording standards also vary and the characteristics of design knowledge information are as follows:
- (1)
Information diversity: design knowledge encompasses structured, semistructured, and unstructured data. The diverse data forms make integration and processing cumbersome.
- (2)
Information overload: the sheer volume of design knowledge includes a significant amount of irrelevant or low-quality information, making it challenging for designers to promptly and accurately extract the relevant information they need.
- (3)
Ambiguity in meaning: different teams and organizations use various standards to describe design knowledge, leading to potential discrepancies in interpretation across different contexts, which increases communication costs.
- (4)
Dynamic iteration: design knowledge is continuously updated and iterated upon, with its collection progressing alongside the design project’s development and evolving requirements.
3.2. Unified Data Modeling Process for Knowledge Information Based on Knowledge Graphs
Given the unique and complex nature of design knowledge information related to cultural heritage artifacts, this section analyzes information integration, innovation needs, and interdisciplinary fusion in the design process. By combining knowledge management and application requirements, we propose a unified data modeling technique for cultural heritage design knowledge information based on knowledge graphs.
As illustrated in
Figure 1, the unified data modeling process for design knowledge information based on knowledge graphs consists of three modules: design knowledge data sources, design knowledge information data model construction, and design knowledge information data model application.
3.3. Key Technology Research
The construction of the design knowledge information data model encompasses several stages: knowledge modeling, knowledge storage, knowledge extraction, knowledge fusion, knowledge computation, and knowledge application. This study employs a bottom-up construction method, focusing on the design knowledge collection stage within the entire design process to build a preliminary design knowledge graph ontology framework. Based on the diverse data within design knowledge information, knowledge representation is carried out to assist in knowledge extraction, knowledge fusion, and knowledge storage. Among these stages, knowledge extraction and knowledge fusion are critical steps in constructing a knowledge graph. Using machine learning methods, we designed techniques for knowledge extraction and knowledge fusion and stored the resulting data.
Knowledge extraction technology involves the automatic identification and extraction of structured knowledge from textual data. This includes entity recognition, relationship extraction, and attribute extraction, which are used to construct knowledge graphs, databases, or other knowledge management systems. For example, in the sentence The gilded tortoise-pattern silver plate, peach-shaped, features a gilded tortoise in the center, entities such as gilded tortoise-pattern silver plate and tortoise can be extracted. The relationship between these entities is decorates, and the attribute of the gilded tortoise-pattern silver plate entity is gold and silver artifact, describing the shape attribute of the gilded tortoise-pattern silver plate entity.
In this extraction task, the attribute extraction task is redefined as a problem that combines entity recognition and relationship extraction. The core of this method lies in not only identifying entities in the text but also recognizing attribute values associated with these entities and linking these attribute values to the corresponding entities through specific relationships. Therefore, the focus of knowledge extraction should be on entity recognition and relationship extraction.
Structured data typically employs manually mapped rules. For data with an explicit structure, such as tables in databases, predefined rules can directly map entities and their relationships within the data. These rules are manually created based on the structural characteristics of the data and are used to identify and extract specific information. Semistructured data generally uses wrapper induction methods to identify and standardize the source code paths of the information to be extracted, which is used to extract information from semistructured web data. Design knowledge information, such as traditional cultural knowledge, often comprises unstructured data. Unstructured data (such as text files, text in images, etc.) is converted into an editable format through text recognition and other technologies. Specialized recognition techniques are needed to extract knowledge information from this data, identifying specific entities, relationships, and more.
Due to the generally low quality of most design knowledge information data, this study proposes an improved text information extraction method, namely an entity-relationship joint extraction method based on a segmental attention fusion mechanism. During the process of entity recognition and relationship extraction, the method fully considers the contextual information in the text data, enhancing the accuracy of entity recognition. The text is divided into multiple segments and entity recognition and relationship extraction are independently performed for each segment, addressing the issue of overlapping entity relationships.
After extracting knowledge from the text, it is necessary to merge entities with the same name. This is because the same name might refer to different entities or the same entity might be referred to by different names in various contexts. To effectively manage the extracted knowledge, these ambiguous entities with the same name need to be merged to ensure consistency in the knowledge base.
When dealing with situations where the same name refers to different entities, a pretrained language model based on contextual features is used to encode sentences containing these ambiguous entities. This approach captures the contextual information surrounding each entity. Then, the similarity between two entities with the same name is calculated based on their contextual encodings and compared to a predetermined threshold. If the similarity between the two entities is higher than this threshold, they are considered to have the same meaning. If the similarity is lower than the threshold, they are considered to refer to different entities.
When different names refer to the same entity, it is necessary to determine whether two or more expressions refer to the same entity. This ensures that the entities being compared and evaluated logically belong to the same category. To achieve this, sentences are encoded using a pretrained language model based on contextual features. This encoding is combined with a graph neural network (GNN) normalization model and a biaffine attention model with a feedforward neural network layer, utilizing two scoring mechanisms.
These scoring mechanisms leverage the encoded contextual information and the structural information from the GNN to calculate the likelihood that different named entities refer to the same entity. This score is then compared to a predetermined threshold. If the score is higher than the threshold, it can be inferred that the two differently named entities indeed refer to the same entity. Conversely, if the score is lower than the threshold, they are considered to refer to different entities.
When entity recognition faces the challenge of nested entities, such as entities embedded within other entities in the text, specialized techniques are required. For instance, in the sentence Apple’s iPhone 15, both Apple and iPhone 15 are entities, with iPhone 15 nested within the larger entity Apple’s iPhone 15. Similarly, a pretrained language model can be used to extract contextual features. These features are then processed using a graph neural network (GNN) and a biaffine attention model to capture the nested relationships between entities. Finally, a feedforward neural network layer is used for detailed feature analysis and recognition.
7. Discussion and Outlook
7.1. Association between Knowledge Graphs and Artifact Information
This paper takes the knowledge of Tang Dynasty gold and silver artifacts, part of China’s material heritage, as an example, focusing on the knowledge collection phase during the early design stage, addressing challenges such as knowledge data diversity, data redundancy, and data ambiguity. A unified data model for Tang Dynasty gold and silver artifact knowledge was constructed, using graph structures to express complex relationships and semantics between entities. By integrating semantic web technology and inference engines and leveraging linked data technology, the model enables the expression of more diversified knowledge. It can integrate heterogeneous data sources to form a unified knowledge representation. During the construction of the knowledge graph, users can intuitively and efficiently browse information through the integrated knowledge graph visualization function.
In prior research on Tang Dynasty gold and silver artifacts, researchers have generally followed a pattern in their descriptions of these artifacts: first providing a unified description of the form, components, and connection relationships of the artifacts, and then detailing each part’s features, patterns, and cultural connotations in a vertical order, such as from top to bottom or starting from the connection points. Based on Chinese language logic, different word types in the text are deconstructed, with irrelevant adjectives and auxiliary words being removed. For example, in the description of the silver cup with pointed lotus petals and wild goose patterns: “flared mouth, slightly outward-turned rim, curved belly expanding outward, trumpet-shaped octagonal high foot ring, with a circular petal hoop at the middle of the foot ring, and a circular tray at the top connecting the cup body. Formed by hammering, flat-chased patterns, gilded decorations, fish-roe ground. The cup body consists of two layers of lotus petals, nine petals per layer, with engraved birds, flowing clouds, trees, and landscapes within each petal. The foot ring tray is engraved with ruyi cloud patterns, while the foot surface is engraved with birds or symmetrical flowers”. In this description, “flared mouth, slightly outward-turned rim, curved belly expanding outward, trumpet-shaped octagonal high foot ring” provides a general description of the overall shape of the artifact, followed by “the cup body consists of two layers of lotus petals… the foot surface is engraved with birds or symmetrical flowers”, which describes the details of the artifact’s shape and explains the connection between the cup body and the foot ring. The silver cup with pointed lotus petals and wild goose patterns has two main components: the cup body and the high foot ring. The cup body features a flared mouth, an outward-turned rim, and an outward-expanding belly, while the high foot ring is trumpet shaped with eight petals and a circular petal hoop, connected to the cup body by a circular tray. The decorations on the cup body and foot ring are also described separately.
It becomes evident that the textual descriptions of various gold and silver artifacts closely align with the underlying construction logic of knowledge graphs. Important components can be extracted as nodes in the knowledge graph while connection and nesting structures can be extracted as edges (i.e., relationships), with material and excavation information serving as attributes. This structured approach simplifies these texts, making it easier for users to understand the complex knowledge system.
7.2. Effectiveness of Model Construction
The research combines graph attention networks (GAT) and BERT with the training model for joint extraction, significantly enhancing the ability to represent and integrate complex knowledge systems. By deeply encoding the text data related to gold and silver artifacts using the BERT model, its bidirectional contextual understanding capability precisely captures subtle semantic information within the text, ensuring the accurate extraction of each entity and its related descriptions. This process is particularly suitable for the complex and multilayered descriptive structures of gold and silver artifacts, from the overall shape of the artifact to the detailed features of its patterns and connections, all of which BERT can effectively identify.
After semantic encoding is completed, the dependency analysis module further parses the structural relationships between entities. For instance, in the description of the two key components “cup body” and “foot ring”, the dependency analysis can clearly determine their connection method and the interaction between them. Through this analysis, the model is not only able to recognize individual entities but can also systematically extract the structural dependency information between entities.
The graph attention network (GAT) further processes these entity nodes and their relationships by assigning weights. When handling complex descriptions of gold and silver artifacts, GAT excels at capturing the interdependencies between elements such as shape, pattern, and material. For example, in the case of the “silver cup with pointed lotus petals and wild goose patterns”, the cup body and high foot ring are treated as key modules and set as nodes in the knowledge graph while their connection and pattern details are precisely represented through edges and attributes in the graph structure. Through this structured approach, the model not only faithfully reconstructs the dependency relationships between complex entities but also ensures that the representation of this knowledge in the graph is more intuitive and organized.
By combining semantic analysis with multidimensional graph structure processing, the joint application of GAT and BERT, compared to traditional knowledge extraction methods, not only automates the extraction of entities and relationships but also ensures the stable operation of the entire knowledge system through deep exploration of the text’s semantics and graph structures.
7.3. Digital Innovation in Cultural Heritage Preservation
By constructing a knowledge graph platform for Tang Dynasty gold and silver artifacts, users can quickly access and integrate knowledge resources within the field. Due to the large volume of Tang Dynasty gold and silver artifacts and the varying descriptions across different periods, the knowledge information is extensive and complex. The platform unifies and organizes the descriptive data from various monographs, enabling users to conveniently and efficiently access accurate heritage knowledge, helping them to deeply explore and apply cultural heritage knowledge, thereby significantly improving the efficiency of knowledge integration.
For designers, the knowledge resources integrated into the platform allow them to flexibly apply elements such as the shapes and patterns of Tang Dynasty gold and silverware in modern design, accelerating design innovation. For the general public, the platform provides a quick and effective way to understand the complex and difficult knowledge of artifacts, not only preserving cultural heritage but also promoting its recreation and application in contemporary society. Through semantic analysis and reasoning functions, the platform uncovers potential knowledge hidden behind different entities and relationships. For example, by analyzing similarities between different artifacts and the inheritance of craftsmanship techniques, it can reveal more about the development patterns and cultural background of Tang Dynasty gold and silver artifacts, offering new perspectives and directions for academic research on cultural heritage.
This research introduces knowledge graph technology, providing a new method for the digital preservation of cultural heritage. This approach not only systematizes and visualizes vast, dispersed heritage knowledge but also forms a comprehensive knowledge network based on data linkage. With this technical approach, cultural heritage is no longer just static historical data but becomes a dynamic digital asset that can be interactively displayed, studied, and researched.
7.4. Limitations and Outlook
The model primarily relies on existing textual data for training and knowledge extraction and the descriptions related to Tang Dynasty gold and silver artifacts may vary due to differences in time periods and literary styles across sources. In the early stages of annotation, a certain level of manual intervention is required, especially during the construction and validation phases of the knowledge graph. Reducing the need for human involvement and increasing automation will help accelerate the construction of the knowledge graph and enhance its applicability across different fields. While the current visualization interface has improved the user experience to some extent, the interactivity and user-friendliness still need enhancement. The needs of different user groups, such as designers, researchers, and the general public, may differ. Therefore, balancing the demands of various users and providing personalized knowledge displays and interactive interfaces is a direction for future system optimization.
Future research will continue to enhance data collection efforts, integrating more heterogeneous data sources, such as archaeological literature, images, and physical records, to form a more comprehensive and enriched knowledge graph of cultural heritage. In response to practical needs, research will also focus on developing a knowledge-based Q&A system for cultural heritage design, utilizing the knowledge graph to handle more complex natural language processing tasks, making the retrieval and collection of design knowledge more efficient. Additionally, research will explore methods for integrating cross-domain data to establish connections between different cultural heritages. The knowledge management experience from Tang Dynasty gold and silver artifacts will be expanded to other cultural heritage fields, such as bronzeware, porcelain, calligraphy, and painting, to build a larger-scale knowledge graph system. By promoting this experience, a unified knowledge management platform encompassing a broader range of cultural heritage categories can be established, further advancing the digital preservation of China’s material cultural heritage.