Next Article in Journal
Boosting Winter Green Travel: Prioritizing Built Environment Enhancements for Shared Bike Users Accessing Public Transit in the First/Last Mile Using Machine Learning and Grounded Theory
Previous Article in Journal
The Impact of Land-Use Carbon Efficiency on Ecological Resilience—The Moderating Role of Heterogeneous Environmental Regulations
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Generative Generic-Field Design Method Based on Design Cognition and Knowledge Reasoning

1
School of Mechanical Engineering, Beijing Institute of Technology, Beijing 100081, China
2
Yangtze Delta Region Academy, Beijing Institute of Technology, Jiaxing 314000, China
*
Author to whom correspondence should be addressed.
Sustainability 2024, 16(22), 9841; https://doi.org/10.3390/su16229841
Submission received: 4 August 2024 / Revised: 4 November 2024 / Accepted: 6 November 2024 / Published: 12 November 2024

Abstract

:
Large language model (LLM) and Crowd Intelligent Innovation (CII) are reshaping the field of engineering design and becoming a new design context. Generative generic-field design can solve more general design problems innovatively by integrating multi-domain design knowledge. However, there is a lack of knowledge representation and design process model in line with the design cognition of the new context. It is urgent to develop generative generic-field design methods to improve the feasibility, innovation, and empathy of design results. This study proposes a method based on design cognition and knowledge reasoning. Firstly, through the problem formulation, a generative universal domain design framework and knowledge base are constructed. Secondly, the knowledge-based discrete physical structure set generation method and system architecture generation method are proposed. Finally, the application tool Intelligent Design Assistant (IDA) is developed, verified, and discussed through an engineering design case. According to the design results and discussion, the design scheme is feasible and reflects empathy for the fuzzy original design requirements. Therefore, the method proposed in this paper is an effective technical scheme of generative generic-field engineering design in line with the design cognition in the new context.

1. Introduction

In modern industry and smart manufacturing, industries are being reshaped by two forces: the power of crowds and artificial intelligence (AI) represented by large language models (LLMs) [1]. Under these two forces, the fields of engineering design and product innovation design are rapidly developing along two paths: crowd intelligence innovation (CII) represented by the Problem–Solution (P-S) Co-evolution Model [2], and AI-Generated Content (AIGC) [3], which utilizes AI technology to generate content.
First of all, we should clarify the following concepts: intra-field design is a process in which a single-field design problem is solved directly in the same field, cross-field design is a process in which a single-field design problem is solved in a multi-field solution space, and generic-field design is a process in which a multi-field design problem is solved in a multi-field solution space. Clearly, cross-field design is a special form of generic-field design. It can be seen that the fundamental basis behind the shift from the intra-field design to generic-field design paradigm is the shift in the product conceptual design process from the traditional sequential model of problem-solving to the P-S Co-evolution Model [4]. Therefore, generic-field design has also become the most fundamental innovation mechanism and the most effective design problem-solving model in the context of CII and AIGC.
In the classical design theory, engineering design is categorized into heuristic design and generative design based on the process and output: heuristic design can output knowledge elements within various design domains such as technical, physical, and functional domains for stimulating the designer’s inspiration, whereas generative design needs to give relevant design solutions, i.e., system architectures that contain the physical structure and their relationships with each other. Compared with heuristic reasoning, generative reasoning needs to consider more dimensional knowledge elements at the technical level, and the technical methods are more complex. The fundamental task of generative reasoning is to provide a physical architecture that meets the conditions based on the input design requirements (including technical domains, functional requirements, etc.).
As shown in Figure 1, this study divides the product concept generation process into two phases: the first phase is responsible for discrete physical structure reasoning, and the second phase is responsible for physical structure relationship reasoning and interpretation to obtain a complete knowledge of the system architecture. In the first phase, the inference engine reasons about the set of discrete physical structures that satisfy the conditions based on the given technical fields and functional requirements. It should be pointed out that although the reasoning result is a discrete set and does not require an explicit expression of the relationship, the organic relationship between the physical structures must still exist implicitly, which means that the reasoning method in the first stage must take this into account. The second stage of the reasoning method is responsible for revealing the implicit correlations in the discrete set of physical structures and interpreting the generated results to a certain extent according to the designer’s needs.
Although generative intra-field design and heuristic generic-field design techniques have been widely used, generative generic-field design methods are still seriously lacking, and their main technical difficulties include the following:
(1) Lack of engineering design knowledge base that contains global coupling correlations: The essence of this problem is that there is no corresponding knowledge representation model based on the design cognitive model of global coupling, which leads to a structural solution set obtained by the heuristic method that shows a lack of systematic correlation and does not conform to the design cognitive model;
(2) Lack of AI method for engineering design with empathy: The essence of this problem is that AI’s understanding of design problems is too limited to the input text, which makes it difficult to form a generic-field problem-solving model, resulting in the lack of feasibility and innovation of the output scheme, and the lack of empathy with users in terms of use situation;
(3) Lack of system-level scheme generation method in line with product conceptual architecture: The causes of this problem are complex, including the lack of global coupling connection of the physical structure solution set obtained in the early stage, the lack of knowledge base to support architecture search, and the difficulty in learning the physical structure connection mode.
In the context of LLM and CII, the unreasonable use of design resources and computing power leads to problems such as low quality of design outputs, repetitive design processes, and difficult convergence of design schemes [5]. Therefore, the lack of appropriate generative methods may lead to new design sustainability problems [6].
To address the generative generic-field design process and the above technical difficulties, this study proposes a method based on design cognition and knowledge reasoning. Firstly, based on the topic model, a statistical model widely used in semantic analysis and text mining, ConceptTM, is proposed to generate a set of physical structures containing implicit connections for a given technical field and functional requirements. Second, in order to represent the hierarchical physical architecture of engineering products, this study proposes the Semantic Graph-based Techspecs Concept Ontology (SG-TCO) model based on semantic graph as the output form of the conceptual design solution, and with this goal in mind, the knowledge base G-Base is improved and constructed to support generic-field design. Based on this, we propose a system architecture generation method, G-Designer, which realizes scheme generation in the form of SG-TCO by finding similar instances from the knowledge base and learning structural connectivity patterns.
This paper will introduce the research progress in related fields in Section 2. Section 3 will clarify the research problem and scope, introduce the knowledge base constructed in the previous stage to support this research, and propose the research framework. Section 4 will introduce the physical structure set solving method ConceptTM, and Section 5 will introduce the system architecture generation method G-Designer. Section 6 will introduce the application platform constructed according to the methodology proposed in this research, and verify and discuss the feasibility and efficiency of the methodology based on a specific case. Finally, the contributions of this research and future research work will be summarized.

2. Literature Review

2.1. Knowledge-Based Design Combined with Design Cognition

Knowledge-based design (KBD) combined with design cognition (DC) mostly utilizes data sources in the form of natural language text [7,8]. It integrates natural language processing (NLP) to conduct heuristic knowledge reasoning that aligns with design cognition. Luo [9] assigns different AI techniques to different design task requirements and proposes a Double Hump Innovation Process Model based on the divergence–convergence cognitive process. Andrade et al. [10] used NLP techniques to alert designers to possible pitfalls in design concepts when appropriate, reducing design iterations. Tan et al. [11] focused on the design work of washing machine products. There are also studies related to design inspiration and knowledge recommendations using online consumer reviews [12] and project work texts [13]. Generic-field design involves a large amount and rich type of knowledge, which can be better utilized with AI technology to play the role of knowledge. With the continuous development of big data and big computing power, a series of emerging AI technologies such as the famous vector model [14], pre-training model [15], contrastive learning [16], generative model [17], and so on, have been derived. The development of AI technology has also contributed to the boom of AI applications in various other fields, and AI4Art [18,19], AI4Game [20,21,22,23], AI4Science [24,25,26], and other related work have emerged. In the field of engineering design, a blueprint for the work of AI4Design is gradually taking shape.
In terms of knowledge representation, knowledge graphs and semantic networks are the most popular approaches among researchers. Li et al. [27] constructed a knowledge graph based on medical requirements and technical documents to provide designers with design algorithms inspired by smart medical devices. Liu et al. [28] constructed a knowledge graph in the field of smart home product design. Wang et al. [29] achieved the goal of providing designers with risk prevention and feasibility analysis at the early design stage based on knowledge graphs. Hao et al. [30] constructed a knowledge graph on supply chain system design. Jia et al. [31] constructed a knowledge graph on mechanical product design. Liu et al. [32] proposed a Functional Structured Concept Network (FSCN) for mining users’ explicit and implicit requirements.
In addition to heuristic reasoning for designing knowledge entities, Chen et al. [33] proposed a method to inspire designers to think (QCue) based on the human reflective process, which mainly promotes thinking by answering users’ questions and asking them rhetorical questions. Jiang et al. [34] proposed a convolutional neural network for extracting the feature vectors of patent images for image classification and retrieval. Bai et al. [35] proposed a DMGAN model for transforming sketch features into real image features to retrieve the realistic counterparts of their design concepts. There are also some technical aspects of research available. For example, Bayesian networks as a form of knowledge representation [36], heuristic design using machine learning and optimization algorithms [37,38,39], manually constructing an expert system using expert experience [40], integrating multiple heuristic reasoning methods to construct a design system [41,42,43,44,45], etc.

2.2. Knowledge-Based Generic-Field Conceptual Design

The complexity of knowledge-based generic-field conceptual design lies in the fact that on the one hand, the knowledge sources used in the generic-field design must cover multiple engineering fields; on the other hand, the generic-field design must support cross-field knowledge fusion and integration, and therefore, needs to abstract the core conceptual elements within each engineering field.
Patents are the best source of knowledge used to support generic-field design. Sarica et al. [46] collected patent data from the USPTO and constructed TechNet, a large semantic network of engineering designs, for designers to perform design concept searches and evaluations. Siddharth et al. [47] improved upon TechNet and constructed a large and extensible knowledge graph for engineering design. Design knowledge characterization methods using semantic networks or knowledge graphs do not need to distinguish the class attributes of entities and can provide a unified description of design concepts in generic fields. However, if a more precise characterization of the conceptual scheme is required, entity category attributes still need to be defined and discerned. In order to clearly describe the connection between technical fields, Luo et al. [48] take IPCs of patents as technical field entities and utilize the citation relationship between patents to construct a technical map InnoGPS, which is used to calculate the knowledge distance between different fields so as to more effectively guide designers in cross-field knowledge integration. Liu et al. [49] built a technical terminology knowledge base based on the IPC of patents and standard terminology of functional flow, which is used to assist designers in searching for design solutions, and Ye et al. [50] also used IPC to classify design concepts into technical fields and realized the retrieval and recommendation of design concepts.
Combining the three domains of generic-field design, KBD and DC, heuristic design based on biological analogies also has a place in the existing research. The starting point of bio-inspired design lies in nature’s biodiversity and evolutionary system; in order to adapt to the requirements of the environment, organisms have evolved diverse coping mechanisms, and in this way, the structures formed are the result of natural selection. If a certain biological structure or mechanism is suitable for solving an engineering problem, it can often bring better results. Chen et al. [51] combined natural language processing technology to extract the structure and function of organisms from biological databases, which was used to inspire the design of wind turbines for noise reduction. Cao et al. [52] assisted in the design of natural resource collection systems in a similar way. In addition to functional–structural analogies, sometimes visual analogies can also provide references for designers. Based on human visual analogical thinking, Zhang et al. [53] proposed a deep clustering model that mines the similarity of visual edges for the visual analogies of sketches to provide inspiration for designers. However, the visual analogy method lacks a principled interpretation of engineering techniques, so applications tend to favor industrial design.
In terms of generic-field design tool development, Design Ideator is a conceptual design toolkit built by Narsale et al. [54] that provides designers with a variety of design inspirations such as context-switching, analogical reasoning, stimulus arousal, and combinatorial constructions, while Linkage is an online tool built by Eliot et al. [55] that provides cross-domain bionic design for design teams. ONPS is an ontology-based intelligent recommendation system for new product development proposed by Hsu et al. [56].

2.3. AI Based Generative Design

AI algorithms such as flow-based models and diffusion models, as well as deep learning methods such as the variable auto-encoder (VAE) and generative against network (GAN), are widely used in generative design. In terms of design objects, due to the simple composition of drug molecules and the availability of a large number of structured databases, studies related to the generative design of drug molecules are more abundant. Mahmood et al. [57] proposed a generative model for mask map modeling to generate drug molecules with similar physicochemical properties to the original data. Zang et al. [58] proposed a generative model based on the reversible flow model. Luo et al. [59] proposed a generative model based on a discrete flow model to accommodate the discrete nature of drug molecules. In terms of output, the generation of drug molecules is often encoded as a graph structure; i.e., a two-dimensional molecular map is used as the output rather than a three-dimensional molecular structure. The architectural domain more often uses 3D structures for generation, which is due to the fact that the structure of buildings is more fixed and regularity is more obvious. Wen et al. [60] achieve the automatic generation of 3D building models based on the structural elements of the line segments. Chang et al. [61] propose a GAN-based method for generating 3D building designs (Building GAN). Sydora et al. [62] proposed an automatic building design generation method based on building information modeling (BIM) rules. Compared to engineering design, which is concerned with functionality and systematicity, building design is more inclined to exterior and industrial design.
In the field of mechanical and structural design, the generative methods are different based on different design representations. Yaroslav et al. [63] use 2D CAD graphics as design representations, extract lines and constraints from them as the basic elements of the design, mimic the natural language generation, and use autoregressive modeling to learn the design. Ayush et al. [64] learn the design from the designer’s behavioral sequences and generate designs by capturing and mimicking the designer’s behavior. Wang et al. [65] proposed a GAN-based generative model for generating the geometrical structure of a crystalline cell that satisfies a given condition. Lee et al. [66] proposed a GAN-based method for generating tire treads using tire performance as the input. Zheng et al. [67] proposed a GAN-based method for generating tire treads in response to standardized program instructions and semantic reasoning mechanisms, and proposed a rule-based robot generation method.
From the studies collated above, it is not difficult to find that most of the current research for generic-field design problems adopt heuristic design methods, and it is difficult to form a methodology that realizes cross-domain knowledge fusion and conceptual program solving for engineering design problems. In the research of generative methods, most of the design problems and schemes belong to the same technical field. On the other hand, studies on generative design require good design characterization methods, large-scale datasets, or explicit generation rules as a prerequisite. Therefore, this study proposes a generative approach for engineering design that is consistent with design cognition, starting from problem formulation, design characterization, and a knowledge base that supports generic-field design.

2.4. Topic Modeling Approaches and Application

Topic model is a method of mining hidden topics in a corpus through modeling, and its essence is a statistical model for clustering the hidden semantic structure of the corpus [68]. The topic model can map words or phrases with the same topic to the same dimension through text dimensionality reduction, and effectively discover the topic structure hidden in large-scale text corpus [69], forming the effect of soft clustering on the vocabulary set. The implementation methods of the topic model mainly include the probabilistic topic model and neural topic model.

2.4.1. Probabilistic Topic Model

The modeling thinking of the probabilistic topic model is to regard the text as the probability distribution of the topic; that is, each text is composed of several topics, and the topic is regarded as the probability distribution of words; that is, each topic is represented by several words.
Blei et al. [70] proposed the first full Bayesian probabilistic topic model Latent Dirichlet Allocation (LDA), which is mainly used to speculate the topic distribution of documents. LDA selects a topic from each document with a certain probability, and then selects a word from this topic with a certain probability, and repeats the above steps continuously to obtain the final whole document. The model structure of LDA mainly contains two distributions; that is, the document is modeled as the distribution of topic, and the topic is modeled as the distribution of words. LDA has greatly promoted the development of the topic model, and a large number of improvements have been proposed based on the LDA model framework. Blei et al. [71] proposed the Correlated Topic Model (CTM) to model the correlation between topics by introducing the covariance matrix. LDA and CTM are both static topic models, which cannot model the timing of text. Therefore, researchers have proposed Dynamic Topic Models (DTMs) [72] and their extension to analyze the evolution of topics in text over time. On the other hand, a collapsed Gibbs Sampling Algorithm for the Dirichlet Multinomial Mixture Model (GSDMM) [73], Biterm Topic Model (BTM) [74], Multi-attribute Latent Dirichlet Allocation (MA-LDA) [75], and other models have also been proposed one after another, which are applied to sparse short text analysis.
Most of the above topic models are unsupervised machine learning models, which pay too much attention to maintaining text data and ignore other predictive information. The topic model based on supervised learning can effectively use text labels [76,77,78,79] to learn the topic distribution implied in documents to train the prediction model and produce more interpretable topics [80]. Representative, such as the Author Topic Model (ATM) [81], is a document generation model based on LDA. It believes that there is a correlation between the polynomial distribution of authors and topics, and models the document as a mixed distribution of topics and authors.

2.4.2. Neural Topic Model

When the data scale is expanded, the complexity of solving the topic model is greatly increased, and the topic model needs to be able to find the multi-level and multi-faceted feature information of the data. In the face of these new challenges, the neural topic model effectively avoids complex model derivation by combining the deep learning method and using the black box mechanism, and is better applied to downstream tasks with the flexibility of neural networks. The neural topic model optimizes the text generation process by combining word embedding [82,83,84,85], Variational Auto-Encoders (VAEs), neural networks, and other frameworks.
Word embedding is a kind of vocabulary vector representation learning technology in natural language processing, which maps the lexical features from high-dimensional space to low-dimensional continuous space into dense vectors. The Topic2Vec model [86] proposes a hybrid method of extracting features from documents. LDA is used to model the global relationship from each document to all topics, and Word2Vec is used to learn target words from the context to capture these relationships so as to realize the integration of context relationships between words.
VAE is a coding–decoding network structure proposed by Kingma et al. [87] that compresses input data into potential features and reconstructs text data distribution. Researchers have proposed a series of VAE-based topic model designs [88,89,90,91,92], providing an extensible and powerful deep generation framework for potential topic modeling [93,94,95,96].
The early neural topic model was based on the multilayer perceptron neural network with forward propagation [97]. Later, aiming at the sparsity of distribution, a neural topic model with sparse constraint was proposed [98,99,100]. The existing neural topic model can learn the complex nonlinear distribution of text and effectively adapt to various natural language processing tasks for mining text topics [101,102]. However, in the face of complex text expression and context dependence, the learned document features still have problems with inconsistent topics and the low utilization of data and knowledge in the text [103,104].
Topic clustering is also common in engineering design. For example, a technical field can be divided into more detailed sub-fields, and the physical structure distribution under each sub-field is different. In this case, a given technical field, divided sub-fields, and all physical structures can be regarded as specific articles, topics, and vocabulary, respectively. Therefore, this study will also use the topic model to reconstruct Bayesian design thinking and build a generative design model.

3. Problem Formulation, Preliminary Preparations, and Research Framework

3.1. Problem Formulation

In the stage of problem formulation, the IPO (Input–Process–Output) model of generic-field generative design should be specified first.
  • Input: While the input of generative design is often the functional requirements, in the generic-field design process, the concept of technical fields needs to be introduced as another type of element in the input. Considering that this study uses patents as the knowledge source, IPC code can be used as the input form of technical field class. Many complete functional effect bases have been formed in the field of engineering design, and each term can be used as a functional requirement.
  • Process: In this study, it is proposed to realize the generic-field design through two processes: discrete physical structure set generation and system architecture generation.
  • Output: In order to specify the output form of the task, a hierarchical characterization of the physical architecture needs to be proposed. An ontology model Techspecs Concept Ontology (TCO) [105] for expressing the hierarchical architecture of engineering products has been proposed in the research related to Product Family Design (PFD) based on a Modular Platform. The architecture of TCO is shown in Figure 2a, including the component layer, Modular Layer, and Product Layer. However, the TCO model ignores the interactive correlations between different units under the same layer, so this study proposes the Semantic Graph-based TCO (SG-TCO) model, as shown in Figure 2b. The SG-TCO model uses a graph structure at the component level to characterize the correlations between physical structures, and the modules at the module level are no longer simple collections of discrete physical structures, but communities or clusters in the semantic graph at the component level.
Figure 2. Difference between TCO and SG-TCO.
Figure 2. Difference between TCO and SG-TCO.
Sustainability 16 09841 g002
In response to the above generic-field design process, this research also needs to focus on solving the following four problems:
  • Problem 1: Global coupling knowledge correlation affecting conceptual scheme generation.
As shown in Figure 3 for a simple lighting system, two models are used here to represent the relationships between the three elements in the system.
Figure 3 shows a directed semantic network consisting of two pieces of knowledge “weather affects lighting” and “light affects lighting”. The information obtained is that both weather and light affect lighting, but this information is still Pair-wise and does not explain the global coupling between the three. In this small lighting system, the coupling of weather and lighting can be accomplished by the designer’s brain; however, once large and complex systems are involved, simple human considerations may ignore these influences, leading to the generation of low-quality design solutions. In the process of solving generic-field design problems, it is necessary to establish joint distributions across different design domains (technical, functional, and structural) for reasoning about physical structure solutions, and thus, it is necessary to understand the coupling and interactions between various elements in the system, which cannot be fulfilled by fragmented modes of knowledge representation, such as knowledge graphs or semantic networks.
In this study, we consider the use of the Probabilistic Graphical Model (PGM) to model the global correlations between conceptual elements and to measure the coupling effects and uncertainties within the system from a quantitative perspective. A Bayesian network, as illustrated in the figure, quantifies the causal relationships and gives the coupling effects of weather and lighting on lighting, providing designers with more comprehensive information.
  • Problem 2: A discrete physical structure set solving model in accordance with the designer’s cognitive mode.
Studies in cognitive science and neuroscience have shown that the thinking process of the human brain is likely to follow the Bayesian paradigm, and scholars call this hypothesis the “Bayesian brain” [106,107,108]. The core view of the Bayesian paradigm is that human understanding of things is prior. On the basis of this prior assumption, likelihood assessment is conducted on the various manifestations or products of things. After the assessment, the posterior is obtained by modifying the prior according to the assessment results. This view can be expressed by the famous Bayesian formula as follows:
P x | D P x P D | x
where x is the thing or hypothesis to be evaluated, and D is the phenomenon or data obtained from observation or experiment on the thing or hypothesis. The symbol indicates that there is a proportional relationship between the two before and after the symbol. The product concept design process is a thinking process with strong cognitive attributes. Considering the conceptual scheme as a collection of structures, the concept design process is a structure-by-structure sampling process, and the assumption of the “Bayesian brain” is obviously very suitable for this process, which is called Bayesian design thinking (BDT).
In the physical structure set solving process, the goal is to obtain a physical structure that satisfies the conditions based on the given technical fields and functional requirements. From the perspective of BDT, the physical structure is the object to be evaluated, and the a priori and likelihood should be based on the technical fields and functional requirements, respectively. In the general design context, on the one hand, the physical structure is considered to be subordinate to a specific technical field, e.g., a notebook belongs to the field of electronic information, and the concept of “electronic information” provides the a priori technical domain for “notebook”; on the other hand, “structure determines function” has been widely recognized by structure-functionalism [69] in the natural sciences and the FS theory in the design sciences [109], so the likelihood term in the BDT is the likelihood that a particular structure has a particular function.
In summary, the physical structure set solving process based on BDT in this study can be expressed as follows:
P S | T , F P S | T P F | S
where S represents the physical structure and T and F are the technical field and functional requirements, respectively.
  • Problem 3: Physical structure set solving method.
As described in Problem 2, considering the conceptual design process as a structure-by-structure sampling process, this study considers the topic clustering method for conceptual solution: firstly, a sub-field topic is sampled based on the distribution of sub-field topics under the technical field of the product; secondly, a structure is sampled based on the distribution of the structures under the sub-field topics, and the process is repeated until the generation of the structure set is completed.
Therefore, in the process of solving P S | T , F , P S | T can be expressed as a topic model based on the above process: construct distributions on all the physical structures as sub-domain topics, and then construct distributions on all these topics as technical domains. Similarly, the P F | S topic model is constructed. In summary, this study uses a topic model to solve the problem of generating a conceptual scheme—i.e., a collection of discrete physical structures—(the topic model is called ConceptTM), thus capturing the clustering phenomenon in design concepts and reducing the parameters of the model for generalization and computation.
  • Problem 4: System architecture generation method.
For generic-field design output in the form of SG-TCO, it is necessary to first establish connections at the physical structure (component) level, and then further cluster the components according to functional modules and construct connections at the module level.
In this study, we first improve the existing knowledge base by constructing structural node connections between the component level and the module level. Based on the knowledge base, for the new generative generic-field design, search-based generation and connect-based generation methods are proposed to construct connections in the discrete physical structure set to generate the system architecture.

3.2. Preliminary Preparations

3.2.1. Knowledge Base Building for Discrete Structure Set Generation

Based on the Function–Behavior–Structure (FBS) model and axiomatic design theory, this study incorporates functional and physical domains into the proposed multi-domain design knowledge representation model. In addition, it is necessary to introduce a technical domain to characterize the relationship between different technical fields, which is the essential requirement of generic-field design.
In summary, this study proposes a three-domain (technical, physical, and functional) generic-field design knowledge representation model consisting of three basic design elements (technical domain, physical structure, and functional effect). Based on the three-domain model, an engineering design product is the sum of the three basic elements, so a design concept can be represented as follows:
P n = { T i } i = 1 I n , { S j } j = 1 J n , { F k } k = 1 K n , R n
In the formula, P n represents the design concept described in the nth patent; T i is the ith technical element that the corresponding design concept has, and I n is the total number of technical elements; S j is the jth physical element that the corresponding design concept has, and J n is the total number of physical elements; F k is the kth functional element that the corresponding design concept has, and K n is the total number of functional elements; and R n and P n are the original texts of the corresponding patent describing the detailed relationship between T i , S j , and F k . All the existing design concepts constitute a base knowledge base:
K B = { P n } n = 1 N
In this study, Chinese invention patents are selected as the knowledge source, and the relevant data can be downloaded from the Patent Data Service Pilot System of the State Intellectual Property Office. The three domain elements in the knowledge base can be extracted from the invention patent data using natural language processing techniques:
  • In this study, IPC codes are used as the technical domain elements, and each code represents a technical field. The first four bits of the IPC code are taken as the characterization of the technology element; i.e., the technology coverage of the technology element is taken as the degree of the subcategory, e.g., B64C is a technology element representing the technology domain of the flight-related technology.
  • Physical domain elements are represented using physical structure terms, and all the terms together constitute the physical domain. In this study, the main physical structure terms referred to in the patent are extracted directly from the illustration section of the accompanying drawings using patent characteristics based on the method of constructing regular expressions.
  • Functional domain elements are represented by functional effect terms, and in the patent text, the abstract, the declaration of rights, the content of the invention, and the specific embodiment are the parts on which the functional effect terms are concentrated. In this study, function effect terms are extracted for the above content based on the word division tool and TF-IDF algorithm, and all the terms together constitute the functional domain.
Finally, this study extracts the above design concepts from 39,463 patents to form the technical domain set T (|T| = 582), the physical structural domain set S (|S| = 4420), and the functional domain set F (|F| = 24246). The constructed knowledge base will support the discrete physical structure set solving process.

3.2.2. Construction of SG-TCO Knowledge Base for System Architecture Generation

The base element of the SG-TCO knowledge base (later referred to as G-Base) is SG-TCO instances; in this study, one SG-TCO instance can be extracted from each patent based on constructing correlations at two levels, component and module, to form the G-Base. The G-Base can be expressed by the following equation:
G B a s e = { SG - TCO n } n = 1 N
SG - TCO = G c , S m , P , G c = V c , E c , S m = { M l } l = 1 L .
In the formula, G c represents the component semantic graph, S m represents the set of module subgraphs, and P represents the description of the other attributes of the product such as name and IPC code. V c represents the set of the nodes of the underlying physical structures, E c represents the set of the connected edges of the semantic correlation relations between the physical structures, M l represents the set of underlying physical structures that the module l has, and L is the number of modules.
This study adopts the bottom-up construction method, starting from the component layer to build the component semantic graph, then extracting the corresponding physical modules at the module layer, and finally adding the ancillary information to form a complete SG-TCO instance.
(1) Component level: semantic graph-based relationship recognition.
Component semantic graph refers to the graph relations between interrelated underlying physical structures, and the essence of constructing a component semantic graph under a given patent is the problem of calculating the strength of semantic correlations between the underlying physical structures that the patent has. The physical structure elements in the patent can be used as the component layer nodes of SG-TCO instances. The number of co-occurrences between two nodes is counted by the regular matching method, which is used as the semantic correlation edge of the component semantic graph. During the construction process, the component semantic graph can be constructed into two modes: undirected graph and directed graph.
(2) Module level: module clustering based on community detection algorithm.
In G-Base, module-level physical modules are defined as a subset of more closely connected physical structures in the component semantic graph. In the field of graph computing, the above methods for mining the subset of tightly connected nodes are collectively known as community detection [110,111].
For the undirected graph pattern component semantic graph, Modularity-based Louvain’s algorithm [112] is used to extract physical modules. Modularity is a metric to evaluate the good or bad results of community detection, and its physical meaning is the difference between the sum of the weights of the connected edges of nodes inside the community and the random case, which is calculated as follows:
Q = 1 2 m i , j A i , j d i d j 2 m δ c i , c j .
In the formula, A is the undirected graph adjacency matrix, d i d j is the node degree value, m = 1 2 i , j A i , j is the total weight of all the connected edges, c i c j is the community into which the corresponding node is divided, and δ c i , c j is the Dirichlet function:
δ c i , c j = 1 ,   c i = c j 0 ,   c i c j
For directed graph schema component semantic graph, the Map Equation-based Infomap algorithm [113] is used to extract the physical modules. The Map Equation is of the following form:
L C = q H Q + c p c H P c , H Q = c q c q l o g q c q , H P c = q c p c l o g q c p c i c p i p c l o g p i p c .
In the formula, p i is the ith element of the smooth distribution p with the directed adjacency matrix B as the transfer probability, i.e., the probability of node i appearing in the wandering sequence after an infinitely long period of random wandering. There
p = B p
The other parameters are calculated as follows:
q c = i c j c p i B i , j , q = c q c , p c = q c + i c p i .
The physical meaning of Map Equation is the information entropy of the sequence obtained by randomly wandering in the network system after performing the community division, so the goal of the Infomap algorithm is to find the optimal community division C-minimization equation.
In addition to physical modules based on the community form, G-Base also focuses on physical modules based on the clique form. A clique is a subgraph that constitutes a complete graph in a graph structure, and any two nodes of these subgraphs have edges. As a supplement to the community form, the clique can improve the recall rate of potential physical modules. In G-Base, both the community and the clique require no less than three component nodes to be recognized as a physical module.

3.3. Research Framework

To address the above problems in the generative generic-field design process, a research framework is proposed as shown in the Figure 4:

4. ConceptTM: Structure Set Generation Method Based on Topic Model

4.1. Graphical Representation of ConceptTM

The infrastructure used by ConceptTM is a directed probability graph, that is, a Bayesian network. More specifically, it is a topic model similar to LDA. As a generative model, ConceptTM describes the generation process of engineering products including technical fields, physical structures, and functional requirements:
(1) Input the technical fields { T n , i } i = 1 I n involved in the nth engineering product;
(2) Sample one of the T n , j from the given technical fields;
(3) Sample technical topics Z n , j s according to the distribution of technical topics in the technical field θ T n , j s ;
(4) Sample the physical structure S n , j according to the distribution of physical structures in the technical topics   β Z n , j s ;
(5) Execute the above physical structure sampling J n times to complete the sampling of physical structures;
(6) Sample one of the S n , k from a given physical structure;
(7) Sample a physical topic Z n , k f according to the distribution of physical topics in the physical structures θ S n , k f ;
(8) Sample functional requirements F n , k according to the distribution of functional requirements in the physical topics β Z n , k f ;
(9) Execute the above functional structure sampling K n times to complete the sampling of functional requirements.
Repeat the above generation process N times to obtain engineering product datasets:
D = { { T n , i } i = 1 I n , { S n , j } j = 1 J n , { F n , k } k = 1 K n } n = 1 N
For PGM, graph representation implies a predefinition of the model based on a priori knowledge or assumptions, including the definition of the graph structure and of the variable types. Figure 5 illustrates the graph representation of ConceptTM.
The variable elements represented by shaded nodes in the figure are Visible Variables, which are the elements described in the KB; the variables represented by non-shaded nodes are assumptions of the model, which will not be observed in the actual dataset, and are Latent Variables. The Technical Topic Distributions θ t s , Topic Distributions β z s , Structural Topic Distributions θ s f , and Functional Topic Distributions β z f are sampled from their respective priors, and are the most important model parameters of ConceptTM. The inference process of ConceptTM describes the joint distribution of elements within an engineering product dataset:
P θ t s , β z s , T n , j , Z n , j s , S n , j , S n , k , θ s f , β z f , Z n , k f , F n , k | T n , i
= t = 1 T P θ t s z = 1 Z s P β z s s = 1 S P θ s f z = 1 Z f P β z f
× n = 1 N j = 1 J n P T n , j | { T n , i } i = 1 I n P Z n , j s | θ T n , j s P S n , j | β Z n , j s s
× n = 1 N k = 1 K n P S n , k | { S n , j } j = 1 J n P Z n , k f | θ S n , k s P F n , k | β Z n , k f f
Equation (14) is the joint distribution corresponding to the generation process of each topic distribution. Equation (15) is the joint distribution corresponding to the generation process of the physical structure, i.e., the solving process of P S | T ; Equation (16) is the joint distribution corresponding to the generation process of the functional requirements, i.e., the solving process of P F | S . The technical fields T n , i of the individual engineering products are not derived from sampling, but are given by the designer, which facilitates the explicit control of technical field integration.
Due to the architecture of the topic model with the use of latent variables, the individual physical structures are not conditionally independent from each other for a given technical fields and functional requirements, i.e.:
P { S j } j = 1 J | T , F j = 1 J P S j | T , F .
This suggests that ConceptTM is capable of mining global coupling connections between physical structures, and can construct sets of discrete physical structures with implicit correlations that are amenable to generative reasoning.

4.2. Training Method of ConceptTM Based on MVI

The mechanism of ConceptTM is Bayesian inference, i.e., the computation of P S | T , F . According to Equation (13),   θ t s , β z s , θ s f , a n d   β z f need to be estimated by a training method of fitting dataset D.
Mean field Variational Inference (MVI) [114] is an approximation algorithm for solving complex multivariate posterior distributions, which assumes that the posterior distributions are independent among multiple variables. After obtaining the approximate posterior distribution of a variable, the mean of the approximate posterior is often used as the point estimate of the variable, which is how ConceptTM obtains the point estimate of the variable θ t s , β z s , θ s f , β z f . The original true posterior distributions of ( θ t s , β z s ) are conditionally independent, i.e.,:
P θ t s , β z s , θ s f , β z f | D = P θ t s , β z s | D P θ s f , β z f | D
And in ConceptTM, P S | T and P F | S obviously have the same structural characteristics, so it is sufficient to apply to ( θ s f , β z f ) after studying the estimation method of ( θ t s , β z s ). The graphical representation of the process of estimating the posterior distribution of ( θ t s , β z s ) by inference from the mean variational field is shown in Figure 6:
In this case, the MVI of variables only plays an auxiliary role in solving, and the results can be discarded after the model training is completed. The approximate posterior distribution of ( θ t s , β z s ) is expressed using a parameterization as follows (~represents the distribution that the variable follows):
q T n , j | λ n , j C a t e g o r i c a l λ n , j , q Z n , j s | γ n , j C a t e g o r i c a l γ n , j , q θ t s | α t s D i r i c h l e t α t s , q β z s | η z s D i r i c h l e t η z s .
The MVI is solved with the optimization objective of minimizing the Kullback–Leibler Divergency (KLD) between the true posterior distribution and the approximate posterior distribution, and its equivalent problem is maximizing the Evidence Lower BOund (ELBO):
m a x ϕ E L B O = z q z | ϕ l o g P x , z q z | ϕ
x = ( T n , i , S n , j )
z = ( T n , j , Z n , j s , θ t s , β z s )
ϕ = λ n , j , γ n , j , α t s , η z s
ϕ is the parameter to be solved and has the following:
P x , z = t = 1 T P θ t s | α s z = 1 Z s P β z s | η s n = 1 N j = 1 J n P T n , j | { T n , i } i = 1 I n P Z n , j s | θ T n , j s P S n , j | β Z n , j s s
q z | ϕ = t = 1 T q θ t s | α t s z = 1 Z s q β z s | η z s n = 1 N j = 1 J n q T n , j | λ n , j q Z n , j s | γ n , j
Since the model features of ConceptTM are consistent with the Author Topic Model (ATM) [115], training can be performed directly using the algorithm in Gensim [116]. The a posteriori means are calculated as the point estimates of the parameters after the optimization is complete:
θ ^ t s = α t s s α t s
β ^ t s = η z s s η z s
θ ^ s f = α s f f α s f
θ ^ z f = η z f f η z f
Furthermore, compute P S | T and P F | S :
P ( { S j } j = 1 J | { T i } i = 1 I ) = 1 I J S j { S j } j = 1 J T j { T i } i = 1 I z Z s θ ^ T j z β ^ z S j
P ( { F k } k = 1 K | { S j } j = 1 J ) = 1 J K F k { F k } k = 1 K S k { S j } j = 1 J z Z f θ ^ S k z β ^ z F k
At this point, the generation of a discrete physical structure set with implicit global coupling correlations can be accomplished by Bayesian inference.

4.3. Reasoning Method of ConceptTM Based on Sampling

The core of ConceptTM’s generative inference is the Bayesian formula:
P { S j } j = 1 J | { T i } i = 1 I , { F k } k = 1 K P { S j } j = 1 J | { T i } i = 1 I × P { F k } k = 1 K | { S j } j = 1 J
In the formula, { T i } i = 1 I and { F k } k = 1 K are the given technology fields and functional requirements, and { S j } j = 1 J is the set of discrete physical structures that possess implicit correlations. The event space of the posterior distribution is a power set of all the physical structures, and if the technical fields and functional requirements that are conditions are also considered, then the total space will be so huge that it will not be possible to accurately obtain the probabilities of each subset. In this case, this study provides two methods for sampling the above distributions, namely Gibbs and Maximum A Posterior (MAP).

4.3.1. ConceptTM’s Gibbs Sampling

Gibbs sampling as an iterative algorithm can greatly reduce the computational complexity to obtain sampled samples that approximately obey the distribution. The specific process is shown in the following Algorithm 1:
Algorithm 1 Gibbs sampling of ConceptTM
Input :   Posterior   P { S j } j = 1 1 | { T i } j = 1 1 , { F k } k = 1 K
Output :   Samples   S G = { { S l , j } j = 1 L } i = 1 L
1 : S G 2 : S e t   { S j } j = 1 J   as   some   random   physical   concepts 3 : f o r   l   in   range 1 , L   do 4 : f o r   j   in   range 1 , J   do 5 : S a m p l e   S l , j P S j | { T i } i = 1 I , { F k } k = 1 K , { S j } j = 1 , j j J 6 : S j S l , j 7 : e n d   f o r 8 : A d d   S l , j j = 1 J   t o   S G 9 : end   for 10 :   return   Samples S G

4.3.2. ConceptTM’s MAP Sampling

MAP is a point estimation method in Bayesian statistics, and applying maximum a posteriori estimation to ConceptTM implies taking the posterior distribution P { S j } j = 1 J | { T i } i = 1 I , { F k } k = 1 K of the set of discrete physical structures as the inference result:
S M = a r g m a x { S j } j = 1 J P { S j } j = 1 J | { T i } i = 1 I , { F k } k = 1 K
Again, due to the large size of the event space, the following greedy Algorithm 2 was used for the approximate solution:
Algorithm 2 Greedy MAP Sampling of ConceptTM
Input :   Posterior   P { S j } j = 1 1 | { T i } j = 1 1 , { F k } k = 1 K
Output :   Approximate   MAP   Estimation   S G = { { S l , j } j = 1 L } i = 1 L
1 : S M 2 : f o r   j   i n   r a n g e   1 , J   d o 3 : S j a r g m a x S j P S M { S j } | { T i } i = 1 I , { F k } k = 1 K 4 : S M S M { S j } 5 : e n d   f o r 6 : f o r   l   i n   r a n g e   1 , L   d o 7 : f o r   j   i n   r a n g e   1 , J   d o 8 :   S l , j a r g m a x S j P { S j } S M / { S j } | { T i } i = 1 I , { F k } k = 1 K 9 : U p d a t e   S M   w i t h   S j S l , j 10 :   end   for 11 :   end   for 12 :   return   Estimation   S M
At this point, ConceptTM has completed the entire training and reasoning process, realizing the generative design of a discrete physical structure set for products.

5. G-Designer: System Architecture Generation Based on Semantic Graph

5.1. Architecture of G-Designer

As can be seen from problem formulation, G-Designer, as a knowledge-based system architecture generation method, consists of a knowledge base G-Base and a knowledge inference engine G-Infer:
G - Designer = { G - B a s e , G - I n f e r }
In the formula, the composition and construction method of G-Base has been introduced in the previous section. G-Infer is the executor of knowledge reasoning and is responsible for transforming the collection of discrete physical structures into physical architectures ( indicates that the left side of the symbol represents the input and the right side represents the output):
G - Infer : S i S G - TCO i , s . t . S i S
In the formula, S is the set of physical domains and Si is a subset of them. G-Infer includes two implementation methods: one is knowledge reuse and the other is knowledge generalize.
Knowledge reuse refers to the feedback of possible physical architectures based on the correlation with the set of input discrete physical architectures by using the existing SG-TCO instances in G-Base as the output. The knowledge reuse-based reasoning method is called G-Searcher. G-Searcher does not generate new instances by itself, but the existing instances have been screened and certified, and its feasibility and reliability are highly guaranteed:
G - Searcher : S i SG - TCO i , s . t . S i S , SG - TCO i G - Base
Knowledge generalization refers to capturing the physical architecture to form patterns by learning patterns from a large number of instances in G-Base so as to complete the complementation of relationships between discrete physical structures and form the system architecture. The knowledge generalization mechanism is based on relational connectivity, so the reasoning method is called G-Connector:
G - Connector : S i SG - TCO i , s . t . S i S , V c ( SG - TCO i ) = S i
In the formula, Vc(SG TCOi) represents the set of physical structures at the component level of SG-TCOi instances. The G-Connector complements the G-Searcher in that it can generate new SG-TCO instances on its own, providing greater potential for innovation.

5.2. Knowledge Infer Method of G-Designer

5.2.1. G-Searcher Based on Knowledge Reuse

G-Searcher is a reasoning method based on knowledge reuse, whose main reasoning task is to find instances corresponding to a given set of discrete physical structures from G-Base, which essentially constructs a correlation between two structures:
{ S i } i = 1 I SG - TCO , s . t . i , S i S ,   SG - TCO G - Base
In this study, two correlation calculation methods, node symbol similarity and structural semantic similarity, are used.
(1) Node symbol similarity calculation
Calculating the similarity of the set of physical structures contained in an instance to a given set of physical structures can be performed using the Jaccard coefficient:
J a c c a r d ( { S i } i = 1 I , SG - TCO ) = | { S i } i = 1 I { S j } j = 1 J | | { S i } i = 1 I { S j } j = 1 J |
Or using the Dice factor:
D i c e ( { S i } i = 1 I , SG - TCO ) = 2 | { S i } i = 1 I { S j } j = 1 J | | { S i } i = 1 I | + | { S j } j = 1 J | = 2 | { S i } i = 1 I { S j } j = 1 J | I + J
It can be seen that the node symbol similarity is characterized by precise matching; however, the structural relationship between the underlying physical structures is ignored and the fact that different structures may not contribute the same to the instance as a whole is not taken into account.
(2) Structural Semantic Similarity Calculation
Different underlying physical structures contained in a product play different roles and have different degrees of importance, and this different importance can be calculated through the component semantic graph.
Therefore, the first step in structural semantic similarity computation is to utilize the semantic graph information of instances to assign different contribution weights to different underlying physical structures [117]. In this study, the node degree in the undirected graph model is used as an evaluation metric, which is normalized to obtain the importance weights:
j [ 1 , , J ] : w j = d j j = 1 J d j
Second, the semantic correlation of two structures is computed based on semantic correlations. After obtaining the importance weights, the correlation between a given set of physical structures and SG-TCO instances can be calculated as follows:
R e l e v a n c e ( { S i } i = 1 I , S G - T C O ) = ( i = 1 I w i v ( S i ) ) ( j = 1 J w j v ( S j ) ) = i = 1 I j = 1 J w i w j v ( S i ) v ( S j ) = i = 1 I j = 1 J w i w j κ ( S i , S j ) .
In the formula, κ ( S i , S j ) is the synthesized semantic correlation between two physical elements, including contextual semantic correlation and morphological semantic correlation:
κ e i , e j = β κ c e i , e j + 1 β κ m e i , e j 0,1 , β 0,1
  • Calculating contextual semantic correlation s κ c ( e i , e j ) based on Skip-gram.
The principle of the Skip-gram model lies in the use of central words to predict their contextual correlations [118]. In this study, the full text of patent text is utilized to train the Skip-gram model so as to obtain the contextual semantic vectors of all the relevant knowledge elements [119]. After the training is completed, the contextual semantic correlations between the knowledge elements can be calculated based on the following normalized cosine similarity:
κ c ( e i , e j ) = ( v i v j v i v j + 1 ) / 2 [ 0 ,   1 ]
In the formula, e represents the knowledge element and v represents the corresponding contextual semantic vector.
  • Computing morphosyntactic correlations κ s n ( s , t ) based on string kernel [120].
The construction of terms ensures the existence of morphosyntactic correlations between knowledge elements. String kernel refers to a function that is used to compute the similarity between strings by mapping the substrings of the strings into a high-dimensional feature space, where the similarity between the strings is computed using the inner product in the feature space:
κ s n ( s , t ) = ϕ n ( s ) ϕ n ( t ) ϕ n s ϕ n t [ 0 ,   1 ]
ϕn(s) and ϕn(t) represent the feature vectors of strings s and t in the feature space constructed by the substrings of length n. Since the length of Chinese terminology vocabulary is usually short and the size of the character set is large, this study will use string kernels with n = 1 and n = 2 and weight both of them to obtain the morphosyntactic correlations between the knowledge elements:
κ m ( e i , e j ) = α κ s 1 ( e i , e j ) + ( 1 α ) κ s 2 ( e i , e j ) [ 0 ,   1 ]
In the formula, α ∈ [0, 1] is the adjustable weight parameter.
So far, the G-Searcher inference process based on knowledge reuse is completed, and the system architecture search and recommendation of the existing relevant SG-TCO instances is realized.

5.2.2. G-Connector Based on Knowledge Generalization

G-Connector is a knowledge generalization-based reasoning approach that speculates on the system architecture for a given collection of discrete physical structures based on connection patterns between physical structures learned from training data:
{ S i } i = 1 I S G - TCO , s . t . i , S i S , V c ( G - TCO ) = { S i } i = 1 I
The G-Connector inference generation process is divided into two parts: component semantic graph generation and module division.
The key to constructing the component semantic graph lies in judging the strength of semantic correlations between the underlying physical structures, and this study adopts contextual semantic correlations to calculate the strength of semantic correlations between the underlying physical structures:
A ^ i , j = A ^ j , i = κ c ( S i , S j ) , i j , 0 , i = j .
Once the semantic component graph is constructed, the community detection algorithm (in Section 3) is utilized to delineate the modules in it, thus constructing a complete example of the system architecture.

6. Platform Development, Case Study, and Discussion

6.1. Training and Case Study of ConceptTM

6.1.1. Training and Evaluation of ConceptTM

In order to obtain the best performance, several ConceptTM models are trained under different hyperparameter settings, and then the best one is selected according to the evaluation index. In ConceptTM, the hyperparameters that have the greatest impact on the performance of the model are the number of topics in the technical–physical part |ZS| and the number of topics in the physical–functional part |ZF|. The performance of ConceptTM is evaluated using the degree of perplexity [68] and coherence [115]. The degree of perplexity refers to the degree of unfamiliarity of the model with the data sample set. Coherence refers to the degree to which the model fits the data sample set. The smaller the model perplexity and the higher the coherence, the better the performance. The specific index calculation method can refer to the references. A total of 80% of the data in KB is used as the model training set, and the remaining 20% is used as the model test set. During the training process, 7 models of the technical–physical part and 7 models of the physical–functional part were obtained, respectively, so a total of 49 ConceptTM models were obtained. All the model instances are trained on the training set, and the evaluation index is calculated on the test set. The results are shown in Figure 7 and Figure 8.
It can be seen from the evaluation results that the concepttm model with the best performance should take |ZS| = 10 and |ZF| = 50.

6.1.2. A Case Study of ConceptTM

Take a simple robot design as an example. Firstly, the technical field (IPC code) of the robot is determined as A61B. Secondly, set four related functional requirements, including “robot”, “material”, “grab”, and “move”, to indicate the need to design a mobile robot for grabbing materials. The required discrete physical structure set size is 10. Under the above settings, the reasoning results obtained by ConceptTM through Gibbs sampling and MAP are shown in Table 1.
Since a large number of samples can be obtained through Gibbs sampling, the table shows the last three samples taken from 100 rounds of sampling. It can be seen that Gibbs sampling is more diversified, and MAP can provide more stable results. Most of the physical structures obtained belong to mechanical parts (base, support, shell, various “boxes” and “plates”), which can be used to assemble the body of the robot. “Motor”, “servo motor”, and “charging port” can be used as mechanical power and energy sources. The robot posture can be measured by “angle sensor”, and can be adjusted by various driving media (spring, slider, guide rail, etc.). It can be proved that ConceptTM can realize the generation of physical structure, which means that ConceptTM can indeed provide practical design inspiration.
Next, verify that ConceptTM can realize the integration of technical fields. ConceptTM can perform explicit technology field integration, mainly because it clearly divides the technology fields from other design concepts based on the three-domain model. Still taking the robot design as an example, in addition to the basic electromechanical execution ability, the current high-tech robots are expected to have interactive perception ability. Therefore, in addition to the above basic technical fields and functional requirements, we also added the settings related to vision (G06K was added in the technical field, and “vision” and “image” were added in the functional requirements). The results are shown in Table 2.
In order to stabilize the results and facilitate comparison, only the knowledge reasoning method based on map estimation is used here. It can be seen that in addition to the general electromechanical components previously obtained, “camera” and “controller” are added to the reasoning results. ConceptTM does add the physical structure of visual processing to the robot as needed, which means that the implementation of technology integration in ConceptTM is successful.

6.2. Platform Development

Intelligent Design Assistant (IDA), a generic-field design knowledge service system, was developed by integrating the knowledge base and the generative generic-field design method obtained in this study into the software, and adopting a convenient user interaction interface design. The main operation interface of the software is shown in Figure 9. Inspired by ChatGPT, IDA adopts the method of “Dialog Interaction, Command Input”, which allows designers to input commands in a certain format to have a dialog with IDA. The main operation interface of IDA mainly consists of two parts: the dialog display part and the command control part.
There are four access objects in IDA, namely:
  • Access object @IDA, which is used for the introduction of the system;
  • Access object @ConceptTM, which is used to call ConceptTM reasoning interface;
  • Access object @GDesigner, for calling GDesigner reasoning interface;
  • Access object @PSearcher for patent searching.

6.2.1. @ConceptTM Command System

Entering the command “@ConceptTM: Help”, the system replies are shown in Figure 10. What can be seen is that in addition to the help command, @ConceptTM also gives two other commands, namely the MAP command and the Gibbs command. The results of MAP inference are fixed under the same command, while the results of Gibbs sampling are more diverse.

6.2.2. @G Designer Command System

Entering the command “@GDesigner: Help”, the system replies are shown in Figure 11. What can be seen is that besides the help command, @GDesigner also gives four other commands, which are the Set Mode command, the G-TCO command, the Search command, and the Connect command.
  • The “Modes” are the methods used by G-Designer to calculate the correlation between physical structures, and there are three of them: contextual semantic correlation (the default method), morphological semantic correlation, and synthesized semantic correlation.
  • The G-TCO command is the retrieval and visualization of SG-TCO instances from G-Base, idx is the number of instances to be viewed, and alg is the module partitioning method used (Louvain/Infomap). The system response to the command will be divided into two parts, the component semantic graph of the instance and the individual module subgraphs.
  • An example of G-Seacher reasoning is shown in Figure 12. The system reply contains the number (“#37764”) and the title (“A New All-in-One Microbiological Testing Device”) of the corresponding instance, and the results of the specific SG-TCO instance visualization can be obtained via the instance view command.
  • G-Connector inference uses the correlation calculation method set by “Set Mode” to calculate the correlation relationship between the provided physical structures and control a certain threshold (eps), and then utilizes the community detection method to classify the sub-modules therein.

6.2.3. @PSearcher Command System

Entering the command “@PSearcher: Help”, and the system replies are shown in Figure 13. In addition to the help command, @PSearcher also gives two other commands, namely, the patent search command “Search” and the patent content display command “Patent #”.
  • The patent search command is used to search for patent documents that contain key conceptual elements. The corresponding patent number, title, and key design concepts are given in the system response.
  • Once the number of a particular patent is obtained, designers can use the display command to view the specific contents of the patent, including the textual parts and abstract drawings.

6.3. Case Study

The application of the methodology and software system proposed in this study is demonstrated by the design of “a product that helps people to move objects”. The scope of the original requirement is very broad, and only one basic requirement is mentioned: “to move objects”. The designer can use “moving” as the basic functional requirement concept to explore the requirements and further define the basic function of the product: it can move freely and handle objects, and its corresponding general physical form is a vehicle with robotic arms and manipulators. At the same time, the technical fields of the product to be designed are identified as B25J (mainly related to industrial robots) and B62D (mainly related to vehicle construction).
Based on the acquired basic technical fields, physical structures, and functional requirements, the designer can utilize PSearcher to conduct patent searches. They can run the command “@PSearcher: Search maxn = 10 tech B25J B62D phys robot robot arm vehicle func handling handling robot handling vehicle”, and its system reply is shown in Figure 14.
Based on the above results, the designer can run the command “@PSearcher: Patent #idx” to view the profile of the relevant patent for inspiration. Figure 15 shows the brief introduction and figures for patent #135233. It can be seen that the patent fits the basic features of the product to be designed: an on-board robotic arm and a gripper, thanks to the accuracy of the underlying design concepts derived from the above reasoning.
Patent #135233 can be used as a reference for the basic form of the product to be designed, but it does not describe the physical architecture of the key component, the gripper, so the next step is to consider using a generative approach to complete the design of this component. Using ConceptTM to generate a discrete set of physical structures and running the command “@ConceptTM: MAP maxn = 10 tech B25J func Gripping Release Clamping Driving Torque”, the system response is shown in Figure 16.
Among the 10 physical structures, some of them are selected as the core components. After obtaining the set of discrete physical structures, G-Designer is utilized to generate the architecture by running the command “@GDesigner: Connect eps = 0.8 alg = louvain phys Motor Worm Gear Clamp Claw Fixed Pin Bracket Shaft” and the result is shown in Figure 17.
It can be seen that the “mechanical claw” mainly includes two Louvain modules, one is the power module (“motor”, “shaft”, “worm”, and “bracket”), and the other is the clamping claw module (“clamping claw”, “fixing pin”, and “gear”). The basic working principle of the “mechanical claw” is that the “motor” drives the “shaft” to rotate, the “shaft” drives the “worm”, the “worm” drives the “gear” (according to the general mechanical structure, this actually refers to the “turbine”), the “gear” drives the “clamping claw”, and the “bracket” and “fixing pin” play the role of fixation and support.
In order to show the final product more clearly, Figure 18 shows the Lego3D model of the “mechanical claw” (some connecting and supporting parts on the view side have been removed in order to see the internal structure), and Figure 19 shows the Lego3D model of the final product “Mechanical Moving Trolley” and its three views. It should be noted that the Lego model does not have any obvious “motor” parts, but the first and second transmission parts that should be connected to the motor are marked in red and orange, such as the “shaft”, “worm”, and “gear” (“turbine”) in the “mechanical claw”.

6.4. Discussion

(1) Performance of ConceptTM
ConceptTM adopts the PGM, especially the infrastructure of the topic model, to systematically express the complex and implicit correlations among physical structures. The advantages of ConceptTM’s features and capabilities are summarized as follows:
  • Based on the topic model architecture, ConceptTM can express global coupling correlations among knowledge elements, which is superior to general fragmented knowledge representation models.
  • With the help of design theory and cognitive theory, ConceptTM reveals the causal relationships among three basic types of design concepts, i.e., technical fields, physical structures, and functional requirements.
  • ConceptTM is able to perform the topic clustering of physical structures, thus mining the internal features of design concepts. Further, it is capable of accomplishing the generation of discrete physical structure collections with implicit correlations for the conceptual design of physical architectures.
  • ConceptTM is capable of explicitly performing technical field fusion, providing a source of innovation for engineering design.
(2) Performance of G-Designer
G-Base obtained a total of 38,473 SG-TCO instances, 149,270 Louvain community modules, 143,397 Infomap community modules, and 544,714 cluster modules. On average, each SG-TCO instance has 3.88 Louvain community modules, 3.73 Infomap community modules, and 14.16 cluster modules.
In the case study, the ability to obtain a clear division of labor between modules is not only the ability of the community detection algorithm, but also an effect of the component semantic graph that captures the underlying physical structures in a certain way. The successful realization of the generative design example proves the following three points: first, the reasonableness of the SG-TCO representation of the product hierarchical system architecture; second, the effectiveness of the G-Base construction method; and third, the feasibility and innovation of the G-Infer architecture generation method.

7. Conclusions

In summary, this study solved a series of problems of generative generic-field design in the context of large language model (LLM) and Crowd Intelligent Innovation (CII). In addition, by improving the utilization efficiency of data resources and putting forward the innovation mechanism in line with design cognition, this research promotes the sustainable innovation generation mode in the field of engineering design [121].
By analyzing the actual needs of designers, combining cognitive science and knowledge engineering, and based on the design cognitive process model, this study proposes a generative generic-field design method. ConcepttTM is based on the Bayesian brain hypothesis. The technical field provides a priori and functional requirements providing the likelihood to complete Bayesian reasoning, and generates a set of discrete physical structures with implicit systematic correlation. G-Designer constructs the architecture knowledge base of patents based on the improved SG-TCO model using a graph algorithm, and generates the system architecture for a given set of discrete physical structures based on the mechanisms of “knowledge reuse” and “knowledge generalization”.
In the course of this research, the development of AI technology has reached a new level: the emergence of ChatGPT has shown the world that “the future is here”. However, ChatGPT is a general-purpose language model, which only has advantages in general text understanding and generation.
In the field of engineering design, generative design based on LLM still has two shortcomings: the design results are considered to lack feasibility, and the output of the model lacks empathy with the designer and design context. The design process has poor interpretability and lacks support from the design cognitive theory. Compared with related models such as ChatGPT and BingChat, the generative generic-field design method proposed in this study can summarize the following three contributions in terms of originality, theoretical significance, and practical significance:
  • In the fields of conceptual design and engineering design, the research gap of generative generic-field design methods has been supplemented.
  • Proposed a design framework that conforms to the new context of design cognition, formed a knowledge representation model and process model that supports generative generic-field design, and reflected the completion and empathy of fuzzy original requirements.
  • Implemented generative design for generic-field problems, outputting a set of design structures containing modular themes and implicit associations, as well as design result layouts and system architectures that meet engineering feasibility.
This does not mean that similar work cannot be carried out in the field of engineering design; therefore, this paper concludes with the following research outlook:
  • The construction of a large professional knowledge base: The knowledge base related to engineering design currently displayed in the academic world is still “too small”, which includes the problem of the type of knowledge in addition to the insufficiency of quantity. Engineering design itself is a task involving rich types of knowledge, including the knowledge of natural language, image visualization, symbolic formulas, computer 2D/3D models, and so on. Therefore, the construction of a large quantity and high-quality multimodal engineering design knowledge base must be a future consideration in this field.
  • Realization and innovation of specialized knowledge reasoning methods: At a time when AI technology is so advanced, it has provided relevant researchers with a rich variety of algorithms that can be used; however, generalized methods do not necessarily fully meet the needs of the task of engineering design. Therefore, the development of specialized knowledge reasoning methods in line with engineering design remains a must for future research, with generative methods being a top priority. At the same time, in order to compare and evaluate the advantages and disadvantages of different reasoning methods, the engineering design field also needs to construct the corresponding evaluation index system and benchmark dataset, which lays the foundation for the synergistic development of reasoning methods.
  • The development and application of interaction technology: Although engineering designers have certain professional abilities, more convenient interaction technology can completely reduce the issues of engineering designers in the learning of design tools, applications, and other cognitive and use loads [122]. This will greatly improve work efficiency, to a certain extent, due to reducing the burden on the brain which can also stimulate the innovation ability of designers. Based on this consideration, therefore, continuing to explore the development and application of interactive technology in the field of engineering design is also one of the directions that can be continuously focused on in the future.

Author Contributions

Conceptualization, Z.M. and L.G.; methodology, Z.M. and M.Z.; software, Z.M. and M.Z.; validation, Z.M. and M.Z.; formal analysis, M.Z; investigation, Z.M.; data curation, M.Z.; writing—original draft preparation, Z.M.; writing—review and editing, J.L.; visualization, Z.M.; supervision, L.G.; project administration, L.G.; funding acquisition, L.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China, grant number No.52375229, Theory and Method of Collective Intelligence Innovative Design Space Collaborative Exploration and Innovation Generation.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors upon request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Filippi, S. Measuring the impact of ChatGPT on fostering concept generation in innovative product design. Electronics 2023, 12, 3535. [Google Scholar] [CrossRef]
  2. Strong, D.M.; Tulu, B.; Agu, E.; Pedersen, P.C. Search and evaluation of coevolving problem and solution spaces in a complex healthcare design science research project. IEEE Trans. Eng. Manag. 2020, 70, 912–926. [Google Scholar] [CrossRef] [PubMed]
  3. Pataranutaporn, P.; Danry, V.; Leong, J.; Punpongsanon, P.; Novy, D.; Maes, P.; Sra, M. AI-generated characters for supporting personalized learning and well-being. Nat. Mach. Intell. 2021, 3, 1013–1022. [Google Scholar] [CrossRef]
  4. Jung, J.; Kleinsmann, M.; Snelders, D. A vision for design in the era of collective computing. J. Eng. Des. 2022, 33, 305–342. [Google Scholar] [CrossRef]
  5. Ruffino, R. Sustainable Design: Aspects of Sustainable Product Development. Bioresources 2021, 16, 6548–6550. [Google Scholar] [CrossRef]
  6. Han, J.; Jiang, P.; Childs, P.R. Metrics for measuring sustainable product design concepts. Energies 2021, 14, 3469. [Google Scholar] [CrossRef]
  7. Cash, P.; Gonçalves, M.; Dorst, K. Method in their madness: Explaining how designers think and act through the cognitive co-evolution model. Des. Stud. 2023, 88, 101219. [Google Scholar] [CrossRef]
  8. Martinec, T.; Škec, S.; Perišić, M.M.; Štorga, M. Revisiting problem-solution co-evolution in the context of team conceptual design activity. Appl. Sci. 2020, 10, 6303. [Google Scholar] [CrossRef]
  9. Luo, J. Data-driven innovation: What is it? IEEE Trans. Eng. Manag. 2022, 70, 784–790. [Google Scholar] [CrossRef]
  10. Andrade, S.R.; Walsh, H.S. Discovering a failure taxonomy for early design of complex engineered systems using natural language processing. J. Comput. Inf. Sci. Eng. 2023, 23, 31001. [Google Scholar] [CrossRef]
  11. Tan, L.; Zhang, H. An approach to user knowledge acquisition in product design. Adv. Eng. Inform. 2021, 50, 101408. [Google Scholar] [CrossRef]
  12. Goldberg, D.M.; Abrahams, A.S. Sourcing product innovation intelligence from online reviews. Decis. Support. Syst. 2022, 157, 113751. [Google Scholar] [CrossRef]
  13. Li, X.; Chen, C.; Zheng, P.; Jiang, Z.; Wang, L. A context-aware diversity-oriented knowledge recommendation approach for smart engineering solution design. Knowl.-Based Syst. 2021, 215, 106739. [Google Scholar] [CrossRef]
  14. Church, K.W. Word2Vec. Nat. Lang. Eng. 2017, 23, 155–162. [Google Scholar] [CrossRef]
  15. Ethayarajh, K. How contextual are contextualized word representations? Comparing the geometry of BERT, ELMo, and GPT-2 embeddings. arXiv 2019, arXiv:1909.00512. [Google Scholar]
  16. Tian, Y.; Sun, C.; Poole, B.; Krishnan, D.; Schmid, C.; Isola, P. What makes for good views for contrastive learning? Adv. Neural Inf. Process. Syst. 2020, 33, 6827–6839. [Google Scholar]
  17. Harshvardhan, G.M.; Gourisaria, M.K.; Pandey, M.; Rautaray, S.S. A comprehensive survey and analysis of generative models in machine learning. Comput. Sci. Rev. 2020, 38, 100285. [Google Scholar]
  18. Ramesh, A.; Dhariwal, P.; Nichol, A.; Chu, C.; Chen, M. Hierarchical text-conditional image generation with clip latents. arXiv 2022, arXiv:2204.06125. [Google Scholar]
  19. Saharia, C.; Chan, W.; Saxena, S.; Li, L.; Whang, J.; Denton, E.L.; Ghasemipour, K.; Gontijo Lopes, R.; Karagol Ayan, B.; Salimans, T. Photorealistic text-to-image diffusion models with deep language understanding. Adv. Neural Inf. Process. Syst. 2022, 35, 36479–36494. [Google Scholar]
  20. Silver, D.; Huang, A.; Maddison, C.J.; Guez, A.; Sifre, L.; Van Den Driessche, G.; Schrittwieser, J.; Antonoglou, I.; Panneershelvam, V.; Lanctot, M. Mastering the game of Go with deep neural networks and tree search. Nature 2016, 529, 484–489. [Google Scholar] [CrossRef]
  21. Silver, D.; Schrittwieser, J.; Simonyan, K.; Antonoglou, I.; Huang, A.; Guez, A.; Hubert, T.; Baker, L.; Lai, M.; Bolton, A. Mastering the game of go without human knowledge. Nature 2017, 550, 354–359. [Google Scholar] [CrossRef] [PubMed]
  22. Silver, D.; Hubert, T.; Schrittwieser, J.; Antonoglou, I.; Lai, M.; Guez, A.; Lanctot, M.; Sifre, L.; Kumaran, D.; Graepel, T. A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play. Science 2018, 362, 1140–1144. [Google Scholar] [CrossRef] [PubMed]
  23. Schrittwieser, J.; Antonoglou, I.; Hubert, T.; Simonyan, K.; Sifre, L.; Schmitt, S.; Guez, A.; Lockhart, E.; Hassabis, D.; Graepel, T. Mastering atari, go, chess and shogi by planning with a learned model. Nature 2020, 588, 604–609. [Google Scholar] [CrossRef] [PubMed]
  24. Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Žídek, A.; Potapenko, A. Highly accurate protein structure prediction with AlphaFold. Nature 2021, 596, 583–589. [Google Scholar] [CrossRef] [PubMed]
  25. Bryant, P.; Pozzati, G.; Elofsson, A. Improved prediction of protein-protein interactions using AlphaFold2. Nat. Commun. 2022, 13, 1265. [Google Scholar] [CrossRef]
  26. Clyde, A. AI for science and global citizens. Patterns 2022, 11, 3. [Google Scholar] [CrossRef]
  27. Li, X.; Chen, C.; Zheng, P.; Wang, Z.; Jiang, Z.; Jiang, Z. A knowledge graph-aided concept–knowledge approach for evolutionary smart product–service system development. J. Mech. Des. 2020, 142, 101403. [Google Scholar] [CrossRef]
  28. Liu, Q.; Wang, K.; Li, Y.; Liu, Y. Data-driven concept network for inspiring designers’ idea generation. J. Comput. Inf. Sci. Eng. 2020, 20, 31004. [Google Scholar] [CrossRef]
  29. Wang, Z.; Zhang, B.; Gao, D. A novel knowledge graph development for industry design: A case study on indirect coal liquefaction process. Comput. Ind. 2022, 139, 103647. [Google Scholar] [CrossRef]
  30. Hao, J.; Zhao, L.; Milisavljevic-Syed, J.; Ming, Z. Integrating and navigating engineering design decision-related knowledge using decision knowledge graph. Adv. Eng. Inform. 2021, 50, 101366. [Google Scholar] [CrossRef]
  31. Jia, J.; Zhang, Y.; Saad, M. An approach to capturing and reusing tacit design knowledge using relational learning for knowledge graphs. Adv. Eng. Inform. 2022, 51, 101505. [Google Scholar] [CrossRef]
  32. Liu, Q.; Wang, K.; Li, Y.; Chen, C.; Li, W. A novel function-structure concept network construction and analysis method for a smart product design system. Adv. Eng. Inform. 2022, 51, 101502. [Google Scholar] [CrossRef]
  33. Chen, T.; Mohanty, R.R.; Krishnamurthy, V.R. Queries and cues: Textual stimuli for reflective thinking in digital mind-mapping. J. Mech. Des. 2022, 144, 21402. [Google Scholar] [CrossRef]
  34. Jiang, S.; Luo, J.; Ruiz-Pava, G.; Hu, J.; Magee, C.L. Deriving design feature vectors for patent images using convolutional neural networks. J. Mech. Des. 2021, 143, 61405. [Google Scholar] [CrossRef]
  35. Bai, C.; Chen, J.; Ma, Q.; Hao, P.; Chen, S. Cross-domain representation learning by domain-migration generative adversarial network for sketch based image retrieval. J. Vis. Commun. Image R. 2020, 71, 102835. [Google Scholar] [CrossRef]
  36. Wang, Y.; Blache, R.; Zheng, P.; Xu, X. A knowledge management system to support design for additive manufacturing using Bayesian networks. J. Mech. Des. 2018, 140, 51701. [Google Scholar] [CrossRef]
  37. Hsu, H.; Chang, S.; Chen, C.; Wu, I. Knowledge-based system for resolving design clashes in building information models. Autom. Constr. 2020, 110, 103001. [Google Scholar] [CrossRef]
  38. Koziel, S.; Pietrenko-Dabrowska, A. Knowledge-based performance-driven modeling of antenna structures. Knowl.-Based Syst. 2022, 237, 107698. [Google Scholar] [CrossRef]
  39. Zhang, J.; Cao, G.; Peng, Q.; Tan, R.; Liu, W.; Zhang, H. A systematic knowledge-based method for design of transformable product. Adv. Eng. Inform. 2022, 52, 101638. [Google Scholar] [CrossRef]
  40. Sarazin, A.; Bascans, J.; Sciau, J.; Song, J.; Supiot, B.; Montarnal, A.; Lorca, X.; Truptil, S. Expert system dedicated to condition-based maintenance based on a knowledge graph approach: Application to an aeronautic system. Expert. Syst. Appl. 2021, 186, 115767. [Google Scholar] [CrossRef]
  41. Mladineo, M.; Zizic, M.C.; Aljinovic, A.; Gjeldum, N. Towards a knowledge-based cognitive system for industrial application: Case of personalized products. J. Ind. Inf. Integr. 2022, 27, 100284. [Google Scholar] [CrossRef]
  42. Long, X.; Li, H.; Du, Y.; Mao, E.; Tai, J. A knowledge-based automated design system for mechanical products based on a general knowledge framework. Expert. Syst. Appl. 2021, 178, 114960. [Google Scholar] [CrossRef]
  43. Kim, D.; Jeong, D.; Seo, Y. Automated composition and execution of web-based simulation systems through knowledge designing and reasoning. Adv. Eng. Inform. 2021, 48, 101263. [Google Scholar] [CrossRef]
  44. Zhong, D.; Fan, J.; Yang, G.; Tian, B.; Zhang, Y. Knowledge management of product design: A requirements-oriented knowledge management framework based on Kansei engineering and knowledge map. Adv. Eng. Inform. 2022, 52, 101541. [Google Scholar] [CrossRef]
  45. Zheng, C.; An, Y.; Wang, Z.; Qin, X.; Yu, F. Application of configuration principle on knowledge-based engineering for manufacturing system design. Procedia CIRP 2021, 104, 1378–1383. [Google Scholar] [CrossRef]
  46. Sarica, S.; Luo, J.; Wood, K.L. TechNet: Technology semantic network based on patent data. Expert. Syst. Appl. 2020, 142, 112995. [Google Scholar] [CrossRef]
  47. Siddharth, L.; Blessing, L.T.; Wood, K.L.; Luo, J. Engineering knowledge graph from patent database. J. Comput. Inf. Sci. Eng. 2022, 22, 21008. [Google Scholar] [CrossRef]
  48. Luo, J.; Sarica, S.; Wood, K.L. Guiding data-driven design ideation by knowledge distance. Knowl.-Based Syst. 2021, 218, 106873. [Google Scholar] [CrossRef]
  49. Liu, H.; Li, W.; Li, Y. A new computational method for acquiring effect knowledge to support product innovation. Knowl.-Based Syst. 2021, 231, 107410. [Google Scholar] [CrossRef]
  50. Ye, F.; Fu, T.; Gong, L.; Gao, J. Cross-domain knowledge discovery based on knowledge graph and patent mining. J. Phys. Conf. Ser. 2021, 1744, 42155. [Google Scholar] [CrossRef]
  51. Chen, C.; Tao, Y.; Li, Y.; Liu, Q.; Li, S.; Tang, Z. A structure-function knowledge extraction method for bio-inspired design. Comput. Ind. 2021, 127, 103402. [Google Scholar] [CrossRef]
  52. Cao, G.; Sun, Y.; Tan, R.; Zhang, J.; Liu, W. A function-oriented biologically analogical approach for constructing the design concept of smart product in Industry 4.0. Adv. Eng. Inf. 2021, 49, 101352. [Google Scholar] [CrossRef]
  53. Zhang, Z.; Jin, Y. An Unsupervised Deep Learning Model to Discover Visual Similarity Between Sketches for Visual Analogy Support, International Design Engineering Technical Conferences and Computers and Information in Engineering Conference; American Society of Mechanical Engineers: New York, NY, USA, 2020; p. V8T. [Google Scholar]
  54. Narsale, S.; Chen, Y.; Mohan, M.; Shah, J.J. Design ideator: A conceptual design toolbox. J. Comput. Inf. Sci. Eng. 2019, 19, 41007. [Google Scholar] [CrossRef]
  55. Graeff, E.; Maranzana, N.; Aoussat, A. Linkage, an online tool to support interdisciplinary biomimetic design teams. J. Mech. Des. 2021, 143, 101401. [Google Scholar] [CrossRef]
  56. Hsu, Y.; Chaing, Y. The Strategic Advantages of Artificial Intelligence System for Product Design Teams with Diverse Cross-Domain Knowledge, Cross-Cultural Design. In Proceedings of the Experience and Product Design Across Cultures: 13th International Conference, CCD 2021, Held as Part of the 23rd HCI International Conference, HCII 2021, Virtual Event, 24–29 July 2021; Proceedings, Part I 23, 2021. Springer: Berlin/Heidelberg, Germany, 2021; pp. 408–419. [Google Scholar]
  57. Mahmood, O.; Mansimov, E.; Bonneau, R.; Cho, K. Masked graph modeling for molecule generation. Nat. Commun. 2021, 12, 3156. [Google Scholar] [CrossRef]
  58. Zang, C.; Wang, F. Moflow: An invertible flow model for generating molecular graphs. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Virtual Event, 6–10 July 2020; pp. 617–626. [Google Scholar]
  59. Luo, Y.; Yan, K.; Ji, S. Graphdf: A discrete flow model for molecular graph generation. In Proceedings of the International Conference on Machine Learning PMLR, Virtual Event, 18–24 July 2021; pp. 7192–7203. [Google Scholar]
  60. Wen, Q.; Zhu, R. Automatic generation of 3D building models based on line segment vectorization. Math. Probl. Eng. 2020, 2020, 8360706. [Google Scholar] [CrossRef]
  61. Chang, K.; Cheng, C.; Luo, J.; Murata, S.; Nourbakhsh, M.; Tsuji, Y. Building-GAN: Graph-conditioned architectural volumetric design generation. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, QC, Canada, 11–17 October 2021; pp. 11956–11965. [Google Scholar]
  62. Sydora, C.; Stroulia, E. Rule-based compliance checking and generative design for building interiors using BIM. Autom. Constr. 2020, 120, 103368. [Google Scholar] [CrossRef]
  63. Ganin, Y.; Bartunov, S.; Li, Y.; Keller, E.; Saliceti, S. Computer-aided design as language. Adv. Neural Inf. Process. Syst. 2021, 34, 5885–5897. [Google Scholar]
  64. Raina, A.; Cagan, J.; McComb, C. Design strategy network: A deep hierarchical framework to represent generative design strategies in complex action spaces. J. Mech. Des. 2022, 144, 21404. [Google Scholar] [CrossRef]
  65. Wang, J.; Chen, W.W.; Da, D.; Fuge, M.; Rai, R. IH-GAN: A conditional generative model for implicit surface-based inverse design of cellular structures. Comput. Method. Appl. M. 2022, 396, 115060. [Google Scholar] [CrossRef]
  66. Lee, M.; Park, Y.; Jo, H.; Kim, K.; Lee, S.; Lee, I. Deep generative tread pattern design framework for efficient conceptual design. J. Mech. Des. 2022, 144, 71703. [Google Scholar] [CrossRef]
  67. Zheng, C.; Xing, J.; Wang, Z.; Qin, X.; Eynard, B.; Li, J.; Bai, J.; Zhang, Y. Knowledge-based program generation approach for robotic manufacturing systems. Robot. Comput.-Int. Manuf. 2022, 73, 102242. [Google Scholar] [CrossRef]
  68. Papadimitriou, C.H.; Tamaki, H.; Raghavan, P.; Vempala, S. Latent semantic indexing:a probabilistic analysis. J. Comput. Syst. Sci. 2000, 61, 217–235. [Google Scholar] [CrossRef]
  69. Blei, D.M. Probabilistic topic models. Commun. ACM 2012, 55, 77–84. [Google Scholar] [CrossRef]
  70. Blei, D.M.; Ng, A.Y.; Jordan, M.I. Latent dirichlet allocation. J. Mach. Learn. Res. 2003, 2003, 993–1022. [Google Scholar]
  71. Blei, D.M.; Lafferty, J.D. Correlated topic models. Adv. Neural Inf. Process. Syst. 2006, 2006, 147. [Google Scholar]
  72. Blei, D.M.; Lafferty, J.D. Dynamic topic models. In Proceedings of the 23rd International Conference on Machine Learning, Pittsburg, PA, USA, 25–29 June 2006. [Google Scholar]
  73. Xu, H.; Wang, W.; Liu, W.; Carin, L. Distilled wasserstein learning for word embedding and topic modeling. In Proceedings of the 32nd Conference on Neural Information Processing Systems, Montreal, QC, Canada, 2–8 December 2018; Volume 31. [Google Scholar]
  74. Yan, X.; Guo, J.; Lan, Y.; Cheng, X. A biterm topic model for short texts. In Proceedings of the 22nd International Conference on World Wide Web, Rio de Janeiro, Brazil, 13–17 May 2013. [Google Scholar]
  75. Liu, G.; Xu, X.; Zhu, Y.; Li, L. An improved latent dirichlet allocation model for hot topic extraction. In Proceedings of the 2014 IEEE Fourth International Conference on Big Data and Cloud Computing, Sydney, Australia, 3–5 December 2014. [Google Scholar]
  76. Boyd-Graber, J.; Resnik, P. Holistic sentiment analysis across languages: Multilingual supervised latent Dirichlet allocation. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, Cambridge, MA, USA, 9–11 October 2010. [Google Scholar]
  77. Rosen-Zvi, M.; Griffiths, T.; Steyvers, M.; Smyth, P. The author-topic model for authors and documents. arXiv 2012, arXiv:1207.4169. [Google Scholar]
  78. McCallum, A.; Corrada-Emmanuel, A.; Wang, X. The author-recipient-topic model for topic and role discovery in social networks: Experiments with enron and academic email. Comput. Sci. Dep. Fac. Publ. Ser. 2005, 2005, 44. [Google Scholar]
  79. Li, D.; Yang, Y.; Tang, H.; Wang, J.; Xu, T.; Wu, W.; Chen, E. VIRT: Improving representation-based models for text matching through virtual interaction. arXiv 2021, arXiv:2112.04195. [Google Scholar]
  80. Peng, S.; Cui, H.; Xie, N.; Li, S.; Zhang, J.; Li, X. Enhanced-RCNN: An efficient method for learning sentence similarity. In Proceedings of the 2020 International Conference on World Wide Web, Taipei, Taiwan, 20–24 April 2020. [Google Scholar]
  81. Xu, C.; Lin, Z.; Wu, S.; Wang, H. Multi-level matching networks for text matching. In Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, Paris, France, 21–25 July 2019. [Google Scholar]
  82. Nguyen, D.Q.; Billingsley, R.; Du, L.; Johnson, M. Improving topic models with latent feature word representations. Trans. Assoc. Comput. Linguist. 2015, 3, 299–313. [Google Scholar] [CrossRef]
  83. Wang, Y.; Zhang, J. Keyword extraction from online product reviews based on bi-directional LSTM recurrent neural network. In Proceedings of the 2017 IEEE International Conference on Industrial Engineering and Engineering Management, Singapore, 10–13 December 2017. [Google Scholar]
  84. Xun, G.; Li, Y.; Zhao, W.X.; Gao, J.; Zhang, A. A correlated topic model using word embeddings. In Proceedings of the International Joint Conference on Artificial Intelligence, Melbourne, Australia, 19–25 August 2017. [Google Scholar]
  85. Sengupta, A.; Paka, W.S.; Roy, S.; Ranjan, G.; Chakraborty, T. An embedding-based joint sentiment-topic model for short texts. In Proceedings of the International AAAI Conference on Web and Social Media, Virtual Event, 7–10 June 2021. [Google Scholar]
  86. Wang, Z.; Ma, L.; Zhang, Y. A hybrid document feature extraction method using latent Dirichlet allocation and word2vec. In Proceedings of the 2016 IEEE first International Conference on Data Science in Cyberspace, Changsha, China, 13–16 June 2016. [Google Scholar]
  87. Kingma, D.P.; Welling, M. Auto-encoding variational bayes. arXiv 2013, arXiv:1312.6114. [Google Scholar]
  88. Srivastava, A.; Sutton, C. Autoencoding variational inference for topic models. arXiv 2017, arXiv:1703.01488. [Google Scholar]
  89. Gou, Z.; Han, L.; Sun, L.; Zhu, J.; Yan, H. Constructing dynamic topic models based on variational autoencoder and factor graph. IEEE Access 2018, 6, 53102–53111. [Google Scholar] [CrossRef]
  90. Lin, T.; Hu, Z.; Guo, X. Sparsemax and relaxed wasserstein for topic sparsity. In Proceedings of the twelfth ACM International Conference on Web Search and Data Mining, Melbourne, Australia, 11–15 February 2019. [Google Scholar]
  91. Feng, J.; Zhang, Z.; Ding, C.; Rao, Y.; Xie, H.; Wang, F.L. Context reinforced neural topic modeling over short texts. Inf. Sci. 2022, 607, 79–91. [Google Scholar] [CrossRef]
  92. Isonuma, M.; Mori, J.; Bollegala, D.; Sakata, I. Tree-structured neural topic model. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Virtual Event, 5–10 July 2020. [Google Scholar]
  93. Liu, L.; Huang, H.; Gao, Y.; Zhang, Y.; Wei, X. Neural variational correlated topic modeling. In Proceedings of the 2019 International Conference on World Wide Web, San Francisco, CA, USA, 13–17 May 2019. [Google Scholar]
  94. Ding, R.; Nallapati, R.; Xiang, B. Coherence-aware neural topic modeling. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, 31 October–4 November 2018. [Google Scholar]
  95. Gui, L.; Leng, J.; Pergola, G.; Zhou, Y.; Xu, R.; He, Y. Neural topic model with reinforcement learning. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language, Hong Kong, 3–7 November 2019. [Google Scholar]
  96. Miao, Y.; Grefenstette, E.; Blunsom, P. Discovering discrete latent topics with neural variational inference. In Proceedings of the International Conference on Machine Learning, Sydney, Australia, 6–11 August 2017. [Google Scholar]
  97. Li, S.; Zhang, Y.; Pan, R.; Mao, M.; Yang, Y. Recurrent attentional topic model. In Proceedings of the AAAI Conference on Artificial Intelligence, San Francisco, CA, USA, 4–9 February 2017. [Google Scholar]
  98. Lau, J.H.; Baldwin, T.; Cohn, T. Topically driven neural language model. arXiv 2017, arXiv:1704.08012. [Google Scholar]
  99. Kang, G.; Xiao, Y.; Liu, J.; Cao, Y.; Cao, B.; Zhang, X.; Ding, L. Tatt-BiLSTM: Web service classification with topical attention-based BiLSTM. Concurr. Comput. Pract. Exp. 2021, 33, e6287. [Google Scholar] [CrossRef]
  100. Huang, P.; Huang, P.; Ai, W.; Ding, J.; Zhang, J. Latent Topic Attention for Domain Classification. In Proceedings of the 2019 Conference of the International Speech Communication Association, Graz, Austria, 15–19 September 2019. [Google Scholar]
  101. Wang, C.; Wang, B.; Xu, M. Tree-structured neural networks with topic attention for social emotion classification. IEEE Access 2019, 7, 5505–95515. [Google Scholar] [CrossRef]
  102. Wang, X.; Yang, Y. Neural topic model with attention for supervised learning. In Proceedings of the 2020 International Conference on Artificial Intelligence and Statistics, Virtual Event, 26–28 August 2020. [Google Scholar]
  103. Panwar, M.; Shailabh, S.; Aggarwal, M.; Krishnamurthy, B. TAN-NTM: Topic attention networks for neural topic modeling. arXiv 2020, arXiv:2012.01524. [Google Scholar]
  104. Li, I.; Yasunaga, M.; Nuzumlalı, M.Y.; Caraballo, C.; Mahajan, S.; Krumholz, H.; Radev, D. A neural topic-attention model for medical term abbreviation disambiguation. arXiv 2019, arXiv:1910.14076. [Google Scholar]
  105. Newman, M.E. Modularity and community structure in networks. Proc. Natl. Acad. Sci. USA 2006, 103, 8577–8582. [Google Scholar] [CrossRef]
  106. Yon, D.; Frith, C.D. Precision and the Bayesian brain. Curr. Biol. 2021, 31, R1026–R1032. [Google Scholar] [CrossRef] [PubMed]
  107. Shi, X.; Hu, X. Using the support vector machine algorithm to predict β-Turn types in proteins. Engineering 2013, 5, 386–390. [Google Scholar] [CrossRef]
  108. Umeda, Y.; Tomiyama, T.; Yoshikawa, H. FBS modeling: Modeling scheme of function for conceptual design. In Proceedings of the 9th Int. Workshop on Qualitative Reasoning, Amsterdam, The Netherlands, 16–19 May 1995; pp. 271–278. [Google Scholar]
  109. Hofmann, T. Probabilistic latent semantic indexing. In Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Berkley, CA, USA, 15–19 August 1999; pp. 50–57. [Google Scholar]
  110. Blondel, V.D.; Guillaume, J.; Lambiotte, R.; Lefebvre, E. Fast unfolding of communities in large networks. J. Stat. Mech. Theory Exp. 2008, 2008, P10008. [Google Scholar] [CrossRef]
  111. Rosvall, M.; Axelsson, D.; Bergstrom, C.T. The map equation. Eur. Phys. J. Spec. Top. 2009, 178, 13–23. [Google Scholar] [CrossRef]
  112. Zeng, A.; Liu, X.; Du, Z.; Wang, Z.; Lai, H.; Ding, M.; Yang, Z.; Xu, Y.; Zheng, W.; Xia, X. Glm-130b: An open bilingual pre-trained model. arXiv 2022, arXiv:2210.02414. [Google Scholar]
  113. Du, Z.; Qian, Y.; Liu, X.; Ding, M.; Qiu, J.; Yang, Z.; Tang, J. Glm: General language model pretraining with autoregressive blank infilling. arXiv 2021, arXiv:2103.10360. [Google Scholar]
  114. Řehůřek, R.; Sojka, P. Gensim—Statistical Semantics in Python. Retrieved from Genism. Org. 2011. Available online: https://www.semanticscholar.org/paper/Gensim-Statistical-Semantics-in-Python-Rehurek-Sojka/b55fe23d7290f59d14e51e7813f5950f5ff08b2b (accessed on 6 February 2024).
  115. Röder, M.; Both, A.; Hinneburg, A. Exploring the space of topic coherence measures. In Proceedings of the Eighth ACM International Conference on Web Search and Data Mining, Shanghai, China, 31 January–6 February 2015; pp. 399–408. [Google Scholar]
  116. Moon, S.K.; Kumara, S.R.; Simpson, T.W. Knowledge representation for product design using techspecs concept ontology. In Proceedings of the IRI-2005 IEEE International Conference on Information Reuse and Integration, Las Vegas, NV, USA, 15–17 August 2005; IEEE: Piscataway, NJ, USA, 2005; pp. 241–246. [Google Scholar]
  117. Traag, V.A.; Waltman, L.; Van Eck, N.J. From Louvain to Leiden: Guaranteeing well-connected communities. Sci. Rep. 2019, 9, 5233. [Google Scholar] [CrossRef]
  118. Teramoto, R.; Aoki, M.; Kimura, T.; Kanaoka, M. Prediction of siRNA functionality using generalized string kernel and support vector machine. FEBS Lett. 2005, 579, 2878–2882. [Google Scholar] [CrossRef]
  119. Lavancier, F.; Møller, J.; Rubak, E. Determinantal point process models and statistical inference. J. R. Stat. Soc. Ser. B: Stat. Methodol. 2015, 77, 853–877. [Google Scholar] [CrossRef]
  120. Yang, C.S.; Li, T.; Cao, Z. Pairwise diversity measures based selective ensemble method. Pattern Recognit. Artif. Intell. 2010, 23, 265–571. [Google Scholar]
  121. Hernandez, R.J.; Goñi, J. Responsible design for sustainable innovation: Towards an extended design process. Processes 2020, 8, 1574. [Google Scholar] [CrossRef]
  122. Lee, J.H.; Ostwald, M.J. The impacts of digital design platforms on design cognition during remote collaboration: A systematic review of protocol studies. Heliyon 2022, 8, e11247. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Generative reasoning framework for generic-field design.
Figure 1. Generative reasoning framework for generic-field design.
Sustainability 16 09841 g001
Figure 3. Different representations of lighting systems.
Figure 3. Different representations of lighting systems.
Sustainability 16 09841 g003
Figure 4. Research framework.
Figure 4. Research framework.
Sustainability 16 09841 g004
Figure 5. Graphical representation of ConceptTM.
Figure 5. Graphical representation of ConceptTM.
Sustainability 16 09841 g005
Figure 6. Graphical characterization of ConceptTM.
Figure 6. Graphical characterization of ConceptTM.
Sustainability 16 09841 g006
Figure 7. The number of topics in the technical–physical part |ZS|.
Figure 7. The number of topics in the technical–physical part |ZS|.
Sustainability 16 09841 g007
Figure 8. The number of topics in the physical–functional part |ZF|.
Figure 8. The number of topics in the physical–functional part |ZF|.
Sustainability 16 09841 g008
Figure 9. Main operation interface of IDA.
Figure 9. Main operation interface of IDA.
Sustainability 16 09841 g009
Figure 10. System replies to “@ConceptTM: Help”.
Figure 10. System replies to “@ConceptTM: Help”.
Sustainability 16 09841 g010
Figure 11. System replies to “@GDesigner: Help”.
Figure 11. System replies to “@GDesigner: Help”.
Sustainability 16 09841 g011
Figure 12. An example of a G-Searcher.
Figure 12. An example of a G-Searcher.
Sustainability 16 09841 g012
Figure 13. System replies of “@PSearcher: Help”.
Figure 13. System replies of “@PSearcher: Help”.
Sustainability 16 09841 g013
Figure 14. System replies of “@PSearcher: Search” of this case.
Figure 14. System replies of “@PSearcher: Search” of this case.
Sustainability 16 09841 g014
Figure 15. The brief introduction and figures for patent #135233.
Figure 15. The brief introduction and figures for patent #135233.
Sustainability 16 09841 g015
Figure 16. The generated structure set of this case.
Figure 16. The generated structure set of this case.
Sustainability 16 09841 g016
Figure 17. The generated system architecture of this case.
Figure 17. The generated system architecture of this case.
Sustainability 16 09841 g017
Figure 18. The Lego3D model of the “mechanical claw”.
Figure 18. The Lego3D model of the “mechanical claw”.
Sustainability 16 09841 g018
Figure 19. The Lego3D model of the “Mechanical Moving Trolley”.
Figure 19. The Lego3D model of the “Mechanical Moving Trolley”.
Sustainability 16 09841 g019
Table 1. The outputs of ConceptTM in the design case “a mobile robot for grabbing materials”.
Table 1. The outputs of ConceptTM in the design case “a mobile robot for grabbing materials”.
Technical FieldA61B
Functional RequirementsRobot Material Grab Move
Reasoning MethodsGibbs SamplingMAP
Physical StructureBracketProtective coverHorizontal boardProtective cover
ShellFeed pipeCompression springSpring
Storage boxArticulated axisRotating axisAxis
Water tankScrewSliderShell
Protective boxExhaust portPiston rodSlider
MotorServo motorGuideBase
PartitionRulerAngle sensorMotor
ScrollGuiding pulleyDriving axisFixed board
CavityCharging portCavitySupporting board
Storage cabinetsMain bodyRegulating screwAssembly table
Table 2. The outputs of ConceptTM implementation technical field integration.
Table 2. The outputs of ConceptTM implementation technical field integration.
Technical Field+G06K
Functional Requirements+Vision, +Image
PhysicalStructureFixed board, protective cover, spring, camera, motor, shell, base, controller, slider, and support column
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Mo, Z.; Gong, L.; Zhu, M.; Lan, J. The Generative Generic-Field Design Method Based on Design Cognition and Knowledge Reasoning. Sustainability 2024, 16, 9841. https://doi.org/10.3390/su16229841

AMA Style

Mo Z, Gong L, Zhu M, Lan J. The Generative Generic-Field Design Method Based on Design Cognition and Knowledge Reasoning. Sustainability. 2024; 16(22):9841. https://doi.org/10.3390/su16229841

Chicago/Turabian Style

Mo, Zhenchong, Lin Gong, Mingren Zhu, and Junde Lan. 2024. "The Generative Generic-Field Design Method Based on Design Cognition and Knowledge Reasoning" Sustainability 16, no. 22: 9841. https://doi.org/10.3390/su16229841

APA Style

Mo, Z., Gong, L., Zhu, M., & Lan, J. (2024). The Generative Generic-Field Design Method Based on Design Cognition and Knowledge Reasoning. Sustainability, 16(22), 9841. https://doi.org/10.3390/su16229841

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop