In this section, we first analyze the evolution of digital twins research over time, featuring visualizations of emerging and declining topics. Then, we examine the application of sensor technology across different fields, highlighting principal keywords in each area. Finally, we present specific case studies from each application field, showcasing the practical implementations of digital twin technology.
3.1. Research Landscape Evolution over Time
From the KCN analysis results, we see a clear growth and diversification trend in the digital twin field.
Table 2 presents the statistics of KCNs from 2000 to 2023. Here, we observe a substantial rise in the number of articles, keywords, and links, particularly after 2020. The increase in articles indicates a surge in research activities, while the growth in links points to an expanding web of interconnected topics.
The development stage of this research field can also be assessed with the
K value, based on Kuhn’s model of scientific progression [
27]. The
K value is calculated by dividing the number of unique keywords by the frequency of those keywords within a discipline. Derived from
Table 2, the
K values for four different time windows are 0.174, 0.155, 0.119, and 0.105, respectively. The declining trend in the
K value, inversely proportional to the growing number of publications, suggests that the field of digital twins is in the midst of an evolution, aligning with Kuhn’s pre-revolution or revolution stage.
Figure 3 reinforces this observation by showing the distribution of articles, keywords, and links across the four time periods, with significant growth in the latter two years. This suggests not only an increase in the research volume but also expansion in the complexity within the field.
Figure 4 expands on these data by comparing the average network strength and the maximum weight of the network, both of which indicate an increase in inter-article and inter-topic connections.
Figure 5 offers a distribution of keyword degrees, strengths, and link weights. The upward trends in average and maximum network degrees from
Figure 4 and
Figure 5 hint at a broadening scope of individual topics and articles, suggesting an increasingly collaborative research environment where topics are more interconnected. Notably, the outliers represent the keywords that are highly connected and centric to this research field. We will present and discuss these topics in the following sections.
Figure 6 provides various insights into the dynamics of the network.
Figure 6a shows the probability density function of keyword degree. A shift toward the right over time indicates that certain keywords are becoming increasingly prominent within the network.
Figure 6b examines the average weight as a function of endpoint degree. The positive linear trend suggests that keywords with a higher degree tend to form stronger connections with other keywords. However, it is not clear from this graph alone whether these connections tend to be with other highly connected keywords or emerging keywords. In addition, the subtle shift toward the right with time means that the combination of keyword degrees associated with a given average weight has been growing with time, suggesting popular keywords start to be the hubs that connect newer topics into the network, facilitating the network’s growth.
Figure 6c presents the relationship between the average weighted neighbor’s degree and the node degree. In all four time windows, there is no clear correlation between the degree of a node and its neighbor’s degree. This complements the insights from
Figure 6b and shows that highly connected keywords connect with a diverse range of nodes rather than only with other highly connected nodes. To accompany this insight,
Figure 6d shows the decreasing trend in the weighted clustering coefficient, indicating that highly connected nodes act more as bridges than remaining within isolated clusters, pointing to an expanding and diversifying field.
These visualizations and metrics depict the characteristics of a rapidly growing complex field, with foundational research expanding and certain topics gaining more prominence. However, the consistent average weight across years also suggests that the additional links may not always contribute to the foundational research, raising questions about the depth and influence of recent publications. This nuanced view of the field’s evolution indicates both robust growth and areas requiring further investigation to understand the research impact.
3.2. Emerging and Declining Research Topics
Figure 7 and
Figure 8 trace the changes in keyword relevance over time, from the earliest time window of 2000–2020 to the most recent time window of 2023. In each time window, we ranked keywords based on their strength, which is determined by the number of connections each keyword has. We then compared the ranking of keywords from both time windows to assess emerging or declining trends.
Because there was a substantial increase in overall keyword strength from the earlier to the later time window, we used rank as a proxy for a keyword’s relevance within its specific period. Additionally, we categorized the keywords into two groups: those relating to digital twin applications and those relating to the sensing ecosystem, which includes sensors, machine learning methods, and computational systems.
It is important to note that a slight decline in a keyword’s rank does not necessarily indicate a decrease in its importance or research focus. Instead, it may indicate a natural transition of the keyword from a novel research area to a more established topic that no longer occupies the forefront of emerging research themes. This shift can be seen as a maturation process within the research landscape, where once-novel concepts become foundational elements of the field.
Figure 7 presents keywords related to digital twin applications. Notably, the top five keywords, namely digital twins, Internet of Things, Cyber-Physical Systems, Industry 4.0, and simulation, have maintained the top five positions with no rank change, indicating that they have endured centrality and significance in digital twin research for over two decades. Digital twins, as the literature searching criteria, is naturally included in all research. The Internet of Things is significant for providing the sensor data that feeds digital twins. Cyber-Physical Systems are essential as they constitute the framework in which digital twins operate, integrating computation with physical processes to enable automated decision making. Industry 4.0 represents the current trend of automation and data exchange in manufacturing technologies, including Cyber-Physical Systems, the IoT, and cloud computing, which are inherently linked to the concept of digital twins. In addition, simulation serves as the analytical engine that enables the virtual representation to predict the behavior and performance of its physical counterpart.
There are two types of keywords that indicate emerging trends: application fields (areas where digital twins are being applied) and functions (what digital twins help achieve). The emerging application fields for digital twins include smart cities, energy consumption, healthcare, the construction industry, power systems, smart grids, and autonomous vehicles. The more digitalized and intelligent infrastructure in these areas enables the implementation of digital twins. The increasingly diversified application fields for digital twins also explain the slight decline of smart manufacturing and the manufacturing industry in the right panel.
The emerging functions include digital transformation, decision making, resource allocation, predictive maintenance, fault diagnosis, and real-time monitoring. The trend can be attributed to advancements in machine learning and sensor technologies. As machine learning algorithms have become more sophisticated, digital twins are now able to not only replicate physical systems but also transform and optimize them. Digital twins also have enhanced decision-making capabilities, enabling automated and informed decisions based on predictive analytics and real-time data.
Figure 8 presents keywords related to sensor and machine learning technology. The top two keywords are machine learning and artificial intelligence. The emerging keywords related to sensors are real time, point cloud, and sensor network, highlighting the growing demand for sensors that can deliver immediate, interconnected, and diverse data types. Regarding the computation architecture that supports digital twins and machine learning functions, we notice a rising trend in edge computing and metaverse and a declining trend in cloud computing. This points to a research area pivoting toward distributed computing paradigms, suggesting a move to bring processing closer to the data source for quicker insights. This trend implies that while cloud computing has become a well-established field, the frontier of research is moving toward systems that can handle analytics at the edge of networks.
As for the machine learning-related keywords, emerging models include deep learning, reinforcement learning, federated learning, surrogate models, and convolutional neural networks. This emergence corresponds to the need for sophisticated analytical tools capable of processing complex, multimodal sensor data. These methods are particularly suited to the demands of digital twins, offering enhanced capabilities for privacy preservation and data security.
3.3. Mapping Keywords to Application Fields
The previous analysis has focused on the temporal characteristics of the research field and the trends in the relevance of the top keywords. In this section, we will delve deeper and examine how digital twin technology, specifically sensing, machine learning, and computation technologies, are being applied in different fields.
In the methodology section, we mentioned that we classified the literature into six application categories. In this section, we draw insights from each category and visualize the insights using a Sankey diagram. The left column of the Sankey diagram lists the keywords of interest, while the right column represents the application fields. The numbers on the left represent the number of papers that contain the keyword of interest. The number on the right is a summation of all streams of numbers from the left. Since we only selected the top keywords in each category to visualize, the number on the right should not be confused with the total number of papers in each category.
Figure 9 displays the mapping from sensor technology to different digital twin application fields. Real-time data and point cloud emerge as the most prevalent keywords, which validates the trend from the slope graph. From this graph, there are two types of sensor keywords: focused sensing technologies and cross-domain technologies. As for focused keywords, process data and vibration have found their place in manufacturing settings as they are common practices for machine and equipment monitoring. Electrocardiograms and cardiac electrophysiology in healthcare may be linked to their potential to create high-fidelity visualizations for cardiac twins. LiDAR shows a strong association with infrastructure applications, likely due to its precision in capturing environmental data for smart city applications.
Cross-domain keywords such as point cloud, data acquisition, and human–robot interaction point toward the versatility of these technologies. Point cloud data, with their high-resolution spatial information, are crucial not only in manufacturing and logistics but also in infrastructure for transportation and urban planning. Data acquisition stands out as a foundational element in the sensing ecosystem to ensure the quality of high-frequency and multimodal sensing data. Human–robot interaction emphasizes the increasing collaboration between humans and automated systems. In healthcare, this could translate to robotic surgery or patient care systems, while, in manufacturing, this can pertain to collaborative robots working alongside human operators.
Figure 10 displays the mapping from machine learning methods to different digital twin application fields. Machine learning bestows active digital twins with decision-making capabilities in various applications. Neural networks and deep learning algorithms play a prominent role in pattern recognition and predictive analytics. The strong presence of reinforcement learning, particularly in fundamental research, signals an interest in developing digital twins capable of autonomous decision making and optimization—a critical feature for systems that learn and adapt over time.
The emergence of federated learning points to a growing concern for data privacy and distributed computation, enabling collaborative model training without centralized data storage. This approach aligns well with digital twins, which often require the synthesis of distributed data while sustaining confidentiality, particularly in healthcare and business settings.
The strong connection between optimization techniques and manufacturing and supply chain applications underlines the role of digital twins in process improvement and efficiency gains. Meanwhile, the intersection of convolutional neural networks with infrastructure and transportation highlights their importance in image and video processing tasks relevant to these fields.
Interestingly, the relatively modest numbers attached to healthcare and human-centric technology may reflect the nascent integration of machine learning into these regulated domains, where safety and validation are paramount.
Overall,
Figure 11 displays the mapping from computational technologies to different digital twin application fields. Blockchain’s notable presence across multiple fields, especially in business and asset management, highlights its role in enhancing security, transparency, and traceability. Its application within manufacturing and supply chain domains indicates its potential to revolutionize how data across the digital twin lifecycle are securely managed and shared.
The Metaverse, often associated with immersive virtual environments, shows a substantial intersection with infrastructure and transportation. This could point toward the Metaverse’s capacity for sophisticated simulations and virtual testing environments, which are crucial for planning and managing large-scale infrastructural projects. As suggested by the slope chart above, cloud computing displays a slightly declining influence, indicating a shift toward distributed computing paradigms such as edge computing. Big data and data-driven keywords maintain a steady connection with fundamental research, reflecting the ongoing need to process and analyze large datasets within the digital twin sphere to extract meaningful insights. In addition, semantic interoperability and data fusion, though not as dominant, indicate niche but vital areas in ensuring that digital twins can communicate effectively across systems and synthesize information from disparate sources.
3.4. Specific Cases in Each Application Field
Guided by the insights from the previous section, we select and review specific instances of digital twin research in this section. The tables in this section are a curated collection of publications based on the highlighted keywords from our Sankey analysis. This section aims to transition from high-level trends to individual research efforts, providing examples of how sensing, machine learning, and computation technologies are implemented within various application areas.
Table 3 presents a selection of studies in fundamental research of digital twins. The study on sensor calibration within building systems [
28] demonstrates the ongoing effort to synchronize physical and virtual sensor data, which is a crucial step for accurate digital twin simulations. Research into sensor reliability [
29] tackles the challenge of predictive maintenance by using redundant digital sensors to foresee potential sensor failures. Both studies emphasize the significance of sensor calibration in maintaining the operational integrity of digital twins. Challenges remain in improving the accuracy of a virtual model while maintaining the complex system built upon multiple sensors. Wearable ECG sensors [
30] have been studied for low-latency signal analysis, enhancing the responsiveness of digital twins. This research resonates with the need to make digital twins interactive and user-centric. In the future, digital twins will serve not only as tools for simulation and monitoring but also as an end-to-end platform for interaction, providing intuitive feedback to users. The integration of tactile sensors in tactile devices [
31] offers insight into the sensory augmentation possibility within digital twins, while the use of LiDAR for user interface design [
32] highlights the importance of high-resolution spatial data in creating intuitive teleoperation systems. Both studies suggest that as the digital twin userbase grows, the user experience will become an essential factor, particularly in how objects are identified and interacted with within these virtual environments. Researchers should recognize that the usability of digital twins is as important as their technical accuracy [
9].
Table 4 presents a selection of research efforts showcasing the application of digital twin technology in the manufacturing and supply chain areas. In CNC machining, force sensors monitor the cutting torque in end milling processes, supplying data for a comprehensive dashboard that integrates real and simulated torque signals for condition monitoring [
35]. The predictive maintenance capacity enables real-time adjustments and machine downtime reduction. Another study develops nonlinear multi-variant dynamic models of multi-axis machine tools with onboard CNC sensing data and visualizes the servo system’s dynamics [
36]. The digital twins in the form of real-time visualization can help optimize the machine tool performance and reduce production errors.
Further, in CNC machining, the fusion of tool, workpiece, and process monitoring data, is visualized on a dashboard, providing a complete view of the manufacturing process [
37]. This digital process twin supports operators in making informed decisions by simulating part geometry and process analytics. The use of optical sensors in a cyber-physical production cell to create an interactive visual replica [
38] signifies the importance of high-fidelity models for understanding and optimizing complex production systems. The above approaches to building digital twin models have made significant progress in unveiling the relations between key indicators and tool performance in the machining process. The sensor-based digital twins allow autonomous monitoring and troubleshooting within smart manufacturing environments. Future work could investigate the scalability of the method to consistently deliver accurate responses and optimize processes as the numbers of machine types and operational parameters scale. Additionally, exploring the integration of machine learning across different manufacturing environments would be valuable.
In additive manufacturing, embedded distributed fiber sensors are used for Finite Element Analysis (FEA) simulations of temperature and strain [
39]. The ability to model these parameters with high precision is indicative of the move toward high-fidelity simulations in digital twins, ensuring product quality and process reliability.
Lastly, the production planning process benefits from the fusion of CPS indicators, production data, and LiDAR-generated point clouds to create a 3D model of a production plant [
40]. This example demonstrates the potential of digital twins in providing a comprehensive three-dimensional context for production planning, facilitating better spatial understanding and resource allocation.
Table 5 presents the selected applications of digital twin sensor technology in the energy and power grid sector. The energy equipment monitoring example showcases a condition monitoring digital twin of a small hydro turbine, enabled by a wireless sensor network, including accelerometers, temperature, and inductive current sensors. The digital twins operating on sensor readings and environmental data provide a condition indicator visualization [
41]. This approach can detect faults early and reduce downtime, which is crucial in the energy sector where continuity is essential.
In electric power conversion, the photovoltaic (PV) dc–dc converter’s efficiency is augmented by thermal cameras and scanning electron microscope imagery. FEM simulations predict temperatures at critical converter components, enabling fast estimations of device conditions under various operational stresses [
42]. The two studies highlight the importance of digital twin predictive maintenance in infrastructure reliability.
Wind engineering research utilizes wind pressure sensors to develop an optimal sensor placement algorithm [
43]. This algorithm aims to reconstruct wind pressure fields accurately, which is indispensable for assessing the structural integrity of wind turbines and optimizing their design for maximum energy capture. The reconstruction of the wind pressure field can also be utilized for creating digital twin infrastructure.
For hydropower generation, pressure sensors are deployed within the hydraulic network, informing the development of a control system that maximizes hydropower production while adhering to hydraulic constraints [
44]. Such digital twin applications ensure the harmonization of power generation with environmental and infrastructural considerations.
Lastly, in the field of smart grids, event loggers are utilized to develop digital twins with autonomous proactive agents [
45]. These agents interact within a coordination platform to manage the complex dynamics of energy demand and supply, thus enhancing grid stability and operational resilience. This research points out that integrating an agent-coordination model into digital twins can address complex energy management issues at the microgrid level. The findings provide an example of how to create resilient and user-centric energy networks.
Table 6 presents the applications of digital twin sensor technology in the healthcare and human-centric area, where sensors are broadly referred to as any device or system that detects events or changes in a given environment, transmitting the information to other devices. In the cardiology field, ECG sensors are integral in developing a digital twin of the human heart [
46]. This innovative approach merges ECG data with medical records to construct a “Cardio Twin”, a proof of concept that offers heart condition visualizations for both local and remote diagnosis. In another example, clinical 12-lead ECGs and Magnetic Resonance Imaging (MRI) create biophysically detailed digital twins for cardiac electrophysiology [
47]. These models simulate intricate heart structures, including Purkinje networks, paving the way for in silico clinical trials and advanced cardiac care.
For rural healthcare, IoT sensors and devices are leveraged to bring medical services to remote areas [
48]. Here, sensors encompass a variety of medical devices that collect health-related data, which, when coupled with blockchain technology, ensures secure data management and analysis in resource-limited settings. In space medicine, sensors include mixed reality devices such as HoloLens and haptic systems, which create a digitized interactive training environment [
49]. This expands the sensory experience by providing real-time feedback and immersive scenarios for astronaut medical training. The above research highlights the potential and success of digital twin technology in the field of personalized and predictive healthcare.
In the educational sector, sensors refer to the instrumentation of a remote lab, where equipment control and monitoring are critical [
50]. These sensor systems enable a hybrid remote laboratory for various learning scenarios, fostering interactive and multimodal educational experiences. This also encourages future researchers to explore digital twin solutions for better learning outcomes and operational safety [
52].
Lastly, in the context of human–robot collaboration, force/torque sensors on a battery pack assembly line provide data for a digital twin that visualizes and analyzes the collaborative environment [
51]. This digital twin assists in designing, developing, and operating a safe and efficient human–robot interactive system. Future study can revolve around the safety and optimization of these systems with the aid of digital twins [
53].
Table 7 presents examples of the use of digital twins in the optimization of infrastructure and transportation systems. For infrastructure modeling, LiDAR sensors are utilized to capture detailed point cloud data of campus buildings [
54]. This technology enables the creation of accurate digital replicas of large structures and facilitates the creation of accurate digital twins of extensive structures, enabling efficient maintenance planning and historical preservation.
In transportation infrastructure, the fusion of 2D images from cameras and 3D point clouds from LiDAR leads to a comprehensive digital twin of a magnetic levitation track [
55]. This detailed representation bridges the gap between macroscopic project management and microscopic engineering analysis, underscoring the capacity for digital twins to offer multiscale insights into transportation systems. This study points out the importance of having efficient and automated processes for managing large LiDAR datasets to enhance the scalability of digital twins in civil engineering. Future study should include developing advanced algorithms that could automate the conversion of point cloud data into information modeling.
Urban logistics can benefit from the integration of sensors and actuators within the infrastructure. This can be achieved through a platform architecture for digital twins that informs policy making via interactive dashboards [
56]. This approach allows for real-time sensor data and logistics system documentation to drive simulation models that can pinpoint gaps and opportunities for transformation within city ecosystems. It is challenging to convert this framework into physical systems for city planners and logistics stakeholders to use and improve urban logistics.
Factory logistics can be revolutionized by incorporating Automated Guided Vehicles (AGVs) that track and monitor the movement of goods on the assembly line [
34]. The development and application of a multi-objective AGV scheduling method based on digital twins reflect a shift toward intelligent and efficient logistics systems.
The planning of long-distance freight flows can be analyzed by integrating IoT sensors, GPS, and GIS into a virtual infrastructure and transportation model [
57]. This digital twin serves as a powerful tool for analyzing and synchronizing transport, demonstrating the potential of digital twins to streamline logistics operations across vast distances. Future study can focus on achieving interconnection between real-time data and virtual models for different transport modes in an operational context.
The predictive maintenance of agriculture equipment can be improved by digital twin technologies integrated with sensors and data pipeline systems [
58]. This study streamlines computational fluid dynamics (CFD) simulation data, sensor readings, and historical information to replicate a virtual cyclone bag filter system in grain milling plants. The digital twins of the system can monitor the filter status and perform precise predictions of a system’s remaining useful life. This research marks the potential of digital twins in improving operational efficiency for smart agriculture through monitoring and predictive analytics.
Table 8 explores the digital twin applications in business and asset management. These sensors are not limited to traditional physical devices but also include digital and social data sources. In production management, the term “sensor” encompasses product documentation throughout production [
59]. This documentation acts as a sensor by providing continuous feedback on the product lifecycle, enabling the development of a digital twin for efficient tracking in high-volume production environments. This study set an example of using Asset Administration Shell to standardize and simplify the digital twin representations of manufacturing assets. Another study in this domain proposes a hybrid digital twin approach that integrates traditional onboard sensors with telemetry data sources to create virtual production line properties [
60]. Their innovative usage of Apache StreamPipes for handling high-volume data streams features a solution to the data preprocessing for digital twins.
The notion of sensors expands further in environmental monitoring, where social media posts on platforms such as Twitter become inputs. These “digital sensors” capture real-time data on the spread of invasive species [
61], offering a novel approach to environmental monitoring by harnessing crowd-sourced information. This study manifests the versatile nature of digital twins by bridging it with ecological management and leveraging Natural Language Processing to model the spread of an invasive species. Additionally, in social issue alleviation and network sentiment analysis, chat rooms and business intelligence data act as sensors by providing communication data and social network dynamics [
62,
63]. These data allow for real-time sentiment analysis and conversation facilitation via chatbots. The concept of semantic digital twins for simulating human behavior for analytical purposes is an innovative idea. In future studies, it is important to address privacy and data security concerns, such as modeling complex human behavior and ensuring ethical use of personal data.