Data-Driven Enterprise Architecture for Pharmaceutical R&D
Abstract
:1. Introduction
1.1. Data-Driven Enterprise. Why? What? How?
1.1.1. Why Become a Data-Driven Enterprise?
- Accelerated Drug Discovery and Design: Leveraging machine learning algorithms and AI, data-driven R&D methodologies enable pharmaceutical companies to model and analyze vast datasets encompassing genetic, molecular, and clinical information. These approaches streamline the drug discovery process by uncovering hidden patterns, identifying novel drug targets, and predicting therapeutic outcomes, thus reducing time and resource requirements. Thus, several studies have demonstrated promising outcomes in such areas as drug molecular design, retrosynthetic analysis, chemical reaction outcome prediction, adverse events detection, virtual screening, peptide synthesis, biomarker discovery, and others [22,23,24,25,26,27,28,29,30].
- Precision Medicine and Targeted Therapies: By harnessing comprehensive patient data, including genetic profiles, biomarkers, and clinical histories, pharmaceutical firms can develop personalized treatment regimens tailored to individual patient characteristics. Data-driven insights facilitate the identification of the patient subpopulations likely to respond positively to specific therapies, enabling the development of targeted and more efficacious treatments [29,31,32,33].
- Optimized Clinical Trials: Data-driven analytics enhance the design and execution of clinical trials, enabling pharmaceutical companies to identify suitable patient cohorts, optimize trial protocols, optimize site-selection process, increase trial participant recruitment, and maintain engagement [24,34].
- Drug Repurposing and Combination Therapies: Data analytics enable pharmaceutical researchers to explore existing datasets and identify opportunities for drug repurposing and combination therapies [35,36]. By leveraging insights from large-scale genomic, transcriptomic, and phenotypic datasets, researchers can uncover novel indications for existing drugs or synergistic combinations with enhanced therapeutic efficacy.
- Pharmacokinetics and Pharmacodynamics: By implementing data-driven approaches including AI methods to pharmacovigilance, pharmacodynamics, and adverse event monitoring, companies can detect and respond to safety concerns faster, more cost-effectively, and with greater accuracy, decreasing regulatory risks and safeguarding patient welfare [37,38,39,40].
1.1.2. What Does Data-Driven Enterprise Mean?
1.1.3. How to Become a Data-Driven Enterprise?
1.1.4. Challenges on the Journey towards Becoming a Data-Driven Enterprise
A data-driven architecture is a nervous system of the entire digital transformation, with data as nervous impulses circulating within the whole enterprise organism and accelerating its performance.
2. Materials and Methods
2.1. Enterprise Architecture: Definition and Evolution of the Concept
Why is centralized and layered Enterprise Architecture outdated?
2.2. The Data-Driven Enterprise Architecture framework
The DDA represents a pioneering framework for distributed and federated architecture at a scale that breaks down architecture into domain-specific components, including application, business, and data architectures, while keeping continuous harmonization across these components and domains.
DDA Framework Principles
- 1.
- Domain decomposition
- 2.
- Productization and customer centricity
- Definition and Boundaries: Products are well-defined entities with clear boundaries. This means knowing exactly what the product is, what it encompasses, and what it does not.
- Identifiable Purpose: Each product serves a distinct purpose, aligned with the organization’s ecosystem and value delivery.
- Domain Ownership: Each product is owned and proactively managed by the respective Domain.
- Customers or Potential Customers: Products revolve around customers, whether internal or external, ensuring alignment with their specific needs.
- Tangible or Measurable Value: Products are designed to generate tangible or measurable value, with well-defined value propositions for their customers or stakeholders.
- Tailored Solutions: Application architecture and data products are tailored to meet specific business needs, enhancing operational efficiency.
- Continuous Evolution: Products are designed for ongoing evolution, fostering long-term collaboration among cross-functional domain teams throughout the product lifecycle that differentiates from a project approach with a predefined timeline, scope, and budget.
- Data-Driven Decision-Making: Data products empower teams with actionable insights, facilitating data-driven decision-making.
- Resource Optimization: Concentrating resources on products that directly contribute to business capabilities improves resource allocation.
- Competitive Advantage: Organizations leveraging application architecture and data products gain a competitive edge by aligning products with customer needs and industry trends.
- Scalability and Innovation: Products are inherently scalable and drive innovation by encouraging creative thinking to enhance functionality and user experiences.
- Flexibility: Application architecture and data products are agile and adaptable, ensuring long-term relevance.
- Application Product (AP)—a group of software components that provides specified digital capabilities.
- Data Product (DP)—data entities that have a distinct purpose and value for the organization and are logically grouped for effective management and improvement. Besides general product attributes (defined above), the additional data-specific attributes of DPs encompass reliability (reflecting business accuracy that directly depends on the quality of data); discoverability and understandability (encompassing self-describing semantics, syntax, and usage metadata); composability of integral components such as metadata, data lineage, access, and governance; and semantics.
- 3.
- Actionable, Measurable, and Adaptable
- 4.
- Domain ownership and Federated Governance
3. Results of a Use Case of the Data-Driven Enterprise Architecture at a Pharma Company
3.1. Background of Pharma Company
3.2. Methodology
3.2.1. Phase I—Gain Input and Analysis
3.2.2. Phase II—Future State Design
- to brainstorm and ideate the future state of Data Fabric around the following five categories: Technology, People, Processes, Metrics, and Capabilities
- to detail the five categories by defining four core components for each category that correspond to the grouped and categorized needs derived from Phase I (see Figure A1 in Appendix A)
- Leadership: Data Domain (DD) leadership organizations responsible for defining directions of DD evolvement, representing DD, and driving cross-domain alignments.
- Execution: Cross-functional and autonomous DD teams that have end-to-end accountability for DD Data Products (DPs) and drive the execution of DD strategies and roadmaps. In addition to DD Teams, the FLM defines Data Fabric High Performing Teams (HPTs) that are responsible for implementing integration, data visualization, a shared semantic layer, and other data engineering tasks, as defined by the DD Teams.
- Journey: The Data Fabric Maturity Journey is represented by the iterative implementation and steady evolution of Data Fabric capabilities that are driven by DD business needs and derived from DD roadmaps.
3.2.3. Phase III—Roadmap
- 1.
- Data Ownership Model that, in turn, consists of the following:
- DDs and their leadership organizations
- DPs are the most valuable data for the organization that is grouped to manage it effectively. They are defined and owned by DDs.
- DD roadmaps represent an actionable way to address, align, implement, and track the continuous improvement of the DPs.
- 2.
- Federated governance framework that defines roles, responsibilities, decision-making, and escalation pathways, enabling the efficient handling of data in a federated way.
- DDs execute their strategies and roadmaps, empowered by Data Fabric HPTs.
- Data Fabric HPTs implement integration, data visualization, and a shared semantic layer.
- The Data Fabric Maturity Journey, which represents the progression of Data Fabric capabilities through structured levels, with an increasing ability to deliver.
Data Mesh and Data Fabric can complement each other effectively. While Data Mesh lays the organizational and mindset foundation for how enterprise data are organized and managed, Data Fabric establishes the technological basis for the data architecture.
3.2.4. Phase IV—Execute the roadmap
- Transparency of Global R&D Data: By structuring DDs and DPs, the DOM enhances the transparency of data, ensuring its clear organization and accessibility.
- Effective Data Organization: The DOM defines the most valuable data for the organization, efficiently organizing and managing it to maximize its utility.
- Selective Data Improvement: The DOM facilitates targeted improvements in the organization’s data and their governance, guided by the DD roadmap.
- Enhanced Decision-Making: Business-oriented data ownership is identified, promoting informed decision-making and data quality enhancement.
- Autonomous DDs: Autonomous DDs are established, reducing reliance on central data teams and enabling end-to-end accountability for DPs within each domain.
- 1.
- Data Domains
- 2.
- Data Products
- Value on its own: DPs are selective combinations of data entities that actively support the organization’s business capabilities and subsequently furnish tangible business value. This inherent value is characterized by self-sufficiency, rendering each DP meaningful in its autonomous context or “on its own”. Thus, for instance, metadata could not be classified as a DP because it has no meaning without a related dataset.
- Domain Ownership: The DOM framework embodies a principle of entrusting the stewardship, accountability, and progressive improvements of DPs to respective DDs. While individual DP ownership is not mandated initially (only DD ownership), it remains a dynamic prerogative for DD teams, with their Leads, to implement it, if needed. This approach provided more flexibility and autonomy to DDs and reduced the framework implementation’s complexity.
- Reliability: A hallmark of DPs lies in their capacity to faithfully mirror business accuracy. To do so, a DP is adhering to quality benchmarks tailored to its distinctive purpose. For instance, clinical data for regulatory submissions adhere to stringent quality standards in terms of consistency, completeness, and accuracy, but could be less stringent in terms of timeliness because it might imply lags between the trial event occurring and its capture in a Clinical Trial Management (CTM) system. However, timeliness of data captured from IoT devices could be critical to generate a valid insight. Consequently, a DP should have a clearly defined purpose that determines its quality requirements, which satisfy it becoming reliable. However, together with the evolution of a DP and extension of use cases, the purposes of DPs will also be extended. For example, a data model that consumes, among others, a participant recruitment DP could generate some valuable insights for a reduction in retention rates and could be incorporated into recruitment operations. In this case, based on the new consumption purposes, the DP could have extended quality standards (e.g., additional metadata could be recorded). It is notable that although each DP has its unique purpose, all of them should contribute to the common vision defined for a DD.
- Discoverability and Understandability: DPs are seamlessly discoverable through standard tools like data catalogs, ensuring their accessibility within the system landscape. Additionally, their inherent structure promotes self-describing semantics, syntax, usage, and inter-relationships.
- Composability: DPs consist of one or more constituent datasets, serving as their fundamental building blocks. Beyond datasets, other integral components such as Metadata, Data Lineage, Data Access and Governance, and Semantics contribute to the holistic identity of a DP. Comparable to the assembly of Lego blocks or a Rubik’s cube, where each individual block has a limited meaning, while their logical composition into a DP generates a value [88].
- 3.
- Data Domain Roadmaps
- 4.
- Federated Governance Framework
- Centralized Facets of Governance
- A DGC Charter that outlines the scope and objectives for creating and managing data governance across the Global R&D, including accountabilities for functions supporting data management efforts across the company.
- An operational model underpinned by distinct roles, responsibilities, and a robust decision-making framework.
- Integration with the DOM Framework to harmonize and streamline data initiatives.
- Decentralized Empowerment
- 1.
- Raising a Data Request or Initiative: Data requests or initiatives primarily originate at the domain level, where any DD Team member, such as a Business Owner, System Owner, Data Steward, or Data Domain/Subdomain Lead, can identify challenges, risks, inconsistencies, or improvement opportunities. They can raise a request using a specific request tool or directly during the DGC meeting by nominating the topic in advance. The Request Tool ensures that every data challenge, proposal, or initiative is captured and assessed.
- 2.
- Request Intake by the DGC: The DGC reviews and takes in the request for further assessment.
- 3.
- Assessment and Alignment: The DGC assesses the request, identifying the impacted data scope or DPs and the scale of change using the “T-shirt approach”. This leads to the definition of stakeholders to drive collaborative decision-making. If a relevant DD is identified, decision-making authority shifts to the DD Lead. The DD Lead aligns the change with stakeholders, such as system owners, business owners, data stewards and sponsors, and presents the suggested decision to the DGC in the regular DGC forum. For major changes or data investment initiatives, the DGC orchestrates a validation of the suggested decision within the context of the overarching business strategy, technical feasibility, and alignment with data governance and architecture principles to ensure strategic and technological viability.
- 4.
- DGC Decision: The request is approved if the assessment and alignment confirm its validity. Otherwise, it is either rejected or postponed, pending further clarification.
- 5.
- Actions and Metrics: The DGC supports the DD Lead to define actions and metrics for the approved request or initiative that is incorporated into DP cards.
- 6.
- Alignment with DD Roadmap: If an approved initiative or major change request (T-size bigger than M) impacts DP or DD scope, it is integrated into the DD Roadmap. This integration provides a clear timeline for execution, milestones, and metrics to monitor outcomes. The DD Roadmap serves as the guiding blueprint for the evolution of the Global R&D data capabilities, ensuring synchronized and harmonized initiatives.
- 7.
- Execution and Iteration: Data initiatives are executed within the scope of the DD Roadmap, led by DD Leads. As initiatives are implemented, the framework supports an iterative approach, allowing for continuous improvement and adaptation based on emerging needs and insights. The Data Fabric HPTs, once established (see Section “Phase II: Execution and Maturing of Data Fabric Capabilities”), will drive the implementation of complex data engineering and integration tasks, effectively bridging the gap between DDs and the Integration Competence Center.
3.3. Preliminary Results
Each big change should incorporate one or more interim phases.
- 1.
- In the journey to embed DDA, what has helped you the most? (a free-text question)
- 2.
- Which aspects of support have been the most useful to date? (a dropdown question)
- 3.
- In your opinion, how self-sustaining are the Domains? e.g., meeting on a regular cadence, acting autonomously, tracking the performance of their OKRs? (with 1 being the lowest self-sustaining and 5 being the most self-sustaining)
3.4. Limitations and Recommendations
- Diverse Transformation Approaches
Striking a balance between centralization and decentralization of the data strategy is crucial. A corporate strategy should aim to establish a common “framework language” to ensure interoperability, while providing the necessary agility and flexibility to localize the framework towards division-specific needs.
- DDA: one size does not fit all.
- Productization and Customer Orientation
- Replication of the Use Case
4. Conclusions
- Leadership and Stakeholder Commitment: Unwavering leadership support and stakeholder commitment are crucial for the success of any transformative efforts. The use case evidenced that intensive stakeholder involvement in defining the framework, its objectives, and implementation strategy fostered ownership, broad acceptance, and commitment to the change.
- Actionable, Measurable, and Adaptable Strategy: A data-driven transformation strategy must not only be conceptual, but also actionable, measurable, and adaptable to succeed. The Pharma company’s Data Fabric strategy, the AAF, DOM, and other structural elements of DDA addressed these objectives. The success of this phase is primarily reflected in the acceptance and commitment garnered within the organization. In the long term, an accelerative effect on the socio-economic outcomes of each DD and AD initiative is anticipated that is facilitated by the DDA foundations (see the Chapter “Value-realization”).
- Prioritizing Organizational and Mindset Change: Successful socio-technological transformation prioritizes organizational and mindset change before technology adoption. DDA provides the foundation for this cultural and organizational shift that must accelerate data-driven transformation in an agile and scalable manner during the subsequent phases. While the concept of Data Fabric is inherently technology-centric, it became evident during the exploration phase that merely implementing Data Fabric technological advancements (such as knowledge graphs and active metadata) would not move the Global R&D closer to its goal of fostering a data-driven culture and empowering digital transformation for innovative medicine development. The critical factor influencing the value realization of any technological advancement is its adoption rate, which hinges on having a solid organizational and mindset foundation.
- 4.
- Federalization: DDA introduces the concept of federalization as a strategic balance between centralization and decentralization. This approach emphasizes the development of modularized domain architectures, marking a departure from traditional monolithic methods. While encouraging autonomy within DDs and ADs, DDA maintains interoperability and contextual consistency on a global scale.
- 5.
- Navigating Complexity Through Decomposition: DDA’s approach of decomposing EA into smaller, cohesive domains such as DD and AD addresses the challenge of managing complexity. Structuring these domains around core business capabilities allows the Pharma company to focus on specific business functions and data requirements within each domain, while empowering DD Teams with required resources and mechanisms to develop targeted solutions.
- 6.
- Assess Different Baseline Concepts for Optimal Data Architecture: The Pharma company’s Data Fabric approach evidenced the importance of the exploratory assessment of various baseline concepts related to data architecture, insights from industry experiences, internal expertise, subject matter experts’ opinions, and the unique internal business needs of the enterprise to design an optimal Data Architecture. It is crucial not to be confined to a single mainstream model, such as Data Mesh or Data Fabric, as there is no one-size-fits-all solution in the realm of Data Architecture. Instead, this process should be envisioned as constructing a customized model, akin to assembling a set of Lego blocks. This model is constructed by selectively integrating elements from various baseline concepts, tailored precisely to meet the specific needs and goals of the enterprise.
- 7.
- Synergy of Data Mesh and Data Fabric. The Data Fabric Strategy evidenced efficient integration of the Data Mesh and Data Fabric principles. While Data Mesh laid the foundational base of organizational structure and mindset for data management, Data Fabric formed the foundation of the technological underpinnings. This harmonious blend should facilitate structural realization of data-driven objectives through robust and scalable technological solutions.
- 8.
- Collaboration and Alignment: Continuous collaboration and alignment with ongoing projects within the Global R&D, such as AAF, as well as related initiatives across the whole organization, along with the iterative and adaptable implementation of DDA, are essential to mitigate the risk of creating “framework silos” and to emphasize that all dependent elements of various frameworks work in harmony. Like Lego blocks, these frameworks need not be identical in size or shape, but they must fit together cohesively. Thus, the focus lies on ensuring the necessary interoperability and compatibility between related elements of different frameworks. Even in cases where DD and AD structures exhibit deviations, these frameworks incorporate mechanisms like an integration of AD Leads into the DGC, mapping DPs with APs, and aligning DDs with AD Leads to guarantee close collaboration, interoperability, and alignment.
- 9.
- Phased Approach: Breaking down transformation into manageable chunks, such as the data-driven capabilities, domains, with their initiatives and implementation roadmaps, improve efficiency of the program management by delivering transparency, accountability, and contributing to stakeholder acceptance. In addition, transformational changes should incorporate one or more incremental interim phases. The number of required interim phases depends on numerous factors, including the size of the change, corporate culture, organizational structure, complexity, and the organization’s scale.
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A
Data Domains | Planning and Forecasting | Study Participant | Study Management | Investigative Staff Engagement | Product Management | Others/Backlog |
Subdomains | Portfolio Planning Budget Management Feasibility | Data Delivery Standards Medical Review Statistical programming RWE | Clinal Operations Country Operations Business Operations | Investigative Staff Engagement | Chemical Process R&D Discovery, Product Development and Supply Distribution | Safety Management Quality Management Regulatory Management |
Data Products | Resource Forecasting OOP Forecasting Project and Product Attributes Project Milestones Forecasting Drivers Resource Capacity Project Accounting (Valuations, Actuals, etc.) Time reported Site information PI information Study information Site feasibility | Pharmacokinetics/Pharmacodynamics datasets and analysis Phase 1 trials patient data (Case Report Form) Phase 1 trials source data ADaM datasets and metadata Study intake data DH Early Assessment data EHR data used in a study Lab data used in a study External real-world databases Tokens DH Technology Enablement board intake data DH Capability Master requirements data DH Technology/Supplier Qualification data Training data eConsent study data eCOA study data | In progress | Study site data Site identity and access management data Training data Contracts Enrollment Data Document collaboration Document repository Managed Access Requests Site relations management Site communication Site feedback collection | In progress | - |
Elements | DOM | AAF | Harmonization | Distinctions |
---|---|---|---|---|
Structure | Data Domains (DDs) | Application Domains (ADs) | Iterative harmonization towards optimal view (not full integration) | Exceptional deviation based on logical grouping of data flows and data categories |
Products | Data Products (DPs) | Application Products (APs) | Alignment and Mapping | DP and AP are distinct elements (although interdependent) |
Product cards | DP cards | AP cards | Interlinked within DP and AP cards | Cards have a different structure specific to their focus area |
Roadmaps | DD roadmaps | AD roadmaps | Strategic alignment between DD and AD roadmaps and cross-reference of the selective items | Distinct focus on data and applications correspondingly |
Federated Governance | ||||
Centralized Facets | Data Governance Council (DGC) | Acceleration Squad | Strategic alignment of interrelated topics | Distinct focus on data and applications correspondingly |
Decentralized Facets | Autonomous cross-functional DD Teams | Autonomous Squads of cross-functional AD teams | Harmonization through close correlation between DD and AD Leads. Thus, a Business AD Lead is always a member of DD team | DD and AD Teams are autonomous in their distinct focus areas such as data and applications |
References
- Hannila, H.; Silvola, R.; Härkönen, J.; Haapasalo, H. Data-Driven Begins with DATA; Potential of Data Assets. J. Comput. Inf. Syst. 2022, 62, 29–38. [Google Scholar] [CrossRef]
- Coyle, D.; Manley, A. University of Cambridge. What Is the Value of Data? A Review of Empirical Methods; Bennett Institute for Public Policy of the University of Cambridge: Cambridge, UK, 2022. [Google Scholar]
- Assur, N.; Rowshankish, K. The Data-Driven Enterprise of 2025. McKinsey & Company. 28 January 2022. Available online: https://www.mckinsey.com/capabilities/quantumblack/our-insights/the-data-driven-enterprise-of-2025 (accessed on 12 November 2023).
- NewVantage Partners. Data and Analytics Leadership Annual Executive Survey 2023: Executive Summary of Findings. 2023. Available online: https://static1.squarespace.com/static/62adf3ca029a6808a6c5be30/t/63a477e23d7f9162b7771d6b/1671722982363/WAVESTONE+NVP--+Data+%26+Analytics+Executive+Leadership+2023--Survey+Findings.pdf (accessed on 12 November 2023).
- Neely, A. Why You Should Be Treating Your Data as an Asset. Anmut. Available online: https://www.anmut.co.uk/why-you-should-be-treating-your-data-as-an-asset/ (accessed on 12 November 2023).
- Johnston, M. Biggest Companies in the World by Market Cap. Investopedia. Available online: https://www.investopedia.com/biggest-companies-in-the-world-by-market-cap-5212784 (accessed on 12 November 2023).
- Ponemon Institute; IBM. Cost of a Data Breach Report 2022; IBM Corporation: Armonk, NY, USA, 2022; Available online: https://www.ibm.com/downloads/cas/3R8N1DZJ (accessed on 12 November 2023).
- Brynjolfsson, E.; Wang, J.; McElheran, K. The Power of Prediction: Predictive Analytics, Workplace Complements, and Business Performance. Bus. Econ. 2021, 56, 217–239. [Google Scholar] [CrossRef]
- Ertz, M.; Sun, S.; Latrous, I. The Impact of Big Data on Firm Performance. In Advances in Intelligent Systems and Computing; Springer: Cham, Switzerland, 2021. [Google Scholar] [CrossRef]
- Müller, O.; Fay, M.; Brocke, J.V. The Effect of Big Data and Analytics on Firm Performance: An Econometric Analysis Considering Industry Characteristics. J. Manag. Inf. Syst. 2018, 35, 488–509. [Google Scholar] [CrossRef]
- Mikalef, P.; Boura, M.; Lekakos, G.; Krogstie, J. Big Data Analytics and Firm Performance: Findings from a Mixed-Method Approach. J. Bus. Res. 2019, 98, 261–276. [Google Scholar] [CrossRef]
- Gul, R.; Ellahi, N. The Nexus between Data Analytics and Firm Performance. Cogent Bus. Manag. 2021, 8, 1923360. [Google Scholar] [CrossRef]
- Schweikl, S.; Obermaier, R. Lessons from Three Decades of IT Productivity Research: Towards a Better Understanding of IT-Induced Productivity Effects. Manag. Rev. Q. 2019, 70, 461–507. [Google Scholar] [CrossRef]
- Hariry, R.E.; Barenji, R.V.; Paradkar, A. Towards Pharma 4.0 in clinical trials: A future-orientated perspective. Drug Discov. Today 2022, 27, 315–325. [Google Scholar] [CrossRef] [PubMed]
- Sharma, D.K.; Patel, P.K.; Shah, M. A comprehensive study on Industry 4.0 in the pharmaceutical industry for sustainable development. Environ. Sci. Pollut. Res. 2023, 30, 90088–90098. [Google Scholar] [CrossRef] [PubMed]
- Tetteh, M.G.; Jagtap, S.; Salonitis, K. Pharma 4.0: Revealing drivers of the digital transformation in the pharma sector. In Lecture Notes in Mechanical Engineering; Springer: Cham, Switzerland, 2023; pp. 528–535. [Google Scholar] [CrossRef]
- Dulko, D.; Kwong, M.; Palm, M.E.; Trinquart, L.; Selker, H.P. From a decentralized clinical trial to a decentralized and clinical-trial-in-a-box platform: Towards patient-centric and equitable trials. J. Clin. Transl. Sci. 2023, 7, e236. [Google Scholar] [CrossRef]
- Eltoukhy, A.E.E.; Shaban, I.A.; Chan, F.T.S.; Abdel-Aal, M.a.M. Data Analytics for Predicting COVID-19 Cases in Top Affected Countries: Observations and Recommendations. Int. J. Environ. Res. Public Health 2020, 17, 7080. [Google Scholar] [CrossRef]
- Hariry, R.E.; Barenji, R.V.; Paradkar, A. From industry 4.0 to pharma 4.0. In Handbook of Smart Materials, Technologies, and Devices; Springer: Cham, Switzerland, 2021; pp. 1–22. [Google Scholar] [CrossRef]
- Inuwa, H.M.; Raja, A.R.; Kumar, A.; Singh, B.; Singh, S. Status of Industry 4.0 applications in healthcare 4.0 and Pharma 4.0. Mater. Today Proc. 2022, 62, 3593–3598. [Google Scholar] [CrossRef]
- Malheiro, V.; Duarte, J.M.; Veiga, F.; Mascarenhas-Melo, F. Exploiting Pharma 4.0 Technologies in the Non-Biological Complex Drugs Manufacturing: Innovations and Implications. Pharmaceutics 2023, 15, 2545. [Google Scholar] [CrossRef] [PubMed]
- Dong, Y.; Wang, W.; Xing, Y.; Du, J.; Meng, Q. Data-Driven modeling methods and techniques for pharmaceutical processes. Processes 2023, 11, 2096. [Google Scholar] [CrossRef]
- Kumar, S.A.; Kumar, T.D.A.; Beeraka, N.M.; Pujar, G.V.; Singh, M.; Akshatha, H.S.; Bhagyalalitha, M. Machine learning and deep learning in data-driven decision making of drug discovery and challenges in high-quality data acquisition in the pharmaceutical industry. Future Med. Chem. 2022, 14, 245–270. [Google Scholar] [CrossRef] [PubMed]
- Vora, L.K.; Gholap, A.D.; Jetha, K.; Thakur, R.R.S.; Solanki, H.K.; Chavda, V.P. Artificial intelligence in pharmaceutical technology and drug delivery design. Pharmaceutics 2023, 15, 1916. [Google Scholar] [CrossRef] [PubMed]
- Dara, S.; Dhamercherla, S.; Jadav, S.S.; Babu, C.M.; Ahsan, M.J. Machine Learning in Drug Discovery: A review. Artif. Intell. Rev. 2021, 55, 1947–1999. [Google Scholar] [CrossRef] [PubMed]
- Koutroumpa, N.-M.; Papavasileiou, K.D.; Papadiamantis, A.G.; Melagraki, G.; Afantitis, A. A Systematic Review of Deep Learning Methodologies Used in the Drug Discovery Process with Emphasis on In Vivo Validation. Int. J. Mol. Sci. 2023, 24, 6573. [Google Scholar] [CrossRef]
- Tang, M.; Li, B.; Chen, H. Application of message passing neural networks for molecular property prediction. Curr. Opin. Struct. Biol. 2023, 81, 102616. [Google Scholar] [CrossRef]
- Paul, D.; Sanap, G.; Shenoy, S.; Kalyane, D.; Kalia, K.; Tekade, R.K. Artificial intelligence in drug discovery and development. Drug Discov. Today 2021, 26, 80–93. [Google Scholar] [CrossRef]
- Bhattamisra, S.K.; Banerjee, P.; Gupta, P.; Mayuren, J.; Patra, S.; Candasamy, M. Artificial intelligence in pharmaceutical and healthcare research. Big Data Cogn. Comput. 2023, 7, 10. [Google Scholar] [CrossRef]
- Nag, S.; Baidya, A.T.; Mandal, A.; Mathew, A.T.; Das, B.; Devi, B.; Kumar, R. Deep learning tools for advancing drug discovery and development. 3 Biotech 2022, 12, 110. [Google Scholar] [CrossRef] [PubMed]
- Huo, L.; Tang, Y. Multi-Objective Deep Reinforcement learning for personalized dose optimization based on Multi-Indicator Experience replay. Appl. Sci. 2022, 13, 325. [Google Scholar] [CrossRef]
- Mhatre, S.; Shukla, S.; Chavda, V.P.; Gandikota, L.; Patravale, V. AI and ML for Development of Cell and Gene Therapy for Personalized Treatment. In Bioinformatics Tools for Pharmaceutical Drug Product Development; Scrivener Publishing LLC: Beverly, MA, USA, 2023; pp. 371–400. [Google Scholar] [CrossRef]
- Quazi, S. Artificial intelligence and machine learning in precision and genomic medicine. Med. Oncol. 2022, 39, 120. [Google Scholar] [CrossRef]
- Elkin, M.E.; Zhu, X. Predictive modeling of clinical trial terminations using feature engineering and embedding learning. Sci. Rep. 2021, 11, 3446. [Google Scholar] [CrossRef]
- Yang, F.; Zhang, Q.; Ji, X.; Zhang, Y.; Li, W.; Peng, S.; Xue, F. Machine learning applications in drug repurposing. Interdiscip. Sci. Comput. Life Sci. 2022, 14, 15–21. [Google Scholar] [CrossRef]
- Krishnamurthy, N.; Grimshaw, A.; Axson, S.A.; Choe, S.H.; Miller, J.E. Drug repurposing: A systematic review on root causes, barriers and facilitators. BMC Health Serv. Res. 2022, 22, 970. [Google Scholar] [CrossRef] [PubMed]
- Kumar, Y.; Koul, A.; Singla, R.; Ijaz, M.F. Artificial intelligence in disease diagnosis: A systematic literature review, synthesizing framework and future research agenda. J. Ambient Intell. Humaniz. Comput. 2022, 14, 8459–8486. [Google Scholar] [CrossRef]
- Obrezanova, O. Artificial intelligence for compound pharmacokinetics prediction. Curr. Opin. Struct. Biol. 2023, 79, 102546. [Google Scholar] [CrossRef] [PubMed]
- Chou, W.-C.; Lin, Z. Machine learning and artificial intelligence in physiologically based pharmacokinetic modeling. Toxicol. Sci. 2022, 191, 1–14. [Google Scholar] [CrossRef]
- Chapman, A.; Peterson, K.; Alba, P.R.; DuVall, S.L.; Patterson, O.V. Detecting Adverse Drug Events with Rapidly Trained Classification Models. Drug Saf. 2019, 42, 147–156. [Google Scholar] [CrossRef]
- Teubner, R.A.; Stockhinger, J. Literature Review: Understanding Information Systems Strategy in the Digital Age. J. Strateg. Inf. Syst. 2020, 29, 101642. [Google Scholar] [CrossRef]
- Riedl, R.; Stieninger, M.; Muehlburger, M.; Koch, S.; Heß, T. What Is Digital Transformation? A Survey on the Perceptions of Decision-Makers in Business. Inf. Syst. E-Bus. Manag. 2023. [Google Scholar] [CrossRef]
- Hanelt, A.; Bohnsack, R.; Marz, D.; Marante, C. A Systematic Review of the Literature on Digital Transformation: Insights and Implications for Strategy and Organizational Change. J. Manag. Stud. 2020, 58, 1159–1197. [Google Scholar] [CrossRef]
- Mahboub, H.; Sadok, H. Contribution to a Better Digital Transformation Implementation: An Integrative Approach. In Machine Learning and Artificial Intelligence; Frontiers in Artificial Intelligence and Applications; IOS Press: Amsterdam, The Netherlands, 2023. [Google Scholar] [CrossRef]
- Schneider, M.; Kanbach, D.K.; Kraus, S.; Dabić, M. Transform Me If You Can: Leveraging Dynamic Capabilities to Manage Digital Transformation. IEEE Trans. Eng. Manag. 2023, 1–15. [Google Scholar] [CrossRef]
- Rashedi, J. The Data-Driven Organization: Using Data for the Success of Your Company; Springer: Cham, Switzerland, 2023. [Google Scholar] [CrossRef]
- Hupperz, M.; Gür, I.; Möller, F.; Otto, B. What Is a Data-Driven Organization? In Proceedings of the Conference AMCIS 2021. Digital Innovation and Entrepreneurship, Virtual, 13–19 August 2021. [Google Scholar]
- Hebbar, P. 10 Indicators of a Truly Data-Driven Organisation; Analytics India Magazine: Bengaluru, India, 2019; Available online: https://analyticsindiamag.com/10-indicators-of-a-truly-data-driven-organisation/ (accessed on 10 April 2024).
- Janmeskens. The Road to a Data-Driven Organization: Navigating Data Dilemmas. Medium. 3 December 2023. Available online: https://medium.com/@meskensjan/the-road-to-a-data-driven-organization-navigating-data-dilemmas-898c612895af (accessed on 12 November 2023).
- Thusoo, A.; Sarma, J.S. Creating a Data-Driven Enterprise with DataOps; O’Reilly Media, Inc.: Sebastopol, CA, USA, 2017. [Google Scholar]
- Yang, H.-L.; Morgan, S.L. Business Strategy and Corporate Governance: Theoretical and Empirical Perspectives. In Business Strategy and Corporate Governance in the Chinese Consumer Electronics Sector; Elsevier: Amsterdam, The Netherlands, 2011; pp. 23–46. [Google Scholar] [CrossRef]
- Warner, K.; Wäger, M. Building Dynamic Capabilities for Digital Transformation: An Ongoing Process of Strategic Renewal. Long Range Plan. 2019, 52, 326–349. [Google Scholar] [CrossRef]
- Ellström, D.; Holtström, J.; Berg, E.; Josefsson, C. Dynamic Capabilities for Digital Transformation. J. Strategy Manag. 2021, 15, 272–286. [Google Scholar] [CrossRef]
- Verhoef, P.C.; Broekhuizen, T.; Bart, Y.; Bhattacharya, A.; Dong, J.Q.; Fabian, N.E.; Haenlein, M. Digital Transformation: A Multidisciplinary Reflection and Research Agenda. J. Bus. Res. 2021, 122, 889–901. [Google Scholar] [CrossRef]
- Teece, D.J. The Evolution of the Dynamic Capabilities Framework. In FGF Studies in Small Business and Entrepreneurship; Springer: Cham, Switzerland, 2022; pp. 113–129. [Google Scholar] [CrossRef]
- Cannas, R. Exploring Digital Transformation and Dynamic Capabilities in Agrifood SMEs. J. Small Bus. Manag. 2021, 61, 1611–1637. [Google Scholar] [CrossRef]
- Chirumalla, K. Building Digitally-Enabled Process Innovation in the Process Industries: A Dynamic Capabilities Approach. Technovation 2021, 105, 102256. [Google Scholar] [CrossRef]
- Helfat, C.E.; Peteraf, M.A. The Dynamic Resource-based View: Capability Lifecycles. Strateg. Manag. J. 2003, 24, 997–1010. [Google Scholar] [CrossRef]
- Curry, E.; Tuikka, T. An Organizational Maturity Model for Data Spaces: A Data Sharing Wheel Approach. In Data Spaces; Springer: Cham, Switzerland, 2022; pp. 21–42. [Google Scholar] [CrossRef]
- Wade, M.; Hulland, J. Review: The Resource-Based View and Information Systems Research: Review, Extension, and Suggestions for Future Research. Manag. Inf. Syst. Q. 2004, 28, 107. [Google Scholar] [CrossRef]
- Teece, D.J.; Leih, S. Uncertainty, Innovation, and Dynamic Capabilities: An Introduction. Calif. Manag. Rev. 2016, 58, 5–12. [Google Scholar] [CrossRef]
- Teece, D.J. Explicating Dynamic Capabilities: The Nature and Microfoundations of (Sustainable) Enterprise Performance. Strateg. Manag. J. 2007, 28, 1319–1350. [Google Scholar] [CrossRef]
- Teece, D.J. Dynamic Capabilities and (Digital) Platform Lifecycles. In Advances in Strategic Management; Emerald Publishing Limited: Leeds, UK, 2017; pp. 211–225. [Google Scholar] [CrossRef]
- Soluk, J.; Kammerlander, N. Digital Transformation in Family-Owned Mittelstand Firms: A Dynamic Capabilities Perspective. Eur. J. Inf. Syst. 2021, 30, 676–711. [Google Scholar] [CrossRef]
- Witschel, D.; Döhla, A.; Kaiser, M.; Voigt, K.; Pfletschinger, T. Riding on the Wave of Digitization: Insights How and under What Settings Dynamic Capabilities Facilitate Digital-Driven Business Model Change. J. Bus. Econ. 2019, 89, 1023–1095. [Google Scholar] [CrossRef]
- Chui, M.; Hazan, E.; Roberts, R.; Singla, A.; Smaje, K.; Sukharevsky, A.; Yee, L.; Zemmel, R. The Economic Potential of Generative AI: The next Productivity Frontier; McKinsey: New York, NY, USA, 2023; Available online: https://www.mckinsey.com/capabilities/mckinsey-digital/our-insights/the-economic-potential-of-generative-ai-the-next-productivity-frontier#industry-impacts (accessed on 18 January 2024).
- Chen, Z.; Ayinde, O.R.; Fuchs, J.R.; Sun, H.; Ning, X. G2Retro as a Two-Step Graph Generative Models for Retrosynthesis Prediction. Commun. Chem. 2023, 6, 102. [Google Scholar] [CrossRef] [PubMed]
- Bertelsmann Stiftung; Thiel, R.; Deimel, L.; Schmidtmann, D.; Piesche, K.; Hüsing, T.; Rennoch, J.; Stroetmann, V.; Stroetmann, K. #SmartHealthSystems Digitalisierungsstrategien im internationalen Vergleich. Available online: https://www.bertelsmann-stiftung.de/de/publikationen/publikation/did/smarthealthsystems (accessed on 12 November 2023).
- Dykes, B. The Four Key Pillars to Fostering a Data-Driven Culture. Forbes. 28 March 2019. Available online: https://www.forbes.com/sites/brentdykes/2019/03/28/the-four-key-pillars-to-fostering-a-data-driven-culture/?sh=6cf7bc367d90 (accessed on 12 November 2023).
- LaBerge, L.; Smaje, K.; Zemmel, R.W. Three New Mandates for Capturing a Digital Transformation’s Full Value; McKinsey & Company: New York, NY, USA, 2022; Available online: https://www.mckinsey.com/capabilities/mckinsey-digital/our-insights/three-new-mandates-for-capturing-a-digital-transformations-full-value#/ (accessed on 12 November 2023).
- Forth, P.; Reichert, T.; De Laubier, R.; Chakraborty, S. Flipping the Odds of Digital Transformation Success. BCG Global. Available online: https://www.bcg.com/publications/2020/increasing-odds-of-success-in-digital-transformation (accessed on 12 November 2023).
- Shein, E. 8 Reasons Why Digital Transformations Fail. CIO. Available online: https://www.cio.com/article/228268/12-reasons-why-digital-transformations-fail.html (accessed on 14 November 2023).
- Lawton, G.; Roy, M. 11 Reasons Why Digital Transformations Fail, Explained by Pros. TechTarget. 20 October 2023. Available online: https://www.techtarget.com/searchcio/tip/Top-6-reasons-why-digital-transformation-failures-happen (accessed on 14 November 2023).
- Van De Wetering, R. The Role of Enterprise Architecture-Driven Dynamic Capabilities and Operational Digital Ambidexterity in Driving Business Value under the COVID-19 Shock. Heliyon 2022, 8, e11484. [Google Scholar] [CrossRef]
- Van De Wetering, R. Enterprise Architecture Resources, Dynamic Capabilities, and Their Pathways to Operational Value. In Proceedings of the Conference: International Conference on Information Systems (ICIS), Munich, Germany, 15–18 December 2019. [Google Scholar]
- Van Den Berg, M.; Slot, R.; Van Steenbergen, M.; Faasse, P.; Van Vliet, H. How Enterprise Architecture Improves the Quality of IT Investment Decisions. J. Syst. Softw. 2019, 152, 134–150. [Google Scholar] [CrossRef]
- Ahlemann, F.; Legner, C.; Lux, J. A Resource-Based Perspective of Value Generation through Enterprise Architecture Management. Inf. Manag. 2021, 58, 103266. [Google Scholar] [CrossRef]
- Pattij, M.; Van De Wetering, R.; Kusters, R.J. Enhanced Digital Transformation Supporting Capabilities through Enterprise Architecture Management: A fsQCA Perspective. Digit. Bus. 2022, 2, 100036. [Google Scholar] [CrossRef]
- Van De Wetering, R. Understanding the Impact of Enterprise Architecture Driven Dynamic Capabilities on Agility: A Variance and FSQCA Study. Pac. Asia J. Assoc. Inf. Syst. 2021, 13, 32–68. [Google Scholar] [CrossRef]
- Törmer, R.L. Dynamic Capability Building In The Lego Group—Prospective Activities Vs. Reflective Learning In Preparation for a Turbulent Digital Future. AIS Electronic Library (AISeL). Available online: https://aisel.aisnet.org/ecis2019_rp/136/ (accessed on 12 November 2023).
- Enterprise Architecture Artifacts Facilitating The Strategy Planning Process for Digital Transformations: A Systematic Literature Review and Multiple Case Study. IADIS Int. J. Comput. Sci. Inf. Syst. 2021, 16, 46–62. [CrossRef]
- Kotusev, S.; Kurnia, S.; Dilnutt, R. The Practical Roles of Enterprise Architecture Artifacts: A Classification and Relationship. Inf. Softw. Technol. 2022, 147, 106897. [Google Scholar] [CrossRef]
- Zachman, J.A. Business Systems Planning and Business Information Control Study: A Comparison. Ibm Syst. J. 1982, 21, 31–53. [Google Scholar] [CrossRef]
- Buckl, S.; Schweda, C.M. On the State-of-the-Art in Enterprise Architecture Management Literature, Technical Report; Technical University of Munich: München, Germany, 2011.
- Herscovitch, L.; Meyer, J.P. Commitment to Organizational Change: Extension of a Three-Component Model. J. Appl. Psychol. 2002, 87, 474–487. [Google Scholar] [CrossRef] [PubMed]
- Armenakis, A.A.; Harris, S.G. Crafting a Change Message to Create Transformational Readiness. J. Organ. Chang. Manag. 2002, 15, 169–183. [Google Scholar] [CrossRef]
- Meyer, J.P.; Herscovitch, L. Commitment in the Workplace: Toward a General Model. Hum. Resour. Manag. Rev. 2001, 11, 299–326. [Google Scholar] [CrossRef]
- Broda, E. Managing a Federated Data Product Ecosystem—Towards Data Science. Medium. 16 January 2023. Available online: https://towardsdatascience.com/managing-a-federated-data-product-ecosystem-3c6bff94c728 (accessed on 12 November 2023).
- Wilkinson, M.D.; Dumontier, M.; Aalbersberg, I.J.; Appleton, G.; Axton, M.; Baak, A.; Blomberg, N.; Boiten, J.W.; da Silva Santos, L.B.; Bourne, P.E.; et al. The FAIR Guiding Principles for Scientific Data Management and Stewardship. Sci. Data 2016, 3, 160018. [Google Scholar] [CrossRef]
- Dehghani, Z. What Is Data Mesh? In Data Mesh: Delivering Data-Driven Value at Scale, 1st ed.; O’Reilly Media: Sebastopol, CA, USA, 2022; pp. 49–50. [Google Scholar]
- Teece, D.J.; Peteraf, M.A.; Leih, S. Dynamic capabilities and organizational agility: Risk, uncertainty, and strategy in the innovation economy. Calif. Manag. Rev. 2016, 58, 13–35. [Google Scholar] [CrossRef]
Characteristics | Application Product Example | Data Product Example |
---|---|---|
Definition and boundaries (What is it? What is it not? What are the components?) | Investigative Staff Training Application | Phase 2 Clinical Study Data for COVID-19 Vaccine ABC |
Feasibility Assessment Tool for Clinical Studies (as software components delivering this digital capability) | Feasibility Assessment Report for Clinical Studies (as a combination of data components) | |
Identifiable purpose (What it serves for?) | Distributing training resources for site staff involved in clinical studies and recording the completion | Providing evidence of safety and efficiency of the vaccine |
A repository for a collaborative site selection, based on their capabilities and previous experiences with them | Support decision-making about the feasibility of a study and incorporated risks. Keep transparency of the decision and ability to use it for future assessments. | |
Customer(s) (Whom it serves?) | Site staff, PII, and other study team members | Internal: Study team, submission team, and other R&D departments. External: regulatory agencies, research organizations, etc. |
Study planning team and other R&D departments and teams | Study planning team and other R&D departments and teams, as well as cross-organizational departments | |
Tangible or measurable value (What value does it generate? How could it be measured?) | Improved site satisfaction and clinical trial operations (e.g., yy FTE savings, zz points increase in the “Sponsor of choice” assessment) | A prerequisite for an approval to market the vaccine and receive a return on investments. In addition, the data would help to further investigate the disease |
Increases efficiency and accuracy of the feasibility assessment by digitalization (e.g., xy FTE savings) | Increase in accuracy and efficiency of decision-making about the feasibility of the study and incorporated risks |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Uzhakova, N.; Fischer, S. Data-Driven Enterprise Architecture for Pharmaceutical R&D. Digital 2024, 4, 333-371. https://doi.org/10.3390/digital4020017
Uzhakova N, Fischer S. Data-Driven Enterprise Architecture for Pharmaceutical R&D. Digital. 2024; 4(2):333-371. https://doi.org/10.3390/digital4020017
Chicago/Turabian StyleUzhakova (née Sabirzyanova), Nailya, and Stefan Fischer. 2024. "Data-Driven Enterprise Architecture for Pharmaceutical R&D" Digital 4, no. 2: 333-371. https://doi.org/10.3390/digital4020017
APA StyleUzhakova, N., & Fischer, S. (2024). Data-Driven Enterprise Architecture for Pharmaceutical R&D. Digital, 4(2), 333-371. https://doi.org/10.3390/digital4020017