Exploring the Molecular Terrain: A Survey of Analytical Methods for Biological Network Analysis
Abstract
:1. Introduction
- To provide a holistic view of the various approaches available for studying molecular interactions at a network level.
- To examine the strengths, limitations, and applications of these diverse analytical tools.
- To enhance our understanding of the intricate dance of molecules within the cellular landscape.
- To facilitate advancements in research by equipping researchers with the necessary knowledge and tools to explore the fascinating world of biological networks.
2. Conventional Methods for Network Analysis
2.1. Analytic Perspective on Network Architecture with Graph Theory
- Degree centrality: this metric simply counts the number of connections a node has, indicating its overall connectivity within the network [17].
- Betweenness centrality: this measure reflects a node’s role as a bridge between different network communities, highlighting its potential to influence information flow within the network [30].
- Closeness centrality: this metric signifies how quickly information can propagate from a particular node to all other nodes in the network [31].
2.2. Network Reconstruction
2.3. Visualizing Network Patterns
- Explore the overall network structure: by observing the layout and distribution of nodes and edges, researchers can gain a holistic understanding of a network’s organization and identify potential patterns or anomalies.
- Identify key network features: visualization tools facilitate the identification of densely connected clusters, isolated nodes, and central hubs, offering valuable insights into potential functional modules and critical players within the network.
- Communicate network insights: visual representations act as powerful communication tools, enabling researchers to effectively share their findings with collaborators and the broader scientific community.
3. Advanced Network Analysis Techniques
3.1. Network Inference
- -
- Mutual information: this technique measures the statistical dependence between two variables, helping identify potential interactions between entities based on their co-occurrence patterns in the data.
- -
- Correlation analysis: by calculating the correlation coefficients between different data points, this approach can reveal co-regulations or associations between entities, suggesting potential interactions within the network.
3.2. Network Dynamics
- ✓
- Differential network analysis is a method that compares networks under different conditions (e.g., healthy vs. diseased state) to identify changes in network topology and connectivity patterns, revealing potential mechanisms underlying biological transitions [63].
- ✓
- Time-series analysis is the approach that analyzes network data collected over time points, allowing researchers to track the evolution of the network and identify dynamic changes in network structure and function [64].
3.3. Machine Learning in Network Analysis
3.4. Topological Analysis
4. Applications of Biological Network Analysis
4.1. Understanding Disease Mechanisms
- Identify key drivers: One crucial application of network analysis lies in pinpointing key drivers of disease progression. These drivers are often highly connected nodes or critical pathways within the network that exert a significant influence on the disease state. By identifying these key drivers, researchers can prioritize potential therapeutic targets. Molecules or pathways identified as key drivers become prime candidates for therapeutic interventions. Disrupting their function or targeting them with specific drugs could potentially halt or reverse disease progression. Moreover, unraveling disease etiology is another critical outcome of identifying key drivers [87]. Understanding the identity and role of key drivers sheds light on the root causes of the disease, providing valuable insights into disease development and progression. This deeper understanding can inform future research directions, personalized treatment strategies, and ultimately contribute to improved patient outcomes.
- Uncover disease modules: Network analysis allows researchers to identify disease modules, which are clusters of interconnected nodes (e.g., genes, proteins) within the network that are functionally associated with the disease [88]. Studying these modules provides valuable insights into coordinated molecular processes: By analyzing the interactions and functions within a disease module, researchers can understand how different molecules within the module work together to contribute to the disease phenotype. This reveals the coordinated molecular processes underlying the disease state. Novel therapeutic strategies: Identifying the components and functions of disease modules opens avenues for the development of novel therapeutic strategies. Instead of targeting individual molecules, these strategies could potentially disrupt entire pathways or processes orchestrated within the disease module, leading to more effective therapeutic interventions. Furthermore, studying disease modules can lead to the discovery of novel therapeutic strategies. Identifying the components and functions of disease modules can pave the way for the development of novel therapeutic strategies that target not just individual molecules but entire pathways or processes contributing to the disease [89]. This holistic approach to targeting disease modules can potentially lead to more effective treatments with fewer side effects and improved outcomes for patients.
- Explore network dynamics: Diseases are rarely static entities but, rather, evolve over time. Network analysis allows researchers to study the dynamics of disease-associated networks, revealing how the network structure and function change throughout disease progression [90]. This information can be crucial for understanding disease progression. By tracking changes in network connections and properties over time, researchers can gain a deeper understanding of how the disease progresses from its early stages to more advanced forms. Moreover, analyzing network dynamics can help pinpoint critical junctures in disease progression where targeted interventions might be most effective in halting or reversing the course of the disease. By analyzing changes in network structure and function over time, researchers can gain a deeper understanding of how the disease progresses and identify potential points for intervention.
4.2. Drug Discovery
4.3. Personalized Medicine
4.4. Additional Applications
- ✓
- Evolutionary biology: Understanding the evolution of biological systems by analyzing changes in network structure and function across different species [97].
- ✓
- Systems biology: Integrating diverse biological datasets (e.g., genomics, proteomics) into network models to gain a holistic understanding of complex biological processes [98].
- ✓
5. Future Directions
5.1. Embracing the Multi-Omics Revolution
- ✓
- Unveil hidden connections: by bridging the gap between different data layers, multi-omics network analysis can reveal previously unseen connections and dependencies, offering a holistic understanding of biological processes.
- ✓
- Enhance disease understanding: integrating data from multiple omics sources can provide deeper insights into disease mechanisms, identifying key drivers and potential therapeutic targets with greater accuracy and specificity.
5.2. Advancing Network Inference Methods
- Machine learning and deep learning algorithms are used by leveraging the power of these techniques; they can enable the development of more robust and accurate methods for inferring network structures and dynamics, especially in the context of multi-omics integration.
- Network dynamics and temporal modeling are applied by developing methods that capture the temporal evolution of networks; they will be crucial for understanding how biological systems change over time, offering insights into disease progression and cellular response to stimuli.
5.3. Personalized Medicine: A Network-Based Approach
- ✓
- Predict individual responses: understanding the unique network characteristics of each patient can enable the prediction of their response to specific treatments, guiding personalized therapeutic decisions and improving clinical outcomes.
- ✓
- Identify novel biomarkers: network analysis can help identify potential biomarkers associated with disease progression or drug response, facilitating early diagnosis, personalized treatment strategies, and improved patient monitoring.
5.4. Exploration of Dynamic and Spatial Aspects of Biological Networks
5.5. Ethical Considerations
Direction | Description | Potential Impact |
---|---|---|
Embracing the Multi-Omics Revolution | Integrate data from various sources (genomics, transcriptomics, proteomics, etc.) into network models. | Unveil hidden connections and dependencies. Enhance disease understanding and identify novel therapeutic targets [104]. |
Advancing Network Inference Methods | Develop sophisticated methods (machine learning, deep learning) to handle large and complex datasets. | Infer network structures and dynamics more accurately, especially for multi-omics data [105]. Capture the temporal evolution of networks to understand dynamic biological processes [106]. |
Personalized Medicine: A Network-Based Approach | Analyze patient-specific molecular networks to guide personalized treatment strategies. | Predict individual responses to treatments and improve clinical outcomes [107]. Identify novel biomarkers for early diagnosis, treatment selection, and patient monitoring. |
Exploration of Dynamics | Organized entities that undergo constant changes in response to internal and external stimuli. | Developing innovative experimental techniques, imaging technologies, and computational models [108]. Gain insights into the spatiotemporal dynamics of biological networks, unraveling the complex interplay between molecular components. |
Ethical Considerations | Address data privacy, security, and algorithmic bias concerns. | Ensure responsible data practices and mitigate potential biases in network analysis [109]. Promote fairness and inclusivity in the application of network analysis. |
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Salem, M.S.Z. Biological networks: An introductory review. J. Proteom. Genom. Res. 2018, 2, 41–111. [Google Scholar] [CrossRef]
- Ji, X.; Freudenberg, J.M.; Agarwal, P. Integrating biological networks for drug target prediction and prioritization. Comput. Methods Drug Repurposing 2019, 203–218. [Google Scholar]
- Junker, B.H.; Schreiber, F. Analysis of Biological Networks; Wiley Online Library: Hoboken, NJ, USA, 2008; Volume 2, ISBN 0470253460. [Google Scholar]
- Serban, M. Exploring modularity in biological networks. Philos. Trans. R. Soc. B 2020, 375, 20190316. [Google Scholar] [CrossRef] [PubMed]
- Wuchty, S.; Ravasz, E.; Barabási, A.-L. The architecture of biological networks. In Complex Systems Science in Biomedicine; Springer: Boston, MA, USA, 2006; pp. 165–181. [Google Scholar]
- Zhu, X.; Gerstein, M.; Snyder, M. Getting connected: Analysis and principles of biological networks. Genes Dev. 2007, 21, 1010–1024. [Google Scholar] [CrossRef]
- Kenhub Human Anatomy Diagram with Illustration of Human Anatomy. Available online: https://www.kenhub.com/ (accessed on 14 March 2024).
- NIH Graphic Explaining Health Information. Available online: https://newsinhealth.nih.gov/2023/12 (accessed on 14 March 2024).
- Kepes, F. Biological Networks; World Scientific: Singapore, 2007; Volume 3, ISBN 9812772367. [Google Scholar]
- Somvanshi, P.R.; Venkatesh, K.V. A conceptual review on systems biology in health and diseases: From biological networks to modern therapeutics. Syst. Synth. Biol. 2014, 8, 99–116. [Google Scholar] [CrossRef]
- Bertamini, M.; Makin, A.D.J. Brain activity in response to visual symmetry. Symmetry 2014, 6, 975–996. [Google Scholar] [CrossRef]
- Beisser, D.; Klau, G.W.; Dandekar, T.; Müller, T.; Dittrich, M.T. BioNet: An R-Package for the functional analysis of biological networks. Bioinformatics 2010, 26, 1129–1130. [Google Scholar] [CrossRef]
- Geppert, T.; Koeppen, H. Biological networks and drug discovery—Where do we stand? Drug Dev. Res. 2014, 75, 271–282. [Google Scholar] [CrossRef]
- Gonzalez-Angulo, A.M.; Hennessy, B.T.J.; Mills, G.B. Future of personalized medicine in oncology: A systems biology approach. J. Clin. Oncol. 2010, 28, 2777. [Google Scholar] [CrossRef]
- Stephanopoulos, G. Synthetic biology and metabolic engineering. ACS Synth. Biol. 2012, 1, 514–525. [Google Scholar] [CrossRef]
- McCarty, N.S.; Ledesma-Amaro, R. Synthetic biology tools to engineer microbial communities for biotechnology. Trends Biotechnol. 2019, 37, 181–197. [Google Scholar] [CrossRef] [PubMed]
- Wang, M.; Wang, H.; Zheng, H. A mini review of node centrality metrics in biological networks. Int. J. Netw. Dyn. Intell. 2022, 1, 99–110. [Google Scholar] [CrossRef]
- Artwork, C. A Piece of Creative Artwork. Available online: https://in.pinterest.com/pin (accessed on 14 March 2024).
- Shankar, N.R.; Rao, P.P.B.; Siresha, S.; Madhuri, K.U. Critical path method in a project network using ant colony optimization. Int. J. Comput. Intell. Res. 2011, 7, 7–16. [Google Scholar]
- Cottrell, W.D. Simplified program evaluation and review technique (PERT). J. Constr. Eng. Manag. 1999, 125, 16–22. [Google Scholar] [CrossRef]
- Bansal, S.; Khandelwal, S.; Meyers, L.A. Exploring biological network structure with clustered random networks. BMC Bioinform. 2009, 10, 405. [Google Scholar] [CrossRef]
- Despalatović, L.; Vojković, T.; Vukicević, D. Community structure in networks: Girvan-Newman algorithm improvement. In Proceedings of the 2014 37th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO), Opatija, Croatia, 26–30 May 2014; pp. 997–1002. [Google Scholar]
- Yao, B.; Zhu, J.; Ma, P.; Gao, K.; Ren, X. A Constrained Louvain Algorithm with a Novel Modularity. Appl. Sci. 2023, 13, 4045. [Google Scholar] [CrossRef]
- Ghosh, S.; Halappanavar, M.; Tumeo, A.; Kalyanaraman, A.; Lu, H.; Chavarria-Miranda, D.; Khan, A.; Gebremedhin, A. Distributed louvain algorithm for graph community detection. In Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS), Vancouver, BC, Canada, 21–25 May 2018; pp. 885–895. [Google Scholar]
- Zhang, P.; Wang, T.; Yan, J. PageRank centrality and algorithms for weighted, directed networks. Phys. A Stat. Mech. Its Appl. 2022, 586, 126438. [Google Scholar] [CrossRef]
- Zhan, J.; Gurung, S.; Parsa, S.P.K. Identification of top-K nodes in large networks using Katz centrality. J. Big Data 2017, 4, 16. [Google Scholar] [CrossRef]
- Barnes, J.A.; Harary, F. Graph theory in network analysis. Soc. Netw. 1983, 5, 235–244. [Google Scholar] [CrossRef]
- Pavlopoulos, G.A.; Secrier, M.; Moschopoulos, C.N.; Soldatos, T.G.; Kossida, S.; Aerts, J.; Schneider, R.; Bagos, P.G. Using graph theory to analyze biological networks. BioData Min. 2011, 4, 10–27. [Google Scholar] [CrossRef]
- Bullmore, E.; Sporns, O. Complex brain networks: Graph theoretical analysis of structural and functional systems. Nat. Rev. Neurosci. 2009, 10, 186–198. [Google Scholar] [CrossRef]
- Dolev, S.; Elovici, Y.; Puzis, R. Routing betweenness centrality. J. ACM 2010, 57, 1–27. [Google Scholar] [CrossRef]
- Veremyev, A.; Prokopyev, O.A.; Pasiliao, E.L. Finding critical links for closeness centrality. INFORMS J. Comput. 2019, 31, 367–389. [Google Scholar] [CrossRef]
- Mohyedinbonab, E.; Jamshidi, M.; Jin, Y.-F. A review on applications of graph theory in network analysis of biological processes. Int. J. Intell. Comput. Med. Sci. Image Process. 2014, 6, 27–43. [Google Scholar] [CrossRef]
- Rice, J.J.; Tu, Y.; Stolovitzky, G. Reconstructing biological networks using conditional correlation analysis. Bioinformatics 2005, 21, 765–773. [Google Scholar] [CrossRef]
- Chen, N.; del Val, I.J.; Kyriakopoulos, S.; Polizzi, K.M.; Kontoravdi, C. Metabolic network reconstruction: Advances in in silico interpretation of analytical information. Curr. Opin. Biotechnol. 2012, 23, 77–82. [Google Scholar] [CrossRef]
- Fang, X.; Lloyd, C.J.; Palsson, B.O. Reconstructing organisms in silico: Genome-scale models and their emerging applications. Nat. Rev. Microbiol. 2020, 18, 731–743. [Google Scholar] [CrossRef]
- Su, G.; Morris, J.H.; Demchak, B.; Bader, G.D. Biological network exploration with Cytoscape 3. Curr. Protoc. Bioinform. 2014, 47, 8–13. [Google Scholar] [CrossRef]
- Bruns, A.; Snee, H. How to Visually Analyse Networks Using Gephi; SAGE Publications, Limited: London, UK, 2022; ISBN 1529609755. [Google Scholar]
- Papadopoulou, O.; Makedas, T.; Apostolidis, L.; Poldi, F.; Papadopoulos, S.; Kompatsiaris, I. MeVer NetworkX: Network analysis and visualization for tracing disinformation. Future Internet 2022, 14, 147. [Google Scholar] [CrossRef]
- Horvát, S.; Podkalicki, J.; Csárdi, G.; Nepusz, T.; Traag, V.; Zanini, F.; Noom, D. IGraph/M: Graph theory and network analysis for Mathematica. J. Open Source Softw. 2023, 8, 4899. [Google Scholar] [CrossRef]
- Hu, Z.; Snitkin, E.S.; DeLisi, C. VisANT: An integrative framework for networks in systems biology. Brief. Bioinform. 2008, 9, 317–325. [Google Scholar] [CrossRef]
- Gustavsen, J.A.; Pai, S.; Isserlin, R.; Demchak, B.; Pico, A.R. RCy3: Network biology using Cytoscape from within R. F1000Research 2019, 8, 1774. [Google Scholar] [CrossRef]
- Kohl, M.; Wiese, S.; Warscheid, B. Cytoscape: Software for visualization and analysis of biological networks. Data Min. Proteom. Stand. Appl. 2011, 696, 291–303. [Google Scholar]
- Zhang, W.; Thill, J.-C. Detecting and visualizing cohesive activity-travel patterns: A network analysis approach. Comput. Environ. Urban Syst. 2017, 66, 117–129. [Google Scholar] [CrossRef]
- Panditrao, G.; Bhowmick, R.; Meena, C.; Sarkar, R.R. Emerging landscape of molecular interaction networks: Opportunities, challenges and prospects. J. Biosci. 2022, 47, 24. [Google Scholar] [CrossRef] [PubMed]
- Javed, M.A.; Younis, M.S.; Latif, S.; Qadir, J.; Baig, A. Community detection in networks: A multidisciplinary review. J. Netw. Comput. Appl. 2018, 108, 87–111. [Google Scholar] [CrossRef]
- Tandon, G.; Yadav, S.; Kaur, S. Pathway modeling and simulation analysis. In Bioinformatics; Elsevier: Amsterdam, The Netherlands, 2022; pp. 409–423. [Google Scholar]
- Liu, C.; Ma, Y.; Zhao, J.; Nussinov, R.; Zhang, Y.-C.; Cheng, F.; Zhang, Z.-K. Computational network biology: Data, models, and applications. Phys. Rep. 2020, 846, 1–66. [Google Scholar] [CrossRef]
- Ovens, K.; Eames, B.F.; McQuillan, I. Comparative analyses of gene co-expression networks: Implementations and applications in the study of evolution. Front. Genet. 2021, 12, 695399. [Google Scholar] [CrossRef] [PubMed]
- Perez De Souza, L.; Alseekh, S.; Brotman, Y.; Fernie, A.R. Network-based strategies in metabolomics data analysis and interpretation: From molecular networking to biological interpretation. Expert Rev. Proteom. 2020, 17, 243–255. [Google Scholar] [CrossRef] [PubMed]
- Muzio, G.; O’Bray, L.; Borgwardt, K. Biological network analysis with deep learning. Brief. Bioinform. 2021, 22, 1515–1530. [Google Scholar] [CrossRef]
- Silverman, E.K.; Schmidt, H.H.H.W.; Anastasiadou, E.; Altucci, L.; Angelini, M.; Badimon, L.; Balligand, J.; Benincasa, G.; Capasso, G.; Conte, F. Molecular networks in Network Medicine: Development and applications. Wiley Interdiscip. Rev. Syst. Biol. Med. 2020, 12, e1489. [Google Scholar] [CrossRef]
- Saint-Antoine, M.M.; Singh, A. Network inference in systems biology: Recent developments, challenges, and applications. Curr. Opin. Biotechnol. 2020, 63, 89–98. [Google Scholar] [CrossRef] [PubMed]
- Cantó-Pastor, A.; Mason, G.A.; Brady, S.M.; Provart, N.J. Arabidopsis bioinformatics: Tools and strategies. Plant J. 2021, 108, 1585–1596. [Google Scholar] [CrossRef]
- Novelli, L.; Lizier, J.T. Inferring network properties from time series using transfer entropy and mutual information: Validation of multivariate versus bivariate approaches. Netw. Neurosci. 2021, 5, 373–404. [Google Scholar] [CrossRef] [PubMed]
- Käser, S.; Vazquez-Salazar, L.I.; Meuwly, M.; Töpfer, K. Neural network potentials for chemistry: Concepts, applications and prospects. Digit. Discov. 2023, 2, 28–58. [Google Scholar] [CrossRef] [PubMed]
- Dautle, M.; Zhang, S.; Chen, Y. scTIGER: A Deep-Learning Method for Inferring Gene Regulatory Networks from Case versus Control scRNA-seq Datasets. Int. J. Mol. Sci. 2023, 24, 13339. [Google Scholar] [CrossRef]
- Zhang, T.; Wong, G. Gene expression data analysis using Hellinger correlation in weighted gene co-expression networks (WGCNA). Comput. Struct. Biotechnol. J. 2022, 20, 3851–3863. [Google Scholar] [CrossRef]
- Mercatelli, D.; Scalambra, L.; Triboli, L.; Ray, F.; Giorgi, F.M. Gene regulatory network inference resources: A practical overview. Biochim. Biophys. Acta BBA Gene Regul. Mech. 2020, 1863, 194430. [Google Scholar] [CrossRef]
- Scutari, M.; Graafland, C.E.; Gutiérrez, J.M. Who learns better Bayesian network structures: Accuracy and speed of structure learning algorithms. Int. J. Approx. Reason. 2019, 115, 235–253. [Google Scholar] [CrossRef]
- Han, X.; Wang, B.; Situ, C.; Qi, Y.; Zhu, H.; Li, Y.; Guo, X. scapGNN: A graph neural network–based framework for active pathway and gene module inference from single-cell multi-omics data. PLoS Biol. 2023, 21, e3002369. [Google Scholar] [CrossRef]
- Ramezani, M.; Ahadinia, A.; Ziaei Bideh, A.; Rabiee, H.R. Joint Inference of Diffusion and Structure in Partially Observed Social Networks Using Coupled Matrix Factorization. ACM Trans. Knowl. Discov. Data 2023, 17, 1–28. [Google Scholar] [CrossRef]
- Powell, W.W.; White, D.R.; Koput, K.W.; Owen-Smith, J. Network dynamics and field evolution: The growth of interorganizational collaboration in the life sciences. Am. J. Sociol. 2005, 110, 1132–1205. [Google Scholar] [CrossRef]
- Cornelius, S.P.; Kath, W.L.; Motter, A.E. Realistic control of network dynamics. Nat. Commun. 2013, 4, 1942. [Google Scholar] [CrossRef] [PubMed]
- Zou, Y.; Donner, R.V.; Marwan, N.; Donges, J.F.; Kurths, J. Complex network approaches to nonlinear time series analysis. Phys. Rep. 2019, 787, 1–97. [Google Scholar] [CrossRef]
- Karakurt, H.U.; Pir, P. Integration of transcriptomic profile of SARS-CoV-2 infected normal human bronchial epithelial cells with metabolic and protein-protein interaction networks. Turk. J. Biol. 2020, 44, 168–177. [Google Scholar] [CrossRef] [PubMed]
- Galindez, G.; Sadegh, S.; Baumbach, J.; Kacprowski, T.; List, M. Network-based approaches for modeling disease regulation and progression. Comput. Struct. Biotechnol. J. 2023, 21, 780–795. [Google Scholar] [CrossRef]
- Karabekmez, M.E.; Taymaz-Nikerel, H.; Eraslan, S.; Kirdar, B. Time-dependent re-organization of biological processes by the analysis of the dynamic transcriptional response of yeast cells to doxorubicin. Mol. Omics 2021, 17, 572–582. [Google Scholar] [CrossRef] [PubMed]
- Wu, Y.; Judge, M.T.; Edison, A.S.; Arnold, J. Uncovering in vivo biochemical patterns from time-series metabolic dynamics. PLoS ONE 2022, 17, e0268394. [Google Scholar] [CrossRef] [PubMed]
- Snijders, T.A.B. Network Dynamics; Sage: Thousand Oaks, CA, USA, 2011. [Google Scholar]
- Xu, X.; Wang, W.; Hong, T.; Chen, J. Incorporating machine learning with building network analysis to predict multi-building energy use. Energy Build. 2019, 186, 80–97. [Google Scholar] [CrossRef]
- Asquith, W.H. The use of support vectors from support vector machines for hydrometeorologic monitoring network analyses. J. Hydrol. 2020, 583, 124522. [Google Scholar] [CrossRef]
- Biau, G. Analysis of a random forests model. J. Mach. Learn. Res. 2012, 13, 1063–1095. [Google Scholar]
- Dong, Y.; Huang, W.; Liu, Z.; Guan, S. Network analysis of time series under the constraint of fixed nearest neighbors. Phys. A Stat. Mech. Its Appl. 2013, 392, 967–973. [Google Scholar] [CrossRef]
- Kanakia, H.; Raundale, P.; Britto, R.; Sawardekar, R. Analysis of Social Networks using Naive Bayes. In Proceedings of the 2019 International Conference on Intelligent Computing and Control Systems (ICCS), Madurai, India, 15–17 May 2019; pp. 88–91. [Google Scholar]
- Charles, S.; Natarajan, J. Identification of Key Gene Modules and Novel Transcription Factors in Tetralogy of Fallot Using Machine Learning and Network Topological Features. Medinformatics 2023, 1, 27–34. [Google Scholar] [CrossRef]
- Bansal, M.; Goyal, A.; Choudhary, A. A comparative analysis of K-nearest neighbor, genetic, support vector machine, decision tree, and long short term memory algorithms in machine learning. Decis. Anal. J. 2022, 3, 100071. [Google Scholar] [CrossRef]
- Sebek, M.; Menichetti, G. Network Science and Machine Learning for Precision Nutrition. In Precision Nutrition; Elsevier: Amsterdam, The Netherlands, 2024; pp. 367–402. [Google Scholar]
- Ganaie, M.A.; Hu, M.; Malik, A.K.; Tanveer, M.; Suganthan, P.N. Ensemble deep learning: A review. Eng. Appl. Artif. Intell. 2022, 115, 105151. [Google Scholar] [CrossRef]
- Kufel, J.; Bargieł-Łączek, K.; Kocot, S.; Koźlik, M.; Bartnikowska, W.; Janik, M.; Czogalik, Ł.; Dudek, P.; Magiera, M.; Lis, A. What is machine learning, artificial neural networks and deep learning?—Examples of practical applications in medicine. Diagnostics 2023, 13, 2582. [Google Scholar] [CrossRef] [PubMed]
- Khemani, B.; Patil, S.; Kotecha, K.; Tanwar, S. A review of graph neural networks: Concepts, architectures, techniques, challenges, datasets, applications, and future directions. J. Big Data 2024, 11, 18. [Google Scholar] [CrossRef]
- Zhao, X.; Wu, J.; Peng, H.; Beheshti, A.; Monaghan, J.J.M.; McAlpine, D.; Hernandez-Perez, H.; Dras, M.; Dai, Q.; Li, Y. Deep reinforcement learning guided graph neural networks for brain network analysis. Neural Netw. 2022, 154, 56–67. [Google Scholar] [CrossRef]
- Veličković, P. Everything is connected: Graph neural networks. Curr. Opin. Struct. Biol. 2023, 79, 102538. [Google Scholar] [CrossRef]
- Hensel, F.; Moor, M.; Rieck, B. A survey of topological machine learning methods. Front. Artif. Intell. 2021, 4, 681108. [Google Scholar] [CrossRef]
- Rieck, B.; Sadlo, F.; Leitte, H. Topological machine learning with persistence indicator functions. In Topological Methods in Data Analysis and Visualization V: Theory, Algorithms, and Applications; Springer: Cham, Switzerland, 2020; pp. 87–101. [Google Scholar]
- Horvath, S. Weighted Network Analysis: Applications in Genomics and Systems Biology; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2011; ISBN 144198819X. [Google Scholar]
- Andrejevic, N.; Andrejevic, J.; Bernevig, B.A.; Regnault, N.; Han, F.; Fabbris, G.; Nguyen, T.; Drucker, N.C.; Rycroft, C.H.; Li, M. Machine-learning spectral indicators of topology. Adv. Mater. 2022, 34, 2204113. [Google Scholar] [CrossRef] [PubMed]
- Deng, Y.; Ye, X.; Du, X. Predictive modeling and analysis of key drivers of groundwater nitrate pollution based on machine learning. J. Hydrol. 2023, 624, 129934. [Google Scholar] [CrossRef]
- Zheng, P.-F.; Chen, L.-Z.; Guan, Y.-Z.; Liu, P. Weighted gene co-expression network analysis identifies specific modules and hub genes related to coronary artery disease. Sci. Rep. 2021, 11, 6711. [Google Scholar] [CrossRef] [PubMed]
- Jin, S.; Zeng, X.; Xia, F.; Huang, W.; Liu, X. Application of deep learning methods in biological networks. Brief. Bioinform. 2021, 22, 1902–1917. [Google Scholar] [CrossRef] [PubMed]
- Guerreiro, L.; Silva, F.N.; Amancio, D.R. Recovering network topology and dynamics from sequences: A machine learning approach. Phys. A Stat. Mech. Its Appl. 2024, 638, 129618. [Google Scholar] [CrossRef]
- Lavecchia, A. Machine-learning approaches in drug discovery: Methods and applications. Drug Discov. Today 2015, 20, 318–331. [Google Scholar] [CrossRef] [PubMed]
- Infante, T.; Del Viscovo, L.; De Rimini, M.L.; Padula, S.; Caso, P.; Napoli, C. Network medicine: A clinical approach for precision medicine and personalized therapy in coronary heart disease. J. Atheroscler. Thromb. 2020, 27, 279–302. [Google Scholar] [CrossRef] [PubMed]
- Glaab, E.; Rauschenberger, A.; Banzi, R.; Gerardi, C.; Garcia, P.; Demotes, J. Biomarker discovery studies for patient stratification using machine learning analysis of omics data: A scoping review. BMJ Open 2021, 11, e053674. [Google Scholar] [CrossRef] [PubMed]
- Vougas, K.; Krochmal, M.; Jackson, T.; Polyzos, A.; Aggelopoulos, A.; Pateras, I.S.; Liontos, M.; Varvarigou, A.; Johnson, E.O.; Georgoulias, V. Deep learning and association rule mining for predicting drug response in cancer. A personalised medicine approach. BioRxiv 2016, 70490. [Google Scholar] [CrossRef]
- Hammad, A.; Elshaer, M.; Tang, X. Identification of potential biomarkers with colorectal cancer based on bioinformatics analysis and machine learning. Math. Biosci. Eng. 2021, 18, 8997–9015. [Google Scholar] [CrossRef]
- Kurz, F.T.; Kembro, J.M.; Flesia, A.G.; Armoundas, A.A.; Cortassa, S.; Aon, M.A.; Lloyd, D. Network dynamics: Quantitative analysis of complex behavior in metabolism, organelles, and cells, from experiments to models and back. Wiley Interdiscip. Rev. Syst. Biol. Med. 2017, 9, e1352. [Google Scholar] [CrossRef] [PubMed]
- Redhu, N.; Thakur, Z. Network biology and applications. In Bioinformatics; Elsevier: Amsterdam, The Netherlands, 2022; pp. 381–407. [Google Scholar]
- Barabasi, A.-L.; Oltvai, Z.N. Network biology: Understanding the cell’s functional organization. Nat. Rev. Genet. 2004, 5, 101–113. [Google Scholar] [CrossRef] [PubMed]
- Guo, P.; Meng, C.; Zhang, S.; Cai, Y.; Huang, J.; Shu, J.; Wang, J.; Cai, C. Network-based analysis on the genes and their interactions reveals link between schizophrenia and Alzheimer’s disease. Neuropharmacology 2024, 244, 109802. [Google Scholar] [CrossRef]
- Wang, R.C.; Wang, Z. Precision medicine: Disease subtyping and tailored treatment. Cancers 2023, 15, 3837. [Google Scholar] [CrossRef] [PubMed]
- Yamada, T.; Bork, P. Evolution of biomolecular networks—Lessons from metabolic and protein interactions. Nat. Rev. Mol. Cell Biol. 2009, 10, 791–803. [Google Scholar] [CrossRef]
- Ekici, A.; Ekici, S.O. A Bayesian network analysis of ethical behavior. J. Macromark. 2016, 36, 96–115. [Google Scholar] [CrossRef]
- Tyson, J.J.; Laomettachit, T.; Kraikivski, P. Modeling the dynamic behavior of biochemical regulatory networks. J. Theor. Biol. 2019, 462, 514–527. [Google Scholar] [CrossRef] [PubMed]
- Ali, A.M.; Mohammed, M.A. A Comprehensive Review of Artificial Intelligence Approaches in Omics Data Processing: Evaluating Progress and Challenges. Int. J. Math. Stat. Comput. Sci. 2024, 2, 114–167. [Google Scholar] [CrossRef]
- Luo, M.; Zhu, J.; Jia, J.; Zhang, H.; Zhao, J. Progress on network modeling and analysis of gut microecology: A review. Appl. Environ. Microbiol. 2024, 90, e00092-24. [Google Scholar] [CrossRef]
- Dao, T.-K.; Ngo, T.-G.; Pan, J.-S.; Nguyen, T.-T.-T.; Nguyen, T.-T. Enhancing Path Planning Capabilities of Automated Guided Vehicles in Dynamic Environments: Multi-Objective PSO and Dynamic-Window Approach. Biomimetics 2024, 9, 35. [Google Scholar] [CrossRef]
- Ayar, E.S.; Dadmand, S.; Tuncbag, N. Network medicine: From conceptual frameworks to applications and future trends. IEEE Trans. Mol. Biol. Multi-Scale Commun. 2023, 9, 374–381. [Google Scholar] [CrossRef]
- Panayides, A.S.; Amini, A.; Filipovic, N.D.; Sharma, A.; Tsaftaris, S.A.; Young, A.; Foran, D.; Do, N.; Golemati, S.; Kurc, T. AI in medical imaging informatics: Current challenges and future directions. IEEE J. Biomed. Health Inform. 2020, 24, 1837–1857. [Google Scholar] [CrossRef] [PubMed]
- Sohail, S.S.; Farhat, F.; Himeur, Y.; Nadeem, M.; Madsen, D.Q.; Singh, Y.; Atalla, S.; Mansoor, W. Decoding ChatGPT: A taxonomy of existing research, current challenges, and possible future directions. J. King Saud Univ. Comput. Inf. Sci. 2023, 35, 101675. [Google Scholar] [CrossRef]
- Green, S.; Şerban, M.; Scholl, R.; Jones, N.; Brigandt, I.; Bechtel, W. Network analyses in systems biology: New strategies for dealing with biological complexity. Synthese 2018, 195, 1751–1777. [Google Scholar] [CrossRef]
Category | Approach | Description | Application Examples |
---|---|---|---|
Network Description and Visualization | Degree Centrality | Measures the number of connections a node has. | Identifying important genes, proteins, or metabolites in a network [44]. |
Betweenness Centrality | Identifies nodes that act as bridges between different network regions. | Identifying potential bottlenecks, key regulators, or drug targets [17]. | |
Closeness Centrality | Measures how quickly information can flow from one node to others. | Identifying central players in information dissemination or regulatory processes [17]. | |
Community Detection | Identifies clusters of nodes with dense connections. | Discovering functional modules, co-regulated genes, or pathways [45]. | |
Network Function and Dynamics | Shortest Path Analysis | Identifies the most efficient pathway for information/material flow. | Understanding signal transduction, metabolic pathways, or drug action [46]. |
Network Motif Analysis | Detects recurring patterns of interactions. | Identifying fundamental regulatory units, signaling modules, or potential drug targets [44]. | |
Differential Network Analysis | Compares networks under different conditions. | Identifying network alterations in disease, drug treatment, or environmental changes [47]. | |
Network Diffusion Analysis | Models information/influence propagation through the network. | Simulating drug or signal spread, studying disease progression, or understanding information flow in cellular processes [1]. | |
Integration with Other Data | Gene Co-expression Analysis | Overlaps gene expression data with interaction networks. | Identifying co-expressed genes potentially involved in the same biological process [48]. |
Network Enrichment Analysis [49] | Identifies statistically overrepresented pathways/functions within network modules. | Linking network structure to known biological functions or disease mechanisms [50]. | |
Network-Based Disease Gene Prioritization | Prioritizes candidate genes for disease association based on network connections. | Identifying novel disease genes or therapeutic targets [51]. |
Technique | Description | Application | Strengths | Limitations |
---|---|---|---|---|
Mutual information | Measures statistical dependence between variables, suggesting potential interactions. | Identifying co-expressed genes potentially involved in the same biological process. Discovering protein–protein interactions based on co-localization data [53]. | Simple and computationally efficient. Less sensitive to noise compared to correlation [54]. | May miss weak or non-linear relationships. Requires careful selection of appropriate dependence measures [55]. |
Correlation analysis | Calculates correlation coefficients to reveal associations between entities, suggesting potential interactions. | Identifying co-regulated genes in gene expression networks. Uncovering potential links between metabolites based on their abundance profiles [56]. | Easy to interpret and implement. Suitable for linear relationships [49]. | Sensitive to outliers and data scaling. May not capture non-linear dependencies [57]. |
Boolean networks | Represents biological systems as logical rules governing interactions between entities. | Simulating network dynamics and identifying critical regulatory points. Modeling cellular differentiation or signal transduction pathways [57]. | Intuitive and interpretable. Enables qualitative analysis of network behavior. | Limited to discrete states and Boolean logic, potentially oversimplifying complex systems. Can become computationally expensive for large networks. |
Bayesian networks | Probabilistic graphical models capture conditional dependencies between variables, allowing for the integration of prior knowledge and reasoning about the likelihood of specific network configurations. | Predicting gene expression levels based on regulatory network structure. Inferring missing links in protein–protein interaction networks [58]. | Incorporates prior knowledge and uncertainty. Enables probabilistic reasoning about network interactions. | Computational complexity can increase with network size. Relies on accurate prior knowledge, which may not always be available [59]. |
Matrix factorization | Decomposes network data matrices into lower-dimensional matrices, revealing hidden patterns and facilitating the identification of potential interactions. | Identifying functionally related genes or proteins based on co-occurrence patterns. Discovering hidden communities within biological networks [60]. | Reduces data dimensionality for efficient analysis. Uncovers hidden patterns in complex networks [61]. | Can be sensitive to noise and outliers in the data. Interpretation of the decomposed factors can be challenging. |
Technique | Description | Application | Strengths | Limitations |
---|---|---|---|---|
Differential network analysis | Compares networks under different conditions to identify changes in connectivity patterns. | Identifying differentially expressed genes and their interactions in disease vs. control samples. Unveiling alterations in protein–protein interaction networks upon drug treatment [65]. | Reveals changes in network structure and function under different conditions. Helps identify potential disease drivers or drug targets [66]. | Requires comparable network data from different conditions. Statistical significance testing can be complex for large networks. |
Time-series analysis | Analyzes network data collected over time points to track network evolution. | Studying the dynamic reconfiguration of metabolic networks during cellular processes. Monitoring the temporal changes in gene regulatory networks during cell cycle progression [67]. | Captures dynamic network changes at a high resolution. Provides insights into network remodeling and adaptation. | Requires extensive time-series network data, which can be expensive and time-consuming to collect. Data analysis can be computationally intensive [68]. |
Technique | Description | Application | Strengths | Limitations |
---|---|---|---|---|
Support vector machines (SVMs) | Classify network modules based on specific features to identify functionally distinct groups. | Clustering genes into co-expressed modules based on their network interactions. Classifying protein complexes based on their network topology [75]. | Effective for high-dimensional data and non-linear relationships. Relatively robust to noise and outliers [71]. | May require careful parameter tuning to achieve optimal performance. Can be computationally expensive for large datasets [76]. |
Random Forests | Predict missing interactions within a network with high accuracy. | Filling gaps in protein–protein interaction networks to improve their coverage. Predicting missing metabolic reactions to complete pathway maps [77]. | Handles data heterogeneity well. Provides ensemble learning for improved accuracy and stability [78]. | Less interpretable compared to some other techniques. Feature selection and parameter tuning can be crucial. |
Nearest neighbors | Classify data points based on their similarity to their nearest neighbors. | Link prediction, node classification [73]. | Simple to implement, interpretable results. | Sensitive to noise, suffers from the curse of dimensionality. |
Naive Bayes | Probabilistic classifier based on Bayes’ theorem. | Spam filtering, social network analysis [74]. | Efficient for large datasets, handles categorical data well | Assumes independence of features, may not be suitable for complex relationships. |
Deep learning | Utilizes deep neural network architectures to learn complex relationships from network data. | Identifying potential drug targets by analyzing protein–protein interaction networks. Classifying different disease subtypes based on their gene regulatory network properties [44]. Predicting the temporal evolution of biological networks. | Capable of capturing intricate non-linear relationships and hidden features. Can handle high-dimensional and complex network data. | Requires large amounts of data for training, which may not always be available [79]. Interpretability and explainability can be challenging. |
Graph neural networks (GNNs) | Specialized deep learning models designed specifically for analyzing graph-structured data, like networks. | Identifying communities of functionally related genes within co-expression networks. Predicting drug–target interactions based on protein–protein interaction networks. Classifying different cell types based on their gene regulatory network features [44]. | Tailored for network data, incorporating node features and network topology. Can capture complex dependencies and relationships within networks [80]. | Relatively new area with ongoing development. May require specialized hardware and software for training and implementation. |
Application | Description | Examples |
---|---|---|
Understanding Disease Mechanisms | Identify key drivers and pathways. Uncover disease modules. Explore network dynamics. | Identifying key genes involved in cancer progression by analyzing cancer gene networks. Discovering co-expressed genes potentially contributing to Alzheimer’s disease. Studying the temporal changes in protein–protein interaction networks during viral infection [99]. |
drug discovery | Identify potential drug targets. Predict drug–target interactions. Repurpose existing drugs. | Prioritizing candidate drug targets based on their network connectivity in disease networks [2]. Predicting the potential side effects of a drug candidate by analyzing its interactions within the network. Identifying existing drugs that may be effective for treating a new disease based on network analysis. |
personalized medicine | Stratify patients based on network profiles. Predict patient response to treatment. Identify biomarkers. | Grouping patients with similar network patterns associated with a specific disease for targeted therapy [100]. Predicting an individual’s response to chemotherapy based on their tumor gene expression network. Identifying potential biomarkers for early diagnosis of a disease by analyzing the network properties of relevant genes. |
Additional Applications | Evolutionary biology: Studying network evolution across species. Systems biology: Integrating diverse data into network models. Ecology and environmental biology: analyzing species interactions using networks. | Understanding the evolution of protein–protein interaction networks in different primates [101]. Building a network model integrating gene expression, protein–protein interaction, and metabolic data to study cellular processes. Analyzing the network of interactions between predator and prey species to understand ecosystem dynamics. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Nguyen, T.-T.; Dao, T.-K.; Pham, D.-T.; Duong, T.-H. Exploring the Molecular Terrain: A Survey of Analytical Methods for Biological Network Analysis. Symmetry 2024, 16, 462. https://doi.org/10.3390/sym16040462
Nguyen T-T, Dao T-K, Pham D-T, Duong T-H. Exploring the Molecular Terrain: A Survey of Analytical Methods for Biological Network Analysis. Symmetry. 2024; 16(4):462. https://doi.org/10.3390/sym16040462
Chicago/Turabian StyleNguyen, Trong-The, Thi-Kien Dao, Duc-Tinh Pham, and Thi-Hoan Duong. 2024. "Exploring the Molecular Terrain: A Survey of Analytical Methods for Biological Network Analysis" Symmetry 16, no. 4: 462. https://doi.org/10.3390/sym16040462
APA StyleNguyen, T. -T., Dao, T. -K., Pham, D. -T., & Duong, T. -H. (2024). Exploring the Molecular Terrain: A Survey of Analytical Methods for Biological Network Analysis. Symmetry, 16(4), 462. https://doi.org/10.3390/sym16040462