Next Article in Journal
Feasibility and Tolerability of Daily Microgreen Consumption in Healthy Middle-Aged/Older Adults: A Randomized, Open-Label, Controlled Crossover Trial
Previous Article in Journal
Immature Sword Bean (Canavalia gladiata) Pod Alleviates Allergic Rhinitis (A Double-Blind Trial) Through PI3K/Akt/mTOR Signaling
Previous Article in Special Issue
Association of Mucin-Degrading Gut Microbiota and Dietary Patterns with Colonic Transit Time in Constipation: A Secondary Analysis of a Randomized Clinical Trial
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

A Data-Driven Approach to Enhance the Prediction of Bacteria–Metabolite Interactions in the Human Gut Microbiome Using Enzyme Encodings and Metabolite Structural Embeddings

by
Gopal Srivastava
1 and
Michal Brylinski
1,2,*
1
Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
2
Center for Computation and Technology, Louisiana State University, Baton Rouge, LA 70803, USA
*
Author to whom correspondence should be addressed.
Nutrients 2025, 17(3), 469; https://doi.org/10.3390/nu17030469
Submission received: 26 December 2024 / Revised: 22 January 2025 / Accepted: 24 January 2025 / Published: 28 January 2025
(This article belongs to the Special Issue Nutrition, Gut Microbiota and Immunity)

Abstract

Background: The human gut microbiome is critical for host health by facilitating essential metabolic processes. Our study presents a data-driven analysis across 312 bacterial species and 154 unique metabolites to enhance the understanding of underlying metabolic processes in gut bacteria. The focus of the study was to create a strategy to generate a theoretical (negative) set for binary classification models to predict the consumption and production of metabolites in the human gut microbiome. Results: Our models achieved median balanced accuracies of 0.74 for consumption predictions and 0.95 for production predictions, highlighting the effectiveness of this approach in generating reliable negative sets. Additionally, we applied a kernel principal component analysis for dimensionality reduction. The consumption model with a polynomial kernel, and the production model with a radial basis function with 32 reduced features, showed median accuracies of 0.58 and 0.67, respectively. This demonstrates that biological information can still be captured, albeit with some loss, even after reducing the number of features. Furthermore, our models were validated on six previously unseen cases, achieving five correct predictions for consumption and four for production, demonstrating alignment with known biological outcomes. Conclusions: These findings highlight the potential of integrating data-driven approaches with machine learning techniques to enhance our understanding of gut microbiome metabolism. This work provides a foundation for creating bacteria–metabolite datasets to enhance machine learning-based predictive tools, with potential applications in developing therapeutic methods targeting gut microbes.
Keywords: bacteria–metabolite interactions; enzyme–metabolite interactions; theoretical (negative) dataset generation; kernel principal component analysis; enzyme encodings; chemical embeddings; chemical metabolism; human gut microbiome bacteria–metabolite interactions; enzyme–metabolite interactions; theoretical (negative) dataset generation; kernel principal component analysis; enzyme encodings; chemical embeddings; chemical metabolism; human gut microbiome

Share and Cite

MDPI and ACS Style

Srivastava, G.; Brylinski, M. A Data-Driven Approach to Enhance the Prediction of Bacteria–Metabolite Interactions in the Human Gut Microbiome Using Enzyme Encodings and Metabolite Structural Embeddings. Nutrients 2025, 17, 469. https://doi.org/10.3390/nu17030469

AMA Style

Srivastava G, Brylinski M. A Data-Driven Approach to Enhance the Prediction of Bacteria–Metabolite Interactions in the Human Gut Microbiome Using Enzyme Encodings and Metabolite Structural Embeddings. Nutrients. 2025; 17(3):469. https://doi.org/10.3390/nu17030469

Chicago/Turabian Style

Srivastava, Gopal, and Michal Brylinski. 2025. "A Data-Driven Approach to Enhance the Prediction of Bacteria–Metabolite Interactions in the Human Gut Microbiome Using Enzyme Encodings and Metabolite Structural Embeddings" Nutrients 17, no. 3: 469. https://doi.org/10.3390/nu17030469

APA Style

Srivastava, G., & Brylinski, M. (2025). A Data-Driven Approach to Enhance the Prediction of Bacteria–Metabolite Interactions in the Human Gut Microbiome Using Enzyme Encodings and Metabolite Structural Embeddings. Nutrients, 17(3), 469. https://doi.org/10.3390/nu17030469

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop