1. Introduction
Selenoprotein P (SELENOP), an integral component of selenium metabolism synthesized and secreted by hepatocytes, has garnered escalating attention for its multifaceted roles in diverse physiological processes. It serves as a pivotal antioxidant, facilitating selenium transport from the liver to tissues and safeguarding against oxidative stress [
1]. The expanding body of evidence suggests an intricate correlation between SELENOP expression and diverse aspects of health and disease. For example, modulation of SELENOP has been linked to obesity and diabetes, attributed to its association with insulin resistance and glucose metabolism [
2,
3]. SELENOP, central to thyroid hormone metabolism, influences the activation and regulation of thyroid hormones, while indirectly supporting thyroid function by contributing to the overall selenium supply and ensuring the proper activity of selenoproteins, including deiodinases crucial for thyroid hormone metabolism [
4]. Additionally, SELENOP, with its selenium-containing compounds, could modulate sexual hormonal pathways, affecting the delicate balance of blood pressure regulation in women. Elevated SELENOP levels in females may influence blood pressure differently than in males, with estrogen’s vasodilatory effects impacting endothelial function and vascular tone [
5].
Moreover, conflicting evidence, particularly in advanced liver disease, indicates lowered circulating selenium and SELENOP levels in hepatic diseases. Recent studies have unveiled connections between aberrant SELENOP levels and metabolic dysfunctions like non-alcoholic fatty liver disease [
1]. For example, lower SELENOP levels have been observed in patients with definite non-alcoholic steatohepatitis [
6]. Furthermore, lower circulating SELENOP levels [
7] and hepatic mRNA SELENOP expression were observed in patients with cirrhosis [
8].
Also, individuals diagnosed with hepatocellular carcinoma (HCC) exhibited decreased hepatic mRNA SELENOP expression in comparison to both those with liver cirrhosis and individuals with normal liver [
8]. Another study reveals concentrations of selenium and SELENOP falling below healthy ranges in HCC patients, suggesting the plausible utility of SELENOP as a biomarker for monitoring and prognostic purposes during convalescence [
9]. This underscores the significance of serum SELENOP as an established biomarker, not only for detecting selenium levels, but also for its ready detection in blood samples, making it particularly relevant in the context of liver-related conditions [
10]. Thus, we postulate that SELENOP holds promise as a novel biomarker for prognosticating overall survival and predicting Ragnum hypoxia scores (a quantitative measure of hypoxia in tissues, derived from the expression levels of hypoxia-responsive genes [
11]). Notwithstanding the longstanding utilization of serum alpha-fetoprotein (AFP) in HCC diagnosis and its historical inclusion in international guidelines, concerns regarding its diagnostic accuracy have prompted its exclusion from updated guidelines. However, debate persists regarding the continued utility of AFP [
12]. This underscores the imperative for alternative biomarkers such as SELENOP, which may offer enhanced prognostic capabilities, thereby addressing potential limitations associated with AFP.
In addition, Hypoxia-Inducible Factor 1 Alpha (HIF1A) emerges as a pivotal marker for hypoxia across clinical and research domains. Functioning as a transcription factor, HIF1A orchestrates the expression of genes crucial for cellular adaptation to low oxygen levels. Elevated HIF1A expression in cancer frequently correlates with tumour hypoxia, exerting profound influences on tumour progression, metastasis, and resistance to therapeutic response. Immunohistochemistry and molecular assays represent standard methodologies for quantifying HIF1A levels, facilitating the assessment of tumour hypoxia [
13]. Moreover, hypoxia reduces SELENOP export and alters selenium metabolism to favour the production of essential selenoproteins like glutathione peroxidase 4; low levels of SELENOP in the blood could indicate an adaptive response to hypoxic conditions. Monitoring SELENOP levels might help in assessing the extent of hypoxia and the body’s selenium status in various pathological conditions [
14]. Therefore, our exploration of SELENOP as a potential biomarker for predicting Ragnum hypoxia scores aligns with the imperative to identify robust markers capable of informing therapeutic strategies and patient outcomes in the context of tumour hypoxia, thereby complementing the established role of HIF1A.
In general, this study aims to comprehensively investigate SELENOP’s diverse roles in liver cancer, guided by the rationale that its multifaceted functions warrant exploration using machine learning approaches for the discovery of novel biomarkers in HCC translational medicine [
15]. Recent findings indicate SELENOP’s potential as a biomarker for monitoring hepatic diseases, including HCC, and as a prognostic indicator, potentially surpassing traditional markers like serum AFP. Additionally, exploring SELENOP’s correlation with Ragnum hypoxia scores complements HIF1A’s role and aids in identifying robust therapeutic indicators. Leveraging data from The Cancer Genome Atlas (TCGA), this study seeks to bridge existing knowledge gaps by evaluating SELENOP expression across various disease stages, grades, and racial and gender groups in liver cancer. It aims to ascertain SELENOP’s potential as both a prognostic marker for overall survival and a predictor for hypoxia. Furthermore, the investigation aims to unravel the intricate connections between SELENOP and hormonal/metabolic biomarkers, shedding light on the molecular mechanisms underlying SELENOP’s involvement in liver cancer for the development of novel diagnostic and therapeutic strategies.
2. Methods
The study utilized mRNA expression data from the Liver Hepatocellular Carcinoma (LIHC) collection in TCGA via cBioPortal (
https://www.cbioportal.org/, accessed on 19 December 2023), encompassing 372 patients diagnosed with HCC. Genetic data were log-transformed mRNA expression z-scores, calculated relative to normal samples based on log RNA Seq V2 RSEM values to ensure a normalized and standardized representation of mRNA expression levels. The Shapiro–Wilk test assessed the normality of the “SELENOP” variable (W = 0.93522,
p-value = 1.578 × 10
−11), indicating a significant departure from the normal distribution.
2.1. Python Programming
The methodology employed Python alongside essential libraries, including NumPy (version 1.23.5), Pandas (version 1.5.3), Matplotlib (version 3.7.0), Seaborn (version 0.12.2), and Statsmodels (version 0.13.5). The dataset, stored as an Excel file, was initially imported using Pandas for comprehensive data manipulation and analysis. Preprocessing steps, such as converting non-numeric values to NaN and subsequently removing corresponding rows with missing data, were conducted. Statistical analysis involved Scipy’s Kruskal–Wallis test to evaluate differences among groups, generating a test statistic and a p-value, with a low p-value indicating significant differences between groups (α = 0.05). Additionally, post hoc tests, such as Dunn’s test or pairwise Mann–Whitney U tests, were applied to conduct pairwise comparisons between groups, elucidating specific group differences in “SELENOP” levels across different categories such as grade, race, category, and stage. The relationship between “SELENOP” and a derived “Category” variable, where categorization was based on “Sex” and a −0.5 to 0.5 “SELENOP” threshold of log RNA expression, was analyzed. Visualizations were addressed through Seaborn and Matplotlib, with Seaborn’s boxplot functions employed to visualize the distribution of “SELENOP” values across distinct categories. Jittered points were overlaid on the boxplots to enhance data visibility. The resulting multipanel boxplots were optimized for publication standards using Matplotlib.
The analysis also focused on exploring relationships between variables (SELENOP, Ragnum Hypoxia Score and Overall Survival) through regression modelling using Generalized Linear Models (GLM) from the Statsmodels library, specifically employing Gaussian family functions. Visualizations were crafted with Seaborn and Matplotlib, with each plot annotated to exhibit statistical metrics like p-values, Bayesian Information Criterion (BIC), Akaike Information Criterion (AIC), and pseudo-R-squared values. Subsequently, the analysis transitioned towards predictive modelling utilizing machine learning techniques. Although the provided code snippet lacks explicit implementation of machine learning algorithms such as Support Vector Regression or Random Forest Regression, it is essential to specify and train these models using relevant features like “SELENOP”, “Ragnum Hypoxia Score”, and “HIF1A” to predict outcomes such as “Overall Survival” or “Ragnum Hypoxia Score”. To ensure the accuracy and effectiveness of the analysis, it is crucial to compute evaluation metrics like Mean Squared Error (MSE), R-squared, or cross-validated performance to assess the predictive accuracy of the models.
2.2. R Programming
Correlation and multiple regression analyses were conducted using the R programming language, version 4.3.1. Considering the non-normal distribution, subsequent statistical tests utilized non-parametric methods to ensure robustness in the analysis. To maintain analysis integrity, missing values and non-numeric data were systematically omitted from the dataset, mitigating potential biases and inaccuracies. Spearman correlations among selected variables that exhibited highly significant concurrence (“SELENOP”, “androgen receptor (AR)”, “Estrogen Receptor 1 (ESR1)”, “Thyroid hormone receptor β (THRB)”, “peroxisome proliferator-activated receptor α (PPARA)”, “Apolipoprotein C3 (APOC3)” and “Apolipoprotein 5 (APOA5)”) were conducted using R with the “Spearman” method from the psych package. The correlation matrix was printed, and a scatterplot matrix with correlation values was generated using the same package.
A conditional check verified sufficient data points for linear regression analysis. If met, a multiple linear regression model was created using the lm function from the stats package, regressing “SELENOP” on “ESR1”, “THRB”, “AR”, and “APOC3”.
4. Discussion
In summary, SELENOP displays nuanced relationships across diverse disease stages and grades, racial and gender groups, and overall survival. Noteworthy distinctions in SELENOP levels among specific subgroups, including varying cancer grades and different racial or gender categories, accentuate the complexity of its involvement in liver cancer. These findings imply a potentially distinct role for SELENOP in diverse HCC subpopulations. Notably, SELENOP levels were lower in high-grade (G3–G4) compared to low-grade (G1) cancers, suggesting a correlation with cancer severity. Racial disparities were observed, with individuals of Asian descent exhibiting lower SELENOP expression than Black or African American individuals. Gender-specific variations were evident, with female patients, particularly in altered states, displaying lower SELENOP levels than males. The Spearman correlation matrix illuminates compelling connections between SELENOP and biomarkers involved in both hormonal dynamics and lipid metabolism. Robust positive correlations are evident with key hormonal markers such as AR (ρ = 0.668), ESR1 (ρ = 0.547), and THRB (ρ = 0.424), underscoring the intricate involvement of SELENOP in hormonal pathways within the realm of liver cancer. Additionally, the matrix reveals strong-to-moderate positive correlations between SELENOP and biomarkers implicated in lipid metabolism, further enriching our understanding. Noteworthy associations include those with PPARA (ρ = 0.507), APOC3 (ρ = 0.448), and APOA5 (ρ = 0.485). Furthermore, the subsequent robust regression model not only corroborates these intricate correlations but accentuates the substantial impact of both hormonal and lipid metabolic factors on SELENOP mRNA expression in the context of liver cancer. The model’s high significance, well-defined coefficients, and strong goodness-of-fit metrics collectively underscore the complexity and efficacy of the associations observed in the Spearman correlation matrix.
In addition, through advanced statistical analyses and machine learning techniques, we have demonstrated a statistically significant association between SELENOP expression levels and overall survival rates among HCC patients, highlighting its potential as a prognostic indicator. However, our investigation also reveals the complexity of SELENOP’s predictive capabilities, with the machine learning model for overall survival exhibiting limited performance; this underscores the complexity of HCC prognosis, which likely involves multifactorial interactions beyond SELENOP expression alone. Consequently, while SELENOP shows promise as a prognostic biomarker, its predictive utility may be enhanced when integrated with other clinical, pathological, and molecular factors.
In contrast, the model for hypoxia demonstrated moderate efficacy, shedding light on SELENOP’s predictive potential for hypoxia in HCC, and offering valuable insights into its role in tumour microenvironment dynamics.
4.1. Biological Plausibility and Mechanistic Insights
The multifaceted nature of SELENOP in cancer encompasses both selenium transport and antioxidant functions. Unlike most selenoproteins, SELENOP, primarily produced in the liver, is known for its roles in selenium transport. Incorporating selenium into up to 10 selenocysteine residues, SELENOP is secreted into plasma, reaching distant tissues before lysosomal degradation, and contributing to the synthesis of other selenoproteins (e.g., GPXs) [
16]. While SELENOP is often a reliable marker for overall selenium levels [
17], epidemiological studies reveal varied correlations with cancer, often with decreased levels observed in several tumour types, including hepatobiliary cancer [
17]. Our analysis also confirms the same trend in HCC as well.
The central role of SELENOP in regulating thyroid hormones [
4] and sex-specific hormones like estrogen have already been discussed [
5]. In our study, the strong positive correlations with biomarkers associated with hormonal changes, such as AR, ESR1, and THRB, suggest a role in estrogenic and thyroid dynamics.
In the context of lipid metabolism, several inconsistent reports show either negative or positive correlations between SELENOP and lipoproteins (e.g., HDL and LDL) [
18]. SELENOP engages in cellular uptake through apoER2 and megalin receptors, both part of the lipoprotein receptor family. ApoER2 mediates endocytosis in the brain and testicles, while megalin serves as the primary receptor in the kidneys. These receptors, with additional functions impacting tumour growth, highlight the complexity of SELENOP’s role. ApoER2 regulates cell motility, and increased megalin expression influences tumour cell proliferation [
16]. The observed correlation of reduced SELENOP levels with fatty liver disease has also been documented [
1,
6].
APOA5, exclusively synthesized by the liver, plays a vital role in triglyceride metabolism, being associated with chylomicrons, LDL, and HDL in plasma [
19]. APOC3 is also a widely known key player in triglyceride metabolism [
20]. PPARA similarly plays a vital role in lipid metabolism [
21]. To our knowledge, our study is the inaugural report establishing a robust correlation and association between SELENOP and markers of lipid and triglyceride metabolism in HCC. The identified connections with key lipid/triglyceride metabolism biomarkers, namely PPARA, APOC3, and APOA5, suggest the involvement of SELENOP in lipid homeostasis within the context of liver cancer.
4.2. Clinical Implications
The identified associations between SELENOP and various clinical parameters in liver cancer underscore its potential as a biomarker for diagnostic, prognostic, and therapeutic purposes [
9]. Furthermore, the detectability of SELENOP in serum and/or plasma underscores its potential as a non-invasive liquid biopsy marker, presenting clinicians with a minimally intrusive avenue for diagnostic insights [
10]. The consistent expression profile of SELENOP across distinct stages of HCC, as revealed by this analysis, suggests its promise as a diagnostic biomarker, particularly in the early stages of the disease. The stability of SELENOP levels across disease stages, coupled with its presence in easily accessible biofluids, positions SELENOP as a prospective asset in advancing early detection strategies for HCC [
22]. More importantly, SELENOP shows promise as a biomarker for predicting hypoxia, potentially rivalling established markers such as HIF1A.
In addition, based on our results, monitoring SELENOP levels alongside key biomarkers, including hormonal markers (AR, ESR1, THRB) and those associated with lipid/triglyceride metabolism (PPARA, APOC3, APOA5), may offer a comprehensive approach for refining risk stratification and guiding personalized treatment strategies in liver cancer. Regular assessment of SELENOP, in conjunction with these relevant biomarkers, holds promise for improving the effectiveness of personalized treatment approaches and ultimately enhancing patient outcomes.
Integrating SELENOP into routine clinical practice could revolutionize liver cancer diagnostics and prognosis. As a liquid biopsy marker, SELENOP enables non-invasive monitoring of disease status and treatment response, facilitating timely, tailored interventions that improve outcomes and reduce costs.
Additionally, SELENOP’s role in predicting hypoxia could enhance HCC management by identifying patients at higher risk, allowing for early, targeted therapies that mitigate hypoxia’s adverse effects, and improving survival and quality of life.
Moreover, combining SELENOP with existing biomarkers could create a comprehensive diagnostic tool, enhancing prognostication and personalized treatment. Future research should validate these findings in diverse populations and explore SELENOP’s clinical applications.
4.3. Limitations
Some limitations should be considered when interpreting the results of this study. First, the cross-sectional nature of TCGA data limits the establishment of causal relationships between SELENOP and various factors in liver cancer [
23]. Longitudinal genetic analysis unveils changes in disease phenotypes over time, refining disease status definitions. Repeat measurements enhance understanding of disease progression, capturing time-varying covariates and interactions often overlooked in cross-sectional studies focused on a single time point [
24].
Additionally, the reliance on mRNA expression data from TCGA introduces inherent limitations, such as potential variations in sample collection methods and platforms [
23]. Also, the dataset’s retrospective nature lacks specific details on lifestyle factors and comorbidities like hepatitis and cirrhosis that could influence SELENOP expression and impact the result as confounders.
4.4. Future Directions
In future, investigating SELENOP’s dynamic interactions with hormonal and lipid/triglyceride metabolism biomarkers is vital for nuanced comprehension of its mechanistic involvement in liver cancer. Scrutinizing correlations with specific molecular entities relevant to hormonal dynamics (e.g., AR, ESR1, THRB) and lipid/triglyceride metabolism (e.g., PPARA, APOC3, APOA5) holds promise for elucidating underlying molecular pathways. Integrating multi-omics data and advanced analytical methodologies will provide a comprehensive insight into SELENOP’s associations within these pathways, potentially identifying therapeutic targets for HCC and refining personalized treatment approaches.
To bolster the prognostic value of SELENOP for overall survival and its diagnostic utility for hypoxia in HCC, validating its expression in serum is paramount. For instance, in a multicentered study, SELENOP was inversely associated with all-cause mortality following breast cancer diagnosis [
25]. Thus, further large-scale prospective studies correlating serum SELENOP levels with patient outcomes, including overall survival, are essential, particularly in liver cancer. Longitudinal analyses can track SELENOP variations over time and their association with disease progression. Integration of serum SELENOP levels with clinical parameters can enhance prognostic accuracy. Additionally, validating SELENOP as a diagnostic marker for hypoxia through correlation with imaging and hypoxia biomarkers is crucial. Prospective trials assessing SELENOP-guided treatment strategies’ impact on patient outcomes are necessary.
5. Conclusions
Our comprehensive study provides a thorough examination of SELENOP’s involvement in HCC, revealing its diverse associations across disease stages and grades, racial and gender groups, and overall survival outcomes. Significant variations in SELENOP expression levels were observed among specific subgroups, particularly across different cancer grades and demographic categories, highlighting its nuanced role in HCC pathogenesis. Lower SELENOP levels in high-grade cancers suggest a potential correlation with disease severity, while disparities across racial and gender groups underscore the complexity of SELENOP’s influence in diverse patient populations. Additionally, robust regression analyses uncover connections between SELENOP and biomarkers involved in hormonal dynamics and lipid metabolism, shedding light on its mechanisms of action in HCC progression. Notably, our study identifies SELENOP as a potential prognostic biomarker, with significant associations observed with overall survival rates among HCC patients. While its predictive capabilities are moderately limited, SELENOP shows promise as a biomarker for predicting hypoxia, potentially rivalling established markers such as HIF1A. Nonetheless, it is essential to acknowledge the limitations of our study, including the cross-sectional nature of TCGA data and potential biases inherent in mRNA expression analyses. Future research should focus on validating SELENOP’s prognostic and diagnostic utility in serum through large-scale prospective studies while integrating multi-omics data to elucidate its intricate molecular pathways in HCC.