Next Article in Journal
Population-Level Cell Trajectory Inference Based on Gaussian Distributions
Next Article in Special Issue
The Macrophage–Fibroblast Dipole in the Context of Cardiac Repair and Fibrosis
Previous Article in Journal
Single-Cell RNA-Seq Analysis Links DNMT3B and PFKFB4 Transcriptional Profiles with Metastatic Traits in Hepatoblastoma
Previous Article in Special Issue
Novel Biomarkers and Advanced Cardiac Imaging in Aortic Stenosis: Old and New
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Fibroblast-Specific Protein-Protein Interactions for Myocardial Fibrosis from MetaCore Network

by
Klaus M. Frahm
1,
Ekaterina Kotelnikova
2,
Oksana Kunduzova
3,4,*,† and
Dima L. Shepelyansky
1,*,†
1
Laboratoire de Physique Théorique, Université de Toulouse, CNRS, UPS, 31062 Toulouse, France
2
Clarivate Analytics, 08025 Barcelona, Spain
3
National Institute of Health and Medical Research (INSERM) U1048, 31432 Toulouse, Cedex 4, France
4
Université de Toulouse, UPS, 31062 Toulouse, France
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Biomolecules 2024, 14(11), 1395; https://doi.org/10.3390/biom14111395
Submission received: 4 September 2024 / Revised: 9 October 2024 / Accepted: 29 October 2024 / Published: 31 October 2024

Abstract

:
Myocardial fibrosis is a major pathologic disorder associated with a multitude of cardiovascular diseases (CVD). The pathogenesis is complex and encompasses multiple molecular pathways. Integration of fibrosis-associated genes into the global MetaCore network of protein-protein interactions (PPI) offers opportunities to identify PPI with functional and therapeutic significance. Here, we report the generation of a fibrosis-focused PPI network and identification of fibroblast-specific arbitrators driving reparative and reactive myocardial fibrosis. In TGF- β -mediated fibroblast activation, developed network analysis predicts new regulatory mechanisms for fibrosis-associated genes. We introduce an efficient Erdös barrage approach to suppress activation of a number of fibrosis-associated nodes in order to reverse fibrotic cascades. In the network model each protein node is characterized by an Ising up or down spin corresponding to activated or repairing state acting on other nodes being initially in a neutral state. An asynchronous Monte Carlo process describes fibrosis progression determined by a dominant action of linked proteins. Our results suggest that the constructed Ising Network Fibrosis Interaction model offers network insights into fibrosis mechanisms and can complement future experimental efforts to counteract cardiac fibrosis.

1. Introduction

Myocardial fibrosis is a major pathologic disorder associated with a multitude of cardiovascular diseases (CVD) [1]. In the heart, tissue fibrotic remodeling is characterized by abnormal fibroblast activation and excessive extracellular matrix (ECM) protein accumulation [2]. Although a number of factors have been implicated in orchestrating the fibrotic response, tissue fibrosis is dominated by a central mediator: transforming growth factor- β (TGF- β ) [3]. Sustained TGF- β production leads to a continuous cycle of growth factor signaling and deregulated matrix turnover [4]. TGF- β binds a heteromeric receptor complex composed of type I and type II serine/threonine kinase receptors that activate smad-dependent gene transcription [5]. The converging lines of evidence suggest that activation of the Smad 3 cascade plays an essential role in extracellular matrix protein gene expression and regulates fibrosis tissue deposition in fibrotic remodeling of the infarcted heart [6]. Abnormalities in TGF- β /Smad3 signaling underlie inflammatory diseases and promote tumor emergence. TGF- β is also central to immune suppression within the tumor microenvironments, and several studies have reported roles in tumor immune evasion and poor responses to cancer immunotherapy [7]. In addition to activation of Smad-dependent signaling, the TGF- β receptor complex can also modulate TGF- β -activated kinase 1 (TAK1 or MAP3K7) [8].
However, despite intensive research, the biomolecules that orchestrate fibrosis are still poorly understood and as a result, effective strategies for limiting fibrosis are lacking [2,4]. Considering the complex heterogeneity of fibrosis, research strategy on a system-level understanding of the disease using mathematical modeling approaches is a driving force to dissect the complex processes involved in fibrotic disorders. Recently, we have reproduced the classic hallmarks of aberrant cardiac fibroblast activation leading to fibrosis and provided a powerful toolbox for characterization of cardiac fibroblast activation [9]. Although the pathogenesis of fibrotic remodeling has not been well identified, accumulated evidence suggests that multiple genes/proteins and their interactions play important roles in disease scenario [10].
Fibrotic remodeling is a complex process caused by genetic abnormalities that alter protein-protein interactions (PPI) [9,11]. In the heart, PPI interfaces represent a highly promising, although challenging, class of potential targets for therapeutic options. In fibrosis, PPI form signaling nodes and hubs that transmit pathophysiological cues along molecular networks to achieve an integrated biological output, thereby promoting fibrogenesis and fibrosis progression. Thus, pathway perturbation, through disruption of PPI critical for fibrosis, offers a novel and effective strategy for curtailing the transmission of pro-fibrotic signals. Deciphering of fibrosis-specific PPI would uncover new mechanisms of fibrotic signaling for therapeutic interrogation. Thus a mathematical analysis of the global PPI MetaCore network [12] can provide useful deep insights in the understanding of fibrosis progression processes.
In this work we use the PPI MetaCore network [12] to perform a mathematical modeling of fibrosis progression. The results reported in [9] determined the protein pro-fibrotic responses as a feedback on TGF protein stimulation. Thus these results [9] support the known fact that the TGF protein plays an important role in fibrosis tissue [3] and establish proteins with most positive and most negative response in cardiac fibroblasts.
A variety of biological applications of the MetaCore network are described in [13,14]. The general statistical properties of the MetaCore network are presented in [15]. The applications of Google matrix algorithms to the fibrosis PPI based on the MetaCore network are described in [16] using the fibrosis responses obtained in [9]. We note that the Google matrix algorithms [17,18,19] find a variety of useful applications in modern complex networks [20] including World Wide Web, Wikipedia, world trade etc.
In our opinion an important advantage of the MetaCore network is that it presents a global network structure of PPI with about 40,000 proteins and important molecules. This allows to perform a mathematical modeling of fibrosis progression over the whole network. Thus in this work we present such a modeling which assumes that at an initial stage there is a certain number of proteins which activate a fibrosis progression of other proteins via their network links to other proteins. These initial proteins are always marked as red network nodes and assumed to be always red permanently generating fibrosis progression (thus being always red or with fixed Ising spin up with σ i = 1 ). Among the other proteins, we assume that there is an initial group of proteins which inhibit fibrosis progression being of fixed blue color (thus being always blue or with Ising spin down with σ i = 1 ). The remaining other proteins (or network nodes) are assumed to be in neutral state of white color or spin being zero ( σ i = 0 ). The fibrosis progression is modeled by the asynchronous Monte Carlo process when at each step a spin of a given node (which is not fixed) is determined as the total spin sign of other nodes linked to it with certain weights of Markov chain transitions. In a certain sense this rule corresponds to a situation when a person takes an average opinion of his network friends linked to him. After many of such Monte Carlo steps the system converges to a steady-state when all network spins obtain a fixed polarization being up (red or fibrosis activated) or down (blue or healthy protected) and some of the nodes may remain at zero neutral spin state (white) being practically disconnected from the red or blue network nodes.
We note that the above Monte Carlo steps describe a process of opinion formation and the global vote of nodes (society members) between red and blue options. For complex social networks [20] there are numerous studies of opinion formation, reviewed in [21]. Various interesting features of voter models and opinion formation on networks had been obtained and described in [21,22,23,24,25,26,27]. Recently it was argued that the opinion formation on the world trade network can be linked with country preference to trade in one or another currency (e.g., US dollar or hypothetical BRICS currency) [28]. The important new element appeared in [28] is that opinion of certain network nodes (countries) is considered to be fixed since it is assumed that they prefer to trade always with fixed currency of USD or BRICS. A further development of this approach with nodes of fixed opinion is done in [29] for Wikipedia networks when initially there are two groups of nodes of fixed red and blue opinions but all other nodes have neutral state (white opinion or spin zero) and their fixed opinion emerges in the result of asynchronous Monte Carlo process. This process is viewed as the Ising Network Opinion Formation (INOF) model. In fact this INOF model is directly suitable for the modeling of fibrosis progression in the PPI MetaCore network and we are using it here with certain slight modifications described in the next Section. We call this modified system as the Ising Network FIbrosis (INFI) model. It should be pointed out that a somewhat similar asynchronous Monte Carlo process is used in models of associative memory (see e.g., [30,31]).
In this work, using the results reported in [9,16], we take in the INFI model an initial state of 10 fixed red nodes that produce positive activation response of fibrosis on growth factor- β (TGF- β ) considering them as fixed 10 red protein nodes. We also consider 6 (or 10, 14) fixed blue protein nodes with negative response (see details in next Section). As a result of fibrosis progression via asynchronous Monte Carlo iterations we obtain the steady-state of affected and healthy protein nodes with about 60–80 percents of proteins affected by fibrosis. This corresponds to a strong fibrosis activation resolution of a large part of the system.
To reduce this strong fibrosis activation we develop an efficient barrage approach based on the analysis of Erdös proteins linked with fixed red nodes. We show that this Erdös barrage constructed in a clever manner allows to obtain a striking reduction of fibrosis activated resolution nodes by a factor of 300 so that almost all protein nodes are transformed in the healthy phase. We hope that the barrage strategy developed in this work will find applications in clinical experiments with fibrosis progression.
The paper constructing as follows: Section 2 describes the data sets, construction of Markov chain transitions, Google matrix and the INFI model, Section 3 presents the obtained results and Discussion and conclusion are given in Section 4. In the Appendix A we provide additional data and Figures.

2. Data Sets and INFI Model Description

2.1. Network Data Sets

In this work we use the same global PPI MetaCore network as in [16]. It contains N = 40 , 079 nodes with N = 292 , 191 links (without self connections which existed in [15]). The number of directed activation/inhibition links is N + / N = 65 , 157 / 49 , 321 1.3 and the number of neutral directed links is N n = N N + N = 177 , 713 . Here we do not take into account the bi-functional activation/inhibition nature of links. Thus we simply have a directed network with N nodes and number of directed links being N .
For convenience, we present in Table 1, taken from [16], 54 selected fibrosis proteins (nodes). These nodes are composed with 4 TGF- β proteins/nodes ( K t = 1 , 2 , 3 , 4 ), 20 “up-proteins” ( K u = 1 , , 20 ), 20 “down-proteins” ( K d = 1 , , 20 ), both obtained from experiments [9] (as described above) and 10 new “X-proteins” (or “X-nodes”; K x = 1 , , 10 ) which have a significant influence on the other 44 nodes. The 4 TGF- β nodes correspond to different isoforms of this protein. As in [16] we have 4 groups of proteins and we use a specific index for each group: TGF- β proteins with index K t = 1 , 2 , 3 , 4 ; up-proteins with a strongest positive response noted by index K u = 1 , , 20 (ordered by the positive response with the strongest response for K u = 1 ); down-proteins with a strongest negative response noted by index K d = 1 , , 20 (ordered by the modulus of negative response with the strongest response modulus for K d = 1 ); external proteins noted by index K x ordered by their local PageRank index (strongest PageRank probability of these 10 proteins is at K x = 1 ; see more detail below). All these 54 proteins have their global index K g = 1 , , 54 used in Table 1.

2.2. Without Formulas: Methods, Characteristics and Expected Network Results

As in [16] here we present qualitative explanations without formulas of the mathematical methods and characteristics described in the next Subsections. Our aim here is to give a global view on our approach for a common reader.
We use the MetaCore directed network [12] which represents an action of a protein A on a protein B in a form of a directed link (edge) for N = 40 , 079 proteins forming the network nodes (proteins). These links are obtained on the basis of careful and detailed analysis of scientific literature about thousands of experiments of various research groups that allowed to collect information about PPI and thus generated a network database with N = 40 , 079 nodes and N = 292 , 191 links. Certain medical applications of the MetaCore network can be found in [13,14].
The universal mathematical methods to analyze such networks are generic and based on the concept of Markov chains and Google matrix [17,18,19]. The validity of these methods has been confirmed for various directed networks from various fields of science. Therefore, since the Google matrix analysis is based on a generic mathematical foundation, we expect that this analysis will also work efficiently for PPI networks.
The Google matrix of the global MetaCore PPI network G is constructed with specific rules briefly described in the next Section 2.3 and in detail in [17,18,19]. The matrix G is obtained from a matrix of Markov chain transition elements S i j that give weights of transitions between nodes. The important property of G is that its application (multiplication) to an arbitrary initial vector v preserves the probability and the normalization of this vector (sum of all vector elements) remains constant (taken to be unity). As a result of multiple multiplications by G any initial vector converges to the steady-state distribution given by the PageRank vector P. The components of this vector represent the probabilities of each node (protein) in this limit. The nodes with the highest probabilities are the most influential nodes of the network (all nodes are monotonically ordered by decreasing values of the PageRank components which provides the “PageRank index” K such K ( j ) = 1 , 2 , for nodes j with largest values P ( j ) ). These nodes have typically many ingoing links and it is likely that some of these ingoing links come from other nodes that also have large PageRank values.
For the inverse network, in which all link directions are inverted, the corresponding PageRank is called CheiRank vector P * [19]. The highest probabilities P * ( j ) are for nodes j with the CheiRank index K * ( j ) = 1 , 2 , being the most communicative nodes. They typically have many outgoing links.
In the INFI model Ising spins with polarization ± 1 (red or blue) or 0 (white) are placed at each node of network. They describe fibrosis activated protein state (red or σ j = + 1 ) which can activate other proteins, healthy protein state (blue or σ j = 1 ) which can prepare other proteins and neutral state proteins (white or σ j = 0 ) that cannot affect other proteins but can be activated or repaired by red or blue proteins. The initial spin configuration has certain fixed red proteins with index K g = 1 , , 10 (see Table 1) and fixed blue proteins with index K g = 25 , , 30 (or 34 or 38) whose colors always remain unchanged, all other proteins are white (or σ j = 0 ). These white proteins, and also red or blue non-fixed proteins, can change their color depending on the majority color of nodes directly linked to it (taking into account the weights of ingoing and outgoing links of the matrix S i j taken without dangling nodes). This choice is similar to social relations when a member of society takes his opinion to be imposed by opinions of other members related with him. An asynchronous Monte Carlo process performs multiple changes of spins leading to the convergence to a steady-state polarization of all spins. The fraction of red spins f r determines if the fibrosis progression affected a major part of network or not. We give the mathematical definitions of the INFI model in next Subsections.

2.3. Markov Chains, Google Matrix, PageRank and CheiRank

In this section we remind some basic mathematical definitions of Markov chains and how the Google matrix and the related PageRank and CheiRank vectors for the MetaCore network are constructed. For this we use a simple version of the MetaCore network where initial links are not weighted (see Refs. [15,16] for some details about a more refined version with bi-functional links). First, one introduces the adjacency matrix with elements A i j = 1 if a node j points to another node i (different from j) and A i j = 0 if there is no link from j to i or if i = j (we use the same convention as in Ref. [16] where all diagonal elements are chosen to be A j j = 0 even if there is a link from a node j to itself). Then we define a matrix S ˜ by S ˜ i j = A i j / l A l j if l A l j 0 and S ˜ i j = 0 otherwise (i.e., for columns j where all elements A l j are 0). This matrix does not yet describe a Markov process since it has potentially some zero columns j (such nodes j are also called dangling nodes). However, in this work we will use a Monte Carlo process based on this particular matrix (see next section).
To obtain a proper stochastic matrix S one replaces S i j = 1 / N for the dangling nodes j and S i j = S ˜ i j for the other nodes (dangling nodes have no outgoing links so that a column of dangling node j has only zero elements in the matrix S ˜ i j ). Here S i j represents the transition probability for a random surfer from node j to i and the column sum normalization i S i j = 1 ensures the conservation of probability.
The Google matrix elements G i j are typically defined by
G i j = α S i j + ( 1 α ) / N
where α = 0.85 is the usual damping factor [17,18]. The Google matrix is also a proper stochastic matrix (column sum normalized) and here the random surfer jumps with probability α on the network according to S and with a probability ( 1 α ) to an arbitrary random node of the network. The damping factor modification helps to avoid possible isolated communities and ensures that the Markov process converges for long times rather quickly to a uniform stationary probability distribution. The latter defines the PageRank vector P which is actually the right eigenvector of the Google matrix G corresponding to the leading eigenvalue λ = 1 , i.e., G P = P . The elements P ( j ) of the PageRank vector correspond to the probability to find the random surfer on the node j in the stationary limit of the Markov process. The PageRank index K ( j ) is obtained by ordering the nodes with decreasing values of P ( j ) , i.e., the highest (lowest) PageRank probability P ( j ) corresponds to K ( j ) = 1 ( K ( j ) = N ). In this work we will use the K-rank of a node to characterize this node in a unique way. The PageRank probability P ( j ) is typically related to the number of ingoing links pointing to node j. However, it also takes into account the “importance” (i.e., PageRank probability) of the nodes having a direct link to j.
One can also consider the network obtained by the inversion of all link directions. The same construction as described above provides a Google matrix noted as G * and the corresponding PageRank vector is called the CheiRank vector P * , defined by G * P * = P * . The CheiRank probability P * ( j ) is typically related to the number of outgoing links weighted by the value P * ( i ) of nodes i having a link from j to i. The CheiRank index K * ( j ) is obtained by ordering the CheiRank vector with decreasing values of P * ( j ) .
For further details about the properties of Google matrix, PageRank and CheiRank vectors, with many different example networks (also with proteins networks), we refer to [15,16,18,19].

2.4. Ising Spin Network, Monte Carlo Process for INFI Model

In this work, we consider an Ising type of model using the symmetrized matrix S ˜ + S ˜ T to model the effective spin interactions. We denote by σ i the spin associated to the node (protein) i but here we allow for three different values being σ i = + 1 (“red state”, “fibrosis state”), σ i = 1 (“blue state”, “healthy state”) and also σ = 0 (“white state”, “undetermined or neutral state”). We study this model using a specific Monte Carlo process similar to [29]. The white state will only be used in the initial condition and is supposed to be only temporary.
First, we fix 10 specific nodes permanently to the red state, those with K g = 1 , , K g = 10 in the set of Table 1 corresponding to the four TGF proteins and the first 6 K u -proteins with strongest positive response on the TGF stimulation [16]. We also fix n b nodes permanently to the blue state being the nodes with K g = 25 , , K g = 24 + n b corresponding to the first n b   K d -proteins with strongest negative TGF-response. Here we choose mostly n b = 6 (same number as the 6 K u proteins with fixed red state) but we also present some data for the cases n g = 10 (same total number of fixed red proteins) and n g = 14 . Therefore 10 + n b nodes have already a permanently fixed spin value σ i = ± 1 .
We implement a Monte Carlo procedure that modifies the other N v = N 10 n b remaining nodes with “variable” spin values (non fixed). For this at given iteration time τ , we update the N v variable nodes by computing the sum over all nodes j of the network (fixed and variable):
Z i = j ( i ) ( S ˜ i j + S ˜ j i ) σ j
and choosing σ i ( τ + 1 ) = + 1 if Z i > 0 , σ i ( τ + 1 ) = 1 if Z i < 0 and σ i ( τ + 1 ) = σ i ( τ ) (unchanged) if Z i = 0 . Without going into technical details, we mention that the numerical implementation of (2) exploits of course the sparse structure of the matrix S ˜ i j . This update is done for every node i in the set of N v variable nodes in a random order using a random permutation (thus no repetitions) which is chosen at the beginning of the iteration process. Once a spin value for a node i has been updated, the new value σ i ( τ + 1 ) will be used for the update of the subsequent nodes i ˜ . We note that somewhat similar asynchronous Monte Carlo process what used for bio-networks in [32,33] but there the matrix elements S ˜ i j + S ˜ j i had values ± 1 and network sizes were relatively small (about 100).
This full update procedure for one time step and the full set of N v variable nodes is repeated up to the iteration time τ max = 100 (with different random permutations for the update order at each time step). We also verify if at any given value of τ there is ideal convergence where the spins no longer change. (This happens typically at τ l a s t 10 ; see below for details).
The full iteration procedure with τ max = 100 time steps is repeated R = 100 , 000 times with different random permutations and eventually also different random positions of initial blue or white nodes (depending on the precise type of initial condition).
During this procedure, we compute the fraction of red outcome f r ( i ) for each node as the number of times the node i has red state at τ max (at different realizations) divided over R and also the overall average f r = i f r ( i ) / N v over the nearly full network of variable nodes. To test the convergence, we also compute the overall average at intermediate values of τ .
In order to test the procedure and also in the direct influence of the permanently fixed nodes, we first choose an initial condition where all N v variable nodes are initialized to the white state. Figure A1 illustrates for this case the convergence of (the overall network average) f r ( τ ) with iteration time τ . The convergence seems to be very good at τ > 5 with nearly constant values on graphical precision. However, a closer look at the data shows that | f r ( τ + 1 ) f r ( τ ) | 10 5 for τ = 10 and the value of f r ( τ ) becomes constant only at τ τ l a s t 70 with typical last non-vanishing differences | f r ( τ l a s t 1 ) f ( τ l a s t ) | 10 10 and | f r ( τ l a s t + 1 ) f ( τ l a s t ) | = 0 . We have verified that this behavior is due to the fact that typically at τ 10 there is exact convergence for a given realization of the random permutations (random pathways) with no further modifications of the spins at τ > 10 . However, there are rare realizations with larger values, up to τ = τ l a s t 70 for exact convergence.
We stress that the approach of INFI model described above is based on physics of interacting Ising spins placed at each node of MetaCore network so that the configuration space of the system becomes 2 N instead of size N used in the Google matrix methods. Thus the INFI model provides a qualitatively new approach for analysis of PPI networks. For Wikipedia networks a similar approach was used in [29]. We state that this method previously was never used for analysis of PPI networks.
In the next section, we present the results for different quantities and also other initial conditions.

3. Results

3.1. Numerical Results

All results presented in this work were obtained with maximal iteration time τ m a x = 100 and R = 100 , 000 realizations for (potential) different initial conditions and random permutations for the Monte Carlo procedure. Also when we describe a particular initial condition it is implicitly understood that the permanently fixed 10 red and n b blue nodes given above are indeed fixed to these values from the very beginning, i.e., if we say “that there are n i b initial blue values” (with n i b = 0 , 1 , ) we mean that there are n i b blue nodes on the set of N v variable (non-fixed) nodes, other variable nodes are initialized to white values and the fixed nodes have still their 10 red and n b blue values (in particular the value of n i b does not include the number of n b fixed blue nodes).
In Figure 1, we show for the case of white or zero initial spins (for the N v variable nodes; i.e., n i b = 0 ) and the three cases of n b = 6 , 10 , 14 initial blue fixed nodes (and 10 fixed nodes as explained above) the probability distribution (normalized by an integral) of fraction of red outcomes f r ( i ) for the nodes obtained by a histogram with bin width Δ f r = 0.01 . These distributions are strongly peaked at values close to f r , p e a k 0.93 ( n b = 6 ), f r , p e a k 0.81 ( n b = 10 ), f r , p e a k 0.65 ( n b = 14 ) for roughly 82% of nodes. The secondary peak at f r = 0 corresponds to the fraction (number) 0.18 (7230) of nodes (same value for the three cases of n b ) which stay white after 100 iterations for all R = 100 , 000 pathway realizations. This set of stable white nodes correspond to nodes not connected to the small number of fixed red and blue nodes. However, despite the small value of only 10 initial fixed red notes the majority of the other nodes has a red outcome, especially for n b = 6 and with slightly reduced values for n b = 10 , 14 . Note that the bin width Δ f r = 0.01 in Figure 1 is still the rather large and histogram computations with Δ f r = 0.001 and Δ f r = 0.0001 show that the actual peaks of the distributions are much sharper as visible in Figure 1.
It seems that with the white initial condition (except for the fixed nodes) nearly all nodes (except those in the stable white set) have a large probability for red outcome. Therefore, we try to reduce this red outcome by choosing a certain number n i b of initial blue (variable) nodes and other nodes with white initial values. The question is also where to place these blue nodes. In a first model we choose the n i b blue nodes at different random positions (for each of the R = 100 , 000 pathway realizations) on the full set of variable nodes giving a set of N r = N v of possible blue initial nodes with possible values n i b = 0 , , N r and a corresponding fraction f i b = n i b / N r . Figure 2 shows for the case n b = 6 (the results for n b = 10 and n b = 14 are very similar) the overall network average f r at maximum iteration time as a function of f i b (or n i b ). The initial value f r 0.75855 at n i b = 0 corresponds to the average which can also be obtained from the histogram data shown in Figure 1 which is roughly 0 × 0.18 + 0.93 × 0.82 0.76 . With increasing f i b (or n i b ) the value of f r decreases, e.g., f r 0.1 for n i b 60 and f r 10 2 for n i b 100 . However, this decrease is not optimal since the initial n i b blue nodes can be on arbitrary random positions of available variable nodes. Therefore, we also try different network subsets for the potential initial blue nodes.
In Figure 3, we show the dependence of f r on f i b (or n i b ) for a reduced subset of N r = 38 nodes corresponding to the 38 proteins of Table 1 which are not used for fixed red/blue values, i.e., K g = 11 , , K g = 24 and K g = 31 , , K g = 54 . Now, n i b represents the number of initial blue nodes on random positions in this subset (and white initial nodes on every other variable node). Now, the decrease of f r with n i b seems somewhat stronger, i.e., f r 0.1 for n i b 8 and f r 10 2 for n i b 16 , and we may conclude that this subset is more effective to reduce the red outcome (“to block fibrosis”) than the full set of available variable nodes.
To determine a still more effective subset, we determine all nodes which have a direct link or inverse link to one of the 10 fixed red nodes (those with K g = 1 , , K g = 10 in Table 1). This defines a particular set, of size N E = 353 , with Erdös number being unity with respect to the 10 fixed red nodes as HUB and using the symmetrized link matrix S ˜ i j + S ˜ j i . Figure 4 shows the dependence of f r on f i b (or n i b ) for this Erdös set with N r = N E . Now the decrease is even more effective with f r 0.1 for n i b 6 and f r 10 2 for n i b 14 .
In Figure 5, we show in a color plot the dependence of f r ( i ) for the 54 nodes i belonging to the set of Table 1 on the index n g which is a monotonic function of n i b (essentially linear for n i b 10 and logarithmic for n i b > 10 ; see caption of Figure 5 and Appendix Figure A1 for details). The data of Figure 5 correspond to the data of Figure 4 using the Erdös set with N E = 353 as potential initial blue nodes (i.e., for each value of n i b { 0 , , 353 } we have n i b initial blue nodes at random positions in this set). In Figure 5, we can of course identify the 10 fixed red nodes ( K g = 1 , , 10 ) and the n b = 6 fixed blue nodes ( K g = 25 , , 30 ) with either f r ( i ) = 1 or f r ( i ) = 0 respectively for all values of n g . The other nodes follow quite closely the decrease of Figure 4 for the global average of f r . However, for certain specific nodes ( K g = 17 , 23 , 36 , 38 , 41 , 44 ) the decay of f r with n g (or n i b ) is less pronounced for n g > 10 . Apparently, these nodes are less likely to be blocked by a modest number of initial blue nodes.
We have also analyzed the data of Figure 2 (using all N v variable nodes as potential initial blue nodes) and Figure 3 (using the remaining set of 38 non-fixed nodes of Table 1 as potential initial blue nodes) with similar color plots and in both cases we observe the same qualitative behavior as in Figure 5: identification of fixed red/blue nodes, similar decrease of f r ( i ) with increasing n i b for the other nodes and less pronounced decrease for the 6 specific nodes mentioned above.
The question arises which of the nodes of the Erdös set, or more generally, which configurations of few selected nodes of this set, are most effective to reduce the red outcome when selected as initial blue nodes. To answer this question, we compute for each node i of the Erdös set the red outcome f r c ( i ) averaged over the full network when this specific node i is selected as single initial blue node, i.e., with n i b = 1 but now with different cases of given fixed positions (instead of random positions). Note that this quantity is different from f r ( i ) used in the histogram of Figure 1 which is the probability of red outcome of node i (not averaged over the network) with full white initial condition. The nodes of the Erdös set can be ordered with increasing values of the new quantity f r c ( i ) which provides a specific ranking index K f r in this set. In Table 2, we present in K f r -order the nodes (proteins) of the Erdös set which have either K 40 or K * 40 , either low K- or K * -rank. It turns out these nodes also provide the lowest values of f r c and K f r (with some holes for K f r > 17 ).
Then we choose as example four “optimal” nodes with K = 3 , 4 , 17 , 342 (marked with an asterix in the first column of Table 2). The nodes K = 4 , 17 , 342 occupy indeed the top three places in the K f r -rank while the node K = 3 “only” corresponds to the position K f r = 11 . The reason for this choice is related to the fact that these four nodes are more uniformly optimal if we also choose a small number of n i b = 2 , , 10 initial blue nodes. In this case, using the data of Figure 4 (or the code to produce these data) it is actually possible to compute the conditional probability f r c ( i ) of a red outcome when the node i is by chance selected by the random initial condition for n i b > 1 (as one of the initial n i b blue nodes). For example for n i b = 4 with R = 100 , 000 different random initial conditions of 4 blues nodes out of 353 we have typically a bit more than 1000 realizations where an arbitrary fixed node i belongs to the random set of 4 initial nodes. This provides enough data for a reasonable average to compute the conditional probability of red outcome of the node i. In this way, it is possible to compute more general K f r -rankings as in Table 2, also for modest values of n i b > 1 . It turns out that these rankings produce roughly the same sets of nodes in the first places (with possible permutations between different n i b -values) and the four selected nodes K = 3 , 4 , 17 , 342 are indeed optimal as a group for the three values n i b = 2 , 3 , 4 (and also some larger values).
In Figure 6, we present (for n b = 6 ) the dependence of f r on n i for this small optimal set of 4 possible initial blue nodes (red square data points). Here the decrease of f r with increasing n i is indeed very strong with f r 0.035 for n i b = 1 and f r 0.00318 for n i b = 4 . Furthermore, Figure 6 also provides the conditional probabilities of red outcome f r c ( [ K 1 , , K n i b ] (black small circle data points) for specific configurations [ K 1 , , K n i b ] with one configuration for n i b = 0 , 4 , four configurations for n i b = 1 , 3 and six configurations for n i b = 2 . We see that there are considerable fluctuations between the different configurations in the red outcome for n i b = 1 and n i b = 2 with optimal values being f r c ( [ 17 ] ) 0.009477 and f r c ( [ 3 , 17 ] ) 0.004863 . Appendix Figure A3 provides two panels of similar figures for the other cases n b = 10 and n b = 14 .
Figure 7 is a similar color plot as Figure 5 but for the specific set of 4 optimal nodes [ 3 , 4 , 17 , 342 ] . The conclusions are similar to Figure 5 (confirmation of fixed red/blue nodes, strong decrease with increasing n i b for other nodes and certain specific nodes with less pronounced decrease.
In Figure 8, we show the 353 nodes of the Erdös set in the global K- K * -plane (in a double logarithmic representation) with colored data points such that the color provides the value of f r c ( K ) for each node at given K-value (obtained at n i b = 1 with the corresponding node as initial blue node). Optimal nodes with low red outcome (blue color) have typically small values in their K- and K * -rank and nodes with large red outcome (red color) have typically large values in their K- and K * -rank. Furthermore, Figure 8 also provides the positions of the 10 fixed red and the 6 blue nodes in the global K- K * -plane with typically quite large values for K and K * . Using the same data Appendix Figure A4 shows the dependence of f r c (at n i b = 1 ) on the rank K f r . Obviously, this curve is monotonically increasing and the range with small values of f r c is rather small, i.e., f r c < 0.1 corresponds to K f r 11 (see also Table 2). Note that the average of the curve in Figure A4 corresponds to the data point at n i b = 1 in Figure 4 which is f r ( n i b = 1 ) 0.5818 .
We have also analyzed the statistical properties of the individual f r ( i ) values, same quantity as in Figure 4, but using the data of Figure 6 for n i b = 4 with the initial blue node configuration [ 3 , 4 , 17 , 342 ] and global (network averaged) value f r 0.00318 . For example Appendix Figure A5 shows the 100 nodes with largest values of this quantity in the K- K * plane. Obviously, among the 26 nodes with f r ( i ) = 1 we have the 10 permanently fixed red nodes ( K g = 1 , , 10 in Table 1) but there are also further 16 nodes with f r ( i ) = 1 probably with (mostly) exclusive links to the fixed 10 red nodes, i.e., no or few links to other nodes, therefore explaining the fixed outcome f r ( i ) = 1 . There are about 1% (4%) of network nodes with values f r ( i ) > 0.5 ( f r ( i ) > 0.08850 ).
Furthermore, Appendix Figure A6 shows the histogram distribution p ( f r ) for the same data with a rapid decay at f i < 1 and a small peak at f r = 1 corresponding to the 26 nodes i with f r ( i ) = 1 . We have also computed the related global probability P ( f r ) for a node i to have a value f r ( i ) > f r . This quantity is shown in Appendix Figure A7, confirming the rapid decay which is quite well algebraic as P ( f r ) f r 1.5 (for f r 5 × 10 3 ) and which is similar to the Poincare recurrences decay in symplectic chaotic maps with Ulam networks [34].
The slow algebraic decay of P ( f r ) has important consequences. It shows that the average values of f r shown in Figure 6 even underestimate the effect. Explicitely, the average value of f r 0.00318 for n i b = 4 in Figure 6 corresponds, according to Appendix Figure A7, to a fraction of nodes P ( 0.00318 ) 0.22 having an f r ( i ) value above this average and 78% of nodes have a smaller f r ( i ) value. It is well known, that for such long tail distributions one should also focus on the median value f r , m e d i a n defined by P ( f r , m e d i a n ) = 0.5 corresponding in our case to f r , m e d i a n = 10 5 (only 1 red outcome in the R = 100 , 000 pathway realizations) indicating that (slightly more than) 50% of nodes have an f r ( i ) value below or equal to the median value with nearly perfect reduction of the red outcome.

3.2. Results Without Formulas

In this work we presented a mathematical model of fibrosis progression in the PPI MetaCore network describing the global interaction structure of almost all proteins and important molecules (nodes). We show that even with only 10 fibrosis activated proteins the fibrosis progression can spread over a great majority of nodes (about 70%). The developed analysis of network structure allows us to propose an efficient strategy which allows to reduce a number of fibrosis activated nodes by a factor 300 and disease elimination. The method is based on the Erdös barrage construction: we determine the Erdös nodes directly linked to the fixed 10 activated red nodes; this number can be relatively large (353 in our case); however, we show that a barrage with only 4 blue repairing Erdös nodes, corresponding to the four proteins c-Myc, p53, c-Fos and β -catenin, gives a reduction of the average number of fibrosis activated nodes by a factor 300; these 4 nodes belong to network nodes with high PageRank and CheiRank indexes of global MetaCore network.
Furthermore, this average actually underestimates the effect since it is determined by a relative small number of nodes with a modest reduction (e.g., factor of 100 ) while for more than 50% of nodes the reduction factor is even 100,000 (only 1 infection outcome in the 100,000 statistical realizations of our simulation). We have also identified in the group of Table 1 six interesting proteins HTR2B, ACSBG1, LGI2, ADORA2A, IL1R2-2 and COX4I2 (corresponding to the vertical green lines in Figure 5 and Figure 7) where the reduction effect is somewhat less pronounced (compared to the 50% of nodes with nearly perfect reduction). We expect that our INFI model can be tested with other PPI networks (e.g., those of [35,36]).

4. Discussion and Conclusions

Decoding fibrosis-associated proteins and fibrotic remodeling progression is a critical issue in treating heart failure. An experimental determination of the fibrosis resolution is extremely time consuming and dynamic process. In this work we present and describe a mathematical model of the fibrosis progression using the global PPI MetaCore network. The performed analysis shows that in TGFβ-mediated fibroblast activation, PPI hubs predict new regulatory proteins for fibrosis progression. We developed an efficient method of the Erdos barrage to identify the proteins driving fibrosis activation cascades in a physiopathological scenario using MetaCore network.
In cardiac tissue, resident cardiac fibroblasts respond to stress or injury by evoluting through phases determined by a inflammatory motifs, a proliferative capacity, and finally, a reparative status, characterized by the deposition of ECM that culminates in fibrosis. The molecular mechanisms orchestrating the dynamic control of the fibroblast phenotype remain obscure, hampering the development of therapeutic strategies to combat cardiac fibrosis. In our study, we found that c-Myc, p53, c-Fos and Androgen receptor arbitrate cardiac fibroblast activation in cardiac remodeling. The gene encoding the c-MYC is the most frequently perturbed transcription factor in the majority of human cancers, with its activation of downstream target genes in the vast majority of the cancer types [37]. Dysfunction of c-MYC can occur through elevated transcription of the gene, amplification or protein stabilization, all of which release the otherwise tight control and altered expression levels of the protein in the cell [38]. nhibition of c-MYC has been observed to favor apoptosis, growth arrest, differentiation, senescence, metabolic changes and tumour regression in several human cancer models, clearly reflecting its potential as a target for anti-cancer therapy [39]. In activated cardiac fibroblasts, p53 transcriptional targets are among the most deregulated gene programs in pressure overload-induced tissue remodeling [40]. In cancerous tumors, p53 is an important component of the DNA damage and transcriptional regulator of cell cycle [41]. Interestingly, p53-sensitive cellular senescence has also been linked to myocardial remodeling in animal models ou tissue fibrosis [42]. Indeed, over expression of the p53 target gene in transgenic mice inhibits fibroblast activation and reduces scar formation after myocardial infarction [43]. An important factor C-fos is a member of a family of early genes which are involved in the signal transduction cascade coupling extracellular stimuli to intracellular events [44]. In particular, C-fos is critical to the regulation of transcription, and may mediate long-term effects of growth factors and membrane-depolarising signals on neural responses to stress [45]. Androgen receptor (AR) signalling in fibroblasts is important in carcinogenesis and prostate development [46]. However, the molecular mechanisms of AR action in fibroblasts and other non-cardiacl cell types are largely unknown.
In the past few years, the receptors and signal transduction pathways mediating the effects of TGF- β on fibroblasts have been identified, enabling the dissection of the specific pathways involved in pathogenic events. TGF- β type I and type II transmembrane receptor serine/threonine kinases transduce downstream signals via novel cytoplasmic latent transcription factors, Smad proteins [47]. Both Smad2 and Smad3 are phosphorylated directly by the type I receptor kinase and able to translocate to the nucleus, where they act as transcriptional regulators of target genes controlling cell apoptosis and differentiation [48]. Interestingly, deletion of Smad3 results primarily in impaired status of immunity in mice and shortening their lifespan [49]. The Smad pathway is believed to be the major signaling mechanism through which active TGF- β stimulates the induction of profibrotic cascades.
In a profibrotic scanario, TGF- β -activated kinase 1 (TAK1) is one of the best characterized non-Smad signal transducers critical for TGF- β functions in EMT and apoptosis through activating the c-Jun N-terminal kinase (JNK) and p38 MAPK cascade [50]. TAK1 also plays an essential role in mediating TGF- β activation of I-kappa B kinase (IKK) and the master transcription factor nuclear factor kappa B (NF-kB) that is required for cell survival [51]. In analogy to the mechanism defined in interleukin-1/Toll-like receptor pathways, TGF- β -induced activation of TAK1 requires TRAF6, a RING domain ubiquitin ligase that itself is modified by a K63-linked polyubiquitin chain, which acts as a scaffold to recruit TAK1 to the TGF- β receptor complex and triggers TAK1 activation [50]. Activity of TAK1 is also regulated by its binding proteins, including TAK1-binding protein 1 (TAB1) that binds constitutively the kinase domain and TAB2 or TAB3 that binds the C-terminal domain and functions as an adaptor linking TRAF6 to TAK1 [52]. However, how TGF- β coordinates TAK1 kinase to dictate the opposing responses of cell survival and apoptosis in different cellular scenarios.
Thus the PPI network leads to a very complex system properties. In view of that we hope that the proposed INFI model may lead to better understading of PPI systems.
The presented method of INFI model is based on mathematical properties of PPI networks and thus it is generic and universal. It can be applied to any other disease progression on PPI network and construction of Erdös barrage for disease propagation. As initial income for the application of this method a medical researcher should provide positive and negative responses on certain protein stimulation similar to those of Table 1 with TGF- β stimulation of fibrosis.
Thus we developed an efficient method of the Erdös barrage when a small group of e.g., 4 repairing protein can reduce the number of fibrosis activated proteins (nodes) by a factor 300 leading to a healthy state of the global system described here by the MetaCore network. We expect that similar results can be obtained for other disease progression. We think that it would be very interesting to test this INFI approach with other global PPI networks like [35,36]. We hope that the described INFI method will lead to new efficient medical treatments of fibrosis and other various diseases.

Author Contributions

Conceptualization, D.L.S.; Investigation, K.M.F., E.K. and O.K. All authors equally contributed to all stages of this work. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported in part through the grant NANOX No ANR-17-EURE-0009, (project MTDINA) in the frame of the Programme des Investissements d’Avenir, France; it was granted access to the HPC resources of CALMIP (Toulouse) under the allocation 2024-P0110; it was also supported by INSERM funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Acknowledgments

We thank L. Ermann for useful discussions.

Conflicts of Interest

Author Ekaterina Kotelnikova was employed by Clarivate Analytics. The authors declare no conflict of interest.

Appendix A. Additional Figures

Here we present additional Appendix Figure A1, Figure A2, Figure A3, Figure A4, Figure A5, Figure A6, and Figure A7 for the main part of this article.
Figure A1. Illustration of the convergence of spin-configurations for n b = 6 , 10 , 14 and n i b = 0 . Shown is average f r ( τ ) versus iteration time τ . For practical reasons the convergence seems to be quite good at τ 5 . However, the value of f r ( τ ) becomes constant only at τ τ l a s t 70 with typical last non-vanishing differences | f r ( τ l a s t 1 ) f ( τ l a s t ) | 10 10 and | f r ( τ l a s t + 1 ) f ( τ l a s t ) | = 0 . All data in this work have been computed with values up to τ = 100 to verify that exact convergence is achieved.
Figure A1. Illustration of the convergence of spin-configurations for n b = 6 , 10 , 14 and n i b = 0 . Shown is average f r ( τ ) versus iteration time τ . For practical reasons the convergence seems to be quite good at τ 5 . However, the value of f r ( τ ) becomes constant only at τ τ l a s t 70 with typical last non-vanishing differences | f r ( τ l a s t 1 ) f ( τ l a s t ) | 10 10 and | f r ( τ l a s t + 1 ) f ( τ l a s t ) | = 0 . All data in this work have been computed with values up to τ = 100 to verify that exact convergence is achieved.
Biomolecules 14 01395 g0a1
Appendix Figure A1 illustrates the convergence of the time evolution of spin configurations.
Appendix Figure A2 illustrates the link between n i b and the coarse-grained index n g used in Figure 5.
Figure A2. Dependence of n i b on the coarse-grained index n g used in Figure 5 (blue/star data points). The green line shows for comparison the linear behavior n i b = n g (exactly identical to blue line for n g 10 ) and the red line shows the exponential behavior n i b = 10 × 1 . 2 n g 10 (good approximation for n g > 10 ).
Figure A2. Dependence of n i b on the coarse-grained index n g used in Figure 5 (blue/star data points). The green line shows for comparison the linear behavior n i b = n g (exactly identical to blue line for n g 10 ) and the red line shows the exponential behavior n i b = 10 × 1 . 2 n g 10 (good approximation for n g > 10 ).
Biomolecules 14 01395 g0a2
Figure A3. As Figure 6 but for n b = 10 (top panel) and n b = 14 (bottom panel).
Figure A3. As Figure 6 but for n b = 10 (top panel) and n b = 14 (bottom panel).
Biomolecules 14 01395 g0a3
Appendix Figure A3 shows f r for optimal configurations at n b = 10 and n b = 14 (as Figure 6).
Appendix Figure A4 shows the values of f r c ( K ) (for n b = 6 ) computed from one initial (variable) blue node K belonging to the Erdös group ordered by increasing values of f r c ( K ) inside this group.
Figure A4. f r c ( K ) versus K f r ( K ) using the data of Figure 8 (i.e., n b = 6 , f r c ( K ) being computed for n i b = 1 with one single initial blue node at node K belonging to the Erdös group.) The index K f r ( K ) is the index obtained by ordering the N E = 353 nodes of the Erdös set with increasing values of f r c ( K ) .
Figure A4. f r c ( K ) versus K f r ( K ) using the data of Figure 8 (i.e., n b = 6 , f r c ( K ) being computed for n i b = 1 with one single initial blue node at node K belonging to the Erdös group.) The index K f r ( K ) is the index obtained by ordering the N E = 353 nodes of the Erdös set with increasing values of f r c ( K ) .
Biomolecules 14 01395 g0a4
Figure A5. The positions of the 100 nodes with top 100 f r ( i ) values of the (non network averaged) red outcome for individual nodes i in the K- K * plane in a zoomed double logarithmic representation (colored full circles) using the data for n i b = 4 of Figure 6 with the configuration [ 3 , 4 , 17 , 342 ] for the initial blue nodes. The color of each data point corresponds to the value of f r ( i ) according to the color bar. There are 26 nodes i with f r ( i ) = 1 including the 10 permanently fixed red nodes (black cross symbols; K g = 1 , , 10 in Table 1) and 16 additional nodes and there are 36 (100) nodes i with f r ( i ) > 0.5 ( f r ( i ) 0.08850 ).
Figure A5. The positions of the 100 nodes with top 100 f r ( i ) values of the (non network averaged) red outcome for individual nodes i in the K- K * plane in a zoomed double logarithmic representation (colored full circles) using the data for n i b = 4 of Figure 6 with the configuration [ 3 , 4 , 17 , 342 ] for the initial blue nodes. The color of each data point corresponds to the value of f r ( i ) according to the color bar. There are 26 nodes i with f r ( i ) = 1 including the 10 permanently fixed red nodes (black cross symbols; K g = 1 , , 10 in Table 1) and 16 additional nodes and there are 36 (100) nodes i with f r ( i ) > 0.5 ( f r ( i ) 0.08850 ).
Biomolecules 14 01395 g0a5
Appendix Figure A5 shows for the data for n i b = 4 of Figure 6 with the initial blue node configuration [ 3 , 4 , 7 , 342 ] the 100 nodes with top individual f r ( i ) values of the red outcome in the K- K * plane.
Appendix Figure A6 shows the distribution p ( f r ) for the data of Figure A5. The peak at f r = 1 corresponds to the 26 nodes i with f r ( i ) = 1 . The large peak at f r = 0 corresponds to 95% of probability for 0 f r 0.01 .
Figure A6. Histogram distribution p ( f r ) for the data of Figure A5 using the values f r ( i ) for all nodes with bin width Δ f r = 0.01 . The normalization is as in Figure 1 by an integral.
Figure A6. Histogram distribution p ( f r ) for the data of Figure A5 using the values f r ( i ) for all nodes with bin width Δ f r = 0.01 . The normalization is as in Figure 1 by an integral.
Biomolecules 14 01395 g0a6
Figure A7. Fraction P ( f r ) of nodes i with f r ( i ) f r versus f r (red curve) for the data of Figure A5 and Figure A6. The presentation is double logarithmic except for the data points at f r = 0 which have artificially been added at a finite value below 10 5 and the 26 nodes i with f r ( i ) = 1 have been excluded in the computation of P ( f r ) . The steps/vertical lines at f i = 0 ( f i = 10 5 ) correspond to 7539 (13,696) network nodes i having 0 (1) case(s) of red outcome out of the R = 100 , 000 random pathway realizations (note that there are 7230 nodes who have always white outcome). The green straight line corresponds to the power law fit P ( f r ) a f r b with a = ( 6.07 ± 0.03 ) × 10 5 and b = 1.457 ± 0.001 obtained for the interval f i [ 5 × 10 3 , 4 × 10 2 ] .
Figure A7. Fraction P ( f r ) of nodes i with f r ( i ) f r versus f r (red curve) for the data of Figure A5 and Figure A6. The presentation is double logarithmic except for the data points at f r = 0 which have artificially been added at a finite value below 10 5 and the 26 nodes i with f r ( i ) = 1 have been excluded in the computation of P ( f r ) . The steps/vertical lines at f i = 0 ( f i = 10 5 ) correspond to 7539 (13,696) network nodes i having 0 (1) case(s) of red outcome out of the R = 100 , 000 random pathway realizations (note that there are 7230 nodes who have always white outcome). The green straight line corresponds to the power law fit P ( f r ) a f r b with a = ( 6.07 ± 0.03 ) × 10 5 and b = 1.457 ± 0.001 obtained for the interval f i [ 5 × 10 3 , 4 × 10 2 ] .
Biomolecules 14 01395 g0a7
Appendix Figure A7 shows the probability P ( f r ) for a node i to have a value f r ( r ) f r for the data of Figure A5 and Figure A6. The decay of P ( f r ) is rather well algebraic as P ( f r ) 6.07 × 10 5 f r 1.46 for f r 5 × 10 3 if the 26 nodes i with f r ( i ) = 1 are excluded in the computation of P ( f r ) . (If these nodes are included the decay is still algebraic in the fit interval f i [ 5 × 10 3 , 4 × 10 2 ] but P ( f r ) is significantly above the power law for f i > 4 × 10 2 .) Note that P ( f r ) corresponds to the integrated probability P ( f r ) = f r 1 p ( f ˜ r ) d f ˜ r using the probability density p ( f r ) visible in Appendix Figure A6 (apart from the numerical approximation due to the histogram with finite bin width).

References

  1. Murtha, L.A.; Schuliga, M.J.; Mabotuwana, N.S.; Hardy, S.A.; Waters, D.W.; Burgess, J.K.; Knight, D.A.; Boyle, A.J. The processes and mechanisms of cardiac and pulmonary fibrosis. Front. Physiol. 2017, 8, 777. [Google Scholar] [CrossRef] [PubMed]
  2. Liu, T.; Song, D.; Dong, J.; Zhu, P.; Liu, J.; Liu, W.; Ma, X.; Zhao, L.; Ling, S. Current understanding of the pathophysiology of myocardial fibrosis and its quantitative assessment in heart failure. Front. Physiol. 2017, 8, 238. [Google Scholar] [CrossRef] [PubMed]
  3. Meng, X.-M.; Nikolic-Paterson, D.J.; Lan, H.Y. TGF-β: The master regulator of fibrosis. Nat. Rev. Nephrol. 2016, 12, 325–338. [Google Scholar] [CrossRef] [PubMed]
  4. Wynn, T.A. Cellular and molecular mechanisms of fibrosis. J. Pathol. 2008, 214, 199–210. [Google Scholar] [CrossRef]
  5. Miyazawa, K.; Itoh, Y.; Fu, H.; Miyazono, K. Receptor-activated transcription factors and beyond: Multiple modes of Smad2/3-dependent transmission of TGF-β signaling. J. Biol. Chem. 2024, 300, 107256. [Google Scholar] [CrossRef]
  6. Bujak, M.; Ren, G.; Kweon, H.J.; Dobaczewski, M.; Reddy, A.; Taffet, G.; Wang, X.-F.; Frangogiannis, N.G. Essential Role of Smad3 in Infarct Healing and in the Pathogenesis of Cardiac Remodeling. Circulation 2007, 116, 2127–2138. [Google Scholar] [CrossRef]
  7. Millet, C.; Zhang, Y.E. Roles of Smad3 in TGF-β Signaling During Carcinogenesis. Crit. Rev. Eukaryot. Gene Expr. 2007, 17, 281–293. [Google Scholar] [CrossRef]
  8. van Caam, A.; Madej, W.; Garcia de Vinuesa, A.; Goumans, M.-J.; ten Dijke, P.; Davidson, E.B.; van der Kraan, P. TGFβ1-induced SMAD2/3 and SMAD1/5 phosphorylation are both ALK5-kinase-dependent in primary chondrocytes and mediated by TAK1 kinase activity. Arthritis Res. Ther. 2017, 19, 112. [Google Scholar] [CrossRef]
  9. Pintus, S.S.; Sharipov, R.N.; Kel, A.; Timotin, A.; Keita, S.; Martinelli, I.; Boal, F.; Tronchere, H.; Kolpakov, F.; Kunduzova, O. Drug Repositioning for Cardiac Fibrosis Through Molecular Signature of Aberrant Fibroblast Activation. INSERM Prepr. 2021; to be published. Available online: https://www.quantware.ups-tlse.fr/QWLIB/fibrosisIsingnetwork/INSERMpreprint2021.pdf (accessed on 8 October 2024).
  10. Karimizadeh, E.; Sharifi-Zarchi, A.; Nikaein, H.; Salehi, S.; Salamatian, B.; Elmi, N.; Gharibdoost, F.; Mahmoudi, M. Analysis of gene expression profiles and protein-protein interaction networks in multiple tissues of systemic sclerosis. BMC Med. Genom. 2019, 12, 199. [Google Scholar] [CrossRef]
  11. Pchejetski, D.; Foussal, C.; Alfarano, C.; Lairez, O.; Calise, D.; Guilbeau-Frugier, C.; Schaak, S.; Seguelas, M.H.; Wanecq, E.; Valet, P.; et al. Apelin prevents cardiac fibroblast activation and collagen production through inhibition of sphingosine kinase 1. Eur. Heart J. 2012, 33, 2360–2369. [Google Scholar] [CrossRef]
  12. MetaCore. Available online: https://clarivate.com/cortellis/solutions/early-research-intelligence-solutions/ (accessed on 6 September 2024).
  13. Ekins, S.; Bugrim, A.; Brovold, L.; Kirillov, E.; Nikolsky, Y.; Rakhmatulin, E.; Sorokina, S.; Ryabov, A.; Serebryiskaya, T.; Melnikov, A.; et al. Algorithms for network analysis in systems-ADME/Tox using the MetaCore and MetaDrug platforms. Xenobiotica 2006, 36, 877–901. [Google Scholar] [CrossRef] [PubMed]
  14. Bessarabova, M.; Ishkin, A.; JeBailey, L.; Nikolskaya, T.; Nikolsky, Y. Knowledge-based analysis of proteomics data. BMC Bioinform. 2012, 13 (Suppl. 16), 13. [Google Scholar] [CrossRef] [PubMed]
  15. Kotelnikova, E.; Frahm, K.M.; Lages, J.; Shepelyansky, D.L. Statistical properties of the MetaCore network of protein-protein interactions. Appl. Netw. Sci. 2022, 7, 7. [Google Scholar] [CrossRef]
  16. Kotelnikova, E.; Frahm, K.M.; Shepelyansky, D.L.; Kunduzova, O. Fibrosis protein-protein interactions from Google matrix analysis of MetaCore network. Int. J. Mol. Sci. 2022, 23, 67. [Google Scholar] [CrossRef] [PubMed]
  17. Brin, S.; Page, L. The anatomy of a large-scale hypertextual Web search engine. Comput. Netw. ISDN Syst. 1998, 30, 107–117. [Google Scholar] [CrossRef]
  18. Langville, A.M.; Meyer, C.D. Google’s PageRank and Beyond: The Science of Search Engine Rankings; Princeton University Press: Princeton, NJ, USA, 2006. [Google Scholar]
  19. Ermann, L.; Frahm, K.M.; Shepelyansky, D.L. Google matrix analysis of directed networks. Rev. Mod. Phys. 2015, 87, 1261. [Google Scholar] [CrossRef]
  20. Dorogovtsev, S. Lectures in Complex Networks; Oxford University Press: Oxford, UK, 2010. [Google Scholar]
  21. Castellano, C.; Fortunato, S.; Loreto, V. Statistical physics of social dynamics. Rev. Mod. Phys. 2009, 81, 591–646. [Google Scholar] [CrossRef]
  22. Sznajd-Weron, K.; Sznajd, J. Opinion evolution in closed community. Int. J. Mod. Phys. C 2000, 11, 1157–1165. [Google Scholar] [CrossRef]
  23. Sood, V.; Redner, S. Voter Model on Heterogeneous Graphs. Phys. Rev. Lett. 2005, 94, 178701. [Google Scholar] [CrossRef]
  24. Watts, D.J.; Dodds, P.S. Influentials, Networks, and Public Opinion Formation. J. Consum. Res. 2007, 34, 441–458. [Google Scholar] [CrossRef]
  25. Galam, S. Sociophysics: A review of Galam models. Int. J. Mod. Phys. C 2008, 19, 409–440. [Google Scholar] [CrossRef]
  26. Schmittmann, B.; Mukhopadhyay, A. Opinion formation on adaptive networks with intensive average degree. Phys. Rev. E 2010, 82, 066104. [Google Scholar] [CrossRef] [PubMed]
  27. Kandiah, V.; Shepelyansky, D.L. PageRank model of opinion formation on social networks. Physica A 2012, 391, 5779. [Google Scholar] [CrossRef]
  28. Coquide, C.; Lages, J.; Shepelyansky, D.L. Opinion Formation in the World Trade Network. Entropy 2024, 25, 141. [Google Scholar] [CrossRef] [PubMed]
  29. Ermann, L.; Shepelyansky, D.L. Confrontation of capitalism and socialism in Wikipedia Networks. Information 2024, 15, 571. [Google Scholar] [CrossRef]
  30. Hopfield, J.J. Neural networks and physical systems with emergent collective computational abilities. Proc. Nat. Acad. Sci. USA 1982, 79, 2554–2558. [Google Scholar] [CrossRef]
  31. Benedetti, M.; Carillo, L.; Marinari, E.; Mezard, N. Eigenvector dreaming. J. Stat. Mech. 2024, 013302. [Google Scholar] [CrossRef]
  32. Albert, R.; Thakar, J. Boolean modeling: A logic-based dynamic approach for understanding signaling and regulatory networks and for making useful predictions. WIREs Syst. Biol. Med. 2014, 6, 353–369. [Google Scholar] [CrossRef]
  33. Tripathi, S.; Kessler, D.A.; Levine, H. Biological Networks Regulating Cell Fate Choice Are Minimally Frustrated. Phys. Rev. Lett. 2020, 125, 088101. [Google Scholar] [CrossRef]
  34. Frahm, K.M.; Shepelyansky, D.L. Poincare recurrences and Ulam method for the Chirikov standard map. Eur. Phys. J. B 2013, 86, 322. [Google Scholar] [CrossRef]
  35. TRANSPATH. Available online: https://genexplain.com/transpath/ (accessed on 6 September 2024).
  36. REACTOME. Available online: https://reactome.org/ (accessed on 6 September 2024).
  37. Yoon, J.; Jeon, T.; Kwon, J.A.; Yoon, S.Y. Characterization of MYC Rearrangements in Multiple Myeloma: An Optical Genome Mapping Approach. Blood Cancer J. 2024, 14, 165. [Google Scholar] [CrossRef] [PubMed]
  38. Wang, C.; Zhang, J.; Yin, J.; Gan, Y.; Xu, S.; Gu, Y.; Huang, W. Alternative approaches to target Myc for cancer treatment. Signal Transduct. Target. Ther. 2021, 6, 117. [Google Scholar] [CrossRef] [PubMed]
  39. Lama, D.; Vosselman, T.; Sahin, C.; Liano-Pons, J.; Cerrato, C.P.; Nilsson, L.; Teilum, K.; Lane, D.P.; Landreh, M.; Henniksson, M.A. A druggable conformational switch in the c-MYC transactivation domain. Nat. Commun. 2024, 15, 1865. [Google Scholar] [CrossRef] [PubMed]
  40. Liu, X.; Burke, R.M.; Lighthouse, J.K.; Baker, C.D.; Dirkx, R., Jr.; Kang, B.; Chakraborty, Y.; Mickelsen, D.M.; Twardowski, J.J.; Mello, S.S.; et al. p53 Regulates the Extent of Fibroblast Proliferation and Fibrosis in Left Ventricle Pressure Overload. Circ. Res. 2023, 133, 271–287. [Google Scholar] [CrossRef]
  41. Abuetabh, Y.; Wu, H.H.; Chai, C.; Al Yousef, H.; Persad, S.; Sergi, C.M.; Leng, R. DNA damage response revisited: The p53 family and its regulators provide endless cancer therapy opportunities. Exp. Mol. Med. 2022, 54, 1658–1669. [Google Scholar] [CrossRef]
  42. Zhu, F.; Li, Y.; Piano, C.; Liu, T.; Li, H.-H.; Du, J. Senescent Cardiac Fibroblast Is Critical for Cardiac Fibrosis after Myocardial Infarction. PLoS ONE 2013, 8, e74535. [Google Scholar] [CrossRef]
  43. Burgos Villar, K.N.; Liu, X.; Small, E.M. Transcriptional regulation of cardiac fibroblast phenotypic plasticity. Curr. Opin. Physiol. 2022, 28, 100556. [Google Scholar] [CrossRef]
  44. Hoffman, G.E.; Smith, M.S.; Verbalis, J.G. c-Fos and Related Immediate Early Gene Products as Markers of Activity in Neuroendocrine Systems. Front. Neuroendocrinol. 1993, 14, 173–213. [Google Scholar] [CrossRef]
  45. Fields, R.D. Signaling from Neural Impulses to Genes. Neuroscientist 1996, 2, 315–325. [Google Scholar] [CrossRef]
  46. Di Donato, M.; Zamagni, A.; Galasso, G.; Di Zazzo, E.; Giovannelli, P.; Barone, M.V.; Zanoni, M.; Gunelli, R.; Costantini, M.; Auricchio, F.; et al. The androgen receptor/filamin A complex as a target in prostate cancer microenvironment. Cell Death Dis. 2021, 12, 127. [Google Scholar] [CrossRef]
  47. Heldin, C.-H.; Mousatakas, A. Signaling Receptors for TGF-β Family Members. Cold Spring Harb. Perspect. Biol. 2016, 8, a022053. [Google Scholar] [CrossRef] [PubMed]
  48. Dennle, S.; Itoh, S.; Vivien, D.; ten Dijke, P.; Huet, S.; Gauthier, J.-M. Direct binding of Smad3 and Smad4 to critical TGF-β-inducible elements in the promoter of human plasminogen activator inhibitor-type 1 gene. EMBO J. 1998, 17, 3091–3100. [Google Scholar] [CrossRef] [PubMed]
  49. Yang, X.; Letterio, J.J.; Lechleider, R.J.; Chen, L.; Hayman, R.; Gu, H.; Roberts, A.B.; Deng, C. Targeted disruption of SMAD3 results in impaired mucosal immunity and diminished T cell responsiveness to TGF-β. EMBO J. 1999, 18, 1280–1291. [Google Scholar] [CrossRef] [PubMed]
  50. Tripathi, V.; Shin, J.-H.; Stuelten, C.H.; Zhang, Y.E. TGF-β-induced alternative splicing of TAK1 promotes EMT and drug resistance. Oncogene 2019, 38, 3185–3200. [Google Scholar] [CrossRef] [PubMed]
  51. Takaesu, G.; Surabhi, R.M.; Park, K.-J.; Ninomiya-Tsuji, J.; Matsumoto, K.; Gaynor, R.B. TAK1 is Critical for IkB Kinase-mediated Activation of the NF-kB Pathway. J. Mol. Biol. 2003, 326, 105–115. [Google Scholar] [CrossRef]
  52. Hirata, Y.; Takahashi, M.; Morishita, T.; Noguchi, T.; Matsuzawa, A. Post-Translational Modifications of the TAK1-TAB Complex. Int. J. Mol. Sci. 2017, 18, 205. [Google Scholar] [CrossRef]
Figure 1. Probability density p ( f r ) of f r for n b = 6 , 10 , 14 obtained from a histogram with bin width Δ f r = 0.01 using N v = N 10 n b data points f r ( i ) corresponding to the number of variable nodes. For each node i the value f r ( i ) is obtained as the fraction of red outcome ( σ i = + 1 ) of R = 100 , 000 pathway realizations for this node using the initial condition with no initial (variable) blue nodes, i.e., n i b = 0 . The distributions are normalized by 0 1 p ( f r ) d f r and there are strongly peaked at values close to f r , p e a k 0.93 ( n b = 6 ), f r , p e a k 0.81 ( n b = 10 ), f r , p e a k 0.65 ( n b = 14 ) for roughly 82% of nodes. The secondary peak at f r = 0 corresponds to the fraction 0.180 of nodes (same value for the three values of n b ) which stay white after 100 iterations for all R = 100 , 000 pathway realizations.
Figure 1. Probability density p ( f r ) of f r for n b = 6 , 10 , 14 obtained from a histogram with bin width Δ f r = 0.01 using N v = N 10 n b data points f r ( i ) corresponding to the number of variable nodes. For each node i the value f r ( i ) is obtained as the fraction of red outcome ( σ i = + 1 ) of R = 100 , 000 pathway realizations for this node using the initial condition with no initial (variable) blue nodes, i.e., n i b = 0 . The distributions are normalized by 0 1 p ( f r ) d f r and there are strongly peaked at values close to f r , p e a k 0.93 ( n b = 6 ), f r , p e a k 0.81 ( n b = 10 ), f r , p e a k 0.65 ( n b = 14 ) for roughly 82% of nodes. The secondary peak at f r = 0 corresponds to the fraction 0.180 of nodes (same value for the three values of n b ) which stay white after 100 iterations for all R = 100 , 000 pathway realizations.
Biomolecules 14 01395 g001
Figure 2. Probabilityof red outcome f r = i f r ( i ) / N v averaged over all (variable) N v network nodes versus fraction f i b of random initial blue nodes in the full set of N r = N v variables nodes and for the case n b = 6 . The top x-axis shows the number n i b = N r f i b of initial (variable) blue nodes. For each value of n i b the initial condition corresponds to n i b initial blue nodes ( σ i ( τ = 0 ) = 1 ) with different random positions in the set of size N r (for the R = 100 , 000 pathway realizations). The representation is logarithmic on both axis except for the first data point at n i b = 0 which has artificially been placed at a finite position below n i b = 1 for practical reasons. Note that the value f r ( n i b = 0 ) 0.75855 of this data point can also be obtained from the distribution average 0 1 f r p ( f r ) d f r from the data of Figure 1 for the case n b = 6 .
Figure 2. Probabilityof red outcome f r = i f r ( i ) / N v averaged over all (variable) N v network nodes versus fraction f i b of random initial blue nodes in the full set of N r = N v variables nodes and for the case n b = 6 . The top x-axis shows the number n i b = N r f i b of initial (variable) blue nodes. For each value of n i b the initial condition corresponds to n i b initial blue nodes ( σ i ( τ = 0 ) = 1 ) with different random positions in the set of size N r (for the R = 100 , 000 pathway realizations). The representation is logarithmic on both axis except for the first data point at n i b = 0 which has artificially been placed at a finite position below n i b = 1 for practical reasons. Note that the value f r ( n i b = 0 ) 0.75855 of this data point can also be obtained from the distribution average 0 1 f r p ( f r ) d f r from the data of Figure 1 for the case n b = 6 .
Biomolecules 14 01395 g002
Figure 3. As Figure 2 for n b = 6 but using a reduced set of potentially initial blue nodes with N r = 38 using the non-fixed nodes of the set given in Table 1 (i.e., the nodes with K g = 11 , 24 and K g = 31 , , 54 ). For each value of n i b the initial condition corresponds to n i b initial blue nodes ( σ i ( τ = 0 ) = 1 ) in this reduced set of 38 nodes (with random positions for each of the R = 100 , 000 pathway realizations). All other variable nodes have initial white values ( σ i ( τ = 0 ) = 0 ). The first data point at n i b = 0 has the same value as in Figure 2 and it has also artificially been placed at a finite position below n i b = 1 for practical reasons.
Figure 3. As Figure 2 for n b = 6 but using a reduced set of potentially initial blue nodes with N r = 38 using the non-fixed nodes of the set given in Table 1 (i.e., the nodes with K g = 11 , 24 and K g = 31 , , 54 ). For each value of n i b the initial condition corresponds to n i b initial blue nodes ( σ i ( τ = 0 ) = 1 ) in this reduced set of 38 nodes (with random positions for each of the R = 100 , 000 pathway realizations). All other variable nodes have initial white values ( σ i ( τ = 0 ) = 0 ). The first data point at n i b = 0 has the same value as in Figure 2 and it has also artificially been placed at a finite position below n i b = 1 for practical reasons.
Biomolecules 14 01395 g003
Figure 4. As Figure 3 for n b = 6 but using the Erdös set (all nodes with direct links in both directions to the 10 fixed red nodes) as reduced set of potentially initial blue nodes with N r = N E = 353 . The first data point at n i b = 0 has the same value as in Figure 2 and it has also artificially been placed at a finite position below n i b = 1 for practical reasons.
Figure 4. As Figure 3 for n b = 6 but using the Erdös set (all nodes with direct links in both directions to the 10 fixed red nodes) as reduced set of potentially initial blue nodes with N r = N E = 353 . The first data point at n i b = 0 has the same value as in Figure 2 and it has also artificially been placed at a finite position below n i b = 1 for practical reasons.
Biomolecules 14 01395 g004
Figure 5. Color plot of f r ( i ) for the 54 nodes of the set of Table 1 for the case of Figure 4 (i.e., n b = 6 , using the Erdös group with N r = 353 nodes as potential initial blue nodes and n i b being the number of initial blue nodes at random positions in the Erdös group). The x-axis corresponds to the index K g of Table 1 and the y-axis represents to the coarse-grained index n g = 0 , , 31 which corresponds to n i b = n g for n g 10 (linear scale) and n i b 10 × 1 . 2 n g 10 for n g > 10 (logarithmic scale). See also Figure A2 which shows the link between n i b and n g . The values of the color bar correspond to f r ( i ) (i.e., red for f r ( i ) = 1 , green for f r ( i ) 0.06 and blue for f r ( i ) = 0 ). Here small f r ( i ) values have been amplified to improve the visibility (non-linear scale in the Colombo).
Figure 5. Color plot of f r ( i ) for the 54 nodes of the set of Table 1 for the case of Figure 4 (i.e., n b = 6 , using the Erdös group with N r = 353 nodes as potential initial blue nodes and n i b being the number of initial blue nodes at random positions in the Erdös group). The x-axis corresponds to the index K g of Table 1 and the y-axis represents to the coarse-grained index n g = 0 , , 31 which corresponds to n i b = n g for n g 10 (linear scale) and n i b 10 × 1 . 2 n g 10 for n g > 10 (logarithmic scale). See also Figure A2 which shows the link between n i b and n g . The values of the color bar correspond to f r ( i ) (i.e., red for f r ( i ) = 1 , green for f r ( i ) 0.06 and blue for f r ( i ) = 0 ). Here small f r ( i ) values have been amplified to improve the visibility (non-linear scale in the Colombo).
Biomolecules 14 01395 g005
Figure 6. As Figure 3 and Figure 4 but with linear x-axis and using the optimal set with four K-values [ 3 , 4 , 17 , 342 ] for potential initial blue nodes (i.e., n b = 6 , N r = 4 and n i b = 0 , , 4 being the number of initial blue nodes at certain positions in this set). The red square data points, connected by red line, represent the average f r with respect to all possible configurations of n i b initial blue nodes and the black data points represent the conditional average f r c ( [ K 1 , , K n i b ] ) for particular configurations [ K 1 , , K n i b ] of initial (variable) blue nodes (with K j being K-rank values of nodes and n i b = 0 , , 4 ). For each column the top (bottom) black data point corresponds to the top (bottom) configuration shown above. For n i b = 0 (“empty” configuration “ [ ] ” with no initial variable blue node) and n i b = 4 (full configuration “ [ 3 , 4 , 17 , 342 ] ” of all four nodes) there is only one configuration and therefore only one associated black data point.
Figure 6. As Figure 3 and Figure 4 but with linear x-axis and using the optimal set with four K-values [ 3 , 4 , 17 , 342 ] for potential initial blue nodes (i.e., n b = 6 , N r = 4 and n i b = 0 , , 4 being the number of initial blue nodes at certain positions in this set). The red square data points, connected by red line, represent the average f r with respect to all possible configurations of n i b initial blue nodes and the black data points represent the conditional average f r c ( [ K 1 , , K n i b ] ) for particular configurations [ K 1 , , K n i b ] of initial (variable) blue nodes (with K j being K-rank values of nodes and n i b = 0 , , 4 ). For each column the top (bottom) black data point corresponds to the top (bottom) configuration shown above. For n i b = 0 (“empty” configuration “ [ ] ” with no initial variable blue node) and n i b = 4 (full configuration “ [ 3 , 4 , 17 , 342 ] ” of all four nodes) there is only one configuration and therefore only one associated black data point.
Biomolecules 14 01395 g006
Figure 7. Color plot of f r ( i ) as in Figure 5 but using the optimal set with four K-values [ 3 , 4 , 17 , 342 ] for potential initial blue nodes (i.e., n b = 6 , N r = 4 and n i b = 0 , , 4 being the number of initial blue nodes at random positions in this set). Here the y-axis corresponds directly to n i b and the x-axis to K g of Table 1. The values of the color bar correspond to f r ( i ) (with amplified scale as in Figure 5).
Figure 7. Color plot of f r ( i ) as in Figure 5 but using the optimal set with four K-values [ 3 , 4 , 17 , 342 ] for potential initial blue nodes (i.e., n b = 6 , N r = 4 and n i b = 0 , , 4 being the number of initial blue nodes at random positions in this set). Here the y-axis corresponds directly to n i b and the x-axis to K g of Table 1. The values of the color bar correspond to f r ( i ) (with amplified scale as in Figure 5).
Biomolecules 14 01395 g007
Figure 8. The positions of the 353 nodes of the Erös group in the K- K * plane in a double logarithmic representation (colored full circles). The color of each data point corresponds to the value of f r c ( K ) using the initial configuration [ K ] (i.e., n i b = 1 with initial blue node at given value of K). To be more precise, the values of the color bar correspond to f r c ( K ) / f r c , m a x with f r c , m a x 0.77775 being the maximum value of f r c (i.e red for maximum, green for intermediate and blue for zero values and with no amplification of small values). The red (blue) crosses indicate the positions of the fixed 10 red (6 blue) nodes.
Figure 8. The positions of the 353 nodes of the Erös group in the K- K * plane in a double logarithmic representation (colored full circles). The color of each data point corresponds to the value of f r c ( K ) using the initial configuration [ K ] (i.e., n i b = 1 with initial blue node at given value of K). To be more precise, the values of the color bar correspond to f r c ( K ) / f r c , m a x with f r c , m a x 0.77775 being the maximum value of f r c (i.e red for maximum, green for intermediate and blue for zero values and with no amplification of small values). The red (blue) crosses indicate the positions of the fixed 10 red (6 blue) nodes.
Biomolecules 14 01395 g008
Table 1. Table of the subset of N r = 54 selected fibrosis proteins (nodes). Here K g represents the global index of this group, K t , u , d , x represent the index of the four subgroups of 4 TGF- β proteins, 20 up-proteins, 20 down-proteins and 10 additional X-proteins; the K ( K * ) index represents the PageRank (CheiRank) index for the global MetaCore network of N = 40 , 079 nodes; the last column gives the associated protein names.
Table 1. Table of the subset of N r = 54 selected fibrosis proteins (nodes). Here K g represents the global index of this group, K t , u , d , x represent the index of the four subgroups of 4 TGF- β proteins, 20 up-proteins, 20 down-proteins and 10 additional X-proteins; the K ( K * ) index represents the PageRank (CheiRank) index for the global MetaCore network of N = 40 , 079 nodes; the last column gives the associated protein names.
K g K t , u , d , x K K * Protein
1 K t = 1 10,78026,299TGF- β 0
2 K t = 2 2355690TGF- β 1
3 K t = 3 96825,073TGF- β 2
4 K t = 4 472629,508TGF- β 3
5 K u = 1 28,73725,928ADAMTS16
6 K u = 2 347825,137FGF21
7 K u = 3 40,04828,152TNFSF18
8 K u = 4 246719,160ACAN
9 K u = 5 148924,511RPH3A
10 K u = 6 26,60029,559ADAMTS8
11 K u = 7 34,76939,960MEGF6
12 K u = 8 26,29527,326SV2B
13 K u = 9 27,11136,021C1QTNF3
14 K u = 10 34,61639,841ANO4
15 K u = 11 12,69616,566IL11
16 K u = 12 26,62423,640CDH10
17 K u = 13 726330,243HTR2B
18 K u = 14 46476551LAMA1-1
19 K u = 15 834226,295LAMA1-2
20 K u = 16 40218252RAPGEF4
21 K u = 17 29,94536,964DNER
22 K u = 18 22,1598569GALNT3
23 K u = 19 29,14515,531ACSBG1
24 K u = 20 24,7868735OLFM2
25 K d = 1 19,03928,262CLEC3B
26 K d = 2 26,47728,290SCARA5
27 K d = 3 26,10911,185SLC10A6
28 K d = 4 636029,204CXCL5
29 K d = 5 14,9528729MYOC
30 K d = 6 596122,288IFITM1
31 K d = 7 55994483ANGPTL4
32 K d = 8 25,53817,434SELENBP1
33 K d = 9 18,93833,179FMO1
34 K d = 10 34,08039,427GPR88
35 K d = 11 627622,141HMGCS2
36 K d = 12 37,06028,328LGI2
37 K d = 13 91622485PTN
38 K d = 14 5135974ADORA2A
39 K d = 15 778922,652GFRA1
40 K d = 16 67188844IL1R2-1
41 K d = 17 35,44628,306IL1R2-2
42 K d = 18 12,1483444PEG10
43 K d = 19 27,82936,195FMO2
44 K d = 20 197324,994COX4I2
45 K x = 1 313 β -catenin
46 K x = 2 46p53
47 K x = 3 1110ESR1
48 K x = 4 1325STAT3
49 K x = 5 2211RelA
50 K x = 6 3882PPAR- γ
51 K x = 7 111767IKK- β
52 K x = 8 179198SNAIL1
53 K x = 9 2371520MMP-14
54 K x = 10 5782123Flotillin-1
Table 2. Table of selected fibrosis proteins (nodes) belonging to a smaller subset of the Erdös group such that either K 40 or K * 40 where the K ( K * ) index represents the PageRank (CheiRank) index for the global MetaCore network of N = 40 , 079 nodes. f r c represents the average fraction of red nodes obtained when using the corresponding node as single initial non-fixed blue node and the index K f r is obtained by ordering the Erdös group of 353 elements with increasing values of f r c ; the last column gives the associated protein names. The shown subset of the Erdös group corresponds essentially to the nodes with lowest f r c -value (up to K f r = 17 ). The four nodes marked by “*” in the first column are the nodes used for the subsequent example computations using 4 specific optimal nodes.
Table 2. Table of selected fibrosis proteins (nodes) belonging to a smaller subset of the Erdös group such that either K 40 or K * 40 where the K ( K * ) index represents the PageRank (CheiRank) index for the global MetaCore network of N = 40 , 079 nodes. f r c represents the average fraction of red nodes obtained when using the corresponding node as single initial non-fixed blue node and the index K f r is obtained by ordering the Erdös group of 353 elements with increasing values of f r c ; the last column gives the associated protein names. The shown subset of the Erdös group corresponds essentially to the nodes with lowest f r c -value (up to K f r = 17 ). The four nodes marked by “*” in the first column are the nodes used for the subsequent example computations using 4 specific optimal nodes.
K f r K K * f rc Protein
*11710.010691c-Myc
*2460.029542p53
*334270.038899c-Fos
414140.053526Androgen receptor
522110.075088RelA (p65 NF-kB subunit)
694340.07674HDAC1
7188190.078157p300
8627230.078989IGF2BP3
923290.081749SP1
1064290.083725HIF1A
*113130.083939 β -catenin
12203210.13861E2F1
1372850.14694SOX9
14432240.14808BRG1
1581150.15027EGFR
1672180.15149EZH2
1713250.15649STAT3
20480400.17501C/EBP β
2138820.20211PPAR- γ
24298330.20997ELAVL1 (HuR)
26394370.22213CREB1
316370170.28502PUM2
3217,71180.29183CUX1 (p110)
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Frahm, K.M.; Kotelnikova, E.; Kunduzova, O.; Shepelyansky, D.L. Fibroblast-Specific Protein-Protein Interactions for Myocardial Fibrosis from MetaCore Network. Biomolecules 2024, 14, 1395. https://doi.org/10.3390/biom14111395

AMA Style

Frahm KM, Kotelnikova E, Kunduzova O, Shepelyansky DL. Fibroblast-Specific Protein-Protein Interactions for Myocardial Fibrosis from MetaCore Network. Biomolecules. 2024; 14(11):1395. https://doi.org/10.3390/biom14111395

Chicago/Turabian Style

Frahm, Klaus M., Ekaterina Kotelnikova, Oksana Kunduzova, and Dima L. Shepelyansky. 2024. "Fibroblast-Specific Protein-Protein Interactions for Myocardial Fibrosis from MetaCore Network" Biomolecules 14, no. 11: 1395. https://doi.org/10.3390/biom14111395

APA Style

Frahm, K. M., Kotelnikova, E., Kunduzova, O., & Shepelyansky, D. L. (2024). Fibroblast-Specific Protein-Protein Interactions for Myocardial Fibrosis from MetaCore Network. Biomolecules, 14(11), 1395. https://doi.org/10.3390/biom14111395

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop