A Unification between Dynamical System Theory and Thermodynamics Involving an Energy, Mass, and Entropy State Space Formalism

Haddad, Wassim M.

doi:10.3390/e15051821

Open AccessArticle

A Unification between Dynamical System Theory and Thermodynamics Involving an Energy, Mass, and Entropy State Space Formalism

by

Wassim M. Haddad

The School of Aerospace Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA

Entropy 2013, 15(5), 1821-1846; https://doi.org/10.3390/e15051821

Submission received: 29 March 2013 / Revised: 10 May 2013 / Accepted: 10 May 2013 / Published: 16 May 2013

(This article belongs to the Special Issue Dynamical Systems)

Download

Browse Figure

Versions Notes

Abstract

:

In this paper, we combine the two universalisms of thermodynamics and dynamical systems theory to develop a dynamical system formalism for classical thermodynamics. Specifically, using a compartmental dynamical system energy flow model involving heat flow, work energy, and chemical reactions, we develop a state-space dynamical system model that captures the key aspects of thermodynamics, including its fundamental laws. In addition, we show that our thermodynamically consistent dynamical system model is globally semistable with system states converging to a state of temperature equipartition. Furthermore, in the presence of chemical reactions, we use the law of mass-action and the notion of chemical potential to show that the dynamic system states converge to a state of temperature equipartition and zero affinity corresponding to a state of chemical equilibrium.

Keywords:

system thermodynamics; energy flow; interconnected systems; entropy; Helmholtz free energy; Gibbs free energy; chemical thermodynamics; mass action kinetics; chemical potential; neuroscience and thermodynamics

1. Introduction

Thermodynamics is a physical branch of science that governs the thermal behavior of dynamical systems from those as simple as refrigerators to those as complex as our expanding universe. The laws of thermodynamics involving conservation of energy and nonconservation of entropy are, without a doubt, two of the most useful and general laws in all sciences. The first law of thermodynamics, according to which energy cannot be created or destroyed but is merely transformed from one form to another, and the second law of thermodynamics, according to which the usable energy in an adiabatically isolated dynamical system is always diminishing in spite of the fact that energy is conserved, have had an impact far beyond science and engineering. The second law of thermodynamics is intimately connected to the irreversibility of dynamical processes. In particular, the second law asserts that a dynamical system undergoing a transformation from one state to another cannot be restored to its original state and at the same time restore its environment to its original condition. That is, the status quo cannot be restored everywhere. This gives rise to a monotonically increasing quantity known as entropy. Entropy permeates the whole of nature, and unlike energy, which describes the state of a dynamical system, entropy is a measure of change in the status quo of a dynamical system.

There is no doubt that thermodynamics is a theory of universal proportions whose laws reign supreme among the laws of nature and are capable of addressing some of science’s most intriguing questions about the origins and fabric of our universe. The laws of thermodynamics are among the most firmly established laws of nature and play a critical role in the understanding of our expanding universe. In addition, thermodynamics forms the underpinning of several fundamental life science and engineering disciplines, including biological systems, physiological systems, chemical reaction systems, ecological systems, information systems, and network systems, to cite but a few examples. While from its inception its speculations about the universe have been grandiose, its mathematical foundation has been amazingly obscure and imprecise [1,2,3,4]. This is largely due to the fact that classical thermodynamics is a physical theory concerned mainly with equilibrium states and does not possess equations of motion. The absence of a state space formalism in classical thermodynamics, and physics in general, is quite disturbing and in our view largely responsible for the monomeric state of classical thermodynamics.

In recent research [4,5,6], we combined the two universalisms of thermodynamics and dynamical systems theory under a single umbrella to develop a dynamical system formalism for classical thermodynamics so as to harmonize it with classical mechanics. While it seems impossible to reduce thermodynamics to a mechanistic world picture due to microscopic reversibility and Poincaré recurrence, the system thermodynamic formulation of [4] provides a harmonization of classical thermodynamics with classical mechanics. In particular, our dynamical system formalism captures all of the key aspects of thermodynamics, including its fundamental laws, while providing a mathematically rigorous formulation for thermodynamical systems out of equilibrium by unifying the theory of heat transfer with that of classical thermodynamics. In addition, the concept of entropy for a nonequilibrium state of a dynamical process is defined, and its global existence and uniqueness is established. This state space formalism of thermodynamics shows that the behavior of heat, as described by the conservation equations of thermal transport and as described by classical thermodynamics, can be derived from the same basic principles and is part of the same scientific discipline.

Connections between irreversibility, the second law of thermodynamics, and the entropic arrow of time are also established in [4,6]. Specifically, we show a state irrecoverability and, hence, a state irreversibility nature of thermodynamics. State irreversibility reflects time-reversal non-invariance, wherein time-reversal is not meant literally; that is, we consider dynamical systems whose trajectory reversal is or is not allowed and not a reversal of time itself. In addition, we show that for every nonequilibrium system state and corresponding system trajectory of our thermodynamically consistent dynamical system, there does not exist a state such that the corresponding system trajectory completely recovers the initial system state of the dynamical system and at the same time restores the energy supplied by the environment back to its original condition. This, along with the existence of a global strictly increasing entropy function on every nontrivial system trajectory, establishes the existence of a completely ordered time set having a topological structure involving a closed set homeomorphic to the real line, thus giving a clear time-reversal asymmetry characterization of thermodynamics and establishing an emergence of the direction of time flow.

In this paper, we reformulate and extend some of the results of [4]. In particular, unlike the framework in [4] wherein we establish the existence and uniqueness of a global entropy function of a specific form for our thermodynamically consistent system model, in this paper we assume the existence of a continuously differentiable, strictly concave function that leads to an entropy inequality that can be identified with the second law of thermodynamics as a statement about entropy increase. We then turn our attention to stability and convergence. Specifically, using Lyapunov stability theory and the Krasovskii–LaSalle invariance principle [7], we show that for an adiabatically isolated system, the proposed interconnected dynamical system model is Lyapunov stable with convergent trajectories to equilibrium states where the temperatures of all subsystems are equal. Finally, we present a state-space dynamical system model for chemical thermodynamics. In particular, we use the law of mass-action to obtain the dynamics of chemical reaction networks. Furthermore, using the notion of the chemical potential [8,9], we unify our state space mass-action kinetics model with our thermodynamic dynamical system model involving energy exchange. In addition, we show that entropy production during chemical reactions is nonnegative and the dynamical system states of our chemical thermodynamic state space model converge to a state of temperature equipartition and zero affinity (i.e., the difference between the chemical potential of the reactants and the chemical potential of the products in a chemical reaction).

The central thesis of this paper is to present a state space formulation for equilibrium and nonequilibrium thermodynamics based on a dynamical system theory combined with interconnected nonlinear compartmental systems that ensures a consistent thermodynamic model for heat, energy, and mass flow. In particular, the proposed approach extends the framework developed in [4] addressing closed thermodynamic systems that exchange energy but not matter with the environment to open thermodynamic systems that exchange matter and energy with their environment. In addition, our results go beyond the results of [4] by developing rigorous notions of enthalpy, Gibbs free energy, Helmholtz free energy, and Gibbs’ chemical potential using a state space formulation of dynamics, energy and mass conservation principles, as well as the law of mass-action kinetics and the law of superposition of elementary reactions without invoking statistical mechanics arguments.

2. Notation, Definitions, and Mathematical Preliminaries

In this section, we establish notation, definitions, and provide some key results necessary for developing the main results of this paper. Specifically,

R

denotes the set of real numbers,

{\bar{Z}}_{+}

(respectively,

Z_{+}

) denotes the set of nonnegative (respectively, positive) integers,

R^{q}

denotes the set of

q \times 1

column vectors,

R^{n \times m}

denotes the set of

n \times m

real matrices,

P^{n}

(respectively,

N^{n}

) denotes the set of positive (respectively, nonnegative) definite matrices,

{(\cdot)}^{T}

denotes transpose,

I_{q}

or I denotes the

q \times q

identity matrix,

e

denotes the ones vector of order q, that is,

e ≜ {[1, \dots, 1]}^{T} \in R^{q}

, and

e_{i} \in R^{q}

denotes a vector with unity in the ith component and zeros elsewhere. For

x \in R^{q}

we write

x \geq \geq 0

(respectively,

x > > 0

) to indicate that every component of x is nonnegative (respectively, positive). In this case, we say that x is nonnegative or positive, respectively. Furthermore,

{\bar{R}}_{+}^{q}

and

R_{+}^{q}

denote the nonnegative and positive orthants of

R^{q}

, that is, if

x \in R^{q}

, then

x \in {\bar{R}}_{+}^{q}

and

x \in R_{+}^{q}

are equivalent, respectively, to

x \geq \geq 0

and

x > > 0

. Analogously,

{\bar{R}}_{+}^{n \times m}

(respectively,

R_{+}^{n \times m}

) denotes the set of

n \times m

real matrices whose entries are nonnegative (respectively, positive). For vectors

x, y \in R^{q}

, with components

x_{i}

and

y_{i}

,

i = 1, \dots, q

, we use

x \circ y

to denote component-by-component multiplication, that is,

x \circ y ≜ {[x_{1} y_{1}, \dots, x_{q} y_{q}]}^{T}

. Finally, we write

\partial S

,

\overset{\circ}{S}

, and

\bar{S}

to denote the boundary, the interior, and the closure of the set

S

, respectively.

We write · for the Euclidean vector norm,

V^{'} (x) ≜ \frac{\partial V (x)}{\partial x}

for the Fréchet derivative of V at x,

B_{ε} (α)

,

α \in R^{q}

,

ε > 0

, for the open ball centered at α with radius ε, and

x (t) \to M

as

t \to \infty

to denote that

x (t)

approaches the set

M

(that is, for every

ε > 0

there exists

T > 0

such that dist

(x (t), M) < ε

for all

t > T

, where dist

(p, M) ≜ {inf}_{x \in M} ∥ p - x ∥

). The notions of openness, convergence, continuity, and compactness that we use throughout the paper refer to the topology generated on

D \subseteq R^{q}

by the norm

∥ \cdot ∥

. A subset

N

of

D

is relatively open in

D

if

N

is open in the subspace topology induced on

D

by the norm

∥ \cdot ∥

. A point

x \in R^{q}

is a subsequential limit of the sequence

{x_{i}}_{i = 0}^{\infty}

in

R^{q}

if there exists a subsequence of

{x_{i}}_{i = 0}^{\infty}

that converges to x in the norm

∥ \cdot ∥

. Recall that every bounded sequence has at least one subsequential limit. A divergent sequence is a sequence having no convergent subsequence.

Consider the nonlinear autonomous dynamical system

\begin{matrix} \dot{x} (t) & = & f (x (t)), x (0) = x_{0}, t \in I_{x_{0}} \end{matrix}

(1)

where

x (t) \in \subseteq R^{n}

,

t \in I_{x_{0}}

, is the system state vector,

D

is a relatively open set,

f : D \to R^{n}

is continuous on

D

, and

I_{x_{0}} = [0, τ_{x_{0}})

,

0 \leq τ_{x_{0}} \leq \infty

, is the maximal interval of existence for the solution

x (\cdot)

of Equation (1). We assume that, for every initial condition

x (0) \in D

, the differential Equation (1) possesses a unique right-maximally defined continuously differentiable solution which is defined on

[0, \infty)

. Letting

s (\cdot, x)

denote the right-maximally defined solution of Equation (1) that satisfies the initial condition

x (0) = x

, the above assumptions imply that the map

s : [0, \infty) \times D \to D

is continuous ([Theorem V.2.1] [10]), satisfies the consistency property

s (0, x) = x

, and possesses the semigroup property

s (t, s (τ, x)) = s (t + τ, x)

for all

t, τ \geq 0

and

x \in D

. Given

t \geq 0

and

x \in D

, we denote the map

s (t, \cdot) : D \to D

by

s_{t}

and the map

s (\cdot, x) : [0, \infty) \to D

by

s^{x}

. For every

t \in R

, the map

s_{t}

is a homeomorphism and has the inverse

s_{- t}

.

The orbit

O_{x}

of a point

x \in D

is the set

s^{x} ([0, \infty))

. A set

D_{c} \subseteq D

is positively invariant relative to Equation (1) if

s_{t} (D_{c}) \subseteq D_{c}

for all

t \geq 0

or, equivalently,

D_{c}

contains the orbits of all its points. The set

D_{c}

is invariant relative to Equation (1) if

s_{t} (D_{c}) = D_{c}

for all

t \geq 0

. The positive limit set of

x \in R^{q}

is the set

ω (x)

of all subsequential limits of sequences of the form

{s (t_{i}, x)}_{i = 0}^{\infty}

, where

{t_{i}}_{i = 0}^{\infty}

is an increasing divergent sequence in

[0, \infty)

.

ω (x)

is closed and invariant, and

{\bar{O}}_{x} = O_{x} \cup ω (x)

[7]. In addition, for every

x \in R^{q}

that has bounded positive orbits,

ω (x)

is nonempty and compact, and, for every neighborhood

N

of

ω (x)

, there exists

T > 0

such that

s_{t} (x) \in N

for every

t > T

[7]. Furthermore, ∈ is an equilibrium point of Equation (1) if and only if

f (x_{e}) = 0

or, equivalently,

s (t, x_{e}) = x_{e}

for all

t \geq 0

. Finally, recall that if all solutions to Equation (1) are bounded, then it follows from the Peano–Cauchy theorem ([7] [p. 76]) that

I_{x_{0}} = R

.

Definition 2.1 ([11] [pp. 9, 10] ) Let

f = {[f_{1}, \dots, f_{n}]}^{T} : D \subseteq {\bar{R}}_{+}^{n} \to R^{n}

. Then f is essentially nonnegative if

f_{i} (x) \geq 0

, for all

i = 1, \dots, n

, and

x \in {\bar{R}}_{+}^{n}

such that

x_{i} = 0

, where

x_{i}

denotes the ith component of x.

Proposition 2.1 ([11] [p. 12] ) Suppose

{\bar{R}}_{+}^{n} \subset D

. Then

{\bar{R}}_{+}^{n}

is an invariant set with respect to Equation (1) if and only if

f : D \to R^{n}

is essentially nonnegative.

Definition 2.2 ([11] [pp. 13, 23] ) An equilibrium solution

x (t) \equiv x_{e} \in {\bar{R}}_{+}^{n}

to Equation (1) is Lyapunov stable with respect to

{\bar{R}}_{+}^{n}

if, for all

ε > 0

, there exists

δ = δ (ε) > 0

such that if

x \in B_{δ} (x_{e}) \cap {\bar{R}}_{+}^{n}

, then

x (t) \in B_{ε} (x_{e}) \cap {\bar{R}}_{+}^{n}

,

t \geq 0

. An equilibrium solution

x (t) \equiv x_{e} \in {\bar{R}}_{+}^{n}

to Equation (1) is semistable with respect to

{\bar{R}}_{+}^{n}

if it is Lyapunov stable with respect to

{\bar{R}}_{+}^{n}

and there exists δ > 0 such that if

x_{0} \in B_{δ} (x_{e}) \cap {\bar{R}}_{+}^{n}

, then

{lim}_{t \to \infty} x (t)

exists and corresponds to a Lyapunov stable equilibrium point with respect to

{\bar{R}}_{+}^{n}

The system given by Equation (1) is said to be semistable with respect to

{\bar{R}}_{+}^{n}

if every equilibrium point of Equation (1) is semistable with respect to

{\bar{R}}_{+}^{n}

The system given by Equation (1) is said to be globally semistable with respect to

{\bar{R}}_{+}^{n}

if Equation (1) is semistable with respect to

{\bar{R}}_{+}^{n}

and, for every

x_{0} \in {\bar{R}}_{+}^{n}, {lim}_{t \to \infty} x (t)

exists.

Proposition 2.2 ([11] [p. 22]) Consider the nonlinear dynamical system given by Equation (1) where f is essentially nonnegative and let

x \in {\bar{R}}_{+}^{n}

. If the positive limit set of Equation (1) contains a Lyapunov stable (with respect to

{\bar{R}}_{+}^{n}

) equilibrium point y, then

y = {lim}_{t \to \infty} s (t, x)

.

3. Interconnected Thermodynamic Systems: A State Space Energy Flow Perspective

The fundamental and unifying concept in the analysis of thermodynamic systems is the concept of energy. The energy of a state of a dynamical system is the measure of its ability to produce changes (motion) in its own system state as well as changes in the system states of its surroundings. These changes occur as a direct consequence of the energy flow between different subsystems within the dynamical system. Heat (energy) is a fundamental concept of thermodynamics involving the capacity of hot bodies (more energetic subsystems with higher energy gradients) to produce work. As in thermodynamic systems, dynamical systems can exhibit energy (due to friction) that becomes unavailable to do useful work. This in turn contributes to an increase in system entropy, a measure of the tendency of a system to lose the ability of performing useful work. In this section, we use the state space formalism to construct a mathematical model of a thermodynamic system that is consistent with basic thermodynamic principles.

Specifically, we consider a large-scale system model with a combination of subsystems (compartments or parts) that is perceived as a single entity. For each subsystem (compartment) making up the system, we postulate the existence of an energy state variable such that the knowledge of these subsystem state variables at any given time

t = t_{0}

, together with the knowledge of any inputs (heat fluxes) to each of the subsystems for time

t \geq t_{0}

, completely determines the behavior of the system for any given time

t \geq t_{0}

. Hence, the (energy) state of our dynamical system at time t is uniquely determined by the state at time

t_{0}

and any external inputs for time

t \geq t_{0}

and is independent of the state and inputs before time

t_{0}

.

More precisely, we consider a large-scale interconnected dynamical system composed of a large number of units with aggregated (or lumped) energy variables representing homogenous groups of these units. If all the units comprising the system are identical (that is, the system is perfectly homogeneous), then the behavior of the dynamical system can be captured by that of a single plenipotentiary unit. Alternatively, if every interacting system unit is distinct, then the resulting model constitutes a microscopic system. To develop a middle-ground thermodynamic model placed between complete aggregation (classical thermodynamics) and complete disaggregation (statistical thermodynamics), we subdivide the large-scale dynamical system into a finite number of compartments, each formed by a large number of homogeneous units. Each compartment represents the energy content of the different parts of the dynamical system, and different compartments interact by exchanging heat. Thus, our compartmental thermodynamic model utilizes subsystems or compartments with describe the energy distribution among distinct regions in space with intercompartmental flows representing the heat transfer between these regions. Decreasing the number of compartments results in a more aggregated or homogeneous model, whereas increasing the number of compartments leads to a higher degree of disaggregation resulting in a heterogeneous model.

To formulate our state space thermodynamic model, consider the interconnected dynamical system

G

shown in Figure 1 involving energy exchange between q interconnected subsystems. Let

E_{i} : [0, \infty) \to {\bar{R}}_{+}

denote the energy (and hence a nonnegative quantity) of the ith subsystem, let

S_{i} : [0, \infty) \to R

denote the external power (heat flux) supplied to (or extracted from) the ith subsystem, let

ϕ_{i j} : {\bar{R}}_{+}^{q} \to R

,

i \neq j, i, j = 1, \dots, q

, denote the net instantaneous rate of energy (heat) flow from the jth subsystem to the ith subsystem, and let

σ_{i i} : {\bar{R}}_{+}^{q} \to {\bar{R}}_{+}, i = 1, \dots, q

, denote the instantaneous rate of energy (heat) dissipation from the ith subsystem to the environment. Here, we assume that

ϕ_{i j} : {\bar{R}}_{+}^{q} \to R

,

i \neq j

,

i, j = 1, \dots, q

, and

σ_{i i} : {\bar{R}}_{+}^{q} \to {\bar{R}}_{+}

,

i = 1, \dots, q

, are locally Lipschitz continuous on

{\bar{R}}_{+}^{q}

and

S_{i} : [0, \infty) \to R, i = 1, \dots, q

, are bounded piecewise continuous functions of time.

Figure 1. Interconnected dynamical system

G

.

Figure 1. Interconnected dynamical system

G

.

An energy balance for the ith subsystem yields

\begin{matrix} E_{i} (T) & = & E_{i} (t_{0}) + [\sum_{j = 1, j \neq i}^{q} \int_{t_{0}}^{T} ϕ_{i j} (E (t)) d t] - \int_{t_{0}}^{T} σ_{i i} (E (t)) d t + \int_{t_{0}}^{T} S_{i} (t) d t, T \geq t_{0} \end{matrix}

(2)

or, equivalently, in vector form,

\begin{matrix} E (T) & = & E (t_{0}) + \int_{t_{0}}^{T} w (E (t)) d t - \int_{t_{0}}^{T} d (E (t)) d t + \int_{t_{0}}^{T} S (t) d t, T \geq t_{0} \end{matrix}

(3)

where

E (t) ≜ {[E_{1} (t), \dots, E_{q} (t)]}^{T}

,

t \geq t_{0}

, is the system energy state,

d (E (t)) ≜ [σ_{11} (E (t)), \dots,

σ_{q q} {(E (t))]}^{T}

,

t \geq t_{0}

, is the system dissipation,

S (t) ≜ {[S_{1} (t), \dots, S_{q} (t)]}^{T}

,

t \geq t_{0}

, is the system heat flux, and

w = {[w_{1}, \dots, w_{q}]}^{T} : {\bar{R}}_{+}^{q} \to R^{q}

is such that

\begin{matrix} w_{i} (E) = \sum_{j = 1, j \neq i}^{q} ϕ_{i j} (E), E \in {\bar{R}}_{+}^{q} \end{matrix}

(4)

Since

ϕ_{i j} : {\bar{R}}_{+}^{q} \to R

,

i \neq j, i, j = 1, \dots, q

, denotes the net instantaneous rate of energy flow from the jth subsystem to the ith subsystem, it is clear that

ϕ_{i j} (E) = - ϕ_{j i} (E)

,

E \in {\bar{R}}_{+}^{q}

,

i \neq j

,

i, j = 1, \dots, q

, which further implies that

e^{T} w (E) = 0

,

E \in {\bar{R}}_{+}^{q}

.

Note that Equation (2) yields a conservation of energy equation and implies that the energy stored in the ith subsystem is equal to the external energy supplied to (or extracted from) the ith subsystem plus the energy gained by the ith subsystem from all other subsystems due to subsystem coupling minus the energy dissipated from the ith subsystem to the environment. Equivalently, Equation (2) can be rewritten as

\begin{matrix} {\dot{E}}_{i} (t) = [\sum_{j = 1, j \neq i}^{q} ϕ_{i j} (E (t))] - σ_{i i} (E (t)) + S_{i} (t), E_{i} (t_{0}) = E_{i 0}, t \geq t_{0} \end{matrix}

(5)

or, in vector form,

\begin{matrix} \dot{E} (t) & = & w (E (t)) - d (E (t)) + S (t), E (t_{0}) = E_{0}, t \geq t_{0} \end{matrix}

(6)

where

E_{0} ≜ {[E_{10}, \dots, E_{q 0}]}^{T}

, yielding a power balance equation that characterizes energy flow between subsystems of the interconnected dynamical system

G

. We assume that

ϕ_{i j} (E) \geq 0, E \in {\bar{R}}_{+}^{q}

, whenever

E_{i} = 0

,

i \neq j

,

i, j = 1, \dots, q

, and

σ_{i i} (E) = 0

, whenever

E_{i} = 0

,

i = 1, \dots, q

. The above constraint implies that if the energy of the ith subsystem of

G

is zero, then this subsystem cannot supply any energy to its surroundings or dissipate energy to the environment. In this case,

w (E) - d (E), E \in {\bar{R}}_{+}^{q}

, is essentially nonnegative [12]. Thus, if

S (t) \equiv 0

, then, by Proposition 2.1, the solutions to Equation (6) are nonnegative for all nonnegative initial conditions. See [4,11,12] for further details.

Since our thermodynamic compartmental model involves intercompartmental flows representing energy transfer between compartments, we can use graph-theoretic notions with undirected graph topologies (i.e., bidirectional energy flows) to capture the compartmental system interconnections. Graph theory [13,14] can be useful in the analysis of the connectivity properties of compartmental systems. In particular, an undirected graph can be constructed to capture a compartmental model in which the compartments are represented by nodes and the flows are represented by edges or arcs. In this case, the environment must also be considered as an additional node.

For the interconnected dynamical system

G

with the power balance Equation (6), we define a connectivity matrix

C \in R^{q \times q}

such that for

i \neq j

,

i, j = 1, \dots, q

,

C_{(i, j)} ≜ 1

if

ϕ_{i j} (E) ≢ 0

and

C_{(i, j)} ≜ 0

otherwise, and

C_{(i, i)} ≜ - \sum_{k = 1, k \neq i}^{q} C_{(k, i)}

,

i = 1, \dots, q

. (The negative of the connectivity matrix, that is,

- C

, is known as the graph Laplacian in the literature.) Recall that if rank

C = q - 1

, then

G

is strongly connected [4] and energy exchange is possible between any two subsystems of

G

.

The next definition introduces a notion of entropy for the interconnected dynamical system

G

.

Definition 3.1 Consider the interconnected dynamical system

G

with the power balance Equation (6). A continuously differentiable, strictly concave function

S : {\bar{R}}_{+}^{q} \to R

is called the entropy function of

G

if

\begin{matrix} (\frac{\partial S (E)}{\partial E_{i}} - \frac{\partial S (E)}{\partial E_{j}}) ϕ_{i j} (E) \geq 0, E \in {\bar{R}}_{+}^{q}, i \neq j, i, j = 1, \dots, q \end{matrix}

(7)

and

\frac{\partial S (E)}{\partial E_{i}} = \frac{\partial S (E)}{\partial E_{j}}

if and only if

ϕ_{i j} (E) = 0

with

C_{(i, j)} = 1

,

i \neq j

,

i, j = 1, \dots, q

.

It follows from Definition 3.1 that for an isolated system

G

, that is,

S (t) \equiv 0

and

d (E) \equiv 0

, the entropy function of

G

is a nondecreasing function of time. To see this, note that

\begin{matrix} \dot{S} (E) & = & \frac{\partial S (E)}{\partial E} \dot{E} \\ = & \sum_{i = 1}^{q} \frac{\partial S (E)}{\partial E_{i}} \sum_{j = 1, j \neq i}^{q} ϕ_{i j} (E) \\ = & \sum_{i = 1}^{q} \sum_{j = i + 1}^{q} (\frac{\partial S (E)}{\partial E_{i}} - \frac{\partial S (E)}{\partial E_{j}}) ϕ_{i j} (E) \\ \geq & 0, E \in {\bar{R}}_{+}^{q} \end{matrix}

(8)

where

\frac{\partial S (E)}{\partial E} ≜ [\frac{\partial S (E)}{\partial E_{1}}, \dots, \frac{\partial S (E)}{\partial E_{q}}]

and where we used the fact that

ϕ_{i j} (E) = - ϕ_{j i} (E)

,

E \in {\bar{R}}_{+}^{q}

,

i \neq j

,

i, j = 1, \dots, q

.

Proposition 3.1 Consider the isolated (i.e.,

S (t) \equiv 0

and

d (E) \equiv 0

) interconnected dynamical system

G

with the power balance Equation (6). Assume that rank

C = q - 1

and there exists an entropy function

S : {\bar{R}}_{+}^{q} \to R

of

G

. Then,

\sum_{j = 1}^{q} ϕ_{i j} (E) = 0

for all

i = 1, \dots, q

if and only if

\frac{\partial S (E)}{\partial E_{1}} = \dots = \frac{\partial S (E)}{\partial E_{q}}

. Furthermore, the set of nonnegative equilibrium states of Equation (6) is given by

E_{0} ≜ \{E \in {\bar{R}}_{+}^{q} : \frac{\partial S (E)}{\partial E_{1}} = \dots = \frac{\partial S (E)}{\partial E_{q}}\}

.

Proof. If

\frac{\partial S (E)}{\partial E_{i}} = \frac{\partial S (E)}{\partial E_{j}}

, then

ϕ_{i j} (E) = 0

for all

i, j = 1, \dots, q

, which implies that

\sum_{j = 1}^{q} ϕ_{i j} (E) = 0

for all

i = 1, \dots, q

. Conversely, assume that

\sum_{j = 1}^{q} ϕ_{i j} (E) = 0

for all

i = 1, \dots, q

, and, since

S

is an entropy function of

G

, it follows that

\begin{matrix} 0 & = & \sum_{i = 1}^{q} \sum_{j = 1}^{q} \frac{\partial S (E)}{\partial E_{i}} ϕ_{i j} (E) \\ = & \sum_{i = 1}^{q - 1} \sum_{j = i + 1}^{q} (\frac{\partial S (E)}{\partial E_{i}} - \frac{\partial S (E)}{\partial E_{j}}) ϕ_{i j} (E) \\ \geq & 0 \end{matrix}

where we have used the fact that

ϕ_{i j} (E) = - ϕ_{j i} (E)

for all

i, j = 1, \dots, q

. Hence,

(\frac{\partial S (E)}{\partial E_{i}} - \frac{\partial S (E)}{\partial E_{j}}) ϕ_{i j} (E) = 0

for all

i, j = 1, \dots, q

. Now, the result follows from the fact that rank

C = q - 1

. □

Theorem 3.1 Consider the isolated (i.e.,

S (t) \equiv 0

and

d (E) \equiv 0

) interconnected dynamical system

G

with the power balance Equation (6). Assume that rank

C = q - 1

and there exists an entropy function

S : {\bar{R}}_{+}^{q} \to R

of

G

. Then the isolated system

G

is globally semistable with respect to

{\bar{R}}_{+}^{q}

.

Proof. Since

w (\cdot)

is essentially nonnegative, it follows from Proposition 2.1 that

E (t) \in {\bar{R}}_{+}^{q}

,

t \geq t_{0}

, for all

E_{0} \in {\bar{R}}_{+}^{q}

. Furthermore, note that since

e^{T} w (E) = 0

,

E \in {\bar{R}}_{+}^{q}

, it follows that

e^{T} \dot{E} (t) = 0

,

t \geq t_{0}

. In this case,

e^{T} E (t) = e^{T} E_{0}

,

t \geq t_{0}

, which implies that

E (t)

,

t \geq t_{0}

, is bounded for all

E_{0} \in {\bar{R}}_{+}^{q}

. Now, it follows from Equation (8) that

S (E (t))

,

t \geq t_{0}

, is a nondecreasing function of time, and hence, by the Krasovskii–LaSalle theorem [7],

E (t) \to R ≜ {E \in {\bar{R}}_{+}^{q} : \dot{S} (E) = 0}

as

t \to \infty

. Next, it follows from Equation (8), Definition 3.1, and the fact that rank

C = q - 1

, that

R = \{E \in {\bar{R}}_{+}^{q} : \frac{\partial S (E)}{\partial E_{1}} = \dots = \frac{\partial S (E)}{\partial E_{q}}\} = E_{0}

.

Now, let

E_{e} \in E_{0}

and consider the continuously differentiable function

V : R^{q} \to R

defined by

V (E) ≜ S (E_{e}) - S (E) - λ_{e} (e^{T} E_{e} - e^{T} E)

where

λ_{e} ≜ \frac{\partial S}{\partial E_{1}} (E_{e})

. Next, note that

V (E_{e}) = 0

,

\frac{\partial V}{\partial E} (E_{e}) = - \frac{\partial S}{\partial E} (E_{e}) + λ_{e} e^{T} = 0

, and, since

S (\cdot)

is a strictly concave function,

\frac{\partial^{2} V}{\partial E^{2}} (E_{e}) = - \frac{\partial^{2} S}{\partial E^{2}} (E_{e}) > 0

, which implies that

V (\cdot)

admits a local minimum at

E_{e}

. Thus,

V (E_{e}) = 0

, there exists 0 such that

V (E) > 0

,

E \in B_{δ} (E_{e}) \ {E_{e}}

, and

\dot{V} (E) = - \dot{S} (E) \leq 0

for all

E \in B_{δ} (E_{e}) \ {E_{e}}

, which shows that

V (\cdot)

is a Lyapunov function for

G

and

E_{e}

is a Lyapunov stable equilibrium of

G

. Finally, since, for every

E_{0} \in {\bar{R}}_{+}^{n}

,

E (t) \to E_{0}

as

t \to \infty

and every equilibrium point of

G

is Lyapunov stable, it follows from Proposition 2.2 that

G

is globally semistable with respect to

{\bar{R}}_{+}^{q}

. □

In classical thermodynamics, the partial derivative of the system entropy with respect to the system energy defines the reciprocal of the system temperature. Thus, for the interconnected dynamical system

G

,

\begin{matrix} T_{i} ≜ {(\frac{\partial S (E)}{\partial E_{i}})}^{- 1}, i = 1, \dots, q \end{matrix}

(9)

represents the temperature of the ith subsystem. Equation (7) is a manifestation of the second law of thermodynamics and implies that if the temperature of the jth subsystem is greater than the temperature of the ith subsystem, then energy (heat) flows from the jth subsystem to the ith subsystem. Furthermore,

\frac{\partial S (E)}{\partial E_{i}} = \frac{\partial S (E)}{\partial E_{j}}

if and only if

ϕ_{i j} (E) = 0

with

C_{(i, j)} = 1

,

i \neq j

,

i, j = 1, \dots, q

, implies that temperature equality is a necessary and sufficient condition for thermal equilibrium. This is a statement of the zeroth law of thermodynamics. As a result, Theorem 3.1 shows that, for a strongly connected system

G

, the subsystem energies converge to the set of equilibrium states where the temperatures of all subsystems are equal. This phenomenon is known as equipartition of temperature [4] and is an emergent behavior in thermodynamic systems. In particular, all the system energy is eventually transferred into heat at a uniform temperature, and hence, all dynamical processes in

G

(system motions) would cease.

The following result presents a sufficient condition for energy equipartition of the system, that is, the energies of all subsystems are equal. This state of energy equipartition is uniquely determined by the initial energy in the system.

Theorem 3.2 Consider the isolated (i.e.,

S (t) \equiv 0

and

d (E) \equiv 0

) interconnected dynamical system

G

with the power balance Equation (6). Assume that rank

C = q - 1

and there exists a continuously differentiable, strictly concave function

f : {\bar{R}}_{+} \to R

such that the entropy function

S : {\bar{R}}_{+}^{q} \to R

of

G

is given by

S (E) = \sum_{i = 1}^{q} f (E_{i})

. Then, the set of nonnegative equilibrium states of Equation (6) is given by

E_{0} = {α e : α \geq 0}

and

G

is semistable with respect to

{\bar{R}}_{+}^{q}

. Furthermore,

E (t) \to \frac{1}{q} e e^{T} E (t_{0})

as

t \to \infty

and

\frac{1}{q} e e^{T} E (t_{0})

is a semistable equilibrium state of

G

.

Proof. First, note that since

f (\cdot)

is a continuously differentiable, strictly concave function, it follows that

(\frac{d f}{d E_{i}} - \frac{d f}{d E_{j}}) (E_{i} - E_{j}) \leq 0, E \in {\bar{R}}_{+}^{q}, i, j = 1, \dots, q

which implies that Equation (7) is equivalent to

(E_{i} - E_{j}) ϕ_{i j} (E) \leq 0, E \in {\bar{R}}_{+}^{q}, i \neq j, i, j = 1, \dots, q

and

E_{i} = E_{j}

if and only if

ϕ_{i j} (E) = 0

with

C_{(i, j)} = 1

,

i \neq j

,

i, j = 1, \dots, q

. Hence,

- E^{T} E

is an entropy function of

G

. Next, with

S (E) = - \frac{1}{2} E^{T} E

, it follows from Proposition 3.1 that

E_{0} = {α e \in {\bar{R}}_{+}^{q}, α \geq 0}

. Now, it follows from Theorem 3.1 that

G

is globally semistable with respect to

{\bar{R}}_{+}^{q}

. Finally, since

e^{T} E (t) = e^{T} E (t_{0})

and

E (t) \to M

as

t \to \infty

, it follows that

E (t) \to \frac{1}{q} e e^{T} E (t_{0})

as

t \to \infty

. Hence, with

α = \frac{1}{q} e^{T} E (t_{0})

,

α e = \frac{1}{q} e e^{T} E (t_{0})

is a semistable equilibrium state of Equation (6). □

If

f (E_{i}) = {log}_{e} (c + E_{i})

, where

c > 0

, so that

S (E) = \sum_{i = 1}^{q} {log}_{e} (c + E_{i})

, then it follows from Theorem 3.2 that

E_{0} = {α e : α \geq 0}

and the isolated (i.e.,

S (t) \equiv 0

and

d (E) \equiv 0

) interconnected dynamical system

G

with the power balance Equation (6) is semistable. In this case, the absolute temperature of the ith compartment is given by

c + E_{i}

. Similarly, if

S (E) = - \frac{1}{2} E^{T} E

, then it follows from Theorem 3.2 that

E_{0} = {α e : α \geq 0}

and the isolated (i.e.,

S (t) \equiv 0

and

d (E) \equiv 0

) interconnected dynamical system

G

with the power balance Equation (6) is semistable. In both cases,

E (t) \to \frac{1}{q} e e^{T} E (t_{0})

as

t \to \infty

. This shows that the steady-state energy of the isolated interconnected dynamical system

G

is given by

\frac{1}{q} e e^{T} E (t_{0}) = \frac{1}{q} \sum_{i = 1}^{q} E_{i} (t_{0}) e

, and hence is uniformly distributed over all subsystems of

G

. This phenomenon is known as energy equipartition [4]. The aforementioned forms of

S (E)

were extensively discussed in the recent book [4] where

S (E) = \sum_{i = 1}^{q} {log}_{e} (c + E_{i})

and

- S (E) = \frac{1}{2} E^{T} E

are referred to, respectively, as the entropy and the ectropy functions of the interconnected dynamical system

G

.

4. Work Energy, Gibbs Free Energy, Helmoholtz Free Energy, Enthalpy, and Entropy

In this section, we augment our thermodynamic energy flow model

G

with an additional (deformation) state representing subsystem volumes in order to introduce the notion of work into our thermodynamically consistent state space energy flow model. Specifically, we assume that each subsystem can perform (positive) work on the environment and the environment can perform (negative) work on the subsystems. The rate of work done by the ith subsystem on the environment is denoted by

d_{w i} : {\bar{R}}_{+}^{q} \times R_{+}^{q} \to {\bar{R}}_{+}

,

i = 1, \dots, q

, the rate of work done by the environment on the ith subsystem is denoted by

S_{w i} : [0, \infty) \to {\bar{R}}_{+}

,

i = 1, \dots, q

, and the volume of the ith subsystem is denoted by

V_{i} : [0, \infty) \to R_{+}

,

i = 1, \dots, q

. The net work done by each subsystem on the environment satisfies

\begin{matrix} p_{i} (E, V) d V_{i} = (d_{w i} (E, V) - S_{w i} (t)) d t \end{matrix}

(10)

where

p_{i} (E, V)

,

i = 1, \dots, q

, denotes the pressure in the ith subsystem and

V ≜ {[V_{1}, \dots, V_{q}]}^{T}

.

Furthermore, in the presence of work, the energy balance Equation (5) for each subsystem can be rewritten as

\begin{matrix} d E_{i} = w_{i} (E, V) d t - (d_{w i} (E, V) - S_{w i} (t)) d t - σ_{i i} (E, V) d t + S_{i} (t) d t \end{matrix}

(11)

where

w_{i} (E, V) ≜ \sum_{j = 1, j \neq i}^{q} ϕ_{i j} (E, V)

,

ϕ_{i j} : {\bar{R}}_{+}^{q} \times R_{+}^{q} \to R

,

i \neq j, i, j = 1, \dots, q

, denotes the net instantaneous rate of energy (heat) flow from the jth subsystem to the ith subsystem,

σ_{i i} : {\bar{R}}_{+}^{q} \times R_{+}^{q} \to {\bar{R}}_{+}

,

i = 1, \dots, q

, denotes the instantaneous rate of energy dissipation from the ith subsystem to the environment, and, as in Section 3,

S_{i} : [0, \infty) \to R

,

i = 1, \dots, q

, denotes the external power supplied to (or extracted from) the ith subsystem. It follows from Equations (10) and (11) that positive work done by a subsystem on the environment leads to a decrease in the internal energy of the subsystem and an increase in the subsystem volume, which is consistent with the first law of thermodynamics.

The definition of entropy for

G

in the presence of work remains the same as in Definition 3.1 with

S (E)

replaced by

S (E, V)

and with all other conditions in the definition holding for every

V > > 0

. Next, consider the ith subsystem of

G

and assume that

E_{j}

and

V_{j}

,

j \neq i

,

i = 1, \dots, q

, are constant. In this case, note that

\begin{matrix} \frac{d S}{d t} & = & \frac{\partial S}{\partial E_{i}} \frac{d E_{i}}{d t} + \frac{\partial S}{\partial V_{i}} \frac{d V_{i}}{d t} \end{matrix}

(12)

and

\begin{matrix} p_{i} (E, V) = {(\frac{\partial S}{\partial E_{i}})}^{- 1} (\frac{\partial S}{\partial V_{i}}), i = 1, \dots, q \end{matrix}

(13)

It follows from Equations (10) and (11) that, in the presence of work energy, the power balance Equation (6) takes the new form involving energy and deformation states

\begin{matrix} \dot{E} (t) = w (E (t), V (t)) - d_{w} (E (t), V (t)) + S_{w} (t) - d (E (t), V (t)) + S (t), \\ E (t_{0}) = E_{0}, t \geq t_{0}, \end{matrix}

(14)

\begin{matrix} \dot{V} (t) & = & D (E (t), V (t)) (d_{w} (E (t), V (t)) - S_{w} (t)), V (t_{0}) = V_{0} \end{matrix}

(15)

where

w (E, V) ≜ {[w_{1} (E, V), \dots, w_{q} (E, V)]}^{T}

,

d_{w} (E, V) ≜ [d_{w 1} (E, V), \dots,

d_{w q} {(E, V)]}^{T}

,

S_{w} (t)

≜ {[S_{w 1} (t), \dots, S_{w q} (t)]}^{T}

,

d (E, V) ≜ {[σ_{11} (E, V), \dots, σ_{q q} (E, V)]}^{T}

,

S (t) ≜ {[S_{1} (t), \dots, S_{q} (t)]}^{T}

, and

\begin{matrix} D (E, V) ≜ diag [(\frac{\partial S}{\partial E_{1}}) {(\frac{\partial S}{\partial V_{1}})}^{- 1}, \dots, (\frac{\partial S}{\partial E_{q}}) {(\frac{\partial S}{\partial V_{q}})}^{- 1}] \end{matrix}

(16)

Note that

\begin{matrix} (\frac{\partial S (E, V)}{\partial V}) D (E, V) = \frac{\partial S (E, V)}{\partial E} \end{matrix}

(17)

The power balance and deformation Equations (14) and (15) represent a statement of the first law of thermodynamics. To see this, define the work L done by the interconnected dynamical system

G

over the time interval

[t_{1}, t_{2}]

by

\begin{matrix} L ≜ \int_{t_{1}}^{t_{2}} e^{T} [d_{w} (E (t), V (t)) - S_{w} (t)] d t \end{matrix}

(18)

where

{[E^{T} (t), V^{T} (t)]}^{T}

,

t \geq t_{0}

, is the solution to Equations (14) and (15). Now, premultiplying Equation (14) by

e^{T}

and using the fact that

e^{T} w (E, V) = 0

, it follows that

\begin{matrix} Δ U = - L + Q \end{matrix}

(19)

where

Δ U = U (t_{2}) - U (t_{1}) ≜ e^{T} E (t_{2}) - e^{T} E (t_{1})

denotes the variation in the total energy of the interconnected system

G

over the time interval

[t_{1}, t_{2}]

and

\begin{matrix} Q ≜ \int_{t_{1}}^{t_{2}} e^{T} [S (t) - d (E (t), V (t))] d t \end{matrix}

(20)

denotes the net energy received by

G

in forms other than work.

This is a statement of the first law of thermodynamics for the interconnected dynamical system

G

and gives a precise formulation of the equivalence between work and heat. This establishes that heat and mechanical work are two different aspects of energy. Finally, note that Equation (15) is consistent with the classical thermodynamic equation for the rate of work done by the system

G

on the environment. To see this, note that Equation (15) can be equivalently written as

\begin{matrix} d L = e^{T} D^{- 1} (E, V) d V \end{matrix}

(21)

which, for a single subsystem with volume V and pressure p, has the classical form

\begin{matrix} d L = p d V \end{matrix}

(22)

It follows from Definition 3.1 and Equations (14)–(17) that the time derivative of the entropy function satisfies

\begin{matrix} \dot{S} (E, V) & = \frac{\partial S (E, V)}{\partial E} \dot{E} + \frac{\partial S (E, V)}{\partial V} \dot{V} \\ = \frac{\partial S (E, V)}{\partial E} w (E, V) - \frac{\partial S (E, V)}{\partial E} (d_{w} (E, V) - S_{w} (t)) \\ - \frac{\partial S (E, V)}{\partial E} (d (E, V) - S (t)) + \frac{\partial S (E, V)}{\partial V} D (E, V) (d_{w} (E, V) - S_{w} (t)) \\ = \sum_{i = 1}^{q} \frac{\partial S (E, V)}{\partial E_{i}} \sum_{j = 1, j \neq i}^{q} ϕ_{i j} (E, V) + \sum_{i = 1}^{q} \frac{\partial S (E, V)}{\partial E_{i}} (S_{i} (t) - d_{i} (E, V)) \\ = \sum_{i = 1}^{q} \sum_{j = i + 1}^{q} (\frac{\partial S (E, V)}{\partial E_{i}} - \frac{\partial S (E, V)}{\partial E_{j}}) ϕ_{i j} (E, V) \\ + \sum_{i = 1}^{q} \frac{\partial S (E, V)}{\partial E_{i}} (S_{i} (t) - d_{i} (E, V)) \\ \geq \sum_{i = 1}^{q} \frac{\partial S (E, V)}{\partial E_{i}} (S_{i} (t) - d_{i} (E, V)), (E, V) \in {\bar{R}}_{+}^{q} \times R_{+}^{q} \end{matrix}

(23)

Noting that

d Q_{i} ≜ [S_{i} - σ_{i i} (E)] d t

,

i = 1, \dots, q

, is the infinitesimal amount of the net heat received or dissipated by the ith subsystem of

G

over the infinitesimal time interval

d t

, it follows from Equation (23) that

\begin{matrix} d S (E) \geq \sum_{i = 1}^{q} \frac{d Q_{i}}{T_{i}} \end{matrix}

(24)

Inequality (24) is the classical Clausius inequality for the variation of entropy during an infinitesimal irreversible transformation.

Note that for an adiabatically isolated interconnected dynamical system (i.e., no heat exchange with the environment), Equation (23) yields the universal inequality

\begin{matrix} S (E (t_{2}), V (t_{2})) \geq S (E (t_{1}), V (t_{1})), t_{2} \geq t_{1} \end{matrix}

(25)

which implies that, for any dynamical change in an adiabatically isolated interconnected system

G

, the entropy of the final system state can never be less than the entropy of the initial system state. In addition, in the case where

(E (t), V (t)) \in M_{e}

,

t \geq t_{0}

, where

M_{e} ≜ {(E, V) \in {\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q} : E = α e, α \geq 0, V \in R_{+}^{q}}

, it follows from Definition 3.1 and Equation (23) that Inequality (25) is satisfied as a strict inequality for all

(E, V) \in ({\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q}) \ M_{e}

. Hence, it follows from Theorem 2.15 of [4] that the adiabatically isolated interconnected system

G

does not exhibit Poincaré recurrence in

({\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q}) \ M_{e}

.

Next, we define the Gibbs free energy, the Helmholtz free energy, and the enthalpy functions for the interconnected dynamical system

G

. For this exposition, we assume that the entropy of

G

is a sum of individual entropies of subsystems of

G

, that is,

S (E, V) = \sum_{i = 1}^{q} S_{i} (E_{i}, V_{i})

,

(E, V) \in {\bar{R}}_{+}^{q} \times R_{+}^{q}

. In this case, the Gibbs free energy of

G

is defined by

\begin{matrix} G (E, V) & ≜ & e^{T} E - \sum_{i = 1}^{q} {(\frac{\partial S (E, V)}{\partial E_{i}})}^{- 1} S_{i} (E_{i}, V_{i}) + \sum_{i = 1}^{q} {(\frac{\partial S (E, V)}{\partial E_{i}})}^{- 1} (\frac{\partial S (E, V)}{\partial V_{i}}) V_{i} \\ (E, V) \in {\bar{R}}_{+}^{q} \times R_{+}^{q} \end{matrix}

(26)

the Helmholtz free energy of

G

is defined by

\begin{matrix} F (E, V) & ≜ & e^{T} E - \sum_{i = 1}^{q} {(\frac{\partial S (E, V)}{\partial E_{i}})}^{- 1} S_{i} (E_{i}, V_{i}), (E, V) \in {\bar{R}}_{+}^{q} \times R_{+}^{q} \end{matrix}

(27)

and the enthalpy of

G

is defined by

\begin{matrix} H (E, V) & ≜ & e^{T} E + \sum_{i = 1}^{q} {(\frac{\partial S (E, V)}{\partial E_{i}})}^{- 1} (\frac{\partial S (E, V)}{\partial V_{i}}) V_{i}, (E, V) \in {\bar{R}}_{+}^{q} \times R_{+}^{q} \end{matrix}

(28)

Note that the above definitions for the Gibbs free energy, Helmholtz free energy, and enthalpy are consistent with the classical thermodynamic definitions given by

G (E, V) = U + p V - T S

,

F (E, V) = U - T S

, and

H (E, V) = U + p V

, respectively. Furthermore, note that if the interconnected system

G

is isothermal and isobaric, that is, the temperatures of subsystems of

G

are equal and remain constant with

\begin{matrix} {(\frac{\partial S (E, V)}{\partial E_{1}})}^{- 1} = \dots = {(\frac{\partial S (E, V)}{\partial E_{q}})}^{- 1} = T > 0 \end{matrix}

(29)

and the pressure

p_{i} (E, V)

in each subsystem of

G

remains constant, respectively, then any transformation in

G

is reversible.

The time derivative of

G (E, V)

along the trajectories of Equations (14) and (15) is given by

\begin{matrix} \dot{G} (E, V) & = & e^{T} \dot{E} - \sum_{i = 1}^{q} {(\frac{\partial S (E, V)}{\partial E_{i}})}^{- 1} [\frac{\partial S (E, V)}{\partial E_{i}} {\dot{E}}_{i} + \frac{\partial S (E, V)}{\partial V_{i}} {\dot{V}}_{i}] \\ + \sum_{i = 1}^{q} {(\frac{\partial S (E, V)}{\partial E_{i}})}^{- 1} (\frac{\partial S (E, V)}{\partial V_{i}}) {\dot{V}}_{i} \\ = & 0 \end{matrix}

(30)

which is consistent with classical thermodynamics in the absence of chemical reactions.

For an isothermal interconnected dynamical system

G

, the time derivative of

F (E, V)

along the trajectories of Equations (14) and (15) is given by

\begin{matrix} \dot{F} (E, V) & = & e^{T} \dot{E} - \sum_{i = 1}^{q} {(\frac{\partial S (E, V)}{\partial E_{i}})}^{- 1} [\frac{\partial S (E, V)}{\partial E_{i}} {\dot{E}}_{i} + \frac{\partial S (E, V)}{\partial V_{i}} {\dot{V}}_{i}] \\ = & - \sum_{i = 1}^{q} {(\frac{\partial S (E, V)}{\partial E_{i}})}^{- 1} (\frac{\partial S (E, V)}{\partial V_{i}}) {\dot{V}}_{i} \\ = & - \sum_{i = 1}^{q} (d_{w i} (E, V) - S_{w i} (t)) \\ = & - L \end{matrix}

(31)

where L is the net amount of work done by the subsystems of

G

on the environment. Furthermore, note that if, in addition, the interconnected system

G

is isochoric, that is, the volumes of each of the subsystems of

G

remain constant, then

\dot{F} (E, V) = 0

. As we see in the next section, in the presence of chemical reactions the interconnected system

G

evolves such that the Helmholtz free energy is minimized.

Finally, for the isolated (

S (t) \equiv 0

and

d (E, V) \equiv 0

) interconnected dynamical system

G

, the time derivative of

H (E, V)

along the trajectories of Equations (14) and (15) is given by

\begin{matrix} \dot{H} (E, V) & = & e^{T} \dot{E} + \sum_{i = 1}^{q} {(\frac{\partial S (E, V)}{\partial E_{i}})}^{- 1} (\frac{\partial S (E, V)}{\partial V_{i}}) {\dot{V}}_{i} \\ = & e^{T} \dot{E} + \sum_{i = 1}^{q} (d_{w i} (E, V) - S_{w i} (t)) \\ = & e^{T} w (E, V) \\ = & 0 \end{matrix}

(32)

5. Chemical Equilibria, Entropy Production, Chemical Potential, and Chemical Thermodynamics

In its most general form thermodynamics can also involve reacting mixtures and combustion. When a chemical reaction occurs, the bonds within molecules of the reactant are broken, and atoms and electrons rearrange to form products. The thermodynamic analysis of reactive systems can be addressed as an extension of the compartmental thermodynamic model described in Section 3 and Section 4. Specifically, in this case the compartments would qualitatively represent different quantities in the same space, and the intercompartmental flows would represent transformation rates in addition to transfer rates. In particular, the compartments would additionally represent quantities of different chemical substances contained within the compartment, and the compartmental flows would additionally characterize transformation rates of reactants into products. In this case, an additional mass balance is included for addressing conservation of energy as well as conservation of mass. This additional mass conservation equation would involve the law of mass-action enforcing proportionality between a particular reaction rate and the concentrations of the reactants, and the law of superposition of elementary reactions ensuring that the resultant rates for a particular species is the sum of the elementary reaction rates for the species.

In this section, we consider the interconnected dynamical system

G

where each subsystem represents a substance or species that can exchange energy with other substances as well as undergo chemical reactions with other substances forming products. Thus, the reactants and products of chemical reactions represent subsystems of

G

with the mechanisms of heat exchange between subsystems remaining the same as delineated in Section 3. Here, for simplicity of exposition, we do not consider work done by the subsystem on the environment or work done by the environment on the system. This extension can be easily addressed using the formulation in Section 4.

To develop a dynamical systems framework for thermodynamics with chemical reaction networks, let q be the total number of species (i.e., reactants and products), that is, the number of subsystems in

G

, and let

X_{j}

,

j = 1, \dots, q

, denote the jth species. Consider a single chemical reaction described by

\begin{matrix} \sum_{j = 1}^{q} A_{j} X_{j} \overset{k}{⟶} \sum_{j = 1}^{q} B_{j} X_{j} \end{matrix}

(33)

where

A_{j}

,

B_{j}

,

j = 1, \dots, q

, are the stoichiometric coefficients and k denotes the reaction rate. Note that the values of

A_{j}

corresponding to the products and the values of

B_{j}

corresponding to the reactants are zero. For example, for the familiar reaction

\begin{matrix} 2 H_{2} + O_{2} \overset{k}{⟶} 2 H_{2} O \end{matrix}

(34)

X_{1}

,

X_{2}

, and

X_{3}

denote the species

H_{2}

,

O_{2}

, and

H_{2} O

, respectively, and

A_{1} = 2

,

A_{2} = 1

,

A_{3} = 0

,

B_{1} = 0

,

B_{2} = 0

, and

B_{3} = 2

.

In general, for a reaction network consisting of

r \geq 1

reactions, the ith reaction is written as

\begin{matrix} \sum_{j = 1}^{q} A_{i j} X_{j} \overset{k_{i}}{⟶} \sum_{j = 1}^{q} B_{i j} X_{j}, i = 1, \dots, r \end{matrix}

(35)

where, for

i = 1, \dots, r

,

k_{i} > 0

is the reaction rate of the ith reaction,

\sum_{j = 1}^{q} A_{i j} X_{j}

is the reactant of the ith reaction, and

\sum_{j = 1}^{q} B_{i j} X_{j}

is the product of the ith reaction. Each stoichiometric coefficient

A_{i j}

and

B_{i j}

is a nonnegative integer. Note that each reaction in the reaction network given by Equation (35) is represented as being irreversible. Irreversibility here refers to the fact that part of the chemical reaction involves generation of products from the original reactants. Reversible chemical reactions that involve generation of products from the reactants and vice versa can be modeled as two irreversible reactions, one involving generation of products from the reactants and the other involving generation of the original reactants from the products. Hence, reversible reactions can be modeled by including the reverse reaction as a separate reaction. The reaction network given by Equation (35) can be written compactly in matrix-vector form as

\begin{matrix} A X \overset{k}{⟶} B X \end{matrix}

(36)

where

X = {[X_{1}, \dots, X_{q}]}^{T}

is a column vector of species,

k = {[k_{1}, \dots, k_{r}]}^{T} \in R_{+}^{r}

is a positive vector of reaction rates, and

A \in R^{r \times q}

and

B \in R^{r \times q}

are nonnegative matrices such that

A_{(i, j)} = A_{i j}

and

B_{(i, j)} = B_{i j}

,

i = 1, \dots, r

,

j = 1, \dots, q

.

Let

n_{j} : [0, \infty) \to {\bar{R}}_{+}

,

j = 1, \dots, q

, denote the mole number of the jth species and define

n ≜ {[n_{1}, \dots, n_{q}]}^{T}

. Invoking the law of mass-action [15], which states that, for an elementary reaction, that is, a reaction in which all of the stoichiometric coefficients of the reactants are one, the rate of reaction is proportional to the product of the concentrations of the reactants, the species quantities change according to the dynamics [11,16]

\begin{matrix} \dot{n} (t) = {(B - A)}^{T} K n^{A} (t), n (0) = n_{0}, t \geq t_{0} \end{matrix}

(37)

where

K ≜ diag [k_{1}, \dots, k_{r}] \in P^{r}

and

\begin{matrix} n^{A} ≜ [\begin{matrix} \prod_{j = 1}^{q} n_{j}^{A_{1 j}} \\ ⋮ \\ \prod_{j = 1}^{q} n_{j}^{A_{r j}} \end{matrix}] = [\begin{matrix} n_{1}^{A_{11}} \dots n_{q}^{A_{1 q}} \\ ⋮ \\ n_{1}^{A_{r 1}} \dots n_{q}^{A_{r q}} \end{matrix}] \in {\bar{R}}_{+}^{r} \end{matrix}

(38)

For details regarding the law of mass-action and Equation (37), see [11,15,16,17]. Furthermore, let

M_{j} > 0

,

j = 1, \dots, q

, denote the molar mass (i.e., the mass of one mole of a substance) of the jth species, let

m_{j} : [0, \infty) \to {\bar{R}}_{+}

,

j = 1, \dots, q

, denote the mass of the jth species so that

m_{j} (t) = M_{j} n_{j} (t)

,

t \geq t_{0}

,

j = 1, \dots, q

, and let

m ≜ {[m_{1}, \dots, m_{q}]}^{T}

. Then, using the transformation

m (t) = M n (t)

, where

M ≜ [M_{1}, \dots, M_{q}] \in P^{q}

, Equation (37) can be rewritten as the mass balance

\begin{matrix} \dot{m} (t) = M {(B - A)}^{T} \tilde{K} m^{A} (t), m (0) = m_{0}, t \geq t_{0} \end{matrix}

(39)

where

\tilde{K} ≜ [\frac{k_{1}}{\prod_{j = 1}^{q} M_{j}^{A_{1 j}}}, \dots, \frac{k_{r}}{\prod_{j = 1}^{q} M_{j}^{A_{r j}}}] \in P^{r}

.

In the absence of nuclear reactions, the total mass of the species during each reaction in Equation (36) is conserved. Specifically, consider the ith reaction in Equation (36) given by Equation (35) where the mass of the reactants is

\sum_{j = 1}^{q} A_{i j} M_{j}

and the mass of the products is

\sum_{j = 1}^{q} B_{i j} M_{j}

. Hence, conservation of mass in the ith reaction is characterized as

\begin{matrix} \sum_{j = 1}^{q} (B_{i j} - A_{i j}) M_{j} = 0, i = 1, \dots, r \end{matrix}

(40)

or, in general for Equation (36), as

\begin{matrix} e^{T} M {(B - A)}^{T} = 0 \end{matrix}

(41)

Note that it follows from Equations (39) and (41) that

e^{T} \dot{m} (t) \equiv 0

.

Equation (39) characterizes the change in masses of substances in the interconnected dynamical system

G

due to chemical reactions. In addition to the change of mass due to chemical reactions, each substance can exchange energy with other substances according to the energy flow mechanism described in Section 3; that is, energy flows from substances at a higher temperature to substances at a lower temperature. Furthermore, in the presence of chemical reactions, the exchange of matter affects the change of energy of each substance through the quantity known as the chemical potential.

The notion of the chemical potential was introduced by Gibbs in 1875–1878 [8,9] and goes far beyond the scope of chemistry, affecting virtually every process in nature [18,19,20]. The chemical potential has a strong connection with the second law of thermodynamics in that every process in nature evolves from a state of higher chemical potential towards a state of lower chemical potential. It was postulated by Gibbs [8,9] that the change in energy of a homogeneous substance is proportional to the change in mass of this substance with the coefficient of proportionality given by the chemical potential of the substance.

To elucidate this, assume the jth substance corresponds to the jth compartment and consider the rate of energy change of the jth substance of

G

in the presence of matter exchange. In this case, it follows from Equation (5) and Gibbs’ postulate that the rate of energy change of the jth substance is given by

\begin{matrix} {\dot{E}}_{j} (t) & = & [\sum_{k = 1, k \neq j}^{q} ϕ_{j k} (E (t))] - σ_{j j} (E (t)) + S_{j} (t) + μ_{j} (E (t), m (t)) {\dot{m}}_{j} (t), E_{j} (t_{0}) = E_{j 0}, \\ t \geq t_{0} \end{matrix}

(42)

where

μ_{j} : {\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q} \to R

,

j = 1, \dots, q

, is the chemical potential of the jth substance. It follows from Equation (42) that

μ_{j} (\cdot, \cdot)

is the chemical potential of a unit mass of the jth substance. We assume that if

E_{j} = 0

, then

μ_{j} (E, m) = 0

,

j = 1, \dots, q

, which implies that if the energy of the jth substance is zero, then its chemical potential is also zero.

Next, using Equations (39) and (42), the energy and mass balances for the interconnected dynamical system

G

can be written as

\begin{matrix} \dot{E} (t) & = & w (E (t)) + P (E (t), m (t)) M {(B - A)}^{T} \tilde{K} m^{A} (t) - d (E (t)) + S (t), E (t_{0}) = E_{0}, \\ t \geq t_{0}, \end{matrix}

(43)

\begin{matrix} \dot{m} (t) & = & M {(B - A)}^{T} \tilde{K} m^{A} (t), m (0) = m_{0} \end{matrix}

(44)

where

P (E, m) ≜ [μ_{1} (E, m), \dots, μ_{q} (E, m)] \in R^{q \times q}

and where

w (\cdot)

,

d (\cdot)

, and

S (\cdot)

are defined as in Section 3. It follows from Proposition 1 of [16] that the dynamics of Equation (44) are essentially nonnegative and, since

μ_{j} (E, m) = 0

if

E_{j} = 0

,

j = 1, \dots, q

, it also follows that, for the isolated dynamical system

G

(i.e.,

S (t) \equiv 0

and

d (E) \equiv 0

), the dynamics of Equations (43) and (44) are essentially nonnegative.

Note that, for the ith reaction in the reaction network given by Equation (36), the chemical potentials of the reactants and the products are

\sum_{j = 1}^{q} A_{i j} M_{j} μ_{j} (E, m)

and

\sum_{j = 1}^{q} B_{i j} M_{j} μ_{j} (E, m)

, respectively. Thus,

\begin{matrix} \sum_{j = 1}^{q} B_{i j} M_{j} μ_{j} (E, m) - \sum_{j = 1}^{q} A_{i j} M_{j} μ_{j} (E, m) \leq 0, (E, m) \in {\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q} \end{matrix}

(45)

is a restatement of the principle that a chemical reaction evolves from a state of a greater chemical potential to that of a lower chemical potential, which is consistent with the second law of thermodynamics. The difference between the chemical potential of the reactants and the chemical potential of the products is called affinity [21,22] and is given by

\begin{matrix} ν_{i} (E, m) = \sum_{j = 1}^{q} A_{i j} M_{j} μ_{j} (E, m) - \sum_{j = 1}^{q} B_{i j} M_{j} μ_{j} (E, m) \geq 0, i = 1, \dots, r \end{matrix}

(46)

Affinity is a driving force for chemical reactions and is equal to zero at the state of chemical equilibrium. A nonzero affinity implies that the system in not in equilibrium and that chemical reactions will continue to occur until the system reaches an equilibrium characterized by zero affinity. The next assumption provides a general form for the inequalities (45) and (46).

Assumption 5.1 For the chemical reaction network (36) with the mass balance Equation (44), assume that

μ (E, m) > > 0

for all

E \neq 0

and

\begin{matrix} (B - A) M μ (E, m) \leq \leq 0, (E, m) \in {\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q} \end{matrix}

(47)

or, equivalently,

\begin{matrix} ν (E, m) = (A - B) M μ (E, m) \geq \geq 0, (E, m) \in {\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q} \end{matrix}

(48)

where

μ (E, m) ≜ {[μ_{1} (E, m), \dots, μ_{q} (E, m)]}^{T}

is the vector of chemical potentials of the substances of

G

and

ν (E, m) ≜ {[ν_{1} (E, m), \dots, ν_{r} (E, m)]}^{T}

is the affinity vector for the reaction network given by Equation (36).

Note that equality in Equation (47) or, equivalently, in Equation (48) characterizes the state of chemical equilibrium when the chemical potentials of the products and reactants are equal or, equivalently, when the affinity of each reaction is equal to zero. In this case, no reaction occurs and

\dot{m} (t) = 0

,

t \geq t_{0}

.

Next, we characterize the entropy function for the interconnected dynamical system

G

with the energy and mass balances given by Equations (43) and (44). The definition of entropy for

G

in the presence of chemical reactions remains the same as in Definition 3.1 with

S (E)

replaced by

S (E, m)

and with all other conditions in the definition holding for every

m > > 0

. Consider the jth subsystem of

G

and assume that

E_{k}

and

m_{k}

,

k \neq j

,

k = 1, \dots, q

, are constant. In this case, note that

\begin{matrix} \frac{d S}{d t} & = & \frac{\partial S}{\partial E_{j}} \frac{d E_{j}}{d t} + \frac{\partial S}{\partial m_{j}} \frac{d m_{j}}{d t} \end{matrix}

(49)

and recall that

\begin{matrix} \frac{\partial S}{\partial E} P (E, m) + \frac{\partial S}{\partial m} = 0 \end{matrix}

(50)

Next, it follows from Equation (50) that the time derivative of the entropy function

S (E, m)

along the trajectories of Equations (43) and (44) is given by

\begin{matrix} \dot{S} (E, m) & = \frac{\partial S (E, m)}{\partial E} \dot{E} + \frac{\partial S (E, m)}{\partial m} \dot{m} \\ = \frac{\partial S (E, m)}{\partial E} w (E) + (\frac{\partial S (E, m)}{\partial E} P (E, m) + \frac{\partial S (E, m)}{\partial m}) M {(B - A)}^{T} \tilde{K} m^{A} \\ + \frac{\partial S (E, m)}{\partial E} S (t) - \frac{\partial S (E, m)}{\partial E} d (E) \\ = \frac{\partial S (E, m)}{\partial E} w (E) + \frac{\partial S (E, m)}{\partial E} S (t) - \frac{\partial S (E, m)}{\partial E} d (E) \\ = \sum_{i = 1}^{q} \sum_{j = i + 1}^{q} (\frac{\partial S (E, m)}{\partial E_{i}} - \frac{\partial S (E, m)}{\partial E_{j}}) ϕ_{i j} (E) + \frac{\partial S (E, m)}{\partial E} S (t) - \frac{\partial S (E, m)}{\partial E} d (E), \end{matrix}

\begin{matrix} (E, m) \in {\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q} \end{matrix}

(51)

For the isolated system

G

(i.e.,

S (t) \equiv 0

and

d (E) \equiv 0

), the entropy function of

G

is a nondecreasing function of time and, using identical arguments as in the proof of Theorem 3.1, it can be shown that

(E (t), m (t)) \to R ≜ \{(E, m) \in {\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q} : \frac{\partial S (E, m)}{\partial E_{1}} = \dots = \frac{\partial S (E, m)}{\partial E_{q}}\}

as

t \to \infty

for all

(E_{0}, m_{0}) \in {\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q}

.

The entropy production in the interconnected system

G

due to chemical reactions is given by

\begin{matrix} d S_{i} (E, m) & = & \frac{\partial S (E, m)}{\partial m} d m \\ = & - \frac{\partial S (E, m)}{\partial E} P (E, m) M {(B - A)}^{T} \tilde{K} m^{A} d t, (E, m) \in {\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q} \end{matrix}

(52)

If the interconnected dynamical system

G

is isothermal, that is, all subsystems of

G

are at the same temperature

\begin{matrix} {(\frac{\partial S (E, m)}{\partial E_{1}})}^{- 1} = \dots = {(\frac{\partial S (E, m)}{\partial E_{q}})}^{- 1} = T \end{matrix}

(53)

where

T > 0

is the system temperature, then it follows from Assumption 5.1 that

\begin{matrix} d S_{i} (E, m) & = & - \frac{1}{T} e^{T} P (E, m) M {(B - A)}^{T} \tilde{K} m^{A} d t \\ = & - \frac{1}{T} μ^{T} (E, m) M {(B - A)}^{T} \tilde{K} m^{A} d t \\ = & \frac{1}{T} ν^{T} (E, m) \tilde{K} m^{A} d t \\ \geq & 0, (E, m) \in {\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q} \end{matrix}

(54)

Note that since the affinity of a reaction is equal to zero at the state of a chemical equilibrium, it follows that equality in Equation (54) holds if and only if

ν (E, m) = 0

for some

E \in {\bar{R}}_{+}^{q}

and

m \in {\bar{R}}_{+}^{q}

.

Theorem 5.1 Consider the isolated (i.e.,

S (t) \equiv 0

and

d (E) \equiv 0

) interconnected dynamical system

G

with the power and mass balances given by Equations (43) and (44). Assume that rank

C = q - 1

, Assumption 5.1 holds, and there exists an entropy function

S : {\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q} \to R

of

G

. Then

(E (t), m (t)) \to R

as

t \to \infty

, where

(E (t), m (t))

,

t \geq t_{0}

, is the solution to Equations (43) and (44) with the initial condition

(E_{0}, m_{0}) \in {\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q}

and

\begin{matrix} R = \{(E, m) \in {\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q} : \frac{\partial S (E, m)}{\partial E_{1}} = \dots = \frac{\partial S (E, m)}{\partial E_{q}} and ν (E, m) = 0\} \end{matrix}

(55)

where

ν (\cdot, \cdot)

is the affinity vector of

G

.

Proof. Since the dynamics of the isolated system

G

are essentially nonnegative, it follows from Proposition 2.1 that

(E (t), m (t)) \in {\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q}

,

t \geq t_{0}

, for all

(E_{0}, m_{0}) \in {\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q}

. Consider a scalar function

v (E, m) = e^{T} E + e^{T} m

,

(E, m) \in {\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q}

, and note that

v (0, 0) = 0

and

v (E, m) > 0

,

(E, m) \in {\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q}

,

(E, m) \neq (0, 0)

. It follows from Equation (41), Assumption 5.1, and

e^{T} w (E) \equiv 0

that the time derivative of

v (\cdot, \cdot)

along the trajectories of Equations (43) and (44) satisfies

\begin{matrix} \dot{v} (E, m) & = & e^{T} \dot{E} + e^{T} \dot{m} \\ = & e^{T} P (E, m) M {(B - A)}^{T} \tilde{K} m^{A} \\ = & μ^{T} (E, m) M {(B - A)}^{T} \tilde{K} m^{A} \\ = & - ν^{T} (E, m) \tilde{K} m^{A} \\ \leq & 0, (E, m) \in {\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q} \end{matrix}

(56)

which implies that the solution

(E (t), m (t))

,

t \geq t_{0}

, to Equations (43) and (44) is bounded for all initial conditions

(E_{0}, m_{0}) \in {\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q}

.

Next, consider the function

\tilde{v} (E, m) = e^{T} E + e^{T} m - S (E, m)

,

(E, m) \in {\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q}

. Then it follows from Equations (51) and (56) that the time derivative of

\tilde{v} (\cdot, \cdot)

along the trajectories of Equations (43) and (44) satisfies

\begin{matrix} \dot{\tilde{v}} (E, m) & = & e^{T} \dot{E} + e^{T} \dot{m} - \dot{S} (E, m) \\ = & - ν^{T} (E, m) \tilde{K} m^{A} - \sum_{i = 1}^{q} \sum_{j = i + 1}^{q} (\frac{\partial S (E, m)}{\partial E_{i}} - \frac{\partial S (E, m)}{\partial E_{j}}) ϕ_{i j} (E) \\ \leq & 0, (E, m) \in {\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q} \end{matrix}

(57)

which implies that

\tilde{v} (\cdot, \cdot)

is a nonincreasing function of time, and hence, by the Krasovskii–LaSalle theorem [7],

(E (t), m (t)) \to R ≜ {(E, m) \in {\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q} : \dot{\tilde{v}} (E, m) = 0}

as

t \to \infty

. Now, it follows from Definition 3.1, Assumption 5.1, and the fact that rank

C = q - 1

that

\begin{matrix} R & = & \{(E, m) \in {\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q} : \frac{\partial S (E, m)}{\partial E_{1}} = \dots = \frac{\partial S (E, m)}{\partial E_{q}}\} \\ \cap {(E, m) \in {\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q} : ν (E, m) = 0} \end{matrix}

(58)

which proves the result. □

Theorem 5.1 implies that the state of the interconnected dynamical system

G

converges to the state of thermal and chemical equilibrium when the temperatures of all substances of

G

are equal and the masses of all substances reach a state where all reaction affinities are zero corresponding to a halting of all chemical reactions.

Next, we assume that the entropy of the interconnected dynamical system

G

is a sum of individual entropies of subsystems of

G

, that is,

S (E, m) = \sum_{j = 1}^{q} S_{j} (E_{j}, m_{j})

,

(E, m) \in {\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q}

. In this case, the Helmholtz free energy of

G

is given by

\begin{matrix} F (E, m) = e^{T} E - \sum_{j = 1}^{q} {(\frac{\partial S (E, m)}{\partial E_{j}})}^{- 1} S_{j} (E_{j}, m_{j}), (E, m) \in {\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q} \end{matrix}

(59)

If the interconnected dynamical system

G

is isothermal, then the derivative of

F (\cdot, \cdot)

along the trajectories of Equations (43) and (44) is given by

\begin{matrix} \dot{F} (E, m) & = e^{T} \dot{E} - \sum_{j = 1}^{q} {(\frac{\partial S (E, m)}{\partial E_{j}})}^{- 1} {\dot{S}}_{j} (E_{j}, m_{j}) \\ = e^{T} \dot{E} - \sum_{j = 1}^{q} {(\frac{\partial S (E, m)}{\partial E_{j}})}^{- 1} [\frac{\partial S_{j} (E_{j}, m_{j})}{\partial E_{j}} {\dot{E}}_{j} + \frac{\partial S_{j} (E_{j}, m_{j})}{\partial m_{j}} {\dot{m}}_{j}] \\ = μ^{T} (E, m) M {(B - A)}^{T} \tilde{K} m^{A} \\ = - ν^{T} (E, m) \tilde{K} m^{A} \\ \leq 0, (E, m) \in {\bar{R}}_{+}^{q} \times {\bar{R}}_{+}^{q} \end{matrix}

(60)

with equality in Equation (60) holding if and only if

ν (E, m) = 0

for some

E \in {\bar{R}}_{+}^{q}

and

m \in {\bar{R}}_{+}^{q}

, which determines the state of chemical equilibrium. Hence, the Helmholtz free energy of

G

evolves to a minimum when the pressure and temperature of each subsystem of

G

are maintained constant, which is consistent with classical thermodynamics. A similar conclusion can be arrived at for the Gibbs free energy if work energy considerations to and by the system are addressed. Thus, the Gibbs and Helmholtz free energies are a measure of the tendency for a reaction to take place in the interconnected system

G

, and hence, provide a measure of the work done by the interconnected system

G

.

6. Conclusion and Opportunities for Future Research

In this paper, we developed a system-theoretic perspective for classical thermodynamics and chemical reaction processes. In particular, we developed a nonlinear compartmental model involving heat flow, work energy, and chemical reactions that captures all of the key aspects of thermodynamics, including its fundamental laws. In addition, we showed that the interconnected compartmental model gives rise to globally semistable equilibria involving states of temperature equipartition. Finally, using the notion of the chemical potential, we combined our heat flow compartmental model with a state space mass-action kinetics model to capture energy and mass exchange in interconnected large-scale systems in the presence of chemical reactions. In this case, it was shown that the system states converge to a state of temperature equipartition and zero affinity.

The underlying intention of this paper as well as [4,5,6] has been to present one of the most useful and general physical branches of science in the language of dynamical systems theory. In particular, our goal has been to develop a dynamical system formalism of thermodynamics using a large-scale interconnected systems theory that bridges the gap between classical and statistical thermodynamics. The laws of thermodynamics are among the most firmly established laws of nature, and it is hoped that this work will help to stimulate increased interaction between physicists and dynamical systems and control theorists. Besides the fact that irreversible thermodynamics plays a critical role in the understanding of our physical universe, it forms the underpinning of several fundamental life science and engineering disciplines, including biological systems, physiological systems, neuroscience, chemical reaction systems, ecological systems, demographic systems, transportation systems, network systems, and power systems, to cite but a few examples.

An important area of science where the dynamical system framework of thermodynamics can prove invaluable is in neuroscience. Advances in neuroscience have been closely linked to mathematical modeling beginning with the integrate-and-fire model of Lapicque [23] and proceeding through the modeling of the action potential by Hodgkin and Huxley [24] to the current era of mathematical neuroscience; see [25,26] and the numerous references therein. Neuroscience has always had models to interpret experimental results from a high-level complex systems perspective; however, expressing these models with dynamic equations rather than words fosters precision, completeness, and self-consistency. Nonlinear dynamical system theory, and in particular system thermodynamics, is ideally suited for rigorously describing the behavior of large-scale networks of neurons.

Merging the two universalisms of thermodynamics and dynamical systems theory with neuroscience can provide the theoretical foundation for understanding the network properties of the brain by rigorously addressing large-scale interconnected biological neuronal network models that govern the neuroelectronic behavior of biological excitatory and inhibitory neuronal networks [27]. As in thermodynamics, neuroscience is a theory of large-scale systems wherein graph theory can be used in capturing the connectivity properties of system interconnections, with neurons represented by nodes, synapses represented by edges or arcs, and synaptic efficacy captured by edge weighting giving rise to a weighted adjacency matrix governing the underlying directed graph network topology. However, unlike thermodynamics, wherein energy spontaneously flows from a state of higher temperature to a state of lower temperature, neuron membrane potential variations occur due to ion species exchanges which evolve from regions of higher concentrations to regions of lower concentrations. And this evolution does not occur spontaneously but rather requires the opening and closing of specific gates within specific ion channels.

A particularly interesting application of nonlinear dynamical systems theory to the neurosciences is to study phenomena of the central nervous system that exhibit nearly discontinuous transitions between macroscopic states. A very challenging and clinically important problem exhibiting this phenomenon is the induction of general anesthesia [28,29,30,31,32]. In any specific patient, the transition from consciousness to unconsciousness as the concentration of anesthetic drugs increases is very sharp, resembling a thermodynamic phase transition. In current clinical practice of general anesthesia, potent drugs are administered which profoundly influence levels of consciousness and vital respiratory (ventilation and oxygenation) and cardiovascular (heart rate, blood pressure, and cardiac output) functions. These variation patterns of the physiologic parameters (i.e., ventilation, oxygenation, heart rate, blood pressure, and cardiac output) and their alteration with levels of consciousness can provide scale-invariant fractal temporal structures to characterize the degree of consciousness in sedated patients.

In particular, the degree of consciousness reflects the adaptability of the central nervous system and is proportional to the maximum work output under a fully conscious state divided by the work output of a given anesthetized state. A reduction in maximum work output (and oxygen consumption) or elevation in the anesthetized work output (or oxygen consumption) will thus reduce the degree of consciousness. Hence, the fractal nature (i.e., complexity) of conscious variability is a self-organizing emergent property of the large-scale interconnected biological neuronal network since it enables the central nervous system to maximize entropy production and optimally dissipate energy gradients. In physiologic terms, a fully conscious healthy patient would exhibit rich fractal patterns in space (e.g., fractal vasculature) and time (e.g., cardiopulmonary variability) that optimize the ability for oxygenation and ventilation. Within the context of aging and complexity in acute illnesses, variation of physiologic parameters and their relationship to system complexity, fractal variability, and system thermodynamics have been explored in [33,34,35,36,37,38].

Merging system thermodynamics with neuroscience can provide the theoretical foundation for understanding the mechanisms of action of general anesthesia using the network properties of the brain. Even though simplified mean field models have been extensively used in the mathematical neuroscience literature to describe large neural populations [26], complex large-scale interconnected systems are essential in identifying the mechanisms of action for general anesthesia [27]. Unconsciousness is associated with reduced physiologic parameter variability, which reflects the inability of the central nervous system to adopt, and thus, decomplexifying physiologic work cycles and decreasing energy consumption (ischemia, hypoxia) leading to a decrease in entropy production. The degree of consciousness is a function of the numerous coupling of the network properties in the brain that form a complex large-scale, interconnected system. Complexity here refers to the quality of a system wherein interacting subsystems self-organize to form hierarchical evolving structures exhibiting emergent system properties; hence, a complex dynamical system is a system that is greater than the sum of its subsystems or parts. This complex system—involving numerous nonlinear dynamical subsystem interactions making up the system—has inherent emergent properties that depend on the integrity of the entire dynamical system and not merely on a mean field simplified reduced-order model.

Developing a dynamical system framework for neuroscience [27] and merging it with system thermodynamics [4,5,6] by embedding thermodynamic state notions (i.e., entropy, energy, free energy, chemical potential, etc.) will allow us to directly address the otherwise mathematically complex and computationally prohibitive large-scale dynamical models that have been developed in the literature. In particular, a thermodynamically consistent neuroscience model would emulate the clinically observed self-organizing spatio-temporal fractal structures that optimally dissipate energy and optimize entropy production in thalamocortical circuits of fully conscious patients. This thermodynamically consistent neuroscience framework can provide the necessary tools involving semistability, synaptic drive equipartitioning (i.e., synchronization across time scales), energy dispersal, and entropy production for connecting biophysical findings to psychophysical phenomena for general anesthesia.

In particular, we conjecture that as the model dynamics transition to an aesthetic state the system will involve a reduction in system complexity—defined as a reduction in the degree of irregularity across time scales—exhibiting semistability and synchronization of neural oscillators (i.e., thermodynamic energy equipartitioning). In other words, unconsciousness will be characterized by system decomplexification. In addition, connections between thermodynamics, neuroscience, and the arrow of time [4,5,6] can be explored by developing an understanding of how the arrow of time is built into the very fabric of our conscious brain. Connections between thermodynamics and neuroscience is not limited to the study of consciousness in general anesthesia and can be seen in biochemical systems, ecosystems, gene regulation and cell replication, as well as numerous medical conditions (e.g., seizures, schizophrenia, hallucinations, etc.), which are obviously of great clinical importance but have been lacking rigorous theoretical frameworks. This is a subject of current research.

Acknowledgements

This research was supported in part by the Air Force Office of Scientific Research under Grant FA9550-12-1-0192.

References

Truesdell, C. Rational Thermodynamics; McGraw-Hill: New York, NY, USA, 1969. [Google Scholar]
Truesdell, C. The Tragicomical History of Thermodynamics 1822–1854; Springer-Verlag: New York, NY, USA, 1980. [Google Scholar]
Arnold, V. Contact Geometry: The Geometrical Method of Gibbs’ Thermodynamics. In Proceedings of the Gibbs Symposium, New Haven, CT, USA, 15–17 May 1989; Caldi, D., Mostow, G., Eds.; American Mathematical Society: Providence, RI, USA, 1990; pp. 163–179. [Google Scholar]
Haddad, W.M.; Chellaboina, V.; Nersesov, S.G. Thermodynamics. A Dynamical Systems Approach; Princeton University Press: Princeton, NJ, USA, 2005. [Google Scholar]
Haddad, W.M.; Chellaboina, V.; Nersesov, S.G. Time-reversal symmetry, Poincaré recurrence, irreversibility, and the entropic arrow of time: From mechanics to system thermodynamics. Nonlinear Anal. Real World Appl. 2008, 9, 250–271. [Google Scholar] [CrossRef]
Haddad, W.M. Temporal asymmetry, entropic irreversibility, and finite-time thermodynamics: From Parmenides–Einstein time–reversal symmetry to the Heraclitan entropic arrow of time. Entropy 2012, 14, 407–455. [Google Scholar] [CrossRef]
Haddad, W.M.; Chellaboina, V. Nonlinear Dynamical Systems and Control. A Lyapunov-Based Approach; Princeton University Press: Princeton, NJ, USA, 2008. [Google Scholar]
Gibbs, J.W. On the equilibrium of heterogeneous substances. Trans. Conn. Acad. Sci. 1875, III, 108–248. [Google Scholar] [CrossRef]
Gibbs, J.W. On the equilibrium of heterogeneous substances. Trans. Conn. Acad. Sci. 1878, III, 343–524. [Google Scholar] [CrossRef]
Hartman, P. Ordinary Differential Equations; Birkhaäuser: Boston, MA, USA, 1982. [Google Scholar]
Haddad, W.M.; Chellaboina, V.; Hui, Q. Nonnegative and Compartmental Dynamical Systems; Princeton University Press: Princeton, NJ, USA, 2010. [Google Scholar]
Haddad, W.M.; Chellaboina, V. Stability and dissipativity theory for nonnegative dynamical systems: A unified analysis framework for biological and physiological systems. Nonlinear Anal. Real World Appl. 2005, 6, 35–65. [Google Scholar] [CrossRef]
Diestel, R. Graph Theory; Springer-Verlag: New York, NY, USA, 1997. [Google Scholar]
Godsil, C.; Royle, G. Algebraic Graph Theory; Springer-Verlag: New York, NY, USA, 2001. [Google Scholar]
Steinfeld, J.I.; Francisco, J.S.; Hase, W.L. Chemical Kinetics and Dynamics; Prentice-Hall: Upper Saddle River, NJ, USA, 1989. [Google Scholar]
Chellaboina, V.; Bhat, S.P.; Haddad, W.M.; Bernstein, D.S. Modeling and analysis of mass action kinetics: Nonnegativity, realizability, reducibility, and semistability. Contr. Syst. Mag. 2009, 29, 60–78. [Google Scholar] [CrossRef]
Erdi, P.; Toth, J. Mathematical Models of Chemical Reactions: Theory and Applications of Deterministic and Stochastic Models; Princeton University Press: Princeton, NJ, USA, 1988. [Google Scholar]
Baierlein, R. The elusive chemical potential. Am. J. Phys. 2001, 69, 423–434. [Google Scholar] [CrossRef]
Fuchs, H.U. The Dynamics of Heat; Springer-Verlag: New York, NY, USA, 1996. [Google Scholar]
Job, G.; Herrmann, F. Chemical potential–A quantity in search of recognition. Eur. J. Phys. 2006, 27, 353–371. [Google Scholar] [CrossRef]
DeDonder, T. L’Affinité; Gauthiers-Villars: Paris, France, 1927. [Google Scholar]
DeDonder, T.; Rysselberghe, P.V. Affinity; Stanford University Press: Menlo Park, CA, USA, 1936. [Google Scholar]
Lapicque, L. Recherches quantitatives sur l’ excitation electiique des nerfs traitee comme une polarization. J. Physiol. Gen. 1907, 9, 620–635. [Google Scholar]
Hodgkin, A.L.; Huxley, A.F. A quantitative description of membrane current and application to conduction and excitation in nerve. J. Physiol. 1952, 117, 500–544. [Google Scholar] [CrossRef] [PubMed]
Dayan, P.; Abbott, L.F. Theoretical Neuroscience: Computational and Mathematical Modeling of Neural Systems; MIT Press: Cambridge, MA, USA, 2005. [Google Scholar]
Ermentrout, B.; Terman, D.H. Mathematical Foundations of Neuroscience; Springer-Verlag: New York, NY, USA, 2010. [Google Scholar]
Hui, Q.; Haddad, W.M.; Bailey, J.M. Multistability, bifurcations, and biological neural networks: A synaptic drive firing model for cerebral cortex transition in the induction of general anesthesia. Nonlinear Anal. Hybrid Syst. 2011, 5, 554–572. [Google Scholar] [CrossRef]
Mashour, G.A. Consciousness unbound: Toward a paradigm of general anesthesia. Anesthesiology 2004, 100, 428–433. [Google Scholar] [CrossRef] [PubMed]
Zecharia, A.Y.; Franks, N.P. General anesthesia and ascending arousal pathways. Anesthesiology 2009, 111, 695–696. [Google Scholar] [CrossRef] [PubMed]
Sonner, J.M.; Antognini, J.F.; Dutton, R.C.; Flood, P.; Gray, A.T.; Harris, R.A.; Homanics, G.E.; Kendig, J.; Orser, B.; Raines, D.E.; et al. Inhaled anesthetics and immobility: Mechanisms, mysteries, and minimum alveolar anesthetic concentration. Anesth. Analg. 2003, 97, 718–740. [Google Scholar] [CrossRef] [PubMed]
Campagna, J.A.; Miller, K.W.; Forman, S.A. Mechanisms of actions of inhaled anesthetics. N. Engl. J. Med. 2003, 348, 2110–2124. [Google Scholar] [PubMed]
John, E.R.; Prichep, L.S. The anesthetic cascade: A theory of how anesthesia suppresses consciousness. Anesthesiology 2005, 102, 447–471. [Google Scholar] [CrossRef] [PubMed]
Macklem, P.T.; Seely, A.J.E. Towards a definition of life. Prespectives Biol. Med. 2010, 53, 330–340. [Google Scholar] [CrossRef] [PubMed]
Seely, A.J.E.; Macklem, P. Fractal variability: An emergent property of complex dissipative systems. Chaos 2012, 22, 1–7. [Google Scholar] [CrossRef] [PubMed]
Bircher, J. Towards a dynamic definition of health and disease. Med. Health Care Philos. 2005, 8, 335–341. [Google Scholar] [CrossRef] [PubMed]
Goldberger, A.L.; Rigney, D.R.; West, B.J. Science in pictures: Chaos and fractals in human physiology. Sci. Am. 1990, 262, 42–49. [Google Scholar] [CrossRef] [PubMed]
Goldberger, A.L.; Peng, C.K.; Lipsitz, L.A. What is physiologic complexity and how does it change with aging and disease? Neurobiol. Aging 2002, 23, 23–27. [Google Scholar] [CrossRef]
Godin, P.J.; Buchman, T.G. Uncoupling of biological oscillators: A complementary hypothesis concerning the pathogenesis of multiple organ dysfunction syndrome. Crit. Care Med. 1996, 24, 1107–1116. [Google Scholar] [CrossRef] [PubMed]

© 2013 by the author; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).

Share and Cite

MDPI and ACS Style

Haddad, W.M. A Unification between Dynamical System Theory and Thermodynamics Involving an Energy, Mass, and Entropy State Space Formalism. Entropy 2013, 15, 1821-1846. https://doi.org/10.3390/e15051821

AMA Style

Haddad WM. A Unification between Dynamical System Theory and Thermodynamics Involving an Energy, Mass, and Entropy State Space Formalism. Entropy. 2013; 15(5):1821-1846. https://doi.org/10.3390/e15051821

Chicago/Turabian Style

Haddad, Wassim M. 2013. "A Unification between Dynamical System Theory and Thermodynamics Involving an Energy, Mass, and Entropy State Space Formalism" Entropy 15, no. 5: 1821-1846. https://doi.org/10.3390/e15051821

APA Style

Haddad, W. M. (2013). A Unification between Dynamical System Theory and Thermodynamics Involving an Energy, Mass, and Entropy State Space Formalism. Entropy, 15(5), 1821-1846. https://doi.org/10.3390/e15051821

Article Menu

A Unification between Dynamical System Theory and Thermodynamics Involving an Energy, Mass, and Entropy State Space Formalism

Abstract

1. Introduction

2. Notation, Definitions, and Mathematical Preliminaries

3. Interconnected Thermodynamic Systems: A State Space Energy Flow Perspective

4. Work Energy, Gibbs Free Energy, Helmoholtz Free Energy, Enthalpy, and Entropy

5. Chemical Equilibria, Entropy Production, Chemical Potential, and Chemical Thermodynamics

6. Conclusion and Opportunities for Future Research

Acknowledgements

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI