Exact Boolean Abstraction of Linear Equation Systems

Allart, Emilie; Niehren, Joachim; Versari, Cristian

doi:10.3390/computation9110113

Open AccessArticle

Exact Boolean Abstraction of Linear Equation Systems

by

Emilie Allart

^1,2,†

,

Joachim Niehren

^1,3,†

and

Cristian Versari

^1,2,*,†

¹

CRIStAL—Centre de Recherche en Informatique, Signal et Automatique de Lille—UMR 9189, Université de Lille—Campus Scientifique, 59655 Villeneuve-d’Ascq, France

²

Faculte des Sciences et Technologies, University of Lille, 59650 Villeneuve-d’Ascq, France

³

Inria, Université de Lille, 59000 Lille, France

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Computation 2021, 9(11), 113; https://doi.org/10.3390/computation9110113

Submission received: 31 July 2021 / Revised: 11 October 2021 / Accepted: 12 October 2021 / Published: 21 October 2021

(This article belongs to the Special Issue Formal Method for Biological Systems Modelling)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

We study the problem of how to compute the boolean abstraction of the solution set of a linear equation system over the positive reals. We call a linear equation system ϕ exact for the boolean abstraction if the abstract interpretation of ϕ over the structure of booleans is equal to the boolean abstraction of the solution set of ϕ over the positive reals. Abstract interpretation over the booleans is thus complete for the boolean abstraction when restricted to exact linear equation systems, while it is not complete more generally. We present a new rewriting algorithm that makes linear equation systems exact for the boolean abstraction while preserving the solutions over the positive reals. The rewriting algorithm is based on the elementary modes of the linear equation system. The computation of the elementary modes may require exponential time in the worst case, but is often feasible in practice with freely available tools. For exact linear equation systems, we can compute the boolean abstraction by finite domain constraint programming. This yields a solution of the initial problem that is often feasible in practice. Our exact rewriting algorithm has two further applications. Firstly, it can be used to compute the sign abstraction of linear equation systems over the reals, as needed for analyzing function programs with linear arithmetics. Secondly, it can be applied to compute the difference abstraction of a linear equation system as used in change prediction algorithms for flux networks in systems biology.

Keywords:

linear equation systems; abstract interpretation; program analysis; systems biology

1. Introduction

We develop approaches to remedy the incompleteness of abstract interpretation [1] of linear equation systems over the reals, in the algebra of booleans

B = {0, 1}

and the structure of signs

S = {- 1, 0, 1}

. These abstractions have various applications: in systems biology, the boolean abstraction underlies abstractions of chemical reactions networks into Boolean networks [2,3]. In program analysis, the sign abstraction can be applied to functional programs with arithmetics for analyzing the signs of the possible values of floating-point variables [4,5].

The soundness of abstract interpretations of first-order logic formulas without negation was shown by John [6,7,8,9]. It applies to the interpretation in any concrete structure S, as long as it is connected by a homomorphism

h : S \to Δ

to the abstract structure

Δ

. The concrete interpretation of a first-order formula

ϕ

is the set of concrete solutions

s o l^{S} (ϕ)

, and its abstract interpretation is the set of its abstract solutions

s o l^{Δ} (ϕ)

. John’s soundness theorem (see Theorem 1 below) states that the set of abstract solutions of overapproximates the abstraction by h of the set of concrete solutions:

h \circ s o l^{S} (ϕ) \subseteq s o l^{Δ} (ϕ)

When choosing the operators in

Σ_{b o o l} = {+, *, 0, 1}

, the class of negation-free first-order formulas with operators in

Σ_{b o o l}

extends on the classes of linear and polynomial equation systems. In this article, we consider the boolean abstraction which is the unique homomorphism

h_{B} : R_{+} \to B

, and the sign abstraction which is the unique homomorphism

h_{S} : R \to S

with respect to the operators in

Σ_{b o o l}

. The boolean abstraction maps any strictly positive real to 1 and 0 to 0. The sign abstraction extends on the boolean abstraction while mapping all strictly negative reals to

- 1

. We note that the structure of signs

S

is not an algebra since the sum of a positive and a negative number may have any sign.

1.1. Problematics

A natural question is whether abstract interpretation is complete [10] for the abstraction of formulas induced by a homomorphisms

h : S \to Δ

, i.e, whether for all negation-free first-order formulas

ϕ

with the same operators:

h \circ s o l^{S} (ϕ) = s o l^{Δ} (ϕ)

We call a formula

ϕ

h-exact if it satisfies this property. A counter example against the completeness of abstraction interpretation for the boolean and the sign abstraction is the linear equation

ϕ_{0}

equal to

x + y \overset{\circ}{=} x + z

. Here, we write

\overset{\circ}{=}

for the equality symbol inside the logic, to point out its difference from equality in the language of mathematics. Formula

ϕ_{0}

is neither

h_{B}

-exact nor

h_{S}

-exact. This can be seen as follows. Over the reals

ϕ_{0}

is equivalent to

y \overset{\circ}{=} z

, so that all assignments

τ

that are abstractions of concrete solutions of

ϕ_{0}

must satisfy

τ (y) = τ (z)

. When interpreted abstractly over

B

or

S

, however,

ϕ_{0}

admits the abstract solution

τ = [x / 1, y / 1, z / 0]

which is not the abstraction of any concrete solution since

τ (y) \neq τ (z)

.

To deal with the incompleteness of abstract interpretation, we propose to study the following two questions for homomorphism

h : S \to Δ

where h is either the boolean abstraction

h_{B}

or the sign abstraction

h_{S}

.

Exact Rewriting Can we rewrite linear equation systems to h-exact formulas?

Computing Abstractions Can we can compute the abstraction

h \circ s o l^{S} (ϕ)

exactly for a given system of homogeneous linear equations

ϕ

?

Geometrically speaking, the concrete solution sets

s o l^{R_{+}} (ϕ)

and

s o l^{R} (ϕ)

of homogeneous linear equation systems

ϕ

are polytopes—i.e., finite intersections of half-spaces in

R^{fv (ϕ)}

. The problem of computing boolean abstractions or sign abstraction is thus to compute the

h_{Δ}

abstraction of a polytope given by a linear equation system.

For any h-exact formula

ϕ

, the computation of abstractions

h \circ s o l^{S} (ϕ)

is equivalent to the computation of

s o l^{Δ} (ϕ)

. Since the abstract structure

Δ

is finite for the boolean and sign abstraction, we can compute the set of abstract solutions in at most exponential time by a naive generate and test algorithm. Finite domain constraint programming [11] can by used to avoid the naive generation of all variable assignments to

Δ

in many practical cases. Therefore, any algorithm for exact rewriting induces an algorithm for computing abstractions that may be feasible in practice.

1.2. Contributions

Our main result is the first algorithm for exact rewriting that applies to linear equation systems for the Boolean abstraction. Based on this algorithm, we present a novel algorithm for computing the sign abstraction of linear equation systems.

Exact Rewriting for the Boolean Abstraction. In the first step, we study exact rewriting of (homogeneous) linear equation systems for boolean abstraction. The counter example

ϕ_{0}

, for instance, can be rewritten to

h_{B}

-exact formula

y \overset{\circ}{=} z

. The idea is to take the system of all linear consequences over

R_{+}

of the linear equation system. There may be infinitely many such consequences, but all of them are linear combinations of the extreme rays of the cone

s o l^{R_{+}} (ϕ_{0})

. Up to normalization, there are only finitely many extreme rays, which are known as the elementary modes of the linear equation system [12,13,14,15]. These can be computed by libraries from computational geometry [16] in at most exponential time. Nevertheless, the computation is often well-behaved in practice.

Based on the elementary modes (Folklore Theorem 2), we can rewrite any (homogeneous) linear equation system into quasi-positive and strongly-triangular linear equation system that is equivalent over

R_{+}

(Corollary 1), that can be computed in at most exponential time. As we prove, such systems are always

h_{B}

-exact (Theorem 3). Hence, any system of linear equations can be converted in at most exponential time to some

R_{+}

-equivalent

h_{B}

-exact formula.

Note that

h_{B}

-exact formulas may still not be

S

-exact. A counter example is the strongly-triangular quasi-positive linear system

u + v \overset{\circ}{=} x \land u + v \overset{\circ}{=} y

. It is not

h_{S}

-exact, since it entails

x \overset{\circ}{=} y

over

R

but still has the abstract solution

[u / 1, v / - 1, x / 1, y / - 1]

over

S

which maps x and y to distinct signs. Indeed, we don’t have any idea of how to do exact rewriting for the sign abstraction. The problem is that positivity is essential for our approach, and since the addition of positive and negative numbers may have any sign,

S

fails to be an algebra, making the analogous argument as in the proof of

B

-exactness fail.

Extension to

h_{B}

-Mixed Systems. In the second step, we introduce

h_{B}

-mixed systems, which by Theorem 4 generalize on systems of 1. linear equations, 2. positive polynomial equations

p \overset{\circ}{=} 0

, and 3. positive polynomial inequations

p \overset{\circ}{\neq} 0

, where p is a positive polynomial without constant term. We then show our main result:

Theorem 5 (Main). Any

h_{B}

-mixed systems can be converted to a

h_{B}

-exact formula by converting its linear subsystem to an

h_{B}

-exact formula.

The correctness of the algorithm for

h_{B}

-mixed systems relies on the notion of

h_{B}

-invariant formulas that we introduce. The class of

h_{B}

-invariant formulas subsume systems of positive polynomial equations

p \overset{\circ}{=} 0

and inequations

p \overset{\circ}{\neq} 0

, where p is a positive polynomials without constant terms.

Computing Sign Abstractions. In the third step, we present an algorithm for computing the sign abstraction of (homogeneous) systems of linear equations based on exact rewriting for the boolean abstraction (Theorem 6). For this, we decompose the sign abstraction into the boolean abstraction based on a function that is definable in first-order logic. This function decomposes real numbers into their positive part x and negative part y. At least one of these two parts must be zero, which can be expressed by the polynomial equation

x * y \overset{\circ}{=} 0

. The positivity of x can be expressed by

\exists z . x \overset{\circ}{=} z * z

and the positivity of y in analogy. In this way, we can reduce the problem of computing

h_{S} \circ s o l^{R} (ϕ)

to the problem of computing

h_{B} \circ s o l^{R_{+}} (ϕ^{'})

for some existentially quantified

h_{B}

-mixed system

ϕ^{'}

that we can make

h_{B}

-exact based on our main Theorem 5.

Application to Program Analysis. We show how to apply the computation of the sign abstraction of linear equation systems to improve the analysis of functional programs with arithmetics. For finding program errors there it can be useful to know about the possible signs of the values of program variables. We elaborate an example in the final Section 10.

Implementation. We implemented the

h_{B}

-exact rewriting algorithm for

h_{B}

-mixed systems from the main Theorem 5 in Python. For this we used a library from computational geometry [16] for computing elementary modes. We also used finite domain constraint programming with Minizinc [17] for computing the set of boolean solutions over logical formulas. Some successful experiments are mentioned in the related work subsection below. We did not yet implemented the algorithm for computing sign abstractions, nor its application to program analysis though.

1.3. Related Work

We start with related work by the authors, and then move to related work by others.

Change Prediction of Reaction Networks. Our main Theorem 5 was recently applied to the change prediction of reaction networks in systems biology [6]. Indeed, the development of the present article was originally motivated by this application. The problem there is to compute the difference abstraction of linear equation systems, expressing the steady state semantics of chemical reaction networks. Two difference abstractions were considered,

h_{Δ_{3}} : R_{+}^{2} \to {△, ▽, \approx}

and a refinement thereof

h_{Δ_{6}} : R_{+}^{2} \to {↑, ↓, ~, ⇑, ⇓, \approx}

. In analogy to the approach adopted for computing sign abstractions (step three above), the algorithmic approach presented there is to decompose the difference abstractions

h_{Δ_{3}}

and

h_{Δ_{6}}

into the boolean abstraction

h_{B}

and functions that are definable in first-order logic. The elaboration of this approach, however, is quite different for reflecting the nature of the difference abstractions.

Experimentation. We tested our implementation of the exact rewriting algorithm for the boolean abstraction successfully for computing difference abstractions in the application of change prediction in systems biology. The experimental results are presented in [6] are generally encouraging. They show that

h_{B}

-exact rewriting based on elementary modes in combination with finite domain constraint programming may indeed avoid naive generate and test in many practical examples. In some of these examples, however, the overall computation time still took some hours.

Abstracting Reaction Networks to Boolean Networks. Independently, the authors proposed an abstraction of chemical reaction networks to boolean networks in [18], whose precision can be improved by using the

h_{B}

-exact rewriting of

h_{B}

-mixed equation systems.

Alternative Algorithm for Computing Sign Abstractions. An alternative algorithm for computing the sign abstraction of linear equation systems (and thus also the boolean abstraction) can be obtained by John’s overapproximation Theorem 1. It shows that it is sufficient to generate the finitely many abstract solutions in

τ \in s o l^{S} (ϕ)

, and then to check for each of them whether there exists a concrete solution

σ

such that

τ = h_{S} \circ σ

. To perform this latter test, note that

h Δ (x) \overset{\circ}{=} 1

is equivalent to the strict inequation

x > 0

and

h_{S} (x) \overset{\circ}{=} 0

by the equation

x \overset{\circ}{=} 0

. Similarly,

h_{S} (x) \overset{\circ}{=} - 1

can be defined by the strict inequation

x < 0

. Whether there exists a concrete solution

σ \in s o l^{R} (ϕ)

such

τ = h \circ σ

is thus equivalent to the satisfiability of

ϕ \land ⋀_{x \in fv (ϕ)} h_{S} (x) \overset{\circ}{=} τ (x)

over

R

, where

fv (ϕ)

is the set of free variables of

ϕ

. The satisfiability of systems of strict linear inequations and homogeneous linear equations without constant terms over

R

are known to be decidable since at least 1926 [19]. But still, one has to generate the set of all abstract solutions

s o l^{S} (ϕ)

. The new algorithm presented above avoids generating this set.

Abstract Program Interpretation over Numerical Domains. In abstract interpretation [20], nonrelational domains permit to approximate the set of values of program variables while ignoring the relationship to the values of others. Well-known nonrelational numerical domains include the interval domain [21] describing invariants of the form

⋀_{i = 1}^{m} x_{i} \in [r_{i}, r_{i}^{'}]

with reals

r_{i} \leq r_{i}^{'}

and the constant propagation domain for invariants of the form

⋀_{i = 1}^{m} x_{i} \overset{\circ}{=} r_{i}

[22].

Abstract interpretation of relational domains may yield better approximations that with nonrelational domains, since relationships between the values of different variables can be taken into account. Well-known relational numerical domains include the polyhedral domain [4]. A polyhedron is the solution set of systems of inhomogeneous linear inequations of the form

n_{1} x_{1} + \dots + n_{m} x_{m} \leq r

. Alternatively, the linear equality domain [23] was considered. These are defined by system of inhomogeneous linear equations

n_{1} x_{1} + \dots + n_{m} x_{n} \overset{\circ}{=} r

.

In the present paper, we study the problem of computing the sign abstraction of polytopes represented by homogeneous linear equation systems. The polytopes can be obtained by existing methods for the abstract program interpretation over the polyhedral domain. One weakness of our approach is that we study the homogeneous case only, so that we can only abstract polytope and not more general polyhedrons.

1.4. Outline

In Section 2, we recall preliminaries on homomorphisms between

Σ

-structures. In Section 3, the first-order logic is recalled. John’s theorem and its relation to the soundness and completeness of abstract interpretation in the classical framework are discussed in Section 4. We discuss how to make linear equation system quasipositive and strongly triangular based on elementary modes in Section 5. These properties can be used to prove

h_{B}

-exactness as we show in Section 6, and thus to obtain an

h_{B}

-exact rewriting of linear equation systems. We introduce the notion of

h_{B}

-invariance in Section 7 and apply it in Section 8 to lift the

h_{B}

-exact rewriting algorithm from linear to

h_{B}

-mixed systems. This allows us to define the sign abstraction of linear equation systems on Section 9. We finally apply this result in Section 10 to the sign analysis of functional programs with arithmetic.

2. Homomorphisms on $Σ$ -Structures

We need some basic notation from set theory and standard notion of universal algebra such as

Σ

-algebras,

Σ

-structures, and homomorphism.

For any set A and

n \in N

, the set of n-tuples of elements in A is denoted by

A^{n}

. For finite sets A the number of elements of A is denoted by

| A |

. Furthermore, for any function

f : A \to B

we define the function

f^{2} : A^{2} \to B^{2}

such that

f^{2} (a, a^{'}) = (f (a), f (a^{'})

for all

a, a^{'} \in A

.

2.1. $Σ$ -Algebras

We next recall the notion of

Σ

-algebras. Let

Σ = \cup_{n \geq 0} F^{(n)} ⊎ C

be a ranked signature. We call the elements of

f \in F^{(n)}

are called n-ary function symbols, even though we may also use them as

n + 1

-ary relation symbols later on when moving to

Σ

-structures. The elements in

c \in C

are called the constants of

Σ

.

Definition 1.

A

Σ

-algebra

S = (dom (S), .^{S})

consists of a set

dom (S)

and an interpretation

.^{S}

such that

c^{S} \in dom (S)

for all

c \in C

, and

f^{S} : dom {(S)}^{n} \to dom (S)

for all

f \in F^{(n)}

.

Let

B = {0, 1}

be the set of booleans,

N

the set of natural numbers including 0,

Z

the set of integers,

R

the set of real numbers, and

R_{+}

the set of positive real numbers including 0. Note that

B \subseteq N \subseteq R_{+} \subseteq R

and

N \subseteq Z \subseteq R

. Let the addition on the reals be the binary function

+^{R} : R^{2} \to R

and the multiplication the binary function

*^{R} : R^{2} \to R

. Let the addition on the positive real numbers

+^{R_{+}} : R_{+}^{2} \to R_{+}

be equal to the restriction

{+^{R}}_{| R_{+} \times R_{+}}

and the multiplication

*^{R_{+}} : R_{+}^{2} \to R_{+}

be the restriction

{*^{R}}_{| R_{+} \times R_{+}}

.

Let

Σ_{b o o l} = {+, *} \cup {0, 1}

be the set of boolean operators where + and * are binary function symbols and 0 and 1 constants. Note that constant 0 is freely overloaded with the boolean 0 and the constant 1 with the boolean 1.

Example 1.

The set of positive reals

R_{+}

can be turned into a

Σ_{b o o l}

-algebra, in which the functions symbols are interpreted as binary functions

+^{R_{+}}

and

*^{R_{+}}

. The constants are interpreted by themselves

0^{R_{+}} = 0

and

1^{R_{+}} = 1

.

Example 2.

The set of Booleans

B = {0, 1} \subseteq R_{+}

equally defines a

Σ_{b o o l}

-algebra. There, the function symbols are interpreted as a disjunction

+^{B} = \lor^{B}

and conjunction

*^{B} = \land^{B}

on Booleans. The constants are interpreted by themselves

0^{B} = 0

and

1^{B} = 1

.

2.2. $Σ$ -Structures

We next recall the usual generalization of

Σ

-algebras to

Σ

-structures. The objective is to generalize from functions to relations. For this, we consider n-ary function symbols as

n + 1

-ary relation symbols.

Definition 2.

A

Σ

-structure

Δ = (dom (Δ), .^{Δ})

consists of a set

dom (Δ)

and an interpretation

.^{Δ}

such that

c^{Δ} \in dom (Δ)

for all

c \in C

and

f^{Δ} \subseteq dom {(Δ)}^{n + 1}

for all

f \in F^{(n)}

.

Clearly, any

Σ

-algebra is also a

Σ

-structure. Note also that symbols in

F^{(0)}

are interpreted as monadic relations, i.e., as subsets of the domain, in contrast to constants in C that are interpreted as elements of the domain.

We denote the subtraction on the reals by the binary function

-^{R} : R^{2} \to R

and the division on the reals by the ternary relation

/^{R} \subseteq R^{2} \times R

. Note that division by zero is undefined. Note also that subtraction on

R_{+}

would yield only a partial function.

Let

Σ_{a r i t h} = {+, *, -, /} \cup {0, 1}

be the arithmetic signature, where 0 and 1 are constants, and all other operators are binary function symbols. Again, we freely overlead to constant 0 with real number 0 and the constant 1 with the real number 1.

Example 3.

The set of reals

R

can be turned into a

Σ_{a r i t h}

-structure, with the interpretation of the binary functions symbols as the ternary relations

+^{R}

,

*^{R}

,

-^{R}

,

/^{R}

. The constants are interpreted by themselves

0^{R} = 0

and

1^{R} = 1

. Note that

/^{R}

is a partial but not a total function, since division by 0 is not defined. So we must see

/^{R}

as a ternary relation, so that

R

is not a

Σ_{a r i t h}

-algebra. It still is a

Σ_{b o o l}

-algebra though.

Example 4.

The set of signs

{- 1, 0, 1} \subseteq R

can be turned into a

Σ_{a r i t h}

-structure

S = ({- 1, 0, 1}, .^{S})

with the interpretation

+^{S}

,

-^{S}

,

*^{S}

and,

/^{S}

given in Figure 1. The constants are interpreted by themselves

0^{S} = 0

and

1^{S} = 1

. Note that all

+^{S}

contains

(- 1, 1, - 1)

,

(- 1, 1, 1)

and

(- 1, 1, 0)

meaning that the sum of a strictly negative and a strictly positive real has a sign in

- 1 +^{S} 1

, so it may either be strictly positive, strictly negative, or zero. So

S

is a

Σ_{a r i t h}

-structure and even when restricting the signature to

Σ_{b o o l}

it remains a

Σ_{b o o l}

-structure that is not a

Σ_{b o o l}

-algebra.

2.3. Homomorphisms

We recall the standard notion of homomorphism for

Σ

-structures which can also be applied to

Σ

-algebras.

Definition 3.

A homomorphism between two Σ-structures S and Δ is a function

h : dom (S) \to dom (Δ)

such that for

c \in C

,

n \in N

,

f \in F^{(n)}

, and

s_{1}, \dots, s_{n + 1} \in dom (S)

:

1: $h (c^{S}) = c^{Δ}$ , and
2: if $(s_{1}, \dots, s_{n + 1}) \in f^{S}$ then $(h (s_{1}), \dots, h (s_{n + 1})) \in f^{Δ}$ .

We can convert any

n + 1

-ary relation to a n-ary set valued functions. In this way, any n-function is converted to a n-ary set valued n-functions. In other words, functions of type

D^{n} \to D

are converted to functions of type

D^{n} \to 2^{D}

where

D = dom (Δ)

. In set-valued notation, the second condition on homomorphism can then be rewritten equivalently as

h (f^{S} (s_{1}, \dots, s_{n})) \subseteq f^{Δ} (h (s_{1}), \dots, h (s_{n}))

. A homomorphism for

Σ

-algebras thus satisfies

h (c^{S}) = c^{Δ}

and for all function symbols

f \in F^{(n)}

and

s_{1}, \dots, s_{n} \in dom (S)

it satisfies

h (f^{S} (s_{1}, \dots, s_{n})) = f^{Δ} (h (s_{1}), \dots, h (s_{n})) .

The boolean abstraction is the function

h_{B} : R_{+} \to B

with

h_{B} (0) = 0

and

h_{B} (r) = 1

if

r > 0

.

Lemma 1.

The boolean abstraction

h_{B}

is a homomorphism between

Σ_{b o o l}

-algebras.

Proof.

For all

r, r^{'} \in R_{+}

we have:

\begin{matrix} h_{B} (r +^{R_{+}} r^{'}) = 1 & \Leftrightarrow & r +^{R_{+}} r^{'} \neq 0 & \Leftrightarrow & r \neq 0 \lor r^{'} \neq 0 & \Leftrightarrow & h_{B} (r) = 1 \lor h_{B} (r^{'}) = 1 \\ h_{B} (r *^{R_{+}} r^{'}) = 1 & \Leftrightarrow & r *^{R_{+}} r^{'} \neq 0 & \Leftrightarrow & r \neq 0 \land r^{'} \neq 0 & \Leftrightarrow & h_{B} (r) = 1 \land h_{B} (r^{'}) = 1 \end{matrix}

Hence,

h_{B} (r +^{R_{+}} r^{'}) = h_{B} (r) +^{B} h_{B} (r^{'})

and

h_{B} (r *^{R_{+}} r^{'}) = h_{B} (r) *^{B} h_{B} (r^{'})

. Finally, for both constants

c \in C

we have that

h_{B} (c^{R_{+}}) = h_{B} (c) = c = c^{B}

. □

The sign abstraction is the function

h_{S} : R \to S

with

h_{S} (0) = 0

,

h_{S} (r) = - 1

for all strictly negative reals

r < 0

and

h_{S} (r) = 1

for all strictly positive reals

r > 0

.

Lemma 2.

The sign abstraction h_𝕊 is a homomorphism between

Σ_{a r i t h}

-structures.

Proof.

Let

r, r^{'} \in R

. For the multiplication we have

h_{S} (r *^{R} r^{'}) = h_{S} (r) *^{R} h_{S} (r^{'})

and thus

h_{S} (r *^{R} r^{'}) \in {h_{S} (r) *^{R} h_{S} (r^{'})} = h_{S} (r) *^{S} h_{S} (r^{'})

. For the addition, we have to distinguish cases. If r and

r^{'}

have the same sign, so

r +^{R} r^{'}

has the same sign, so that we have

h_{S} (r +^{R} r^{'}) \in h_{S} (r) +^{S} h_{S} (r^{'})

. If

r > 0

and

r^{'} < 0

or vice versa then we have

h_{S} (r) +^{S} h_{S} (r^{'}) = S

so that

h_{S} (r +^{R} r^{'}) \in S = h_{S} (r) +^{S} h_{S} (r^{'})

. The treatment of

-^{S}

and

/^{S}

is similar. For the constants, we have

h_{S} (0^{R}) = 0^{S}

and

h_{S} (1^{R}) = 1^{S}

. □

3. First-Order Logic

We recall the syntax and semantics of first-order logic with equality. For this, we fix a countably infinite set of variables

V

that will be ranged over by

x, y, z

.

3.1. Expressions

Given a ranked signature with constants and function symbols

Σ = C \cup ⋃_{n \geq 0} F^{(n)}

, the set of

Σ

-expressions contains all terms that can be constructed from constants and variables by using function symbols:

e_{1}, \dots, e_{n} \in E_{Σ}

::=

x ∣ c ∣ f (e_{1}, \dots, e_{n})

where

c \in C

,

n \geq 0

,

f \in F^{(n)}

,

x \in V

Let

fv (e)

be the set of all variables that occur in e. Given a subset

V \subseteq V

let

E_{Σ} (V)

be the subset of expression

e \in E_{Σ}

with

fv (e) \subseteq V

.

The semantics of

Σ

-expressions is defined in Figure 2. For any

Σ

-structure S and variable assignment

σ : V \to dom (S)

, any expression

e \in E_{Σ} (V)

denotes a set of values

〚 e 〛^{σ, S} \subseteq dom (S)

. This set is defined recursively by set-valued interpretation of the operators of the expressions in the structure S. If S is a

Σ

-algebra, then the result will always be a singleton.

3.2. Logic Formulas

The set of first-order formulas is the set of terms constructed with the usual first-order connectives from equations with symbols in

Σ

and variables in

V

:

ϕ \in F_{Σ}^{}

::=

e \overset{\circ}{=} e^{'} ∣ \exists x . ϕ ∣ ϕ \land ϕ ∣ \neg ϕ

where

e, e^{'} \in E_{Σ}

and

x \in V

A

Σ

-formula

ϕ \in F_{Σ}^{}

is a term, which either is a

Σ

-equation

e \overset{\circ}{=} e^{'}

with variables in

V

, an existentially quantified formula

\exists x . ϕ

, a conjunction

ϕ \land ϕ^{'}

, or a negation

\neg ϕ

. A system of

Σ

-equations is a conjunction of equations

e_{1} \overset{\circ}{=} e_{1}^{'} \land \dots \land e_{n} \overset{\circ}{=} e_{n}^{'}

where

e_{1}, e_{1}^{'}, \dots, e_{n}, e_{n}^{'} \in E_{Σ}

.

The set of free variables

fv (ϕ)

contains all those variables of

ϕ

that occur outside the scope of any existential quantifier in

ϕ

. Given a subset

V \subseteq V

we write

F_{Σ}^{} (V)

for the subset of formulas

ϕ \in F_{Σ}^{}

such that

fv (ϕ) \subseteq V

.

First-order formulas can be defined for providing the missing logical operators. Firstly, we can define disjunctions

ϕ \lor ϕ^{'} =_{def} \neg (\neg ϕ \land \neg ϕ^{'})

and implications

ϕ \to ϕ^{'} =_{def} \neg ϕ \lor ϕ^{'}

, and secondly, universally quantified formulas

\forall x . ϕ =_{def} \neg \exists x . \neg ϕ

. Note that these formulas are not negation-free (and thus John’s theorem cannot be applied to them). Third, we define the valid formula

true =_{def} \exists x . x \overset{\circ}{=} x

. Fourth, we write

⋀_{i = 1}^{n} ϕ_{i}

instead of

ϕ_{1} \land \dots \land ϕ_{n}

. In the case

n = 0

the conjunction is

true

. Fifth, for any vector of variables

y = (y_{1}, \dots, y_{n}) \in V^{n}

we will write

\exists y . ϕ

instead of

\exists y_{1} \dots \exists y_{n} . ϕ

.

For any

V \subseteq V

, the semantics of first-order formulas

ϕ \in F_{Σ}^{} (V)

for a

Σ

-structure S and a variable assignment

σ : V \to dom (S)

is the truth value

〚 ϕ 〛^{σ, S} \in B

defined in Figure 3.

Note that the equality symbol

\overset{\circ}{=}

is interpreted as nondisjointness, i.e., an equation

e \overset{\circ}{=} e^{'}

is true if and only if

〚 e 〛^{σ, S} \cap 〚 e^{'} 〛^{σ, S} \neq \emptyset

. In the case of

Σ

-algebras, the equality symbol

\overset{\circ}{=}

is indeed interpreted as equality of singletons. In the case of more general

Σ

-structures, though, it is not interpreted as set equality.

The set of solutions with domain V of a formula

ϕ \in F_{Σ}^{} (V)

over a

Σ

-algebra S is:

s o l_{V}^{S} (ϕ) = {σ : V \to dom (S) ∣ 〚 ϕ 〛^{σ, S} = 1}

If

V = fv (ϕ)

we omit the index V, i.e.,

s o l^{S} (ϕ) = s o l_{V}^{S} (ϕ)

.

Two formulas

ϕ, ϕ^{'} \in F_{Σ}^{}

are called S-equivalent if they have the same solution sets over S on

V = fv (ϕ) \cup fv (ϕ^{'})

, that is

s o l_{V}^{S} (ϕ) = s o l_{V}^{S} (ϕ^{'})

. Note that

y \overset{\circ}{=} y

is equivalent to

z \overset{\circ}{=} z

and also to

true

, i.e., to

\exists x . x \overset{\circ}{=} x

.

3.3. Examples

Since

B \subseteq R_{+}

we can define the boolean abstraction by a formula

y \overset{\circ}{=} h_{B} (x)

in

F_{Σ_{b o o l}}^{}

over

R_{+}

with two variables

x, y \in V

:

(x \overset{\circ}{=} 0 \land y \overset{\circ}{=} 0) \lor (\neg x \overset{\circ}{=} 0 \land y \overset{\circ}{=} 1)

Since

S \subseteq R

we can define the sign abstraction by a formula

y \overset{\circ}{=} h_{S} (x)

in

F_{Σ_{b o o l}}^{}

over

R

with two variables

x, y \in V

:

\begin{matrix} (x \overset{\circ}{=} 0 \land y \overset{\circ}{=} 0) \lor (x > 0 \land y \overset{\circ}{=} 1) \lor (x < 0 \land y + 1 \overset{\circ}{=} 0) \end{matrix}

where:

\begin{matrix} x \geq 0 & =_{def} & \exists z . x \overset{\circ}{=} z * z \\ x > 0 & =_{def} & x \geq 0 \land \neg (x \overset{\circ}{=} 0) \\ x < 0 & =_{def} & \neg x \geq 0 \end{matrix}

These definitions illustrate that both abstraction are closely related to strict inequations

x > 0

and

x < 0

. The boolean abstraction is concerned with strict positivity

x > 0

, while the sign abstraction is also concerned with strict negativity

x < 0

.

3.4. Semantic Properties of Free and Bound Variables

We need the following two standard lemmas on the role of free and bound variables for the solution sets of logic formulas. For any subset of variable assignments R of type

V^{'} \to dom (S)

and any disjoint sets of variables

V \cap V^{'} = \emptyset

we define

{ext}_{V}^{S} (R) = {σ \cup σ^{'} ∣ σ : V \to dom (S), σ^{'} \in R}

.

Lemma 3

(Cylindrification). If

V \cap fv (ϕ) = \emptyset

then

s o l_{V \cup fv (ϕ)}^{S} (ϕ) = {ext}_{V}^{S} (s o l^{S} (ϕ))

.

Proof.

We can show that

〚 e 〛^{σ, S} = 〚 e 〛^{σ_{| fv (e)}, S}

for all expressions

e \in E_{Σ}

with

fv (e)

disjoint to V and any variable assignment

σ : fv (e) \cup V \to dom (S)

by induction on the structure of expressions. From this, we can prove for all formulas

ϕ \in F_{Σ}^{}

such that

fv (ϕ)

is disjoint from V and

σ : fv (ϕ) \cup V \to dom (S)

that

〚 ϕ 〛^{σ, S} = 〚 ϕ 〛^{σ_{| fv (ϕ)}, S}

by induction on the structure of

Σ

-formulas. This implies the lemma. □

The projection

π_{a} (f)

of a function

f : A \to B

is its restriction

f_{| A \ {a}}

. The projection of a set F of functions

f : A \to B

is

π_{a} (F) = {π_{a} (f) ∣ f \in F}

.

Lemma 4

(Quantification is projection).

s o l^{S} (\exists x . ϕ) = π_{x} (s o l^{S} (ϕ))

.

Proof.

This is follows from the semantics of existential quantified formulas as follows:

s o l^{S} (\exists x . ϕ) = {σ_{| fv (ϕ) \ {x}} ∣ σ \in s o l^{S} (ϕ)} = π_{x} (s o l^{S} (ϕ))

□

4. Abstract Interpretation

We recall the notion of

Σ

-abstractions and use them for abstracting sets of concrete solutions of logic formulas within the usual framework of abstract interpretation. Due to John’s theorem, this abstraction can be soundly approximated by the abstract interpretation of logic formulas in the target structure of the

Σ

-abstraction. We will argue that John’s overapproximation shows the soundness of abstract interpretation in the classical framework of Cousot & Cousot [1]. We will then introduce the notion of exactness of a logic formula with respect to a

Σ

-abstraction and relate it to the completeness of abstract interpretation.

4.1. John’s Overapproximation for $Σ$ -Abstractions

The notion of

Σ

-abstraction from [6] generalizes at the same time over the boolean abstraction and the sign abstraction.

Definition 4.

A Σ-abstraction is a homomorphism

h : S \to Δ

between Σ-structures such that

dom (Δ) \subseteq dom (S)

.

The boolean abstraction

h_{B}

is a

Σ_{b o o l}

-abstraction by Lemma 1. The sign abstraction

h_{S}

is a

Σ_{b o o l}

-abstraction by Lemma 2.

Let

h : S \to Δ

be a

Σ

-abstraction and

V \subseteq V

. For any subset of assignments R of type

V \to dom (S)

, we define the abstraction:

h \circ R = {h \circ σ : V \to dom (Δ) ∣ σ \in R}

Theorem 1

(John’s Overapproximation [6,8,9]). For any Σ-abstraction

h : S \to Δ

between Σ-structures and any negation-free Σ-formula

ϕ \in F_{Σ}^{}

:

h \circ s o l^{S} (ϕ) \subseteq s o l^{Δ} (ϕ)

John’s theorem states that the abstraction with respect to h of the concrete solution set of a first-order formula can be overapproximated by abstract interpretation of the formula in the target structure of h.

We only give a brief sketch of the full proof, which can be found in [6]. Let

V = fv (ϕ)

and

σ : V \to dom (S)

. For any expression

e \in E_{Σ} (V)

, we can show

h (〚 e 〛^{σ, S}) = 〚 e 〛^{h \circ σ, Δ}

by induction on the structure of e. It then follows for any negation-free formula

ϕ \in F_{Σ}^{} (V)

that

〚 ϕ 〛^{σ, S} \leq 〚 ϕ 〛^{h \circ σ, Δ}

. This is equivalent to that

{h \circ σ ∣ σ \in s o l_{V}^{S} (ϕ)} \subseteq s o l_{V}^{Δ} (ϕ)

and thus

h \circ s o l_{V}^{S} (ϕ) \subseteq s o l_{V}^{Δ} (ϕ)

. Since

V = fv (ϕ)

, it follows that

h \circ s o l^{S} (ϕ) \subseteq s o l^{Δ} (ϕ)

as required.

4.2. Exactness of $Σ$ -Formulas for $Σ$ -Abstractions

As a new contribution, we introduce the notion of exactness of first-order formulas with respect to a

Σ

-abstraction.

Definition 5

(h-Exactness). Let

h : S \to Δ

be a Σ-abstraction and

ϕ \in F_{Σ}^{} (V)

a formula. We call ϕ h-exact with respect to V if h

\circ s o l_{V}^{S} (ϕ) = s o l_{V}^{Δ} (ϕ) .

We call ϕ h-exact if ϕ is h-exact with respect to

fv (ϕ)

.

For instance, the linear equation system

ϕ

equal to

x + y \overset{\circ}{=} x + z

is neither

h_{B}

-exact nor

h_{S}

-exact. However it is equivalent to

y \overset{\circ}{=} z

which is both

h_{B}

-exact and

h_{S}

-exact. To see this note that

τ = [x / 1, y / 1, z / 0]

belongs to

s o l^{B} (ϕ)

but not to

h_{B} \circ s o l^{R_{+}} (ϕ)

since

τ (y) \neq τ (z)

. The same assignment also belongs to

s o l^{S} (ϕ)

but not to

h_{S} \circ s o l^{R} (ϕ)

since

τ (y) \neq τ (z)

.

4.3. Soundness and Completeness of Abstract Interpretation

John’s theorem is related to the soundness of abstract interpretation and the notion of exactness to its completeness. To state the precise relationship, we need to embed our setting into the classical framework of abstract interpretation [1,10].

When considering formulas as programs, the usual framework of abstract interpretation of programs applies to the interpretation of the formulas (programs) in the target structure of the

Σ

-abstraction. More formally, we fix a finite subset of variables

V \subseteq V

and consider the subset of formulas as programs:

P = {ϕ \in F_{Σ}^{} (V) ∣ ϕ is negation - free}

The semantics of a program

ϕ \in P

over a given

Σ

-structure S is the set of its solutions over S:

〚 ϕ 〛^{} = s o l^{S} (ϕ)

The range of the semantics mapping is the space of concrete values

C = 2^{{σ ∣ σ : V \to dom (S)}}

. Note that

(C, \subseteq, \cap, \cup)

is a complete lattice. An abstract interpretation of a program

ϕ \in P

maps

ϕ

to the set of its solutions over

Δ

:

〚 ϕ 〛^{♯} = s o l^{Δ} (ϕ)

The range of the abstract interpretation is the abstract domain

A = 2^{{τ ∣ τ : V \to dom (Δ)}}

. Clearly,

(A, \subseteq, \cap, \cup)

is also a complete lattice. We define the abstraction function

α_{h} : C \to A

of our Galois connection such that for subsets of concrete assignments

R \subseteq C

:

α_{h} (R) = h \circ R

Definition 6

(Cousot & Cousot [1], Giacobazzi, Ranzato & Scozzari [10]). An abstract interpretation

〚 . 〛^{♯} : P \to A

is sound for an abstraction

α : C \to A

with respect to the program semantics

〚 . 〛^{} : P \to C

if for all programs

ϕ \in P

it holds that

α (〚 ϕ 〛^{}) \subseteq 〚 ϕ 〛^{♯}

. It is complete if all programs

ϕ \in P

satisfy

α (〚 ϕ 〛^{}) = 〚 ϕ 〛^{♯}

.

John’s theorem states that the abstract interpretation

α_{h}

of negation free-formulas

ϕ \in P

over

Δ

is sound for the abstraction of

s o l^{S} (ϕ)

with respect to the

Σ

-abstraction

h : S \to Δ

. Furthermore, if all formulas of

P

are h-exact then abstract interpretation over

Δ

is complete for abstraction

α_{h}

. As illustrated above, abstract interpretation over

B

fails to be complete for the abstraction

α_{h_{B}}

, and similarly, abstract interpretation over

S

fails to be complete for the abstraction

α_{h_{S}}

. Note that the completeness of abstract interpretations was largely studied in the context of program analysis (see e.g., Section 8 of [10] for an overview).

In the present article, we study the problem of exact rewriting for

h_{B}

. The question is how to rewrite a

Σ_{b o o l}

-formula into a

h_{B}

-exact formula that is

R_{+}

-equivalent. Note that exact rewriting of linear equation system for

h_{B}

is a different problem than to decide whether abstract interpretation is complete for

α_{h_{B}}

on linear equation systems. Still, both notions are closely related: exact rewriting can help to improve the precision of abstract interpretation just in the case where it is not already complete, i.e., maximally precise. Otherwise, exact rewriting is trivial.

In the case of the sign abstraction, we do not have any algorithmic idea of how to do exact rewriting for linear equation systems. Therefore, we study the easier problem of exact rewriting for the boolean abstraction of linear equation systems in the first place. Given an

h_{B}

-exact formula

ϕ

, we can compute the abstraction

h_{B} \circ s o l^{R_{+}} (ϕ) = s o l^{B} (ϕ)

by finite domain constraints programming. We then use exact rewriting for the boolean abstraction to compute sign abstractions of linear equation systems

h_{S} \circ s o l^{R} (ϕ)

, rather than relying on exact rewriting for the sign abstraction itself. For this, we use first-order definitions beside of finite domain constraint programming.

4.4. Galois Connection

We finally introduce the concretization operation that corresponds to the abstraction of the solution set of a logic formula with respect to a

Σ

-abstraction, and show that the pair of abstraction and concretization forms a Galois connection.

Given a

Σ

-abstraction

h : S \to Δ

, and a set R of variable assignments to

dom (Δ)

, we define the left-decomposition of R with respect to h as the following set of variable assignments to

dom (S)

:

\begin{matrix} h \circ - R & =_{def} {σ ∣ h \circ σ \in R} \end{matrix}

So let

α_{h} : C \to A

be the abstraction induced by

Σ

-abstraction h. We define the corresponding concretization function

γ_{h} : A \to C

such that for all abstract assignments

R \subseteq A

:

γ_{h} (R) = h \circ - R =_{def} {σ \in C ∣ h \circ σ \in R}

Lemma 5.

(A, C, α_{h}, γ_{h})

is a Galois connection, i.e., for all

R \in C

and

T \in A

:

α_{h} (R) \subseteq T if and only if R \subseteq γ_{h} (T)

Proof.

If

h \circ R \subseteq T

then

h \circ - h \circ R \subseteq \circ - T

and since

R \subseteq h \circ - h \circ R

we have

R \subseteq h \circ - T

. If conversely

R \subseteq h \circ - T

then

h \circ R \subseteq h \circ h \circ - T

and since

h \circ h \circ - T = T

it follows that

h \circ R \subseteq T

. □

5. Equation Systems, Positivity, and Triangularity

We study systems of

Σ_{b o o l}

-equations for positivity and triangularity. These notions will be essential for showing

B

-exactness. We are not only interested in homogeneous linear equations but also in more general polynomial equations without constant term.

5.1. Classes of Equation Systems

Let

e_{1}, \dots, e_{n} \in E_{Σ_{b o o l}}

be a sequence of expressions and

n \in N

. If

n \neq 0

we define

\sum_{i = 1}^{n} e_{i} =_{def} e_{1} + \dots + e_{n}

and

\prod_{i = 1}^{n} e_{i} =_{def} e_{1} * \dots * e_{n}

. For

n = 0

, we define

\sum_{i = 1}^{n} e_{i} = 0

and

\prod_{i = 1}^{n} e_{i} = 1

. Furthermore, for any expression

e \in E_{Σ_{b o o l}}

we define:

n e =_{def} \sum_{i = 1}^{n} e and e^{n} =_{def} \prod_{i = 1}^{n} e

A polynomial (with natural coefficients) is a

Σ_{b o o l}

-expression of the following form:

\sum_{j = 1}^{l} n_{j} \prod_{k = 1}^{i_{j}} x_{j, k}^{m_{j, k}}

where l and

i_{j}

are natural numbers,

x_{1, 1}, \dots, x_{l, i_{l}}

variables, all

n_{j} \neq 0

are natural numbers called the coefficients, and all

m_{j, k} \neq 0

are natural numbers called the exponents. The products

\prod_{k = 1}^{i_{j}} x_{j, k}^{m_{j, k}}

are called the monomials of the polynomial.

Definition 7.

A polynomial

\sum_{j = 1}^{l} n_{j} \prod_{k = 1}^{i_{j}} x_{j, k}^{m_{j, k}}

with natural coefficients

n_{j} \neq 0

has no constant term if none of its monomials are equal to 1, i.e.,

i_{j} \neq 0

for all

1 \leq j \leq l

. It is linear if all its monomials are variables, i.e.,

i_{j} = 1

and

m^{j, 1} = \dots = m^{j, i_{j}} = 1

for all

1 \leq j \leq l

.

A polynomial equation is a

Σ_{b o o l}

-equation

p \overset{\circ}{=} p^{'}

between polynomials. A polynomial equation system is a system of polynomial equations.

Linear polynomials have the form

\sum_{j = 1}^{l} n_{j} x_{j, 1}

where l and all

n_{j} \neq 0

are naturals and all

x_{j, 1}

are variables. In particular, linear polynomials do not have a constant term. Note that the constant 0 is equal to the linear polynomial with

l = 0

. A (homogeneous) linear equation is a polynomial equation with linear polynomials, so without constant terms. A (homogeneous) linear equation system is a system of linear equations.

A (homogeneous) integer matrix equation has the form

A y \overset{\circ}{=} 0

where A is a

n \times m

matrix of integers for some naturals

m, n

such that

y \in V^{m}

and

0 \in {0}^{n}

. Any integer matrix equation can be turned into a linear equation system with natural coefficients, by bringing the negative coefficients positively on the right-hand side. For instance, the linear integer matrix equation:

(\begin{matrix} 3 & 0 \\ 2 & - 5 \end{matrix}) (\begin{matrix} x \\ y \end{matrix}) \overset{\circ}{=} (\begin{matrix} 0 \\ 0 \end{matrix})

corresponds to the following system of linear

Σ_{b o o l}

-equations:

3 x \overset{\circ}{=} 0 \land 2 x \overset{\circ}{=} 5 y

Therefore, we will sometimes confuse an integer matrix equations with the corresponding system of linear

Σ_{b o o l}

-equations. Conversely, any system of linear

Σ_{b o o l}

-equations can be converted into a integer matrix equation by moving the positive right-hand sides negatively to the left and factorizing the expressions for the different occurrences of the same variable.

5.2. Positivity and Triangularity

such that

σ (y), σ (y^{'})

We next define positivity and triangularity properties for equation systems. These are key properties to show

B

-exactness of linear equation systems.

Definition 8.

A

Σ_{b o o l}

-equation is called positive if it has the form

e \overset{\circ}{=} 0

and quasi-positive if it has the form

e \overset{\circ}{=} n y

, where

n \in N

,

y \in V

, and

e \in E_{Σ_{b o o l}}

. We call a system of

Σ_{b o o l}

-equations positive respectively quasi-positive if all its equations are.

This definition makes sense, since all constants in

Σ_{b o o l}

-expressions are positive and all operators of

Σ_{b o o l}

-expressions preserve positivity. Note also that any positive equation is quasipositive since the constant 0 is equal to the polynomial

0 y

.

This above system of linear equations is quasipositive, but not positive since

5 y

appears on a right-hand side. More generally, the linear equation system for a integer matrix equation

A y \overset{\circ}{=} 0

is positive if and only if all integers in A are positive, and quasipositive if each line of A contains at most one negative integer.

Definition 9.

We call a quasipositive system of

Σ_{b o o l}

-equations triangular if it has the form

⋀_{l = 1}^{n} e_{l} \overset{\circ}{=} n_{l} y_{l}

such that the variables

y_{l}

are l-fresh for all

1 \leq l \leq n

, i.e.,

y_{l} \notin fv (⋀_{i = 1}^{l - 1} e_{i} \overset{\circ}{=} e_{i}^{'})

and if

n_{l} \neq 0

then

y_{l} \notin fv (e_{l})

. We call the quasi-positive polynomial system strongly-triangular if it is triangular and satisfies

n_{l} \neq 0

for all

1 \leq l \leq n

.

The above linear equation system is triangular, but not strongly triangular since the right-hand side of the first equation is 0. Consider an integer matrix equation

A y \overset{\circ}{=} 0

. If A is positive and triangular, then the corresponding linear equation system is positive and triangular too. For being quasipositive and strongly-triangular, the integers below the diagonal of A must be negative, those on the diagonal must be strictly negative, and those on the right of the diagonal must be positive.

5.3. Linear Equation Systems and Elementary Modes

We next show that elementary modes [12,13,14,15] can be used to transform systems of linear equations into

R_{+}

-equivalent systems that are quasi-positive and strongly-triangular.

We first recall the necessary definitions and folklore results on elementary modes and the double description method. We limit the presentation to equations with integer coefficients solved in

R_{+}

, since more general definitions and results for elementary modes in

R

are not needed for this paper.

Definition 10.

The support of a function

σ : V \to R

is

supp (σ) = {y \in V ∣ σ (y) \neq 0}

.

Definition 11

(Elementary Modes). An elementary mode of an integer matrix

A \in Z^{n, m}

is a vector

n \in N^{n}

such that for any sequence of pairwise distinct variables

y \in V^{n}

the function

σ = [y / n]

is a solution in

s o l^{R_{+}} (A y \overset{\circ}{=} 0)

such that:

$supp (σ)$ is minimal, i.e., there exist no $σ^{'} \in s o l^{S} (ϕ)$ such that $supp (σ^{'}) ⊊ supp (σ)$ ,
σ is normalized, i.e., there exist variables $y, y^{'}$ in $y$ such that $σ (y)$ and $σ (y^{'})$ are coprimes (their greatest common divisor is 1).

The elementary modes of a matrix A are the extreme directions of the polyhedral cone

s o l^{R_{+}} (A y \overset{\circ}{=} 0)

. This implies that any solution of the linear system can be expressed as a weighted sum of its elementary modes, where all the weights are non negative. Due to normalization, the number of elementary modes is finite for all integer matrices.

Theorem 2

(Folklore). For any integer matrix

A \in Z^{m, n}

one can compute a matrix of natural numbers

E \in N^{n, o}

in at most exponential time, such that the

Σ_{b o o l}

-formulas for

A y \overset{\circ}{=} 0

and

\exists x . E x \overset{\circ}{=} y

are

R_{+}

-equivalent for all vectors

y \in V^{n}

and

x \in V^{o}

of pairwise distince variables. Furthermore, the o columns of E are the elementary modes of A.

We note that Theorem 2 can be lifted to matrices of rational numbers

Q

, since any rational matrix equation

A y \overset{\circ}{=} 0

can be rewritten to a integer matrix equation with the same

R_{+}

-solution set, by multiplying with the natural numbers in the denominators of the rational numbers. The freely available cddlib tool in the rational mode [24] inputs a matrix

A \in Q^{n, m}

, and outputs the list of (integer) elementary modes of A. From this list, we can construct the matrix E for A by aligning the elementary modes of A as the columns of E.

Note that the interface of the cddlib tool is more general, in that it applies to rational matrix inequations interpreted over the reals, rather than to rational matrix equations interpreted over the positive reals: it permits to compute the normalized extreme directions of the polyedral cone

s o l^{R} (B y \geq 0)

for any rational matrix inequation B over the reals. If one wants to compute the elementary modes of a rational matrix A – that is the normalized extreme directions of polyhedral cones of over the positive reals

s o l^{R_{+}} (A y \overset{\circ}{=} 0)

– then one can chose

B = (\begin{matrix} A \\ - A \\ I d \end{matrix})

where

I d

is the identity matrix with as many columns as A.

Corollary 1

(Elementary Mode Rewriting). Given a system of linear equations

ϕ \in F_{Σ_{b o o l}}^{}

, one can compute in at most exponential time an

R_{+}

-equivalent formula

emr (ϕ)

that has the form

\exists x . ϕ^{'}

where

ϕ^{'}

is quasi-positive and strongly-triangular system of equations.

Proof.

Any system of linear equations

ϕ

can be converted into some integer matrix equation

A y \overset{\circ}{=} 0

where

y

is a vector that contains all variables in

fv (ϕ)

exactly once. Let E be a matrix of elementary modes of A from Theorem 2. This theorem states that

A y \overset{\circ}{=} 0

and thus

ϕ

is

R_{+}

-equivalent to

\exists x . E x \overset{\circ}{=} y

for some vector of fresh variables

x

. So let

emr (ϕ)

be

\exists x . ϕ^{'}

and

ϕ^{'}

be

E x \overset{\circ}{=} y

. Since all entries of E are positive, the variables in

y

are pairwise distinct, and the variables in

x

are chosen freshly, it follows that

ϕ^{'}

is both quasi-positive and strongly-triangular. □

We have implemented the elementary mode rewriting in Python based on the cddlib tool, and plan to make our tool publically available soon. An example input is the system of linear

Σ_{b o o l}

-equations

ϕ_{0}

given in Figure 4. The corresponding integer matrix equation system is given there too. The elementary modes of the matrix of this system are the vectors

(1, 0, 1, 1)

and

(1, 1, 0, 0)

. When putting these vectors in the columns of a new matrix, our tool returns the elementary mode rewriting

emr (ϕ_{0})

in Figure 5.

6. $h_{B}$ -Exact Rewriting of Linear Equation Systems

Our next objective is to study the preservation of h-exactness by logical operators. The main difficulty of this paper is the fact that h-exactness is not preserved by conjunction. Nevertheless, as we will show next, it is preserved by disjunction and existential quantification.

To do so we first show that h-exactness is preserved when adding variables. For this, we have to assume that the

Σ

-abstraction h is sujective, which will be the case of all

Σ

-abstractions of interest.

Lemma 6

(Variable extension preserves exactness). Let

h : S \to Δ

be a Σ-abstraction that is surjective and

ϕ \in F_{Σ}^{} (V)

a formula. Then the h-exactness of ϕ implies the h-exactness of ϕ with respect to V.

Proof.

This follows from that abstractions of solutions of

ϕ

can be extended arbitrarily to variables that do not appear freely in

ϕ

as stated by the following claim.

Claim 1.

For all

σ : V \to Δ

:

σ \in h \circ s o l^{S} (ϕ)

iff

σ_{| fv (ϕ)} \in h \circ s o l^{S} (ϕ)

.

For the one direction let

σ \in h \circ s o l_{V}^{S} (ϕ)

. Then, there exists

σ \in s o l_{V}^{S} (ϕ)

such that

σ = h \circ σ

. Since

V \supseteq fv (ϕ)

it follows that

σ_{| fv (ϕ)} \in s o l^{S} (ϕ)

. Furthermore

σ_{| fv (ϕ)} = h \circ σ_{| fv (ϕ)}

and thus

σ_{| fv (ϕ)} \in h \circ s o l^{S} (ϕ)

.

For the other direction let

σ_{| fv (ϕ))} \in h \circ s o l^{S} (ϕ)

. Then, there exists

σ \in s o l^{S} (ϕ)

such that

σ_{| fv (ϕ)} = h \circ σ

. For any

y \in V \ fv (ϕ)

let

s_{y} \in dom (S)

be such that

h (s_{y}) = σ (y)

. Such values exists since h is surjective. Now define

σ^{'} = σ [y / s_{y} ∣ y \in V \ fv (ϕ)]

. Since

V \supseteq fv (ϕ)

it follows that

σ^{'} \in s o l_{V}^{S} (ϕ)

. Furthermore,

σ = h \circ σ^{'}

, so

σ \in h \circ s o l_{V}^{S} (ϕ)

. □

For the case of disjunction, we need a basic property of unions (joins) which fails for intersections (meets).

Lemma 7

(Abstraction

α_{h}

preserves joins). Let V be a set of variables,

R_{1}

and

R_{2}

be subsets of assignments of type

V \to dom (S)

and

h : S \to Δ

be a Σ-abstraction. Then:

h \circ (R_{1} \cup R_{2}) = h \circ R_{1} \cup h \circ R_{2}

Proof.

This lemma follows from the following equivalences:

\begin{matrix} τ \in h \circ (R_{1} \cup R_{2}) & \Leftrightarrow & \exists σ . σ \in R_{1} \cup R_{2} \land τ = h \circ σ \\ \Leftrightarrow & \exists σ . (σ \in R_{1} \lor σ \in R_{2}) \land τ = h \circ σ \\ \Leftrightarrow & \exists σ . (σ \in R_{1} \land τ = h \circ σ) \lor (σ \in R_{2} \land τ = h \circ σ) \\ \Leftrightarrow & τ \in h \circ R_{1} \lor τ \in h \circ R_{2} \\ \Leftrightarrow & τ \in h \circ R_{1} \cup h \circ R_{2} \end{matrix}

□

Proposition 1.

The disjunction of h-exact formulas is h-exact.

Proof.

Let

ϕ_{1}

and

ϕ_{2}

be negation free formulas that are h-exact. Let

V = fv (ϕ_{1}) \cup fv (ϕ_{2})

. Lemma 6 shows that

ϕ_{1}

and

ϕ_{2}

are also h-exact with respect to the extended variable set V, i.e., for both

i \in {1, 2}

:

\begin{matrix} h \circ s o l_{V}^{S} (ϕ_{i}) & = & s o l_{V}^{Δ} (ϕ_{i}) \end{matrix}

The h-exactness of the disjunction

ϕ_{1} \lor ϕ_{2}

can now be shown as follows:

\begin{matrix} h \circ s o l^{S} (ϕ_{1} \lor ϕ_{2}) & = & h \circ (s o l_{V}^{S} (ϕ_{1}) \cup s o l_{V}^{S} (ϕ_{2})) \\ = & h \circ s o l_{V}^{S} (ϕ_{1}) \cup h \circ s o l_{V}^{S} (ϕ_{2}) & by Lemma 7 \\ = & s o l_{V}^{Δ} (ϕ_{1}) \cup s o l_{V}^{Δ} (ϕ_{2}) & by h - exactness of ϕ_{1} and ϕ_{2} wrt . V \\ = & s o l^{Δ} (ϕ_{1} \lor ϕ_{2}) \end{matrix}

□

Lemma 8

(Projection commutes with abstraction). For any Σ-abstraction

h : S \to Δ

, subset R of assignments of type

V \to S

, and variable

x \in V

:

\circ π_{x} (R) = π_{x} (h \circ R)

.

Proof.

For all

σ : V \to dom (S)

we have

h \circ π_{x} (σ) = h \circ σ_{| V \ {x}} = {(h \circ σ)}_{| V \ {x}} = π_{x} (h \circ σ)

. □

Proposition 2

(Quantification preserves exactness). For any surjective Σ-abstraction

h : S \to Δ

and formula

\exists x . ϕ \in F_{Σ}^{}

, if ϕ is h-exact then

\exists x . ϕ

is h-exact.

Proof.

Let

ϕ

be h-exact. By definition

ϕ

is h-exact with respect to

V = fv (ϕ)

. Since h is assumed to be surjective, Lemma 6 implies that

ϕ

is h-exact with respect to

V \cup {x}

(independently of whether x occurs freely in

ϕ

or not). Hence:

\begin{matrix} h (s o l^{S} (\exists x . ϕ)) & = & (π_{x} (s o l^{S} (ϕ))) & by Lemma 4 \\ = & π_{x} h ((s o l^{S} (ϕ))) & by Lemma 8 \\ = & π_{x} (s o l^{Δ} (ϕ)) & \sin ce ϕ is h - exact \\ = & s o l^{Δ} (\exists x . ϕ) & by Lemma 4 \end{matrix}

□

We next study the h-exactness for strongly-triangular systems of

Σ_{b o o l}

-equations, under the condition that h is an abstraction between

Σ_{b o o l}

-algebras with unique division (see Definition 12).

Lemma 9

(Singleton property). If S is a Σ-algebra,

e \in E_{Σ} (V)

, and

σ : V \to S

a variable assignment, then the set

〚 e 〛^{σ, S}

is a singleton.

Proof.

By induction on the structure of expressions

e \in E_{Σ} (V)

:

Case of constants

c \in C

. The set

〚 c 〛^{σ, S} = {c^{S}}

is a singleton.

Case of variables

x \in V

. The set

〚 x 〛^{σ, S} = {σ (x)}

is a singleton.

Case

f (e_{1}, \dots, e_{n})

where

e_{i} \in E_{Σ} (V)

and

f \in F^{(n)}

.

\begin{matrix} 〚 f (e_{1}, \dots, e_{n}) 〛^{σ, S} & = & {f^{S} (s_{1}, \dots, s_{n}) ∣ s_{i} \in 〚 e_{i} 〛^{σ, S}} \end{matrix}

This set is a singleton since

〚 e_{i} 〛^{σ, S}

are singletons by induction hypothesis, meaning that

f^{S} (〚 e_{1} 〛^{σ, S}, \dots, 〚 e_{n} 〛^{σ, S})

is also a singleton since S is a

Σ

-algebra. □

A

Σ

-algebra is a

Σ

-structure with the singleton property. Let

ele

be the function that maps any singleton to the element that it contains.

Definition 12.

We say that a

Σ_{b o o l}

-structure S has unique division if it satisfies the first-order formula

\forall x . \exists^{= 1} y . n y \overset{\circ}{=} x

for all nonzero natural numbers

n \in N

.

Clearly, the

Σ_{b o o l}

-structures

R_{+}

,

B

, and

S

have unique division. Note, however, that

S

is not a

Σ_{b o o l}

-algebra, so that the following two Propositions 3 and 4 cannot be applied to

S

instead of

B

.

For any element s of the domain of a

Σ_{b o o l}

-structure S with unique division and any nonzero natural number

n \in N

, we denote by

\frac{s}{n}

the unique element of

{σ (y) ∣ σ \in s o l^{S} (n y \overset{\circ}{=} z), σ (z) = s}

.

Lemma 10.

Let

ϕ \in F_{Σ_{b o o l}}^{}

be a formula and S a

Σ_{b o o l}

-algebra with unique division. For nonzero natural number n, variable

y \notin fv (ϕ)

, and expression

e \in E_{Σ} (fv (ϕ))

:

s o l^{S} (ϕ \land n y \overset{\circ}{=} e) = {σ [y / \frac{ele (〚 e 〛^{σ, S})}{n}] ∣ σ \in s o l^{S} (ϕ)}

Proof.

We fix some

σ : fv (ϕ) \to dom (S)

arbitrarily. Since S is a

Σ_{b o o l}

-algebra,

〚 e 〛^{σ, S}

is a singleton and

fv (e) \subseteq V (ϕ)

,

ele (〚 e 〛^{σ, S})

is defined uniquely. Furthermore S has unique division, so that

\frac{ele (〚 e 〛^{σ, S})}{n}

is a well-defined element of

dom (S)

. Therefore and since

y \notin fv (ϕ)

,

σ [y / \frac{ele (〚 e 〛^{σ, S})}{n}]

is the unique solution of the equation

n y \overset{\circ}{=} e

that extends on

σ

.

Firstly, we prove the inclusion “⊇”. Let

σ \in s o l^{S} (ϕ)

,

y \notin fv (ϕ)

, and

σ [y / \frac{ele (〚 e 〛^{σ, S})}{n}]

is a solution of

n y \overset{\circ}{=} e

, it follows that

σ [y / \frac{ele (〚 e 〛^{σ, S})}{n}]

is a solution of

ϕ \land n y \overset{\circ}{=} e

.

Secondly, we prove the inverse inclusion “⊆”. Let

σ \in s o l^{S} (ϕ \land n y \overset{\circ}{=} e)

. Since

σ [y / \frac{ele (〚 e 〛^{σ, S})}{n}]

is the unique solution of the equation

n y \overset{\circ}{=} e

that extends on

σ^{'} = σ_{| fv (ϕ)}

it follows that

σ (y) = \frac{ele (〚 e 〛^{σ, S})}{n}

so that

σ = σ^{'} [y / \frac{ele (〚 e 〛^{σ, S})}{n}]

while

σ^{'} \in s o l^{S} (ϕ)

. □

Proposition 3.

Let

ϕ \in F_{Σ_{b o o l}}^{} (V)

a formula,

n \neq 0

a natural number,

e \in E_{Σ_{b o o l}} (V)

an expression,

y \notin V

, and

h : S \to Δ

a

Σ_{b o o l}

-abstraction between

Σ_{b o o l}

-algebras with unique division. Under these conditions, if ϕ is h-exact then

ϕ \land e \overset{\circ}{=} n y

is h-exact.

Proof.

Let

e \in E_{Σ_{b o o l}} (V)

an expression. □

Claim 2.

For any

σ : V \to R_{+}

:

h (ele (〚 e 〛^{σ, S})) = ele (〚 e 〛^{h \circ σ, Δ})

.

This can be seen as follows. For any

σ : V \to S

Theorem 1 on homomorphism yields

h (〚 e 〛^{σ, S}) \subseteq 〚 e 〛^{h \circ σ, Δ}

. Since S and

Δ

are both

Σ

-algebras, the sets

〚 e 〛^{σ, S}

and

〚 e 〛^{h \circ σ, Δ}

are both singletons by Lemma 9, so that

h (ele (〚 e 〛^{σ, S})) = ele (〚 e 〛^{h \circ σ, Δ})

.

Claim 3.

For any

s \in dom (S)

and

n \neq 0

a natural number:

h (\frac{s}{n}) = \frac{h (s)}{n}

.

Since S is assumed to have unique division

s^{'} = \frac{s}{n}

is well-defined as the unique element of

dom (S)

such that

\underset{n}{\underset{︸}{s^{'} +^{S} \dots +^{S} s^{'}}} = s

. Hence,

h (\underset{n}{\underset{︸}{s^{'} +^{S} \dots +^{S} s^{'}}}) = h (s)

and since h is a homomorphism, it follows that

\underset{n}{\underset{︸}{h (s^{'}) +^{Δ} \dots +^{Δ} h (s^{'})}} = h (s)

. Since

Δ

is assumed to have unique division, this implies that

h (s^{'}) = \frac{h (s)}{n}

.

The Proposition can now be shown based on these two claims. Let

ϕ

be h-exact,

y \notin V

, and

fv (e) \subseteq V

. We have to show that

ϕ \land n y \overset{\circ}{=} e

is h-exact too:

\begin{matrix} h \circ s o l^{S} (ϕ \land e \overset{\circ}{=} n y) & = & h \circ {σ [y / \frac{ele (〚 e 〛^{σ, S})}{n}] ∣ σ \in s o l^{S} (ϕ)} & by Lemma 10 \\ = & {(h \circ σ) [y / h (\frac{ele (〚 e 〛^{σ, S})}{n})] ∣ σ \in s o l^{S} (ϕ)} & elementary \\ = & {σ [y / h (\frac{ele (〚 e 〛^{σ, S})}{n})] ∣ σ \in s o l^{Δ} (ϕ)} & h - exactness of ϕ \\ = & {σ [y / \frac{h (ele (〚 e 〛^{σ, S}))}{n}] ∣ σ \in s o l^{Δ} (ϕ)} & by Claim 3 \\ = & {σ [y / \frac{ele (〚 e 〛^{h \circ σ, Δ})}{n}] ∣ σ \in s o l^{Δ} (ϕ)} & by Claim 2 \\ = & s o l^{Δ} (ϕ \land e \overset{\circ}{=} n y) & by Lemma 10 \end{matrix}

Proposition 4.

Let

h : S \to Δ

be a

Σ_{b o o l}

-abstraction between algebras with unique division. Then any strongly-triangular system of

Σ_{b o o l}

-equations is h-exact.

Proof.

Any strongly-triangular system of equations has the form

⋀_{i = 1}^{n} e_{i} \overset{\circ}{=} n_{i} y_{i}

where n and

n_{i} \neq 0

are naturals and

y_{i}

is i-fresh for all

1 \leq i \leq n

. The proof is by induction on n. In the case

n = 0

, the conjunction is equal to

true

which is h-exact since

h (s o l^{S} (t r u e)) =

s o l^{Δ} (t r u e)

. In the case

n > 0

, we have by induction hypothesis that

⋀_{j = 1}^{i - 1} e_{j} \overset{\circ}{=} n_{j} y_{j}

is h-exact. Since

n_{i} \neq 0

it follows from Proposition 3 that that

e_{i} \overset{\circ}{=} n_{i} y_{i} \land ⋀_{j = 1}^{i - 1} e_{j} \overset{\circ}{=} n_{j} y_{j}

is h-exact. □

We notice that Proposition 4 remains true for triangular systems that are not strongly-triangular. This follows from results that we can only present in the next section (Theorem 4 and Proposition 5), since they require an additional argument.

Theorem 3

(

h_{B}

-Exactness). Quasi-positive strongly-triangular polynomial systems are

h_{B}

-exact.

Proof.

The

Σ_{b o o l}

-algebras

R_{+}

and

B

have unique division, so we can apply Proposition 4 for proving the theorem. □

We note that the analogous statement for

S

instead of

B

fails, even though

S

has unique division. The problem is that

S

is not a

Σ_{b o o l}

-algebra. As a counter-example, reconsider the strongly-triangular system of quasi-positive system equations:

u + v \overset{\circ}{=} x \land u + v \overset{\circ}{=} y

This system implies

x \overset{\circ}{=} y

over

R

but accept the abstract solution

[u / 1, v / - 1, x / 1, y / - 1]

mapping x and y to distinct signs, so it is not

h_{S}

-exact. Nevertheless, it is

h_{B}

-exact by Theorem 3.

Corollary 2

(

h_{B}

-exact rewriting of linear equation systems). For any linear

Σ_{b o o l}

-equations ϕ the elementary mode rewriting

emr (ϕ) \in F_{Σ_{b o o l}}^{}

is

R_{+}

-equivalent,

h_{B}

-exact, and can be computed in at most exponential time from ϕ.

Proof.

The elementary modes rewriting Corollary 1 shows that any linear

Σ_{b o o l}

-equation system

ϕ

is

R_{+}

-equivalent a formula

emr (ϕ)

of the form

\exists z . ϕ^{'}

such that

ϕ^{'}

is a quasi-positive strongly-triangular linear equation system. Theorem 3 shows that any quasi-positive strongly-triangular linear equation system is

h_{B}

-exact, so is

ϕ^{'}

. Existential quantification preserves

h_{B}

-exactness by Proposition 2, so

emr (ϕ)

is

h_{B}

-exact too. □

This

h_{B}

-exact rewriting permits us to compute the boolean abstraction of any system of linear

Σ_{b o o l}

-equations by computing the

B

-solutions of the

R_{+}

-equivalent

h_{B}

-exact formula. The latter can be done by finite domain constraint programming.

Our objective to find an algorithm for computing the sign abstraction of a system of linear

Σ_{b o o l}

-equations remains open. We finally approach it in Section 9. While the idea is to use the

h_{B}

-exact rewriting algorithm, we first need to generalize it from linear systems to mixed systems. This is done in Section 8. The generalization relies on the notion of

h_{B}

-invariance, which we discuss next in Section 7.

7. Invariance

A problem that we need to overcome is that conjunctions of two h-exact formulas may not be h-exact. The situation changes when assuming the following notion of h-invariance for at least one of the two formulas.

Definition 13

(Invariance). Let

h : S \to Δ

be a Σ-abstraction and

V \subseteq V

a subset of variables. We call a subset R of variable assignments of type

V \to dom (S)

h-invariant iff:

\forall σ, σ^{'} : V \to dom (S) . (σ \in R \land h \circ σ = h \circ σ^{'} \Rightarrow σ^{'} \in R) .

We call a Σ-formula ϕ h-invariant if its solution set

s o l^{S} (ϕ)

is.

The relevance of the notion of invariance for exactness of conjunctions—that we will formalize in Proposition 5—is due to the the following lemma:

Lemma 11.

If either

R_{1}

or

R_{2}

are h-invariant then:

h \circ (R_{1} \cap R_{2}) = h \circ R_{1} \cap h \circ R_{2}

.

Proof.

The one inclusion is straightforward without invariance:

\begin{matrix} h \circ (R_{1} \cap R_{2}) & = & {h \circ σ ∣ σ \in R_{1}, σ \in R_{2}} \\ \subseteq & {h \circ σ ∣ σ \in R_{1}} \cap {h \circ σ ∣ σ \in R_{2}} \\ = & h \circ R_{1} \cap h \circ R_{2} \end{matrix}

For the other inclusion, we can assume without loss of generality that

R_{1}

is h-invariant. So let

τ \in h \circ R_{1} \cap h \circ R_{2}

. Then, there exist

σ_{1} \in R_{1}

and

σ_{2} \in R_{2}

such that

τ = h \circ σ_{1} = h \circ σ_{2}

. By h-invariance of

R_{1}

it follows that

σ_{1} \in R_{2}

. So

σ_{1} \in R_{1} \cap R_{2}

, and hence,

τ \in h \circ (R_{1} \cap R_{2})

. □

We can now present the algebraic characterization of h-invariance based on the concretization function

γ_{h}

of the Galois connection of h. Recall that

R \subseteq h \circ - (h \circ R)

for all subsets of concrete variable assignments R. The inverse inclusion characterizes the h-invariance of R.

Lemma 12

(Algebraic characterization). Let

h : S \to Δ

be a Σ-abstraction. A subset R of concrete variable assignment

V \to dom (S)

is h-invariant for h iff h

\circ - (h \circ R) \subseteq R

.

Proof.

“⇒”. Let R be h-invariant and

σ \in h \circ - (h \circ R)

. Then, there exists

σ^{'} \in R

such that

h \circ σ = h \circ σ^{'}

. The h-invariance of R thus implies that

σ \in R

.

“⇐”. Suppose that

h \circ - (h \circ R) \subseteq R

. Let

σ, σ^{'} : V \to dom (S)

such that

h \circ σ = h \circ σ^{'}

and

σ \in R

. We have to show that

σ^{'} \in R

. From

h \circ σ = h \circ σ^{'}

and

σ \in R

it follows that

σ^{'} \in h \circ - (h \circ R)

and thus

σ^{'} \in R

as required. □

Lemma 13

(Variable extension preserves invariance). Let h be a surjective abstraction and R a subset of functions of type

V^{'} \to dom (S)

and V a subset of variables disjoint from

V^{'}

. If R is h-invariant then

{ext}_{V}^{S} (R)

is h-invariant too.

Proof.

This follows straightforwardly from the characterization of h-invariance in Lemma 12 and the following two claims:

Claim 4.

If h is surjective then

h \circ {ext}_{V}^{S} (R) = {ext}_{V}^{Δ} (h \circ R)

.

This follows from

h \circ {ext}_{V}^{S} (R) = {h \circ σ ∣ σ \in {ext}_{V}^{S} (R)} = {ext}_{V}^{Δ} ({h \circ σ^{'} ∣ σ^{'} \in R})

where we use the surjectivity of h in the last step.

Claim 5.

h \circ - {ext}_{V}^{Δ} (R^{'}) = {ext}_{V}^{S} (h \circ - R^{'})

for any subset

R^{'}

of functions of type

V^{'} \to dom (Δ)

.

\begin{matrix} h \circ - {ext}_{V}^{Δ} (R^{'}) & = & {σ : V \cup V^{'} \to dom (S) ∣ h \circ σ \in {ext}_{V}^{Δ} (R^{'})} \\ = & {σ : V \cup V^{'} \to dom (S) ∣ h \circ σ_{| V^{'}} \in R^{'}} \\ = & {ext}_{V}^{S} ({σ^{'} : V^{'} \to dom (S) ∣ h \circ σ^{'} \in R^{'}} \\ = & {ext}_{V}^{S} (h \circ - R^{'}) \end{matrix}

□

Lemma 14.

Let

h : S \to Δ

be a surjective Σ-abstraction, ϕ be a Σ-formula, and

V \supseteq fv (ϕ)

. Then, the h-invariance of ϕ implies the h-invariance of

s o l_{V}^{S} (ϕ)

.

Proof.

This follows from the cylindrification Lemma 3 and that extension preserves h-invariance as shown in Lemma 13. □

Proposition 5

(Exactness is preserved by conjunction when assuming invariance). Let h be a surjective Σ-abstraction. If

ϕ_{1}

and

ϕ_{2}

are h-exact Σ-formulas and

ϕ_{1}

or

ϕ_{2}

are h-invariant then the conjunction

ϕ_{1} \land ϕ_{2}

is h-exact.

Proof.

Let

ϕ_{1}

and

ϕ_{2}

be h-exact

Σ

-formulas. We assume without loss of generality that

ϕ_{1}

is h-invariant. Let

V = fv (ϕ_{1} \land ϕ_{2})

. Since

fv (ϕ_{2}) \subseteq V

the set

s o l_{V}^{S} (ϕ_{2})

is h-invariant too by Lemma 14. We can now show that

ϕ_{1} \land ϕ_{2}

is h-exact as follows:

\begin{matrix} h \circ s o l^{S} (ϕ_{1} \land ϕ_{2}) & = & h \circ (s o l_{V}^{S} (ϕ_{1}) \cap s o l_{V}^{S} (ϕ_{2})) \\ = & h \circ s o l_{V}^{S} (ϕ_{1}) \cap h \circ s o l_{V}^{S} (ϕ_{2}) & by Lemma 11 \\ = & s o l_{V}^{Δ} (ϕ_{1}) \cap s o l_{V}^{Δ} (ϕ_{2}) & by h - exactness of ϕ_{1} and ϕ_{2} wrt V \\ = & s o l^{Δ} (ϕ_{1} \land ϕ_{2}) \end{matrix}

□

Our next objective is to show that h-invariant formulas are closed under conjunction, disjunction, and existential quantification. The two former closure properties rely on the following two algebraic properties of abstraction decomposition.

Lemma 15

(Concretization

γ_{h}

preserves join and meet). For any Σ-abstraction

h : S \to Δ

, any subsets of assignments of type

V \to dom (S)

R_{1}

and

R_{2}

and V a subset of variables:

$h \circ - (R_{1} \cap R_{2}) = h \circ - R_{1} \cap h \circ - R_{2}$ .
$h \circ - (R_{1} \cup R_{2}) = h \circ - R_{1} \cup h \circ - R_{2}$ .

For general Galois connections, concretization is well-known to preserve joins but may not preserve meets. Still, meets are preserved for any Galois connections where the the concrete and abstract domains C and A are powersets as in our setting, so that joins are unions and meets intersections.

Proof.

The case of unions follows straightforwardly from the definitions:

\begin{matrix} h \circ - (R_{1} \cup R_{2}) & = & {σ ∣ h \circ σ \in R_{1} \cup R_{2}} \\ = & {σ ∣ h \circ σ \in R_{1} \lor h \circ σ \in R_{2}} \\ = & {σ ∣ h \circ σ \in R_{1}} \cup {σ ∣ h \circ σ \in R_{2}} \\ = & h \circ - R_{1} \cup h \circ - R_{2} \end{matrix}

The case of intersection is symmetric:

\begin{matrix} h \circ - (R_{1} \cap R_{2}) & = & {σ ∣ h \circ σ \in R_{1} \cap R_{2}} \\ = & {σ ∣ h \circ σ \in R_{1} \land h \circ σ \in R_{2}} \\ = & {σ ∣ h \circ σ \in R_{1}} \cap {σ ∣ h \circ σ \in R_{2}} \\ = & h \circ - R_{1} \cap h \circ - R_{2} \end{matrix}

□

Lemma 16

(Intersection and union preserve invariance). Let

h : S \to Δ

be a Σ-abstraction. Then, the intersection and union of any two h-invariant subsets

R_{1}

and

R_{2}

of variables assignments of type

V \to dom (S)

is h-invariant.

Proof.

This follows from the algebraic characterization Lemma 12 for invariance, in combination with the algebraic properties of composition and decomposition given in Lemmas 7, 11, and 15. □

Lemma 17

(Concretization

γ_{h}

commutes with projection).

h \circ - π_{x} (R) = π_{x} (h \circ - R)

.

Proof.

For all

σ : V \to dom (Δ)

we have

h \circ - π_{x} (σ) = h \circ - σ_{| V \ {x}} = {(h \circ - σ)}_{| V \ {x}} = π_{x} (h \circ - σ)

. □

Proposition 6

(Invariance is preserved by conjunction, disjunction, and quantification). If h is a surjective abstraction, then the class of h-invariant FO-formulas is closed under conjunction, disjunction, and existential quantification.

Proof.

Let

h : S \to Δ

be a

Σ

-abstraction.

Case of conjunction: Let

ϕ_{1}

and

ϕ_{2}

be h-invariant and

V = fv (ϕ_{1} \land ϕ_{2})

. By Lemma 14 the sets

s o l_{V}^{S} (ϕ_{1})

and

s o l_{V}^{S} (ϕ_{2})

are both h-invariant, and so by Lemma 16 is their intersection. Hence:

\begin{matrix} h \circ - (h \circ s o l^{S} (ϕ_{1} \land ϕ_{2})) \\ = & h \circ - (h \circ (s o l_{V}^{S} (ϕ_{1}) \cap s o l_{V}^{S} (ϕ_{2}))) \\ \subseteq & s o l_{V}^{S} (ϕ_{1}) \cap s o l_{V}^{S} (ϕ_{2}) & by h - invariance and Lemma 12 \\ = & s o l^{S} (ϕ_{1} \land ϕ_{2}) \end{matrix}

By Lemma 12 in the other direction, this implies that

ϕ_{1} \land ϕ_{2}

is h-invariant.

Case of disjunction: Analogous to the case of conjunction.

Case of existential quantification:

\begin{matrix} h \circ - (h \circ s o l^{S} (\exists x . ϕ_{1})) \\ = & h \circ - (h \circ π_{x} (s o l^{S} (ϕ_{1}))) & by Lemma 4 \\ = & h \circ - (π_{x} (h \circ s o l^{S} (ϕ_{1}))) & by Lemma 8 \\ = & π_{x} (h \circ - (h \circ s o l^{S} (ϕ_{1}))) & by Lemma 17 \\ \subseteq & π_{x} (s o l^{S} (ϕ_{1})) & by h - invariance of ϕ_{1} and Lemma 12 \\ = & s o l^{S} (\exists x . ϕ_{1}) & by Lemma 4 \end{matrix}

By Lemma 12, this implies that

\exists x . ϕ_{1}

is h-invariant. □

We do not know whether negation preserves h-invariance in general, but for finite

Δ

it can be shown that if

ϕ

is h-exact and h-invariant, then

\neg ϕ

is h-exact and h-invariant too.

Proposition 7.

Let h be a surjective Σ-abstraction. Then, the class of h-exact and h-invariant Σ-formulas is closed under conjunction, disjunction, and existential quantification.

Proof.

Closure under conjunction follows from Propositions 5 and 6, closure under disjunction from Propositions 1 and 6, and closure under existential quantification by Propositions 2 and 6. □

Theorem 4

(

h_{B}

-invariance and

h_{B}

-exactness of polynomial equations). Any positive polynomial equation

p \overset{\circ}{=} 0

such that p has no constant term is

h_{B}

-exact and

h_{B}

-invariant.

Proof.

Consider a positive polynomial equation

p \overset{\circ}{=} 0

such that p has no constant term and only positive coefficients. Thus, p has the form

\sum_{j = 1}^{l} n_{j} \prod_{k = 1}^{i_{j}} x_{j, k}^{m_{j, k}} \overset{\circ}{=} 0

where

l \geq 0

, and

n_{j}, i_{j}, m_{j, k} > 0

.

Claim 6.

For both algebras

S \in {B, R_{+}}

:

s o l^{S} (p \overset{\circ}{=} 0) = s o l^{S} (⋀_{j = 1}^{l} ⋁_{k = 1}^{i_{j}} x_{j, k} \overset{\circ}{=} 0) .

The polynomial has a value of zero if and only if all its monomials do, that is:

\prod_{k = 1}^{i_{j}} x_{j, k}^{m_{j k}} = 0

for all

1 \leq j \leq l

. Since constant terms are ruled out, we have

i_{j} \neq 0

. Furthermore, we assumed for all polynomials that

m_{j, k} \neq 0

. So for all

1 \leq j \leq l

there must exist

1 \leq k \leq i_{j}

such that

x_{j, k} = 0

.

Claim 7.

The equation

x \overset{\circ}{=} 0

is

h_{B}

-exact and

h_{B}

-invariant.

This proof of this claim is straightforward from the definitions.

With these two claims, we are now in the position to prove the Theorem 4. Since the class of

h_{B}

-exact and

h_{B}

-invariant formulas is closed under conjunction and disjunction by Proposition 7, it follows from by Claim 7 that

\land_{j = 1}^{l} \lor_{k = 1}^{i_{j}} x_{j, k} \overset{\circ}{=} 0

is both

h_{B}

-exact and

h_{B}

-invariant. Since this formula is equivalent over

R_{+}

to the polynomial equation by Claim 6, the

h_{B}

-invariance carries over to

p \overset{\circ}{=} 0

. The

h_{B}

-exactness also carries over based on the equivalence for both structures

R_{+}

and

B

:

\begin{matrix} h_{B} \circ s o l^{R_{+}} (p \overset{\circ}{=} 0) & = & h_{B} \circ s o l_{V}^{R_{+}} (\land_{j = 1}^{l} \lor_{k = 1}^{i_{j}} x_{j, k} \overset{\circ}{=} 0) & by Claim 6 for R_{+} \\ = & s o l^{B} (\land_{j = 1}^{l} \lor_{k = 1}^{i_{j}} x_{j, k} \overset{\circ}{=} 0) & by h_{B} exactness \\ = & s o l^{B} (p \overset{\circ}{=} 0) & by Claim 6 for B . \end{matrix}

□

8. $h_{B}$ -Exact Rewriting of $h_{B}$ -Mixed Systems

In this section, we lift our main result to

h_{B}

-mixed system, presenting a rewrite algorithm that makes any

h_{B}

-mixed system

h_{B}

-exact.

Definition 14.

A

h_{B}

-mixed system is a formula in

F_{Σ_{b o o l}}^{}

of the form

\exists z . ϕ \land ϕ^{'}

where ϕ is a system of linear

Σ_{b o o l}

-equations and

ϕ^{'}

a

h_{B}

-invariant and

h_{B}

-exact first-order formula.

Note that linear equation systems

A y \overset{\circ}{=} 0

, with A an integer matrix and

y

a sequence of pairwise distinct variables, need not to be

h_{B}

-exact, if A is not positive. However, as shown by the elementary mode rewriting Corollary 1 any linear equation systems is

R_{+}

-equivalent to some quasipositive strongly-triangular linear system, that is

h_{B}

-exact by Theorem 3.

Our next objective is to rewrite formulas to reduce the overapproximation coming with the abstract interpretation over the Booleans by John’s theorem. The idea is to make a linear equation system

h_{B}

-exact that are used as subformulas as for instance of

h_{B}

-mixed systems.

We recall from Corollary 1 that the elementary mode rewriting

emr (ϕ)

of a linear equation system is an

h_{B}

-exact formula that is

R_{+}

-equivalent to

ϕ

. We now introduce the boolean rewriting by lifting the elementary mode rewriting to a richer class of formulas. Given a vector

z \in V^{*}

, a linear equation system

ϕ \in F_{Σ_{b o o l}}^{}

, and a formula

ϕ^{'} \in F_{Σ_{b o o l}}^{}

, the boolean rewriting is defined by:

br (\exists z . (ϕ \land ϕ^{'})) =_{def} \exists z . (emr (ϕ) \land ϕ^{'})

The boolean rewriting may indeed reduce the overapproximation coming with abstract interpretation of formulas over the booleans, as show by the following proposition.

Proposition 8.

h_{B} \circ s o l^{R_{+}} (ψ) \subseteq s o l^{B} (br (ψ)) \subseteq s o l^{B} (ψ)

.

Proof.

Let

ϕ

be a linear equation system,

z \in V^{*}

,

ϕ^{'} \in F_{Σ_{b o o l}}^{}

and

ψ =_{def} \exists z . ϕ \land ϕ^{'}

. Since

ϕ

is

R_{+}

-equivalent to

emr (ϕ)

, it follows that

br (ψ)

is

R_{+}

-equivalent to

ψ

. Hence,

s o l^{R_{+}} (ψ) = s o l^{R_{+}} (br (ψ))

so that:

h_{B} \circ s o l^{R_{+}} (ψ) = h_{B} \circ s o l^{R_{+}} (br (ψ))

By John’s theorem, we have:

h_{B} \circ s o l^{R_{+}} (br (ψ)) \subseteq s o l^{B} (br (ψ))

Furthermore, by

h_{B}

-exactness,

R_{+}

-equivalence, and again John’s theorem, we have:

s o l^{B} (emr (ϕ)) = h_{B} \circ s o l^{R_{+}} (emr (ϕ)) = h_{B} \circ s o l^{R_{+}} (ϕ) \subseteq s o l^{B} (ϕ)

Therefore, it follows that:

s o l^{B} (br (ψ)) \subseteq s o l^{B} (ψ)

In combination this yields the inclusions of the proposition. □

Theorem 5

(Main). For any

h_{B}

-mixed system

ψ \in F_{Σ}^{}

the boolean rewriting

br (ψ)

is

h_{B}

-exact,

R_{+}

-equivalent to ψ, and can be computed in at most exponential time.

Proof.

Let

ψ

be a

h_{B}

-mixed system

\exists x . (ϕ \land ϕ^{'})

. where

ϕ

is a linear equation system and

ϕ^{'}

a first-order formula that is

h_{B}

-exact and

h_{B}

-invariant. Based on the elementary modes rewriting Corollary 1, the linear equation system

ϕ

can be transformed in at most exponential time to the form

emr (ψ) = \exists z . ϕ^{″}

where

ϕ^{″}

is a quasipositive strongly-triangular system of linear equations. Such polynomial equation systems are

h_{B}

-exact by Theorem 3, and so is

ϕ^{″}

. The Invariance Proposition 5 shows that the conjunction

ϕ^{″} \land ϕ^{'}

is

h_{B}

-exact too, since

ϕ^{'}

was assumed to be

h_{B}

-exact and

h_{B}

-invariant. The

h_{B}

-exactness is preserved by existential quantification by Proposition 2, so the formula

br (ψ) = \exists x . emr (ϕ) \land ϕ^{'}

is

h_{B}

-exact too. □

Corollary 3.

The

h_{B}

-abstraction of the

R_{+}

-solution set of a

h_{B}

-mixed system ϕ, that is

h_{B} \circ s o l^{R_{+}} (ϕ)

, can be computed in at most exponential time in the size of the system ϕ.

Proof.

Given a

h_{B}

-mixed system

ϕ

, we can apply Theorem 5 to compute in at most exponential time a

R_{+}

-equivalent formula

ϕ^{″}

that is

h_{B}

-exact. It is then sufficient to compute

s o l^{B} (ϕ^{″})

in exponential time in the size of

ϕ

. This can be done in the naive manner, that is by evaluating the formula

ϕ^{″}

—which may be of exponential size—over all possible boolean variable assignments, of which there may be exponentially many. For each assignment, the evaluation can be done in PSpace, and thus in exponential time. The overall time required is thus a product of two exponentials, which remains exponential. □

The algorithm from the proof Corollary 3 can be improved so that it becomes sufficiently efficient for practical use. For this the two steps with exponential worst case complexity must be made polynomial for the particular instances. Firstly, note that the computation of the elementary modes (Corollary 1) is known to be computationally feasible. Various algorithms for this purpose were implemented [16,24,25,26] and applied successfully to problems in systems biology [14]. The second exponential step concerns the enumeration of all boolean variable assignments. This enumeration may be avoided by using constraint programming techniques for computing the solution set

s o l^{B} (ϕ^{″})

. For those

h_{B}

-mixed systems for which both steps can be done in polynomial time, we can compute the boolean abstraction of the

R_{+}

-solution set in polynomial time too. The practical feasibility of this approach was demonstrated recently at an application to knockout prediction in systems biology [6], where previously only over-approximations could be computed.

9. Computing Sign Abstractions

We next show how to compute the sign abstraction

h S \circ s o l^{R} (ϕ)

for systems

ϕ

of linear

Σ_{b o o l}

-equations. To apply

h_{B}

-exact rewriting, we decompose the sign abstraction into the boolean abstraction and functions definable in first-order logic.

9.1. Decomposition

We can decompose any real number

r \in R

into a pair of two positive numbers

d e c (r) \in R_{+}^{2}

—negative and the positive part—as follows:

d e c (r) =_{def} \{\begin{matrix} (0, r) & if r \geq 0 \\ (- r, 0) & if r \leq 0 \end{matrix}

The image of this surjective function is

{0} \times R_{+}) \cup (R_{+} \times {0}

, so it has an inverse

dec^{- 1} : ({0} \times R_{+}) \cup (R_{+} \times {0}) \to R

, which satisfies for all pairs

(r_{1}, r_{2})

in the domain:

dec^{- 1} (r_{1}, r_{2}) = r_{2} -^{R} r_{1}

Furthermore, recall that

h_{B}^{2} : R_{+}^{2} \to B^{2}

satisfies

h_{B}^{2} (r_{1}, r_{2}) = (h_{B} (r_{1}), h_{B} (r_{2}))

.

Lemma 18

(Decomposition).

h S = d e c^{- 1} \circ h_{B}^{2} \circ d e c

Proof.

If r is negative then

dec^{- 1} (h_{B}^{2} (dec (r))) = dec^{- 1} (h_{B}^{2} ((- r, 0))) = dec^{- 1} ((h_{B} (- r), 0))

= - h_{B} (- r) = h_{S} (r)

. Otherwise if r is positive then

dec^{- 1} (h_{B}^{2} (dec (r))) = dec^{- 1} (h_{B}^{2} ((0, r)))

=

dec^{- 1} ((0, h_{B} (r))

=

h_{B} (r) = h_{S} (r)

. □

9.2. Positivity

We show in a first step that first-order formulas over the reals can be rewritten, such that interpretation over the positive reals is enough.

We call a formula

ϕ \in F_{Σ_{b o o l}}^{}

flat if all equations contained in

ϕ

have the form

x \overset{\circ}{=} x_{1} + x_{2}

,

x \overset{\circ}{=} x_{1} * x_{2}

,

x \overset{\circ}{=} 0

, or

x \overset{\circ}{=} 1

for some variables

x, x_{1}, x_{2}

. Note that any formula

ϕ \in F_{Σ_{b o o l}}^{}

can be converted to an equivalent flat formula in linear time by introducing fresh existentially quantified variables, so that we can assume flatness without loss of generality.

We fix two generators of fresh variable

ν_{⊖}

,

ν_{\oplus} : V \to V

. For any

x \in V

, the intention is that

ν_{\oplus} (x)

stands for the positive part of x and

ν_{⊖} (x)

for its negative part. We will preserve the invariants

x = ν_{\oplus} (x) - ν_{⊖} (x)

and

ν_{\oplus} (x) * ν_{⊖} (x) = 0

. Furthermore, we define

ν : V \to V^{2}

such that for all

x \in V

:

ν (x) =_{def} (ν_{⊖} (x), ν_{\oplus} (x))

For any flat formula

ϕ \in F_{Σ}^{} (V)

we define a formula

{dec}_{ν} (ϕ) \in F_{Σ}^{} (ν_{⊖} (V) \cup ν_{\oplus} (V))

with the variables

ν_{⊖} (x)

and

ν_{\oplus} (x)

instead of x for all

x \in V

. Otherwise the formula

{\tilde{dec}}_{ν} (ϕ)

has the same meaning as over the reals than

ϕ

.

{\tilde{dec}}_{ν} (ϕ) = {dec}_{ν} (ϕ) \land \underset{x \in V}{⋀} ν_{\oplus} (x) * ν_{⊖} (x) \overset{\circ}{=} 0

where

\begin{array}{l} {dec}_{ν} (x \overset{\circ}{=} x_{1} + x_{2}) = & {dec}_{ν} (x \overset{\circ}{=} x_{1} * x_{2}) = \\ ν_{\oplus} (x) + ν_{⊖} (x_{1}) + ν_{⊖} (x_{2}) \overset{\circ}{=} & ν_{\oplus} (x) + ν_{\oplus} (x_{1}) * ν_{⊖} (x_{2}) + ν_{⊖} (x_{1}) * ν_{\oplus} (x_{2}) \overset{\circ}{=} \\ ν_{⊖} (x) + ν_{\oplus} (x_{1}) + ν_{\oplus} (x_{2}) & ν_{⊖} (x) + ν_{\oplus} (x_{1}) * ν_{\oplus} (x_{2}) + ν_{⊖} (x_{1}) * ν_{⊖} (x_{2}) \\ {dec}_{ν} (x \overset{\circ}{=} 0) = ν_{\oplus} (x) \overset{\circ}{=} ν_{⊖} (x) & {dec}_{ν} (x \overset{\circ}{=} 1) = ν_{\oplus} (x) \overset{\circ}{=} ν_{⊖} (x) + 1 \\ {dec}_{ν} (\exists x . ϕ) = \exists ν_{⊖} (x) . \exists ν_{\oplus} (x) . & {dec}_{ν} (ϕ \land ϕ^{'}) = {dec}_{ν} (ϕ) \land {dec}_{ν} (ϕ^{'}) \\ ν_{\oplus} (x) * ν_{⊖} (x) \overset{\circ}{=} 0 \land {dec}_{ν} (ϕ) & {dec}_{ν} (\neg ϕ) = \neg {dec}_{ν} (ϕ) \end{array}

Note that the definition in the case of addition, the definition relies on that subtraction

-^{R}

in the structure of reals is the inverse of addition

+^{R}

. The expressions that are to be subtracted on one side of the equation are added to the other side instead. This is also used in the case of multiplication, in combination with the distributivity law for addition

+^{R}

and multiplication

*^{R}

. Furthermore,

{\tilde{dec}}_{ν} (ϕ)

belongs to

F_{Σ_{b o o l}}^{} (ν_{⊖} (V) \cup ν_{⊖} (V))

and can be computed in linear time from

ϕ

.

Proposition 9

(Positivity). For any flat formula

ϕ \in F_{Σ_{b o o l}}^{} (V)

:

\begin{matrix} d e c \circ s o l_{V}^{R} (ϕ) = {σ^{2} \circ ν_{| V} ∣ σ \in s o l^{R_{+}} ({\tilde{d e c}}_{ν} (ϕ))} \end{matrix}

Proof.

By induction on the structure of

ϕ

. In the first case of reals, can use that

-^{R}

is the inverse of

+^{R}

and that the distributivity laws holds for

+^{R}

and

*^{R}

. □

Lemma 19.

For any flat linear equation system ϕ, the formula

{\tilde{d e c}}_{ν} (ϕ)

is a

h_{B}

-mixed system.

Proof.

If

ϕ

is a flat linear system, then

{dec}_{ν} (ϕ)

is a linear system, so that

{\tilde{dec}}_{ν} (ϕ)

is a

h_{B}

-mixed system. □

9.3. Computing Sign Abstractions

We now have developed all the prerequisite for computing the sign abstraction of linear equation systems by using

h_{B}

-exact boolean rewriting of

h_{B}

-mixed systems.

Theorem 6.

For any linear equation system

ϕ \in F_{Σ_{b o o l}}^{} (V)

, the formula

br ({\tilde{d e c}}_{ν} (ϕ))

can be computed in at most exponential time and satisfies:

h_{S} \circ s o l_{V}^{R} (ϕ) = {[y / τ (ν_{\oplus} (y)) -^{R} τ (ν_{⊖} (y)) ∣ y \in V] ∣ τ \in s o l_{}^{B} (br ({\tilde{d e c}}_{ν} (ϕ)))}

Proof.

Let

ϕ \in F_{Σ_{b o o l}}^{} (V)

be a system of linear equations. Without loss of generality, we can assume that

ϕ

is flat. Let:

\tilde{ϕ} =_{def} {\tilde{d e c}}_{ν} (ϕ)

. The formula

\tilde{ϕ}

is a

h_{B}

-mixed system by Lemma 19 with

fv (\tilde{ϕ}) = ν_{⊖} (V) \cup ν_{\oplus} (V)

so that we can apply the Main Theorem 5 to it. It shows that boolean rewriting

br (\tilde{ϕ})

is an

R_{+}

-equivalent formula in

F_{Σ}^{} (ν_{\oplus} (V) \cup ν_{⊖} (V))

that is

h_{B}

-exact and can be computed in at most exponential time. We can now conclude as follows:

\begin{matrix} h_{S} & \circ s o l_{V}^{R} (ϕ) \\ = dec^{- 1} \circ h_{B}^{2} \circ dec \circ s o l_{V}^{R} (ϕ) & Decomposition Lemma 18 \\ = dec^{- 1} \circ h_{B}^{2} \circ {σ^{2} \circ ν_{| V} ∣ σ \in s o l_{}^{R_{+}} (\tilde{ϕ})} & Positivity Proposition 9 \\ = dec^{- 1} \circ h_{B}^{2} \circ {σ^{2} \circ ν_{| V} ∣ σ \in s o l_{}^{R_{+}} (br (\tilde{ϕ})} & R_{+} - equivalence of \tilde{ϕ} and br (\tilde{ϕ}) \\ = {dec^{- 1} \circ τ^{2} \circ ν_{| V} ∣ τ \in s o l_{}^{B} (br (\tilde{ϕ}))} & h_{B} - exactness of br (\tilde{ϕ}) \\ = {[y / τ (ν_{\oplus} (y)) -^{R} τ (ν_{⊖} (y)) ∣ y \in V] & definition of dec^{- 1} \\ ∣ τ \in s o l_{}^{B} (br (\tilde{ϕ}))} \end{matrix}

The sign abstraction of a system

ϕ

of

Σ_{b o o l}

-equations with free variables in

V = fv (ϕ)

can thus be computed by first computing the

h_{B}

-exact formula

br (\tilde{ϕ}) \in F_{Σ}^{} (ν_{\oplus} (V) \cup ν_{⊖} (V))

from Theorem 6 by applying the Positivity Proposition 9 and the Main Theorem 5, then computing

s o l_{}^{B} (br (\tilde{ϕ}))

by finite domain constraint programming, and finally inferring

h_{S} \circ s o l^{R} (ϕ)

thereof based on the equation of Theorem 6.

Corollary 4.

The sign abstraction

h_{S} \circ s o l_{V}^{R} (ϕ)

can be computed in at most single exponential time in the size of ϕ.

Proof.

The formula

br (\tilde{ϕ})

is of exponential size but contains only twice as many variables than

ϕ

. Let

n = | fv (ϕ) |

. We can then compute

h_{S} \circ s o l_{V}^{R} (ϕ)

by testing

6^{2 n}

variable assignments for membership to

s o l_{}^{R} (br (\tilde{ϕ}))

. Each such test is linear in the size of

br (\tilde{ϕ})

, and thus in

O (2^{m})

where m is the size of

ϕ

. So the overall time is in

O (6^{2 n} 2^{m})

and since

n \leq m

in

O (6^{3 m})

. □

We finally show that the same algorithm as for computing the sign abstraction for linear equation systems can be lifted to a richer class of formulas to obtain another and possibly more precise overapproximation of the sign abstraction than John’s.

Proposition 10.

Let

ψ = \exists z . ϕ \land ϕ^{'}

in

F_{Σ_{b o o l}}^{} (V)

for some linear equation system ϕ and formula

ϕ^{'} \in F_{Σ_{b o o l}}^{}

. The formula

br ({\tilde{d e c}}_{ν} (ψ))

then yields an overapproximation of the sign abstraction of ϕ:

h_{S} \circ s o l_{V}^{R} (ψ) \subseteq {[y / τ (ν_{\oplus} (y)) -^{R} τ (ν_{⊖} (y)) ∣ y \in V] ∣ τ \in s o l_{}^{B} (br ({\tilde{d e c}}_{ν} (ψ)))}

Proof.

Along the lines of the proof of Theorem except that

br ({\tilde{d e c}}_{ν} (ψ))

is not

h_{B}

-exact. Therefore, the equality where the

h_{B}

-exactness was used must be weakened to an inclusion. □

10. Application to Program Analysis

We illustrate our results by applying the sign abstraction for program analysis based on abstract interpretation. We consider the Python implementation in Figure 6 of the function

I : R^{2} \to R

. A call

I (a, s)

supposedly computes the approximation of the integral

\int_{0}^{a} f (x) d x

with step width

s

for some total function

f : R \to R

. Abstract interpretation allows us to find out the conditions that must hold on the input parameters for

I ((a : f l o a t, s : f l o a t)

to work properly, and in particular to avoid exception throwing.

We can first interpret numeric programs abstractly as a formula of first-order logic with signature

Σ_{a r i t h}

. We illustrate this in an ad hoc manner on the integral example

I

:

ϕ_{I} =_{def} \begin{matrix} \exists {ret}_{f} \exists {ret}_{I} \exists result . \\ (a < 0 \Leftrightarrow raise_exception \overset{\circ}{=} 1) \land \\ ((s > a \land do_recursion \overset{\circ}{=} 0 \land result \overset{\circ}{=} 0) \lor \\ (\neg (s > a) \land do_recursion \overset{\circ}{=} 1 \land a_{rec} \overset{\circ}{=} a - s \land s_{rec} \overset{\circ}{=} s \land \\ result \overset{\circ}{=} s \cdot {ret}_{f} + {ret}_{I})) \end{matrix}

The variables

a

and

s

are the formal parameters in the definition of

I (a : f l o a t, s : f l o a t)

. The others are fresh variables introduced to handle exceptions or function calls: the boolean flag

raise_exception

represents exception throwing, the boolean flag

do_recursion

has a true value only when a recursive call is made to

I

with actual parameters represented by the variables

a_{r e c}, s_{r e c}

and return value represented by

{ret}_{I}

, while

{ret}_{f}

is the variable for the return value of the call to the function

f

. The final return value of

I

is represented by the variable

result

. In what follows, we are not interested in the signs of the last three variables, so we quantify them existentially.

The sign behavior of function

I

is given by the formula’s sign abstraction

h_{S} \circ s o l^{R} (ϕ_{I})

. Given that

ϕ_{I}

is not

h_{B}

-mixed system, we cannot apply the algorithm from Theorem 6 directly to compute this sign abstraction. Nevertheless, it will be beneficial as we will illustrate below.

By John’s theorem, the sign abstraction

h_{S} \circ s o l^{R} (ϕ_{I})

can be overapproximated by the abstract interpretation

s o l^{S} (ϕ_{I})

. Since

S

is a finite structure, this abstract interpretation can be computed by finite domain constraint programming. For this, we implemented a solver for first-order formulas over the structure

S

with Minizinc [17]. When applied to

ϕ_{I}

it returns the set of abstract solutions

s o l^{S} (ϕ_{I})

given in Table 1. This set contains the 6 unjustified abstract solutions

2, 4, 10, 13, 15, 18

outside

h_{S} \circ s o l^{R} (ϕ_{I})

. In the table they are distinguished by gray background color. We also note that the last three solutions

17, 18, 19

could be ruled out when using a more precise abstract program interpretation, taking into account that no recursive call is possible when an exception is thrown.

The sets of abstract solutions provide information on possible sign of values of the parameters in a call

I (a : f l o a t, s : f l o a t)

. For example, solution 1 in Table 1 states that when called with values of signs

[a / 0, s / 1]

the function

I

will not raise an exceptions nor make a recursive call. Solution 8 states that when called with values of signs

[a / 1, s / 1]

function

I

may go into recursion with signs

[a_{r e c} / 0, s_{r e c} / 1]

without raising an exception.

Any set of abstract solutions defines an abstract call graph. The abstract call graphs of

s o l^{S} (ϕ_{I})

and

h_{S} \circ s o l^{R} (ϕ_{I})

from Table 1 are given in Figure 7. Solution 1 in Table 1 implies a solid edge from the node

I^{S} (1, 1)

to the node

I^{S} (0, 1)

. The edge is solid since solution 1 is justified. Edges induced by unjustified solutions are dashed. The unjustified solution 10 for instance induces the dashed edge from

I^{S} (1, 1)

to

I^{S} (1, - 1)

. Solutions with

do_recursion = 0

and

raise_exception = 0

do not induce any edge. Instead, they show that the computation may stop, producing final nodes that are surrounded by a double circle. The final nodes are

I^{S} (1, 1)

and

I^{S} (0, 1)

. Note that for all nonfinal nodes, either an exception is raised or the computation loops endlessly. Solutions with

raise_exception = 1

induce an edge to the except node.

Given that only 2 unjustified solutions with

do_recursion = 0

and

raise_exception = 0

(10 and 18), there are only 2 dashed edges in the graph. Furthermore, the edges induced by the last three solutions 17, 18, 19 are drawn in blue, since these could be removed with a more precise abstract program interpretation than

ϕ_{I}

.

The sign analysis without the unjustified dashed edges yields the following result: the program in state

I^{S} (1, 1)

, where

a > 0

and

s > 0

may either terminate, loop indefinitely, or go to state

I^{S} (0, 1)

and terminate there immediately. With the unjustified dashed edges, however, it wrongly seems possible that the program may also raise an exception by passing through

I^{S} (- 1, 1)

. This overapproximation would be particularly unfortunate since state

I^{S} (1, 1)

is the only useful state to call

I

.

We next show how to remove the unjustified solutions by applying the overapproxmation algorithm for the sign abstraction from Proposition 10, that lifts the algorithm for exact sign abstraction from Theorem 6 to a richer class of formulas. The idea is to split the formula

ϕ_{I}

into its linear part and the rest. Before doing so, we preprocess the inequation

s > a

: We introduce a fresh variable

signvar

, add the equation

s - a \overset{\circ}{=} signvar

, and rewrite

s > a

to

signvar > 0

. The linear part of

ϕ_{I}

then becomes:

s - a \overset{\circ}{=} signvar \land a_{rec} \overset{\circ}{=} a - s \land s_{rec} \overset{\circ}{=} s

We can then rewrite the linear part into the signature

Σ_{b o o l}

by moving the negative parts positively onto the other side. This yields the following linear equation system:

\begin{matrix} s \overset{\circ}{=} signvar + a \land a_{rec} + s \overset{\circ}{=} a \land s_{rec} \overset{\circ}{=} s \end{matrix}

The remainder of

ϕ_{I}

can be rewritten as follows:

\begin{matrix} ((a < 0 \land raise_exception > 0) \lor (a \geq 0 \land raise_exception \overset{\circ}{=} 0)) \\ \land ((signvar > 0 \land do_recursion \overset{\circ}{=} 0 \land result \overset{\circ}{=} 0) \lor \\ (signvar \leq 0 \land do_recursion > 0 \land result \overset{\circ}{=} s * {ret}_{f} + {ret}_{I})) \end{matrix}

It is not clear whether the conjunction of both parts is a

h_{B}

-mixed system, since it is not clear how to show the

h_{B}

-invariance of the equation

result \overset{\circ}{=} s * {ret}_{f} + {ret}_{I}

. Still, we can apply the overapproximation algorithm of the sign abstraction from Proposition 10. It indeed improves on John’s approximation, ruling out both unjustified solutions. The details are worked out in Appendix A.

In the general case, linear equation systems are not enough, in which case our algorithm from Theorem 6 for computing sign abstractions cannot be applied. But then we can still apply the overapproximation algorithm from Proposition 10 which rewrites a linear part of the formula exactly. As illustrated by the present example, this overapproximation is often way more precise than John’s.

11. Example for the Overapproximation of the Sign Abstraction

We reconsider conjunction of the linear part obtained and the rest of

ϕ_{I}

, that is

ϕ_{I}^{l i n} \land ϕ_{I}^{r e s t}

where:

ϕ_{I}^{l i n} =_{def} \{\begin{matrix} s \overset{\circ}{=} signvar + a \\ \land & a_{rec} + s \overset{\circ}{=} a \\ \land & s_{rec} \overset{\circ}{=} s \end{matrix}

ϕ_{I}^{r e s t} =_{def} \{\begin{matrix} ( & (a < 0 \land raise_exception > 0) \\ \lor & (a \geq 0 \land raise_exception \overset{\circ}{=} 0)) \\ \land & ( & (signvar > 0 \land do_recursion \overset{\circ}{=} 0 \land result \overset{\circ}{=} 0) \\ \lor & (signvar \leq 0 \land do_recursion > 0 \land result \overset{\circ}{=} s * {ret}_{f} + {ret}_{I})) \end{matrix}

The decomposition of the linear subsystem

{d e c}_{ν} (ϕ_{I}^{l i n})

for interpretation over

B

as defined in Section 9 is obtained by splitting each variable x into two fresh variables

ν_{\oplus} (x)

and

ν_{⊖} (x)

representing its positive and negative part:

{d e c}_{ν} (ϕ_{I}^{l i n}) = \{\begin{matrix} ν_{\oplus} (s) + ν_{⊖} (a) + ν_{⊖} (signvar) \overset{\circ}{=} ν_{⊖} (s) + ν_{\oplus} (a) + ν_{\oplus} (signvar) \\ \land & ν_{\oplus} (a_{rec}) + ν_{⊖} (a) + ν_{\oplus} (s) \overset{\circ}{=} ν_{⊖} (a_{rec}) + ν_{\oplus} (a) + ν_{⊖} (s) \\ \land & ν_{\oplus} (s_{rec}) + ν_{⊖} (s) \overset{\circ}{=} ν_{⊖} (s_{rec}) + ν_{\oplus} (s) \end{matrix}

The additional constraints on the decomposition variables are:

\begin{matrix} ν_{\oplus} (s) * ν_{⊖} (s) \overset{\circ}{=} 0 \\ \land & ν_{\oplus} (a) * ν_{⊖} (a) \overset{\circ}{=} 0 \\ \land & ν_{\oplus} (signvar) * ν_{⊖} (signvar) \overset{\circ}{=} 0 \\ \land & ν_{\oplus} (a_{rec}) * ν_{⊖} (a_{rec}) \overset{\circ}{=} 0 \\ \land & ν_{\oplus} (s_{rec}) * ν_{⊖} (s_{rec}) \overset{\circ}{=} 0 \\ \land & ν_{\oplus} (result) * ν_{⊖} (result) \overset{\circ}{=} 0 \\ \land & ν_{\oplus} ({ret}_{I}) * ν_{⊖} ({ret}_{I}) \overset{\circ}{=} 0 \\ \land & ν_{\oplus} ({ret}_{f}) * ν_{⊖} ({ret}_{f}) \overset{\circ}{=} 0 \end{matrix}

The elementary mode rewriting

emr ({d e c}_{ν} (ϕ_{I}^{l i n}))

is the following

R_{+}

-equivalent

h_{B}

-exact

Σ_{b o o l}

-formula obtained via Corollary 1:

\begin{matrix} \exists x_{0} \dots \exists x_{10} . \\ \land & ν_{⊖} (a) \overset{\circ}{=} x_{10} + x_{8} + x_{9} \\ \land & ν_{\oplus} (a) \overset{\circ}{=} x_{10} + x_{6} + x_{7} \\ \land & ν_{⊖} (a_{rec}) \overset{\circ}{=} x_{4} + x_{5} + x_{9} \\ \land & ν_{\oplus} (a_{rec}) \overset{\circ}{=} x_{3} + x_{5} + x_{7} \\ \land & ν_{⊖} (signvar) \overset{\circ}{=} x_{2} + x_{3} + x_{7} \\ \land & ν_{\oplus} (signvar) \overset{\circ}{=} x_{2} + x_{4} + x_{9} \\ \land & ν_{⊖} (s) \overset{\circ}{=} x_{1} + x_{3} + x_{8} \\ \land & ν_{\oplus} (s) \overset{\circ}{=} x_{1} + x_{4} + x_{6} \\ \land & ν_{⊖} (s_{rec}) \overset{\circ}{=} x_{0} + x_{3} + x_{8} \\ \land & ν_{\oplus} (s_{rec}) \overset{\circ}{=} x_{0} + x_{4} + x_{6} \end{matrix}

The nonlinear remainder also needs to be rewritten with the decomposition variables for interpretation over

B

. The formula below is

{d e c}_{ν} (ϕ_{I}^{l i n})

except that we simplified the rewriting of inequations a bit.

\begin{matrix} ( & (\neg ν_{⊖} (a) \overset{\circ}{=} 0 \land \neg ν_{\oplus} (raise_exception) \overset{\circ}{=} 0) \\ \lor & (ν_{⊖} (a) \overset{\circ}{=} 0 \land ν_{⊖} (raise_exception) \overset{\circ}{=} 0 \land ν_{\oplus} (raise_exception) \overset{\circ}{=} 0)) \\ \land & ( & (\neg ν_{\oplus} (signvar) \overset{\circ}{=} 0 \land ν_{⊖} (do_recursion) \overset{\circ}{=} 0 \land ν_{\oplus} (do_recursion) \overset{\circ}{=} 0 \\ \land ν_{⊖} (result) \overset{\circ}{=} 0 \land ν_{\oplus} (result) \overset{\circ}{=} 0) \\ \lor & (ν_{\oplus} (signvar) \overset{\circ}{=} 0 \land \neg ν_{\oplus} (do_recursion) \overset{\circ}{=} 0 \\ \land ν_{\oplus} (result) + ν_{⊖} (s) * ν_{\oplus} ({ret}_{f}) + ν_{\oplus} (s) * ν_{⊖} ({ret}_{f}) + ν_{⊖} ({ret}_{I})) \\ \overset{\circ}{=} ν_{⊖} (result) + ν_{\oplus} (s) * ν_{\oplus} ({ret}_{f}) + ν_{⊖} (s) * ν_{⊖} ({ret}_{f}) + ν_{\oplus} ({ret}_{I}))) \end{matrix}

For any solution

τ

of the conjunction of the above three blocks of formulas over the algebra of booleans

B

, we then obtain an assignment

σ \in h_{S} \circ s o l^{R} (ϕ_{I})

according to Theorem 6:

\begin{matrix} σ (s) = τ (ν_{\oplus} (s)) -^{R} τ (ν_{⊖} (s)) \\ σ (a) = τ (ν_{\oplus} (a)) -^{R} τ (ν_{⊖} (a)) \\ σ (signvar) = τ (ν_{\oplus} (signvar)) -^{R} τ (ν_{⊖} (signvar)) \\ σ (a_{rec}) = τ (ν_{\oplus} (a_{rec})) -^{R} τ (ν_{⊖} (a_{rec})) \\ σ (s_{rec}) = τ (ν_{\oplus} (s_{rec})) -^{R} τ (ν_{⊖} (s_{rec})) \\ σ (result) = τ (ν_{\oplus} (result)) -^{R} τ (ν_{⊖} (result)) \\ σ ({ret}_{f}) = τ (ν_{\oplus} ({ret}_{f})) -^{R} τ (ν_{⊖} ({ret}_{f})) \\ σ ({ret}_{I}) = τ (ν_{\oplus} ({ret}_{I})) -^{R} τ (ν_{⊖} ({ret}_{I})) \end{matrix}

12. Conclusions and Future Work

We showed that any

h_{B}

-mixed system can be rewritten into an

h_{B}

-exact formula by computing the elementary modes of the linear subsystem. In previous work,

h_{B}

-exact rewriting

h_{B}

-mixed systems was applied to compute difference abstractions exactly. In the present paper, we showed that

h_{B}

-exact rewriting can also be used to compute sign-abstractions exactly.

We have illustrated the usefulness of the computation of sign abstraction for linear formulas for the sign analysis of function programs. Using John’s overapproximation is often not good enough for such applications, since the relationships between the signs of different variables are quickly lost. We saw that elementary mode rewriting yields better a better approximation of the sign abstraction even for nonlinear equation systems, which may preserve these relationships.

The time for computing abstractions exactly strongly depends on the time needed to compute the elementary modes. Some experiments were reported in [6] in the case of the difference abstraction. There, one has to compute the elementary modes for a linear equation system that contains two copies of the linear equation system given with the input. The copying doubles the size and may increase the time for the computation of the elementary modes seriously. In the application of difference abstraction to change prediction of reaction networks, we observed cases where John’s overapproximation of the difference abstraction could be computed in circa 10 min, while the exact computation required circa 10 h.

In the future, it would we of interest to find heuristics for approximating abstractions of linear equation systems that reduce the computation time of the exact algorithm while improving John’s overapproximation in precision. In the case of difference abstractions, the minimal support heuristics was proposed for this purpose [6]. In the example mentioned above, this heuristics could be computed in circa 10 min, like John’s overapproximation, while yielding the exact result. In general, however, the minimal support heuristics is not exact.

Another interesting question for future work is how to compute more quantitative abstractions exactly, as for instance with intervals. In this case however the structure of abstract values is infinite, therefore finite domain constraint programming is no longer sufficient to compute the set of abstract solutions.

Author Contributions

These authors contributed equally to this work. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data sharing not applicable.

Acknowledgments

We would like to acknowledge the reviewers for the constructive feedback.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

The system of linear

Σ_{b o o l}

-equations

{dec}_{ν} (ϕ_{I}^{l i n})

corresponds to the following linear integer matrix equation:

(\begin{matrix} 1 & - 1 & 0 & 0 & 1 & - 1 & - 1 & 1 & 0 & 0 \\ - 1 & 1 & 1 & - 1 & 0 & 0 & 1 & - 1 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 1 & - 1 & - 1 & 1 \end{matrix}) (\begin{matrix} ν_{⊖} (a) \\ ν_{\oplus} (a) \\ ν_{⊖} (a_{rec}) \\ ν_{\oplus} (a_{rec}) \\ ν_{⊖} (signvar) \\ ν_{\oplus} (signvar) \\ ν_{⊖} (s) \\ ν_{\oplus} (s) \\ ν_{⊖} (s_{rec}) \\ ν_{\oplus} (s_{rec}) \end{matrix}) \overset{\circ}{=} (\begin{matrix} 0 \\ 0 \\ 0 \\ 0 \\ 0 \\ 0 \\ 0 \\ 0 \\ 0 \\ 0 \end{matrix})

The elementary mode rewriting

emr ({d e c}_{ν} (ϕ_{I}^{l i n}))

corresponds to the linear integer matrix equation:

(\begin{matrix} 0 & 0 & 1 & 0 & 0 & 0 & 0 & 0 & 0 & 1 & 1 \\ 0 & 0 & 1 & 0 & 0 & 0 & 0 & 1 & 1 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 & 1 & 0 & 0 & 0 & 1 \\ 0 & 0 & 0 & 0 & 1 & 0 & 1 & 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 1 & 1 & 0 & 0 & 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 1 & 0 & 1 & 0 & 0 & 0 & 0 & 1 \\ 0 & 1 & 0 & 0 & 1 & 0 & 0 & 0 & 0 & 1 & 0 \\ 0 & 1 & 0 & 0 & 0 & 1 & 0 & 1 & 0 & 0 & 0 \\ 1 & 0 & 0 & 0 & 1 & 0 & 0 & 0 & 0 & 1 & 0 \\ 1 & 0 & 0 & 0 & 0 & 1 & 0 & 1 & 0 & 0 & 0 \end{matrix}) (\begin{matrix} x_{0} \\ x_{1} \\ x_{10} \\ x_{2} \\ x_{3} \\ x_{4} \\ x_{5} \\ x_{6} \\ x_{7} \\ x_{8} \\ x_{9} \end{matrix}) \overset{\circ}{=} (\begin{matrix} ν_{⊖} (a) \\ ν_{\oplus} (a) \\ ν_{⊖} (a_{rec}) \\ ν_{\oplus} (a_{rec}) \\ ν_{⊖} (signvar) \\ ν_{\oplus} (signvar) \\ ν_{⊖} (s) \\ ν_{\oplus} (s) \\ ν_{⊖} (s_{rec}) \\ ν_{\oplus} (s_{rec}) \end{matrix})

References

Cousot, P.; Cousot, R. Systematic Design of Program Analysis Frameworks. In Proceedings of the Sixth Annual ACM Symposium on Principles of Programming Languages, San Antonio, TX, USA, 29–31 January 1979; pp. 269–282. [Google Scholar] [CrossRef]
Paulevé, L.; Sené, S. Non-Deterministic Updates of Boolean Networks. In Proceedings of the 27th IFIP WG 1.5 International Workshop on Cellular Automata and Discrete Complex Systems (AUTOMATA 2021), Marseille, France, 12–14 July 2021; pp. 10:1–10:16. [Google Scholar] [CrossRef]
Paulevé, L. Most Permissive Reaction Networks. Available online: https://loicpauleve.name/md/ak8WJ5d2TqKpmJBtP_8BaQ# (accessed on 2 September 2021).
Cousot, P.; Halbwachs, N. Automatic Discovery of Linear Restraints Among Variables of a Program. In Proceedings of the Fifth Annual ACM Symposium on Principles of Programming Languages, Tucson, AZ, USA, 23–25 January 1978; pp. 84–96. [Google Scholar] [CrossRef] [Green Version]
Granger, P. Static Analysis of Linear Congruence Equalities among Variables of a Program. In Colloquium on Trees in Algebra and Programming, Proceedings of the International Joint Conference on Theory and Practice of Software Development (TAPSOFT’91), Brighton, UK, 8–12 April 1991; Abramsky, S., Maibaum, T.S.E., Eds.; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 1991; Volume 493, pp. 169–192. [Google Scholar] [CrossRef] [Green Version]
Allart, E.; Niehren, J.; Versari, C. Computing Difference Abstractions of Linear Equation Systems. Theor. Comput. Sci. 2021. [Google Scholar] [CrossRef]
Allart, E.; Versari, C.; Niehren, J. Computing Difference Abstractions of Metabolic Networks Under Kinetic Constraints. In Computational Methods in Systems Biology, Proceedings of the 17th International Conference on Computational Methods in Systems Biology (CMSB 2019), Trieste, Italy, 18–20 September 2019; Bortolussi, L., Sanguinetti, G., Eds.; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2019; Volume 11773, pp. 266–285. [Google Scholar] [CrossRef] [Green Version]
Niehren, J.; Versari, C.; John, M.; Coutte, F.; Jacques, P. Predicting changes of reaction networks with partial kinetic information. Biosyst. 2016, 149, 113–124. [Google Scholar] [CrossRef] [PubMed] [Green Version]
John, M.; Nebut, M.; Niehren, J. Knockout Prediction for Reaction Networks with Partial Kinetic Information. In Verification, Model Checking, and Abstract Interpretation, Proceedings of the 14th International Conference on Verification, Model Checking, and Abstract Interpretation (VMCAI 2013), Rome, Italy, 20–22 January 2013; Giacobazzi, R., Berdine, J., Mastroeni, I., Eds.; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2013; Volume 7737, pp. 355–374. [Google Scholar] [CrossRef] [Green Version]
Giacobazzi, R.; Ranzato, F.; Scozzari, F. Making abstract interpretations complete. J. ACM 2000, 47, 361–416. [Google Scholar] [CrossRef]
Nethercote, N.; Stuckey, P.J.; Becket, R.; Brand, S.; Duck, G.J.; Tack, G. MiniZinc: Towards a Standard CP Modelling Language. In Principles and Practice of Constraint Programming—CP 2007, Proceedings of the 13th International Conference on Principles and Practice of Constraint Programming (CP 2007), Providence, RI, USA, 23–27 September 2007; Bessiere, C., Ed.; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2007; Volume 4741, pp. 529–543. [Google Scholar] [CrossRef]
Motzkin, T.; Raiffa, H.; Thompson, G.; Thrall, R. The double description method. In Contributions to the Theory of Games; Kuhn, H.W., Tucker, A.W., Eds.; Princeton University Press: Princeton, NJ, USA, 1953; Volume 2, pp. 51–74. [Google Scholar]
Fukuda, K.; Prodon, A. Double Description Method Revisited. In Combinatorics and Computer Science, Proceedings of the 8th Franco-Japanese and 4th Franco-Chinese Conference, Brest, France, 3–5 July 1995; Deza, M., Euler, R., Manoussakis, Y., Eds.; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 1995; Volume 1120, pp. 91–111. [Google Scholar] [CrossRef]
Gagneur, J.; Klamt, S. Computation of elementary modes: A unifying framework and the new binary approach. BMC Bioinform. 2004, 5, 175. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Zanghellini, D.; Ruckerbauer, D.E.; Hanscho, M.; Jungreuthmayer, C. Elementary flux modes in a nutshell: Properties, calculation and applications. Biotechn. J. 2013, 8, 1009–1016. [Google Scholar] [CrossRef] [PubMed]
Bagnara, R.; Hill, P.M.; Zaffanella, E. The Parma Polyhedra Library: Toward a complete set of numerical abstractions for the analysis and verification of hardware and software systems. Sci. Comput. Program. 2008, 72, 3–21. [Google Scholar] [CrossRef] [Green Version]
Rendl, A.; Guns, T.; Stuckey, P.J.; Tack, G. MiniSearch: A Solver-Independent Meta-Search Language for MiniZinc. In Principles and Practice of Constraint Programming, Proceedings of the 21st International Conference on Principles and Practice of Constraint Programming (CP 2015), Cork, Ireland, 31 August–4 September 2015; Pesant, G., Ed.; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2015; Volume 9255, pp. 376–392. [Google Scholar] [CrossRef]
Allart, E.; Niehren, J.; Versari, C. Reaction Networks to Boolean Networks. Available online: https://hal.archives-ouvertes.fr/hal-02279942 (accessed on 2 September 2021).
Dines, L.L. On Positive Solutions of a System of Linear Equations. Ann. Math. 1926, 28, 386–392. [Google Scholar] [CrossRef]
Miné, A. A Few Graph-Based Relational Numerical Abstract Domains. In Static Analysis, Proceedings of the 9th International Static Analysis Symposium (SAS 2002), Madrid, Spain, 17–20 September 2002; Hermenegildo, M.V., Puebla, G., Eds.; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2002; Volume 2477, pp. 117–132. [Google Scholar] [CrossRef] [Green Version]
Cousot, P.; Cousot, R. Static determination of dynamic properties of programs. In Proceedings of the Second International Symposium on Programming, Paris, France, 13–15 April 1976; pp. 106–130. [Google Scholar]
Granger, P. Static analysis of arithmetical congruences. Int. J. Comput. Math. 1989, 30, 165–190. [Google Scholar] [CrossRef]
Karr, M. Affine relationships among variables of a program. Acta Inf. 1976, 6, 133–151. [Google Scholar] [CrossRef]
Fukuda, K. cddlib—An efficient implementation of the Double Description Method. 2018. Available online: https://github.com/cddlib/cddlib (accessed on 2 September 2021).
Klamt, S.; Stelling, J.; Ginkel, M.; Gilles, E.D. FluxAnalyzer: Exploring structure, pathways, and flux distributions in metabolic networks on interactive flux maps. Bioinformatics 2003, 19, 261–269. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Avis, D.; Jordan, C. mplrs: A scalable parallel vertex/facet enumeration code. Math. Program. Comput. 2018, 10, 267–302. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Evaluation in the

Σ_{a r i t h}

-structure of signs

S

.

Figure 1. Evaluation in the

Σ_{a r i t h}

-structure of signs

S

.

Figure 2. Set-valued interpretation of expressions

〚 e 〛^{σ, S} \subseteq dom (S)

.

Figure 2. Set-valued interpretation of expressions

〚 e 〛^{σ, S} \subseteq dom (S)

.

Figure 3. Interpretation of formulas

ϕ \in F_{Σ}^{} (V)

as truth values

〚 ϕ 〛^{σ, S} \in B

over a

Σ

-structure S given a variable assignment

σ : V \to dom (S)

.

Figure 3. Interpretation of formulas

ϕ \in F_{Σ}^{} (V)

as truth values

〚 ϕ 〛^{σ, S} \in B

over a

Σ

-structure S given a variable assignment

σ : V \to dom (S)

.

Figure 4. A linear equation system and the corresponding integer matrix equation.

Figure 5. The elementary mode rewriting and the corresponding matrix equation.

Figure 6. Python function approximating the integral

\int_{0}^{a} f (x) d x

for a given function

f : R \to R

.

Figure 6. Python function approximating the integral

\int_{0}^{a} f (x) d x

for a given function

f : R \to R

.

Figure 7. Sign call graph of function

I

in Figure 6 created from the sets of abstract solutions in Table 1. Solid lines correspond to abstract solutions in

h_{S} \circ s o l^{R} (ϕ_{I})

, while dashed lines correspond to unjustified abstract solutions in

s o l^{S} (ϕ_{I})

. For example,

I^{S} (1, - 1)

represents assignment

[a / 1, s / - 1]

, that is signs of

a

and

s

in calls

I (a : f l o a t, s : f l o a t)

where

a > 0

and

s < 0

. Light blue edges may be removed by improving

ϕ_{I}

so that solutions 17, 18, 19 become impossible. Computation may terminate without raising an exception in nodes surrounded by a double circle.

Figure 7. Sign call graph of function

I

in Figure 6 created from the sets of abstract solutions in Table 1. Solid lines correspond to abstract solutions in

h_{S} \circ s o l^{R} (ϕ_{I})

, while dashed lines correspond to unjustified abstract solutions in

s o l^{S} (ϕ_{I})

. For example,

I^{S} (1, - 1)

represents assignment

[a / 1, s / - 1]

, that is signs of

a

and

s

in calls

I (a : f l o a t, s : f l o a t)

where

a > 0

and

s < 0

. Light blue edges may be removed by improving

ϕ_{I}

so that solutions 17, 18, 19 become impossible. Computation may terminate without raising an exception in nodes surrounded by a double circle.

Table 1. Set of abstract solutions in

s o l^{S} (ϕ_{I})

. Six solutions with gray background color are unjustified since outside

h_{S} \circ s o l^{R} (ϕ_{I})

.

Table 1. Set of abstract solutions in

s o l^{S} (ϕ_{I})

. Six solutions with gray background color are unjustified since outside

h_{S} \circ s o l^{R} (ϕ_{I})

.

#	$raise_exception$	$do_recursion$	$a$	$s$	$a_{rec}$	$s_{rec}$
1.	0	0	0	1	−1	1
2.	0	0	1	1	0	1
3.	0	0	1	1	−1	1
4.	0	0	1	1	1	1
5.	0	1	0	0	0	0
6.	0	1	1	0	1	0
7.	0	1	0	−1	1	−1
8.	0	1	1	1	0	1
9.	0	1	1	−1	1	−1
10.	0	1	1	1	−1	1
11.	0	1	1	1	1	1
12.	1	0	−1	0	−1	0
13.	1	0	−1	−1	0	−1
14.	1	0	−1	−1	−1	−1
15.	1	0	−1	−1	1	−1
16.	1	0	−1	1	−1	1
17.	1	1	−1	−1	0	−1
18.	1	1	−1	−1	−1	−1
19.	1	1	−1	−1	1	−1

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Allart, E.; Niehren, J.; Versari, C. Exact Boolean Abstraction of Linear Equation Systems. Computation 2021, 9, 113. https://doi.org/10.3390/computation9110113

AMA Style

Allart E, Niehren J, Versari C. Exact Boolean Abstraction of Linear Equation Systems. Computation. 2021; 9(11):113. https://doi.org/10.3390/computation9110113

Chicago/Turabian Style

Allart, Emilie, Joachim Niehren, and Cristian Versari. 2021. "Exact Boolean Abstraction of Linear Equation Systems" Computation 9, no. 11: 113. https://doi.org/10.3390/computation9110113

APA Style

Allart, E., Niehren, J., & Versari, C. (2021). Exact Boolean Abstraction of Linear Equation Systems. Computation, 9(11), 113. https://doi.org/10.3390/computation9110113

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Exact Boolean Abstraction of Linear Equation Systems

Abstract

1. Introduction

1.1. Problematics

1.2. Contributions

1.3. Related Work

1.4. Outline

2. Homomorphisms on Σ -Structures

2.1. Σ -Algebras

2.2. Σ -Structures

2.3. Homomorphisms

3. First-Order Logic

3.1. Expressions

3.2. Logic Formulas

3.3. Examples

3.4. Semantic Properties of Free and Bound Variables

4. Abstract Interpretation

4.1. John’s Overapproximation for Σ -Abstractions

4.2. Exactness of Σ -Formulas for Σ -Abstractions

4.3. Soundness and Completeness of Abstract Interpretation

4.4. Galois Connection

5. Equation Systems, Positivity, and Triangularity

5.1. Classes of Equation Systems

5.2. Positivity and Triangularity

5.3. Linear Equation Systems and Elementary Modes

6. h B -Exact Rewriting of Linear Equation Systems

7. Invariance

8. h B -Exact Rewriting of h B -Mixed Systems

9. Computing Sign Abstractions

9.1. Decomposition

9.2. Positivity

9.3. Computing Sign Abstractions

10. Application to Program Analysis

11. Example for the Overapproximation of the Sign Abstraction

12. Conclusions and Future Work

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

2. Homomorphisms on $Σ$ -Structures

2.1. $Σ$ -Algebras

2.2. $Σ$ -Structures

4.1. John’s Overapproximation for $Σ$ -Abstractions

4.2. Exactness of $Σ$ -Formulas for $Σ$ -Abstractions

6. $h_{B}$ -Exact Rewriting of Linear Equation Systems

8. $h_{B}$ -Exact Rewriting of $h_{B}$ -Mixed Systems