1. Introduction
Complex networks can be found everywhere, and they belong to an interdisciplinary field of science [
1,
2,
3]. Their modeling and analysis is of great importance. From a mathematical point of view, the networks appear in the theory of graphs. Topology can represent and characterize the properties of the entire network structure. A topology represents a real network and usually it is converted to either a directed or an undirected graph. The objects or events of the model are assigned to the vertices of the graph, and the edges are frequently used to describe various relations between the objects or events, e.g., dependency relations. The topological models are usually calculated based on some probabilities for large networks [
4,
5,
6,
7].
The other way to cope with the complexity of large networks is that we create a model that allows us to neglect some of the fine details of the network, but to highlight its large-scale structure, the topology. We neglect the details of the network by using the notion of subnetwork.
In this paper, we introduce the novel concept of resolvable networks, which are structures that have the following three properties: a resolvable network (1) consists of subnetworks, which are a set of nodes, and since subnetworks are sets, their inner structure is unconstrained; (2) non-overlapping subnetworks may communicate by passing messages over directed edges; and (3) subnetworks may overlap.
The inner structure of subnetworks is uninteresting from the viewpoint of resolvable networks, so we can highlight the topology. On the other hand, subnetworks may overlap, and we can use this feature to reveal the inner structure of subnetworks. However, we pay a price for that, as the graphical representation (in 2D) will be less and less intuitive if we add an increasing number of details. This is why we allow and use two different interpretations for subnetworks.
Before we detail these interpretations, let us recall the concept of Boolean satisfiability problems. From now on, we will use the short term SAT problem as it is widely used in the literature for these problems. By SAT problem, we mean the problem of propositional satisfiability for formulas in CNF (conjunctive normal form), i.e., formulas that are represented by clause sets. SAT is one of the most known NP-complete problems [
8], and it has connections to many subfields of computer science, including theoretical computer science, artificial intelligence, theory of algorithms, hardware design, and formal verification [
9].
Now, we are turning back to the possible interpretations of our subnetworks. The less constrained one is called the pessimistic interpretation of subnetworks. In this interpretation, there are no further constraints, i.e., the only constraints are those which are needed to define a resolvable network. In this interpretation, any resolvable network can be represented as a clause set, i.e., as a SAT problem. Moreover, it works also the other way around: any SAT problem can be represented by a resolvable network. This is the main property of the pessimistic interpretation of subnetworks, which allows people working on SAT to have a new graphical interpretation, a tool that helps to understand better the problem and helps to find its solution.
The main property of the pessimistic interpretation of subnetworks makes sure that we can use resolution to refine the structure of a resolvable network by adding more and more edges to it. We can also use this feature to depict any SAT problem as a resolvable network, although, the picture might be very complex as we are going to demonstrate it by some examples.
On the other hand, we discuss also the optimistic interpretation of subnetworks. In this case we have an extra constraint, inside a subnetwork each node can send a message to any other node directly or indirectly, so the subnetwork is strongly connected. In this interpretation any resolvable network can be represented as a clause set, i.e., as a SAT problem, but not the other way around; not all SAT problems can be represented by resolvable networks, if we use the optimistic interpretation. Resolution is still a valid operation in this interpretation. Furthermore, the system has the property of transitivity, which means that if we know that the subnetwork A can send messages to subnetwork B and B to C, then A can also send messages to C.
This work has several motivations. In general, since many people are visual, i.e., they can more easily obtain information from a diagram than from a text, graphical representations are important to provide better understanding of the concepts and thus also for teaching purposes.
This research is conducted as a part of the “Graph to SAT and SAT to Graph” project, (see
https://www.researchgate.net/project/Graph-to-SAT-and-SAT-to-Graph, accessed on 13 October 2021), which aims to find a suitable graph representation for the SAT problem. We believe that if we could visualize the SAT problem, then we would gain more insight on it that could help also to find more efficient solutions for them, at least in some subcases.
This paper is inspired by one of the recent papers of the above project, which introduces the so-called weak model of communication graphs [
10].
A communication graph is suitable to model, for example, a wireless sensor network. The sensors are the nodes of the communication graph. If a sensor can send a message to another one, then this fact is represented by a directed edge. Finally, each node should be labeled by a unique Boolean variable.
The weak model has some nice properties; for example, if the communication graph is strongly connected, then its weak model is a black and white SAT problem [
10]. On the other hand, the weak model is computationally very expensive, because we have to find all circuits in the input communication graph to be able to generate its weak model. We have to find all circuits, not only the elementary ones, and some graphs might have exponentially many circuits compared to the number of their nodes, just like in the case of the complete graph
, see [
11]. Therefore, our aim is to find a new digraph-based representation, which represents a digraph as a SAT problem.
In the next section,
Section 2, we overview similar works. In
Section 3, we give some basic definitions, and then we introduce the novel notion of the resolvable network in several subsections.
Section 4, among other results, shows the close relation between resolution and resolvable networks. In
Section 5, we present the pessimistic and the optimistic interpretation of subnetworks.
Section 6 presents further applications. Finally, in
Section 7, we present the conclusions and some possible future work.
3. Definitions and Basic Lemmas
In this section, first, we fix the notions and notations and recall some known facts that are needed to understand the paper. Then, in various subsections, we also give the definitions of our new concepts and we prove some facts about them.
Boolean variables are called literals; more precisely, they are positive literals. Their negations are called negative literals. Examples of literals are: .
A set of literals is called a clause. A set of them is called a clause set. Formally, a SAT problem is given by a clause set. An assignment is also represented by a set of literals. Let a set of literals be given as a clause or as an assignment. Any Boolean variable may occur in this set either as a positive literal or as a negative literal, but not as both, or it may not occur at all. Clauses are interpreted as disjunctions of their literals. Assignments are interpreted as conjunctions of their literals. Clause sets are interpreted as conjunctions of their clauses.
If an assignment or a clause contains exactly n literals, then it is an n-assignment or an n-clause, respectively. In special cases, when and in an n-clause, a unit and a binary clause are obtained, respectively. An n-SAT problem is a clause set where its clauses have at most n literals. A clause from a clause set is called a full-length clause if and only if it contains all variables of the clause set.
Some further notions will be used. Let K be a clause. Then, let denote the set of variables which occur in K; further, let denote the set of negative literals of K; and finally, let denote the set of positive literals of K. Obviously, , i.e., the clause as a set of literals, is the union of its negative and positive literals. Further, , that is, each literal of a clause, is either a negative or a positive literal, but cannot be both. Furthermore, as the positive literals appear in the clause exactly as variables.
The negation of a set L of literals is denoted by and it refers to elementwise negation, i.e., in each element of L is negated. Observe that .
Let H be the set of variables of a clause set. is said to be the white clause or the white assignment on variables H if and only if . Similarly, is the black clause or the black assignment on variables H if and only if . For instance, let , then , and . Notice that , and .
The clause Ksubsumes clause C if and only if K is a subset of C.
Further, the clause set Hsubsumes clause K if and only if there is a clause in H which subsumes K. Formally: .
The clause difference of two clauses C and D, denoted by , is defined as If , then we say that C differs from D. If , then we say that C and Ddiffers in k variables. Note that , i.e., C does not differ from C, and .
Further, , and Let the symmetric difference of two clauses C and D be defined as .
Examples: , and
Resolution can be performed on two clauses if and only if they differ in exactly one variable. If resolution can be performed, then the resolvent, denoted by , is defined as
The assignment S is said to be a solution of clause set H if and only if for all ,
Further, the clause set H is a BaW SAT problem if and only if it has exactly two solutions, the white and black assignments, i.e., and .
Now, we briefly recall some concepts of graph theory. The pair is a digraph, with the set of vertices and the set of edges . An edge is an ordered pair of vertices. If is an element of , then we may say that is an edge of .
We may also call a communication graph if the following conditions hold: for all e in , is not in ; moreover, the elements of are identified by Boolean variables, i.e., by positive literals. At communication graphs, the word node is also used as a synonym of vertex.
Now, we briefly recall a crucial property of the weak model of communication graphs from [
10]. If we have a circuit
in the input communication graph with exit points
, then this circuit is represented by the clause
.
In this paper, we are generalizing this idea as follows. If we have two subnetworks (see the next subsection for formal description) A and B, such that A consists of the nodes , and B consists of the nodes , and A can send messages to B, i.e., is an edge of the studied resolvable network, then we represent this by the same clause: . Since one edge of a resolvable network can be represented as a clause, the whole resolvable network can be represented by a SAT problem. This is the main idea of this work, and now we are ready to see the formal definition of the model.
3.1. Definition of Resolvable Networks
In this subsection, we formally define the notion of a resolvable network and also provide an example. Some basic results as lemmas are coming in some subsequent subsections.
Let be the set of vertices as it is usual in graph theory. Subnetworks are sets of vertices. Since subnetworks are sets, their inner structure is arbitrary, i.e., may not be precisely defined. A network consist of subnetworks connected by directed edges.
We say that a structure is a resolvable network if
- (1)
it consists of subnetworks;
- (2)
non-overlapping subnetworks may be connected by directed edges in it;
- (3)
its subnetworks may overlap.
More formally:
Definition 1. Let be the set of vertices. is a subnetwork if . The digraph is a resolvable network, where is the set of its subnetworks, and is the set of its edges. is the set of vertices of . An edge is an ordered pair of non-overlapping subnetworks, i.e., if is an element of , then .
If is an element of , then we may say that is an edge of , or is a reach of . The reach is depicted by .
has the property that there is no incoming edge to it. has the property that there is no outgoing edge from it. Both of them are representing the empty set, but in different roles.
If there is no edge of that contains the subnetwork A or or , then we may neglect them in graphical representations. In our graphical representations, we may use only those elements of that appear in at least one edge in .
Example: Let us have three subnetworks:
with the following two edges:
. This resolvable network is depicted in
Figure 1.
We may use the word reach as a synonym of edge to emphasize that the edge means a bit more than that subnetwork A may send messages to subnetwork B, because there are vertices in these subnetworks which realize the message passing—see the next subsection.
3.2. Interpretation of Reaches
The semantics of resolvable networks are motivated by the concept of message sending, which is a very basic operation of a network. In these semantics, vertices are called nodes, and nodes can be used as Boolean variables, similarly as in [
10] and as we have already mentioned.
The reach is interpreted as follows: The node a can send messages to node b, or in other words, we can send messages from a to b in the network. This reach can be represented by the logical formula . It is also worth noting here that, to make our description clear, we use ⇒ to represent the Boolean logical implication. Furthermore, the reach means that we can send messages from the subnetwork to , which abbreviates the fact that messages can be sent
from a to c; or
from a to d; or
from b to c; or
from b to d.
This means that at least one of the above possibilities works, but the possibility is not fixed. Thus, this reach, which is also represented by , can also be represented by the logical formula: , which is equivalent to this logical formula: , which is equivalent to this clause: .
This means that a reach can be represented by a clause, and vice versa, a clause can be represented by a reach; see later.
Definition 2 (Interpretation of a reach). The disjunctive interpretation, or for short, the interpretation of the reach , is defined as , and it reads that is the (disjunctive) interpretation of the reach .
The next lemma is essential since it enables us to represent a reach as a clause, and the other way around, to represent a clause as a reach.
Lemma 1. Let and . Assume that and are distinct sets of nodes. If and , then , i.e., formula F and the formula which is represented by clause G are logically equivalent to each other.
Proof. We know in general that . We know also that and are distinct sets of nodes. From the definition of interpretation of a reach, we know that . From these, using the basic properties of logical disjunction, we obtain that , which is represented by the clause . Hence, ; the formula F and the formula which is represented by clause G are logically equivalent to each other. □
This lemma enables us to represent a reach as a clause, and a clause as a reach, because if is a reach, then we know, by the definition of reach, that A, B are distinct sets, and in this case we have
For example, the reach can be represented by the clause , and the other way around: the clause can be represented by the reach .
Note that the clause is represented by the reach , and the other way around: a reach with represents a clause with only positive literals of .
Furthermore, the clause is represented by the reach , and the other way around: a reach with represents a clause with only negative literals of .
In addition, as a special case, the empty clause is represented by the reach , and vice versa.
3.3. The Double Meaning of ‘Reach’: The Noun and the Verb
The word ‘reach’ has two distinct types: it can be a noun or a verb. In this paper, we may use this word in both of its roles. However, these are not independent. In this subsection, we give some details on that.
If we have two reaches and , then the following question arises: Can we send messages from the subnetwork A to subnetwork C, or in other words, can we reach from A to C? This question shows how we use the verb meaning of the word reach to describe the expressiveness of resolvable networks. Thus, we can use the word reach as a noun, such as in “ is a reach”, and also as a verb, such as “from A we can reach C”.
Now, to come back to our research, let us see how can we deal with the above question: If we have two reaches
and
, then can we reach from
A to
C? The answer to this question, in general, is that it depends on
B or on its inner structure. If
B has only one node, then the answer is definitely yes. Otherwise, it can be either no or yes. If inside
B the nodes cannot send messages to each other, then the answer is probably no. If inside
B we find a strongly connected communication graph, see [
10], then the answer is yes. See also the pessimistic and the optimistic semantics of subnetworks. We will come back to this point in
Section 5.
In general, we have no knowledge of what is inside a subnetwork, but if we have two subnetworks, A, B, then we know that one of the following pairs of statements is true: either A, B are distinct or not; if they are distinct, then either is a reach or not, or is a reach or not. If they are not distinct, then neither is a reach, nor is a reach, but they overlap, so they have some common nodes. These structures might help us to find out the inner structure of a subnetwork using the clause representation of reaches and performing resolution over them.
Some of the reaches could be in a relation that helps us to discover some new relations among them. The following relation of the reaches plays a crucial role.
Definition 3 (Basic transitivity in a resolvable network). We say that two reaches and of a resolvable network are transitive if , and in this case, we say that from A we can reach C in the resolvable network N.
3.4. Interpretation of Resolvable Networks
In this subsection, we show that any resolvable network can be interpreted as a clause set. A resolvable network consist of reaches, and each reach can be represented by a clause. The question is what the interpretation is if we have two reaches , and . How should we interpret the relation of those? There are two obvious ways to connect them: either by ‘and’ or by ‘or’. Thus, the question is as follows. Does having both of the reaches and mean that A can reach B and C? Or, does this mean that A can reach B or C?
In the first case, it is very easy to create the SAT representation of a resolvable network, because between two reaches we have conjunction just like in the case of the SAT problem, between two clauses we have conjunction, and a reach can be represented as a clause, as we learned from Lemma 1.
In the second case, we have disjunction between two reaches, but still, using the ideas of the weak model of communication graphs, see [
10], we could create a meaningful logical representation. However, in this paper, we choose and work with the first possibility, and thus, we are ready to see what logical formulae are representing the resolvable networks.
Definition 4 (Interpretation of a resolvable network). The conjunctive interpretation, or for short, interpretation, of the resolvable network is defined as , and it reads that is the (conjunctive) interpretation of the resolvable network .
The following lemma states that any resolvable network can be represented by a SAT problem.
Lemma 2. Let be a resolvable network. Let . Let . Then, , i.e., formula F and the formula which is represented by clause set S are logically equivalent to each other.
Proof. By definition of interpretation of a resolvable network, we know that . By definition of a reach, we know that this can be written as . From this, by Lemma 1, we obtain that . From this, using the interpretation of a clause set, we obtain that F can be represented by the clause set . Hence, , i.e., formula F and the formula which is represented by clause set S are logically equivalent to each other. □
Example:
Let us take the resolvable network in
Figure 1, where we have three subnetworks:
, and two reaches:
. This resolvable network is represented by the following clause set:
.
The following lemma states that any SAT problem can be represented by a resolvable network. Before the lemma, we define an auxiliary function.
Definition 5. We define the function which creates a reach out of a clause, where the ’from’ part is created from the negative literals, and the ’to’ part is created from the positive literals. More formally, , where , if , otherwise , and , if , otherwise .
Examples:
.
.
.
.
Now, we are ready to state our lemma.
Lemma 3. Let S be a clause set. Let Let Let . Then,
Proof. The key idea of the proof is that if is a clause, then is the set of its negative literals, is the set of variables of its negative literals, and is the set of its positive literals. Note that . Furthermore, and are distinct sets. If we use Boolean variables as vertices, then is a reach, so we can use the function from Definition 5. From this, by Lemma 1 we have that . We know that and , since for any we know that is a reach, and we know that is a resolvable network. From this, being Lemma 2, we obtain that . □
We have proven that, in fact, SAT problems and resolvable networks are equivalent, i.e., any SAT problem can be represented by a resolvable network and vice versa. These results allow us to transfer some of the well-known and widely used concepts to our resolvable networks including, e.g.,
solution. We can also say that a resolvable network is
satisfiable. (It is if the represented SAT problem is satisfiable.) We provide also the formal definition of this latter notion:
Definition 6 (Satisfiable resolvable network). We say that resolvable network is satisfiable if and only if is satisfiable.
Now, as we have concluded that based on the results presented in the lemmas of this section, we are able to represent SAT problems by resolvable networks, we are presenting some examples.
Our examples correspond to the well-known pigeon hole problem [
19]. The pigeon hole problem with
m pigeons and
n holes is the problem to place
m pigeons in
n holes without placing two pigeons in the same hole. Depending on the values of
m and
n, the problem may have or may not have solutions, i.e., the formula representing the problem may or may not be satisfiable.
We use the DIMACS format to provide the examples:
c a pigeon hole problem with 2 pigeons and 1 hole
p cnf 2 3 % a problem with 2 variables and 3 clauses
1 0 % pigeon1 sits in hole1
2 0 % pigeon2 sits in hole1
-1 -2 0 % pigeon1 and 2 cannot sit in hole1 at the same time
There are two Boolean variables used in this formula; we refer to them as 1 and 2. The 1 is true if and only if the pigeon1 sits in the hole (we have only one hole in this example), and 2 is true if and only if pigeon2 sits in the hole. There are three clauses (the three bottom lines of the DIMACS description) describing our aims:
pigeon1 must sit somewhere, i.e., it should sit in the hole.
pigeon2 must also sit somewhere, i.e., it should sit in the hole.
there is only one hole; it is a place only of at most one pigeon, i.e., it could not happen that both pigeons are sitting there.
The problem can be written also as a SAT problem represented by the formula
; moreover,
Figure 2 shows the corresponding resolvable network:
.
To answer the question of whether the represented SAT problem is satisfiable or not, we have to check whether any message from
goes to
, or if the message can be lost. If the message can be lost, then the SAT problem is satisfiable, otherwise not. In
Figure 2, there are two outgoing edges from
, the blue ones. Each node broadcasts its message and sends it over using all of its outgoing edges. This means that the message from
arrives at 1 and 2. A subnetwork waits until all of its nodes get the message, then it broadcasts the message. In this case, we have the subnetwork
, since both nodes get the message from
it sends to its children, which is in this case the
. Since the message arrived from
to
, the represented SAT problem, this resolvable network is unsatisfiable.
In our next example, we have pigeons and holes. The DIMACS representation of the problem is shown below:
c a pigeon hole problem with 2 pigeons and 2 holes
p cnf 4 4 % a problem with 4 variables and 4 clauses
1 2 0 % pigeon1 sits in hole1 or in hole2
3 4 0 % pigeon2 sits in hole1 or in hole2
-1 -3 0 % pigeon1 and 2 cannot sit in hole1 at the same time
-2 -4 0 % pigeon1 and 2 cannot sit in hole2 at the same time
The problem is described by four variables, i.e.,
1 is representing if pigeon1 sits in hole1;
2 is representing if pigeon1 sits in hole2;
3 is representing if pigeon2 sits in hole1; and finally,
4 is representing if pigeon2 sits in hole2.
The four clauses refer to the four bottom lines of the DIMACS representation, representing that both pigeons need to sit somewhere, and in any hole there could not be more than one pigeon. The corresponding SAT problem can be written by the following formula:
.
Figure 3 shows the resolvable network of the problem with four reaches corresponding to the above-mentioned clauses.
To answer the question of whether
Figure 3 represents a satisfiable SAT instance or not, we have to check whether a message from
might be lost or not.
has two children,
and
. This means that one of these pairs of nodes gets the message:
, or
. In the first and in the last cases, the message is delivered to
, since we have two subnetworks
, and
which can send message to
, but in the second and third cases, the message cannot be delivered. Therefore, this problem represents a satisfiable SAT instance. Indeed, the solutions for the model of the formula of the problem are the assignments
and
. They mean that either pigeon1 sits in hole1 and pigeon2 in hole2, or pigeon1 sits in hole2 while pigeon2 sits in hole1; the problem is clearly solvable with these conditions.
Finally, let us take an example which corresponds to the pigeon hole problem with three pigeons and two holes. The DIMACS format of the example is as follows:
c a pigeon hole problem with 3 pigeons and 2 holes
p cnf 6 9 % a problem with 6 variables and 9 clauses
1 2 0 % pigeon1 sits in hole1 or in hole2
3 4 0 % pigeon2 sits in hole1 or in hole2
5 6 0 % pigeon3 sits in hole1 or in hole2
-1 -3 0 % pigeon1 and 2 cannot sit in hole1 at the same time
-1 -5 0 % pigeon1 and 3 cannot sit in hole1 at the same time
-3 -5 0 % pigeon2 and 3 cannot sit in hole1 at the same time
-2 -4 0 % pigeon1 and 2 cannot sit in hole2 at the same time
-2 -6 0 % pigeon1 and 3 cannot sit in hole2 at the same time
-4 -6 0 % pigeon2 and 3 cannot sit in hole2 at the same time
This SAT problem is depicted in
Figure 4. We use colors because there are many edges. For example, the light green edge connects the light green subnetwork to
.
We do not reveal any secrets in recalling that the pigeon hole problem is solvable if and only if the number of pigeons is not greater than the number of holes. In the usual logical representation, the problem with pigeons and n holes is considered with some value of n, as we have also shown examples of that type with and .
3.5. Equivalence of Resolvable Networks
A resolvable network is, in fact, interpreted as a formula of the SAT problem.
Definition 7 (Equivalence of resolvable networks). We say that two resolvable networks are equivalent if and only if their interpretations are equivalent. More formally, , where N and are resolvable networks.
We expand the definition of the relation of reach to subnetworks as follows.
Definition 8 (Subnetwork can reach subnetwork). We say that subnetwork A can reach subnetwork B in resolvable network N if and only if the interpretation of the reach is a logical consequence of the interpretation of N, i.e.,
Definition 9 (Refined resolvable network). Let be a resolvable subnetwork. If subnetwork A can reach subnetwork B in N, then is a refined resolvable network of N. We also overload the operator to abbreviate the above construction: If A can reach B in N, then , where . In this case, we say that is a refined resolvable network of N.
Lemma 4. If subnetwork A can reach subnetwork B in resolvable network N, then and N are equivalent.
Proof. From the assumption that A can reach B in N, we know by Definition 9 that is a refined resolvable network of N. From the same assumption, we know by Definition 8 that interpretation of the reach is a logical consequence of the interpretation of N. From this, by the well-known fact that , we know that , i.e., by applying Definition 7, they are equivalent resolvable networks. □
3.6. On the Scalability of the Method
On the one hand, as usual, visual techniques, e.g., Venn diagrams have strict limitations, i.e., they work well and are really useful for better understanding and teaching if the represented example/problem has a relatively small size. On the other hand, visual techniques could help to find new techniques and methods which may also work on large(r) size problems. As we know, industrial-sized SAT problems might have millions of clauses and some hundreds of thousands variables. We cannot visualize such a big SAT instance in a meaningful way, except if we perform some simplifications. Thus, in this subsection, we investigate the question of how to create a bird’s eye view perspective of big SAT instances.
One of the options is to split the set of nodes into two subsets, take them as subnetworks, abstract away the inner structure of these two subnetworks, and show the relations between them.
Since it is not easy to follow a raw text, we give this bird’s perspective algorithm in a more structured way:
Create two almost equal size distinct subnetworks, say A and B, such that each node belongs either to A or B.
Show the two subnetworks without any inner structures without detailing their elements by abstracting away each clause C which has the property or .
If there is a clause C such that and , then draw an edge from A to B.
If there is a clause C such that and , then draw an edge from B to A.
If there is a clause C such that or , then create an additional subnetwork D which intersects both A and B.
The last step could be more detailed. We could depict in different ways the cases if , or , or , and so on. It is the subject of further investigations which are the interesting cases.
One of the interesting cases is the following: If S is a satisfiable SAT instance and M is one of its solutions, then we can choose A and B in this way: , and . In this way, we will see that there is either an edge from A to B, or a subnetwork which intersects both A and B, or both, or neither of them, but there will be no edge from B to A. There will be no edge from B to A because the corresponding clause would not be satisfied by the solution M; see also Proposition 1.
Proposition 1. Let S be a satisfiable SAT problem. Let M be a solution of S. Then, S does not subsume .
Proof. This is a special case of the Lemma 4.3.1. case (b) in [
20]. □
We show this kind of bird’s perspective created from the well-known SAT instance, called:
. This problem can be downloaded from the SATLIB - Benchmark Problems; see
https://www.cs.ubc.ca/~hoos/SATLIB/Benchmarks/SAT/RND3SAT/uf20-91.tar.gz, accessed on 13 October 2021. We only show the first few lines of this problem since it has 91 clauses, but our method can be explained already based on these data.
c This Formula is generated by mcnf
c
c horn? no
c forced? no
c mixed sat? no
c clause length = 3
c
p cnf 20 91
4 -18 19 0
3 18 -5 0
-5 -8 -15 0
-20 7 -16 0
10 -13 -7 0
-12 -9 17 0
This problem is satisfiable, and one of its solutions is
; thus, we should choose A, B as follows: and .
Then, we get the bird’s perspective shown in
Figure 5. We can see that there is an edge from
A to
B (see step 3 of the bird’s perspective algorithm) because
contains, for example, the clause
(all its literals appear in the solution and it has both positive and negative literals). There is also a subnetwork which intersects
A and
B (see step 5 of the bird’s perspective algorithm) because of, for example, the clause
: here, we have a subnetwork
which has an intersection with both
A and
B.
There are also other options to obtain a simplified view on a SAT problem. For example, we can show only those clauses which have exactly negative literals and exactly positive literals. For example, if we have a 3-SAT problem and first we show the layer where and , then the layer and , then the layer where and , and finally the layer with and , then we could see some structure of the problem. This possibility is not investigated in this work.
In this subsection, we showed that not only small-sized problems can be handled by our model, but we may also have the chance to perform a kind of hierarchical modeling for large-sized problems; however, some of the technical details are left for a future study. Evidently, by making abstractions, some information is lost; however, for that price we can deal with large-sized problems.
7. Conclusions and Future Work
We have introduced a novel data structure, the resolvable network. In a sense, it is a generalization of digraphs, where a vertex represents a subnetwork. The inner structure of such a subnetwork is not constrained by the definition of resolvable network, but some usages might introduce constraint on them, just like in case of our optimistic interpretation of subnetworks. Subnetworks might overlap (there could be two different subnetworks such that they share some nodes), unlike digraphs (where distinct vertices could not have any common parts since they could be mapped to singleton sets). However, subnetworks are allowed to connect by edges only if they do not overlap, somehow inheriting that “non-overlapping” property of digraphs. Since digraphs are so widely used, we hope that this generalization will also be useful in some other areas of mathematics and modeling.
Our main result is the following. If we use the pessimistic interpretation of subnetworks, then any resolvable network can be represented by a SAT problem, and any SAT problem can be represented by a resolvable network. Thus, we can say that we have found an immediate connection between the SAT problems and their proposed graphical representations. This tight relationship between the SAT problems and resolvable networks allows us to enrich both sides by notions from the other side. For example, we can define the notion of satisfiable resolvable network (see Definition 6), or we can use resolution to obtain a refined resolvable network (see Definition 9 and Theorem 2). This bi-directional property is novel when comparing our new model to the existing ones mentioned in
Section 2.
Note that in the new representation, i.e., in resolvable networks, an edge means a logical consequence relation. As we saw, an edge from subnetwork to subnetwork is represented by the clause , which is interpreted as . This is equivalent to , i.e., an edge in a resolvable network can be interpreted as logical consequence. We believe that logical consequence is a natural way to understand the role of the edges of graphs representing logical formulae.
In
Section 2, we reviewed some other widely known graph representations of the SAT problem. As we have seen, an edge does not represent a logical consequence in the case of AIG, BDD, and ZDD; thus, although they are good for technical analysis, they do not support human understanding. On the other hand, one can represent each SAT problem by using these approaches, similarly to our new model. In the case of implication graphs, an edge means an implication relation, which is easy to follow, similarly to our proposed solution, but implication graphs can represent only 2-SAT problems, and not all SAT problems can be represented. Our new representation, the resolvable network, offers both benefits: an edge represents logical consequence, and, as we have already recalled, any SAT problem can be represented by a resolvable network. Based on the above-described reasons, we believe that our model contains more information about the nature of the SAT problems than other representations, and thus, further analysis of the model could bring real help and also more knowledge about the SAT problems and their solvability.
We studied some concrete examples of how to solve some SAT instances. Those examples suggest that we can solve the SAT problem without unit-propagation or resolution, which are the most basic techniques to solve SAT. To brew a generic SAT solver algorithm based on resolvable networks is our next goal.
Another direction of research, we should mention here, is based on the fact that in this work, we have defined and used the notion of disjunctive interpretation of reaches and the notion of conjunctive interpretation of resolvable networks. We can define alternative meaningful interpretations as follows. The conjunctive interpretation of the reach is defined as follows: . The disjunctive interpretation of the resolvable network is defined as . This later one is very interesting because in this case, we have disjunction between two reaches, so we could use the ideas of the weak model of communication graphs to create another SAT problem representation of resolvable networks, which might inherit the nice properties of the weak model of communication graphs.
In this work, each reach can be represented by a clause because we allow communication only of non-overlapping subnetworks; see “(2) non-overlapping subnetworks may communicate by message passing over directed edge” in the definition of resolvable networks. If we also allow communication of overlapping subnetworks, then the reach is represented by a tautology if . This does not change the property that each SAT problem can be represented as a resolvable network. However, those reaches of a resolvable network which describe communication of overlapping subnetworks cannot be represented as a clause.
Finally, we hope to report usages from the field of biology. Subnetworks might represent tissues or cells and reaches might represent communication of them.