1. Introduction
The interaction between modal logic and Aristotelian diagrams has a rich and well-documented history. The oldest kind of Aristotelian diagram is the square of opposition for syllogistics, which dates back to the second century CE [
1]. However, from the twelfth century onwards, philosophers started to make use of these diagrams to explicate their theorizing on modalities as well [
2,
3,
4,
5]. Furthermore, historical scholarship has shown that Aristotelian diagrams for modal logic can be reconstructed from the works of many earlier authors, such as Theophrastus [
6,
7], Chrysippus [
8,
9], and Avicenna [
10,
11]. Today, Aristotelian diagrams not only appear in well-known textbooks on modal logic [
12,
13], but they are also used in applications of modal logic to a variety of philosophical and logical topics, such as paraconsistency [
14,
15], logic-sensitivity [
16,
17,
18], and theories of truth [
19,
20]. It should be emphasized that all of these applications manifest a wide diversity of Aristotelian diagrams: next to the “ordinary” squares of oppositions, we also find several types of hexagons, octagons, and even more complex diagrams [
21].
In all of this work, Aristotelian diagrams are used to visualize the logical relations holding among a number of modal statements. In this paper, however, we take a completely different approach, as we will be using a modal language to describe Aristotelian diagrams. This boils down to “reversing the directionality” of the interaction between modal logic and Aristotelian diagrams. Whereas the aforementioned work was mainly concerned with Aristotelian diagrams for modal logic, in this paper, we rather study the modal logic of Aristotelian diagrams.
More concretely, we start from the basic observation that Aristotelian diagrams are graphs, i.e., relational structures consisting of vertices (viz., formulas) and edges (viz., Aristotelian relations) between them. Consequently, we can view these diagrams as Kripke frames, which allows us to interpret, i.e., to provide a relational semantics for, a modal language. (For reasons of expressivity, we will actually interpret a language of hybrid logic, rather than one of “basic” modal logic.) We can then express formulas in this modal language that correspond exactly with properties of Aristotelian diagrams that we are interested in. For example, we will exhibit a modal formula
that precisely captures the property of being a classical square of opposition: an Aristotelian diagram is a classical square if and only if
is valid on that diagram (considered as a Kripke frame). Similarly, the modal formulas
and
capture the properties of being a degenerate square and being a square of opposition “in general”, respectively. (Precise definitions of classical squares, degenerate squares, and squares in general are provided later in the paper.) We can take this idea of viewing Aristotelian diagrams as Kripke frames one step further, which gives rise to the modal consequence relation
. Many results from logical geometry (i.e., the systematic study of Aristotelian diagrams) can now be formally expressed as AD-valid arguments. Consider, for example, the well-known fact that every square of opposition is either a classical square or a degenerate square: this is formally captured by the argument
. (It should be noted that this approach was partially inspired by the work of Grossi [
22,
23]: he proposed to view argumentation frameworks as Kripke frames, which then allowed him to study abstract argumentation theory using modal logic).
There are several complementary reasons for studying the modal logic of Aristotelian diagrams from the perspective of logical geometry as well as from that of modal logic itself. First of all, it nicely illustrates the expressive power of modal—and in particular, hybrid—languages to describe relational structures. Secondly, even though this is not the main concern of this paper, we will show how a sound and strongly complete axiomatization of
can be obtained almost immediately from general results about hybrid logic. Thirdly, based on the intuitive idea of “viewing an Aristotelian diagram as a Kripke frame”, we formally define the notion of an AD-frame. We will prove that this definition completely captures the structural properties of Aristotelian diagrams, thus shedding new light on the very nature of these diagrams. Fourthly, and perhaps most importantly, the modal language developed here can be used to formulate generic descriptions of families of Aristotelian diagrams. For example, in order to define the family of all classical squares of opposition, we usually refer to an arbitrary logical system (or in more algebraically oriented parts of logical geometry, to an arbitrary Boolean algebra). Such a definition is indeed
general (because it works for all logical systems), but it is not
generic (because it still refers to some logical system—regardless of whether that system was chosen arbitrarily or not). For most purposes, such a general definition is appropriate, but there also exist certain theoretical contexts where we really want to work with generic descriptions of Aristotelian families (that do not refer to any logical system at all) [
24]. Using the modal language developed in this paper, we can define the family of classical squares of opposition as consisting of precisely those diagrams which (when viewed as Kripke frames) validate the formula
. Since this characterization does not refer to any logical system at all, the modal formula
provides a fully generic description of the family of classical squares of opposition.
The paper is organized as follows.
Section 2 provides some basic background information on Aristotelian diagrams in order to keep this paper self-contained.
Section 3 introduces all the key components of AD-logic, including the modal language
, the AD-frames that this language will be interpreted on, and the consequence relation
. In
Section 4, we prove that there exists a bijection between finite Aristotelian diagrams and finite AD-frames (up to isomorphism), which shows that AD-frames completely capture the structural properties of Aristotelian diagrams.
Section 5 shows how AD-logic allows us to express several key results from logical geometry (in particular, characterizations of and relations between various families of Aristotelian diagrams). In
Section 6, we demonstrate how AD-logic can be used to construct interesting new Aristotelian diagrams, which simultaneously contain certain formulas (when viewed as Aristotelian diagrams) and validate those same formulas (when viewed as AD-frames). Finally,
Section 7 wraps things up, and mentions some avenues for further research.
2. The Basic Building Blocks of Logical Geometry
Aristotelian diagrams can be defined in several ways [
25]. The most general definition is in terms of Boolean algebras, but for the purposes of this paper, it suffices to focus on Aristotelian diagrams relative to logical systems. We first introduce the Aristotelian relations in Definition 1 and then move on to the diagrams themselves in Definition 2.
Definition 1 (Aristotelian relations).
Let be a logical system, which is assumed to have Boolean operators and a model–theoretic semantics . The formulas are said to be-contradictory | iff | | and | , |
-contrary | iff | | and | , |
-subcontrary | iff | | and | , |
in -subalternation | iff | | and | . |
These relations are abbreviated as , , , and . Definition 2 (Aristotelian diagram). Let be a logical system, as shown in Definition 1, and let be a non-empty fragment of formulas. An Aristotelian diagram for is a vertex- and edge-labeled graph: its vertices are labeled by the formulas from , while its edges are labeled by the Aristotelian relations holding between those formulas (relative to ). Furthermore, the fragment is required to satisfy the following conditions:
- 1.
for every , there is a such that ,
- 2.
for every , there is no such that ,
- 3.
there is no such that or .
Figure 1 shows two basic examples of Aristotelian diagrams, viz., classical squares of opposition for
and for
, where
is the system of classical propositional logic, and
is the system of modal logic that is interpreted on Kripke frames with a serial accessibility relation [
13].
Definition 2’s three conditions on
are motivated by various theoretical and cognitive considerations [
26]. The vast majority of diagrams that are found in the extant literature indeed satisfy these conditions. In recent theoretical work on Aristotelian diagrams [
27], these conditions are sometimes dropped. However, for the purposes of this paper, we stick to the more traditional approach, and do include these conditions in our definition. Finally, note that because of conditions 1 and 2, we can always write the elements of a finite Aristotelian diagram for
(up to
-equivalence) as
, which shows that
is an even number.
The notion of Aristotelian isomorphism, which was first introduced in [
24,
28], is of crucial importance in logical geometry, because it allows us to start studying Aristotelian diagrams as objects of independent interest. More concretely, Aristotelian isomorphisms enable us to make abstraction of the specific formulas and logical system that define a given Aristotelian diagram, and thus, to say that two diagrams are “essentially the same”, despite being built from completely different fragments and logical systems.
Definition 3 (Aristotelian isomorphism). Consider Aristotelian diagrams D for and for . An Aristotelian isomorphism from D to is a bijection such that, for all and all Aristotelian relations , it holds that iff .
Upon visual inspection, it is immediately clear that function
f, defined by
,
,
and
, is an Aristotelian isomorphism between the two Aristotelian diagrams shown in
Figure 1. For example, we have
and
, i.e.,
.
4. The Relation between AD-Logic and Aristotelian Diagrams
At this point, there might seem to be a subtle mismatch between the informal motivation for this paper, as laid out in
Section 1, and the technical development of AD-logic in
Section 3. After all, the formulas of
are interpreted on AD-frames and AD-models (see Definitions 9–12), whereas the fundamental intuition was that we wanted to interpret such formulas on the Aristotelian diagrams
themselves. The distinction is a subtle one, as AD-frames and Aristotelian diagrams are clearly related to each other; for example, the notations
and
for the relations in an AD-frame are highly suggestive, and the same holds for the parallel observations regarding even numbers of worlds/formulas in finite AD-frames/Aristotelian diagrams. In this section, we will close this gap altogether. We first introduce the notion of an AD-frame being “based on” a given Aristotelian diagram, and then show that there exists a bijection between finite Aristotelian diagrams (up to Aristotelian isomorphism) and finite AD-frames (up to modal isomorphism). (The restriction to finite diagrams/AD-frames is needed for technical reasons; however, this restriction is not very significant, since all Aristotelian diagrams and AD-frames that we will be working with later in the paper are finite anyway).
4.1. AD-Frames Based on Aristotelian Diagrams
We start by introducing the idea of an AD-frame being based on an Aristotelian diagram. We first formulate this in terms of arbitrary birelational Kripke frames, and then prove that all such frames are indeed AD-frames (see Lemma 3).
Definition 13 (Birelational Kripke frame based on an Aristotelian diagram). Consider an Aristotelian diagram D for . A birelational Kripke frame is said to be based on D iff there exists a bijection such that for all :
- 1.
iff ,
- 2.
iff .
Note that all components of the Kripke frame
F and the Aristotelian diagram
D play key roles in Definition 13: the sets
W and
are the domain and codomain of the bijection
h, while conditions 1 and 2 specify a correspondence between the relations
and
of the Kripke frame
F and the actual Aristotelian relations that hold among the formulas of
D, relative to the logical system
(see Definition 1). Additionally, note that because of its bijective nature and the presence of conditions 1 and 2, the function
h from Definition 13 looks very much like an Aristotelian isomorphism (see Definition 3); the only difference is that
h connects an AD-frame with an Aristotelian diagram, whereas an Aristotelian isomorphism is defined between two Aristotelian diagrams. (One point of difference is that
h is concerned with just
two relations,
and
, whereas an Aristotelian isomorphism is concerned with all
four Aristotelian relations. However, this difference is very minor, since
and
can be defined in terms of
and
; see Lemmas 2 and 3 of [
26]).
We now prove that if a birelational Kripke frame is based on an Aristotelian diagram, then it is an AD-frame. This is exactly as it should be, and justifies our terminological choice for the word “AD-frame”.
Lemma 3. Let be a birelational Kripke frame that is based on an Aristotelian diagram D for . Then F is an AD-frame.
Proof. Since F is based on D, there exists a bijection that satisfies the two conditions mentioned in Definition 13. In order to show that F is an AD-frame, it suffices to check that it satisfies all the conditions of Definition 5. All these conditions follow, via the bijection h, from Definitions 1 and 2. For example, is symmetric because the Aristotelian relation of -contrariety is symmetric; in detail: for all , we have iff iff [ and ] iff [ and ] iff iff .
For a more elaborate example, we check that condition 4(c) of Definition 5 holds. Consider arbitrary worlds and suppose that and . It follows that and and . By the definition of and , it follows that and and , and hence, by transitivity, , i.e., (†). Furthermore, it follows from that and, hence, there exists an -model M such that and , i.e., . From and , it follows that and and, hence, by transitivity, . Since , it follows that , i.e., . Since and , it follows that and, hence, . From (†) and (‡), it follows that and, hence, , as required. □
We are now ready to entirely close the gap between Aristotelian diagrams and AD-frames. In particular, validity on an Aristotelian diagram D can straightforwardly be defined in terms of validity on all AD-frames that are based on D (see Definition 10).
Definition 14 (Validity on an Aristotelian diagram). The formula is said to be valid on an Aristotelian diagram D, written as , iff for every AD-frame F that is based on D, it holds that .
This notion of “diagrammatic validity” will play a crucial role in
Section 5 and
Section 6 later in the paper. However, in the remainder of the current section, we will first investigate the relationship between Aristotelian diagrams and AD-frames in more detail.
4.2. From Aristotelian Diagrams to AD-Frames
We have just introduced the notion of a birelational Kripke frame being based on a given Aristotelian diagram, and shown that such frames are always AD-frames. However, there are two questions that have not been answered thus far. First of all, given an Aristotelian diagram D, we can ask whether there always exists an AD-frame based on D. Secondly, we can ask to what extent the AD-frames based on a given Aristotelian diagram are unique. In order to provide an affirmative answer to the first question, we define the canonical frame for a given Aristotelian diagram D.
Definition 15. Consider an Aristotelian diagram D for . The canonical frame for D is the birelational Kripke frame , where , and .
Lemma 4. Consider an Aristotelian diagram D for . Then the canonical frame is based on D, and is thus an AD-frame.
Proof. To see that is based on D, it suffices to note that the identity function has all the required properties to fulfill the role of h in Definition 13. By Lemma 3, it immediately follows that is an AD-frame. □
Since an Aristotelian diagram
D for
yields an AD-frame
“via” the identity function
, there is a potentially confusing situation that needs our careful consideration. After all, given a formula
and a formula
of AD-logic, it now makes sense to ask whether (for any valuation
V on
and any assignment
g on
) we have
, or, using a common abbreviation, whether
. However, this latter expression does
not mean that we are dealing with a valid argument with a single premise
and conclusion
, but rather, that the formula
is true at the “formula-viewed-as-a-world”
(recall our notational convention regarding metavariables for
and
). For example, given the classical square of opposition for
that was already shown in
Figure 1b, we can say that
, since
and
and
. Furthermore, note that
does
not mean that the formula
is contrary to the formula
: this would involve a category mistake, since
belongs to the basic modal language
, whereas
belongs to
. (Formally speaking, there is no overarching logic that
and
both belong to, and with respect to which they could be said to be contrary to each other.)
We now turn to the second question about AD-frames based on a given Aristotelian diagram. It is easy to see that two distinct AD-frames can be based on the same Aristotelian diagram. For example, the three frames (i)
, (ii)
and (iii)
are all nominally distinct from each other, yet all three are based on the diagram for
(recall that
is short for classical propositional logic). We thus cannot hope for there to be a
unique AD-frame based on a given Aristotelian diagram; however, as the example above already suggests, we can show that AD-frames based on a given Aristotelian diagram are unique
up to modal isomorphism. In order to formulate this correctly, we first recall the notion of modal isomorphism (which is entirely standard from modal logic; see [
40], p. 58) in Definition 16. Furthermore, we will actually prove a slightly stronger result: if
and
are two Aristotelian diagrams that are Aristotelian isomorphic to each other, and
and
are AD-frames that are based on resp.
and
, then
and
are modally isomorphic to each other.
Definition 16. Consider AD-frames and . A modal isomorphism between these two frames is a bijection such that for all , it holds that iff and iff .
Lemma 5. Consider Aristotelian diagrams for and for , and consider an AD-frame based on and an AD-frame based on . If and are Aristotelian isomorphic, then and are modally isomorphic.
Proof. For , since is based on , there exists a bijection that satisfies the requirements of Definition 13. Furthermore, since and are Aristotelian isomorphic to each other, there exists a bijection that satisfies the requirements of Definition 3. We now show that is a modal isomorphism.
First of all, note that since
is a bijection, the function
exists and is a bijection as well. Furthermore, instead of saying that for all
,
iff
(see Definition 13), we can equivalently say that for all
,
iff
. (Exactly the same remark applies to
and
.) The function
is a composition of three bijections and is itself thus also a bijection. Furthermore, for all
, we have the following chain of equivalences:
| | |
iff | | (def. of functional composition) |
iff | | (by the remark about above) |
iff | | (since f is an Arist. isomorphism) |
iff | | (by the properties of ). |
In exactly the same way, we can show that
iff
. This shows that
is a modal isomorphism. □
We are now in a position to define a function from finite Aristotelian diagrams (up to Aristotelian isomorphism) to finite AD-frames (up to modal isomorphism). We introduce some auxiliary notions in Definition 17, and the actual function in Definition 18. It is quite easy to show that is injective, which we do in Lemma 6.
Definition 17. For each Aristotelian diagram D and AD-frame F, we define
,
.
Furthermore, we define
,
.
Definition 18. The function is defined by putting , for every . Note that this is a well-defined function because of Lemmas 4 and 5: if and are Aristotelian isomorphic, then and are modally isomorphic.
Lemma 6. The function λ is injective.
Proof. Consider arbitrary Aristotelian diagrams
for
and
for
and suppose that
, i.e.,
and
are modally isomorphic. We show that
, i.e.,
and
are Aristotelian isomorphic. Since
and
are modally isomorphic, there exists a modal isomorphism
. For all
, we have
| iff | | (by Definition 15) |
| iff | | (by Definition 16) |
| iff | | (by Definition 15). |
Similarly, we can show that
iff
. Furthermore, since subcontrariety and subalternation can be defined in terms of contradiction and contrariety (recall Lemmas 2 and 3 from [
26]), we also get
iff
and
iff
. This shows that
f is an Aristotelian isomorphism between
and
, as desired. □
Note that the proof of Lemma 6 does not rely on the fact that and contain only finite Aristotelian diagrams and AD-frames. This stands in sharp contrast with the proof of Lemma 8 below, which deals with the surjectivity of .
4.3. From AD-Frames to Aristotelian Diagrams
In this subsection, we finish our investigation of the relation between AD-frames and Aristotelian diagrams, by showing that the function
defined above is not only injective, but also surjective. This proof will be a bit more involved, so we first introduce the auxiliary notion of (maximal)
F-consistency in Definition 19 and prove Lemma 7. The latter is a Lindenbaum-type lemma, which states that every
F-consistent set can be extended to a maximally
F-consistent set. Finally, it bears emphasizing that the proof of Lemma 8 is closely related to the algorithm described in
Section 4 of [
24].
Definition 19. Let be an AD-frame. A set is said to be F-consistent iff there are no such that or . Furthermore, X is said to be maximally F-consistent iff (i) X is F-consistent and (ii) for all , the set is not F-consistent. Finally, is defined to be the set of all maximally F-consistent subsets of W.
Lemma 7. Let be an AD-frame. For every F-consistent set , there exists a maximally F-consistent set such that .
Proof. Let be the collection of all sets such that and S is F-consistent. Note that is non-empty (since ) and that is partially ordered by ⊆. We now prove that every non-empty chain in the poset has an upper bound in . Consider an arbitrary chain in . Clearly, is an upper bound for , so it suffices to show that , i.e., (i) and (ii) is F-consistent. Regarding (i), since is non-empty, there exists an , and hence . Regarding (ii), suppose, toward a reductio, that is not F-consistent. Hence, there exist such that or . Since , there exist such that and . Since is a chain, we have or . Consider the case (the case is analogous). It then follows that , and thus is not F-consistent, which contradicts .
We have now checked that every non-empty chain in the poset has an upper bound in . By Zorn’s lemma, it follows that has at least one maximal element, A. Since , we have, by definition of , that and that A is F-consistent. To show that , it thus suffices to check that A is maximal. Toward a reductio, assume that there exists some such that is F-consistent. Since and is F-consistent, we have . However, since , this contradicts the fact that A is a maximal element of . □
Lemma 8. The function is surjective.
Proof. Consider an arbitrary finite AD-frame . We will show that there exists an Aristotelian diagram D such that , i.e., such that F is modally isomorphic to . In order to specify D, we have to specify a logical system and a fragment . For our logical system, we take classical propositional logic, , while for , we take the image of W under the function f, which is defined below.
First of all, note that
, so since
F is finite, it follows that
is finite as well. Now fix an ordering on the elements of
:
. Secondly, we consider a partition
of
consisting of
formulas:
. The set
is called a “partition of
”, because its elements are mutually exclusive (i.e.,
for distinct
) and jointly exhaustive (i.e.,
). It is easy to see that such a partition
always exists; see below for a concrete example and see the proof of Theorem 4 in [
28] for a general construction method. For each
, we define
Now let
, and consider the Aristotelian diagram
D for
. (This can also be formulated using the well-known representation format of bitstrings [
24,
28,
41]: define
to be the bitstring that has a 1 in its
i bit position iff
.) We will show that
f is a modal isomorphism from
F to
.
We first check that f is bijective. Surjectivity follows immediately from the definition of f. For injectivity, consider arbitrary and suppose that . We will show that , by showing that there exists an such that iff . By the seriality of , there exist such that . Now consider the following case distinction:
The set is F-consistent.
By Lemma 7, there exists some such that . We have but (because is F-consistent, and ), as desired.
The set is F-consistent.
Analogously to the previous case, we can now show that there exists some such that but , as desired.
Neither nor is F-consistent.
We will show that this case cannot occur. Since is not F-consistent, we have or . If , then it would follow by the symmetry and functionality of that , which contradicts our assumption that . Hence, . Similarly, since is not F-consistent, it follows that . However, from and , it follows by condition 4(b) of Definition 5 that .
We now check that for all , we have iff . For the left to right direction, suppose that . By the definition of , it suffices to prove the following two claims:
.
Consider an arbitrary -model M and suppose, toward a contradiction, that . Since , there exists such that and . Similarly, since , there exists such that and . Since and are members of a partition of , it follows from and that . Hence, we have and , which, together with , contradicts the F-consistency of .
.
We have to show that , i.e., that . Since , it suffices to show that for each , we have or .
Consider some and suppose that . We will show that . Since is maximal, it follows from that is not F-consistent, i.e., there exist such that or . Now, if x and y both come from , then itself would be F-inconsistent. Furthermore, if x and y both come from , then , which contradicts the irreflexivity of and . So without loss of generality, we assume that and , and thus we have or .
In the first case, i.e., , it follows by the symmetry and functionality of that . Since , we thus have , as desired. We now turn to the second case: . Toward a contradiction, assume . Since is maximal, it follows that is not F-consistent, i.e., there exist such that or . Reasoning as before, we again assume, without loss of generality, that and , and thus we have or .
In the first subcase, i.e., , it follows by the symmetry and functionality of that . Since , we thus have , which contradicts our assumption that . In the second subcase, i.e., , it follows by condition 4(c) from Definition 5 that . However, since , this contradicts the F-consistency of .
For the right to left direction, suppose that and, toward a contradiction, that . We make the following case distinction:
.
Then the set is F-consistent, so by Lemma 7, there exists some such that . It follows from that . Hence, for all , if then . This is in contradiction with the fact that .
.
By the seriality of , there exist such that . Now, if , then by the symmetry and functionality of it would follow that and , so would mean that , and thus (by the symmetry of ) that , which contradicts . This shows that . Furthermore, it cannot happen that either, since together with and the symmetry of , that would contradict condition 4(b) of Definition 5. Since and , it follows that the set is F-consistent, and hence, by Lemma 7, there exists some such that . Note that , since together with and , this would violate the F-consistency of . Analogously, we show that . It follows from that , i.e., . Hence, for all , it holds that or . This contradicts the fact that .
In an analogous fashion, we can check that for all , we have iff . To summarize: f is a modal isomorphism from to the AD-frame that is based on the Aristotelian diagram D for . □
In order to better grasp the proof of Lemma 8, it may be useful to work through a concrete example. Consider the AD-frame
shown in
Figure 2a. An easy calculation yields
. Since
, we need to consider a partition
that consists of three formulas, for example
. Since
w only belongs to
, we define
. Since
v only belongs to
, we define
. Since
u belongs to
and
, we define
. Finally, since
s belongs to
and
, we define
. In this way, we obtain the Aristotelian diagram
D for
that is shown in
Figure 2b. Upon visual inspection of the left- and right-hand sides of
Figure 2, it should be immediately clear that
F is modally isomorphic to
.
We can now put everything together in Theorem 2. The importance of this theorem, and in particular, of Lemma 8, should not be underestimated. These results tell us that Definition 5 completely captures the structural properties of Aristotelian diagrams. More concretely, the various conditions stated in Definition 5 are sufficient to exhaustively describe the behavior of Aristotelian relations (consider how each of these properties played a role in the proof of Lemma 8!). Until now, there was a long list of various properties of Aristotelian relations (e.g., we know that subalternation is transitive), but it was unknown whether this list was exhaustive. In other words, it was conceptually possible that some complex property of the Aristotelian relations might exist that we did not yet know about, and that does not follow from the properties that are already known. Theorem 2 tells us definitively that no such properties exist.
Theorem 2. The function is bijective.
Proof. This follows immediately from Lemmas 6 and 8. □
To conclude this section, it is worth mentioning that the proof of Theorem 2 immediately yields another interesting result as well: every Aristotelian diagram for some finite fragment and some logical system is Aristotelian isomorphic to a diagram for a finite fragment and the logic (Lemma 9). This means that when we are studying Aristotelian diagrams, we can essentially restrict ourselves to diagrams for (fragments from) classical propositional logic.
Lemma 9. Consider an Aristotelian diagram for , where is finite. There exist a finite fragment and a diagram for such that is Aristotelian isomorphic to .
Proof. Consider the canonical frame that is based on . In the proof of Lemma 8, we constructed a finite fragment (namely, ) and an Aristotelian diagram for such that is modally isomorphic to . By the proof of Lemma 6, it follows that the Aristotelian diagrams and themselves are Aristotelian isomorphic. □
5. Characterizing Aristotelian Families in AD-Logic
In this section, we really start putting AD-logic to use, in order to capture some important insights from logical geometry. In particular, one of the main ongoing research efforts in logical geometry is to develop a systematic typology of
Aristotelian families (i.e., maximal classes of Aristotelian diagrams that are closed under Aristotelian isomorphism) [
24,
28,
42]. We will now prove frame correspondence results for several of the most important families of Aristotelian diagrams: for each such family
A, we exhibit a formula
, and show that
iff
D belongs to the Aristotelian family
A, for all diagrams
D. These frame correspondences do not refer to any underlying logical system, and thus provide fully generic descriptions of the Aristotelian families [
24] (recall the distinction between
general and
generic characterizations of Aristotelian families from
Section 1).
5.1. Expressing Frame Cardinalities
We start by mentioning an auxiliary result. Because our language is so expressive, it can be used to count the number of worlds in a (finite) AD-frame. We write “” to denote a sequence of n closing brackets.
Lemma 10. Consider an AD-frame . Then we have: | iff | | , |
| iff | |
|
| | |
|
| | |
|
| | | , |
and in general, for any even number n: | iff | |
|
| | |
|
| | |
|
| | | ⋮ |
| | |
|
| | | . |
The aforementioned -formula is henceforth abbreviated as , so iff . Proof. This is an easy exercise in frame correspondence theory for hybrid logic. The structure of the formula is clear: each variable gets bound to a world ; for all we have a formula of the form inside , which shows that ; finally, the subformula of means that are all the worlds in W, and thus . □
Note that the formula contains only the binder and the global modality, and hence, there is nothing specifically “(Aristotelian) diagrammatical” about (in particular, it does not contain or ). This is exactly as it should be, since merely describes the cardinality of the AD-frame F, which is (nearly) independent of F’s status as (based on) an Aristotelian diagram. (But do note that we define only for even numbers n, since finite Aristotelian diagrams/AD-frames have, by definition, an even number of formulas/worlds; see Definitions 2 and 5).
5.2. Characterizing the Aristotelian Family of Pairs of Contradictories
Within the typology of Aristotelian diagrams, there exists a unique smallest type of Aristotelian diagram, viz., the
pair of contradictories (PCD) (see
Figure 3). For a diagram for
to be a PCD, it suffices to require that
contains exactly two formulas; from the general Definition 2 of an Aristotelian diagram, it then follows that these two formulas are
-contradictory to each other (hence the term “pair of contradictories”). Consequently, it is also straightforward to characterize PCDs in AD-logic.
Definition 20. An Aristotelian diagram for is a pair of contradictories iff .
Theorem 3. Define . An Aristotelian diagram D for is a PCD iff .
Proof. For the left to right direction, suppose that D is a PCD and consider an arbitrary AD-frame that is based on D. By Definitions 13 and 20, it follows that , so by Lemma 10, we get , i.e., .
For the right to left direction, suppose that , and consider the canonical AD-frame . By Lemma 4, it follows that , i.e., , so by Lemma 10, we find that , which means that D is a PCD. □
Let us have another look at Definition 20 and Theorem 3. At first sight, it might look like the right-hand side of Definition 20 does not refer to any underlying logical system (and is thus fully generic in nature). However, recall that by Definition 2, it follows from the requirement that that there are such that , which does involve a reference to . Definition 20 is thus general but not generic in nature, because its right-hand side still (implicitly) refers to some underlying logical system (even though that system can be chosen arbitrarily). By contrast, the right-hand side of Theorem 3 does not refer to any underlying logical system whatsoever, and thus offers a fully generic characterization of the Aristotelian family of PCDs. (This contrast between official definitions and -characterizations will become even clearer when we are dealing with Aristotelian families of squares, hexagons, and octagons, because their official definitions refer much more explicitly to the underlying logical system ).
5.3. Characterizing the Two Aristotelian Families of Squares of Opposition
Within the typology of Aristotelian diagrams, there exist exactly two families of squares of opposition: the
classical square and the
degenerate square; see
Figure 4 [
24,
28,
42]. The classical square is by far the oldest and most well-known kind of Aristotelian diagram. In comparison to a classical square, a degenerate square has lost all Aristotelian relations, except for the two relations of contradiction (which are required to be present by the very definition of Aristotelian diagram). Both families of squares are straightforwardly characterized in AD-logic.
Definition 21. Consider an Aristotelian diagram D for . Then:
- 1.
D is a square of opposition iff ,
- 2.
D is a classical square of opposition iff and
there exist such that ,
- 3.
D is a degenerate square of opposition iff and
there do not exist such that .
Theorem 4. Define the -formulas , and . Then for any Aristotelian diagram D for , we have:1. | D is a square of opposition | iff | , |
2. | D is a classical square of opposition | iff | , |
3. | D is a degenerate square of opposition | iff | . |
Proof. Analogous to the proof of Theorem 3. For purposes of illustration, we prove the second item. For the left to right direction, suppose that D is a classical square of opposition, and consider an arbitrary AD-frame that is based on D. By Definitions 13 and 21, it follows that and that there exist such that . From , it follows by Lemma 10 that . From , it follows that . Hence, , i.e., .
For the right to left direction, suppose that and consider the canonical AD-frame . By Lemma 4, it follows that , i.e., . From , it follows by Lemma 10 that . From , it follows that there exist such that . By Definition 21, this means that D is a classical square of opposition. □
Just like with the PCDs, let us have another look at Definition 21 and Theorem 4. Note that items 2 and 3 of Definition 21 are general but not generic in nature, because their right-hand sides still refer to some underlying logical system (even though that system can be chosen arbitrarily). By contrast, the right-hand sides of items 2 and 3 of Theorem 4 do not refer to any underlying logical system whatsoever, and thus offer fully generic characterizations of the Aristotelian families of classical and degenerate squares. Entirely analogous remarks apply to the characterizations of Aristotelian families of hexagons and octagons in the remainder of this section (compare Definitions 22 and 23 with Theorems 5 and 6, respectively).
Next to characterizing the classical and degenerate squares, we can also easily express in AD-logic that these two types of diagrams jointly constitute an exhaustive classification of the squares of opposition. To illustrate the versatility of AD-logic, we prove the two parts of Lemma 11 in different fashions: the first item is proved directly “within” AD-logic, while the second is proved by reducing it to the corresponding result in logical geometry. (If we had only considered the proof’s difficulty and length, it would have been more straightforward to prove both items directly within AD-logic).
Lemma 11. The following hold:
- 1.
, i.e.,
and ,
- 2.
.
Proof. For item 1, propositional reasoning yields .
For item 2, consider an arbitrary AD-frame F and suppose, toward a contradiction, that . It follows that , so by Lemma 10, F is finite, and by Lemma 8, there exists an Aristotelian diagram D such that F is modally isomorphic to . Since modal isomorphisms preserve frame validity, it follows from that , and, by Lemma 5, that for every AD-frame that is based on D. By Definition 14, this means that , so by Theorem 4, it follows that D is a classical square of opposition. In exactly the same way, it follows from that D is a degenerate square of opposition. However, this is impossible, since by Definition 21, a diagram cannot simultaneously be a classical and a degenerate square of opposition. □
Note that the formula
contains the global modality, ⋄, both in its left conjunct
and in its right conjuct
. However, if one prefers to work without the global modality (e.g., because one adheres to the credo that modal logic should be
local; see [
40], p. ix) and is prepared to put the cardinality condition in the metalanguage instead of in
, then we can characterize classical squares of opposition without the global modality altogether. In particular, we have to replace
with the disjunction
. (Entirely analogous remarks apply to
).
Lemma 12. Consider an Aristotelian diagram D for and suppose that . Then D is a classical square of opposition iff .
Proof. For the left to right direction, suppose that D is a classical square of opposition, and consider an arbitrary AD-frame that is based on D. By Definitions 13 and 21, it follows that and that there exist such that (and thus also , by the symmetry of ). By the seriality and functionality of , there exist unique such that . By the irreflexivity of and , it follows that for all , so since , it follows that . Consider an arbitrary valuation V on F and assignment g on . Since , we have for , and for . Since , it follows that , as desired.
For the right to left direction, suppose that , and consider the canonical AD-frame . By Lemma 4, it follows that . Consider an arbitrary valuation V on and assignment g on , and a world . We thus have . On the one hand, if , then there exists such that . On the other hand, if , then there exists such that . By Definitions 1 and 2, there exist such that , and . In both cases, we find, by Definition 21, that D is a classical square of opposition. □
5.4. Characterizing the Five Aristotelian Families of the Hexagons of Opposition
One can show that there are exactly five families of hexagons of opposition: the Jacoby–Sesmat–Blanché (JSB) hexagon, the Sherwood–Czeżowski (SC) hexagon, and the unconnected-
n (U-
n) hexagon for
(see
Figure 5 and
Figure 6) [
42]. The JSB hexagons are named after Jacoby [
43], Sesmat [
44], and Blanché [
45], and the SC hexagons after William of Sherwood [
46] and Czeżowski [
47]. The U4, U8, and U12 hexagons are less well-known, and are named after the number of pairs of unconnected formulas that they contain. (Recall that two formulas are said to be
unconnected iff they do not stand in any Aristotelian relation to each other.) Once again, all families of hexagons are straightforwardly characterized in AD-logic.
Definition 22. Consider an Aristotelian diagram D for . Then:
- 1.
D is a hexagon of opposition iff ,
- 2.
D is a JSB hexagon iff and there exist such that
, and ,
- 3.
D is an SC hexagon iff and there exist such that
, , and ,
- 4.
D is a U4 hexagon iff and there exist such that and ,
and for all , if , then ,
- 5.
D is a U8 hexagon iff and there exist such that ,
and for all , if , then ,
- 6.
D is a U12 hexagon iff and there do not exist such that .
Theorem 5. Define the following -formulas:
- 1.
,
- 2.
,
- 3.
,
- 4.
,
- 5.
,
- 6.
.
Then for any Aristotelian diagram D for , we have:1. | D is a hexagon of opposition | iff | , |
2. | D is a JSB hexagon | iff | , |
3. | D is an SC hexagon | iff | , |
4. | D is a U4 hexagon | iff | , |
5. | D is a U8 hexagon | iff | , |
6. | D is a U12 hexagon | iff | . |
Proof. This is analogous to the proofs of Theorems 3 and 4. For purposes of illustration, we prove the third item. For the left to right direction, suppose that D is an SC hexagon, and consider an arbitrary AD-frame that is based on D. By Definitions 13 and 22, it follows that and that there exist such that and . If we let the state variables x, y and z get bound to the worlds w, , and v, respectively, then it is easy to check that . (Note that the world u does not get bound to any state variable; u is the world (i) that is -reachable from (the world bound to) z, and (ii) at which holds.) Furthermore, from it follows by Lemma 10 that . In sum, we find that .
For the right to left direction, suppose that , and consider the canonical AD-frame . By Lemma 4, it follows that , i.e., . From , it follows by Lemma 10 that . From , it follows that there exist such that , , , and . (Namely, , and are the worlds that get bound to the state variables x, y, and z, respectively; does not get bound to any state variable, but is the world (i) that is -reachable from (the world bound to) z and (ii) at which holds.) By Definition 22, this means that D is an SC hexagon. □
Just as before, for some of these characterizing formulas, we can get rid of the global modality, ⋄, if we are prepared to put the cardinality condition in the metalanguage instead of in . For example, in , we have to replace with the disjunctive formula .
Lemma 13. Consider an Aristotelian diagram D for and suppose that . Then D is a JSB hexagon iff .
Proof. This is analogous to the proof of Lemma 12. □
5.5. Characterizing Some Aristotelian Families of Octagons of Opposition
In the project of setting up a systematic typology of Aristotelian diagrams, it has been shown that there are exactly 18 families of octagons of opposition [
42]. In this paper, we will not deal with all these families, but rather focus on some of the most well-known ones: the Moretti–Pellissier octagons [
48,
49], the Lenzen octagons [
50,
51,
52], the Buridan octagons [
53,
54,
55], the Beziau octagons [
56,
57], and the Keynes–Johnson octagons [
58,
59] (see
Figure 7 and
Figure 8). Once again, these families of octagons (and also the other ones, wintroduced the notion of an Aristotelian diagramhich we do not deal with in this paper) can straightforwardly be characterized in AD-logic.
Definition 23. Consider an Aristotelian diagram D for . Then:
- 1.
D is an octagon of opposition iff ,
- 2.
D is a Moretti–Pellissier octagon iff and there exist such that
, , , , and ,
- 3.
D is a Lenzen octagon iff and there exist such that
, , , , , , , and ,
- 4.
D is a Buridan octagon iff and there exist such that
, , , , , and ,
and for all , if then ,
- 5.
D is a Beziau octagon iff and there exist such that
, , , , and ,
and for all , if then ,
- 6.
D is a Keynes–Johnson octagon iff and there exist such that
, , and ,
and for all , if , then .
Theorem 6. Define the following -formulas:
- 1.
,
- 2.
,
- 3.
,
- 4.
,
- 5.
,
- 6.
.
Then for any Aristotelian diagram D for , we have:1. | D is an octagon of opposition | iff | , |
2. | D is a Moretti–Pellissier octagon | iff | , |
3. | D is a Lenzen octagon | iff | , |
4. | D is a Buridan octagon | iff | , |
5. | D is a Beziau octagon | iff | , |
6. | D is a Keynes–Johnson octagon | iff | . |
Proof. This is analogous to the proofs of Theorems 3–5. For purposes of illustration, we prove the fifth item. For the left to right direction, suppose that D is a Beziau octagon, and consider an arbitrary AD-frame that is based on D. By Definitions 13 and 23, it follows that and that there exist such that and , and for all , if , then . If we let the state variables x, y and z get bound to the worlds w, s and v, respectively, then it is easy to check that . (Note that the worlds u and do not get bound to any state variables; these two worlds are -reachable from (the world bound to) z. At u, it holds that , and at , it holds that .) Furthermore, from , it follows by Lemma 10 that . In sum, we find that .
For the right to left direction, suppose that , and consider the canonical AD-frame . By Lemma 4, it follows that , i.e., . From , it follows by Lemma 10 that . From , it follows that there exist such that , , , , and , and that there are no other pairs of -contrary formulas in , apart from the aforementioned ones. (Namely, , and are the worlds that get bound to the state variables x, y, and z, respectively. The worlds and do not get bound to any state variables: these two worlds are -reachable from (the world bound to) z. At , it holds that , and at , it holds that .) By Definition 23, this means that D is a Beziau hexagon. □
6. Aristotelian Diagrams for AD-Logic
In
Section 2, we introduced the notion of an Aristotelian diagram for a fragment of formulas
, relative to some logical system
. For example, we can take
to be some system of modal logic, and thus study
Aristotelian diagrams for modal logic; for example, recall the classical square of opposition for
in
Figure 1b. Next, in
Section 3,
Section 4 and
Section 5, we presented the system of AD-logic. This gives rise to a modal consequence relation
, and can thus be viewed as the
modal logic of Aristotelian diagrams.
Already in the opening paragraphs of this paper, we drew a sharp distinction between these two approaches. More concretely, we distinguished the system of AD-logic from the underlying logic
of some Aristotelian diagram; recall the notational convention introduced at the end of
Section 3.1:
as metavariables over
versus
as metavariables over
. This distinction was, and still remains, important in order to avoid significant confusions. However, on a strictly technical level, there is no reason why we cannot take the underlying logic
of our Aristotelian diagrams to be the system of AD-logic itself. In light of the results of
Section 5, this might even be a very fruitful idea. In this way, we thus end up studying Aristotelian diagrams for AD-logic, or, putting together the two aforementioned italicized expressions:
Aristotelian diagrams for the modal logic of Aristotelian diagrams.
In this section, we briefly illustrate the viability and fruitfulness of this idea by means of a concrete example. Recall the formulas
that were introduced and studied in
Section 5. Since the language
is closed under negation, we also have
. Hence, we can define the following fragment of
-formulas:
We clearly have all the ingredients at our disposal for defining an Aristotelian diagram (see Definition 2). First of all, note that AD-logic is a logical system with Boolean operators and a model–theoretic semantics . Secondly, the fragment is non-empty, and moreover:
Lemma 14. The fragment satisfies the following properties:
- 1.
for every , there is a such that ,
- 2.
for every , there is no such that ,
- 3.
there is no such that or .
Proof. Item 1 follows trivially from the definition of . For item 2, we show that and are not AD-equivalent to each other (all other cases are analogous). Consider some degenerate square of opposition D. By Definition 21, D is a square of opposition but not a classical square. By Theorem 4, it follows that and . Hence, there exists an AD-frame F that is based on D such that and .
For item 3, we show that is AD-contingent, i.e., and (all other cases are analogous). Consider some degenerate square D and some classical square . By Definition 21 and Theorem 4, it follows that and . From , it follows that there exists an AD-frame F that is based on D such that (and thus ). From , it follows that (and thus ). □
It remains to be determined what the Aristotelian diagram for looks like exactly, i.e., which Aristotelian family it belongs to. This is straightforward to check:
Lemma 15. The following hold:
- 1.
,
- 2.
,
- 3.
,
- 4.
,
- 5.
,
- 6.
.
Proof. These are all easy exercises in reasoning about AD-frames. Note that item 1 has already been proved above, as item 2 of Lemma 11. □
Items 1–2 of Lemma 15 state that
and
are contrary to each other in AD-logic (see Definition 1), which we abbreviate as
. Completely analogously, items 3–4 state that
, and items 5–6 state that
. Since
, it follows by Definition 22 that the Aristotelian diagram for
is a JSB hexagon, as shown in
Figure 9.
We finish this section by making three comments about this JSB hexagon for AD-logic, in increasing order of conceptual importance. First of all, recall that it is common in logical geometry to distinguish two subtypes of JSB hexagons, viz., strong and weak [
28]. In general, given a JSB hexagon
H for
with
,
and
, we say that
H is a
strong JSB hexagon iff
and that
H is a
weak JSB hexagon iff
. With this definition in mind, it is easy to check that the hexagon for
, as shown in
Figure 9, is a
strong JSB hexagon, since it follows from Lemma 11 that
.
Secondly, next to the standard research on Aristotelian diagrams for (formulas coming from) the
object language of some logical system
, there is also a flourishing tradition of studying Aristotelian diagrams for
metalogical notions [
25,
60,
61,
62,
63,
64,
65]. For example, given a logic
, we can say that the Aristotelian relations of
-contrariety and
-subcontrariety are themselves contrary to each other (since two
-formulas cannot simultaneously be
-contraries and
-subcontraries, while they can be neither
-contraries nor
-subcontraries, e.g., when they are
-contradictories). From this perspective, the JSB hexagon for
, as shown in
Figure 9, is simultaneously an object-level and a meta-level Aristotelian diagram. On the one hand, this hexagon is situated at the object level, because it is concerned with formulas such as
and
, which come from the object language
of AD-logic. On the other hand, this hexagon is, at the same time, situated at the meta-level as well, because of the correspondence results established in
Section 5. For example,
means that the Aristotelian families of classical and degenerate squares are contrary to each other (since an Aristotelian diagram cannot simultaneously be a classical and a degenerate square, while it can be neither a classical nor a degenerate square, e.g., when it is a PCD).
Finally, and perhaps most subtly, note that because of their specific setup, Aristotelian diagrams for AD-logic can simultaneously contain and validate a given formula. Concretely, let us write
D to refer to the JSB hexagon in
Figure 9, and consider the formula
. On the one hand, we have
and, thus, the formula
occurs insideD when
D is viewed as an Aristotelian diagram for
. On the other hand,
D is a JSB hexagon, and thus not a square, which means that
, i.e., the formula
is
validated byD, when
D is viewed as an AD-frame. It bears emphasizing that there is nothing paradoxical or inconsistent about this situation. Nevertheless, we do acknowledge that this can quickly get very confusing, and that is precisely the reason why we postponed discussing Aristotelian diagrams for AD-logic until this section, at the very end of the paper.
7. Conclusions
In this paper, we developed the system of AD-logic, i.e., the (hybrid) modal logic of Aristotelian diagrams. We established a sound and strongly complete axiomatization for AD-logic, and proved that there exists a bijection between finite Aristotelian diagrams (up to Aristotelian isomorphism) and finite AD-frames (up to modal isomorphism). This means that the properties of AD-frames are sufficient to derive all structural properties of Aristotelian diagrams. We then showed how AD-logic can capture several major results from logical geometry; for example, for every well-known Aristotelian family A, we exhibited a formula and showed that an Aristotelian diagram D belongs to the family A iff is validated by D (when the latter is viewed as an AD-frame). These correspondence results do not refer to any underlying logical system, and thus offer fully generic characterizations of the Aristotelian families. Finally, we showed that AD-logic gives rise to interesting new Aristotelian diagrams, and we reflected on their profoundly peculiar status.
There are many avenues for future research. For example, continuing along the lines of
Section 6, we can investigate further Aristotelian diagrams for AD-logic. In the context of
Section 5, we can continue to formalize significant portions of logical geometry in AD-logic. For example, we can define characterizing formulas
for less well-known Aristotelian families
A. More fundamentally, in logical geometry, we not only consider Aristotelian (families of) diagrams in isolation of each other, but also study how they are related to each other. For example, there is work on the notions of
subdiagrams (e.g., in each JSB hexagon, three classical squares of opposition can be embedded as subdiagrams) and
complementarity (e.g., a JSB hexagon is complementary to a Buridan octagon, relative to an Aristotelian rhombic dodecahedron) [
66,
67,
68]. Is AD-logic sufficiently expressive to formalize such relational claims as well?
Finally, we also need to consider what
cannot be expressed in AD-logic. In particular, in the development of a systematic typology of Aristotelian diagrams, it has become clear that many (though not all) Aristotelian families can be divided into
Boolean subfamilies; for example, there are two Boolean subfamilies of JSB hexagons (usually called “strong” and “weak”; we briefly encountered these in
Section 6), there are three Boolean subfamilies of Buridan octagons, etc. These Boolean subfamilies
cannot be expressed in AD-logic; for example, there do not exist any formulas
such that
D is a strong JSB hexagon iff
and
D is a weak JSB hexagon iff
. At first sight, this might appear to be a disadvantage or a limitation of AD-logic. However, one could also argue that this is exactly what we want, in order to keep AD-logic fully generic, i.e., independent of any underlying logic
. For example, whether a JSB hexagon for
is strong or weak fundamentally depends on the underlying logic
; from the perspective of AD-logic, we do not (want to) have “access” to the entire logic
, but only to the Aristotelian relations
and
that it gives rise to (via the abstract relations
and
in the AD-frames). It is clear that more thorough reflection is needed regarding this subtle issue.