1. Introduction
Lakatos’ monumental play ‘Proofs and Refutations’ (see [
1]) can be considered a demonstration of applying the proof–refutation (or conjecture–refutation) method as a practical realization of the falsificationism concept advocated and supported at that time, among other authors, by [
2]. At the same time, the concept of proving–refuting–improving, demonstrated in the same play, can be used as an effective interactive class model.
Refutation, as an isolated process, plays an extremely important role in the development of a pupil’s critical thinking and has a crucial place in each study program syllabus. We deem that examples of finding and treating incorrectness in some reasoning and argumentation are at least of equal didactic importance as those with correct derivations and proofs. Such examples present and help incite critical thinking.
First, let us explain in brief what we mean under the term ‘extrapolation’. As we know, interpolation deals with finding statements C and D, which are in between A and B, when ; i.e., ‘A implies B’, is provable, meaning that all three sequents , and are provable. In this case, the sequent presents an interpolant for . On the other side, if is refutable, i.e., , then we are looking for two statements C and D, such that , and are all provable; in this case, the sequent will be an extrapolant for .
In this paper, we extend the proving–refuting method by its immediate result—improving—and place it in a wider logical context relating it with the well-known concept of interpolation, with a new concept, extrapolation, as its dual. Both these notions, extrapolation and interpolation, are closely connected with many aspects of abductive reasoning [
3]. The improving process, based deeply on the extrapolation method, is presented through several examples. Let me repeat here that once, a long time ago, my teacher Aleksandar Kron told me: ‘Oh, how many times I fell asleep with a proof, and woke up with a counterexample’. This was the essence of the proving–refuting–improving process, during the daily journey of any scientist from a conjecture to the truth (see [
4]). This process, consisting of proving and refuting attempts producing an improvement of the starting conjecture, is presented formally as an methodological procedure for discovering better statements. In fact, this can be considered a kind of Hegelian–Marxist dialectic scheme: thesis–antithesis–synthesis. However, the crucial cognition is that the essential step of this procedure is based on extrapolation, which is a dual to the well-known logical feature of reasoning—the interpolation property. We introduce the notion of extrapolation as a counterpart of interpolation. We do this in general form, independently of the basic logic. Namely, our definition depends neither on language—we do not use connectives—nor on logic—we suppose that our deduction relation is not necessarily linked to classical logic. Pure propositional logics open the problem of existence of a minimal extrapolant, which seems particularly interesting in case of infinitely valued systems.
2. Interpolation and Extrapolation—A General Idea
A typical form of a scientific statement is that ‘B follows from A’, denoted by , expressing a causal relationship between A and B. Refutation of such a statement consists of argumentation presenting at least one example (interpretation) where A is satisfied, but B is not.
The turnstyle symbol will be used in an informal way, not connected to any particular logical system, but assuming its rudimentary structural properties such as identity (), weakening ( implies ), permutation ( implies ), contraction ( implies ) and transitivity ( and imply ).
Establishing a statement as a conjecture means that we believe that holds, but also that this is partly under question; does ? In order to obtain a final conclusion regarding the truthfulness of our conjecture, we try to prove and to refute it. This process implies finding examples supporting and counterexamples refuting , as well as looking for similar statements that are, by their nature, weaker than in cases when is refutable, and stronger than in cases when is provable.
Let us consider the two apparently simplest cases of causal connection: (i)
is not proven and (ii)
is proven, where
A and
B are two arbitrary sentences. In the second case (ii), we can assert that there are two propositions
C and
D such that the following statements are provable:
,
and
. If
C and
D are logically equivalent, then we recognize here a form of the well-known Craig interpolation theorem (see [
5]), pointing out that the form presented here can be considered as its slight generalization. In a similar way, we will deal with the first case (i) and suppose that there are two propositions
C and
D such that the following statements are provable:
,
and
, obtaining a form that is somehow dual to interpolation (ii) and which could be treated as a kind of extrapolation.
We point out that the term ‘duality’ is used here in a quite different meaning than in classical two-valued logic. For each statement of the form , provable or unprovable, we consider a provable statement . If and are provable, then is called an interpolant, while when and are provable, then is called an extrapolant. Consequently, C and D as parts of an interpolant are in consequent of A and antecedent of B, respectively, but as parts of an extrapolant, they have ‘dually’ just the opposite roles; C is in antecedent of A and D is in consequent of B.
More accurately, if we suppose that is any statement, provable or not, then (i) is its extrapolant if all statements , and are provable; (ii) is its interpolant if all statements , and are provable. We omit here more formal details such as variable sharing and the context of a particular logical system for the deduction relation.
Note that in the case that an interpolant exists, the original statement is provable. However, in the case that an extrapolant exists, we can conclude nothing regarding the provability of . The most interesting cases in the sequel of this paper will be exactly those (i) when , i.e., is unprovable. The challenges before us here are how to find some ‘good’ extrapolants for and (ii) when is provable, how to find its ‘good’ interpolants. This is because in both these cases, the statement should present an improvement of , which will be explained below.
The term ‘interpolation’ is justified by the simple fact that we insert a new statement in between A and B, with an obvious possibility to infer from , and . Similarly, the extrapolation process involves looking for a statement C ‘before’ A, because , and a statement D ‘after’ B, because . Both requirements, interpolation and extrapolation, have some trivial solutions. If is proven, then both forms and present possible interpolants. Furthermore, for any , all statements , and present its extrapolants, where we use the symbols ⊤ and ⊥, respectively, to denote truth and absurdity constants. Later, after sharpening both notions, extrapolation and interpolation, following the spirit of Craig’s interpolation theorem and practical applications of extrapolation, we will see that trivial solutions have no importance (as usual).
Example 1. (Lakatos’ Proofs and Refutations).
In his famous work, by giving a picturesque presentation of the proving–refuting process, Lakatos (see [1]) begins with an incorrect and refutable formulation of Euler’s Polyhedral Theorem. The dialog between a teacher and his pupils starts with the teacher’s provocation: “In our last lesson we arrived at a conjecture concerning polyhedra, namely, that for all polyhedra , where V is the number of vertices, E the number of edges and F the number of faces. We tested it by various methods. But we have not yet proven it. Has anybody found a proof?” After that, through a few iterations, the teacher, together with his pupils, by using a proving–refuting–improving method, obtains and proves the correct form of Euler’s Polyhedral Theorem: for all convex polyhedra, . Example 2. (Elementary Geometry). Let denote any right triangle with sides , where c is its hypothenuse, and denotes any triangle with sides . Some of the known elementary geometric facts can be formulated by means of a deduction relation as follows:
Triangle Inequality: .
Pythagorean Theorem: and .
We also have two obvious facts: ; i.e., each right triangle is a triangle and, in elementary algebra, for any positive reals (see [6]). In order to illustrate the extrapolation phenomenon in this context, we consider the following negative statement:By the extrapolation approach, bearing in mind that and , we can infer the following statements: , and , as possible extrapolants. Deeming the proving–refuting–improving process one of the most important methods of knowledge growth, the author of this text, with a group of his brilliant students (Aleksanra Djoković, Bojana Tujković, Ivana Čekrdžić, Aleksandar Elezović, Doroteja Djordjević and Milan Perić), during the spring semester 2014, set up a musical performance under the title ‘Proofs and refutations: devoted to the glorious triangle’, at the Faculty of Economics, University of Belgrade. That performance was deeply inspired by [1] but, for the sake of better understanding the basic message, instead of Euler’s Polyhedral Theorem, considered in the original Lakatos’ play, we dealt with proofs and refutations of the Pythagorean Theorem. Example 3. (Propositional Calculus). Here, we present some more subtle examples of interpolants and extrapolants. Let ∧ and ∨ denote the conjunction and disjunction connectives, respectively. (i) The form is unprovable, i.e., , and it can be improved by the following forms: , , , and ; this is not a complete list of its extrapolants. (ii) The form is provable and it can be improved by the following interpolants: , , and ; this is not a complete list of its interpolants. Let us note that the examples of extrapolants and interpolants considered here are compatible not only with classical, but also with many non-classical propositional logics.
Example 4. (Set-Theoretic Interpretation). Due to the immediate link between the set-theoretic inclusion relation and the classical implication connective, the interpolation and extrapolation have a rough illustrative and a quite simple set-theoretic interpretation. Namely, if for two sets A and B we have , then the sets C and D, such that , can be considered as the basic constituents of an interpolant for . On the other side, if , then the sets C and D, such that , and , define an extrapolant for .
Example 5. (Impossibility Tradition).
In spite of their great methodological and logical importance (see [7,8]), impossibility theorems raise a natural question: how they can be transformed into the corresponding relevant possibility results? Each such transformation is based on some proving–refuting–improving process that starts with the improving, i.e., with an extrapolation step. Let us discuss two simple cases of impossibility theorems.Incommensurability of the diagonal and the side of a square: if a is the side of a square and d is its diagonal, then , i.e., , where is the set of natural numbers. By replacing (weakening) with , bearing in mind that where is the set of rational numbers, we also obtain by extrapolation an invalid statement . The next iteration is finding an appropriate extrapolant for the statement . Obviously, by replacing with , bearing in mind that where is the set of reals, we obtain a valid positive statement , i.e., a possibility result.
Unsolvability of the equation , , in the field of reals: leads to two simple positive possibilities. By antecedent weakening, from , we obtain , or, by consequent weakening, from where is the set of complex numbers, we have , i.e., solvability of that equation in the field of complex numbers.
In a similar way, but with more complex argumentation and context, Arrow’s Impossibility Theorem (see [7,8,9]), the most popular and important result in Social Choice Theory during the last century, has generated a number of possibility results. Variations of the corresponding possibility theorems (see [10]), obtained by weakening the antecedent or consequent of Arrow’s original theorem, can be considered as effective examples of applying the proving–refuting–improving process as well. Here, we will explain why we do believe that both interpolants and extrapolants present improvements of our initial statement . (i) If is not provable, then its extrapolant , which is provable, obtained from an unprovable statement, presents its improvement, bearing in mind that from the initial statement of low quality (unprovable), we obtain its extrapolant , a statement of higher quality (provable). (ii) If is provable, then its interpolant , which is provable together with and , can be used as a sufficient condition to infer the initial statement , and from this point of view it can be considered as its essence—its improvement—enabling us to prove and better understand the meaning of the initial statement .
3. Extrapolation—More Formally
Let us discuss a more subtle aspect of extrapolation including some views of relevance logic. A deduction of
B from hypothesis
A is acceptable relevance logic if this deduction employs every element of
A. Another syntactic relevance principle, known as
variable sharing condition, is that if
A entails
B, then
, where
denotes the set of all atomic formulae, i.e., propositional letters, occurring in
A (see [
11]). Variable sharing is not sufficient, but it is a necessary condition for relevance.
Now, we can formulate more ambitious expectations, including some kind of variable sharing principle.
Interpolation property: If and , then there exist C and D such that , , and .
Extrapolation property: If and , then there exist C and D such that , , and .
The interpolation property is defined in accordance with Craig’s well-known approach (see [
5]). The extrapolation property tends to find relevant, non-trivial and, in some sense, minimal statements
C and
D establishing an extrapolant.
Let us note here that Craig’s original definition deals with only one formula C, such that and , as an interpolant for . In this spirit, it would be possible to redefine our notion of extrapolant C for so that and . It is not difficult to see that this approach with one formula playing the role of interpolant (or extrapolant) is logically equivalent to our definition employing two formulae in both cases.
The logical, methodological, philosophical and, even algebraic aspects of interpolation have been analyzed, discussed and explained in detail as a necessary part of most textbooks in logic (see [
12,
13]). Here, we will attempt to elucidate the logical sense of extrapolation. Bearing in mind the following derivation:
the extrapolation can be considered to be a weakening of the antecedent and the consequent of
, respectively, by special statements
C and
D, such that
and
(Instead of
, we will write simply
, which, according to the traditional classical logic proof-theoretic interpretation, can be understood as
). The procedure will be satisfiable when, from an unprovable statement, we obtain a provable one, i.e., when, in fact, from
, we obtain
, where
C and
D are in the corresponding causal connections with
A and
B, respectively. In practice, when we search for an adequate statement, instead of reasoning starting with the explicit application of weakening rules, as above, the pure derivation with the cut rules
hides the presence of weakening. On the other side, we have to emphasize that it would be wrong to understand the extrapolation just as a simple weakening, because it is a very restricted and specific weakening in order to find the relevant extrapolant.
Extrapolation is formally, in the context of classical logic, equivalent to the left and the right side weakening rules, bearing in mind the following derivations
Nevertheless, the extrapolation, as defined, seems more restrictive and suggests some kind of ‘relevant’ weakening. Namely, the above two derivations are classically, and even intuitionistically, admissible, but not from the point of view of relevance logic. This is the reason why the extrapolation can be essentially considered as a process partly supported by relevant logic principles, bearing in mind that variable sharing conditions for C with A and B with D are satisfied, but not necessary for C with D.
In case of an unprovable statement , when we look for some of its improvements , in order to avoid trivial solutions and to find the best one, if possible, we define the notion of minimal sentences:
Minimal extrapolants: Suppose is not proven and is its extrapolant. C will be called a minimal sentence for A, B and D, in this order, if for each , such that is provable and is unprovable, one of the statements or is unprovable. In a dual way, D will be called a minimal sentence for A, B and C, in this order, if for each , such that is unprovable and is provable, one of the statements or is unprovable. In cases when both hold, C is a minimal sentence for A, B and D, and D is a minimal sentence for A, B and C; then, the statement is called a minimal extrapolant for .
The central question now is the following one: does a minimal extrapolant exist (and when)? It depends on the logical context, clearly. For instance, in m-valued propositional logics, due to the existence of finitely many nonequivalent formulae over any finite set of atomic formulae (propositional letters), we always have the possibility to choose the minimal sentences. The next question is: does the minimal nontrivial extrapolant exist (and when)? Moreover, how could a nontrivial statement be characterized?
Example 6. Obviously, for any two sentences A and B, such that and , the statement presents an extrapolant. This is a trivial example.
Example 7. Let us consider again some extrapolants , , and of the statement . In the case of extrapolant , the statement is not minimal for , and because there is a statement, p such that , and both and are provable. On the other hand, the statement p is a minimal one for , and , and this is a way to find a new and ‘better’ extrapolant . In the case of this extrapolant , although p is a minimal for , and , the proposition is not minimal for , and p because, for the statement p, we have that and both and are provable, while p is a minimal statement for , and p. Let us note also that the examples considered here have a general character and are compatible with both classical and intuitionistic propositional logics.
Example 8. In set-theoretic interpretation, when , the parts of minimal extrapolants will be the sets in between and with respect to the inclusion relation. In general, will be a minimal extrapolant for , and , and will be a minimal extrapolant for , and .
4. More Examples
The importance of propositional language is founded, inter alia, on its simplicity. Propositional context is usually suitable for explaining and understanding the differences between various philosophical concepts for the foundations of mathematics. For instance, the spirit of essential variations between Platonism, intuitionism and relevance is already visible on the level of classical, intuitionistic and relevant propositional logics. On the other side, the founding of any serious mathematical theory needs much more than a propositional language. Here, we will try to present the idea of extrapolation in the context of the first-order predicate language.
The general symbolic form of an Impossibility Theorem stating that ‘there does not exist an object
x such that
A implies
B’, is
The first-order sentence
can be presented in a classically equivalent way as
, or a bit more informally as “
does not imply
”, i.e.,
. Here, we want to describe an application of extrapolation method on
Namely, we are looking for sentences C and D such that , and , where the last statement presents an extrapolant and, simultaneously, a transformation of an ‘impossibility’ result into a ‘possibility’ one.
On the level of general first-order languages examples, we analyze an ‘impossibility case’.
Example 9. Let us consider the following statement: , having exactly the form of an impossibility theorem. If we try to weaken the antecedent by (1) or by (2) , and the consequent by (3) or by (4) , we do not obtain extrapolants by combining (1) with (3), (1) with (4) or (2) with (3); only the combination (2) with (4) gives an extrapolant, because , , and .
We also consider some relationships between binary relations properties.
Example 10. The logic of preferences is usually based on axioms concerning some properties of a binary relation P, called a preference relation. For instance, the list of axioms contains irreflexivity (Ir), , asymmetry (As), , transitivity (Tr), and connectivity (Cn), . It is an easy exercise to show that , but, bearing in mind that , and , we conclude that presents an extrapolant and an improvement of the initial statement. In a similar way, we can find that the same statement, is an extrapolant for both and .
5. A Proving–Refuting–Improving Procedure
Each theorem, or more generally, each scientific statement, can be expressed in the following form: if Γ, then Δ. presents a set of hypotheses (given context or a theory) and is a consequence (conclusion). This is the reason why the basic form we use in this part of the paper is , an informal deduction relation (entailment) ⊢ between two finite sets of sentences (antecedent) and (consequent), with the intended meaning that it is possible to infer a conclusion , interpreted as a disjunction of all elements of , from the hypotheses set , interpreted as a conjunction of all elements of . The Greek capitals , with or without subscripts or superscripts, will be used as metavariables over finite sets of sentences denoted by Latin capitals We also use with the usual model theoretic, meaning that if all elements of are true, then at least one element of is true. This will be the context enabling us to express that , or , is provable or unprovable, and that , or , is refutable or irrefutable.
The idea of a proving–refuting–improving procedure has been hinted at by [
4]. Here, we will develop it further. In both cases when
is provable or unprovable, i.e., when
is valid or refutable, we define the following four sets:
-antecedent,
-consequent,
-antecedent and
-consequent, respectively, as
,
,
and
, corresponding to the sets
and
, such that, for each
i,
and
are provable, and for each
j,
and
are provable.
The main problem here is to define concrete content of sets , , and in this general case, because the condition that is provable has infinitely many solutions for . On the other hand, each particular problem in some specific part of mathematics gives the researcher a freedom to use his intuition during the process of ‘looking for a better theorem’.
The two elementary steps in our proving–refuting–improving procedure as follows:
Step (i): if is not proven or is refuted, we are looking for some or some for which the provability of can be reconsidered, where is obtained from by substituting at least one occurrence of by in or at least one occurrence of by in ;
Step (ii): if is proven, or is not refuted, we are looking for some or for which the provability of can be reconsidered, where is obtained from by substituting at least one occurrence of by in or at least one occurrence of by in .
In both cases (i) and (ii), the result will be a statement . If is provable, then the procedure can be stopped and will present a generalized extrapolant or interpolant for in cases (i) and (ii), respectively. Otherwise, if we cannot decide if is provable or if is refutable, then we proceed with step (i) on .
Finally, in the sequel, we apply the same procedure on ; i.e., firstly, we try to prove or to falsify . If is not proven or is falsifiable, then we apply the procedure (i) on in order to obtain a new statement . If is provable or is not refuted, then we apply the procedure (ii) on in order to obtain a new statement . This process is called the proving–refuting– improving procedure.
Let us point out that a similar form of a generalized interpolant appears in S. Maehara’s approach to interpolation in the context of sequent calculi (see [
13]).
Note that the sentence ‘ is not proven’ does not exclude the case that can be provable, and sentence ‘ is not refuted’ does not exclude the case that can be refutable. Namely, if some fact is not proven, maybe, in the future, it could be proven, and if some fact has not been refuted up to now, it could be refuted later.
The above procedure, part (i), proving–refuting–improving, was based on methodological ideas promoted by Popper–Lakatos’ proof–refutation (also known as conjecture–refutation) falsificationism (see [
1,
2]). Furthermore, the transformation of
into
, generally, can be considered a kind of Hegelian–Marxist dialectic scheme: thesis–antithesis–synthesis, which is obviously parallel with our scheme consisting of (i) and (ii), defining the process of proving–refuting–improving.
The statement presents an improvement of in case (i), in a sense that from an unprovable statement , we obtain a statement , which may be provable; but if is provable, then is provable as well. On the other hand, the statement presents an improvement of in case (ii), in the sense that from a provable statement , we obtain a provable statement from which can be derived; i.e., is more general than . These are the reasons to treat as an improvement of in both cases. This also means that any possible application of our procedure to a provable statement cannot produce an unprovable statement.
If reconsideration of provides a statement , consisting of some new elements of , then, obviously, presents an improvement of . More accurately, we can justify our procedure by some kind of soundness statement:
Theorem 1. - (i)
If the statement is obtained from by applying step (i), then can be inferred from ;
- (ii)
If the statement is obtained from by applying step (ii), then can be inferred from .
Proof. By induction on —the number of statements belonging to : in case (i), from both, and , and and , by the hypothetical syllogism rule, we can infer . In case (ii), from both pairs, and , and from and , by the hypothetical syllogism rule, we can infer . □
Let us note that in the particular case when
is true or when
is a false statement, applying step (i) of our procedure produces the effects of enthymematic reasoning (see [
14]).
A rare and unexpected case, which is not covered by (i) and (ii), is when the statement is undecidable, i.e., the case when it is possible to show that is neither provable nor refutable. Such examples are connected with highly formalized concepts and will not be our focus.
This procedure can be considered a sequence of consecutive attempts to falsify a statement and then to save it as a supplementary conjecture or to give it a new semantic interpretation. In this way, a progressive improvement of the initial claim is enabled.
In order to visualize the transformation process of
into
with the help of
,
,
and
, we give a
-presentation of relationships between elements of
and
, with or without subscripts or superscripts:
where, for instance, the first column
of this
-presentation means that both
and
are provable. Consequently, by some replacements of
with
or with
,
, and some replacements of
with
or with
,
, we obtain this new form
. The symbol ‘
’, appearing above, stands for ‘
’ or ‘⊢’.
6. Concluding Remarks
An unproven statement of hypothetical character, a conjecture, is usually treated in one of the following two ways: we try to prove it, or we try to refute it. Then, for a proven statement, we try to find its interpolants, in order to simplify its proof and to better understand the nature of its proof, but for a refuted, i.e., unprovable, statement, we look for its extrapolants, trying to find a similar and relevant but provable statement.
Briefly, if we start with a statement of the form , then we have, syntactically, two possibilities to obtain from a better statement: if is unproven, we will look for its extrapolant presenting a provable statement relevant for , but if is proven, then we will find its interpolant relevant for , better explaining the nature of . Namely, the basic principle respected in the process of transforming into a ‘better statement’ is that all side statements occurring in derivations, such as and , are provable, except the principal statement , which can be, but does not have to be, provable, and that each step in the considered derivation is made strictly in accordance with the sound logical inference rules.
In working versions of this paper, we used the term ‘algorithm’ for the proving–refuting–improving process, but later we accepted the term ‘procedure’ as the appropriate one. Namely, it is not clear if the step transforming into is well defined, in the sense that we do not know if the problem of provability of both and is decidable.
Finally, let us note that while the phenomenon of interpolation is usually treated as a property of an axiomatic theory or a logical system, because even some natural propositional logics do not possess it (see [
15]), extrapolation, although observed as a dual to interpolation, presents essentially a method of transforming an unprovable statement
into a ‘similar’, but provable one:
.
We also point out that if there is a grain of suspicion that a counterexample to our conjecture exists, it will be of great didactic importance in developing and stirring the critical reasoning of students and researchers. This has to find a central place in all study programs as a basic goal of education, together with stimulating creative thinking.