1. Introduction
Decision tables (sometimes represented as datasets or finite information systems with a distinguished decision attribute) appear in data analysis [
1,
2,
3,
4,
5,
6] and in such areas as combinatorial optimization, computational geometry, and fault diagnosis, where they are used to represent and explore problems [
7,
8].
Decision trees [
1,
5,
6,
7,
9] and decision rule systems [
2,
3,
4,
8,
10,
11,
12] are widely used as classifiers, as a means for knowledge representation, and as algorithms for solving various problems of combinatorial optimization, fault diagnosis, etc. Decision trees and rules are among the most interpretable models in data analysis [
13].
In this paper, we consider classes of conventional decision tables closed under the removal of columns and changing of decisions. The most natural examples of such classes are closed classes of decision tables generated by information systems (see
Section 3.4 for an explanation). We study the dependence of the minimum complexity of deterministic and nondeterministic decision trees on the complexity of the set of attributes attached to columns of the decision table. We also study the dependence of the minimum complexity of deterministic decision trees on the minimum complexity of nondeterministic decision trees. Note that the nondeterministic decision trees can be considered as representations of systems of true decision rules that cover all rows of decision tables. Note also that the depth of deterministic and nondeterministic decision trees for computation Boolean functions was studied quite intensively [
14,
15,
16,
17].
This paper continues the study of closed classes of decision tables that began with work [
18] and continued with works [
19,
20]. To the best of our knowledge, there are no other papers that study the closed classes of decision tables.
Various classes of objects that are closed under different operations are intensively studied. Among them, in particular, are classes of Boolean functions closed under the operation of superposition [
21] and minor-closed classes of graphs [
22]. Decision tables represent an interesting and important mathematical object deserving of mathematical research, in particular, the study of closed classes of decision tables.
In [
18], we studied the dependence of the minimum depth of deterministic decision trees and the depth of deterministic decision trees constructed by a greedy algorithm on the number of attributes (columns) for conventional decision tables from classes closed under operations of removal of columns and changing of decisions.
In [
19], we considered classes of decision tables with many-valued decisions closed under operations of removal of columns, changing of decisions, permutation of columns, and duplication of columns. We studied relationships among three parameters of these tables: the complexity of a decision table (if we consider the depth of decision trees, then the complexity of a decision table is the number of columns in it), the minimum complexity of a deterministic decision tree, and the minimum complexity of a nondeterministic decision tree. We considered a rough classification of functions characterizing relationships and enumerated all possible seven types of relationships.
In [
20], we considered classes of decision tables with 0–1 decisions (each row is labeled with the decision 0 or the decision 1) closed relative to the removal of attributes (columns) and changing decisions assigned to rows. For tables from an arbitrary closed class, we studied the dependence of the minimum complexity of deterministic decision trees on various parameters of the tables: the minimum complexity of a test, the complexity of the set of attributes attached to columns, and the minimum complexity of a strongly nondeterministic decision tree. We also studied the dependence of the minimum complexity of strongly nondeterministic decision trees on the complexity of the set of attributes attached to columns. Note that a strongly nondeterministic decision tree can be interpreted as a set of true decision rules that covers all rows labeled with the decision 1.
In the previous papers, we did not consider in detail conventional decision tables in which rows are labeled with arbitrary decisions. These tables differ significantly from tables with many-valued decisions and from tables with 0–1 decisions considered previously. We now describe the results obtained in the present paper. Let A be a class of conventional decision tables closed under the removal of columns and changing of decisions, and let be a bounded complexity measure. In this paper, we study three functions: , and .
The function
characterizes the growth in the worst case of the minimum complexity of a deterministic decision tree for a decision table from
A with the growth of the complexity of the set of attributes attached to columns of the table. We prove that the function
is either bounded from above by a constant or grows as a logarithm of
n, or it grows almost linearly depending on
n (it is bounded from above by
n and is equal to
n for infinitely many
n). These results are generalizations of results obtained in [
20] for closed classes of decision tables with 0–1 decisions.
The function characterizes the growth in the worst case of the minimum complexity of a nondeterministic decision tree for a decision table from A with the growth of the complexity of the set of attributes attached to columns of the table. We prove that the function is either bounded from above by a constant or grows almost linearly depending on n (it is bounded from above by n and is equal to n for infinitely many n).
The function characterizes the growth in the worst case of the minimum complexity of a deterministic decision tree for a decision table from A with the growth of the minimum complexity of a nondeterministic decision tree for the table. This function is either not everywhere defined or is everywhere defined. Let be everywhere defined. We proved that this function is either bounded from above by a constant or it is greater than or equal to n for infinitely many n.
The novelty of the work is as follows:
For the function , which characterizes the complexity of deterministic decision trees, we have received an exhaustive description of the types of its behavior.
For the function , which characterizes the complexity of nondeterministic decision trees, we have received an exhaustive description of the types of its behavior.
For the function , which characterizes relationships between the complexity of deterministic and nondeterministic decision trees, we have received a preliminary description of the types of its behavior that requires additional study.
The obtained results allow us to point out the cases when the complexity of deterministic and nondeterministic decision trees is essentially less than the complexity of the set of attributes attached to columns of the table. This may be useful in applications.
The paper consists of six sections. In
Section 2, main definitions and notation are considered. In
Section 3, we provide the main results.
Section 4 contains auxiliary statements. In
Section 5, we prove the main results.
Section 6 contains short conclusions.
2. Main Definitions and Notation
Denote and, for any , denote . Let be the set of attributes (really names of attributes). Two attributes are considered different if .
2.1. Decision Tables
First, we define the notion of a decision table.
Definition 1. Let . Denote by the set of rectangular tables filled with numbers from in each of which rows are pairwise different, each row is labeled with a number from ω (decision), and columns are labeled with pairwise different attributes from P. Rows are interpreted as tuples of values of these attributes. Empty tables without rows belong also to the set . We will use the same notation Λ for these tables. Tables from will be called decision tables.
Example 1. Figure 1 shows a decision table from . Denote by the set of tables from in each of which all rows are labeled with the same decision. Let .
Let T be a nonempty table from . Denote by the set of attributes attached to columns of the table T. Let and . We denote by the table obtained from T by the removal of all rows that do not satisfy the following condition: in columns labeled with attributes , the row has numbers , respectively.
We now define two operations on decision tables: the removal of columns and changing of decisions. Let .
Definition 2. Removal of columns. Let . We remove from T all columns labeled with the attributes from the set D. In each group of equal rows on the remaining columns, we keep one with the minimum decision. Denote the obtained table by . In particular, and . It is obvious that .
Definition 3. Changing of decisions. Let (by definition, ). For each row of the table T, we replace the decision attached to this row with . We denote the obtained table by . It is obvious that .
Definition 4. Denote . The set is the closure of the table T under the operations of removal of columns and changing of decisions.
Example 2. Figure 2 shows the table , where is the table shown in Figure 1, and . Definition 5. Let and . Denote . The set is the closure of the set A under the considered two operations. The class (the set) of decision tables A will be called a closed class if .
Let and be closed classes of decision tables from . Then, is a closed class of decision tables from .
2.2. Deterministic and Nondeterministic Decision Trees
A finite tree with the root is a finite directed tree in which exactly one node called the root has no entering edges. The nodes without leaving edges are called terminal nodes.
Definition 6. A k-decision tree is a finite tree with the root, which has at least two nodes and in which
The root and edges leaving the root are not labeled.
Each terminal node is labeled with a decision from the set ω.
Each node, which is neither the root nor a terminal node, is labeled with an attribute from the set P. Each edge leaving such a node is labeled with a number from the set .
We denote by the set of all k-decision trees. Let . We denote by the set of attributes attached to nodes of that are neither the root nor terminal nodes. A complete path of is a sequence of nodes and edges of in which is the root of , is a terminal node of and, for , the edge leaves the node and enters the node . Let . If , then we correspond to the table T and the complete path a decision table . If , then . If and, for , the node is labeled with the attribute and the edge is labeled with the number , then .
Definition 7. Let . A deterministic decision tree for the table T is a k-decision tree Γ satisfying the following conditions:
Only one edge leaves the root of Γ.
For any node, which is neither the root nor a terminal node, edges leaving this node are labeled with pairwise different numbers.
.
For any row of T, there exists a complete path τ of Γ such that the considered row belongs to the table .
For any complete path τ of Γ, either or all rows of are labeled with the decision attached to the terminal node of τ.
Example 4. The 2-
decision tree shown in Figure 3 is a deterministic decision tree for the decision table shown in Figure 1. Definition 8. Let . A nondeterministic decision tree for the table T is a k-decision tree Γ satisfying the following conditions:
.
For any row of T, there exists a complete path τ of Γ such that the considered row belongs to the table .
For any complete path τ of Γ, either or all rows of are labeled with the decision attached to the terminal node of τ.
Example 5. The 2-
decision tree shown in Figure 4 is a nondeterministic decision tree for the decision table shown in Figure 1.
Figure 4.
A nondeterministic decision tree for the decision table shown in
Figure 1.
Figure 4.
A nondeterministic decision tree for the decision table shown in
Figure 1.
2.3. Complexity Measures
Denote by B the set of all finite words over the alphabet , which contains the empty word and on which the word concatenation operation is defined.
Definition 9. A complexity measure is an arbitrary function that has the following properties: for any words ,
if and only if —positivity property.
for any word obtained from by permutation of letters—commutativity property.
—nondecreasing property.
—boundedness from above property.
The following functions are complexity measures:
Function h for which, for any word , , where is the length of the word .
An arbitrary function
such that
, for any
,
and, for any nonempty word
,
An arbitrary function such that , for any , , and, for any nonempty word , .
Definition 10. A bounded complexity measure is a complexity measure ψ, which has the boundedness from below property: for any word , .
Any complexity measure satisfying the equality (
1), in particular the function
h, is a bounded complexity measure. One can show that if functions
and
are complexity measures, then the functions
and
are complexity measures, where for any
,
and
. If the function
is a bounded complexity measure, then the functions
and
are bounded complexity measures.
Definition 11. Let ψ be a complexity measure. We extend it to the set of all finite subsets of the set P. Let D be a finite subset of the set P. If , then . Let and . Then, .
2.4. Parameters of Decision Trees and Tables
Let and be a complete path of . We correspond to the path a word : if , then , and if and, for , the node is labeled with the attribute , then .
Definition 12. Let ψ be a complexity measure. We extend the function ψ to the set . Let . Then, , where the maximum is taken over all complete paths τ of the decision tree Γ. For a given complexity measure ψ, the value will be called the complexity of the decision tree Γ. The value will be called the depth of the decision tree Γ.
Let be a complexity measure. We now describe the functions , , , , , , , and N defined on the set and taking values from the set . By definition, the value of each of these functions for is equal to 0. Let .
, where the minimum is taken over all deterministic decision trees for the table T.
, where the minimum is taken over all nondeterministic decision trees for the table T.
A set is called a separating set for the table T if the sets of columns labeled with attributes from D rows of the table T are pairwise different. Then, is the minimum cardinality of a separating set for the table T.
. This is the complexity of the set of attributes attached to columns of the table T.
Let be a row of the table T. Denote , where the minimum is taken over all subsets D of the set such that in the set of columns of T labeled with attributes from D, the row is different from all other rows of the table T. Then, , where the maximum is taken over all rows of the table T.
.
If , then . Let , , and columns of the table T be labeled with the attributes . Let . Denote by the minimum number for which there exist attributes such that and . Then, .
is the number of rows in the table T.
For the complexity measure h, we denote , , , and . Note that is the number of columns in the table T.
Example 6. We denote by the decision table shown in Figure 1. One can show that , , , , , , , and . 3. Main Results
In this section, we consider results obtained for the functions , , and and discuss closed classes of decision tables generated by information systems.
3.1. Function
Let
be a bounded complexity measure and
A be a nonempty closed class of decision tables from
. We now define a function
. Let
. Then
The function
characterizes the growth in the worst case of the minimum complexity of a deterministic decision tree for a decision table from
A with the growth of the complexity of the set of attributes attached to columns of this table.
Let be an infinite subset of the set in which, for any , . Let us define a function . Let . If , then . If, for some , , then .
Theorem 1. Let ψ be a bounded complexity measure and A be a nonempty closed class of decision tables from . Then, is an everywhere defined nondecreasing function such that for any and . For this function, one of the following statements holds:
- (a)
If the functions and N are bounded from above on class A, then there exists a positive constant such that for any .
- (b)
If the function is bounded from above on class A and the function N is not bounded from above on class A, then there exist positive constants , , , such that for any .
- (c)
If the function is not bounded from above on class A, then there exists an infinite subset D of the set ω such that for any .
Thus, for the function , we have received an exhaustive description of the types of its behavior. Type (a) is degenerate: the number of rows in decision tables from the closed class is limited from above by a constant. Type (b) is of most interest to us: the complexity of deterministic decision trees behaves in the worst case as the logarithm on the complexity of the set of attributes in the table. Type (c) is not of particular interest: the complexity of deterministic decision trees in the worst case is the same as the complexity of the set of attributes in the table.
3.2. Function
Let
be a bounded complexity measure and
A be a nonempty closed class of decision tables from
. We now define a function
. Let
. Then
The function
characterizes the growth in the worst case of the minimum complexity of a nondeterministic decision tree for a decision table from
A with the growth of the complexity of the set of attributes attached to columns of this table.
Theorem 2. Let ψ be a bounded complexity measure and A be a nonempty closed class of decision tables from . Then, is an everywhere defined nondecreasing function such that for any and . For this function, one of the following statements holds:
- (a)
If the function is bounded from above on class A, then there exists a positive constant c such that for any .
- (b)
If the function is not bounded from above on the class A, then there exists an infinite subset D of the set ω such that for any .
Thus, for the function , we have received an exhaustive description of the types of its behavior. Type (a) is of most interest to us: the complexity of nondeterministic decision trees is bounded from above by a constant. Type (b) is not of particular interest: the complexity of nondeterministic decision trees in the worst case is the same as the complexity of the set of attributes in the table.
3.3. Function
Let be a bounded complexity measure and A be a nonempty closed class of decision tables from . We now define possibly partial function . Let . If the set is infinite, then the value is undefined. Otherwise, .
The function characterizes the growth in the worst case of the minimum complexity of a deterministic decision tree for a decision table from A with the growth of the minimum complexity of a nondeterministic decision tree for this table.
Theorem 3. Let ψ be a bounded complexity measure and A be a nonempty closed class of decision tables from . Then, and is a nondecreasing function in its domain.
If is not an everywhere defined function, then its domain coincides with the set for some .
If the function is everywhere defined, then one of the following statements holds:
- (a)
If the function is bounded from above on the class A, then there is a nonnegative constant c such that for any .
- (b)
If the function is not bounded from above on the class A, then there exists an infinite subset D of the set ω such that for any .
Remark 1. From Theorem 1, it follows that the function is bounded from above on class A if and only if the functions and N are bounded from above on class A.
For the function , we have received a preliminary description of the types of its behavior. Type (a) is degenerate: the number of rows in decision tables from the closed class is limited from above by a constant. Type (b) is of most interest to us. However, more research is needed to understand how the function can behave within this type.
3.4. Family of Closed Classes of Decision Tables
Let
U be a set and
be a finite or countable set of functions (attributes) defined on
U and taking values from
. The pair
is called a
k-
information system. A
problem over
is an arbitrary tuple
, where
,
and
are functions from
with pairwise different indices
. The problem
z is to determine the value
for a given
. Various examples of
k-information systems and problems over these systems can be found in [
7].
We denote by a decision table from with n columns labeled with attributes . A row belongs to the table if and only if the system of equations has a solution from the set U. This row is labeled with the decision .
Let the algorithms for solving problem z be algorithms in which each elementary operation consists of calculating the value of some attribute from the set on a given element . Then, as a model of the problem z, we can use the decision table , and as models of algorithms for solving the problem z, we can use deterministic and nondeterministic decision trees for the table .
Denote by the set of problems over and . One can show that ; i.e., is a closed class of decision tables from generated by the information system .
Closed classes of decision tables generated by k-information systems are the most natural examples of closed classes. However, the notion of a closed class is essentially wider. In particular, the union , where and are k-information systems, is a closed class, but generally, we cannot find an information system such that .
3.5. Example of Information System
Let
be the set of real numbers and
be the set of functions defined on
and taking values from the set
such that, for any
and
,
Let be a bounded complexity measure and . One can prove the following statements:
The function N is not bounded from above on the set A.
The function is bounded from above on the set A if and only if there exists a constant such that for any .
The function is not bounded from above on the set A.
The function is everywhere defined if and only if, for any , the set is finite.
4. Auxiliary Statements
This section contains auxiliary statements.
It is not difficult to prove the following upper bound on the minimum complexity of deterministic decision trees for a table.
Lemma 1. For any complexity measure ψ and any table T from , The notions of a decision table and a deterministic decision tree used in this paper are somewhat different from the corresponding notions used in [
23]. Taking into account these differences, it is easy to prove the following statement, which follows almost directly from Lemma 1.3 and Theorem 2.2 from [
23].
Lemma 2. For any complexity measure ψ and any table T from , The following two statements are simple generalizations of similar results obtained in [
20] for decision tables with 0–1-decisions. For the sake of completeness, we present their proofs.
Lemma 3. For any complexity measure ψ and any table T from , Proof. Let . Then, . Therefore, .
Let , and be attributes attached to columns of the table T. Denote and . Evidently, . Taking into account that the function has the nondecreasing property, we obtain that any two rows of T are different in columns labeled with attributes from the set D. Let for the definiteness, .
Let . Let be a row of . Since , there exist attributes of the table such that the row is different from all other rows of in columns labeled with these attributes and . It is clear that . Therefore, .
Let
be not a row of
. We consider
m tables
,
, ⋯,
. If
, then
. Since
,
. Taking into account that
, we obtain
. Let
. Then, there exists
such that
and
. Denote
and
. Evidently,
. Therefore, in
, there are attributes
such that the row
is different from all other rows of the table
in columns labeled with these attributes and
. One can show that
Therefore,
. Hence,
Since
,
. Using the boundedness from above property of the function
, we obtain
. Therefore,
.
Thus, for any , . As a result, we obtain . □
Lemma 4. For any table T from , Proof. If , then and the considered inequality holds. Let . Then, . Denote . Evidently, for any row of the table T, there exist attributes and numbers such that the table contains only the row . Therefore, there is a one-to-one mapping of rows of the table T onto some set G of pairs of tuples of the kind where and . Evidently, . Therefore, . □
Lemma 5. For any table T from , there exists a mapping such that Proof. Let and be a mapping for which for any such that . Denote . Let be a deterministic decision tree for the table such that . Denote by the number of terminal nodes of . Evidently, . One can show that . Therefore, . Since , . Hence, . Taking into account that , we obtain . □
It is not difficult to prove the following upper bound on the minimum cardinality of a separating set for a table by the induction on the number of rows in the table.
Lemma 6. For any table T from , Lemma 7. For any complexity measure ψ and any table T from , Proof. Let . If , then . Let . It is clear that each deterministic decision tree for the table T is a nondeterministic decision tree for the table T. Therefore, . □
Lemma 8. For any complexity measure ψ and any table T from , which contains at least two rows, there exists a table such that Proof. Let be a row of the table T such that . Let D be a subset of the set with the minimum cardinality such that and in the set of columns labeled with attributes from D, the row is different from all other rows of the table T. Let be the tuple obtained from the row by the removal of all numbers that are in the intersection with columns labeled with attributes from the set . Let and, for any , if , then and if , then . Denote .
From the fact that
D has the minimum cardinality and from the properties of the function
, it follows that for any attribute from the set
D, there exists a row of
T, which is different from the row
only in the column labeled with the considered attribute among attributes from
D. Therefore, for any attribute of the table
, there exists a row of
, which is different from the row
only in the column labeled with this attribute. Thus,
Using properties of the function
, we obtain
. From this inequality and from (
2), it follows that
Let
be a nondeterministic decision tree for the table
such that
,
be a complete path of
such that the row
belongs to the table
, and
. It is clear that
is the only row of the table
. Therefore, in columns labeled with attributes
the row
is different from all other rows of the table
. Thus,
. Therefore,
and
. Using Lemmas 1 and 7, we obtain
. Therefore,
By the choice of the set
D,
. From this equality and from (
3) and (
4), it follows that
. □
It is not difficult to prove the following upper bounds on the minimum complexity of nondeterministic decision trees for a table.
Lemma 9. For any complexity measure ψ and any table T from , 5. Proofs of Theorems 1, 2 and 3
Proof of Theorem 1. Since A is a closed class, . By definition, . Using this fact and Lemma 1, we obtain that is an everywhere defined function and for any . Evidently, is a nondecreasing function. Let and . Using the positivity property of the function , we obtain . Therefore .
(a) Let the functions and N be bounded from above on the class A. Then, there are constants and such that and for any table . Let . Taking into account that A is a closed class, we obtain . By Lemma 3, . From this inequality, inequality and from Lemma 2, it follows that . Denote . Taking into account that T is an arbitrary table from the class A, we obtain that for any .
(b) Let the function
be bounded from above on class
A and the function
N be not bounded from above on
A. Then, there exists a constant
such that for any table
,
Let
and
T be an arbitrary table from
A such that
. If
; then, evidently,
. Let
. Using the boundedness from below property of the function
, we obtain
and
. From these inequalities and Lemma 4, it follows that
. From (
5) and Lemma 3, it follows that
. Using the last two inequalities and Lemma 2, we obtain
Denote
and
. Then,
. Taking into account that
n is an arbitrary number from
and
T is an arbitrary table from
A such that
, we obtain that for any
,
Let
. Denote
and
. We now show that
Denote
. Taking into account that the function
N is not bounded from above on set
A, we obtain that there exists a table
such that
. Let
C be a separating set for the table
T with the minimum cardinality such that
for any
. The existence of such a set follows from the inequality (
5) and properties of commutativity and nondecreasing of the function
. Evidently,
. Let
D be a subset of the set
C such that
. Denote
. One can show that for any attribute of
, there are two rows of the table
that differ only in the column labeled with this attribute. Therefore,
D is a separating set for the table
with the minimum cardinality. By Lemma 6,
.
Using Lemma 5, we obtain that there exists a mapping
such that
Denote
. Since
is a bounded complexity measure,
. Using the boundedness from above property of the function
, we obtain
. Hence, the inequality (
8) holds. From (
7) and (
8), it follows that
for any
.
(c) Let the function be not bounded from above on class A. Using Lemma 8, we obtain that the set is infinite. Since the class A is closed, , and since , . Evidently, for any , . Taking into account that is a nondecreasing function, we obtain that for any . □
Proof of Theorem 2. Since A is a closed class, . By definition, . Using this fact and Lemma 9, we obtain that is an everywhere defined function and for any . Evidently, is a nondecreasing function. Let and . Using the positivity property of the function , we obtain . Therefore, .
(a) Let the function be bounded from above on the class A. Then, there is a constant such that for any . By Lemma 9, for any . Therefore, for any , .
(b) Let the function be not bounded from above on class A. Using Lemma 8, we obtain that the set is infinite. Since the class A is closed, , and since , . Evidently, for any , . Taking into account that is a nondecreasing function, we obtain that for any . □
Proof of Theorem 3. Let and both n and belong to the domain of the function . Immediately from the definition of this function, it follows that . Let and . Using the positivity property of the function , one can show that . From Lemma 2, it follows that . Therefore, .
Let be not an everywhere defined function and m be the minimum number from such that the value is not defined. It is clear that and the value is not defined for each such that . Denote . Then, the domain of the function is equal to .
We now consider the case when is an everywhere defined function.
(a) Let there exist a nonnegative constant such that for any table . Then, evidently, for any .
(b) Let there be a nonnegative constant c such that for any table . Let us assume that there exists a nonnegative constant such that for any table . Then, the value is not defined, but this is impossible. Therefore, the set is infinite. Since , . By Lemma 7, for any table . Hence, for any . Taking into account that is a nondecreasing function, we obtain that for any . □