1. Introduction
Contemporary practices in artificial intelligence (AI) research have emphasized the significance of ontologies in formally representing knowledge within specific domains or scopes. According to Gruber [
1], an ontology serves as an explicit specification of a conceptualization, outlining the concepts, entities, and relationships inherent in a knowledge domain. Ontologies encode the formal structure of existence (formal ontologies) and categorize what exists in a domain (material or domain ontologies), connecting scientific disciplines by making their assumptions and commitments explicit. The proliferation of ontological research, sometimes called the “Ontological Turn”, demonstrates the growing recognition of the importance of formalizing knowledge [
2]. In a more applied context, viewing ontologies as the ideal references for accuracy in data science (e.g., in knowledge graphs) makes their creation, usage, and maintenance essential tasks for the robustness of results and their explainability.
In computer science, a formal ontology represents a precise and rigorous mathematical–logical framework for collectively understood concepts, presented in a format comprehensible and processable by computers [
3]. Achieving such conceptualization requires designing formal definitions for the vocabulary, expressed through axioms and rules. This facilitates the exact definition, constraint, and interrelation of concepts, thereby eliminating potential confusion or ambiguity.
Various formalisms can be utilized to represent a formal ontology, including first-order logic (FOL), higher-order logic, description logic (DL), or RDF, among others. The choice of formalism is determined not only by representational considerations, such as expressiveness, but also by the need to support automated reasoning, ensure conceptual consistency, infer new knowledge, and facilitate general computational analysis, among other practical considerations.
However, effectively leveraging ontologies is faced with challenges. As discussed by Gomez et al. [
4], ontologies must strike a balance between the complexity of formal representation and the reasoning capabilities they afford. Overly simplistic ontologies may fail to capture the nuances of complex domains, limiting inferential capabilities, while excessively complex ontologies can impede efficient computational reasoning. Ongoing research aims to develop techniques for constructing ontologies at an appropriate level of complexity, supporting robust knowledge representation and reasoning (KRR) paradigms. In addressing this challenge, a thorough analysis of the logical representation language is essential [
5].
Ontology engineering (OE) involves several stages that range from project conceptualization to ontology maintenance and documentation. The two initial extraction-based steps focus on requirement specification and conceptualization of the domain. In the conceptualization stage, relevant domain concepts and relations are identified to represent the informal conceptual model.
The next two stages focus on formalizing the conceptualization, both from a logical perspective (e.g., with description logic) and in terms of implementation (e.g., with OWL), enabling web standard representation as well as reasoning engines.
Lastly, other activities are necessary to enhance the quality and usability of the ontology. The first of these is the evaluation stage, which involves assessing the correctness, completeness, and conciseness of the implemented ontology relative to the requirements. The last stage centers around documentation and maintenance activities, which include documenting design decisions and keeping the ontology up to date in evolving domains via planned revisions and extensions, preserving backward compatibility when possible.
This paper is focused on the foundational aspects associated with the aforementioned process, particularly concentrating on the critical maintenance task, which can involve revising concepts and relationships when there are changes in the domain, particularly for ontology reuse, with a primary focus on foundational or formal ontologies. Foundational ontologies are pivotal in KRR and reuse because they contribute to the creation of shareable and reusable domain ontologies [
6]. A persistent challenge in OE is the need for a comprehensive analysis of the relationship between foundational and domain ontologies.
The OE process outlined in the introduction poses several logical challenges, many of which can be effectively addressed through the adoption of a robust methodology. A notable approach to (formal) ontology design is the “definitional methodology” proposed by Bennett [
7]. This approach is rooted in the recognition that definitions play a central role in ontology construction. The idea is to begin by establishing a small set of primitive concepts with precisely defined model-theoretic semantics upon which other concepts and relationships are built. Definability theorems are then introduced to identify concepts that can be defined in terms of others. The expressive power of definitions, the importance of primitives, and the comprehension of definability are all factors that contribute to the resilience of ontologies when constructing extensive ontological vocabularies.
Despite its advantages, the definitional strategy may be perceived as overly rigid due to stringent requirements such as categoricity, where any two models of ontology are equivalent in a certain sense. In reality, the characterization of classes and relations in the “intended models” could be more crucial than in the general FOL class of models to which logical categoricity refers. An illustrative case study can be found in QSR ontologies, where a recognized class of real spatial models represents the target of KRR practices. While such models may not be logically characterized without considerable complexity, in practice, an affordable approach that ensures essential features may be considered sufficient.
Research has demonstrated the value of developing and reusing ontologies that formalize general concepts across domains. There is a great deal of well-founded explicit knowledge formalizing general notions (such as time concepts and the part-of relation) [
8]. However, it is often the case that instead of reusing existing ontologies, engineers create custom procedural programs that implicitly encode such knowledge. This practice results in duplicated effort and lost opportunities to leverage prior work. The creation of formal ontologies represents a substantial investment of time and skill to form standardized, logic-based conceptual systems. As a result, enabling ontology reuse across different applications has become a key priority within knowledge engineering fields. Effective ontology reuse offers many potential benefits, including reducing redundant modeling work, promoting consistency across projects, and improving semantic interoperability for knowledge transfer.
As ontologies are reused in new contexts, it often becomes necessary to re-examine ontological commitments made in the original ontology development. Such commitments may include the definitions of classes, the relationships between them, and the properties of those classes. In some cases, the new application environment may require modifications or extensions to the ontology to represent the concepts and relationships in the domain accurately. Several factors can motivate such ontological revisions instead of simple ontology reuse, including the following:
Emergence of new entity types or activities in the domain not currently modeled in the ontology;
Changes in assumed relationships between existing domain concepts;
Novel user requirements and use cases not originally envisioned;
Inaccuracies in the original conceptual model exposed during ontology deployment.
When revising an ontology, it is crucial to ensure compatibility with existing models by properly interpreting changes. This becomes particularly critical when existing definitions require modification. For instance, when introducing new concepts or relationships that expand the ontology, it is advisable to provide theoretical justification connecting these additions to the original theory. This approach allows for the reconsideration of the class of representative models from the source theory (i.e., the “intended models” that the engineer aims to represent) to incorporate the new ontology elements in a principled manner. Overall, ontology revisions should strike a balance between innovation and continuity, achieved through sound theoretical grounding and methodical integration with existing elements.
Despite the aforementioned desiderata, it is common for some of new ontology components to be undefinable or only partially characterized based on the elements of the source ontology. Consequently, the definitional methodology may not be suitable in such cases. Alternative approaches are needed to effectively capture the relationships between the existing and newly introduced elements, ensuring the coherence and integrity of the revised ontology. This underscores the importance of a systematic and carefully planned expansion strategy when modifying ontologies, ensuring the ongoing relevance and applicability of the underlying conceptual framework.
1.1. From Ontologies to Relation Calculus
In advocating for ontology-building methodologies, computational solutions derived from the ontology characterize relationships and concepts. This facet plays a pivotal role in guaranteeing the resilience of the associated computation [
9] (e.g., spatial reasoning). By ensuring that the definitions within the ontology effectively capture the intricacies of relationships and concepts, the qualitative computation derived from such formalism becomes a reliable foundation. This not only strengthens the computational aspects of the ontology but also establishes a solid ground for employing spatial theory in the development and reasoning processes.
Analyzing the computational complexity of reasoning problems in the (revised) ontology is an important research direction. As Lutz et al. [
10] demonstrate, model–theoretical techniques can be used to study the complexity of constraint satisfaction problems (CSP). Such techniques can provide valuable insights into the scalability of ontology-based systems.
In the field of AI, qualitative calculus serves as a specific type of mathematical formalism crafted to represent and reason about relations and properties in a qualitative, rather than quantitative, manner. This formalism centers on qualitative properties, relations, and changes within the objects or processes under study, utilizing a finite set of symbols or qualitative values to encode these properties and relations.
Furthermore, qualitative calculus provides operators designed to combine qualitative values into meaningful expressions. For instance, it facilitates the composition of spatial relations and includes inference rules to derive new qualitative knowledge from established facts. Additionally, certain calculi permit the qualitative simulation and analysis of how values change over time, enhancing their utility in understanding dynamic systems and temporal aspects. As Gruter points out, these characteristics make qualitative calculi a valuable tool for AI applications, offering a natural and effective approach to handling qualitative aspects of KRR [
9].
According to [
11], the paradigm of algebraic constraint-based reasoning, embodied in the notion of a qualitative calculus, has been studied within two frameworks. One framework defines qualitative calculus as “a non-associative relation algebra (NA) with a qualitative representation”, while the other defines it as “an algebra generated by Jointly Exhaustive and Pairwise Disjoint (JEPD) relations”. These frameworks provide complementary perspectives. The divergence of definitions creates confusion around the notion of a qualitative calculus and revisits the “what” question (cf. [
12]).
Emerging from Allen’s examination of temporal relations, the concept of a composition table (CT) has evolved into a crucial technique, offering an efficient inference mechanism for a broad spectrum of theories [
13]. In Allen’s interval algebra (IA), reasoning is manageable only for specific subalgebras and is executed through path consistency, a constraint propagation algorithm. Its rule-based implementation necessitates
rules for a subalgebra with n relations [
14].
The work in [
13] focuses on the challenge of providing comprehensive characterization of the class of theories and relational constraint languages. It investigates situations in which a complete proof procedure can be specified by a CT, shedding light on the potential and limitations of this approach. Likewise, in order to optimize the calculus, redundancy checking is a crucial task [
15].
Formally, in the context of a theory
T wherein
is a JEPD set of defined base relations, the composition of two of them is entailed by
T. That is, given
, the problem consists of determining the smallest subset
such that
In semantic terms, this definition ensures that whenever and hold in M a model of T, some i must exist such that a and c are related.
One of the primary challenges in qualitative spatial reasoning (QSR) revolves around integrating or expanding existing theories to incorporate new aspects within the same formalism while ensuring the existence of a sound calculus [
16,
17]. Even when a qualitative calculus is derived from a formal ontology, a common practice in ontology reuse involves reconsidering ontological commitments that would affect the calculus. For instance, in QSR, the dilemma between crisp and fuzzy spatial sets is often reevaluated. Addressing such issues requires taking various features and perspectives into account. On the one hand, there is a desire to maintain the characteristics of existing objects; on the other hand, there is a necessity to reassess some definitions and the associated qualitative calculus. This balance underscores the need to relax the definitional methodology, requiring only certain axioms and properties of the definitions to exhibit categoricity (i.e., common to all intended models).
Lastly, it is worth noting that qualitative calculus-based techniques have applications beyond foundational studies. For example, in [
14], a practical approach to rule-based temporal reasoning over RDF/OWL using Allen’s interval algebra (IA) is outlined, showcasing the versatility of the methodology. This example provides an idea of both the utility of having foundational ontologies and the need to adapt these ontologies to different applications through revision to a greater or lesser extent.
1.2. Formalization for Qualitative Spatial Representation and Reasoning
Representation and reasoning with qualitative spatial relations constitute an important problem in AI, with wide-ranging applications in fields such as geographic information systems, computer vision, autonomous robot navigation, natural language understanding, spatial databases, and image mining [
18]. One of the necessary elements (though not necessarily sufficient) for the effectiveness of a QSR proposal (e.g., for prediction) is the availability of a reasonably high-quality theory, especially when complete information is available and the range of scales involved, both temporal and spatial, is not extreme. When these conditions are not met, simulation becomes less effective or entirely inappropriate. In [
19], an insightful discussion on twelve features of physical reasoning problems that present challenges for simulation-based reasoning is developed. The paper also surveys alternative techniques for physical reasoning that do not rely on simulation.
Mereotopology is a field in logic and ontology concerned with parts, wholes, and connections. The term ‘mereotopology’ combines ‘mereology’ (the study of part-whole relations) and ‘topology’ (the study of connections and boundaries without considering the form). Mereotopology aims to provide a formal framework for describing concepts such as parts, wholes, boundaries, contact, and connection. It builds upon mereology by incorporating topological notions such as closure and neighborhood into the theory of parts and wholes. This expansion allows for a richer characterization of concepts that involve both parthood and connection relations.
The importance of mereotopology in qualitative spatial and temporal reasoning (QSTR) cannot be overstated. Serving as an alternative to set theory and Euclidean geometry, mereotopology finds applications across diverse fields beyond spatial and temporal reasoning (e.g., [
20]).
The ongoing debate surrounding the point versus object dilemma persists in various manifestations. Mereotopologies, as point-free methodologies, offer an alternative for modeling information while sidestepping certain perplexing assumptions inherent in set–theoretical systems. The essence of point-free approaches lies in introducing points into mereology or mereotopology only when their existence is unequivocally justified.
The task of constructing spatial ontologies is not without representational challenges, especially when dealing with diverse ontological entities such as bodies, space, and situated objects. An illustration of this is [
21], where the exploration of location ontologies is grounded in mereotopological pluralism.
Regarding language design itself, the scope encompasses both time and space, as well as the structural properties of objects, adopting a way of considering the structure of space and the entities that inhabit it. This design should draw inspiration from robust classifications and specifications.
The need for a sound language for representation is intrinsic to any ontology with a remarkable level of abstraction. For instance, in [
22], a taxonomy of part types is proposed based on the manner of attachment of a part to its parent whole, its degree of dependence on that whole, or external factors. As another example, in [
23], a novel approach to qualitative shape recognition and matching in design is introduced. The method employs an ordering information approach for the qualitative description of shapes, considering features such as angles, relative side lengths, concavities, convexities of the boundary, and color.
The design challenge in the case of ontologies for mereotopology involves difficulties from both a logical and representational standpoint, as well as in OE. Nuances come into play that are challenging to represent or interpret appropriately. This situation can lead to major issues that affect the ontology’s design itself. In addressing these issues, the authors of [
24] propose corrections and additions to the axiomatization of SUMO (
https://www.ontologyportal.org/ (accessed on 27 february 2024). Their work focuses on rectifying problems related to omitted and unintended models of classical mereology. Additionally, a more recent study by Silva in [
25] demonstrates that the existing axiomatization of SUMO fails to capture some of the intended models of classical mereology.
The logical approach to QSTR becomes ubiquitous in situations where there is a desire to formalize, almost at the verification level, the relationships between information, space, and objects. In this context, an example could be the paper [
26], where a spatio-temporal specification language resulting from the integration of temporal and spatial logic is introduced. Recent research on spatial and spatio-temporal model checking provides novel image analysis methodologies rooted in logical methods for topological spaces, as seen in [
27] in biomedicine.
Continuing in the field of biomedicine, the development of biomedical ontologies necessitates a meticulous formal analysis of foundational relations, utilizing various formal tools in the process. Ideally, an analysis in a highly expressive language, like FOL, should be complemented by analyses in less expressive but computationally tractable deductive systems, such as description logics [
28].
In the context of biomedical ontologies, the qualitative and time-dependent nature of spatial relations is highlighted in [
29]. The motivation stems from the relatively well-understood formal representation of mereological aspects, specifically parthood relations, in canonical anatomy. However, other aspects, including connectedness and adjacency relations between anatomical parts, their shape and size, as well as the spatial arrangement within larger anatomical structures, are less comprehensively represented in existing computational, anatomical, and biomedical ontologies. The paper [
30] also illustrates the utility of a formal spatial theory in clarifying spatial information within biomedical ontologies and enhancing their automated reasoning capabilities.
In the field of image mining, as presented in [
18], the authors introduce an innovative framework for robot vision based on concept lattice theory and
cloud model theory. The concept lattice serves as a representation of the human conceptualization process using mathematical formal language, while the cloud model provides a transformation model between qualitative concepts and quantitative numerical values. Notably, the authors contend that the problem at hand possesses a rough nature, implying a certain degree of indistinctness in the relationships addressed.
Alongside the logical dimension, computational considerations are crucial. Spatial–temporal reasoning can benefit from various tools derived from computational logic. For instance, in [
31], the authors propose an approach to trajectory calculus that can be implemented using answer set programming (ASP). ASP is also valuable for designing a formal framework for representing and reasoning about cardinal directions between extended spatial objects on a plane, as demonstrated in [
32].
The QSR team at the University of Leeds pioneered the development of several logical calculi designed for the representation and analysis of QSR on regions, with a particular focus on the so-called Region Connection Calculus (RCC) (see
Section 5 below and [
17,
33]). RCC has proven useful across several AI domains such as GIS. As articulated in [
34], their motivation lies in utilizing regions as the fundamental spatial entity and showcasing how a rich language can be constructed using few primitives. In the case of RCC, this begins with the concept of connection.
The reasoning within RCC encompasses both static and dynamic aspects, providing a comprehensive framework for QSR. Although RCC serves as a notable example of a mereotopological system, some discussion on its semantic nature is necessary [
35].
Certainly, alternative approaches to RCC exist, deviating from the definitional strategy inherent in the RCC axiom system. In [
36], the relations of contact, nontangential inclusion, and dual contact are treated as non-definable primitives. This extends the language of distributive contact lattices significantly. Part of the paper focuses on appropriately axiomatizing the new language termed Extended Distributive Contact Lattice (EDC-lattice) using universal first-order axioms that hold in all contact algebras. EDC-lattices can also be regarded as an algebraic tool for a specific subarea of mereotopology [
37]. Ref. [
38] provides an overview of research examining the cognitive validity of qualitative spatial calculi, with a focus on the RCC and Egenhofer’s intersection models (IM), in which both topological theories are frequently claimed to align with human spatial cognition and extensions.
1.3. On the Use of Automated Reasoning Systems in Ontology Engineering
Automated reasoning systems (ARS) provide tools to contrast hypotheses, specifications, and axioms. Results obtained by automated theorem provers (ATPs) ensure the correctness of certain ontological reusing practices. This allows verification of the logical validity of extensions or transformations without relying on unspecified intuitive properties or hidden ontological commitments [
39]. In QSR, using an ATP to establish properties ensures that only the knowledge expressed in the axioms is utilized, avoiding reliance on spatial intuitions not formalized in the theory.
In this paper, ATP Prover9 (
https://www.cs.unm.edu/~mccune/prover9/ (accessed on 27 February 2024)) and model finder Mace4 (
https://www.mcs.anl.gov/research/projects/AR/mace4/ (accessed on 27 February 2024)) are utilized to support findings in our work. Prover9 is software for automated first-order logic theorem proving, employing resolution and paramodulation inference. Key features include equality reasoning, customizable search strategies, proof output in natural deduction format, and stand-alone proof checking.
Complementing Prover9, the Mace4 system searches for finite models satisfying input statements and their negations. It clarifies axiomatic systems by testing conjectures and revealing counterexamples, further aiding Prover9 proof construction. Together, these tools enable robust development, testing, and formal proof of mathematical conjectures. Their versatility supports applications in mathematics, computer science, and AI research. In this paper, both systems are actively utilized in proofs.
1.4. Aim and Contributions of the Paper
Ontology reuse remains a challenge for the ontology community [
40]. The lack of a consensus on this notion poses a barrier in OE. Without understanding what constitutes reuse, it is difficult to perform or assist in ontology design for reuse.
The aim of this paper is to address the issue. More concretely, the main contributions of this paper are the following:
A proposal for the formal definition of ontology reusability (
Section 6), which includes extension and revision as tasks.
Beyond clarifying these concepts, the paper proposes a methodology for formal ontology reuse (
Section 6.3). It characterizes reuse operations that can guide reuse to meet specified requirements. This not only directs the reuse task but also assesses the implications of reusing an ontology.
As an illustrative case, the paper tackles a second aim: the problem of obtaining a sound interpretation (by conferring spatial meaning) to the extensions of RCC obtained-according to the methodology-by inserting an “undefined” relationship (
Section 7).
The application case of the methodology establishes the mereological paradigm in 3WD, providing a formal framework that encompasses various possibilities of representing 3WD mereological relations (as used, for example, in [
41]).
In a broad scope, the aim is to describe how the rudiments of first-order model theory (and computational logic) can be used to increase knowledge on generic extensions of the theory.
This paper substantially extends a previous contribution [
42].
1.5. Structure of the Paper
The next section provides a description of state-of-the-art work related to this paper, referring to the specific representational challenges of spatial KRR.
Section 3 offers a brief summary of the basic ideas of the three-way decision paradigm and provides some notes on three-way mereo(topo)logy.
Section 4 contains some examples motivating ontology revision, covering spatial notions in socio-geodemographic systems, spatial interpretation for ontology reasoning, and extending the formalism of description logic.
Section 5 presents background information on the mereotopological theory RCC required for the development of the paper.
The subsequent sections represent the formal instantiation of the ideas outlined earlier. Both contain the main contributions of the paper.
Section 6 is dedicated to presenting a type of ontological extension with a certain degree of categoricity and defining the methodology for ontology extension and revision. As the second main contribution of the paper, this methodology is then applied in
Section 7 to RCC.
Section 8 is devoted to comparing the results obtained with our methodology with other different ontological redesigns, such as the “egg-yolk” approach. In
Section 9, some conclusions, a discussion on the strengths and weaknesses of our proposal, and some ideas for future work are presented.
3. On the Three-Way Paradigm
The so-called three-way decision paradigm (3WD) has been proposed to study decision-making processes more comprehensively [
72,
73]. It is a decision-making framework that extends traditional binary classification systems by introducing a third decision option: uncertainty or no decision has been made. The 3WD framework acknowledges that, in certain situations, the available information may not be sufficient to confidently assign instance to either the positive or negative class. Thus, the inclusion of an undecided category offers a more realistic representation of uncertainty and augments the flexibility of decision models. It aims to unify various decision-making approaches that address indefiniteness or uncertainty within the same framework (cf. [
74] for an introduction). This approach aligns with a cognitive perspective [
75].
In the 3WD paradigm, decisions are based on three possibilities (or regions within the workspace): positive, negative, and undecided or boundary, enabling a more nuanced representation of uncertainty in decision-making processes.
Some 3WD approaches include working with posets and decision determination, interval-based evaluations, rough set theory, multi-valued logics, and extensions of formal concept analysis (FCA) [
76] or granular computing [
77].
According to Y. Yao [
75], trisecting the universe of discourse—positive, negative, and undecided or boundary—involves additional activities beyond trisection, designing evaluation functions, and finding thresholds. Namely, appropriately describing each region to identify where a situation belongs, transferring objects between regions based on the agent’s interests, and designing behavioral strategies for each region. In addition to such research focuses, a third should be added [
77] to evaluate the effectiveness of the designed strategy.
These aspects are intricately connected to complementary decision-making tasks, such as data collection/indexes, analyzing past decisions, and justification/explanation. The selection among these options depends on the problem’s nature and the designer’s choices (e.g., [
78]). Consequently, the principles of 3WD can be applied in fields like data science (including Big Data [
79]) and AI, with applications ranging from establishing machine learning processes using rough techniques. The 3WD techniques provide solutions applicable under uncertainty or when dealing with missing data/information [
80], handling imbalanced data [
81], addressing ambiguity in labeling [
82], or requiring incremental concept learning [
83] and concept mining within machine learning [
76]. More challenging applied areas include sentiment and opinion analysis [
84,
85].
Ref. [
86] is an example of the usefulness of 3WD in computational–logic environments. In such work, the notion of knowledge harnessing is introduced to study how to extract valid information from conflicting or uncertain data within the 3WD paradigm. The utility of information is measured by assessing the semantic closeness of the new information to the existing knowledge within the knowledge base or by estimating the change in the structure of the information relative to the original [
87].
A classic framework for 3WD suggests working with evaluations with rank on a poset. In the particular case of the range of values of the evaluation is the (ordered) interval
, identifying 0 with
false and 1 with
true. An acceptance–rejection evaluation is a function
and the three decision regions are as follows:
;
;
.
In mereotopology, the 3WD paradigm is naturally suitable for accommodating two possible extensions of reasoning with regions. The first is considering regions with areas of indefiniteness (“fringes”), and the second is applying ideas of 3WD to spatial relations [
41,
88]. Although the cited papers are of a mereological nature, there is no doubt that the ideas are applicable to spatially extended regions (because of the usual domain universe in
). We will explore this concept further later on. In [
89], a novel approach is introduced and assessed that merges time-based granulation with three-way decisions. This method aims to aid decision-makers in comprehending and analyzing the granular structures acquired through the conceptualization of spatio-temporal events.
In QSTR, several approaches could be considered as 3WD-like. For example, building on the assumption that a vague object can be constructed as the conceptualization of a field, the work presented in [
50] classifies vague objects into five distinct types: direct field-cutting objects, focal operation-based field-cutting objects, element-clustering objects, object-referenced objects, and dynamic boundary objects. Subsequently, a categorization system is introduced, employing fuzzy set theory to formally articulate the semantic nuances among these diverse types of vague objects.
Ideas towards three-way mereology are a key component of recent approaches to classic classification/decision methods. For example, the two main types of three-way clustering described in [
88] would be completely characterized in this paper in the case of clusters characterized as regions (similar analysis could be done in other contexts, as in [
90,
91]).
The first of the 3WD clustering methods is the so-called three-way hard clustering, a type of clustering where each object is assigned to only one cluster. In this approach, there is no overlap between clusters, and each object is distinctly categorized into a single cluster. The clusters are represented by two disjoint sets: the core region and the fringe (boundary) region. The second method is three-way overlapping clustering, which allows objects to belong to more than one cluster. In this approach, clusters can overlap, meaning that an object may have partial membership in multiple clusters. This overlapping nature of clusters helps capture complex relationships and patterns in the data that may not be easily captured by traditional clustering methods. By allowing objects to have partial memberships in multiple clusters, three-way overlapping clustering can provide a more nuanced understanding of the data and improve clustering accuracy.
5. Formal Elements of the Mereotopological Theory RCC
The application of the proposed ontology extension method will be illustrated grounded in RCC as the foundational theory (
Section 6). In this section, we provide a more technical description of the key elements of this theory.
Roughly speaking, RCC can be seen as the outcome of applying the definitional methodology to the primitive connection relation between two regions, denoted as C(.,.). Axioms asserting that this relation is reflexive and symmetrical are introduced. The intended models of RCC operate on connected regular regions from topological spaces. In the design of RCC, the intended interpretation of connection is the topological closures of the two connected regions intersect.
The axiom set presented in
Table 1 expresses the remaining relations’ properties and definitions, which can be regarded as RCC’s set of axioms [
17]. We denote by
the first-order logic language used in these axioms. A key property of the set of RCC relations is that it possesses the mathematical structure of a lattice, derived from the partial order “provably subrelation of”.
Figure 5 illustrates the Hasse diagram of this lattice. Recall that the Hasse diagram of a partially ordered set is a directed graph constructed using the elements of the set as nodes, where the edges consist of all ordered pairs
such that
, and there exists no
z such that
.
Definition 1. Let Ω be a topological space and T be a mereotopological theory. An interpretation on Ω is an interpretation of the language of T whose universe is the class of regular sets in Ω. T is interpretable on Ω if there exists an interpretation on Ω, which is model of T.
The set of eight RCC binary relations shown in
Figure 1 is denoted as RCC8. These relations form a JEPD set. As a JEPD set, the set RCC8 is exploited as a calculus for constraint satisfaction problems (CSPs) (see [
12]). Another useful calculus is based on the JEPD
. In short, the key difference between RCC8 and RCC5 is that the former enables enriching the knowledge representation by utilizing region frontiers, while the latter does not. Although empirical evidence has found that RCC8 better suits representing topological relations distinguished by humans than RCC5 [
105], both are used here.
5.1. Relational Calculus in RCC
Since RCC8 forms a JEPD set, we can define relation composition and study the relation calculus over RCC8 as described in
Section 1.1.
The composition table is shown in
Table 2. For instance, the following fact is represented in the table: if region
A is a proper part (PP) of region
B, and region
B is a proper part (PP) of region
C, then by composition, we can infer that region
A is a proper part (PP) of region
C. The task of building the table involves analyzing how to compose relations.
The exhaustive analysis of composition, reflected in
Table 2, is provable from RCC (for example, using Prover9, it is possible to obtain formal proofs of each case).
5.2. The Challenge of the Interpretability of RCC Extensions
There are two types of extension to consider in OE: one involving a specific model (or class of models) while respecting the original ontology, and the other involving the redefinition of the model class in the ontology extension. This paper focuses on the latter.
Regarding the first type, an example is demonstrated in [
55], where it is shown how any consistent RCC8 binary constraint network (RCC8 network) can be consistently extended. This result ensures the transfer of tractability from a set of RCC8 relations to its closure under composition, intersection, and converse. In this way, we can enrich the mereotopological configuration of the regions under study with more regions. In reality, these types of extensions affect the population of the ontology (the new elements must be added while preserving consistency) and not the design of its axioms.
The second type of extension is foundational in nature as it pertains to the theories that underpin the models. Therefore, the task could be more complex. If an abstract extension of a standard QSR theory is obtained, it is necessary to extend the standard interpretation to accommodate the new concepts/relationships as spatial regions/relations. This type would encompass revisions of mereological ontologies implicit in machine learning to accommodate new methods, such as 3WD clustering [
41,
77,
80] commented on in
Section 3. If, in the new theory, the elements of the initial theory can be considered objects of the universe, then it can be interpreted within the former. We formalize this idea below.
Firstly, it is worth noting that the notion of interpretability between theories can be considered from different perspectives. For example, if our framework is OE, and we only want to use the knowledge represented in one theory in another target theory, it would suffice to find a syntactic translation mapping that operates on the axioms (e.g., [
25]). In this case, we are not relativizing the information to a special type of objects and properties.
Nevertheless, if we want to work with more complex interpretations that use the same language, we can resort to relativization. The relativization of a formula with respect to another consists of restricting the property expressed by the former to those elements that satisfy the latter (in our case, the objects considered in the models of the former ontology).
Definition 2. Let be a formula in a FOL language .
The idea is to use this type of interpretability to relativize any T extension of RCC to the regions considered in RCC. For example, a property in a theory about spatial vagueness could be relativized to the special case of (crisp) regular regions, which must be characterized by such a formula .
6. Lattice Categorical Ontology Extensions
From a certain perspective, ontologies could be viewed as qualitative theories aimed at representing the conceptualization of knowledge space. This standpoint includes the representation of class (concept) and relation hierarchies. Therefore, a fundamental requirement for a qualitative theory is the ability to entail basic relationships (those that the ontological engineer intends to preserve under extensions) among its concepts or relations. For instance, it has already been mentioned that RCC entails relationships between spatially defined relations, exhibiting a lattice structure of
Figure 5, and a composition calculus, (as shown in
Table 3) [
39].
Likewise, any extension or revision of the ontology should thus meet similar requirements. In an effort inspired by foundational questions in the semantic web [
106], a formal definition of a “robust ontology”, termed “lattice categorical extension” [
87], was proposed. In brief, a “lattice categorical” theory is one that entails the lattice structure of its basic relations. Next, we formalize this notion.
Given a fixed language , let be a finite set of concept/relation symbols, and let T be a theory. Given , a model M of a T, consider the structure in the language defined as follows. The universe of consists of interpretations in M of , with interpreted as for each . Here, ⊤ represents M, ⊥ represents ∅, and ≤ denotes the subset relation. We denote by the cardinal of set X.
It is assumed that is required to have a lattice structure, although this requirement can be relaxed as the poset of relations among the concepts in can be embedded into a lattice structure (even if it necessitates introducing dummy concept/relation names).
In order to work with the relationships between the elements of , it is necessary to work with equations that identify the basic operations with these elements.
Definition 3. Consider the language .
- 1.
The set of lattice closed equations in this vocabulary is denoted by .
- 2.
Given E is a set of equations, it is denoted by the translation of E to the set of first-order logic formulas in the language .
An example is presented in
Figure 6, where a set of equstions
E, and its translation
, is the set of formulas that appear in
Figure 7.
A relationship between
and model
M can be established through the union of two sets of formulas. The first is a finite set
(for example, that of
Figure 6 for RCC). The second is a set of formulas
that categorizes the lattice axioms under completion. The latter contains the following:
Idem: A lattice skeleton is a set E of lattice equations that characterizes (modulo ) the lattice of a model excluding isomorphism. The idea of a lattice skeleton is to have a set of interesting properties that we wish to preserve, and which are provable in the theory and its extensions.
Definition 4. Let E be an theory. We say that E is a lattice skeleton (l.s.) for a theory T if E verifies that.
There are such that , and
has a unique model except isomorphism.
Idem: The definition formalizes the concept of a comprehensive specification of the class or relation hierarchy, which, akin to description logic ontologies, a lattice skeleton would resemble a -box with a specific inferential completeness, showcasing all the “subclass of” relations between classes in its concept hierarchy, thereby ensuring that all models share these selected relationships (expressed in the lattice language).
Note that no condition of minimality on the set of equations
E is requested. The reason is that the designer aims to represent in
E all the information to be preserved in the extensions, even if some of it is redundant to satisfy both properties of the above definition. If this was not the case, the application of techniques developed in [
108] would allow (after passing to propositional formulas) us to reduce the size of the skeleton if necessary while preserving the information.
Theorem 1. Every consistent theory has a lattice skeleton.
Proof. (sketch) The proof is based on the description of a lattice skeleton for the theory T from a model :
First, the lattice is constructed.
Next, a set of lattice equations in the language is chosen so that there exists a single model of size (these two additional elements correspond to ⊤ and ⊥). Then, has a unique model.
□
6.1. Lattice Categorical Theories
Idem: It is worth noting that lattice skeletons are essentially model-based, so it is possible for there to be non-equivalent ones (modulo ). This poses challenges to reasoning with relations. While lattice skeletons characterize with intentional purposes, the existence of a unique lattice structure, up to isomorphism (thus entailed by T), would enhance relational reasoning. This uniformity across models of T ensures consistency in relationships among the relations and enables the entailment of the transition table by T.
Definition 5. A theory T is called a lattice categorical (l.c.) theory if every pair of lattice skeletons for T are equivalent theories modulo ; that is, for any and skeletons for T,as lattice theories. A key result regarding the extensions we aim to specify is the following, ensuring their existence:
Theorem 2. Every consistent theory T has an extension which is lattice categorical.
Proof. Idem: Let T be consistent and M be a model of T. Consider the partially ordered set whose elements are the interpretations for each , and the relation ≤ represents the subset relation. In terms of sets, the lattice operations ∧ and ∨ are interpreted as ∩ and ∪, respectively.
Consider the lattice generated by . Since is finite, this lattice is also finite. Therefore, we can select E as a finite set of equations such that has as its only model up to isomorphism.
Then, the theory would serve as such an extension. □
Idem: The most important feature of lattice categorical theories is that they guarantee reasoning only with the lattice equations is sufficient to work with the relations of the theory.
allows focusing the reasoning on the relationships among the elements of
, avoiding the use of the more complex
T. Lattice categoricity has been used for extending ontologies by the decision of the user, motivated by data and designed by the user [
87], and ontology merging [
109]. It is also necessary to emphasize that from a set
, one can reconstruct a theory that has
E as its skeleton, namely
. An important example for our purposes is RCC:
Theorem 3. RCC is a lattice categorical theory.
Proof. (sketch) The proof is assisted by Mace4 (to determine the models of the skeleton) and Prover9 (to find the justifying proofs).
In the input file for Mace4, we introduce the following:
The axioms characterizing the lattice theory (
Figure 8).
The axioms for unique names and domain closure would be missing. To simplify the coding work, each relation is assigned a number from 0 to 16, as shown in
Figure 5. This ensures the principle of unique names when working with Mace4 without explicitly stating the axiom in
. Similarly, if models of size
are sought with Mace4, the domain closure axiom of
is implicitly satisfied and does not need to be explicitly stated with formulas.
The result follows from the following facts:
The only model obtained by Mace4 (
Figure 9) has a Hasse diagram that matches that of
Figure 5.
entails all the formulas in
(provided in
Figure 7). It is necessary to prove this to verify that the operations of the elements of the model found by Mace4 (
Figure 6) are indeed theorems of
.
□
Figure 9.
Lattice operations of the model of
founded by Mace4, with
E being the lattice skeleton from
Figure 6.
Figure 9.
Lattice operations of the model of
founded by Mace4, with
E being the lattice skeleton from
Figure 6.
Finally, a lattice ontology is defined as a “complete” hierarchical view of a classic ontology as follows:
Definition 6. A lattice ontology is a pair , where T is the formal theory and E a lattice skeleton. If T is l.c., it will be called l.c. ontology.
An example of a lattice categorical ontology is
, where
E is the set of equations provided in
Figure 6. It is important to note that the concept of a lattice ontology is more constrained than other notions, such as the usual pair
(i.e.,
-box,
-box) in description logics. Let us consider a simple example of obtaining an lattice categorical extension in the form of a lattice ontology of a DL-ontology. Consider the ontology formed only with the
T-box (
):
Following the proof of Theorem 1, it suffices to select a model:
Next, the poset
is embedded into a lattice, for example,
L, whose Hasse diagram is depicted in
Figure 10. It is easy to see that the following lattice equation set is a skeleton for this lattice:
Therefore,
would be a lattice categorical extension, or, by explicating a DL-theory from
E,
, being
:
6.2. Ontological Extension and Revision
The following idea builds on concepts from [
86], where we examine how information inconsistent with the base theory can be retrieved. The operators from that paper are not directly applicable here because we must select the information through its corresponding translation into lattice language. Recall that this information is included in the skeleton because we aim to preserve it as valid in the extension. The following formalization of
robust ontological extension translates the notion of categorical extension in first-order logic (FOL) into a lattice-based ontology.
Definition 7. Given two lattice ontologies we will say that is a lattice categorical extension of (denoted by ) with respect to the sets of concepts and respectively, if
Please note that the above definition does not explicitly require any direct relationship between
and
, although such a relationship actually exists and can be extracted through
and
. Therefore, a type of lattice-based extension of the form
can be found, where
is the theory in the base first-order language without axioms. For example, for
E from
Figure 6, this would be effectively equivalent to
, where
consists of the formulas appearing as theorems in
Figure 7.
6.3. Extension Methodology
Taking into account the construction carried out in this section, the proposed methodology for extending (possibly with revision) a formal ontology consists of the following stages:
Obtain , a lattice categorical extension of the formal ontology to extend. In a skeleton , include the relations between the concepts/relations of the set of that are to be preserved. The goal is to extend the revise the theory, possibly extending with new concepts/relations, resulting in the language .
Extend the equations of to obtain a new set of equations, , where the fundamental relations that the new concepts/relations would have among themselves and with the original ones appear. Then, extend to obtain such that the theory has a unique model (lattice) up to isomorphism. The sought-after extension is .
Finally, interpret the new lattice ontology, possibly reinterpreting the elements of , to align the extension with the new intended models.
7. A Case Study: Extending RCC
The purpose of the second part of the paper is to explore, using the proposed methodology, extensions of RCC that accommodate relations of indefiniteness or vagueness among spatial regions. A motivation for this study is to provide a certified and complete blueprint of all possible definitions of uncertainty for RCC. Thus, the analysis should cover the various solutions existing in the literature. Additionally, this study would underpin the 3WD paradigm in RCC by offering all the possibilities of mereotopological combinations of regions with uncertainty.
The problem ventures into a meta-meteorological realm as it necessitates reasoning within the RCC theory and adjusting its axioms accordingly for an extension of the type
where
I will denote the undefined relationship under study.
According to the third stage of the extension methodology, to complete the study of the extensions, all new relations must be reinterpreted in the new intended models while respecting the global lattice structure of the relations.
Given the importance of transition tables in QSR, the new spatial relation
I, signifying “with a certain degree of indefiniteness”, should complement RCC8. That is,
should form a JEPD and should be incorporated into a new table of relation composition. Refer to [
87] for alternative extension approaches. Based on this condition, using
, it should demonstrate, with the assistance of Prover9, the new transition table.
If the intention is to interpret the new relation as “vague”, two options are available. The first option involves changing the semantics, as discussed in [
110], while the second option entails reinterpreting the nature of the intended models of the original ontology. This would require revising the semantics, but it would be tailored specifically for that new class of intended models, which is common in 3WD reinterpretations. In this paper, we opt for the second option, which, as we will demonstrate in the following sections, allows for the derivation of classical interpretations in QSR through lattice categorical extensions of RCC.
The main result on this kind of RCC extensions is the following:
Theorem 4. There are only eight l.c. extensions of (being E the lattice skeleton of Figure 6) by insertion of a new relation I such that is a JEPD set. Proof. (Sketch) The idea behind the proof is as follows: The language
is considered, and on this, a set of equations
is considered along with the equations asserting that
forms a JEPD. It is worth noting that, by the principle of unique names, the new
I will not necessarily be interpreted as ⊥ in any possible lattice model of this theory of size
. By applying Mace4, eight models are found (whose Hasse diagrams are described in
Figure 11).
Finally, the procedure is carried out as in Theorem 3 to verify with Prover9 that the chosen equation set for each extension constitutes a lattice skeleton with respect to . The l.c. extension being sought is . □
Roughly speaking, the analysis of the complete catalogue of lattice categorical (l.c.) extensions (
Figure 11) allows us to interpret the new relations as representing “undefinition up to a degree”. In other words, the relation
I indicates that the RCC8 relationship between the regions is not precisely known, but a less specific relationship between the regions can be determined. This concept requires further development with a robust model–theoretic interpretation, which would induce, in turn, an ontological revision. Similarly, as we will illustrate, an interpretation of this relation
I in terms of 3WD regions is naturally provided using RCC5 as a basis.
7.1. Interpreting with Pulsation/Contraction
Due to space constraints, the paper does not address the issue of interpreting spatial indefiniteness relations in a fuzzy generalization of RCC (as in [
61]). The primary focus lies in verifying that the methodology for obtaining extensions encompasses classical RCC extensions with indefiniteness as particular cases. Additionally, another aim is to illustrate the process of redefining the interpretation of the original relations in the new intended models.
Recall that, in the context of RCC, the intended models are those that consider their elements belonging to , which represents the regular sets of a topological space (typically Euclidean spaces). The illustrations provided aims to illuminate practical applications within this intended domain.
Definition 8. A pulsation on a topological space is a mapsuch that the topological closure of contains that of X; . The pair , where Ω is nontrivial, connected, and regular and is called a topological space with pulsation.
The first reinterpretation we will consider is inspired by 3WD, taking into account that each region of space induces a trisection of the space itself: the positive region (the region itself), an extending region that adds indefiniteness or a boundary (boundary region “a fringe” [
41]), and, lastly, the complement of both regions (negative region). See
Figure 12.
The reinterpretation of the relations on a topological space with pulsation is based on considering the pairs
as the intended regions. In three-way clustering [
88] (particularly in those inspired by Mathematical Morphology [
41]), pulsation can be understood as a natural operation on sets or regions that relates a core set with a fringe set representing indefiniteness in the cluster (
Figure 12).
Theorem 5. Seven of the eight extensions from Theorem 4 are interpretable in topological spaces with pulsation using the vague regions specified as , where x is a regular set.
Proof. Given , the natural interpretation of R in the topological space will be denoted by ().
For the sake of simplicity, we make use of the following conventions:
Let be a topological space with pulsation . To define the interpretation, is the structure on the language of (), and the interpretation of in is obtained by combination of relations. A modo de ilustración, only two of such interpretations, y , are described. The others are similar.
if
,
and
if ,
,
, and
□
Corollary 1. A mereotopological representation of 3WD clustering types is as follows:
From a logical perspective, the interpretability of RCC in the new extensions is feasible by adding a function symbol to the language for representing pulsation. This is possible because, as a reminder, the extension is determined by the lattice skeleton and not by the theory in the original first-order logic (FOL) language.
Theorem 6. The theory is interpretable in each extension .
Proof. (sketch) Please note that
is not in the language of
but it can be added to the language of the extension for representational purposes. Thus, the lattice categorical (l.c.) extension
is utilized, where
is a skeleton, and
The formula
for interpretation is as follows:
That is, in mereotopological terms, it expresses that , the closure of x, is equal to and the closure of ; thus, can be considered a classic RCC region. □
The relation does not have an interpretation in terms of pulsation. It necessitates the concept of “contraction”, as elaborated in the following section.
Theorem 7. cannot be interpreted in topological spaces with pulsation.
Proof. (sketch) The idea of the proof is to reason with the lattice of interpretations in a topological space. Let be a topological space with pulsation, and the lattice that induces it.
It has . Since , it follows that . Moreover, , hence . As for O, we have ; therefore, .
Concerning
, we know that
. From all this, it is concluded that
Let us see that, in this case, . Let be such that . That is, . If, in addition, , then ; that is, . Therefore, . □
To understand the spatial relation expressed by , it is necessary to consider relationships where the vague part is located “within” the region.
Definition 9. A contraction in a topological space is a map such that for each A witha nonempty interior. The pair , where Ω is nontrivial and connected, is called a topological space with contraction.
Theorem 8. is interpretable in a topological space with contraction.
Proof. Given , it is defined as the structure of the language as follows:
if ;
if ;
.
□
In
Figure 13, all the interpretations are described. The interpretations correspond, in essence, to a l.s. of every possible l.c. extension of RCC.
Theorem 9. Under each interpretation , defined above endows the lattice structure depicted in Figure 11. Proof. (sketch)
The proof of this theorem is very laborious; it consists of carrying out four stages to a successful conclusion:
Firstly, we need to demonstrate that for all , if in , then in , for all . This is equivalent to showing that the possets of RCC relations can be embedded into the lattice for each .
Secondly, we must demonstrate that for each , the relation occupies the position indicated in the corresponding Hasse diagram of the lattice . To do this, it is necessary to verify the following conditions:
;
;
;
;
;
;
.
As the third stage, it is necessary to demonstrate that for each , it holds that forms a pairwise disjoint set of relations.
Finally, it is verified that for each , the Hasse diagram of represents the union operation in the relation lattice of .
The details of the process are provided in
Appendix A.
□
Corollary 2. The set is a JEPD set under the interpretation for .
Proof. The lattice structure, faithfully represented in the corresponding Hasse diagram, illustrates such fact. □
7.2. Building the Composition Tables for the Extensions
One of the advantages of interpreting l.c. extensions is that it allows us to build a composition table for the extension in the case of the new JEPDs. As for RCC8, it is possible to prove the composition table for the new JEPD sets
. To illustrate the method, the table for
has been computed. This can be computed from the Hasse diagram
. As an example,
Table 3 shows the composition table for
. That is, the table represents, for each
, the minimal set
such that
Roughly speaking, the table that corresponds to the composition of relations where , coincides with the table we obtain for , except
If from the composition of two relations in is obtained or (or both of them), then it will appear or (or both of them), besides the relation .
As a consequence of the above, if in the composition table of the result of composing two relations is , then the result is set , which we have denoted as .
As an example, we present one of the calculations carried out. Specifically,
in models with pulsation. Given
regions in such a model:
|
|
∩ |
∪
|
∪ |
∪ |
|
∪ |
|
∪
|
|
8. Comparing Results with Other Interpretations of Vagueness
In this section, the results obtained by applying our methodology to RCC will be compared with two classical approaches to reasoning with vagueness in mereology. It will be shown that (1) the relationships introduced in the classical treatment of vague regions in RCC, known as the “egg-yolk” paradigm, are classified by the indefiniteness relationships obtained in the different lattice categorical extensions derived from RCC, and (2) the notions of contraction and pulsation provide mereotopological counterparts to the classical approaches used in rough set theory.
8.1. Interpretation in the “Egg-Yolk” Approach
The ’Egg-Yolk’ (e-y) representation [
63] introduces a novel approach to handling spatial vagueness by employing a pair of concentric regions to represent a vague region. This representation includes an inner region, termed the ’yolk’, which captures the core or most certain part of the vague region. Surrounding this yolk is the outer region, known as the ’white’, which encompasses the entire vague region, including its uncertain or borderline areas. This dual-region model allows for the explicit representation of vagueness in spatial entities, offering a more nuanced and flexible approach to spatial representation and reasoning.
In this section, an interpretation for will be described, performed on the egg-yolk. A complete picture of the relationship between undefined relations will be shown (as well as with RCC5) instead of a separate interpretation for each one.
The e-y idea reconciles with the mereological approach in the 3WD paradigm, as used in 3WD clustering [
41,
80,
91]. Consequently, RCC5 is employed in this paradigm instead of RCC8 for obvious reasons. A comprehensive description of e-y relations is depicted in
Figure 14.
Please note that the 46 possible relations in the e-y paradigm can be specified using RCC5 relations for the egg and the yolk. For example, given
and
are two e-y regions, relation 3 from
Figure 14 is specified in RCC5 by the following:
That is, the graphical representation n provided in the figure is a simplified representation of the relation between the e-y relationships specified with RCC5, similar to the example above. Thus, enumerating the possibilities will be very useful for representing the interpretation in the e-y paradigm in a simplified manner.
To simplify the interpretation of the relationships, we will use the following notation: when we write , we will be expressing that the interpretation of in the e-y paradigm corresponds to .
Given two e-y regions
,
and
, it is defined
by the following:
Since the objective to demonstrate is that the relations from are interpretable in this paradigm, we must find a specification in terms of the e-y paradigm relations.
Theorem 10. The set is interpretable in the egg-yolk paradigm.
Proof. The interpretation (using the enumeration of
Figure 14) is as follows:
and
which agree with (acording to lattice ). Thus:
.
,
By symmetry, and
.
, thus
.
.
□
Corollary 3. induces, in the egg-yolk interpretation, the lattice structure depicted in Figure 15. Proof. The proof follows from the classification of the interpretations of the relations e-y given in Theorem 10. □
Corollary 4. The set is JEPD, and also y . Moreover, it is possible to build composition tables for their calculus from the lattice of Figure 15. Theorem 11. The theory is interpretable in the class of models of the e-y paradigm.
Proof. Since, in the egg-yolk paradigm, regions are represented as pairs , it is necessary to add two function symbols to the language, and , to denote each component of a region.
The formula
for the interpretation of RCC is as follows:
□
It is also possible to interpret the new relations of indefiniteness through contraction in the e-y paradigm. Given a vague region a, the “egg-yolk” representation would be the pair . To make the reasoning more legible, we denote by the contraction of a. Using RCC5, the relations between each component of two regions can be specified.
Given
, we denote
by the set
; that is,
This way, RCC5 is naturally included. The interpretation of RCC5 and the new indefiniteness relations is as follows:
The interpretation of
coincides with the intuitive idea of being “proper part”. In our case, if a region
A is a proper part of another
B, then
a and
are proper parts of
and, consequently, of
b. This situation is represented by relation 24 in
Figure 14. Thus,
. Similarly, the remaining RCC5 relations can be interpreted.
On the other hand, the interpretation of
would be
, which, in turn, is the interpretation of
due to its position in
. Therefore,
The position of
in
suggests the interpretation
,
where .
By symmetry,
and
are defined similarly to
and
, respectively:
, therefore, , i.e.,
.
.
is nothing but a subset of
, as it contains figures where regions overlap but their contractions are discrete. Thus,
Finally, as
represents total indefiniteness, it will be represented by any of the figures in
Figure 14, so
As seen, it is not necessary to interpret relation although it is indeed possible since , which coincidentally is equal to .
This observation is part of the proof of the following result:
Theorem 12. In the egg-yolk interpretation described above, the relations comprise the lattice in Figure 16, along with RCC5. Proof. It is necessary to proceed following the same strategy as in Theorem 6. □
As a corollary, different JEPDs are available for constructing a calculus through transition tables:
Corollary 5. In the interpretation described above, the set of relations forms a JEPD
8.2. Interpretation Inspired by Rough Set Theory
In spatial reasoning, the focus on approximating a set is often associated with understanding its constituents (as illustrated in Example I), as well as with approximating or decomposing regions using others with a particular geometric or topological structure. This concept underlies the significance of rough approximation in naive set theory, which can be interpreted using concepts from RCC.
Given the ambient space
and a family of sets
, including subset
, it is approximated using a function
, where the values
represent the relations of full overlap, partial overlap, and no overlap. The function
indicates the degree of overlap between the subset
b and the elements
of the partition of
X.
The approximations and of the set b can be defined in this context as follows:
To offer a fresh interpretation of the indefiniteness relations employing rough notions, we will generalize the partition concept as follows. The underlying idea in this formalization is to consider sets with non-empty interiors as the “empty sets”.
Definition 10. A topological space Ω is called DR-Rough
if there exists a family of regular subsets of Ω such that:
- 1.
(i.e. );
- 2.
the set is regular;
- 3.
it is a covering of X, .
The family is called a DR-Rough cover.
Note that, in the previous definition, condition 2 is necessary since, in general, the union of regular sets is not regular. Consider the example in
Figure 17. In
with the usual topology, let
be the family of closed balls with center
and radius
. It is verified that
, since
but
.
The idea of working with regions as DR-Rough spaces is interesting because it relates rough approximation with the approximation provided in this paper. This concept cuts across many research lines. For example, when working with boundary regions in 3WD or in the geometric case and when attempting to restructure concept spaces using specific regions (e.g., [
111]). Additionally, in the case of the local treatment of massive data, it could be useful (cf. [
79]).
Example 1. An example of a DR-Rough topological space, consider and the cover where (Figure 18). In DR-rough spaces, the relations
can be interpreted with rough-type approximations. For this, consider a DR-rough cover
of non-empty regular closed sets. It is easy to prove the following (see
Figure 18):
Theorem 13. Let and . Then:
- 1.
is a pulsation, and
- 2.
is a contraction.
Thus, can be considered as an a rough fringe region of indecision in three-way clustering.
9. Conclusions and Future Work
The need to revise formal ontologies for reuse presents logical and computational challenges, making this task in OE difficult to execute safely while adhering to representational principles and coherence. An ontology revision lacking formal grounding may compromise the quality of the ontology and the accuracy of the data it supports. Conversely, while the definitional methodology is theoretically suitable for extending ontologies, it encounters challenges in practical application, particularly in the case of non-lightweight ontologies.
This paper presents a proposal for robustly extending and reusing formal ontologies, aiming to overcome the stringent constraints of the definitional methodology. The approach involves replacing the requirement of definability with weaker conditions associated with the (finite) ⊆ structure induced by the theory on its concepts/relations. By focusing on finite structures, it becomes feasible to effectively analyze extensions, facilitated by the use of ARS, enabling result verification. Moreover, this approach allows for the determination of all structures of a given type, with fewer requirements than the definitional methodology, facilitating the categorization and interpretation of generic extensions of formal ontologies.
The second part of the paper focuses on applying the extension methodology to classify and interpret various classes of RCC extensions for vagueness, as well as computing their composition tables. These extensions are part of the logical foundation for ontology cleaning with RCC, as described in [
99], and implemented using the associated tool. Furthermore, classic extensions of RCC have been reinterpreted as particular cases encompassed within the computed extensions. The case study also provides a solid foundation for reinterpreting mereotopological ontologies in the 3WD paradigm.
The strength of the methodology we present for obtaining ontological extensions lies in considering a notion of lattice categoricity that is weaker than that of first-order logic categorical theory. This notion focuses on the relationships among concepts/relations that are the representational objective of the ontology, avoiding the concept of categoricity in the sense of first-order logic, which is much more complex. In this way, with the assistance of automated reasoners, we can find the required extensions.
As a trade-off for the loss of “uniqueness” that the definitional methodology provided, the method has limitations due to the weakening of conditions. The first is that it is not a method that can be applied independently of the domain expert or ontology user. It would be advisable for the expert to choose the relationships (the lattice skeleton) they wish to preserve in the extension. Likewise, as observed in the case of the RCC extension studied in the paper, there may be multiple extensions satisfying those constraints. This fact necessitates that, once calculated, the expert must be consulted to select the appropriate one.
Although the first part of the paper conducts an analysis of the state of the art on RCC extensions and variants, it is important to emphasize that the logical–algebraic approach in studies on RCC is not entirely novel. For instance, in [
112], RCC5 is studied by defining relations between regions using Boolean tuples. A partial order is induced between the five RCC5 relations.
Ref. [
113] introduces a method for approximately analyzing binary topological relations between regions with indeterminate boundaries. It includes examples demonstrating the determination of the eight relations in 2D space. Additionally, spatial reasoning within a fuzzy region connection calculus is explored in [
61]. Notably, the paper establishes a close relationship with the egg–yolk paradigm and demonstrates that satisfiable knowledge bases can be realized by fuzzy regions in any dimension.
In [
114], multi-level topological relations of RCC-8 are introduced using concepts such as separation number and types of spatial elements. The paper focuses on detailing four relations: EC, PO, TPP, and TPPi at additional levels. On the other hand, in [
62], there is a proposal to extend algebraic and topological relations between sets and fuzzy sets to bipolar fuzzy sets.
Computing abstract RCC extensions may not be adequate when the new relations lack a mereotopological basis. Alternative methods like lattice alignment may be more appropriate. This issue will be the subject of future study. Likewise, estimating the complexity of constraint satisfaction problems (CSP) in the extensions is also an interesting research line. Additionally, we intend to apply the methodology to foundational ontologies, for example, attempting to resolve anomalies found in them through revisions (e.g., [
25]).