1. Introduction
Remote sensing allows one to acquire information from a distance from cameras, sensors, microphones, and other external devices. The data may originate from satellites, aeroplanes, and sonar systems, among other sources. Satellite-based instruments are commonly used to monitor various parameters of the Earth’s surface, such as temperatures in various infrared frequencies, and to take images. Global coverage is offered with frequency being as low as once per day. Stationary satellites are even capable of providing continuous monitoring of specific locations. The main problems associated with satellite data include their heterogeneity and large volumes. The data need to be analyzed on a daily basis, sometimes in real time, if timely response is needed.
In general, multisensor data are voluminous, heterogeneous, and sometimes incomplete. Prior to being analyzed, they need to be processed and represented in a specific form. They may be represented as discrete data by temporal series, i.e., by sequences of data usually taken in equal time intervals, or as continuous time-dependent functions and stochastic processes. Numerous algorithms have been developed, for example, concerning volcano monitoring (cf., e.g., [
1]).
In the literature, heterogeneous means are used, such as means diagrams, functions, tables, and textual descriptions, to specify those heterogeneous types of phenomena. Dependencies between factors of different kind, such as similarities, regularities, and periodicity, are quite often described with the use of text only, meaning that various problems stemming from such an approach, such as imprecision and the lack of a unifying framework for specification, validation, and reasoning need to be dealt with. Textual specifications are inherently imprecise, as this is the feature of natural languages. They do not rely on formal semantics in the form of comprehensive mathematical models. Consequently, they do not allow for precise evaluation or a formal reasoning, and thus the reasoning lacks due formality.
Logics provide uniform formal languages, models, and reasoning methods, and thus provide a solution enabling to address above mentioned problems. Logics are associated with well-defined classes of models constituting their semantics. They provide precise formal languages for the description of the models and capabilities for correct reasoning abut properties of the models. In general, a logic is a system consisting of a formal language for specification, a class of models corresponding to the language, and a set of sound rules for a correct reasoning about the about the models.
Temporal logics are used to express change over time, properties of behaviors, and sequences of actions. Their languages provide temporal modalities for specifying future or past events. They are used also to define and synthesize system controllers. On the other hand, they provide rules for correct reasoning about temporal properties. Various kinds of temporal logics exist (see [
2] for an overview).
Interval temporal logics (cf., e.g., [
2,
3]) are used for specifying time-dependent processes relative to time intervals. Duration calculus (DC) is an interval logic that is widely used for the specifying, modeling, and reasoning about discrete and continuous processes. It allows to specify propositional functions with Boolean values changing over time. There is an operator corresponding to the integral (cf., e.g., [
4]) that measures how long such a propositional function remains true. It may be used to study periodicity of system states.
In this paper, we present duration calculus for functions (DC4F), a natural extension of the duration calculus for dealing with general integrable functions, not only Boolean-valued ones, as is the case of DC. The idea is simple; we take the integral operator on Riemann integrable functions and use it within a frame of an interval logic such as DC. Thus, the integral operator is used within a well-suited logical framework. DC4F, in addition to the expressive capabilities of DC, allows one to characterize the behavior of functions over time intervals in terms of their integrals. Consequently, we can characterize not only the duration of a certain property, as in the case of DC, but characterize the behavior of functions over time intervals in terms of integrals. The proposed extension is conservative in the logical sense: the DC part is unchanged meaning that its valid formulas remain valid and all its invalid formulas remain invalid. Even though the extension of DC is natural, we are, to our best knowledge, the first ones who propose it.
To evaluate the proposed logic, we investigated various phenomena, multisensor data, and facts concerning volcano monitoring, as this is a popular research topic and the degree of complexity of data is significant. Periodic degassing and temperature increases are common characteristic of active volcanoes. Distinct periodicity patterns concerning measurable parameters of volcanoes’ activity have been widely identified. The timescales are ranging from seconds to weeks and months. The development of temperature and gas measurement techniques is aimed at enabling a robust quantification of high-frequency processes. Paper [
5] presents an overview of the current state of knowledge regarding periodic volcanic degassing and evaluates the methods aiming at detecting periodicity. It summarizes and statistically analyzes published studies. Periodicity analysis of volcano activity (cf., e.g., [
5]) is one of the challenges.
It turns out that such phenomena and their dependencies may be conveniently specified using DC4F, as it provides a expressive and uniform language for expressing various phenomena in a precise way, models for various data, and evaluation and reasoning capabilities. It provides a convenient specification language to express hypothesis concerning expected temporal properties. If data (in particular, multisensor data) are provided, then DC4F formulas may be validated for them and, thus, their truthfulness may be checked. On the other hand, the reasoning rules provide for convenient reasoning possibilities. Thus, DC4F may be perceived as a unifying logical framework for data integration and for formulating hypotheses, their evaluation, and reasoning.
However, high expressivity of languages always comes at the cost of complexity of reasoning rules. The higher the expressivity, the more difficult the reasoning. More precisely, the question if a formula follows from a set of other formulas has high computational complexity. DC4F, like DC, is an expressive language and is, therefore, undecidable, i.e., there is no general algorithm for deciding the question mentioned above. Nonetheless, this does not hinder its use for specification, nor does it hinder the validation of formulas for concrete data.
The paper is organized as follows. In
Section 2, we discuss related works.
Section 3 contains a brief presentation of duration calculus and the way it is used. In
Section 4, we present the main idea of DC4F and some examples of its application. In
Section 5, we define its formal syntax and outline its semantics in an informal and exemplary way; we also list some of its properties and show how it applies to the multidimensional case.
Section 6 is devoted to applications of DC4F in the area of volcano monitoring, in particular to the validation of the proposed concepts. In particular, we present an exemplary reasoning in DC4F. We conclude the paper with
Section 7. The paper also contains an Appendix in which we define the formal semantics of DC4F.
2. Related Works
Data fusion means the integration of data and knowledge from different sources. It is a widely studied topic (cf. the overview papers [
6,
7] and the references there). Data fusion approaches may be classified in various ways—for example, data association, state estimation, and decision fusion. Data fusion has been studied in the context of temporal series as well. In paper [
8], the authors study the fusion of long-term data in the form of dense time series from the Moderate Resolution Imaging Spectroradiometer (MODIS) and Landsat imagery. They investigate a spatiotemporal adaptive fusion algorithm in a regionalization study in which MODIS was used. They show the correlation of the time series achieved by different observation methods. In paper [
8], the authors propose a data fusion method for producing high spatiotemporal resolution values for the normalized difference vegetation index in the case of time series.
We approach this topic from the perspective of temporal logic, rather than from the practical angle, as in the papers mentioned above and various others. Temporal logic (TL) describes behavior over time (cf., e.g., [
2]). Their languages offer temporal modalities for specifying past or future events, as well as their temporal relations, such as: event A will happen or event A may happen in the future, property A has to be always present, property A holds until property B will be satisfied, and so on. They are also used to define and synthesize system controllers; the controllers are then guaranteed to monitor and control the underlying systems according to the specific requirements. On the other hand, they are used to ensure that conclusions drawn based on the assumptions made are correct, that the implementation satisfies the specifications, and so on.
There are different types of temporal logics (see [
2] for the overview). The linear temporal logic describes behaviors by referring to linear sequences of events. The computation tree logic (CTL) describes the possibilities and, more precisely, the branching time structures. The interval temporal logic (ITL) (cf., e.g., [
2,
3]) describes behaviors relative to time intervals, stating, for example, that a property is valid during the entire time period or that it occurs within a certain subperiod. It is also possible to express the fact that a property holds within a given time period, which is then followed by another time period over which another property holds.
Duration calculus (DC) is a popular and widely studied kind of interval logic (cf., e.g., [
4]). Duration calculus may be considered to be a form of ITL. In addition to the above-mentioned potential of ITL, DC contains an integral operator that allows one to express durations of properties in a quantitative manner.
In [
9], automata-based semantics of DC have been defined covering data, real-time, and communication-related aspects. A model-checking algorithm has been presented for a subset of DC that may be model-checked. The algorithm has been implemented and its use demonstrated.
DC and ITL are expressive but not decidable, i.e., no procedure exists for figuring out whether a given formula is always valid or not. Thus, as usual, expensivity is at the cost of complexity—in this case, decidability. A restricted form of DC, known as RDC, has decidable inference relation [
10]. In fact, formulas of this type may be reduced to the regular expressions. The problem of DC model checking is a topic of ongoing research (cf. e.g., [
11] and the references there). In [
12], the authors propose a method for solving minimal and maximal reachability problems for the multipriced timed automata. The automata are an extension of timed automata with multiple cost variables that may evolve according to specified rates.
The range of applications varies from the above-mentioned real-time systems (cf. e.g., [
13] and the references there) to air traffic control (cf. e.g., [
14]) and hybrid systems [
15].
Duration calculus is often used for the synthesis of real-time system controllers (see, for example, [
16]) and the references there). It is capable of specifying time constraints of dynamic systems. Various methods are used—for example, integer linear problem-solving and optimization problem-solving methods (cf. [
16]). DC is used also for traffic system specification (cf., e.g., [
13]). There is also a wide range of tools supporting DC (see, for example, [
10] and the references there).
In [
17], a variant of duration calculus was presented for system discounting, i.e., the idea that something happening earlier is more important than similar events happening later. The idea was introduced earlier into temporal logics such as LTL and CTL. The authors demonstrated decidability of the model-checking for timed automata and a dedicated fragment of discounted duration calculus.
We illustrate our ideas with data concerning volcano monitoring. In fact, volcano monitoring is one of the prime examples for the application of multisensor systems placed in satellites, aeroplanes, balloons, and on the ground. They provide huge amounts of data that have to be preprocessed, analyzed, evaluated, and reasoned about. Thus, it is a proper area to test ideas such as ours. Of particular interest are the Strombolian effects, i.e., periodic volcanic activity phenomena such as temperature picks, gas bursts, and lava eruptions (cf., e.g., [
18,
19,
20]).
In [
1], Koeppen et al. propose a fully automated interactive algorithm called MODVOLC for the analysis of thermal satellite time-series data. The algorithm is aimed at detecting and quantifying the excess energy radiated from the thermal anomalies such as active volcanoes. The algorithm enhances the previously developed approaches (see the references in [
1]). It is characterized by the law rate of false positives (see [
1,
21]). It flags thermal anomalies—in particular, volcanic eruptions. The algorithm was tested for different localizations such as the Anatahan volcano, the Kīlauea volcano, and the Cantarell oil field in the Gulf of Mexico.
In [
22], Laiolo et al. compared thermal satellite images of the Etna eruption that took place in 2018 with ground-based geophysical data of summit craters. The Moderate Resolution Imaging Spectroradiometer (MODIS) provided infrared images and helped to identify pixels including the possible hot spots.
Temporal series corresponding to Strombolian effects may be compared using similarity measures. In fact, various similarity measures are used to compare temporal series and functions in general (cf., e.g., [
23] and the references there). One can use the dynamic time warping algorithm (cf., e.g., [
24] and the references there). This algorithm is used in temporal series analysis for measuring similarity between temporal series that may vary in terms of speed. Algorithms of this type were applied to compare graphical data representing temporal series, but also in comparisons of audio and video materials (cf., e.g., [
25,
26]).
Dyea et al. [
19] proposed a method of detecting Strombolian eruptions based on training a convolutional neural network. The method automatically categorizes eruptions based on infrared images taken at the rim of a crater atop Mount Erebus. The authors show that machine learning may be effectively used to classify the characteristics of Strombolian eruptions, to facilitate the process of studying their origins, and to assess the hazards posed by volcanic eruptions.
3. Duration Calculus
In this section, we present the basic features of duration calculus (DC). We present the basic idea behind temporal specification relative to time intervals. We also list and explain the temporal modalities it provides and explain how they can be used.
3.1. Using Duration Calculus
In this subsection, we show the basic properties of duration calculus (DC) in its original form. DC allows to specify Boolean-valued functions, which play the role of time-dependent formulas, and to use the DC integral operator to measure the time, or equivalently the duration, when they hold. We also consider an operator for splitting intervals.
DC allows to treat durations of processes. More precisely, it expresses integrals of Boolean-valued functions with values 1 and 0 representing true and false, respectively. Such functions are called state functions since they indicate whether the system is in a given state or not. It should be noted that the properties are defined not in respect to specific points in time but in respect to intervals and in terms of durations.
We start with a simple example of system with states modeled by the
function and lasting for the time interval
(see
Figure 1). This function is first positive on
subinterval and then it is negative on subinterval
. The fact that the function is first positive may be characterized in DC using the Boolean-valued function
, which returns 1 if
and 0 otherwise. In the first case, the value is 1, and in the second case, it equals 0; we identify
with 1 and
with 0.
is called the characteristic function of the set
. To describe that the value of
is negative, we can use the characteristic function
. This function may be obtained from the first one using logical negation
, which swaps 1 and 0. It should be noted that DC does not allow direct access to
and its integral, but only via Boolean-valued functions characterizing its values.
Duration calculus enables one to express such properties as the duration of states—for example, the period of time over which remains positive, with the help of state function . DC formulas are evaluated always in respect to an underlying time interval. Length function ℓ returns the length of the underlying interval. It is defined using the integration operator ∫ of DC: . The semantics of this operator is defined relatively to an underlying interval —in this case, .
For a Boolean-valued function, the integral operator measures the time when the function is true. For intervals of length
, the length of subinterval where
is positive is equal to the length of the subinterval in which it is negative. This fact may be expressed using the length function
ℓ:
In this formula, we use the integral
, which, for a given interval
, returns the value
, i.e., the value of the integral for the interval, and similarly for
. The formula states that if the length of an interval equals to
, then the time when the
is true equals to the time, the duration, when
is true. We do not have to consider the fact that the function is equal 0 at the end of intervals, since the functions are characterized in terms of integrals and for subintervals of length 0, the integral value has no influence on the entire integral.
For interval , it holds that , since the duration of being true is . In fact, this expression is true for every interval of the form , where , but it is false for intervals of the form . Similarly, we can express the fact that the duration of being negative is : . This expression is true on every interval of the form where . In both cases, we have to use the auxiliary functions to characterize in terms of being positive or negative, respectively.
3.2. Temporal Modalities
In this subsection, we present three temporal modalities definable in DC: the chop, the after-modality, the sometime-modality, and the always-modality. The chop is denoted by
(cf., e.g., [
10]); it applies to two neighboring intervals. It allows one to express the fact that a property holds in the first interval and another property holds in the second interval. The last two modalities occur in all kinds of modal and temporal logics (cf., e.g., [
2]), but they have a specific meaning in DC. The sometime-modality is denoted by
◊ and requires a property to hold for some time interval. The always-modality is denoted by □ and means that a property holds for all time periods.
Formally, for two formulas , the formula means that for an interval in question, there exists a number x such that , F holds for interval and G holds for interval . For example, the fact that is initially positive and then negative (see the previous subsection) can be described in respect to two subintervals dividing the interval in question. We may express it by formula . It is true on the interval as the interval can be split into two subintervals and ; thus, we chose .
Note that the operator is associative; thus, the formula of the form is equivalent to formula , since the formulas depend on splitting an underlying interval into three parts where the subformulas have to hold, respectively. Consequently, we will drop the brackets when using chop.
Given a formula
G and an interval
, the fact that
holds for interval
means that it has a subinterval
, i.e.,
, such that
G holds in
. The may-modality may be expressed using the chop operator
, which means that we can divide the underlying interval into three subintervals and that
G holds for the middle subinterval, whereas we do not require anything from its adjacent subintervals on the left- and right-hand sides. For example, we have the following property:
This formula says that if the length of the underlying interval is larger than , then there is a subinterval of length larger than 0 where the property holds all the time, i.e., is positive.
The modal formula means that for all subintervals of an underlying interval, the formula G holds. For example, for all subintervals of interval , the function is true all the time: . Modality □ can be expressed using the sometime-modality ◊: ; the formula means that no subinterval exists where G does not hold. Vice versa, can be expressed as . Thus, both modalities are dual. Both are definable in terms of the chop operator, but we will use them as syntactic sugar to facilitate readability of the formulas.
4. The Idea of Duration Calculus for Functions
In this section, we present the idea of duration calculus for functions (DC4F). The objective is to treat Riemann integrable functions with values of type real. Such functions can represent continuous as well as discrete time series. We informally present features of the proposed calculus and illustrate them with simple examples demonstrating its basic characteristics, the way it can be used, and the capabilities of the logic.
4.1. DC Versus DC4F
DC allows one to treat Boolean-valued functions only and to reason about their durations. It does not allow to integrate general functions. In this subsection, we propose an extension of DC supporting all Riemann integrable functions. We present the basic features of the proposed logic. We also present examples that are specific to DC4F and, thus, cannot be expressed in DC.
In DC, a state function f, we can consider the time intervals, or periods, when this states holds. Such a case corresponds to considering the corresponding characteristic functions: . The integrals of the characteristic functions correspond to the duration of the characterized state or property. For example, as explained in the previous section, the fact that is first negative and then positive can be expressed in DC only via the auxiliary characteristic functions and , but not directly. In DC, we can integrate function and obtain the duration over which it remains true, but we cannot integrate or any other function that is not Boolean. Thus, for example, we can express the property that for intervals of length , is positive as equally long as it is negative .
However, not all properties of integrals can be expressed in this way. For example, we cannot express properties such as , i.e., that if the length of the underlying interval is , then the integral is equal to 0. Thus, DC is not expressive enough to treat time series in general.
DC4F is a natural extension of DC as allows all integrable functions. For example, we can consider . In logical terms, it is a conservative extension of DC. Conservativity means that a formula of DC is a tautology in the sense of DC if, and only if, it is a tautology of DC4F; equivalently, it is satisfied in a DC model if, and only if, there is a DC4F extension of the model in which the formula is satisfied. In general, DC can be expressed in DC4F by restricting the set of integrable functions so that it contains Boolean-valued functions only.
4.2. Monotonicity
We formulate the definition of monotonicity in terms of intervals and integrals. The property that a function is monotone over an interval can be expressed in DC4F by the requirement that for two neighboring subintervals of the same length, the integral over the left one has smaller or equal value to the integral over the right one. It can be expressed by formula
M of the form
. This property must hold for all subintervals of this kind; thus, we have the formula:
.
Figure 2 shows a monotonically increasing function
f.
Alternatively, we can define the monotonicity property in the following way:
This formula, though it may seem complicated, is rather simple. It states that for a given interval and for all its adjacent subintervals, the normalized value of the integral for the left subinterval is lower than or equal to the normalized value of the integral for the right subinterval. It is illustrated in
Figure 2. Intervals
and
are adjacent. The values of the integrals for these intervals have to be normalized by their length, as their lengths are different. We will abbreviate this property of function
f as
. Analogously, we can define the property of monotonic decrease,
. We use a capital letter to indicate that both functions have a functional argument instead of a time value.
Although length operator
ℓ occurs in the characteristics of the first and of the second interval, it returns independent values. Due to transitivity of smaller or equal relation ⩽, the monotonicity property holds for all subintervals, not only for the adjacent ones. Thus,
(see
Figure 2).
4.3. Limits and Amplitude
In this subsection, we show how to define lower limits of functions and their minimal amplitudes. The definitions are based on temporal modalities. Upper limits and maximal amplitudes can be defined analogously. We use those definitions below when dealing with volcanic Strombolian effects.
We start with the case when a function
is an upper approximation of a function
or, equivalently, function
is a lower approximation of function
(see
Figure 3). This property can be expressed by demanding that for each subinterval of a given interval, the integral of
f is smaller than or equal to the integral of
g:
The property that g increases c times faster than c can be expressed in DC4F in a simple way: . It should be noted that we cannot directly express it in DC.
The diagram presented in
Figure 4 shows function
h, which is above value
a. The property that, for a given interval, values of function
h are above a certain limit
a all the time can be expressed as follows:
The integral of
a is equal to the value
. Consequently, we demand that it remains smaller than the integral of
h for all subintervals of the underlying interval. The fact that function
h exceeds threshold
y for some time can be expressed as follows:
This formula states that for some subinterval of a nonzero length, the value of the integral is not lower than the value of a multiplied by the length of the subinterval. The function shown in
Figure 4 extends beyond line
y for a time period. Equivalently, the property can be expressed as follows:
The fact that an amplitude of a function is at least
d can be expressed as follows:
Formula (
8) says that there exist two subintervals of nonzero length. The integral of
h for the first one is smaller than or equal to
, as it was expressed by formula (
7). Analogously, the integral of
h for the second one is larger than or equal to
. Formula (
8) requires also that the difference
is equal to at least
d.
In
Figure 4, the intervals correspond to the parts of diagrams of
h located below the
x and above the
y line, respectively, are indicated by bold lines. Below, we abbreviate formula (
8) by
. Formula
is a shorthand of Formula (
6) and expresses the fact that function
h is above threshold
y for some time period.
5. Syntax and Semantics of DC4F
In this section, we define the formal syntax of DC4F and describe its informal semantics. We list some basic properties of the proposed logic and show how to apply it in the multidimensional case, such as those involving two-dimensional images.
5.1. Syntax of DC4F
Every logic has its proper language, which has to be formally defined. In this subsection, we define the formal syntax of DC4F. This syntax is close to that of DC. The crucial difference consists in the fact that we allow all unary integrable functions, not only the Boolean-valued ones, and their integrals. Basically, there are constants, variables, and function symbols corresponding to the integrable functions, which can be added and multiplied by constants. There is also an integral operator. Formulas are either of the atomic form, when real numbers are compared, or are composed from other formulas.
To define the syntax formally, designated sets of variables, functions, and relation symbols are required. There are global variables, which remain unchanged. We assume that
is an unbound set of real-valued variables denoted by letters
,
,
, … We also assume that there is a set
of constant symbols. The set
contains unary functions symbols
corresponding to integrable functions. The set
contains functional symbols
corresponding to functionals with arguments in the form of time intervals and values of type real
(cf.
Appendix A).
The syntax of the logic comprises three categories of expressions: functions containing integrable functions, terms containing expressions of type , and formulas containing logical formulas in the proper sense of the word. We assume in the definition below that .
, for
The first category includes constants and functions of type real; the constants are treated here as functions with arity equal to 0. The functions can be multiplied by constants and added. There is also the maximum operator ∨; it denotes their maximum and is a reminiscent of DC (cf., e.g., [
4]). In DC, there is an indefinable negation operator ¬ that applies to state functions (cf., e.g., [
4]). It is characterized by the following property:
if, and only if,
. We can define the negation ¬ on integrable functions such that it coincides with negation on state functions:
. Consequently, if
, then
. If
, then
, since ∨ is the maximum.
The second category includes constants, variables, and applications of the integral operator. There are also auxiliary operators X on intervals with values of type real. Complex terms can be constructed from simpler ones using arithmetic operations.
The third category comprises formulas as such. The atomic formulas include constants and ; the atomic formulas are also formed from relational symbol applied to terms, such as . Complex formulas are composed from other formulas by the application of logical operators: logical negation, alternative, existential quantification, concatenation, the chop operator, and iteration. Thus, if F and G are formulas, then the alternative , the chop , the iteration , and the quantification are formulas as well.
As an example of the above syntax, we can present formula , provided that belongs to . The formula is obtained by the application of chop to two atomic subformulas. The first subformula is obtained by the application of equation = to terms and 2; the first term is obtained from function by application of the integral operator. The other subformula is obtained in a similar way.
It should be noted that relations
and
can be defined in terms of < and = using logical operators ¬ and ∨, e.g., in the first case, we have
. Similarly, the conjunction
is defined as
. The implication
is defined as
. The general quantifier
is defined as
. As mentioned in
Section 3.2, modalities □,
◊ are defined using chop and negation.
is defined as
and
as
. The length operator
ℓ is defined as
(see
Section 3.1). We assume that
binds stronger than ∨ and ∧. ∨ and ∧ bind stronger than quantifiers ∃ and ∀. Quantifiers bind stronger than implication ⇒. Implication binds stronger than modalities
◊ and □.
5.2. Informal Semantics of DC4F
In this subsection, we present the intuitive semantics of the previously defined syntax. The presentation is informal, but it can be easily formalized, as shown in the
Appendix A. We discussed it for the three syntactic categories defined above:
,
and
.
We assume here that the time domain is the set of non-negative real numbers . In the case of discrete time modeled by natural numbers , for every discrete function f, we define the corresponding step function g on by extending the values of f to the corresponding unit interval: , where is the largest integer such that . Thus, we can treat discrete functions, or discrete time series, as if they were defined for continuous time.
Elements belonging to category are interpreted as time series, and more precisely as Riemann integrable functions with the domain . The functions may be multiplied by constants, added, and subtracted; thus, the set corresponds to a linear space. The function returns the maximum of and .
The category contains terms of type . The constants, e.g., 0 or 1, are interpreted as the corresponding real numbers. Variable symbols are evaluated by functions mapping them into the set . For and an interval , where and , term of the form is interpreted as the integral . This operator is linear in respect to functions: and .
A term t depends on variable if t contains it. In general, the value of terms containing variables depend on the values of its variables. Valuations are mappings of variables into . We can, for example, define valuation as mapping of the form . Let term p be of the form . The value of p for is 10; it does not depend on an underlying interval. The value of a term depends on an underling interval if contains operator ∫. In the case of term q of the form , the value depends on the underlying interval and variable z; for interval and , the value is 7.
The category of formulas plays a crucial role, as formulas are the proper objects of logical reasoning. In fact, formulas can be understood as Boolean-valued terms. The satisfaction of formulas, i.e., the fact whether formulas are true or false, is defined in respect to an underlying interval and a valuation of the variables they include. The values of these terms may be compared using <, =, given the underlying interval and the valuation. The result of this comparison is either true or false, depending on the value of the corresponding terms. For example, for interval and valuation , formula is true but formula is false.
Let be an interval and be a valuation. Formula is satisfied in interval I for valuation , if formula F is false in interval I for . Similarly, the alternative is satisfied if F or G is satisfied in I for .
In case of chop,
is satisfied in interval
I for valuation
if there exists
such that
F is satisfied in
for
and
G is satisfied in
for
. Iteration
of formula
F is satisfied in
I for
if there is an
n such that we can split the interval into subintervals
,
, …,
such that
F is satisfied in each subinterval for
(cf. [
4] Section 4). An example of the iteration formula is presented below and its formal semantics are presented in the
Appendix A.
Quantified formula is satisfied in interval I for valuation if there exists a valuation which differs from only for variable x such that F is satisfied in I for . The fact that and differ only at x means that for every variable y different from variable x, it holds that . For example, is satisfied for valuation defined above; in fact, we can define as but .
5.3. Basic Properties
In this subsection, we list some basic properties of the logical operators, such as the linearity and monotonicity of the integral. We use them in an exemplary reasoning.
The DC4F calculus has the following logical properties:
,
,
If , for every , then
,
, ,
,
If the formulas hold in an interval I for a valuation , then G holds in an interval I for a valuation .
Point (1) concerns integration of constants. Point (2) expresses the fact that integrals are linear operators, i.e., they commute with multiplication by constants and are distributive in respect to addition. Point (3) expresses the monotonicity of integral operators. If the antecedent inequality holds for all non-negative reals, then the consequent inequality holds for all time intervals. Property (4) can be expressed as
. The above-mentioned points are specific to DC4F because it allows integrable functions, not only propositional ones as in the case of DC. The following points are common with DC (cf. [
4], Section 2). Property (5) states that every interval can be split into an interval of length 0 and the rest. Point (6) specifies three exemplary propositional tautologies (cf., e.g., [
27]) that we use later in an exemplary derivation. The first tautology says that truth is implied by any formula. The second tautology expresses the monotonicity of the conjunction operator. The third one expresses the commutativity of conjunction. It should be noted that all propositional tautologies hold in case of DC as well as DC4F. Point (7) specifies the monotonicity of the chop operator. Property (8) is the modus ponens reasoning rule. It says that if the antecedent of an implication is true and the implication is true, then the consequent of the implication is true as well. The rule is used in many kinds of logic (cf., e.g., [
2,
27]).
5.4. Multidimensional Case
The functions considered in DC and DC4F are unary, i.e., they are of the form , where t is the time parameter. In this subsection, we consider integrals over multidimensional sets and spaces. Such cases may occur when, for instance, images of an area are considered and measurements are performed. The development of values over time may concern specific areas or multidimensional spaces. In this case, the integrated function has a number of variables, apart from the time variable. We show that in fact such a multidimensional case can be reduced to the case of unary functions. In fact, we utilize such integrals in the examples presented in the following section.
An image may be represented by a number of pixels with two coordinates x, y. Thus, the area of an image may be considered as a set . In the multidimensional case, it has the form . If the images are used to measure certain values, then the measurement can be modeled by a function. The measurements may be time-dependent. Thus, we consider time-dependent integrable functions of the form and multidimensional sets of the form . For an interval I, the corresponding integral has the form: , where is a product measure on the space . Equivalently, we can present the integral using the characteristic function corresponding to set A, the function has value 1 for the elements of A and 0 for all other arguments: .
This integral can be presented also in the form . Thus, the integral has the form , where . Consequently, it can be presented as an integral of unary function depending on the time parameter t only.
6. Applications and Validation
In the case of multisensor systems, the data generated may be of various types, such as temperature, pressure, or density measurements, videos, or sound, among others. It is hard to bring them into a uniform and consistent model. Thus, quite often, textual specifications and informal validations are used, which is inherently imprecise. Specification, interpretation, comparison, and reasoning about such heterogeneous data pose a nontrivial problem. In this section, we apply the DC4F calculus to express measurements and complex behaviors. The goal is to illustrate the way DC4F can be applied and to demonstrate its capabilities for specifying temporal series. We use examples from the area of volcanic activity monitoring by multisensor systems because of their complexity and, thus, the challenge that they pose. We aim to show that the extension of DC that we propose in this paper provides a uniform language, models, and a reasoning system to integrate, describe, and reason about such data. The examples are used to show the practical applicability of DC4F and to demonstrate how it can be used. However, it is not our goal to adequately describe those phenomena per se. It should be also pointed out that the units of measurement do not play an essential role for the illustration. Our concepts concern mathematical properties and, thus, are independent of the units used.
6.1. Volcano Monitoring
To demonstrate the applicability of DC4F, we looked at several examples of signal processing and the corresponding temporal series. It turned out that volcano monitoring is an interesting and challenging task due to the complexity of the process, various monitoring methods used, and the heterogeneity of the acquired data. In this subsection, we present a brief presentation of volcano monitoring processes, as they are instrumental in illustrating our ideas.
As pointed out by Corradini et al. in [
18,
28], volcano monitoring processes require the correlation of satellite data, provided that satellites are present over the specific target. Satellite data are heterogeneous in nature and relate to temperatures measured in different wavelengths, among other measurements. Several phenomena, such as temperature picks, gas explosions, and magma eruptions, need to be considered (cf., e.g., [
19,
20]).
Strombolian volcano eruptions are a widely studied topic. They are characterized by specific cyclical patterns. Therefore, their specification is a challenging task. A Strombolian volcano eruption is characterized by regular, relatively mild blasts. It consists of the ejection of fragments of solidified lava, material deposited by previous volcanic eruptions, and masses of molten rock to altitudes ranging from tens to hundreds of meters (cf., e.g., [
18,
19,
20]). Eruptions of this type are named after the Stromboli volcano. They are commonly observed in volcanoes fed by low to moderate viscosity magma. The explosions are caused by the bursting of gas slugs, which rise to the surface faster than the surrounding magma. Quakes associated with such explosions occur at low depths (cf. e.g., [
20]). Spectroscopic measurements performed on the Stromboli volcano were used to demonstrate that gas slugs originate from the volcano–crust interface at the depth of approximately 3 kilometers, which may promote slug coalescence. In the case of the Stromboli volcano, quantitative constraints for the depth of conglomerates of high-pressure gas bubbles inducing Strombolian activity can be identified on the basis of spectroscopic measurements of the magmatic gas phase driving the explosions.
6.2. Dealing with Estimates
Various parameters are considered when a volcano eruption is monitored. These parameters need to be measured and compared. In this subsection, we show that functional limits may be dealt with in CD4F.
Thermal satellite images of the Etna eruption that took place in 2018 were compared with ground-based geophysical data of summit craters using the Moderate Resolution Imaging Spectroradiometer (MODIS) (see [
22]). The technology provides infrared images and makes the identification of hot spots including pixels possible. The thermal anomalies may be quantified in terms of volcanic radiative power (VRP), expressed in watts (see [
22] and the references there). The VRP value is calculated as follows:
where
is the pixel area of approximately
m
,
is the recorded the middle infrared radiance (W m
sr
m
), and
are the background pixels. These values allow to estimate the average lava erupted volume, i.e., the time average discharge rate (TADR), through a unique parameter called radiant density
(J m
):
This parameter represents capacity of the lava body to radiate heat (see [
22] and the references there). It allows to estimate the erupted lava volume by assuming the appropriate parameter
for the lava flow in question.
The fact that the actual lava flow volume
is between limits
and
(see
Figure 5) can be specified by the following formula, which is analogous to Formula (
5) from
Section 4.3. This fact cannot be expressed in DC, since DC only allows to consider the duration properties on intervals but not the values of integrals:
The inequalities express the desired property in terms of intervals. The formula means that the inequalities hold for every subinterval of the given interval (see
Section 3.2).
6.3. Dealing with Two-Dimensional Spaces
There exist algorithms for detecting thermal anomalies in two-dimensional pictures related to volcano eruptions. In this subsection, we discuss the application of DC4F to such cases. We specify the units of measurement, but it should be noted that for the presentation, the units of measurement and exact numeric values obtained do not play a significant role in the presentation data, since in our framework we can deal with values of type independently of their interpretation.
MODVOLC is a fully automated algorithm for the analysis of thermal satellite time-series data [
1]. The algorithm is based on spectral wavelengths centred at 4 and 11–12 µm emitted by high-temperature volcanic sources and the surrounding Earth surface. In the case of a volcanic eruption, the radiance is measured from the corresponding pixel. One pixel corresponds to approximately 4 km. Images are collected every 15 min. In pixels that show a thermal anomaly, the radiance measured in the mid-infrared (4 µm) is significantly higher than in surrounding pixels where no thermal anomalies are depicted [
1,
21].
Figure 6 shows an excerpt of Figure 6 from [
1]. It contains results obtained by the MODVOLC algorithm detecting thermal anomalies for different thresholds 2.0, 2.6, and 3.0, respectively, in case of the Anatahan volcano monitoring data. The following formula presents the Local Index of Change of the Environment
representing the degree to which each pixel deviates from its normal behavior normalized by its variability (cf. [
1,
29]).
An individual observation is modeled by the function
specifying the state of a pixel
at time
t. Its normal behavior is expressed by function
and its variability by
. One can then identify pixels
for which the function exceeds a given threshold
. The number of such pixels can be then computed for the entire area of interest
A using its characteristic function
, i.e.,
in the discrete case when single pixels are considered and each pixel has measure 1. A thermal anomaly is a pixel with a temperature measurement value above a defined threshold. The total number of thermal anomalies occurring at time
t with values above threshold
is modeled by function
defined by the following integral:
Thus, the number of anomalies is the integral of the product of the characteristic function of set
A, corresponding to the area of interest, and of the characteristic function identifying points with temperature above threshold
. In the continuous case, the value of the integral is equal to the measure of the set of thermal anomalies in the area of interest.
6.4. Specification of Strombolian Effects
Strombolian effects are characterized by cyclic periods of repetitive behaviors. Repetitive gas explosions and temperature peaks occurring in a periodic manner are one of such aspects. These explosions are caused by deep slag-driven explosive activity (cf., e.g., [
20]). Those effects include also consecutive magma eruptions. In this subsection, we demonstrate that these kinds of effects can be specified in DC4F. We utilize the temporal properties formulated in the preceding section. The data that we use were obtained on 9 April 2002 during hours of passive and explosive degassing on the Stromboli volcano that was most active at the time [
20]. The spectrometer was placed at a distance of 240 m.
Repetitive temperature increases are shown in
Figure 7. It shows the diagram presented in [
20] (see Figure 1 and Figure S1 there) representing peaks in radiation corresponding to source temperature in Celsius.
Figure 7 shows a function
with cyclic periods of consecutive value peaks and drops representing the level of radiation. The peaks can reach different maximal values, the drops can reach different minimal values, and their numbers are not fixed in a cycle. The time scale of Strombolian effects is usually hours and days (cf., e.g., [
18,
20,
28]). In
Figure 7 and below, we use only numeric time values, as the exact timescale does not play any role in the demonstration and may be changed in an arbitrary manner.
Each peak formation is characterized by a monotone increase of
g followed by its monotone decrease. In consequence, a peak can be specified in DC4F by formula
(see
Section 4.2). For example,
Figure 7 shows a peak corresponding to interval
.
We can also require that the peak values be above a certain threshold
and the amplitude be above the value of
d. These requirements can be formulated as
and
, respectively. The first formula states that function
g reaches a value above threshold
, and the second one states that its amplitude is at least equal to
d (see
Section 4.3). In
Figure 7, the first peak is above 400 and the amplitude is above 100; of course, these parameters can be adjusted arbitrarily. The fact of reaching such a peak with the constraints concerning values of threshold and amplitude can be expressed by the formula
. This formula is the logical conjunction of the previous formulas; we abbreviate it as
. We use capital letters to indicate that functions have functional arguments, not only arguments of type
.
Now, the peaks and the corresponding drops form a repetitive behavior. The left-hand side of
Figure 7 shows three peaks, modeling COS bursts, which correspond to the intervals
, for
. We can write the corresponding formula as
. It means that the interval may be split into three subintervals such that each satisfies the formula
. We abbreviate the formula as
.
In general, for a formula F, n repetitions of the form are abbreviated as . This formula means that the interval may be split into n subintervals in which F is satisfied. If the number of repetitions is arbitrary, then we write . If it equals at least n, then we write , which is equivalent to .
The peaks occur in cyclic series separated by periods of low volcanic activity. Let formula
be the abbreviation of
. This means that
g is always below threshold
in a given time interval and that the length of the interval is between time bounds
and
(cf.
Section 4.3). We can compose the formulas again to specify the behavior shown in
Figure 7; for
and
, the formula has the form:
This Strombolian activity consists of three cycles; each cycle is composed of at least three bursts followed by a period of a low volcanic activity. The three cycles of repetitive bursts are indicated in
Figure 7 by brackets below the time axis. The periods of limited volcanic activity are indicated in a similar way.
Strombolian activities may be also detected using satellites equipped with thermal imaging cameras, which sometimes are the source of more reliable indicators than sensors on the ground (cf., e.g., [
18,
28]). However, if the satellites are not stationary or if visibility is reduced by clouds, then data from thermal imaging cameras cannot be used. We can specify the conditional visibility of thermal peaks by formula
. In this formula,
is a Boolean-valued time-dependent function modeling visibility; it is true if there is visibility and false otherwise. Function
f models temperature measurements at a certain frequency. Parameter
is the corresponding temperature threshold and parameter
is the corresponding amplitude. The formula is conditional and reads as follows: if there is visibility, then a thermal peak is observed. We abbreviate this formula by
. Analogously, we can define
for the case in which the thermal radiance measurement is below a certain threshold. If the peaks in thermal anomalies coincide with the peaks in COS bursts, then we can transform Formula (
14), taking the thermal anomalies into account as well:
The sometime-modality is used here to specify that we do not require anything for the time periods before and after the characteristic behavior. We may also specify here the minimum and maximum lengths of the periods of low activity, if needed.
Of course, the formulas become more and more complicated as we specify more kinds of behaviors and data, e.g., multisensor data, and constraints. However, the point is that the formulas may be specified in DC4F. Thus, we may precisely formulate fine hypotheses concerning complex behaviors. Furthermore, if implemented, the hypotheses may be verified in respect to existing data. They may also be used to automatically mine the available data, historic or incoming.
6.5. Reasoning in DC4F
In the previous subsections, we presented the capabilities of DC4F for the specification of periodic behaviors and the hypothesis framing. In this subsection, we present a relatively simple example of reasoning in DC4F utilizing the properties of DC4F defined in
Section 5.3. The example is somewhat artificial, as we did not find proper examples in the literature, and we do not claim that the formulas truly express actual volcanic behavior. We simplify the proof, as its full version would be rather long. As in the previous section, we skip units of measurement as they do not play any role for the illustration.
Suppose that we know that a Strombolian effect is characterized by two properties occurring within 24 h: a series of COS bursts and an observation of thermal anomalies. The data is on COS burst is collected by sensors on the ground and the data on thermal anomalies are collected by a sensor placed on a satellite. The behavior is specified by functions
and
representing the emission level of COS and the number of observed anomalies, respectively. We use abbreviation
for the formula obtained from formula (
14) by substituting
for
g, 1 for
n, and by setting other parameters somehow. We assume that
for a certain threshold
(cf.
Section 6.3).
Let the Strombolian effect be characterized by at least 10 COS bursts and thermal anomalies of value 5 occurring within 24 h. Moreover, let it be an established fact that 12 COS bursts are always accompanied by thermal anomalies of value 7. The following formulas specify these two properties:
Now, let the observation made by sensor on the ground be that within 24 h, there were 14 COS bursts, as specified by the following formula:
Suppose that the data on thermal anomalies are not available due to bad weather and, thus, the integral
cannot be computed. We will show that, despite that obstacle, the Strombolian behavior can be proved.
The formula
can be presented in the form
(we apply here the definition of
◊ and unfold the exponent). The formulas
and
are tautologies of the predicate calculus (see point (6) in
Section 5.3). Due to these two tautologies and the monotonicity of the chop operator (see point (7) in
Section 5.3), we deduce the implication
Further, applying the monotonicity property of conjunction (see point (6) two times in
Section 5.3) to the above implications, we derive the following implications:
Moreover,
and
. From the above implications, we derived the following implication:
Antecedent of this implication is the observation (
18). The consequent of this implication is the formula to be proved. The consequent follows from the antecedent and the implication by the application of modus ponens rule (see point (8) in
Section 5.3). Thus, observation (
18) and fact (
17) imply that the Strombolian property holds.
6.6. Detecting Similar Behaviors
In this subsection, we consider the methods used in different areas, volcano monitoring included, for detecting similarities between functions and, in particular, between temporal series. Various similarity measures can be used in this case (cf., e.g., [
23,
24,
25,
26]). Different types of volcanic activity may be characterized not only in absolute terms, i.e., by general characteristics satisfied by all behaviors, but also relative to one another, or in relation to known examples of behaviors. We consider neither the application of specific similarity measures to specific cases, nor their adequacy, as we are not aiming to identify similarities in volcanic behavior. We are not in the position to judge which similarity measures would be most suitable for the Strombolian eruption patterns. We merely intend to stress that such measures and the corresponding algorithms can be used within the framework of DC4F.
Similarity measures may be formalized by functionals
, which are interpreted as real-valued functions defined on intervals (see
Section 5.1 and also the
Appendix A). Similarity measures
m can be seen as functionals that, for a given interval
I and two functions
f and
g to be compared, return a real value being the measure of similarity. Thus, we can consider the application of a measure functional
m to the two measured functions
f and
g and an interval of interest
I:
. Consequently, the result can be presented as a real-valued function on intervals
, which returns a real value
r for the interval
I (see
Section 5.1 and also
Appendix A). The measured level of similarity can be then used in framing descriptions and hypotheses concerning the time series—in particular, concerning volcanic eruptions.
Similarly, the standard Pearson correlation coefficient used in statistics may be integrated into CD4F as well. The correlation coefficient of two functions f, g is specified by integral . The integral returns for an interval I a value of type real, provided that the denominator is different from zero. Thus, coefficient depends on the interval in question, and as was the case in the example presented above, it may be represented as a real-valued function on intervals.
In general, some of the similarity functions can be defined directly in DC4F. Others, such as algorithms computing the similarity measures, can be defined simply by the interval mapping operators corresponding to functionals
. Thus, if there is an algorithm computing a similarity measure for intervals, then we can associate interval operator
with the algorithm:
if
x is the value computed by the algorithm for interval
. The algorithm proposed in [
19] can be treated along these lines.
7. Conclusions
In this paper, we proposed a duration calculus for integrable functions (DC4F). It is a natural, conservative extension of the well-known and widely used duration calculus. DC was used in numerous applications to specify various kinds of real-time and hybrid systems, to synthesize controllers, and so on. However, it allows to deal only with state functions, i.e., functions with Boolean values only. It did not aim to specify signals and measurements. The theoretical value of the proposed extension seems to be rather modest; however, its applications appear to be interesting. We are wondering why DC was not developed in this way from the beginning, as this would naturally broaden its application range and would be a consequent thing to do.
DC4F, like DC, is a type of interval logic providing a formal language, uniform mathematical models, and a reasoning system to deal with time series. Both calculi provide axioms and inference rules allowing to reason about properties of systems and to draw conclusions. However, they differ in expressivity, the scope of their applications, and, of course, maturity. Expressivity is achieved, in most cases, at the cost of complexity of the proof system and methods. This is the case of DC and DC4F as well. As the general form of DC is not decidable, i.e., there is no algorithm that decides whether a given formula is provable or not; the extended expressivity of DC4F relative to DC does not change much in that respect. DC, in contrary to DC4F, is aimed exclusively at the specification of durations of properties. The properties are specified by Boolean-valued functions. A great amount of research has been conducted about DC. Model-checking algorithms were created for its restricted versions and various tools were developed. It has been used to synthesize controllers. DC4F allows to integrate arbitrary Riemann integrable functions. Consequently, it provides a consistent general model for general form time series. The models can be handled using integral calculus, computer algebra, and so on. It allows one to specify several factors, parameters, and signals in one coherent language, to model them in a uniform manner, and to reason about them. Consequently, DC4F extends the ideas of DC to a qualitatively new area of applications.
We demonstrated that DC4F can be used to specify various temporal properties of time series, such as monotonicity, boundedness, and periodicity. We used it to specify different aspects of volcano monitoring activities, with a particular emphasis placed on Strombolian effects. DC4F proved to be a framework that is useful for integrating various types of data expressed in terms of time series and diagrams. Usually, in other papers, such types of data are informally related and reasoned about using informal textual descriptions in natural languages, such as English, and some mathematical formulas. This way of handling data is, of course, imprecise and error-prone. Thus, DC4f provides a remedy for this problem in the form of a unifying logic.
DC4F can be used to formulate specifications and hypotheses, reason about them, and validate them. Informally speaking, the difference between validation and deduction is that validation is performed for a concrete data set, on a specific model, semantic deduction concerns all possible data sets and all concrete models, and syntactic deduction relies solely on the application of sound deduction rules. The proposed logic can be combined with other methods, such as algorithms, and in particular with trained neural networks and other methods used in artificial intelligence (cf., e.g., [
19]).