1. Introduction
Natural phenomena are intrinsically spatiotemporal and often highly dynamic [
1]. The increasing availability of simulation and observation datasets has enabled scientists and researchers to better capture and understand the complexity and dynamics of natural phenomena. With higher spatiotemporal resolution and dimensionality, a more complete view of natural phenomena can be obtained, such as detailed information of the vertical and temporal dimensions (4D: latitude, longitude, altitude, and time). GIScience methodologies and techniques have been developed to represent and examine the dynamics in geographic changes through quantitative and qualitative ways, especially assisting us in understanding “where and when natural phenomena happen”, “how long a natural phenomenon lasts”, or “what the common transport pathway is for a natural phenomenon.”
Existing spatiotemporal data models [
2,
3,
4] have the capability of representing various changes in 3D (latitude, longitude, and time), such as creation, alteration, destruction, reincarnation, split, merge, and reallocation. However, unlike those phenomena occurring on the Earth’s surface, such as flooding [
3], land use and land cover change [
5], or deforestation [
6], most geographic phenomena exist in a 4D context (latitude, longitude, altitude, and time). Geographic objects in 3D space interact more frequently than the ones in 2D space. Therefore, it is important to represent, analyze, and model the dynamics of natural phenomena in a 4D environment. In this research, volumetric components, as well as the handling of complex object interactions, are proposed in the spatiotemporal representations.
One of the challenges is to incorporate the fuzzy or non-rigid boundary of natural phenomena into the spatiotemporal framework. The intrinsic properties of the non-rigid natural phenomena include dynamics and deformation and can be constantly changing their internal structure while transport in space and time [
7]. Therefore, representing these internal structure change needs to be included in the spatiotemporal framework as well. In the object-based image analysis field, in order to describe natural variability, models need to be capable of expressing the fuzzy boundaries and being adaptable to unforeseeable changes [
8]. In addition, hierarchical relationships at different levels of abstraction in space and time need to be further explored [
9]. Inspired by the above researches, we have integrated the hierarchy theory in our non-rigid object representation to enhance the event representation.
In addition, existing spatiotemporal data frameworks have been largely focused on spatial reasoning or temporal reasoning separately, whilst spatiotemporal quantitative analysis can assist the better understanding and future predictions of natural phenomena events. Through an object-oriented method, McIntosh and Yuan [
10] quantified the characterization and similarity of geographic events regarding the static and dynamic characteristics and relationships of objects. Spatiotemporal statistical methods that quantify the changes or dynamics have also been developed, including spatiotemporal scan statistics [
11], spatial time series [
12], and SpatioTemporal Autoregressive Regression [
13]. In addition, Bothwell and Yuan [
14] quantified the movements and directions in continuous space and time based on the principles of fluid kinematics. However, a full exploration of quantifying the event evolution in a holistic data framework is still missing. Although quantifying the attributes of an event can assist with better understanding the events as a whole, it is more critical to examine the intermediate and internal dynamics of event evolution over space and time. Therefore, in this research, we propose an event-based measurement to quantify the dynamics, not just the attributes, of natural phenomena evolutions.
In this paper, our objectives are to (1) conceptually represent a natural phenomenon as an event using a graph-based approach, and (2) quantify the evolutions and dynamics of the events. Works relating to spatiotemporal representation and quantification are reviewed in
Section 2. The proposed data framework is described in
Section 3 and applied to represent and quantify the constant movements of dust events as a case study in
Section 4.
Section 5 provides a discussion on the potential applications and limitations of the proposed framework. And finally,
Section 6 concludes this research and proposes potential future work.
3. A spatiotemporal Data Framework for 4D Natural Phenomena Representation and Quantification
This research proposes a spatiotemporal data framework to represent and quantify the dynamics of natural phenomena as events. An event is defined as “something that happens at a particular time and place” by Allan et al. [
39], which is broadly accepted in the literature. There are three major entities inside the data framework for events (
Figure 1): (1)
ST-object that represents an object that evolves in space and time; (2)
ST-relation that represents seven types of relationships between
ST-objects between consecutive timestamps; (3)
ST-event that represents the lifecycle of an occurrence of natural phenomena from the start till the end, possibly involving multiple
ST-objects and the
ST-relations associated with these
ST-objects. In addition, to integrate the hierarchical theory into the object representation,
ST-objects are represented in different levels of intensity (e.g., temperature for an urban heat island, dust concentration for a dust storm, and base reflection for a thunderstorm), where a higher level of intensity occupies a smaller spatial coverage, i.e., one or multiple
ST-objects with a higher level of intensity aggregate into an ST
-object with a lower level of intensity. These intensity values are determined based on the understanding of a particular type of natural phenomena, and the number of different levels can be flexible. A particular
ST-event is then represented as a graph, consisting of
ST-objects as nodes and
ST-relations as edges. A graph-based quantification method is further proposed to numerically understand the dynamics of changes between timestamps and the whole event. Details are introduced in the following subsections.
3.1. ST-Object
A spatiotemporal object (ST-object) represents a moving object with possibly changing shape and evolving internal structure, i.e., its geometry with spatial attributes (shape and location) can change in different timestamps. Each geo-object is represented by its identity, time, thematic attributes, and geometry at time: Id(O), t, A1(id,t), …, An(id,t), Geom(id,t) respectively. In keeping with the rapid increase of 4D Big Data, such as meteorological satellite retrieval and forecasting of results, we embed the Geom(id,t) of an ST-object in a 4D context. For each ST-object, the thematic attributes and geometry are evolving over time. The geometries of geo-objects are usually indeterminate for natural phenomena, as there is no single threshold appropriate for the identification of all geographical objects for a natural phenomenon. Thus, we embed the indeterminacy of boundary in the data framework using a hierarchical approach. By modeling the natural phenomenon at several hierarchies, internal structures of the natural phenomenon of interest can show a distinct pattern at a given hierarchy. This approach can also represent the hierarchical relationships of nested objects, e.g., a high temperature zone is contained inside a low temperature zone.
3.2. ST-Relation
A spatiotemporal relation (
ST-relation) connects
ST-object at consequent timestamps, i.e., appearance, disappearance, expansion, contraction, continuation, splitting, and merging (
Figure 2). A continuation relation,
rcontinue, represents when object
o at time
ti continues at time
ti+1 and its geometry remains constant, where the
ST-object’s identifier does not change. Expansion (
rexpand) and contraction (
rcontract) are two other types of relations when the
ST-object continues to exist, but its geometry grows or decays. Splitting (
rsplit) and merging (
rmerge) are two relations when the
ST-objects cease to exist through splitting and merging. Appearance and disappearance are two relations that involve an
ST-object from only one particular timestamp, which represents the beginning or the end of the particular
ST-object. A specific
ST-relation (
R) is represented by the involved objects’ identity, time, and type of relation:
Id(
o1),
ti,
r,
Id(
o2),
ti+1; where
r ϵ [
rcontinue, rexpand, rcontract, rsplit, rmerge],
i ϵ [1,
n]. All attributes serve as the primary key of this table. Given objects
O1(
id1, t1, , g1),
O2(
id2, t2, , g2), and
O3(
id3, t3, , g3), the spatiotemporal relations can be defined as follows:
Expansion (Geometric Growth): O2 (id2, t2, , g2) continues existing and is geometrically growing from O1(id1, t1, , g1), where id1 = id2, and t1< t2.
Contraction (Geometric Decay): O2 (id2, t2, , g2) continues existing and is geometrically decaying from O1(id1, t1, , g1), where id1 = id2, and t1< t2.
Continuation (Geometry remains constant): O2 (id2, t2, , g2) continues existing and its geometry remains constant from O1(id1, t1, , g1), where id1 = id2, g1 = g2, and t1< t2.
Splitting: O1(id1, t1,, g1) splits into O2(id2, t2,, g2), and O3(id3, t2,, g3), where id1 ≠ id2 ≠ id3. Object O1 ceases to exist after the split and objects O2 and O3 never exist before. Then the graph will store the associated entries, (id1, t1, rs, id2, t2) and (id1, t1, rs, id3, t2).
Merging: O1(id1, t1,, g1) and O2(id2, t1,, g2) merge into O3(id3, t2,, g3), where id1 ≠ id2 ≠ id3. The merged objects O1 and O2 cease to exist and the new object O3 is formed by the merge and did not exist before. Then the relation table will store the associated entries, (id1, t1, rm, id3, t2) and (id2, t1, rm, id3, t2).
Appearance: O1(id1, t2,, g1) is not related to any object at t1;
Disappearance: O1(id1, t1,, g1) is not related to any object at t2.
3.3. ST-Event
A spatiotemporal event (
ST-event) is represented as a single occurrence for a definite time series which can involve one or more
ST-objects that have associated
ST-relations. A specific event
E is represented by its identity, starting time, ending time, and the corresponding list of
ST-objects and
ST-relations:
EID,
ti, tj, {
Om, …, On}, {
Rk, …, Rl}, where
ti <=
tj. AN ST
-event is represented as a graph linking the objects of timestamp
t with the objects of timestamp
t+1. Each node of the graph represents a specific object, and each edge represents the overlapping interactions between objects from consequent timestamps. Each edge corresponds with its own weight, which represents the degree of spatial overlap between two objects (Equation (1)).
where
represents an object of timestamp
t and
represents an object of timestamp
t+1. Therefore, for two objects that have continuation relation, their weight is exactly 1. Objects that have expansion, contraction, merge, or split relations have a weight larger than 0 and smaller than 1. For appearance and disappearance relations, there is no need to calculate the weights, since one of the objects (i.e.,
and
) does not exist. Each graph represents an ST
-event, and the number of levels is the number of timestamps that this
ST-event exists. For each
ST-object, the object hierarchy is stored as a sub-graph, and can be integrated with the overall time directed graph of the associated
ST-event (
Figure 3).
3.4. Graph-Based Measurements of ST-Event Evolution Dynamics
In order to quantify the dynamics based on the
ST-event graphs, we propose to measure the variations inside a
graph to quantitatively represent how an
ST-event evolves in time regarding its spatial and non-spatial characteristics. Guttler et al. [
33] proposed the quantification of the variations considering the 2D area change of the land cover, and the Euclidean distances between the changing land cover areas. Similar to Guttler et al. [
33], we represent a graph as
G, the set of objects covered by
G at the timestamp
t as
Gt, and the weight of the link between object
and object
from consecutive timestamps as
. In our approach, however, we enhance this quantification by considering the spatial distances as well as the change of 3D volume and intensity level of the natural phenomena.
The Variation (
Var) of a graph between two consecutive timestamps is calculated with the Equation (2).
There are two parts inside the Var formula, where the first part represents the importance of the object
compared to the entire set of objects at timestamp
t. A higher object importance will influence more on the total variation value. The second part investigates the changes between an object at timestamp
t and all objects at timestamp
t+1 that have linkages to it. This change is calculated as a weighted sum of the 3D Euclidian distances
and intensity differences
between the object
and
. The weight
measures the spatial overlap o between
and
. The intensity differences
are calculated based on the absolute differences between the intensity-weighted volumes of the object
and
, represented in Equation (3).
where the intensity levels of the object
(at timestamp
t) and the object
(at timestamp
t+1) are represented as
and
, and different intensity levels are taken into consideration for the overall volume calculation. This volume calculation strategy ensures that a small volume with a higher level of intensity can still have a high intensity-weighted volume value, and that it can still result in a high difference value if the intensity changes significantly while the volume does not. In addition, this intensity-weighted volume calculation naturally reflects and quantifies the internal structure of the objects.
The 3D distances and intensity differences are weighted by and respectively. The interdependence of these two variables needs to be calculated based on the univariate analysis of variance (ANOVA). If the interdependence is statistically significant, then and need to be adjusted based on the coefficient of different variables. If not, then and are assigned equal weights.
The Global Variation (GlobalVar) for a graph is calculated to summarize the variations from each pair of consecutive timestamps. The summarized GlobalVar evaluates the attributes of an event as a whole. The GlobalVar value of a particular graph reflects the extent of evolvements of the event within the spatiotemporal range. A high GlobalVar value indicates that there might be significant temporal evolutions occurred within the event’s lifecycle.
4. Dust Event Representation and Quantification
In order to evaluate the applicability of the spatiotemporal data framework for representing and quantifying the dynamics of natural phenomena, dust events in Northern Africa, the Middle East, and the Mediterranean were used to illustrate the instantiation of the proposed conceptual data framework and obtain information of dust events’ spatiotemporal evolution and transport characteristics. Dust events are extracted from 4D simulation dataset using object identification and tracking algorithms. The overall procedure, illustrated in
Figure 4, includes the following: (1) identifying static dust storm objects; (2) extracting the interactions of objects over consequent timestamps; (3) reconstructing dust storm events (ST-events); and (4) quantifying the variations of dust storm events.
4.1. Data and Implementation
Dust events are detected from the 4D simulation data generated by the operational dust model BSC-DREAM8bv2.0 [
40] maintained at the Barcelona Supercomputing Center. The spatial extent covers North Africa, the Middle East, and the Mediterranean (0.7 S–64.3 N, 25.7 W–59.3 E) with the spatial resolution of ~30 km. The vertical dimension covers 24 layers from the Earth surface to 15 km. The temporal extent covers 12 months, from December 2013 to November 2014, with a temporal resolution of one hour. The simulated variable used in this research is the dust concentration with a unit of μg/m
3, which reflects the intensity of dust particles in the atmosphere (
Figure 5).
The implementation of dust event representation and quantification (detailed in
Section 4.2,
Section 4.3,
Section 4.4 and
Section 4.5) is conducted in Python. The identified dust objects, relations, and events are stored in graph-based data structure using NetworkX (
https://networkx.github.io/), which is a Python library for the creation and manipulation of graphs. The total size of the 4D dust simulation data is 40.0 GB. The identified dust objects are recorded and indexed by their unique
Id(
O) and their associated attributes and geometries are stored in separate auxiliary files, which totals at 1.8 GB. The reconstructed dust events are stored in a graph-based data structure, which totals at 4.7 MB. The experiments were conducted on an 8-core Intel i7-9700k with 16.0 GB RAM. The execution time for each step takes 24 min 32 s for identifying dust objects, 13 min 12 s for tracking the linkages of objects over time, and 6 s for reconstructing dust events.
4.2. Identifying Dust Objects
Dust objects at each timestamp are identified using a region-grow based algorithm [
41], in which a dust object is defined as a “contiguous volume with dust concentration value larger than a threshold (
Dth), while its volume larger than a threshold (
Vth).” In this experiment, the dust concentration threshold is a set of 160, 320, 640 μg/m
3 (as intensity level 1, 2, and 3) and the volume threshold is 10 voxels. The multi-threshold of dust concentration is referred by the legend of dust concentration maps of model results from Barcelona Supercomputing Center (i.e., 20, 40, 80, 160, 320, 640, 1280, 2650 µg/m
3) (
https://ess.bsc.es/bsc-dust-daily-forecast), and is selected based on the health impact indications of the surface dust concentration to vulnerable populations as negligible, marginal, and severe dust intensity [
42]. Although the multi-thresholding approach used here and detailed in Yu and Yang [
41] is designed particularly for dust object identification, it is flexible and can be easily adapted to the identification of other atmospheric objects.
Figure 6 demonstrates the identified five objects (O1~O5) at Intensity Level 1, and the objects contained inside them. For example, Object O1_L2_1 is identified at Intensity Level 2 and is contained inside Object O1; and Object O5_L3_1 is identified at Intensity Level 3 and is contained inside Object O5_L2_1, which is inside Object O5. The relationships—
Contains—between objects identified at different intensity levels are computed through the identification algorithm [
36].
4.3. Tracking the Linkages of Objects over Consequent Timestamps
The tracking algorithm is an overlap-based method, detailed in Yu et al. [
43]. The identified dust objects from consecutive timestamps are compared regarding partial spatial overlaps to identify the potential linkages, and a best match is found from all possible combinations of objects from each pair of timestamps.
Based on the proposed framework,
ST-relations are automatically identified and recorded from the tracking results. An appearance is considered as the start of a specific
ST-object, and a disappearance is considered as the end of the object. In between, an
ST-object is temporally linked by continuation, expansion, or contraction, which are recognized as
ST-relations within the same
ST-object. Other linkages, including merging and splitting, are recognized as
ST-relations between different
ST-objects.
Figure 7 demonstrates an example of three consecutive timestamps, where
ST-objects O1, O2, and
O3 merge into
ST-object O5, and then O5 splits into
ST-objects O6, O7, and
O8; and
ST-object O4 continues to exist and merged with
ST-object O5 into
ST-object O8. 4.4. Reconstructing Dust Events
Dust events are then reconstructed by linking ST-objects that have ST-relations among each other. Each ST-event is represented by a graph connecting ST-objects with ST-relations, demonstrating how this event evolves and moves in space and time. The spatial and non-spatial attributes with each ST-object at a particular timestamp are associated with the nodes of the graph, and the weights representing spatial overlap are associated with the edges of the graph. Sample attributes are maintained in the graphs as attributes of the nodes, including centroid (i.e., the center of mass for a dust object), volume (i.e., the amount of cubic space inside of a dust object), and average concentration (i.e., the average dust concentration of all cubic space inside of a dust object). Object hierarchy subgraphs associated with individual ST-objects are integrated into the event graph.
In this experiment, there are 1796
ST-events/graphs reconstructed for the entire spatiotemporal range of the experiment data (
Table 1). The total number of nodes (
ST-objects) per graph ranges from 11 to 1959, with a mean value of 47. The total number of edges (
ST-relations) per graph ranges from 10 to 1980, with a mean value of 46. For each event, the temporal variation between each time pair and the global variation are calculated. For variance calculation,
and
are determined as independent or statistically different based on the univariate ANOVA test; thus
and
are both set to be 1. The
GlobalVar per graph ranges from 3.47 to 14,216.18, with a mean value of 363.75. The
ST-events with high GlobalVar values might reflect significant evolutions within the event’s lifecycle regarding the temporal changes in volume, intensity, and distance transported. However, an
ST-event with a high GlobalVar value does not necessarily indicate a significant evolution, as the event might be long-lasting so that the accumulated GlobalVar is higher. An
ST-event with a low GlobalVar value reflects that the event might be short-lived and has stable variations in volume, intensity, and distance transported.
4.5. Graph-Based Measurements of Dust Event Evolution Dynamics
In order to better illustrate the graph structure and content, we select three event graphs representing different levels of
GlobalVar (low, medium, and high). The first event graph with a relatively low
GlobalVar represents part of the Southern Arabian Peninsula dust event in July 2014 (
https://earthobservatory.nasa.gov/images/85370/persistent-dust-storms-on-the-southern-arabian-peninsula). Within the dust event, dust with a high concentration transported from the interior of Sudan to the Red Sea, carried by the northwest winds to the southeast. As indicated by the event graph (
Figure 8a), this
ST-event consists of an ST
-object with continuing types of
ST-relation—continuation, expansion, and contraction. The temporal variation graph (
Figure 8b) shows that this event evolves slowly in time with low variation values and a relatively low
GlobalVar value. Both the dust concentration and the volume profile graphs (
Figure 8c,d) describe the growing and decaying stages of this event. The dust transport (red trajectories in
Figure 8e–g) shows that this event travels from northwest of Saudi Arabia towards central Saudi Arabia while the dust plume covered both Saudi Arabia and United Arab Emirates.
The second event graph represents the Libya dust event in early September 2014, where a cold and windy front moving across the Libya desert caused a haboob (
https://earthobservatory.nasa.gov/images/84014/dust-plume-over-the-red-sea). A haboob is a type of intense dust event usually caused by strong winds of thunderstorm outflow. As indicated by the event graph (
Figure 9a), this
ST-event consists of seven
ST-objects with different types of
ST-relations—continuation, expansion, contraction, splitting, and merging. The temporal variation graph (
Figure 9b) shows that this event has a relatively high
GlobalVar value contributed majorly by two disturbingly high variation values at T3 and T5. The temporal profiles of dust concentration and volume (
Figure 9c,d) reveal the evolution in time that two
ST-objects merged into one, then split again and decayed gradually.
Figure 8e demonstrated the overall dust transport pathway for the September 2014 Libya dust event, where dust originating from the Libyan desert (
Figure 9f) was transported westward to Tunisia while picking up dust from other dust sources. The highlighted times are instances where the dust event evolves in a significant way, thus llustrating the changes in dust object volume and shape to further investigate at those times (
Figure 9g–j). High variation values happen when dust objects both merge and split almost equally, which is shown in
Figure 9c,d.
The third event graph represents the Middle East dust event in September 2014, where a long-term dust storm with high dust concentration and volume. It combined the characteristics of a haboob and a shamal. A haboob is usually a short-lived but dramatic dust event appearing in a wall of dust sweeping across the ground, while a shamal is a pattern of persistent northwesterly winds in the Middle East region. When dust events are associated with shamal, they can persist over a wide spatial and temporal range. This specific event is not recorded in NASA Earth Observatory, but a similar one was recorded in the same time of 2015 (
https://earthobservatory.nasa.gov/images/86539/dust-marches-across-iraq-and-iran). The event represented in
Figure 10 demonstrates that this event last for 23 days and has the one of the highest
GlobalVar values (within the experiment data) of 12784.68. The characteristics of a haboob in this event can be reflected in
Figure 10(b1–b4), where dust has a relatively low concentration but a large volume. One of the first peaks in variation is highlighted as T1, where a high wall of dust splits into two objects with almost even volume. Similarly, from the highlighted time T2, it is observed that the high variations occur when large volumes of dust split or merge. The characteristic of a shamal in this event is reflected by the overall movement of the dust transport from the northwest to the Persian Gulf, suggesting late-summer shamal winds. In addition, the long-lived dust event is sustained by the long-lasting shamal wind bringing up and maintaining dust particles in the atmosphere.
5. Discussions
The proposed framework allows the description of multiple ST-objects aggregating into an ST-event when these ST-objects have ST-relations at any time. However, the framework is limited in that an ST-object cannot be associated with several ST-events simultaneously. The reason the proposed framework is designed in this way is that multiple ST-objects (happening at the same time and having ST-relations at any stage during their lifecycles) are considered to represent the state of an event at a particular point in time. ST-events can happen simultaneously, but these ST-events do not have ST-relations at any stage during their lifecycles.
The proposed framework represents a data-driven analytical framework for understanding the evolution of natural phenomena, instead of a model based on physical laws. The representation and quantification of natural phenomena over space and time can be used in various ways to assist the understanding of natural phenomena, thus integrating or complementing physical models with data-driven analytics. The proposed framework can exploit the spatiotemporal dependencies and dynamics of natural phenomena, including the geometry, context, and spread trends. Based on these spatiotemporal dependencies and dynamics produced by the proposed framework, long-term spatiotemporal statistics and machine learning models can be built to better capture spatiotemporal patterns of a certain phenomenon. Integrating the data-driven analysis and physical models can potentially provide more accurate simulation and forecasting of the natural phenomena.
The proposed framework is applicable with minor adjustments to other weather events, including thunderstorms, jet streams, and ocean eddies. One of the examples applying the identification and tracking methods to satellite precipitation observations to generate rainfall events was reported in a previous article [
43], and it is expected that temporal variations of these rainfall events can be further calculated using the proposed framework in this study.
6. Conclusions and Future Work
This research proposes a spatiotemporal data framework to represent the characteristics and dynamics of natural phenomena in an event-oriented way. The spatiotemporal data framework consists of three major entities: ST-object, ST-relation, and ST-event. Considering the non-rigid boundaries of natural phenomena, ST-objects are represented in a hierarchical way, where lower intensity ST-objects contains higher intensity ones, and intensity values can be flexible. AN ST-event is represented as a graph, where ST-objects and ST-relations that are associated with this event are nodes and edges. Based on the graph representation, the measurements of temporal variations and the global variation are proposed to quantify the dynamics within the event’s lifecycle and the characteristic of the entire event. This graph-based quantification takes into account the following factors: spatial overlaps, object importance, distance change, and volume-intensity change; thus, it can represent a relatively objective measurement of the event variation. Such a holistic spatiotemporal data framework with event representation and evolution quantification can assist in the better understanding of the spatiotemporal dynamics of natural phenomena.
The proposed spatiotemporal data framework is applied to the one-year dust simulation dataset from the operational model maintained at the Barcelona Supercomputing Center. Dust objects are identified at each timestamp and the movements of dust events are tracked through consecutive timestamps. Dust events are then reconstructed and represented as event graphs. Different dust events along with the event graph, temporal variations, temporal profiles of dust volume and concentration, and the transport pathways are demonstrated and verified with the archived dust events at NASA Earth Observatory.
One of the future research directions is to integrate heterogenous databases for a multi-modal database to further assist the understanding of the relationship between natural phenomena with other physical and social factors, such as the relationship between dust events and disease outbreaks [
44], and the relationship between Sahara dust events and hurricanes [
45]. For example, to understand whether dust transport is a contributing factor to the outbreaks of infectious disease, we can establish a multi-modal database system that integrates a graph database for dust transport, a relational database for disease outbreak records, and a non-relational database for real-time environmental sensors for aerosols. As environmental big data evolves with real-time analytical demands, such a multi-modal database will be of significant benefit to scientific researchers and the industry.
The example framework presented in this resarch utilizes dust event identification and tracking as an example and leverages a standard desktop computinh environment to handle the sample data. Another future direction resides in utilizing advanced computing capabilities, such as Graphics Processing Unit (GPU) computing, cloud computing, and big data management systems, to accelerate the process of identifying and tracking the evolution of natural phenomena and enable the real-time analytics when integrating the increasingly available environmental big data. Such event representation and evolution quantification framework can further be integrated into early warning systems to provide dynamic and customized early warning messages to vulnerable populations with specific needs.