A Big Data Reference Architecture for Emergency Management

Iglesias, Carlos A.; Favenza, Alfredo; Carrera, Álvaro

doi:10.3390/info11120569

Open AccessArticle

A Big Data Reference Architecture for Emergency Management

by

Carlos A. Iglesias

^1,*

,

Alfredo Favenza

²

and

Álvaro Carrera

¹

Department of Telematics Systems Engineering, Universidad Politécnica de Madrid, 28040 Madrid, Spain

²

LINKS Foundation, 10138 Torino, Italy

^*

Author to whom correspondence should be addressed.

Information 2020, 11(12), 569; https://doi.org/10.3390/info11120569

Submission received: 16 October 2020 / Revised: 28 November 2020 / Accepted: 2 December 2020 / Published: 4 December 2020

(This article belongs to the Special Issue News Research in Social Networks and Social Media)

Download

Browse Figures

Versions Notes

Abstract

:

Nowadays, we are witnessing a shift in the way emergencies are being managed. On the one hand, the availability of big data and the evolution of geographical information systems make it possible to manage and process large quantities of information that can hugely improve the decision-making process. On the other hand, digital humanitarianism has shown to be very beneficial for providing support during emergencies. Despite this, the full potential of combining automatic big data processing and digital humanitarianism approaches has not been fully realized, though there is an initial body of research. This paper aims to provide a reference architecture for emergency management that instantiates the NIST Big Data Reference Architecture to provide a common language and enable the comparison of solutions for solving similar problems.

Keywords:

emergency management; crowdsourcing; big data; crowdworking; disaster management

1. Introduction

Having access to reliable information during emergencies is essential for effective emergency management. New technologies have mainly changed the nature and quantity of information available from different actors, such as public authorities, media, citizens, and volunteer organizations.

The growth of social media, satellite remote sensing, sensor networks, and connected devices has contributed to a data deluge beyond what can be captured, processed, and interpreted with traditional tools, which is usually known as a big data problem. According to NIST’s big data definition [1], “Big Data consists of extensive datasets—primarily in the characteristics of volume, variety, velocity, and/or variability—that require a scalable architecture for efficient storage, manipulation, and analysis”. Thus, big data technologies have been widely used to process data and improve disaster management decision-making processes [2,3].

Besides, social media and crowdsourcing have significantly impacted how information is processed and decisions are made. Consequently, emergency management has evolved from centralized top-down models managed by public authorities to collaborative approaches where citizen participation is encouraged. These two models represent a continuum of existing emergency management models [4]. At the end of the continuum lies the command and control approach [5] (also called strategic [6]), which follows an authoritarian model and divides competencies by level of command into strategic, tactical, and operational [7]. At the other end of the continuum lies the emergent human resource model [5] (also called people-centered [7] or tactical [6]), which tends to divide competencies by theme [7], such as communication, logistics, and shelter.

A common view is that traditional top-down crisis management approaches are necessary but not sufficient [8], and they should be complemented with the promotion of societal resilience. While top-down approaches can improve preparedness and planning of emergencies, an effective response during the immediate aftermath of a crisis is critically improved by citizens’ resilience.

The availability and adoption of Information and Communication Technologies (ICTs) have been among the reasons that have enabled this shift in emergency management [9]. Society has accustomed to immediacy and to gather and deliver information in real-time. Even when landline phone networks are unavailable or intermittently available, fiber-optic connectivity and mobile phone networks exhibit a more resilient performance, especially to establish SMS and text-based short messaging communication. As stated by Eric Gujer [10]: “The Internet plays an increasingly important role in catastrophes and conflicts. Television fundamentally changed our perception of conflicts and disasters through live broadcasts from war zones in the nineties. The Internet, cell phones, and satellites are the next stage in the media revolution”. The effective use of social media has made possible phenomena, such as the Arab Spring. In the words of the protester Fawaz Rashed: “We use Facebook to schedule the protests, Twitter to coordinate, and YouTube to tell the world”. In the emergency response domain, the effective usage of social media has also impacted emergency management. Disasters such as Haiti’s earthquake in 2010 “have represented a paradigm shift in the use of social media for disaster response, as multiple web-based platforms emerged to collect, refine, and disseminate crisis-related social media” [11].

Despite these advances, many challenges remain in leveraging the crowd’s wisdom and automatic information processing. The main identified shortfalls of crowdsourcing applications are scalability, quality control, coordination, safety, and forecasting capabilities. Several authors [12,13,14] report that crowdsourcing applications such as Ushahidi have severe scalability issues, since their inflow rate of information can reach thousands of messages per minute, surpassing crowd processing capacity, resulting in an ever-growing backlog of unprocessed requests. Another frequently discussed downside is the need for better quality control and assurance [12,13]. Quality assurance is required to improve classifications and geo-location accuracy, reduce redundancy, and ensure critical control points. Concerning coordination, crowdsourcing applications have proven to be useful for gathering information during the disaster, but they do not support response coordination. Gao et al. [12] proposed to integrate groupsourcing, so the system allows the separation of requests from crowds (end users suffering the catastrophe) and requests from groups (coordination messages of responding organizations). Besides, these platforms cannot forecast the evolution of incoming messages or emergencies in areas with limited or communication ability [12].

In view of the above-identified shortfalls, it would be advantageous to provide methods and systems that would enable us to combine effectively big data-enabled automatic processing with the power of human-centered approaches in emergency management. In this paper, we aim at providing a reference architecture that enables the combination of the both approaches.

The remainder of this paper is organized as follows. Section 2 reviews existing works. Section 3 introduces the proposed Big Data Framework for Emergency Management that provides a panoramic overview of the different actors, data, tasks, and coordination means for emergency management. Section 4 presents how the reference architecture is mapped onto a case study. Section 5 analyses the results. Finally, the conclusions of the research are presented in Section 6.

2. Background

2.1. National Planning Framework for Emergency Management

Since disasters tend to be repetitive, disaster management usually defines cycles that include all the activities and measures to be taken before, during, and after the disaster to reduce its impact. The emergency management cycle usually considers four phases [15]: mitigation, preparedness, response, and recovery, although there are other proposals in the literature [16,17]. These phases are not sequential, but they overlap, interrelate, and complement each other [17]. They can be classified [18] into three stages: pre-disaster (mitigation and preparedness), during the disaster (response), and post-disaster (mitigation and recovery).

Mitigation comprises all activities aiming at reducing the impact of the disaster (public education, building codes and zones, buying flood and fire insurance, etc.). Preparedness defines plans about responding (e.g., emergency training, warning systems, evacuation plans). Response deals with all the activities to minimize the hazards created by the disaster (e.g., search and rescue, emergency relief, seeking shelter). Finally, recovery is the process of repairing damage and returning the community to a normal situation (e.g., temporary housing, restoring services, financial assistance).

Business processes involved in emergency management should be identified to analyze the applicability of big data techniques. To this end, we have adopted the framework United States National Planning Frameworks [19]. The National Planning Frameworks provide a governmental guide to prepare for and provide a unified national preparedness to disasters and emergencies. There are five frameworks for each of the five preparedness mission areas: the National Prevention Framework [20], the National Protection Framework [21], the National Mitigation Framework [22], the National Response Framework (NRF) [23], and the National Disaster Recovery Framework [24].

Each framework defines the partners involved in emergency response, and their roles and responsibilities. Additionally, it provides a shared vocabulary for defining the core capabilities and activities that must be accomplished in incident management. There are 32 core capabilities identified. Table 1 collects the 24 core capabilities related to emergency management. The core capabilities related to terrorism have been excluded, since they are out of this work’s scope. In particular, the following core capabilities in the phases have been excluded: prevention and protection (screening, search and detection; and interdiction and disruption), prevention (forensics and attribution), protection (access control and identify verification; cybersecurity; physical protective measures; risk management for protection and programs activities; and supply chain integrity and security)). Besides, the phases protection and prevention have been merged in phase preparedness since this phase is more extended in the literature, and after removing core capabilities related to terrorism, these two phases share the same core capabilities.

The core capability of intelligence and information sharing is particularly relevant in the role of community participation. The National Prevention Framework does not formalize the tasks associated with this core capability but provides a list of critical tasks: planning and direction, collection, exploitation and processing, analysis and production, dissemination, feedback and evaluation, and assessment.

2.2. NIST Big Data Reference Architecture

Reference architectures [25] aim at providing abstract software architectures that collect architectural patterns and software elements for supporting the development of systems in specific domains. With regard to big data systems, several authors have proposed reference architectures for big data systems [26,27,28], and reference architectures for big data systems in specific domains, such as security [29], industry [30], e-learning [31], cloud-based video analytics [32], and smart cities [33], to name a few.

In this section, we briefly review the NIST Big Data Reference Architecture (NBDRA), shown in Figure 1. It is the proposal that has achieved the most support from the academy and industry, being developed by a working group launched in 2013 with over six hundred participants from industry, academia, and government. It provides a vendor-neutral, technology and infrastructure-agnostic conceptual model of a big data architecture. NBDRA defines an open reference architecture representing big data systems and is intended to support data engineers, data scientists, data architects, software developers, and decision-makers to develop interoperable big data solutions. The reference architecture is organized around five major roles and two fabric roles. The five NBDRA roles are: data provider, data consumer, big data application provider, big data framework provider, and system orchestrator. The two fabric roles are management, and security and privacy. These two fabrics provide services to the five main roles.

These actors and fabrics interact as follows. Data provider actors make data available to others. Data are then processed by a big data application provider that executes the data life cycle to meet the application requirements defined by a system orchestrator. The big data application provider uses the big data framework provider’s resources, making the required infrastructure available for processing, and storing the data. The output of the system is received by data consumer, which exploits the insights of big data processing. Additionally, a management fabric takes charge of maintaining the data quality while addressing management tasks such as system, data, security, and privacy considerations. Specific attention is paid to security and privacy, and special measures are taken by the security and privacy fabric so that these requirement policies are met, including auditing.

Five big data processing activities for big data application providers are defined: collection, preparation, analytics, visualization, and access. The collection activity handles the interface with the data provider. The preparation and curation task deals with data validation, cleansing, standardization, reformatting, and frequently persisting the data. The analytics activity implements techniques to extract knowledge from the data. The visualization activity deals with presenting the data to the data consumer to communicate their insights optimally. Finally, the access activity provides a service for handling the requests of the data consumer. These phases can be processed in different ways attending to the requirements and big data platform capabilities. There are three main approaches [34,35]: batch processing, stream processing, and interactive processing.

After the conceptual definition of NBDRA, the NIST Big Data Public Working Group has addressed the definitions of the interfaces between NBDRA components [36], and the elaboration of guidelines for its adoption [37].

The phases defined by the NIST framework can be easily mapped to other frameworks [26,38,39]. Some works [40] extended big data architectures in order to integrate metadata and quality management components so that the data ingested in the architecture was annotated to enable provenance and quality assessment.

2.3. Big Data Technologies in Emergency Management

Big data is “a term used to describe the large amount of data in the networked, digitized, sensor-laden, information-driven world” [34]. Big data problems have been traditionally characterized by four dimensions: volume, velocity, variety, and veracity, known as The Four Vs [41,42].

Disaster management can be characterized as a big data problem according to these dimensions. First of all, a great variety of data sources are integrated for disaster management. According to Qadir et al. [43], data disaster can be classified into data exhaust, online activity, sensors, small data, public data, and crowdsourced data. Data exhaust refers to information passively collected (i.e., mobile call detail records, banking records, credit card history, access logs). Online activity data are all the data derived from users’ social activity (i.e., SMS, emails, posts, comments, search engine activity). Sensing technologies refer to information collected by sensors, such as remote sensing (i.e., satellites and aircraft), networked sensing (i.e., networked sensor systems), and participatory sensing technologies (i.e., sensors from everyday devices such as phones or buses). Small data are data derived from individual personal traces [44] that can complement big data for providing personalized solutions. Public data are all the public data provided by official channels, such as governmental and municipal offices. Finally, crowdsourced data are the data generated actively by the population (suppliers of it are frequently known as digital humanitarians [45]), who participate in a network of volunteers to support disaster management. Depending on the different data source characteristics, spatial and non-spatial, they require different batch and stream big data processing frameworks, as detailed by Cumbane and Gidófalvi [46].

Volume and velocity dimensions are associated with the previously enumerated data sources. To name a few examples, Kwak [47] reports that the big data system for flood disaster risk assessment uses as one of its data sources the satellite Himawari-8, and it generates a file size of 329 GB per day and 930 MB per 10 min. During the 2010 Haiti earthquake, the US Geological Survey (USGS) reported that over 600,000 files representing 54 terabytes of imagery data were distributed within the first six weeks after the primary event [48]. Kryvasheyeu et al. [49] report that more than 50 million tweets were posted during Hurricane Sandy.

Data veracity should be verified, since a lack of control could lead to misleading decisions. In particular, crowdsourced data usage brings several potential issues [50], such as the inaccuracy of information and rumors and the malicious use of social media. Several social media verification approaches have been followed, based on automatic intelligent processing or crowdsourcing [50].

Several authors have summarized existing research in the application of big data systems to emergency management [2,43,46,51,52,53,54,55,56,57]. The availability of resilient communication networks is one of the challenges for the application of big data technologies during emergencies, since large-scale disasters can result in massive blackouts. Song et al. [57] reviewed the main approaches for achieving network resiliency, which can be enhanced using big data [2], such as the use of ad hoc networking, delay/disruption-tolerant networking, and smartphone-based Emergency Communication Networks (ECNs). Several surveys [51,53] analyzed the application of data management techniques in disaster situations and reviewed the main challenges and solutions that data science can face in the areas of data integration and ingestion, information extraction, information retrieval, information filtering, data mining, and decision support. Goswami et al. [58] conducted a review of the application of data mining techniques for disaster management. They concluded that the main usages are prediction (e.g., forecasting of the magnitude of an earthquake [59]), detection of natural disasters (e.g., early earthquake detection based on social sensors [60]), and disaster management strategies (e.g., understanding people needs and sentiment based on blogs and social networks [61]). Li et al. [53] added other application types, in particular, disaster simulations (e.g., 3D storm surge visualizations to improve situational awareness and evacuation decisions [62]), disaster visualization (e.g., visual analytics facilities to improve information sharing across stakeholders [63]), and insurance risk modeling (e.g., evaluating the economic effects of earthquakes in construction [64]). Several authors [3,55,58] reviewed the application of big data technologies depending on the disaster type and emergency phase.

The recent systematic review by Freeman et al. [56] reveals that ICT and big data technologies have been used mostly in real natural disasters in the response phase (75% of reviewed works). The ICT tools that have been used more frequently together with big data technologies are Geographical Information System (GIS), social media tools, patient health databases, and general disaster management software. With regard to the big data consumers, Freeman et al. identified clinical first responders, community members, national governments (military or non-military), and local Non-Governmental Organizations (NGOs). They report that most articles (64.47%) are targeted to clinical first responders or community members. In contrast, the systematic review by Akter and Wamba [65] reports that most works address the mitigation phase (36.8%), followed by the response phase (28.9%). This discrepancy could have come from the different selection criteria of both works, since Freeman et al. considered only academic works dealing with real emergencies and simulations, while Akter and Wanda focused the review on works that provide theoretical insights. Finally, Yu et al. [54] conducted another systematic review that provides a detailed classification of articles according to the disaster management phase, data source, and disaster type.

2.4. Digital Humanitarianism in Disasters

The effective use of social media and crowdsourcing in the 2010 Haiti earthquake has supposed a turning point for leveraging public participation in disaster response [66]. It has led to the development of new digital humanitarianism [67] that uses crowdsourcing, remote volunteer collaboration, data production and processing, social media, and crisis mapping.

Thus, in this section, we aim to review and characterize the tasks developed by humanitarian organizations in emergency management. To clarify the terminology used in our study, we define below three terms that are sometimes interchanged: social media, social networks, and crowdsourcing.

Social media are [68] “a group of Internet-based applications that build on the ideological and technological foundations of Web 2.0, and that allow the creation and exchange of User Generated Content”. Depending on the level of self-disclosure and ordered by media richness, social media can be classified [68] into high self-disclosure (blogs, Social Network Sites (SNS), and virtual social networks) and low self-disclosure (e.g., wikis, content communities such as Flicker and Youtube, and virtual game worlds). SNSs are [69] “Web-based services that allow individuals to (1) construct a public or semi-public profile within a bounded system, (2) articulate a list of other users with whom they share a connection, and (3) view and traverse their list of connections and those made by others within the system”. Finally, crowdsourcing is [70] “the act of a company or institution taking a function once performed by employees and outsourcing it to an undefined network of people in the form of an open call”. These three systems have different uses in emergency management. While social media are used for general information and knowledge communication, SNSs are used for coordination and personal information communication, and crowdsourcing provides the ability to outsource the response to people out of the damaged area. In particular, it is important to point out that visualization (in particular, crisis maps) has a communicative purpose in big data platforms, but it is also an effective means for coordination among digital humanitarians.

Several works have reviewed the roles of social media [4,71,72,73,74,75] and crowdsourcing [76,77,78,79] in emergency management. Besides, some works review how digital humanitarianism can benefit from big data [67,80] and computational techniques [81,82].

According to Alexander [4], social media plays three critical roles during emergencies: listening function to understand people’s opinions and concerns; monitoring function for improving disaster management based on people experiences; and dissemination usage during emergency planning and crisis management. Other works also review the role of social media during specific disaster phases, such as preparedness [74] and situation awareness [71]. Yin et al. [71] propose an architecture for emergency situation awareness whose main data processing components are burst detection, text classification for impact assessment, online clustering for topic discovery, and geotagging. Anson et al. [74] classify potential uses of social media for preparedness as (i) improving the effectiveness of preparedness communication by tailoring prepared messages to particular target audiences, increasing the reach of these messages by scheduling them properly and identifying influential users, and evaluating the effectiveness of the campaign; (ii) discovery of community networks that can be mobilized before a disaster occurs; and (iii) providing preparedness information.

Several authors have reviewed the role of crowdsourcing in emergencies. Poblet et al. [76,78] classified social media approaches into data-oriented, communication-oriented, and crowdsourcing. Data-oriented approaches analyze social media to extract relevant information that complements existing procedures, while communication-oriented approaches aim at enhancing communication between citizens and disaster managers. In contrast, crowdsourcing is between these two approaches, since it leverages people’s workforce in the disaster management cycle. Poblet et al. identified as leading roles data generation (passively, actively, or structured as reports) and microtaskers (e.g., tagging or geolocating), and reviewed the functionality provided by crowdsourcing tools. Liu [77] defined a framework for characterizing the role of crowdsourcing in emergency management. She defined six dimensions: why (types of tasks), who (types of crowds), what (types of flows), where (spatial aspects), and when (temporal aspects). This review is mainly interested in the why dimension, which identifies the following tasks: crowd sensing, crowd tagging, crowd mapping, and crowd curating. Finally, Kankanamge et al. [79] carried out a systematic review of crowdsourcing’s impact on disaster risk reduction.

Finally, another interesting perspective is the usage of computational techniques for processing social media and crowdsourcing. Imran et al. [80,81,83] reviewed the computational techniques for processing social messages. During the mitigation phase, the main activity is event detection from social data streams using topic detection, new event detection, and tracking techniques. In the response and recovery phases, several techniques are used for managing the information overload, which can complement crowdtasking processing. Information classification is usually carried out using supervised machine learning based on previously labeled examples. The purposes of classification depend on the data available as well as on the information needs. They distinguish between classification by information provided (e.g., affected people, infrastructure damaged), by information source (e.g., citizens, media, government), by information credibility factors (e.g., fake news, rumors), by temporal aspects (e.g., emergency phase), by geographical location (e.g., a specific geographical area), or by factual, subjective or emotional content (e.g., citizens’ feelings). Another technique is information clustering, an unsupervised machine learning technique whose primary usage is grouping similar messages or detecting anomalies. Other techniques frequently used are text summarization and semantic enrichment, mainly based on entity recognition and linking. Zhang et al. [30] carried out an interdisciplinary review of social media use in disasters. They point out the need to analyze temporal and spatial patterns to understand the information diffusion process using social network analysis techniques, providing valuable insights for understanding the rumor propagation process.

The last two perspectives can also be combined so that automatic and human elements work together in processing data pipelines, so-called Crowdsourced Stream Processing (CSP) [84]. Data pipelines are a linear sequence of data processing steps where each step processes the previous one’s output. The data pipeline design pattern is a classical approach to data processing that has gained attention with the rise of big data, since this pattern helps manage the big data volume, thanks mainly to the use of distributed batch-processing big data platforms [85]. This adoption of big data technologies into humanitarian operations is an ongoing effort [86]. It provides many benefits, such as real-time information access and improving the decision-making process. Nevertheless, there are still some challenges [86], which include the availability of big data infrastructures and staff in marginalized regions, and the need to define suitable data policies to preserve data protection and privacy. In addition, crowdsourced big data could reinforce digital inequalities [87]. As Burns [67] discusses, big data is not only a new source of data for digital humanitarianism. Its adoption requires transforming humanitarian organizations that should adopt a new set of practices.

3. Big Data Reference Architecture for Emergency Management

This section presents the design of a big data reference architecture for emergencies based on NBDRA. The reference architecture shown in Figure 2 has been constructed inductively based on the analysis of the literature previously presented. Analytical tasks have been classified according to the CommonKADS task hierarchy [88], as explained below.

The proposed reference architecture aims at developing a shared understanding of the applications of big data for emergency management. This reference architecture can be used for knowledge management by collecting and organizing best practices and for its practical implementation.

Data providers introduce information feeds in the system. The proposed reference architecture extends previous taxonomies [89,90], and includes ICT systems that provide information to the big data system [56]. Data providers have been classified as:

Digital sensors: data collected passively through the use of digital services (e.g., mobile phones, web searches).
Physical sensors: sensors [90] (e.g., satellite [91], wireless sensor networks [90,92] and geospatial) focused on remote sensing of changes in human activity.
Social media and news media: the information published on the Internet (e.g., blogs, Twitter) can be traced as social sensors of people’s opinions and intents. Especially relevant is geolocated social media [72].
Open data: open information provided by governments (e.g., census, statistics) and organizations (e.g., Wikipedia).
Crowdsourcing: information produced actively by users in order to report information about a disaster (e.g., mobile phone reporting tool, emergency map).
Health Information Management Systems: health information for managing the disaster, mainly related to patients and hospital management systems.
GIS: geographical information provided by GIS systems.

The five processing activities within the big data application provider has been further detailed for emergency management.

The collection activity uses standard big data collection techniques for accessing data providers and persisting data in the big data framework provider. Depending on the disaster phase, the system orchestrator should configure access to data providers and the security and privacy fabric components to follow the established requirements and data policies. The main specificity for disaster management the integration with crowdworking software.

The preparation activity comprises data cleansing, standardization, validation, and enrichment. The proposed framework includes a list of microtasks derived from the literature review: filtering [93], tagging [94], translation [94], geocoding [95], geotagging [96], validation to check the veracity or data correctness [97], correction [98], summarization [99], and comparison [100]. Many of these tasks can be done using crowdsourcing or automatic methods. For example, Imran et al. [97] use automatic techniques for filtering and classifying images, and the classification is validated using crowdsourcing.

The analytics activity aims at extracting knowledge from the ingested data. Analytic tasks have been organized based on the CommonKADS task library [88], since it provides a general framework for classifying the potential uses of big data analytics. This framework distinguishes two general task types: analytical and synthetic tasks. Analytical tasks produce a characterization of the system and are subdivided into prediction, classification, diagnosis, assessment, and monitoring. Synthetic tasks construct a description of the system and are subdivided into assignment, scheduling, planning, modeling, and design. This categorization has been used for classifying uses of big data according to NRF core capabilities in the different phases of disaster management: mitigation (Table 2), preparedness (Table 3), response (Table 4), and recovery (Table 5).

During the pre-disaster stage, big data analytics can contribute to building resilient infrastructures and communities, both in mitigation and preparedness activities. As shown in Table 2, during the mitigation phase, big data technologies can help in reducing the impact of disasters by providing a long term hazards data collection system. Big data analytics can be used for risk assessment, in order to understand vulnerabilities to threats and hazards, and develop plans and strategies to manage them. In addition, monitoring and prediction analytic tasks are also relevant, since they can help decision makers to prioritize risks and make informed decisions. Regarding preparedness activities, big data technologies can improve decision making in planning, coordination and information activitiesm as shown in Table 3.

During the disaster stage, big data technologies can provide real-time decision support for disaster management, since they can manage the variety, volumen, and velocity of the available data sources. As shown in Table 4, the main purpose of analytic tasks is providing real-time assessment. In fact, the integration of big data has transformed the decision-making process that previously was based on historical data [86]. Instead, now organizations can make more informed decisions and adapt their strategy when the situation changes. As illustrated in Table 4, big data analytics can provide assessments for improving decision making in a wide range of activities, such as analysis of social media for emergency planning [101], rescue team coordination [102], and triage [103]. In addition, analytic tasks can provide new insights, since they can detect hidden patterns that enable decision makers to gain a deeper understanding of the situation [86]. Monitoring activities can benefit from the integration of heterogeneous sources [104], and help in detecting trends and patterns to foresee potential issues [86,105,106]. Moreover, big data technologies can not only improve situational awareness, but prediction analytic tasks can enable moving from hindsight to foresight, and anticipate the consequences of the current situation.

Finally, during the aftermath of the disaster, big data technologies can contribute to monitor its recovery status, and provide assessment to evaluate the socio-economic consequences and recovery efforts, as can be seen in Table 5.

The visualization activity presents processed data to data consumers. The proposed reference architecture includes crisis maps since they are among the most popular visualization mechanisms for crowd data. They provide an overview of the emergency situation and include layers for organizing the information (e.g., incidents, safety, and security) [107].

The access activity manages communication and interaction with data consumers. For disaster management, specific attention should be paid to the communication with crowdsourcing tools, and with visual analytics tools such as crisis mapping ones.

Finally, data consumers use the output of the big data system for managing the disaster. Data consumers of the Big Data System for Emergency Management are:

Government: governmental partners responsible for disaster management.
Media: mass media communication that contributed to information distribution and sharing during the emergency cycle.
NGOs: participating in the emergency as first responders.
Citizens: citizens affected or non-affected by the emergency.
Crowdsourcing: digital humanitarian organizations participating proactively in emergency management.
Health information management systems: health systems that can use the big data insights for their decision making processes.
GIS: GIS systems that can aggregate information from the big data system.
Social media management: social media management tools that can use big data insights for improving information sharing impact.

The proposed reference architecture enables the integration of automatic (big data-based) and crowdsourcing resources as follows.

Regarding big data processing, data pipelines correspond to the processing tasks carried out by big data application providers in NBDRA according to the requirements specified in the system orchestrator. The execution of data pipelines usually requires the system orchestrator’s interaction with other systems that play the role of big data application provider, management fabric, and security and privacy fabric. As Imran et al. [84] point out, crowdsourcing systems are more suitable for data entry, binary classification, and n-ary classification microtasks. The use of automatic or human processing for these tasks depends on disaster requirements and resource availability.

With reference to the integration of crowdsourcing resources, digital sensors and social media have been identified as data providers, which corresponds to the crowdsourcing roles “crowd as a reporter”, “crowd as a sensor”, and “crowd as a social computer” according to the crowdsourcing role taxonomy defined by Poblet et al. [76]. Besides, the activities defined in the big data application provider can be executed automatically by the big data system or orchestrated as microtasks, which corresponds to the crowdsourcing role “crowd as a microtasker” of the previously mentioned taxonomy. The access activity also considers the integration of interfaces with the crowdsourcing tools [78], including the popular crisis mapping system [108]. Finally, crowdsourcing also plays the role of data consumer. A digital humanitarian can benefit from the use of big data systems for optimizing their performance.

4. Case Study

This section describes a case study to show how the defined reference architecture can be mapped onto published disaster management architectures.

Kabir et al. [136] proposed the system STIMULATE for coordinating rescue operations based on the information published by affected people in the social network Twitter. The system is deployed in a cloud environment using Hadoop and comprises three components: the tweet fetcher, tweet processing, and rescue scheduling.

The tweet fetcher component collects tweets using the Twitter streaming API. A Web interface allows filtering tweets using multiple keywords and locations. The location area can be selected on a map. Then tweets are preprocessed, replacing emojis, jargon, slang, and contractions with more common wordings. The result is stored in a MongoDB dataset [137].

The tweet processing component aims at detecting stranded individuals and determine the rescue needs and priority. For this purpose, the system extracts locations. Multi-label multi-class classification is then performed based on a taxonomy provided by the Federal Emergency Management Agency (FEMA) for rescuing stranded people. The categories are: rescue needed, DECW (diseased, elderly, children, and pregnant women), water needed, injured, sick, and flood. Then, rescue priority is calculated based on the aggregation of different factors, such as weather conditions obtained using Open Weather API (Open Weather Service available at https://openweathermap.org/api). The tweet classifier uses a deep neural network that uses Keras [138] and Tensorflow [139] libraries and has been trained with Harvey and Irma datasets, and evaluated in 15 public disaster datasets.

The rescue scheduling component provides tools for managing the rescue operation. It provides a web interface so that rescue teams can manage their tasks, and an administrator can monitor task progress. A scheduling algorithm assigns tasks to rescue teams based on the tweet processing component’s priority and based on their capacity.

According to the eNRF core capabilities taxonomy, this system is used during the response phase in mass search and rescue operations. Figure 3 describes the mapping of the use case to the reference architecture. The system uses two data providers, Twitter and Open Weather API, that expose a collection of interfaces. Data consumers are government institutions and NGOs since the system aims at coordinating institutional rescue efforts and volunteers. The big data framework provider provides data facilities (MongoDB) and task distribution (Hadoop). The case study uses neither an orchestrator component nor a management fabric.

The core of the STIMULATE system is mapped onto the big data application provider. The collection component consists of a web server that processes data requests and interacts with the data provider.

Data consumers carry out these requests through the collection interface within the access component, which is implemented as a web application The collection component stores the information in the data facilities of the big data application framework, in this case, the database MongoDB. The preparation component pre-processes incoming tweets chaining geocoding and transformation tasks (i.e., management of slang, emojis, and contractions). Then the analytics component performs the tweet classification activity to determine rescue priority that feeds the scheduling task. Results from the analytics component are shown in the visualization component, which provides two interfaces, for administrators and rescue teams. The interface for rescue teams shows a route map for visiting each task location in order. Access to visualization is controlled by an authorization and authentication policy defined in the security and privacy fabric. The access component enables communication with data consumers. In this case, data consumers can configure and interact with the collection component and visualization component.

From this simple case study, some advantages of using a reference architecture can be pointed out. The proposed reference architecture can help us to evaluate the architecture, propose enhancements, and improve reusability. First, the system could benefit from the usage management fabric for automating configuration, resource management, and monitoring. Second, the security and privacy fabric is only used for controlling access in the visualization component, which can be an issue since the system should preserve confidentiality, privacy, and security. Since the collection component’s functionality is not specific to this problem, the system could reuse available collection components designed with security and privacy in mind. Similarly, the preparation component is generic, and the system could benefit from a library of pre-processing multi-lingual components. Finally, the developed analytic component could be reused for other purposes. The use of well-defined interfaces would enable its reuse and improvement.

5. Discussion

This article proposes a reference architecture for big data processing in disaster management. The reference architecture has been designed inductively, based on an extensive review of the literature and the published implementation architectures in the domain. As a framework for its definition, we have chosen NBDRA, since it provides a general framework defined in a public working group, with participants from industry, academia, and government. As a result, NBDRA provides a vendor-neutral, technology-agnostic, and infrastructure-independent ecosystem.

The proposed reference architecture has identified the key components that are relevant for disaster management, and has categorized them based on NRF core capabilities [19] and the CommonKADS task hierarchy [88]. The combination of both taxonomies provides an explicit schema of knowledge reusability, and shows big data technologies’ applications for every single core capability for managing disasters. Given that many stakeholders participate in emergency management, the definition of standardized interfaces is essential for effective coordination of the efforts, the provision of access to data sources taking into account privacy and security concerns, and the customization of data consumer and data provider access. NBDRA defines functional components, and an actor can play several roles (i.e., data consumer and data provider). Since NBDRA supports the representation of stacking and chaining of big data systems [34], the cooperation of the big data systems participating in disaster management can also be represented in the proposed reference architecture. The need for cooperation is widely recognized in emergency management [140], since responses require a great diversity of skills and resources. Big data integration and Extract, Transform, and Load (ETL) technologies can be crucial for breaking down and bridging data silos [141]. Moreover, the proposed reference architecture can help in organizing and classifying existing experiences and sharing best practices.

We have detected that the component “security and privacy fabric” should receive more attention in this domain since most works do not mention how they address these concerns. As discussed in some reviews about big data technologies for disaster management [3,54,65], security and privacy issues are still a big challenge. Nevertheless, this problem is not specific to disaster management since big data introduces many privacy preservation challenges [142]. Thus, adopting a reference architecture can provide a good starting point for fostering the sharing and adoption of best practices.

A limitation of this work is that the reference architecture has been based on published research and should be complemented by consultation with domain stakeholders. Besides, NIST Big Data Working Group has defined interfaces between the NBDRA components [36]. Nevertheless, there is not an available reference implementation of NBDRA, which could foster its adoption. Another limitation of this work is that we have focused on big data architectural aspects, but other aspects should be addressed. In particular, big data potential can only be achieved if legal, organizational, semantic, and technical interoperability is reached [143]. In particular, some researchers report [144] that while technical interoperability has reached a high level of maturity, semantic and legal interoperability remains a significant barrier for the sector. Future work should be carried out to address semantic interoperability, taking into account existing standards, such as OASIS Emergency Data Exchange Language (EDXL) Emergency Standards [145], and semantic interoperability based on ontologies [146,147] to exploit the potential of disaster knowledge graphs [148].

6. Conclusions

This paper has focused on the definition of a Big Data Reference Architecture for Emergencies based on NBDRA. The aim of this work is providing a common vocabulary that enables to discuss Big Data architecture designs and implementations. Besides, reference models foster knowledge reusability. In the emergency domain, reusability can be done at different levels: datasets, data pipelines, and data processing and visualization software components. This research work aimed at identifying the essential components of a Big Data System for emergency management. For this, an extensive literature review has been carried out, and as a result, a reference architecture has been proposed inductively based on emergency management experiences. We have adopted NBDRA as a generic framework for describing Big Data systems adapted to a specific domain. Another aspect that we have addressed has been integrating crowdsourcing elements that enable the design and execution of hybrid data pipelines for emergency management.

We believe that Big Data analytics platforms will be more frequently integrated with crowdsourcing systems shortly. Thus, it is essential to learn best practices and to define open models for sharing practices and components. This reference architecture is a first step for providing a common framework for describing Big Data systems in the disaster domain. The need for inter-organizational cooperation characterizes large scale disasters. When it comes to sharing data and data analytics, reference architectures can improve organizational cooperation since standard interfaces enable selecting swappable components, and their combination.

Our future work will be focused on two aspects. On the one hand, we will work on evaluating the reference architecture with disaster stakeholders. On the other hand, we are interested in the specification of components for exploiting disaster knowledge graphs and in the extension of NBDRA interfaces for interacting with these components.

Author Contributions

Conceptualization, C.A.I., A.F., and Á.C.; methodology, C.A.I. and A.F.; investigation, C.A.I., A.F., and Á.C.; resources, C.A.I., A.F., and Á.C.; data curation, C.A.I., A.F., and Á.C.; writing—original draft preparation, C.A.I.; writing—review and editing, C.A.I., A.F., and Á.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research has been partially funded by the Spanish Ministry of Science and Innovation (Ministerio de Ciencia e Innovación) under the R&D project COGNOS (PID2019-105484RB-I00) and by the Spanish Ministry of Education, Culture, and Sport (Ministerio de Educación, Cultura y Deporte) through the mobility research stay grant PRX16/00515.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

CEOS	Committee on Earth Observation Satellites
CSP	Crowdsourced Stream Processing
CRG	Community Response Grids
ECN	Emergency Communication Network
EDXL	EDXL
ETL	Extract, Transform, and Load
FEMA	Federal Emergency Management Agency
GIS	Geographical Information System
ICT	Information and Communication Technologies
NBDRA	NIST Big Data Reference Architecture
NGO	Non-Governmental Organization
NRF	National Response Framework
OSM	OpenStreetMap
SM	Social Media
SNS	Social Network Sites
USGS	US Geological Survey
VGI	Volunteer Geographic Information

References

Chang, W.L.; Grady, N. NIST Big Data Interoperability Framework: Volume 1, Definitions; Technical Report; National Institute of Standards and Technology: Gaithersburg, MD, USA, 2019.
Wang, J.; Wu, Y.; Yen, N.; Guo, S.; Cheng, Z. Big Data Analytics for Emergency Communication Networks: A Survey. IEEE Commun. Surv. Tutor. 2016, 18, 1758–1778. [Google Scholar] [CrossRef]
Arslan, M.; Roxin, A.M.; Cruz, C.; Ginhac, D. A review on applications of big data for disaster management. In Proceedings of the 2017 13th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS), Jaipur, India, 4–7 December 2017; pp. 370–375. [Google Scholar]
Alexander, D.E. Social Media in Disaster Risk Reduction and Crisis Management. Sci. Eng. Ethics 2014, 20, 717–733. [Google Scholar] [CrossRef]
Neal, D.M.; Phillips, B.D. Effective Emergency Management: Reconsidering the Bureaucratic Approach. Disasters 1995, 19, 327–337. [Google Scholar] [CrossRef]
Castillo, C. Big Crisis Data; Cambridge University Press: Cambridge, UK, 2016. [Google Scholar]
Scolobig, A.; Prior, T.; Schröter, D.; Jörin, J.; Patt, A. Towards people-centred approaches for effective disaster risk management: Balancing rhetoric with reality. Int. J. Disaster Risk Reduct. 2015, 12, 202–212. [Google Scholar] [CrossRef]
Boin, A.; McConnell, A. Preparing for Critical Infrastructure Breakdowns: The Limits of Crisis Management and the Need for Resilience. J. Contingencies Crisis Manag. 2007, 15, 50–59. [Google Scholar] [CrossRef]
Manso, M.; Manso, B. The Role of Social Media in Crisis: A European holistic approach to the adoption of online and mobile communications in crisis response and search and rescue efforts. In Proceedings of the 17th International Command & Control Research & Technology Symposium, Fairfax, VA, USA, 19–21 June 2012. [Google Scholar]
Gujer, E.; Weekes, B.; Gasser, U.; Maclay, C.; Best, M. Intelligence of the Masses or Stupidity of the Herd? In Peacebuilding in the Information Age: Sifting Hype from Reality; The Berkman Klein Center for Internet & Society at Harvard University: Cambridge, MA, USA, 2011; pp. 23–25. [Google Scholar]
McClendon, S.; Robinson, A.C.; Currion, P.; de Silva, C.; Walle, B.V.D.; Field, K.; O’Brien, J.; Intagorn, S.; Lerman, K.; Jennex, M.; et al. Leveraging Geospatially-Oriented Social Media Communications in Disaster Response. Int. J. Inf. Syst. Crisis Response Manag. 2013, 5, 22–40. [Google Scholar] [CrossRef] [Green Version]
Gao, H. Harnessing the Crowdsourcing Power of Social Media for Disaster Relief. IEEE Intell. Syst. 2011, 26, 10–14. [Google Scholar] [CrossRef]
Morrow, N.; Mock, N.; Papendieck, A.; Kocmich, N. Independent evaluation of the Ushahidi Haiti project. Dev. Inf. Syst. Int. 2011, 8, 2011. [Google Scholar]
Imran, M.; Castillo, C.; Lucas, J.; Patrick, M.; Rogstadius, J. Coordinating human and machine intelligence to classify microblog communications in crises. In Proceedings of the 11th International ISCRAM Conference, State College, PA, USA, 18–21 May 2014. [Google Scholar]
Alexander, D.E. Principles of Emergency Planning and Management; Oxford University Press on Demand: Oxford, UK, 2002. [Google Scholar]
Coetzee, C.; Van Niekerk, D. Tracking the evolution of the disaster management cycle: A general system theory approach. Jàmbá J. Disaster Risk Stud. 2012, 4, 1–9. [Google Scholar] [CrossRef]
Barid, M.E. The Phases of Emergency Management; University of Memphis: Memphis, TN, USA, 2014. [Google Scholar]
Khan, H.; Khan, A. Natural Hazards and Disaster Management in Pakistan. Technical Report 11052; Munich Personal RePEc Archive. 2008. Available online: https://mpra.ub.uni-muenchen.de/11052/ (accessed on 2 December 2020).
Federal Emergency Management Agency (FEMA). Overview of the National Planning Frameworks; U.S. Department of Homeland Security: Hyattsville, MD, USA, 2016.
Federal Emergency Management Agency (FEMA). National Prevention Framework; U.S. Department of Homeland Security: Hyattsville, MD, USA, 2016.
Federal Emergency Management Agency (FEMA). National Protection Framework; U.S. Department of Homeland Security: Hyattsville, MD, USA, 2016.
Federal Emergency Management Agency (FEMA). National Mitigation Framework; U.S. Department of Homeland Security: Hyattsville, MD, USA, 2016.
Federal Emergency Management Agency (FEMA). National Response Framework; U.S. Department of Homeland Security: Hyattsville, MD, USA, 2016.
Federal Emergency Management Agency (FEMA). National Disaster Recovery Framework; U.S. Department of Homeland Security: Hyattsville, MD, USA, 2016.
Nakagawa, E.Y.; Antonino, P.O.; Becker, M. Reference architecture and product line architecture: A subtle but critical difference. In Proceedings of the European Conference on Software Architecture, Essen, Germany, 13–16 September 2011; Springer: Berlin, Germany, 2011; pp. 207–211. [Google Scholar]
Pääkkönen, P.; Pakkala, D. Reference architecture and classification of technologies, products and services for big data systems. Big Data Res. 2015, 2, 166–186. [Google Scholar] [CrossRef] [Green Version]
Sang, G.M.; Xu, L.; De Vrieze, P. A reference architecture for big data systems. In Proceedings of the 2016 10th International Conference on Software, Knowledge, Information Management & Applications (SKIMA), Chengdu, China, 15–17 December 2016; pp. 370–375. [Google Scholar]
Nadal, S.; Herrero, V.; Romero, O.; Abelló, A.; Franch, X.; Vansummeren, S.; Valerio, D. A software reference architecture for semantic-aware Big Data systems. Inf. Softw. Technol. 2017, 90, 75–92. [Google Scholar] [CrossRef] [Green Version]
Klein, J.; Buglak, R.; Blockow, D.; Wuttke, T.; Cooper, B. A reference architecture for big data systems in the national security domain. In Proceedings of the 2016 IEEE/ACM 2nd International Workshop on Big Data Software Engineering (BIGDSE), Austin, TX, USA, 16 May 2016; pp. 51–57. [Google Scholar]
Zhang, X.; Ming, X.; Yin, D. Reference architecture of common service platform for Industrial Big Data (I-BD) based on multi-party co-construction. Int. J. Adv. Manuf. Technol. 2019, 105, 1949–1965. [Google Scholar] [CrossRef]
Palanivel, K.; Chithralekha, T. Big Data Reference Architecture for e-Learning Analytical Systems. Int. J. Recent Innov. Trends Comput. Commun. 2018, 6, 55–67. [Google Scholar]
Alam, A.; Ullah, I.; Lee, Y.K. Video Big Data Analytics in the Cloud: A Reference Architecture, Survey, Opportunities, and Open Research Issues. IEEE Access 2020, 8, 152377–152422. [Google Scholar] [CrossRef]
Santana, E.F.Z.; Chaves, A.P.; Gerosa, M.A.; Kon, F.; Milojicic, D.S. Software Platforms for Smart Cities: Concepts, Requirements, Challenges, and a Unified Reference Architecture. ACM Comput. Surv. 2017, 50, 78. [Google Scholar] [CrossRef]
Chang, W.L.; Boyd, D.; Levin, O. NIST Big Data Interoperability Framework: Volume 6, Reference Architecture; Technical Report; National Institute of Standards and Technology: Gaithersburg, MD, USA, 2018.
Philip Chen, C.; Zhang, C.Y. Data-intensive applications, challenges, techniques and technologies: A survey on Big Data. Inf. Sci. 2014, 275, 314–347. [Google Scholar] [CrossRef]
Chang, W.L.; Marcus, B.; Baru, C. NIST Big Data Interoperability Framework: Volume 8, Reference Architecture Interfaces; Technical Report; National Institute of Standards and Technology: Gaithersburg, MD, USA, 2019.
Chang, W.L.; Marcus, B.; Baru, C. NIST Big Data Interoperability Framework: Volume 9, Adoption and Modernization; Technical Report; National Institute of Standards and Technology: Gaithersburg, MD, USA, 2019.
Cavanillas, J.M.; Curry, E.; Wahlster, W. (Eds.) New Horizons for a Data-Driven Economy—A Roadmap for Big Data in Europe; Springer: Berlin, Germany, 2015; p. 303. [Google Scholar]
Tekiner, F.; Keane, J.A. Big data framework. In Proceedings of the 2013 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2013, Manchester, UK, 13–16 October 2013; pp. 1494–1499. [Google Scholar]
Immonen, A.; Ovaska, E. Evaluating the Quality of Social Media Data in Big Data Architecture. IEEE Access 2015, 3, 2028–2043. [Google Scholar] [CrossRef]
Laney, D. 3D data management: Controlling data volume, velocity and variety. META Group Res. Note 2001, 6, 1. [Google Scholar]
Miele, S.; Shockley, R. Analytics: The Real-World Use of Big Data; IBM Institute for Business Value: Somers, NY, USA, 2013. [Google Scholar]
Qadir, J.; Ali, A.; ur Rasool, R.; Zwitter, A.; Sathiaseelan, A.; Crowcroft, J. Crisis analytics: Big data-driven crisis response. J. Int. Humanit. Action 2016, 1, 1–21. [Google Scholar] [CrossRef] [Green Version]
Estrin, D. Small data, where n= me. Commun. ACM 2014, 57, 32–34. [Google Scholar] [CrossRef]
Meier, P. Digital Humanitarians: How Big Data Is Changing the Face of Humanitarian Response; CRC Press: Boca Raton, FL, USA, 2015. [Google Scholar]
Cumbane, S.P.; Gidófalvi, G. Review of Big Data and Processing Frameworks for Disaster Response Applications. ISPRS Int. J. Geo-Inf. 2019, 8, 387. [Google Scholar] [CrossRef] [Green Version]
Kwak, Y.J. Nationwide flood monitoring for disaster risk reduction using multiple satellite data. ISPRS Int. J. Geo-Inf. 2017, 6, 203. [Google Scholar] [CrossRef]
Duda, K.A.; Jones, B.K. USGS remote sensing coordination for the 2010 Haiti earthquake. Photogramm. Eng. Remote Sens. 2011, 77, 899–907. [Google Scholar] [CrossRef]
Kryvasheyeu, Y.; Chen, H.; Moro, E.; Van Hentenryck, P.; Cebrian, M. Performance of social network sensors during Hurricane Sandy. PLoS ONE 2015, 10, e0117288. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Conrado, S.P.; Neville, K.; Woodworth, S.; O’Riordan, S. Managing social media uncertainty to support the decision making process during emergencies. J. Decis. Syst. 2016, 25, 171–181. [Google Scholar] [CrossRef] [Green Version]
Hristidis, V.; Chen, S.C.; Li, T.; Luis, S.; Deng, Y. Survey of data management and analysis in disaster situations. J. Syst. Softw. 2010, 83, 1701–1714. [Google Scholar] [CrossRef]
Miyazaki, H.; Nagai, M.; Shibasaki, R. Reviews of Geospatial Information Technology and Collaborative Data Delivery for Disaster Risk Management. ISPRS Int. J. Geo-Inf. 2015, 4, 1936–1964. [Google Scholar] [CrossRef]
Li, T.; Xie, N.; Zeng, C.; Zhou, W.; Zheng, L.; Jiang, Y.; Yang, Y.; Ha, H.Y.; Xue, W.; Huang, Y.; et al. Data-driven techniques in disaster information management. ACM Comput. Surv. (CSUR) 2017, 50, 1–45. [Google Scholar] [CrossRef]
Yu, M.; Yang, C.; Li, Y. Big data in natural disaster management: A review. Geosciences 2018, 8, 165. [Google Scholar] [CrossRef] [Green Version]
Joseph, J.K.; Dev, K.A.; Pradeepkumar, A.; Mohan, M. Big data analytics and social media in disaster management. In Integrating Disaster Science and Management; Elsevier: Amsterdam, The Netherlands, 2018; pp. 287–294. [Google Scholar]
Freeman, J.D.; Blacker, B.; Hatt, G.; Tan, S.; Ratcliff, J.; Woolf, T.B.; Tower, C.; Barnett, D.J. Use of big data and information and communications technology in disasters: An integrative review. Disaster Med. Public Health Prep. 2019, 13, 353–367. [Google Scholar] [CrossRef] [Green Version]
Song, X.; Zhang, H.; Akerkar, R.A.; Huang, H.; Guo, S.; Zhong, L.; Ji, Y.; Opdahl, A.L.; Purohit, H.; Skupin, A.; et al. Big Data and Emergency Management: Concepts, Methodologies, and Applications. IEEE Trans. Big Data 2020. [Google Scholar] [CrossRef]
Goswami, S.; Chakraborty, S.; Ghosh, S.; Chakrabarti, A.; Chakraborty, B. A review on application of data mining techniques to combat natural disasters. Ain Shams Eng. J. 2018, 9, 365–378. [Google Scholar] [CrossRef] [Green Version]
Zhang, X.Y.; Li, X.; Lin, X. The data mining technology of particle swarm optimization algorithm in earthquake prediction. Adv. Mater. Res. 2014, 989, 1570–1573. [Google Scholar] [CrossRef]
Sakaki, T.; Okazaki, M.; Matsuo, Y. Earthquake shakes Twitter users: Real-time event detection by social sensors. In Proceedings of the 19th International Conference on World Wide Web, Raleigh, NC, USA, 26–30 April 2010; pp. 851–860. [Google Scholar]
SHIROTA, Y. Temporal awareness of needs after east japan great earthquake using latent semantic analysis. Inf. Model. Knowl. Bases XXV 2014, 25, 200. [Google Scholar]
Zhang, K.; Chen, S.C.; Singh, P.; Saleem, K.; Zhao, N. A 3d visualization system for hurricane storm-surge flooding. IEEE Comput. Graph. Appl. 2006, 26, 18–25. [Google Scholar] [CrossRef]
Surakitbanharn, C.; Ebert, D.S. Improving the communication of emergency and disaster information using visual analytics. In Proceedings of the International Conference on Applied Human Factors and Ergonomics, Los Angeles, CA, USA, 17–21 July 2017; Springer: Berlin, Germany, 2017; pp. 143–152. [Google Scholar]
Hsu, W.K.; Chiang, W.L.; Xue, Q.; Hung, D.M.; Huang, P.C.; Chen, C.W.; Tsai, C.H. A probabilistic approach for earthquake risk assessment based on an engineering insurance portfolio. Nat. Hazards 2013, 65, 1559–1571. [Google Scholar] [CrossRef]
Akter, S.; Wamba, S.F. Big data and disaster management: A systematic review and agenda for future research. Ann. Oper. Res. 2019, 283, 939–959. [Google Scholar] [CrossRef] [Green Version]
Yates, D.; Paquette, S. Emergency knowledge management and social media technologies: A case study of the 2010 Haitian earthquake. Int. J. Inf. Manag. 2011, 31, 6–13. [Google Scholar] [CrossRef]
Burns, R. Rethinking big data in digital humanitarianism: Practices, epistemologies, and social relations. GeoJournal 2015, 80, 477–490. [Google Scholar] [CrossRef]
Kaplan, A.M.; Haenlein, M. Users of the world, unite! The challenges and opportunities of Social Media. Bus. Horiz. 2010, 53, 59–68. [Google Scholar] [CrossRef]
Boyd, D.M.; Ellison, N.B. Social Network Sites: Definition, History, and Scholarship. J. Comput. Mediat. Commun. 2007, 13, 210–230. [Google Scholar] [CrossRef] [Green Version]
Howe, J. The Rise of Crowdsourcing. Wired Mag. 2006, 14, 1–5. [Google Scholar]
Yin, J.; Lampert, A.; Cameron, M.; Robinson, B.; Power, R. Using Social Media to Enhance Emergency Situation Awareness. IEEE Intell. Syst. 2012, 27, 52–59. [Google Scholar] [CrossRef]
Simon, T.; Goldberg, A.; Adin, B. Socializing in emergencies—A review of the use of social media in emergency situations. Int. J. Inf. Manag. 2015, 35, 609–619. [Google Scholar] [CrossRef] [Green Version]
Teodorescu, H.N. Using analytics and social media for monitoring and mitigation of social disasters. Procedia Eng. 2015, 107, 325–334. [Google Scholar] [CrossRef] [Green Version]
Anson, S.; Watson, H.; Wadhwa, K.; Metz, K. Analysing social media data for disaster preparedness: Understanding the opportunities and barriers faced by humanitarian actors. Int. J. Disaster Risk Reduct. 2017, 21, 131–139. [Google Scholar] [CrossRef]
Saroj, A.; Pal, S. Use of social media in crisis management: A survey. Intern. J. Disaster Risk Reduct. 2020, 48, 101584. [Google Scholar] [CrossRef]
Poblet, M.; García-Cuesta, E.; Casanovas, P. Crowdsourcing tools for disaster management: A review of platforms and methods. In Proceedings of the International Workshop on AI Approaches to the Complexity of Legal Systems, Bologna, Italy, 11 December 2013; Springer: Berlin, Germany, 2013; pp. 261–274. [Google Scholar]
Liu, S.B. Crisis Crowdsourcing Framework: Designing Strategic Configurations of Crowdsourcing for the Emergency Management Domain. Comput. Support. Coop. Work. Cscw: Int. J. 2014, 23, 389–443. [Google Scholar] [CrossRef]
Poblet, M.; García-Cuesta, E.; Casanovas, P. Crowdsourcing roles, methods and tools for data-intensive disaster management. Inf. Syst. Front. 2018, 20, 1363–1379. [Google Scholar] [CrossRef]
Kankanamge, N.; Yigitcanlar, T.; Goonetilleke, A.; Kamruzzaman, M. Can volunteer crowdsourcing reduce disaster risk? A systematic review of the literature. Int. J. Disaster Risk Reduct. 2019, 35, 101097. [Google Scholar] [CrossRef]
Fernandez-Luque, L.; Imran, M. Humanitarian health computing using artificial intelligence and social media: A narrative literature review. Int. J. Med Inform. 2018, 114, 136–142. [Google Scholar] [CrossRef]
Imran, M.; Castillo, C.; Diaz, F.; Vieweg, S. Processing social media messages in mass emergency: A survey. ACM Comput. Surv. (CSUR) 2015, 47, 1–38. [Google Scholar] [CrossRef]
Nazer, T.H.; Xue, G.; Ji, Y.; Liu, H. Intelligent disaster response via social media analysis a survey. ACM SIGKDD Explor. Newsl. 2017, 19, 46–59. [Google Scholar] [CrossRef]
Imran, M.; Castillo, C.; Diaz, F.; Vieweg, S. Processing social media messages in mass emergency: Survey summary. In Proceedings of the Companion Proceedings of the Web Conference 2018, Lyon, France, 23–27 April 2018; pp. 507–511. [Google Scholar]
Imran, M.; Lykourentzou, I.; Castillo, C. Engineering Crowdsourced Stream Processing Systems. arXiv 2013, arXiv:1310.5463. [Google Scholar]
Dennison, D.; Harvey, T. Data Processing Pipelines. Available online: https://research.google/pubs/pub45329/ (accessed on 4 December 2020).
Whipkey, K.; Verity, A. Guidance for Incorporating Big Data into Humanitarian Operations. Reptech. Rept. Digital Humanitarian Network. 2015. Available online: https://www.digitalhumanitarians.com/ (accessed on 4 December 2020).
Mulder, F.; Ferguson, J.; Groenewegen, P.; Boersma, K.; Wolbers, J.; Avgerou, C.; Baack, S.; Brown, J.; Duguid, P.; Cooke, B.; et al. Questioning Big Data: Crowdsourcing crisis data towards an inclusive humanitarian response. Big Data Soc. 2016, 3, 133–146. [Google Scholar] [CrossRef]
Schreiber, A.T.; Schreiber, G.; Akkermans, H.; Anjewierden, A.; Shadbolt, N.; de Hoog, R.; Van de Velde, W.; Wielinga, B.; Shadbolt, N.R. Knowledge Engineering and Management: The CommonKADS Methodology; MIT Press: Cambridge, MA, USA, 2000. [Google Scholar]
Pulse, U.G. Big Data for Development: Challenges & Opportunities; UN Global Pulse: New York, NY, USA, 2012. [Google Scholar]
Shams, F.; Cerone, A.; Nicola, R.D. On Integrating Social and Sensor Networks for Emergency Management. In Software Engineering and Formal Methods; Springer: Berlin/Heidelberg, Germany, 2016; pp. 145–160. [Google Scholar]
CEOS Disaster SBA team and DI-06-09 GEO Task Group. Use of Satellites for Risk Management. Volume I Establishing Global Requirements for Earth Observation Satellite Data to Support Multi-Hazard Disaster Management throughout the Disaster Cycle; Technical Report; CEOS: Gaithersburg, MD, USA, 2008. [Google Scholar]
Du, C.; Zhu, S. Research on urban public safety emergency management early warning system based on technologies for the Internet of Things. Procedia Eng. 2012, 45, 748–754. [Google Scholar] [CrossRef] [Green Version]
Lin, W.Y.; Wu, T.H.; Tsai, M.H.; Hsu, W.C.; Chou, Y.T.; Kang, S.C. Filtering disaster responses using crowdsourcing. Autom. Constr. 2018, 91, 182–192. [Google Scholar] [CrossRef]
Hester, V.; Shaw, A.; Biewald, L. Scalable crisis relief: Crowdsourced SMS translation and categorization with Mission 4636. In Proceedings of the First ACM Symposium on Computing for Development, London, UK, 17–18 December 2010; pp. 1–7. [Google Scholar]
Gómez, J.; Manso, M.A.; Alcarria, R. Volunteering assistance to online geocoding services through a distributed knowledge solution. In Proceedings of the RICH-VGI Workshop at 18th AGILE Conference on Geographic Information Science, Lisbon, Portugal, 9–12 June 2015. [Google Scholar]
Jonathan, C.; Mokbel, M.F. Stella: Geotagging images via crowdsourcing. In Proceedings of the 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, Seattle, WA, USA, 6–9 November 2018; pp. 169–178. [Google Scholar]
Imran, M.; Alam, F.; Qazi, U.; Peterson, S.; Ofli, F. Rapid Damage Assessment Using Social Media Images by Combining Human and Machine Intelligence. arXiv 2020, arXiv:2004.06675. [Google Scholar]
Mirkin, S.; Venkatapathy, S.; Dymetman, M. Confidence-driven rewriting for improved translation. In Proceedings of the XIV MT Summit, Nice, France, 2–6 September 2013; pp. 257–264. [Google Scholar]
Wu, S.Y.; Thawonmas, R.; Chen, K.T. Video summarization via crowdsourcing. In Proceedings of the CHI’11 Extended Abstracts on Human Factors in Computing Systems, Vancouver, BC, Canada, 10–11 May 2011; pp. 1531–1536. [Google Scholar]
Venetis, P.; Garcia-Molina, H. Quality control for comparison microtasks. In Proceedings of the First International Workshop on Crowdsourcing and Data Mining, Beijing, China, 12 August 2012; pp. 15–21. [Google Scholar]
Wong, H.T.; Chiang, V.C.L.; Choi, K.S.; Loke, A.Y. The need for a definition of Big Data for nursing science: A case study of disaster preparedness. Int. J. Environ. Res. Public Health 2016, 13, 1015. [Google Scholar] [CrossRef]
Nagendra, N.P.; Narayanamurthy, G.; Moser, R. Management of humanitarian relief operations using satellite big data analytics: The case of Kerala floods. Ann. Oper. Res. 2020, 1–26. [Google Scholar] [CrossRef]
Bates, D.W.; Saria, S.; Ohno-Machado, L.; Shah, A.; Escobar, G. Big data in health care: Using analytics to identify and manage high-risk and high-cost patients. Health Aff. 2014, 33, 1123–1131. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Jiang, W.; Jiang, Y. Construction of water pollution monitoring model after flood disaster based on big data analysis. Ccamlr Sci. 2019, 495–506. Available online: https://go.gale.com/ps/anonymous?id=GALE%7CA634165416&sid=googleScholar&v=2.1&it=r&linkaccess=abs&issn=10234063&p=AONE&sw=w (accessed on 4 December 2020).
Rathore, M.M.; Paul, A.; Ahmad, A.; Imran, M.; Guizani, M. Big data analytics of geosocial media for planning and real-time decisions. In Proceedings of the 2017 IEEE International Conference on Communications (ICC), Paris, France, 21–25 May 2017; pp. 1–6. [Google Scholar]
Yang, T.; Xie, J.; Li, G.; Mou, N.; Li, Z.; Tian, C.; Zhao, J. Social Media Big Data Mining and Spatio-Temporal Analysis on Public Emotions for Disaster Mitigation. ISPRS Int. J. Geo-Inf. 2019, 8, 29. [Google Scholar] [CrossRef] [Green Version]
Júnior, P.S.; Novais, R.; Vieira, V.; Pedraza, L.G.; Mendonça, M.; Villela, K. Visualization mechanisms for crowdsourcing information in emergency coordination. In Proceedings of the 14th Brazilian Symposium on Human Factors in Computing Systems, Salvador, Brazil, 3–6 November 2015; pp. 1–8. [Google Scholar]
Macdonell, C. Ushahidi: A crisis mapping system. ACM SIGCAS Comput. Soc. 2015, 45, 38. [Google Scholar] [CrossRef]
Baxter, P.; Aspinall, W.; Neri, A.; Zuccaro, G.; Spence, R.; Cioni, R.; Woo, G. Emergency planning and mitigation at Vesuvius: A new evidence-based approach. J. Volcanol. Geotherm. Res. 2008, 178, 454–473. [Google Scholar] [CrossRef] [Green Version]
Zhuang, Y.; Yu, K.; Wang, D.; Ding, W. An evaluation of big data analytics in feature selection for long-lead extreme floods forecasting. In Proceedings of the 2016 IEEE 13th International Conference on Networking, Sensing, and Control (ICNSC), Mexico City, Mexico, 28–30 April 2016; pp. 1–6. [Google Scholar]
Wang, Y.; Deng, M.; Bao, Y.; Zhang, H.; Chen, J.; Qian, J.; Guo, C. Power system disaster-mitigating dispatch platform based on big data. In Proceedings of the 2014 International Conference on Power System Technology, Chengdu, China, 20–22 October 2014; pp. 1014–1019. [Google Scholar]
Kontokosta, C.E.; Malik, A. The Resilience to Emergencies and Disasters Index: Applying big data to benchmark and validate neighborhood resilience capacity. Sustain. Cities Soc. 2018, 36, 272–285. [Google Scholar] [CrossRef]
Gouveia, J.P.; Palma, P. Harvesting big data from residential building energy performance certificates: Retrofitting and climate change mitigation insights at a regional scale. Environ. Res. Lett. 2019, 14, 095007. [Google Scholar] [CrossRef]
Kim, H.S.; Sun, C.G.; Cho, H.I. Geospatial big data-based geostatistical zonation of seismic site effects in Seoul metropolitan area. ISPRS Int. J. Geo-Inf. 2017, 6, 174. [Google Scholar] [CrossRef] [Green Version]
Wang, R.Q.; Mao, H.; Wang, Y.; Rae, C.; Shaw, W. Hyper-resolution monitoring of urban flooding with social media and crowdsourcing data. Comput. Geosci. 2018, 111, 139–147. [Google Scholar] [CrossRef] [Green Version]
Merchant, R.M.; Elmer, S.; Lurie, N. Integrating social media into emergency-preparedness efforts. N. Engl. J. Med. 2011, 365, 289–291. [Google Scholar] [CrossRef] [Green Version]
Barren, D. The President’s National Security Telecommunications Advisory Committee. In Proceedings of the MILCOM 2006-2006 IEEE Military Communications Conference, Washington, DC, USA, 23–25 October 2006. [Google Scholar]
Lee, Y.; Watanabe, K.; Li, W.S. Enhancing regional digital preparedness on natural hazards to safeguard business resilience in the Asia-Pacific. In Proceedings of the International Conference on Information Technology in Disaster Risk Reduction, Sofia, Bulgaria, 16–18 November 2016; Springer: Berlin, Germany, 2016; pp. 170–182. [Google Scholar]
Fekete, A. Critical infrastructure cascading effects. Disaster resilience assessment for floods affecting city of Cologne and Rhein-Erft-Kreis. J. Flood Risk Manag. 2020, 13, e312600. [Google Scholar] [CrossRef]
Itoh, M.; Yokoyama, D.; Toyoda, M. Visual Exploration of Changes in Passenger Flows and Tweets on Mega-City Metro Network. IEEE Trans. Big Data 2016, 2, 85–99. [Google Scholar] [CrossRef]
Muhammad, A.; Goda, K. Impact of earthquake source complexity and land elevation data resolution on tsunami hazard assessment and fatality estimation. Comput. Geosci. 2018, 112, 83–100. [Google Scholar] [CrossRef]
Lian, X.; Melancon, S.; Presta, J.R.; Reevesman, A.; Spiering, B.; Woodbridge, D. Scalable Real-time Prediction and Analysis of San Francisco Fire Department Response Times. In Proceedings of the 2019 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computing, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI), Leicester, UK, 19–23 August 2019; pp. 694–699. [Google Scholar]
Mishra, S.; Singh, S.P. A stochastic disaster-resilient and sustainable reverse logistics model in big data environment. Ann. Oper. Res. 2020, 1–32. [Google Scholar] [CrossRef]
Rakes, T.R.; Deane, J.K.; Rees, L.P.; Fetter, G.M. A decision support system for post-disaster interim housing. Decis. Support Syst. 2014, 66, 160–169. [Google Scholar] [CrossRef]
Berawi, M.A.; Siahaan, S.A.O.; Miraj, P.; Leviakangas, P. Determining the Prioritized Victim of Earthquake Disaster Using Fuzzy Logic and Decision Tree Approach. Evergreen 2020, 7, 246–252. [Google Scholar] [CrossRef]
Zahra, K.; Imran, M.; Ostermann, F.O. Automatic identification of eyewitness messages on twitter during disasters. Inf. Process. Manag. 2020, 57, 102107. [Google Scholar] [CrossRef]
Zhong, L.; Takano, K.; Ji, Y.; Yamada, S. Big Data Based Service Area Estimation for Mobile Communications during Natural Disasters. In Proceedings of the 2016 30th International Conference on Advanced Information Networking and Applications Workshops (WAINA), Crans-Montana, Switzerland, 23–25 March 2016; pp. 687–692. [Google Scholar]
Caragea, C.; Squicciarini, A.C.; Stehle, S.; Neppalli, K.; Tapia, A.H. Mapping moods: Geo-mapped sentiment analysis during hurricane Sandy. In Proceedings of the ISCRAM 2014 Conference Proceedings—11th International Conference on Information Systems for Crisis Response and Management, University Park, PA, USA, 18–21 May 2014; pp. 642–651. [Google Scholar]
Patra, R. Automated Categorization and Mining Tweets for Disaster Management. In Machine Learning Algorithms for Industrial Applications; Springer: Berlin, Germany, 2020; pp. 37–51. [Google Scholar]
Román, M.O.; Stokes, E.C.; Shrestha, R.; Wang, Z.; Schultz, L.; Carlo, E.A.S.; Sun, Q.; Bell, J.; Molthan, A.; Kalb, V.; et al. Satellite-based assessment of electricity restoration efforts in Puerto Rico after Hurricane Maria. PLoS ONE 2019, 14, e0218883. [Google Scholar] [CrossRef]
Mudigonda, S.; Ozbay, K.; Bartin, B. Evaluating the resilience and recovery of public transit system using big data: Case study from New Jersey. J. Transp. Saf. Secur. 2019, 11, 491–519. [Google Scholar] [CrossRef]
Guo, J.; Wu, X.; Wei, G. A new economic loss assessment system for urban severe rainfall and flooding disasters based on big data fusion. Environ. Res. 2020, 188, 109822. [Google Scholar] [CrossRef]
Banisakher, M.; Nguyen, V.; Mohammed, D. Big Data Analysis and Simulation for Performance Measurement of Hospitals in Emergency Situations. Int. J. Simul. Syst. Sci. Technol. 2017, 18. [Google Scholar] [CrossRef]
Shibuya, Y.; Tanaka, H. Socio-economic disaster recovery captured by big housing market data. In Proceedings of the 2019 IEEE Global Humanitarian Technology Conference (GHTC), Seattle, WA, USA, 17–20 October 2019; pp. 1–8. [Google Scholar]
Contreras, D.; Wilkinson, S.; Balan, N.; Phengsuwan, J.; James, P. Assessing Post-Disaster Recovery Using Sentiment Analysis. The case of L’Aquila, Haiti, Chile and Canterbury. In Proceedings of the 17th World Conference on Earthquake Engineering, Sendai, Japan, 19–24 July 2022. [Google Scholar]
Kabir, M.Y.; Gruzdev, S.; Madria, S. STIMULATE: A System for Real-time Information Acquisition and Learning for Disaster Management. In Proceedings of the 2020 21st IEEE International Conference on Mobile Data Management (MDM), Versailles, France, 30 June–3 July 2020; pp. 186–193. [Google Scholar]
Bradshaw, S.; Brazil, E.; Chodorow, K. MongoDB: The Definitive Guide: Powerful and Scalable Data Storage; O’Reilly Media: Sebastopol, CA, USA, 2019. [Google Scholar]
Gulli, A.; Pal, S. Deep Learning with Keras; Packt Publishing Ltd.: Birmingham, UK, 2017. [Google Scholar]
Abadi, M.; Barham, P.; Chen, J.; Chen, Z.; Davis, A.; Dean, J.; Devin, M.; Ghemawat, S.; Irving, G.; Isard, M.; et al. Tensorflow: A system for large-scale machine learning. In Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI16), Savannah, GA, USA, 2–4 December 2016; pp. 265–283. [Google Scholar]
Wolf-Fordham, S. Integrating Government Silos: Local Emergency Management and Public Health Department Collaboration for Emergency Planning and Response. Am. Rev. Public Adm. 2020, 50, 560–567. [Google Scholar] [CrossRef]
Patel, J. Bridging Data Silos Using Big Data Integration. Int. J. Database Manag. Syst. 2019, 11, 1–6. [Google Scholar] [CrossRef]
Chamikara, M.A.P.; Bertók, P.; Liu, D.; Camtepe, S.; Khalil, I. Efficient privacy preservation of big data for accurate data mining. Inf. Sci. 2020, 527, 420–443. [Google Scholar] [CrossRef] [Green Version]
Scheerlinck, J.; Eeghem, F.V.; Loutas, N. Big Data Interoperability Analysis; Technical Report SC508DI07171; European Union, ISA Programme, EU: Brussels, Belgium, 2018; Available online: https://joinup.ec.europa.eu/sites/default/files/document/2018-05/SC508DI07171%20D05.02%20Big%20Data%20Interoperability%20Analysis_v1.00.pdf (accessed on 2 December 2020).
Mazimwe, A.; Hammouda, I.; Gidudu, A. An empirical evaluation of data interoperability—A case of the disaster management sector in Uganda. ISPRS Int. J. Geo-Inf. 2019, 8, 484. [Google Scholar] [CrossRef] [Green Version]
Guide, E.I. Emergency Data Exchange Language (EDXL) Implementer’s Guide; OASIS: Burlington, MA, USA, 2005. [Google Scholar]
Gençtürk, M.; Evci, E.; Guney, A.; Kabak, Y.; Erturkmen, G.B.L. Achieving semantic interoperability in emergency management domain. In Proceedings of the International Symposium on Environmental Software Systems, Zadar, Croatia, 10–12 May 2017; Springer: Berlin, Germany, 2017; pp. 279–289. [Google Scholar]
Barros, R.; Kislansky, P.; do Nascimento Salvador, L.; Almeida, R.; Breyer, M.; Pedraza, L.G. EDXL-RESCUER Ontology: Conceptual Model for Semantic Integration. In Proceedings of the ISCRAM 2015 Conference, Kristiansand, Norway, 24–27 May 2015. [Google Scholar]
Purohit, H.; Kanagasabai, R.; Deshpande, N. Towards Next Generation Knowledge Graphs for Disaster Management. In Proceedings of the 2019 IEEE 13th International Conference on Semantic Computing (ICSC), Newport Beach, CA, USA, 30 January–1 February 2019; pp. 474–477. [Google Scholar]

Figure 1. Overview of the NIST Big Data Reference Architecture.

Figure 2. High Level Big Data Framework for Emergency Management.

Figure 3. Mapping between STIMULATE use case and the reference architecture for emergency management.

Table 1. Emergency core capabilities per emergency phase adapted from [19], where ✗ denotes that a core capability is required in the emergency phase.

Core Capability	Mitigation	Preparedness	Response	Recovery
Planning	✗	✗	✗	✗
Public information and Warning	✗	✗	✗	✗
Operational Coordination	✗	✗	✗	✗
Intelligence and Information Sharing		✗
Community Resilience	✗
Long-term vulnerability reduction	✗
Risk and Disaster Resilience Assessment	✗
Threats and Hazards Identification	✗
Infrastructure Systems			✗	✗
Critical transportation			✗
Environmental Response/Health and Safety			✗
Fatality Management Services			✗
Fire Management and Suppression			✗
Logistics and Supply Chain Management			✗
Mass Care Services			✗
Mass Search and Rescue Operations			✗
On-scene Security, Protection and Law Enforcement			✗
Operational Communications			✗
Publish Health, Healthcare, and Emergency Medical Services			✗
Situational Assessment			✗
Economic recovery				✗
Health and Social Services				✗
Housing				✗
Natural and Cultural Resources				✗

Table 2. Big data granular tasks for mitigation phase.

NRF Capability	Task	Example
Planning	Assessment	Simulation modelling of eruptive processes for identifying eruption scenarios for emergency planning in at Vesuvius, Italy [109]
Public information and warning	Communication, Prediction	Big Data analytics for predicting extreme flood risks and create awareness in the community to mitigate its effects [110]
Operational Coordination	Schedule	Develop scheduling plans of power supply based on disaster trends and reserves of emergency supply [111].
Community resilience	Assessment	Use of big data technologies to integrate physical, social, economic, and environmental dimensions to assess neighbourhood resilience [112]
Long-term vulnerability reduction	Assessment	Harvesting big data from residential buildings for assessment on climate change policies [113].
Risk and Disaster Resilient Assessment	Assessment	Geospatial zonation of seismic site effects in Seoul [114].
Threats and hazards identification	Monitoring	Monitoring social media and crowdsourcing data for early identification of urban flooding [115]

Table 3. Big Data granular tasks for preparedness phase.

NRF Capability	Task	Example
Planning	Prediction	Ambulance demand forecast based on weather conditions and datasets from hospitals [101]
Public information and warning	Communicate	Use of social media to communicate that vaccine against H1N1 influenza was available [116]
Operational Coordination	Assessment	Recommendation of using operational analytics to coordinate emergency response across Federal, State, and local agencies [117]
Intelligence and Information Sharing	Collection	Usage of big data and open data integration mechanisms for improving information sharing from central to local governments and NGOs during preparedness in Taiwan [118].

Table 4. Big data granular tasks for response phase.

NRF Capability	Task	Example
Planning	Assessment	Analysis of geosocial media post for emergency planning [105]
Public information and warning	Assessment	Assessment for managing affected populations based on a spatio-temporal analysis of public emotion information [106]
Operational Coordination	Assessment	Improved coordination between rescue teams integrating geographical, satellite, census and mobile phone call reports in Kerala floods [102]
Infrastructure systems	Assessment	Spatial assessment of risk and resilience of critical infrastructures for flood disaster [119]
Critical transportation	Prediction	Description and prediction of passenger flows, detection of unusual flows and its explanation based on Twitter content during several disasters in Japan [120]
Environmental response; Health and safety	Monitoring	Big Data system for monitoring water pollution after flood disaster [104]
Fatality management services	Assessment	Fatality estimation and tsunami hazard assessment based on big data earthquake source models [121]
Fire management and suppression	Prediction	Real-time prediction of fire department response times in San Francisco [122]
Logistics and supply chain management	Assessment	Decision support system for optimal facility location, its state of operation, and production-distribution across countries [123].
Mass care services	Assignment	Decision support system for allocation of temporary housing after the disaster [124]
Mass search and rescue operations	Assessment	Decision support system for prioritising victims to be rescued [125]
On-scene security, protection and law enforcement	Classification	Identification of eyewitness messages [126]
Operational communications	Prediction	Prediction of mobile service disruption during Tokyo earthquakes [127]
Public health, healthcare, and emergency medical services	Assessment	Triage based on big data [103]
Situational assessment	Classification	Detecting informative tweets [128]

Table 5. Big Data granular tasks for recovery phase.

NRF Capability	Task	Example
Planning	Assessment	Assessment of resilience to Emergencies and Disasters at neighbourhood level for improving planning based on big data fusion [112]
Public information and warning	Monitoring	Monitoring social media (e.g., Twitter) and classify messages per disaster phase and mine relevant information [129]
Operational Coordination	Assessment	Satellite-based assessment of electricity restoration efforts during Hurricane Maria in Puerto Rico [130]
Infrastructure systems	Assessment	Evaluation resilience and recovery of public transit systems based on Big Data [131]
Economic recovery	Assessment	Economic loss assessment for rainfall and flooding disasters based on Big Data fusion [132]
Health and social services		Decision support system for evaluating hospital resources during post-disaster management [133]
Housing	Assessment	Socio-economic analysis of disaster recovery based on housing market data [134]
Natural and cultural resources	Assessment	Recovery assessment of monuments based on sentiment analysis of tweets during memorial days [135]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Iglesias, C.A.; Favenza, A.; Carrera, Á. A Big Data Reference Architecture for Emergency Management. Information 2020, 11, 569. https://doi.org/10.3390/info11120569

AMA Style

Iglesias CA, Favenza A, Carrera Á. A Big Data Reference Architecture for Emergency Management. Information. 2020; 11(12):569. https://doi.org/10.3390/info11120569

Chicago/Turabian Style

Iglesias, Carlos A., Alfredo Favenza, and Álvaro Carrera. 2020. "A Big Data Reference Architecture for Emergency Management" Information 11, no. 12: 569. https://doi.org/10.3390/info11120569

APA Style

Iglesias, C. A., Favenza, A., & Carrera, Á. (2020). A Big Data Reference Architecture for Emergency Management. Information, 11(12), 569. https://doi.org/10.3390/info11120569

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Big Data Reference Architecture for Emergency Management

Abstract

1. Introduction

2. Background

2.1. National Planning Framework for Emergency Management

2.2. NIST Big Data Reference Architecture

2.3. Big Data Technologies in Emergency Management

2.4. Digital Humanitarianism in Disasters

3. Big Data Reference Architecture for Emergency Management

4. Case Study

5. Discussion

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI