1. Introduction
Drought is a kind of costly natural hazard which has widespread impact on societies, economies, and the environment [
1,
2]. In most cases, the effect of drought can accumulate and linger for many years after the cessation of the event [
3]. It is imperative to build an early drought warning system to support local stakeholders to assist with effective decision making for drought-vulnerable communities [
4].
In general terms, rainfall deficiency for a long dry period leads to drought. However, there is no uniform definition of drought. Various types of droughts include meteorological (when dry conditions within an area are influenced by climactic factors), agricultural (when dry conditions have an impact on agricultural productivity and crops), etc. To measure drought severity, there are some indices which are proposed based on meteorological, hydrological, agricultural and socioeconomic information [
5,
6,
7,
8]. Aitkenhead et al. [
8] develop a region-specific drought risk index based on the integration of drought vulnerability, exposure, and hazard indices, in order to evaluate drought in the Northern Murray–Darling Basin. Vicente-Serrano et al. [
9] propose a climatic drought index, namely, the standardized precipitation evapotranspiration index (SPEI), by exploring precipitation and temperature data. Sayari et al. [
10] use three drought indices, SPI, Precipitation Index Percent of Normal (PNPI), and Agricultural Rainfall Index (ARI), which monitor drought intensity and duration in the Kashafrood basin (northeast Iran). One of the mostly commonly used indices to assess the meteorological drought is Standardized Precipitation index (SPI) [
11], which classifies precipitation as standard deviations from the long-term mean. The SPI has been considered as a universal drought index in numerous hydrological and meteorological services [
12]. From the perspective of agriculture application, the Normalized Difference Vegetation Index (NDVI) [
13] provides an indication of crop health. Chua et al. [
14] perform a case study to illustrate that the Vegetation Health Index (VHI), outgoing longwave radiation (OLR) anomaly, and the SPI can capture the spatial and temporal aspects of the severe 2015–2016 El Niño-induced drought in Papua New Guinea accurately. Lotsch et al. [
15] adopt global precipitation and satellite NDVI data in time series to study both spatial and temporal variability between terrestrial ecosystems and precipitation regimes.
In this study, we examine the relationship between the NDVI and drought category. NDVI is related to the biomass and greenness of vegetation, which serves as an indicator to understand vegetation health at a both uniform and global scale [
16,
17]. Our problem can be formulated as follows: Given the NDVI data, and the Australian Gridded Climate Dataset (AGCD), return the optimal NDVI threshold values to represent the boundary of different drought categories, namely, extreme drought, severe drought, moderate drought, mild drought, and no drought. Specifically, we aim to find the NDVI threshold value to provide justification on why a larger (or a smaller) NDVI is related to a different drought category and whether or not the NDVI varies across different climate regions.
A drought classification model can be built based on the available datasets to further explore the influenced factors on the drought category classification [
6,
18,
19]. Machine learning techniques are often adopted to perform drought classification. In particular, Santos et al. [
20] consider analysing short-, medium-, and long-term droughts in Brazil from 1998 to 2015 based on the SPI data, where four types of drought events were studied for drought categories. An et al. [
21] propose a deep convolutional neural network to classify maize drought stress, where three categories are considered. Lima et al. [
22] develop a classification system to evaluate four different drought indices: SPI, Percent of Normal Precipitation (PNP), Deciles Method (DM), and Rainfall Anomaly Index (RAI). Felsche et al. [
23] use a list of 30 atmospheric and soil variables and apply the artificial neural network to classify drought or no drought in two European domains. Then, they use the Shapely values to provide an explanation of the classification model and calculate the contribution percentage of each dataset to the classification model. Quang Tri et al. [
24] establish drought classification maps in the Ba River basin, Vietnam, based on three meteorological drought indices, namely, the SPI, the Soil and Water Assessment Tool model, and the hydrological drought index. Vidyarthi et al. [
25] aim to extract knowledge from the artificial neural network (ANN) drought classification model to improve the comprehensibility of the black-box classification model. Moreira et al. [
26] classify drought severity using the loglinear model based on the SPI data from 14 rainfall stations in 12 months from September 1932 to June 2006. Rani et al. [
27] devise a classifier model to know about the severity of drought by predicting the climate conditions with a focus on the drought-vulnerable Indian state of Andhra Pradesh. Their model adopts a artificial neural network which is coupled with a feed forward neural network to predict the rainfall in the future and fuzzy c-means for the purpose of partitioning the forecasted data into low, medium, and high rainfall. Danandeh Mehr [
28] adopt the gradient-boosting decision tree to classify the drought severity into three categories of wet, normal, and dry events. Danandeh Mehr [
7] present a fuzzy random forest model by incorporating the Standardized Precipitation Evapotranspiration Index (SPEI) from 1961 to 2015 around the Central Antalya Basin, Turkey. Won et al. [
29] study the impact of two drought indices, namely, the SPI and the Evaporative Demand Drought Index (EDDI), on future droughts in South Korea. Paulo et al. [
30] propose a Markov chain approach to characterize the stochasticity of droughts which aims to predict the transition from one class of severity to another up to three months ahead. Chiang et al. [
31] perform a comparative study by establishing four models such as support vector machine and artificial neural networks to forecast the reservoir drought status in the next few days. To predict the drought, they take four kinds of features as the input, such as reservoir storage capacity and inflows. Malik et al. [
32] propose several heuristic approaches to forecast meteorological droughts using the Effective Drought Index (EDI) in Uttarakhand State, India.
Drought classification is an active and heavily researched area of ML application. When specifying drought thresholds, prior work [
4] often relies on the human-involved decision rules by using the static thresholds. Most importantly, this requires rich domain knowledge from experts and has potential to be subjective at times. The objectives of this work are: (1) uncovering the influence of the NDVI dataset on meteorological drought category (derived from the AGCD dataset); (2) studying the application of our drought classification model in two areas with different climate zones in Australia, namely, Temperate and Grassland region; (3) pinpointing the “optimal” thresholds of NDVI leading to different drought categories.
The paper is organised as follows.
Section 2 introduces the study area, data source, and methodology in the study.
Section 3 describes the experimental setup and experimental results, while
Section 4 discusses the results of drought classification findings, as well as their geospatial and temporal impact.
Section 5 summarises the major observations and findings.