With advancements in big geospatial data and artificial intelligence, multi-source data and diverse data-driven methods have become common in dengue risk prediction. Understanding the current state of data and models in dengue risk prediction enables the implementation of efficient and accurate prediction in
[...] Read more.
With advancements in big geospatial data and artificial intelligence, multi-source data and diverse data-driven methods have become common in dengue risk prediction. Understanding the current state of data and models in dengue risk prediction enables the implementation of efficient and accurate prediction in the future. Focusing on predictors, data sources, spatial and temporal scales, data-driven methods, and model evaluation, we performed a literature review based on 53 journal and conference papers published from 2018 to the present and concluded the following. (1) The predominant predictors include local climate conditions, historical dengue cases, vegetation indices, human mobility, population, internet search indices, social media indices, landscape, time index, and extreme weather events. (2) They are mainly derived from the official meteorological agency satellite-based datasets, public websites, department of health services and national electronic diseases surveillance systems, official statistics, and public transport datasets. (3) Country-level, province/state-level, city-level, district-level, and neighborhood-level are used as spatial scales, and the city-level scale received the most attention. The temporal scales include yearly, monthly, weekly, and daily, and both monthly and weekly are the most popular options. (4) Most studies define dengue risk forecasting as a regression task, and a few studies define it as a classification task. Data-driven methods can be categorized into single models, ensemble learning, and hybrid learning, with single models being further subdivided into time series, machine learning, and deep learning models. (5) Model evaluation concentrates primarily on the quantification of the difference/correlation between time-series observations and predicted values, the ability of models to determine whether a dengue outbreak occurs or not, and model uncertainty. Finally, we highlighted the importance of big geospatial data, data cloud computing, and other deep learning models in future dengue risk forecasting.
Full article