1. Introduction
Sustainability helps society in many ways by improving wellbeing and quality of life. Sustainable societies include many various sectors that encompass businesses, government agencies, environmentalists, and civic associations. Sustainable communities always seek to be innovative to boost their local economy for a healthy ecosystem [
1]. Sustainable development goals (SGDs) play an essential role in shaping daily societal activities, especially in under-developed countries such as Nigeria. However, the impact of the COVID-19 pandemic has caused the reversal of the United Nations target of lifting millions of people out of poverty. A recent study [
2] projected that there are 99 million people already pushed into poverty in 2020. There will be an additional 44 million people who will still live in extreme poverty by 2030 due to the impact of the COVID-19 pandemic. These would bring the total number of people globally living in poverty to 905 million by 2030.
Under sustainable development goals (SDGs), there is a need for countries such as Nigeria to come up with strategies and targeted interventions to reduce the number of vulnerable people that are currently living in extreme poverty, especially in remote areas that face difficulties in their daily lives with limited resources and poor infrastructure. Most of the under-developed countries, such as the Nigerian government, usually executed their services, including humanitarian relief aid, without seeking the opinion of the public, which is one-way governance. The sustainable development goal in information and communication technology (ICT) requires two-way governance where the government brings its citizens closer to them for better decision-making, especially during disaster management such as emergency response services including COVID-19 palliatives, relief aid, and logistics packages’ distribution to vulnerable people. Management and innovation for environmental sustainability meet the needs of the present without compromising the ability of future generations to meet their own needs. Sustainable development in the context of humanitarian support helps the government to reduce the number of vulnerable people living in severe pain caused by a disaster such as the COVID-19 pandemic.
COVID-19 is a disastrous disease that started spreading among people in December 2019 in Wuhan, China [
3,
4]. The virus started increasing and spreading on 31 December 2019, which began to draw public attention. In this regard, the World Health Organization (WHO) announced COVID-19 as a pandemic in March 2020 [
5]. As of 8 January 2021, the number of new COVID-19 cases reached the total number of 88,222,239 and the total number of 1,909,910 reported by the Centre for Systems Science and Engineering, Johns Hopkins University [
6]. Although not up to 1 percent of the Nigerian populace have been infected, out of the current approximate population of 200 million [
7], 136 thousand people (approximately 0.068% of the country’s population) have been infected by the virus [
8]. Similarly, the mortality rate has been low as currently 1630 deaths have been recorded, while the active cases count is 23,949 out of the total (136 thousand) detected cases so far [
8]. Many countries forced their citizens to stay at home to curb the spread of the deadly virus, making businesses and social activities stop worldwide [
9,
10,
11]. In times of disaster such as COVID-19, vulnerable people need help from the government to survive, and Nigeria, just like other countries, claimed to have spent billions of Naira distributing relief packages and COVID-19 palliatives to vulnerable citizens. The distribution of these palliatives is part of the measures the government has put in place in order to provide basic welfare to the masses as they have been restricted from conducting their daily business and means of earning income. However, Nigerians have expressed mixed feelings via social media, specifically on the billions of Naira claimed by the government on COVID-19 relief aid and palliatives [
12]. Some of these reactions are based on citizens’ concerns about the government’s failure to enforce sustainability (human, social and fiscal) laws on shared ideas of equality and rights, which improves qualities such as transparency, accountability, equality, reciprocity, cohesion, and honesty, the promotion of relationships amongst people, and their wellbeing.
Several governance approaches include the public’s participation as customers, partners, or citizens and bringing sustainable life to the public [
13]. Good governance requires improvement overtime to address critical issues concerning existing and emerging challenges to the governed. Part of such challenges is to improve the government’s ability to manage rapidly growing information demands on accountability and transparency of public funds; this requires the use of analytics to address governance challenges [
14] proactively. Considering the current rising expectations of citizens on their government, there is a need for the government to be innovative in gathering and understanding the data about the expectations of its citizens. This means that social media should not be limited to communicating government achievements to the public but should be extended as a listening and information source for the government. Such information sources can be used to develop predictive models that enable decision support, which is a vital part of smart governance [
15,
16].
With tremendous advancements in Information and Communication Technology, Business Intelligence is an essential tool to improve business processes and an influential and successful instrument in shaping business objectives to target customer’s needs [
17]. Similarly, Opinion Mining applies to businesses and other areas such as politics and governance. Opinion mining has become highly popular and interesting to researchers because of its application to several fields [
18]. According to [
19], 3.81 billion active social media users are depositing a massive amount of data on several social media platforms such as Facebook, Twitter, Instagram, WhatsApp, and others. Currently, Nigeria has more than 85 million internet users [
20]. This study chose Twitter as the primary data source due to its popularity because, according to [
21], Twitter accounts for 50 percent of the 25 million social media users in Nigeria. It is a popular microblog with 140 million active users posting more than 400 million tweets every day world-wide. Many users post information such as disaster damage reports and disaster preparedness situations during the disaster response period, making Twitter essential for updating and accessing data. Mining sentimental data efficiently will help better understand the emergency response, timely and efficiently [
22,
23].
Data mining plays a vital role in several research domains, such as artificial intelligence, statistics, and machine learning [
24]. Data mining is used to discover important information from databases. Opinion mining and sentiment analysis in the domain of social media platforms, data mining, or computing at large, as stated by [
25], refers to the identification, assessment, generating, and summarizing of an opinion and feelings of users about a different aspect of services rendered by governmental or private organizations, socio-economic and daily life activities. Additionally, for computation, sentiment analysis might be defined as written expressions of subjective mental states. Sentiment analysis and opinion mining in the context of emergency response help governmental and humanitarian agencies to identify sentiment and opinion that could bring the public closer to them by understanding their concerns and emotional feelings in times of emergency responses; hence, that would help in making an informed decision [
26,
27].
Several sentiment analysis studies on the English language have been conducted using different standard machine learning algorithms on social media regarding governmental issues using classifiers that have proven efficient in text classification, as illustrated by [
28,
29]. Despite the achieved efficiency, such studies are yet to be conducted on the citizens’ emotions regarding accountability and transparency of the Nigerian government spending on COVID-19 relief and aid response. Besides, such studies cannot be ported to informal (Nigerian Pidgin) English texts. Nigerian Pidgin English is estimated to have between 3 and 5 million people that use it as the primary communication medium in their daily interactions, and as the second language, around 75 million Nigerians [
30]. Due to its difference in vocabulary to standard English, words such as “tank” are interpreted as an expression of gratitude, not as a container, while “ginger” is interpreted as motivation, not as a plant [
31]. Therefore, this study aims to fill the existing research gap on emergency response-related studies and address the lack of text emotion datasets by constructing a Nigerian Local English Twitter Emotion dataset (Pidgin English). The dataset is experimented with using six Machine Learning algorithms to classify aggregated citizens’ emotions on the Nigerian government’s COVID-19 palliatives’ distribution. The remainder of this paper is structured as a review of related literature discussed in
Section 2 of this paper.
Section 3 discussed the methodology in terms of materials and methods used in conducting the experiments conducted in this paper. The results obtained from the experiments conducted are presented and discussed in
Section 4 and
Section 5, respectively. Finally, the conclusion of the study is presented in
Section 6 of this paper.
4. Results
This section presents and analytically discusses the results obtained from the conducted experiments. The evaluation metrics used are Accuracy percentage as well as Precision, Recall, and F-measure. However, while experimenting on the algorithms, K-Nearest Neighbor (KNN) was tested with values ranging from 1–50 for the “K” values, and for searching the K value, a Grid Search was used. Meanwhile, 1–200 ranges of values were selected for “number of estimators” in testing the Random Forest (RF) classifier. Therefore, the best “K” value chosen for KNN was 33, which returns the best accuracy among the tested values. Additionally, 200 values for the “number of estimators” was the best among the tested values for RF.
4.1. Results on Different Training and Testing Splits
This experiment was carried out to analyze the dataset used in training and testing all six models on different dataset splits.
Figure 3 shows the results obtained from the experiment based on the algorithms’ accuracy. The algorithms are Multinomial Naïve Bayes (MNB), Logistics Regression (LR), Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF), and K-Nearest Neighbor (KNN). The dataset split was carried out in two phases, training and testing, based on the total number of observations or instances (tweet).
The above result indicates no significant difference from the experimental results found between 50 and 90 percent training on the dataset. The percentage split’s accuracy values on the range (50 to 90 percent) yielded satisfactory performance, thus promising an acceptable result when implemented on the collected dataset. Therefore, the 80:20 training and testing split was used to create the models used in this study.
4.2. Experimental Results and Comparison
This section shows the overall comparison of the experimental results obtained on the six machine learning models shown in
Figure 4 and
Figure 5. Support Vector Machine outperforms the remaining classifiers with the highest accuracy of 88% from the experimental results. Random Forest (RF) and Logistics Regression (LR) emerge as the second-best classifiers amongst other models with 86% percent accuracy each and at 0.87 precision, 0.86 recall, and 0.86 F1-score on a weighted average. There is a slight difference in the macro average where Logistics Regression scores 0.87 precision, 0.86 recall, and 0.87 F1-score, while Random Forest scores 0.88 on precision, 0.85 recall, and 0.86 F1-score. This shows that LR outperforms RF on the macro average of the evaluation metrics. The third overall classifier with 81% accuracy is Decision Tree, while the fourth model is Multinomial Naïve Bayes with 77% accuracy. The worst performing classifier in terms of accuracy, precision, recall, and F1-score is the K-Nearest Neighbor with 70% accuracy.
These results indicate that SVM, LR, and RF achieved higher scores on weighted and average micro results from all models than MNB, DT, and KNN. Particularly on the weighted average, SVM, LR, and RF all scored the same percentage F1-measure of 86%. On the F1-measure for macro average, even though the scores are not the same for the three highest-scoring algorithms (SVM, RF, and LR), the difference is insignificant as SVM only surpasses LR with 0.02% and RF with 0.03%.
The results shown in
Figure 6 are obtained based on the five emotions across all the six models. Additionally, we utilized the F1-score to show the models’ performance on each emotion class because the F1-score is the harmonic means of precision and recall.
Based on the results from
Figure 6, the following was discovered:
“Disgust” is the emotion labeled with the highest number from the classification performed. Four out of the six models detected it as the highest emotion class: LR, SVM, NB, and KNN. DT and RF recorded a lower emotion class, “Disgust,” compared to the rest.
“Joy” is among the five emotion classes which achieved the best F1-score classification in two models, DT and RF, out of the six experimented models. This means that the two models surpass SVM, LR, NB, and KNN in classifying the emotion class “Joy.”
The three other emotions, i.e., “Anger, Sadness, and Fear,” scored varying counts that fall between that of “Disgust” and “Joy” across the six created models.
Figure 7 below indicates the average F1-score across all six models, SVM, RF, LR, DT, NB, and KNN, on each emotion class based on percentage.
The results shown in
Figure 7 above reveal that the emotion label “Disgust” surpasses the remaining four emotions with an 87% average F1-score on all six models. The second highest emotion is “Joy” with an 85% F1-score, followed by “Sadness” with 83%. The last two emotions are “Anger” and “Fear” with 81% and 73%. However, based on the results indicated above, the highest classified emotion, “Disgust”, is part of the negative emotions, which shows that the public is not happy with the COVID-19 palliative distribution in Nigeria.
4.3. Correlation Analysis
Pearson’s Product-Moment correlation analysis was conducted to determine the significance level in terms of relationship on the outcome of the different classified emotions. This is based on the outcome of the six machine learning models, as illustrated in
Section 3, i.e., methodology. The result from the correlation analysis, as shown in
Table 7, reveals a strong, positive correlation between “joy” and “fear”, which was statistically significant (r = 0.870, n = 6,
p = 0.05).
Based on the result from
Table 7, the following was discovered:
The significant correlation between “Joy” and “Fear” implies that the public is happy and afraid at the same time. This means people are excited about the news of palliatives’ distribution but fear that there would not be equity and transparency in the distribution process due to corruption, which negatively affects both human and social sustainability.
The other three classes, i.e., sadness, disgust, and anger, all have their respective scores, but no significant correlation exists between them or other classes. This is despite the majority of classification results being “disgust” (which implies that the public is not happy with the distribution process), as illustrated in
Figure 7.
4.4. Word Cloud
In this research, one of the content mining approaches was used to visualize the most frequent words used in the collected Twitter data on public opinion and sentiment on COVID-19 palliatives and relief aid packages’ distribution to citizens of Nigeria. This technique is called Word Cloud, a visual or graphical representation of the information in the dataset [
62]. Additionally, a word cloud shows words in large, medium, and small font sizes. The largest word that appears in a word cloud represents the most frequent word in the dataset, while the smallest represents the least frequent word.
Figure 8 shows the generated word cloud from the emotions of the collected dataset.
The word frequency (word cloud) of the collected tweets, as shown in
Figure 8, reveals a high occurrence of the words “suffering” and “sadness” as the major feelings of the public on the COVID-19 palliatives’ distribution.
5. Discussion
This study aims to classify public emotions regarding COVID-19 relief aid distribution to vulnerable Nigerian citizens. The classification was conducted using six machine learning algorithms on the Nigerian Local English Slangs-Pidgin (NLES-P) dataset. Data pre-processing was conducted to refine the dataset and put it in order for experimentation. Various evaluation measures, including accuracy, precision, recall, and F-measure, were employed to evaluate the algorithms’ performance. The following paragraphs answer the posed research questions and objectives of the study.
Research Question 1: How to accommodate Nigerian Pidgin English in emotion classification from social media posts?
To answer this question, the researchers mapped this question with the research objective 1 (RO1), which is “to construct and annotate the relevant Nigerian Pidgin English dataset from Twitter post”. The approach to answer the question and achieve the objective is to construct Nigerian Pidgin English Twitter posts that contain an opinion and sentiment of Nigerians regarding COVID-19 relief aid packages’ distribution. Therefore, findings from this study reveal that the dataset was constructed with five emotion classes (anger, sadness, fear, joy and disgust).
Research Question 2: How can social media data and machine learning algorithms help identify public emotions associated with the distribution of COVID-19 relief aid to vulnerable Nigerian citizens?
To answer this research question, research objective 2 (RO2) was mapped with the research question, which is “to classify the emotions of the Twitter post using Naïve Bayes, Support Vector Machine, Random Forest, Logistics Regression, K-Nearest Neighbour and Decision Tree”. In this regard, the constructed emotion dataset (NLES-P) was utilized and experimented with the above-mentioned machine learning algorithms. Additionally, the basic aim of the algorithms is to predict appropriate emotion labels, namely, Joy, Sadness, Fear, Anger and Disgust. The experimental results obtained show that all the algorithms produced acceptable results, though there are some variations amongst the algorithms. The findings from this study revealed that the research question was answered, and the research objective was also achieved.
Research Question 3: What are the performance variations of the six standard machine learning algorithms?
To answer this research question, the researchers mapped research objective 3 (RO3) with research question 3, which is “to compare the accuracy of the machine learning models using standard performance evaluation metrics (accuracy, precision, recall and f1-score)”. Despite this, the researchers conducted a number of experiments to evaluate the performance of the six machine learning classifiers with respect to the emotion classification problem, as mentioned in the previous section. The approach to answering this question is aimed at inspecting and comparing the results obtained from the previous experiments conducted across all six ML classifiers. The experimental results revealed that SVM outperforms all the remaining five models with 88% accuracy with respect to the labeled emotions. This indicated that the research question was answered, and the objective was achieved based on the results obtained from the experiment.
However, there is no available work directly related to this study, but the researchers tried to compare the results with some previous studies conducted using the International Survey on Emotion Antecedents and Reactions (ISEAR) dataset.
Table 8 below shows the comparison of this study with other studies conducted using the same emotion classes (anger, sadness, joy, fear, and disgust). The results reveal that despite the utilization of non-standard English in this study, it produced better results accuracy than other similar studies conducted with Standard English.
Varying scores on performance were recorded by the machine learning models created for the emotion classification. Based on the experiment conducted, the results clearly show that the public emotions on COVID-19 relief aid package distribution in Nigeria were not satisfactory because the negative emotions expressed by the public outnumbered the public happiness. In order to avoid the lack of accountability and transparency that caused so much disgust, sadness, and anger during the COVID-19 relief aid distribution, the following actions are suggested:
Online and physical registration centers for the needy should be opened. The online portal can be accessed by the needy that live within cities or can access the internet. Simultaneously, the physical registration centers can attend to the needy people who do not have access to the internet and live in rural communities. This would help the government know the exact number of needy based on their different locations, and thus ensure adequate provision based on numbers rather than random or equal distribution as the number of the needy varies based on regions.
Accessible distribution centers can be set up nationwide to invite registered nurses to come for their support pickup. This can be scheduled in order to maintain the COVID-19 social distancing safety protocol.
Contributions of the Study
The contributions of this study are six machine learning models for the emotion classification of informal Nigerian Local English Slangs-Pidgin (NLES-P). Although the different models achieved different performance scores, each model recorded an acceptable performance score used in emotion classification. Furthermore, the models in this study can be applied in the text classification of Pidgin in countries such as Ghana, Cameroon, and Equatorial Guinea, where the same Pidgin English is widely spoken as in Nigeria. Secondly, as this research is the first to perform emotion classification on Nigerian Pidgin English, it has contributed an emotion dataset to the Nigerian Pidgin English language resources used in performing emotion classification. Thirdly, the holdout modeling approach employed by this study in creating the models for emotion classification has proven to perform efficiently well in terms of factors such as accuracy. Even though the holdout approach was not compared with another approach, such as cross-validation, the performance recorded by the holdout approach is very satisfactory. Lastly, the text analytics results for the Nigerian government on the distribution of the COVID-19 palliatives to citizens have revealed varying emotions that inform the government about the masses’ emotions.
6. Conclusions
This research focused on the reach and perceived public sentiment on emergency responses, specifically COVID-19 palliatives and relief aid packages’ distribution to vulnerable Nigerian citizens. Twitter was selected as the source for data on public opinions and sentiments. Various machine learning algorithms were employed: Multinomial Naïve Bayes, Support Vector Machine, Random Forest, Decision Tree, K-Nearest Neighbor, and Logistics Regression. These algorithms were used to classify the aggregated opinion and sentiment of the public using the Nigerian Local English Slang-Pidgin (NLES-P) dataset. Some of this research’s contributions consist of: (1) The results from this study would help the government and other organizations in resource-oriented decisions by taking into account their citizens’ needs and opinions in times of disaster management; (2) The research produced a text emotion dataset named Nigerian Local English Slang-Pidgin (NLES-P) (containing the emotion classes: anger, sadness, joy, fear, and disgust) to further facilitate research on Nigerian Pidgin and the related aspects; (3) Performance comparison was conducted on the six standard machine learning classifiers for Twitter classification regarding COVID-19 palliatives and relief aid using the NLES-P dataset. Finally, the results from the conducted experiments reveal that Support Vector Machine outperforms the other models with the highest accuracy of 88%. Likewise, “Disgust”, as one of the five emotion classes, surpasses the other emotions with an 87% average F1-score across the six experimented models. Conclusively, the overall results of all the experiments conducted indicated that the level of unhappiness from Nigerians regarding the distribution of COVID-19 palliatives and relief aid packages by the government was very high, with little positive sentiment from the public. Furthermore, the correlation analysis conducted shows a significant correlation between “Joy” and “Fear”, implying that the public is happy and afraid at the same time, which implies that people are excited about the news of palliatives’ distribution but afraid that there would not be equity and transparency in the distribution process due to corruption.
Limitations and Future Work
This study currently has some limitations that need to be improved in future works. The suggested future works are:
Collect more data because some of the algorithms used in this study perform well on a large dataset. Other social media such as Facebook and Instagram should be used for data collection, creating a larger dataset.
The majority of Nigerians express their opinions and sentiments using their native languages. This is because the country consists of three major local languages: Hausa, Igbo, and Yoruba. Future research should use these languages to help in measuring the performance of the machine learning algorithms.
The machine learning algorithms could be used for real-time classification using Twitter social media.
The type of essentials, such as food, shelter, medical support, etc., should be used as features to classify the type of essentials needed by vulnerable people. Finally, future researchers should use different parameters and feature engineering techniques to experiment on these machine learning algorithms (MNB, RF, SVM, DT, LR, and KNN) and make comparative analysis because we noticed that some of the models have different parameters and factors to make them perform better.