Next Article in Journal
The Influence of Animal Glue as an Additive on the Properties of Lime Architectural Grouts
Previous Article in Journal
Elucidating the Potential of Biochar-Bentonite Composite and Kaolinite-Based Seed Balls for the Remediation of Coal Mining Impacted Heavy Metals Contaminated Soil
Previous Article in Special Issue
Thai Local Chicken Breeds, Chee Fah and Fah Luang, Originated from Chinese Black-Boned Chicken with Introgression of Red Junglefowl and Domestic Chicken Breeds
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Twitter Mining for Detecting Interest Trends on Biodiversity: Messages from Seven Language Communities

1
Graduate School of Engineering, Osaka University, Yamadaoka 2-1, Suita 565-0871, Osaka, Japan
2
Institute for Future Initiatives, The University of Tokyo, 1-1-1 Yayoi, Bunkyo-ku, Tokyo 113-8657, Japan
3
Department of Ecosystem Studies, The University of Tokyo, 1-1-1 Yayoi, Bunkyo-ku, Tokyo 113-8657, Japan
4
Institute for Global Environmental Strategies, 2108-11, Kamiyamaguchi, Hayama 240-0115, Kanagawa, Japan
*
Author to whom correspondence should be addressed.
Sustainability 2023, 15(17), 12893; https://doi.org/10.3390/su151712893
Submission received: 16 May 2023 / Revised: 13 August 2023 / Accepted: 16 August 2023 / Published: 25 August 2023

Abstract

:
The recent rates of global change in nature are unprecedented in human history. The Intergovernmental Science-Policy Platform on Biodiversity and Ecosystem Services (IPBES) has proposed a framework to achieve transformative change. Transformative change with respect to nature will be driven by recognizing the values people have; making inclusive decisions based on these values; restructuring policies, rights, and regulations in accordance with them; and transforming social norms and goals that can drive change. Social media is a new source of information and a modern tool for monitoring public opinion on human–nature interactions. This study identified commonalities among seven language communities (the six official languages of the United Nations and the Japanese language), demonstrating the uniqueness of the Japanese community by comparing hashtags in tweets that include the term biodiversity and determining differences in interest and concern about biodiversity from the past to the present. Tweets accessible at the end of 2021 that focus on biodiversity were collected from the Twitter server and used to form a text dataset. Interest was then qualitatively and quantitatively identified using natural language processing technology. Engagements and diversity indices were found to be on the rise in all language communities. We found that the Japanese language community has a different perspective on the relationship between biodiversity and humans from the scope of the IPBES conceptual framework. Future work should examine the relationship between passion for biodiversity and the Sustainable Development Goals. In addition, collaboration with various people around the world is necessary to understand the concept of biodiversity in different traditions and cultures.

1. Introduction

The Intergovernmental Science-Policy Platform on Biodiversity and Ecosystem Services (IPBES) noted that the rate of recent global change in nature is unprecedented in human history and proposed a framework for transformative change [1]. According to this framework, the direct drivers of change in nature with the greatest impact globally are changes in land and sea use, direct exploitation of organisms, climate change, pollution, and invasion of alien species. These direct factors arise from a set of underlying factors called indirect drivers that are supported by social values and behaviors, including production and consumption patterns, human population dynamics and trends, trade, technological innovations, and both local and global governance. Therefore, to achieve transformative change from the current trend to a more sustainable one, it is necessary to recognize effective points of intervention across values and behaviors (both indirect and direct factors) and governance in collaboration with various stakeholders [2].
Transformative change entails a fundamental, system-wide reorganization across technological, economic, and social factors, including paradigms, goals, and values [1]. The context of transformative change differs according to history, culture, and social institutions (Figure 1). From the socioecological perspective, transformative change depends on eight leverage points and five collaborative implementations of priority governance interventions, termed levers [3]. As demonstrated in a set of values-centered leverage points in [2], “shifting societal norms and goals” will have the most significant impact. In addition, recognizing the nature values people have; making inclusive decisions based on these values; restructuring policies, rights, and regulations; and transforming social norms and goals can drive transformative change. Goal 1 of the Aichi Targets established in 2010 by the Convention on Biological Diversity (CBD) [4,5] has as its first aim that “people are aware of the values of biodiversity and the steps they can take to conserve and use it sustainably”. In response to the National Biodiversity Strategies and Action Plans mandated by the CBD [6], the Japanese government has also set the following: “Strategy (1): Letting biodiversity sink in Japanese society” [7]. Further, the Kunming-Montreal Global Biodiversity Framework also proposed enhancing communication, education, and awareness of biodiversity and the uptake of this framework by all actors in Section K (outreach, awareness, and uptake) [8]. Determining the leverage points is crucial for considering viewpoints that will stimulate people’s interest. Thus, the Japanese government had developed the 6th National Biodiversity Strategies and Action Plan and setting “awareness and action for biodiversity in the consumption and production” as the main pillar of the strategy [9,10].
In this trend, the CBD assessed the achievement of Goal 1 of the 10-year Aichi Targets [4,6]. The Ministry of the Environment of Japan checked time-series trends in the interest of nature and engagement in biodiversity conservation from the 1990s to the 2010s [11]. However, these were limited questionnaire surveys with a limited number of respondents, and they did not aim to understand the biodiversity values people have. The CBD indicated that there was no globally consistent information identifying trends in awareness or willingness to act on biodiversity [5]. Leadley et al. (2022) indicated the importance of a monitoring system that can discover and identify the drivers that cause changes in biodiversity, the actions that lead to the expected biodiversity outcomes, and the need to establish a set of readily monitored predictive indicators to design proactive planning and actions [12].
Within this context, social media is a novel source of information and an up-to-date tool for monitoring public opinion in human–nature interactions [13,14]. Analysis of citizens’ interests through social media can be implemented through various means, including smart city [15,16,17], traffic [18,19,20], and energy [21]. Several types of social media have been used for this purpose, and one of the most common is Twitter. Twitter is a service for friends, family, and coworkers to communicate and stay connected through the exchange of quick and frequent messages called tweets, which may contain photos, videos, links, and text [22]. Monetary Daily Active Usage/Users counted 237.8 million active users in 2022 [23]. Twitter features a service called the full-archive search for use in academic research. This allows people in academia to access the full corpus of Twitter data back to the first tweet posted in March 2006 [24]. It is expected that a Twitter search will help clarify people’s interest in biodiversity.
A great deal of research on climate change has been conducted using Twitter. For instance, Pearce et al. (2014) investigated how people reacted to the Integrated Assessment Report of the IPCC [25], and Kim and Cooke (2018) observed speaking about climate change and ocean acidification when the United States withdrew from the Paris Agreement [26]. Molodtsova (2014) measured the influence of mass media on attitudes linking local extreme weather events to climate change in the United States [27]. Arlt et al. (2018) investigated the effects of media and interpersonal communication on participation in climate discourse online and revealed that receiving information on climate change from social media, including Twitter; active information seeking online; and interpersonal conversations strongly encourage participation [28]. Sanford et al. (2023) examined the influence of the emotional framing of messages posted by environmental activists on engagement and behavioral intentions on climate action [29]. Studies related to COVID-19 have included analyses of the impact of a global pandemic on mitigation behavior and information communication on climate change on Twitter [30,31,32]. As state-of-the-art research, Thakur (2023) combined Twitter mining and sentiment analysis and detected the difference in opinions via emotions [33].
Twitter has also been utilized in research on biodiversity. For example, Twitter helped in identifying endemic species [34] and invasive alien species [35], and social media data (including Twitter data) were compared to measure the popularity and number of visitors to a national park [36]. Jarić et al. (2016) assessed whether the use of Latin or vernacular names was more effective when data mining on the web [37], and Papworth et al. (2015) explored the transmission of conservation research through online news and social media [38]. Furthermore, several studies have covered people’s interest in biodiversity conservation. Hawkins and Silver (2016) and Macdonald et al. (2017) analyzed discussions of and reactions to seal and lion hunting, respectively [39,40]. Hammond et al. (2022) conducted a content analysis of tweets about elephants [41]. Kidd et al. (2018) assessed the attention to threatened species on Twitter and found that many threatened species were not mentioned at all [42]. In an important study of biodiversity mainstreaming, Cooper (2019) investigated 22 biodiversity-related keywords in 31 languages in online newspapers, social media, and internet searches to monitor Aichi Target 1 [43]. They also proposed a method to measure the progress of Aichi Target 1. Ohtani (2022) also attempted to reveal the emotional tendencies behind contexts in which biodiversity has been used on Twitter with sentiment analysis. The study presented changes in the discourse surrounding biodiversity in English tweets over the past 10 years and the possibility of developing quantitative methods such as natural language processing [44]. Barrios-O’Neill (2020), the most relevant study to our work, analyzed the advocacy of environmental NGOs on Twitter by screening advocacy relating to biodiversity and revealed that Twitter advocacy was dominated by climate change, overexploitation, and plastic pollution, but major threats to agriculture, urbanization, invasions, and pollution were rarely addressed [45].
It is important to clarify how people’s interest in biodiversity has changed and whether it is currently designing the necessary levers to transform values and behaviors related to biodiversity. Menendez et al. (2018) analyzed tweets that used the hashtag #WorldEnvironmentDay on the respective day and identified interests related to the sustainable care of the environment and public health [46]. To deepen mainstreaming biodiversity among the public, our study aimed to identify the historical trends and specific interests of communication on biodiversity. In addition, we need to deeply dive into more specific concepts of the biodiversity domain by localizing the context to reflect the diversity of nature’s values [2].
This study investigated the commonality among seven language communities (Japanese, which is the primary language of the authors, and six official languages of the United Nations) and the uniqueness of the Japanese community by comparing tweets that include the term biodiversity for 15 years. It also clarified changes in interest in and concern about biodiversity from the past to the present. This study formulated the following three research questions (RQs):
RQ1: Is the interest in biodiversity continuously activated (RQ1-1) and diversified (RQ1-2)? This result can contribute to monitoring the current progress of promoting people’s interests on biodiversity.
RQ2: What are the shared interests among the language communities and the special interest of the Japanese language community? This can help share the diversity of the interests, depending on the indigenous communities.
RQ3: What promotes the interest in biodiversity in the Japanese language community? The knowledge can be used to support the effective measure design for enriching the people’s values on biodiversity.
The detail of each RQ’s interpretation and the methodology of the quantification are described in the next section.

2. Materials and Methods

As depicted in Figure 2, tweets on biodiversity were collected from the Twitter server to create the text dataset. After preprocessing the text dataset, interest in biodiversity was qualitatively and quantitatively identified via data mining using natural language processing technology. All programs in the pseudocode were implemented with Python (ver. 3.7.7).

2.1. Twitter Data Collection

Twitter API v2 [47] was used to collect tweets related to biodiversity through a data collection program. Japanese and the six official languages of the United Nations were selected as the target language communities. The full archive search function for academic was used, and the endpoint of “GET/2/tweets/search/all” was used for the Twitter search [48]. The query parameters for the search matching in the seven language communities were as follows: Japanese (生物多様性), Arabic (التنوع البيولوجي), Chinese (生物多样性), English (biodiversity), French (biodiversité), Russian (биopaзнooбpaзиe), and Spanish (biodiversidad) [49]. The exact match was applied, and partial match and synonyms were not used for the query. The collected tweet.fields parameters were “created_at” and “public_metrics”. The max_result parameter was 500. The search period was from 21 March 2006 to 31 December 2021. The start_time and end_time parameters were dynamically set according to the request response and the rate limit. The trials to connect the endpoint were iterated at 3.5 s intervals and stopped for 600 s when the iteration was exceeded to the limit rate. The data collection was performed twice. The first search period was from 21 March 2006 to 31 March 2021, and the second was from 1 April 2021 to 31 December 2021. It took two weeks in early May 2021 for the first collection and one week in February 2022 for the second collection. This dataset was built using the available tweets at this point. Therefore, this dataset did not include tweets that were deleted in the past and the deleted tweets could not be collected anymore so we had to be careful of sampling bias. All tweets were included in the dataset regardless of the user properties. The dataset could include the tweets generated by bots. Bots represent a serious threat to users, as they can launch large-scale attacks and manipulation campaigns, and removal technologies such as machine learning can be applied to detect them [50]; however, we decided to use all the data to research the effects of the bots.

2.2. Hashtag (#) Extraction

The hashtag (#) function is widely used in tweets. Twitter users can annotate keywords with a hashtag to emphasize interest. This hashtag can help users to find conversations on certain topics and bring their own tweets greater attention [51]. Thus, users’ interests can be identified by analyzing the types of topics mentioned in the hashtags in tweets related to biodiversity. The hashtags were extracted from the original tweets using regular expression-matching operations. Regular expression is a sequence of characters that specifies a match pattern in text strings. Tokens represented #(\w+|[^ -~-]+) in regular expression matching were extracted as hashtags. Hashtags related to personal names, excluding those of celebrities, were removed to ensure privacy. To preserve the original intentions of the users, no preprocessing such as altering text casing was applied.

2.3. Evaluation of Engagements and Diversity Indices

Evaluation of engagement and diversity indices of hashtags were conducted to address RQ1 and determine whether there is continuous interest in biodiversity (RQ1-1) and whether it is diversified (RQ1-2).

2.3.1. Engagement

The engagements were evaluated according to the number of retweets, likes, and replies the tweets had [52]. On Twitter, users can retweet and like posts, denoting the endorsement of tweets [53]. Tweet i, RTi, like i, and reply i were the number of tweets retrieved, the total number of tweets retweeted, the total number of tweets liked, and the total number of tweets replied to in year i, respectively. Using these variables, RT_engagement i, like_engagement i, and reply_engagement i were calculated with Equations (1)–(3), respectively.
R T _ e n g a g e m e n t i = R T i T w e e t i
l i k e _ e n g a g e m e n t i = l i k e i T w e e t i
r e p l y _ e n g a g e m e n t i = r e p l y i T w e e t i

2.3.2. Diversity Index

Shannon entropy was used as the diversity index of interest in biodiversity to address RQ1-2. Shannon entropy is a metric used in information theory, and it is defined as the amount of information in a bit unit [54]. Thus, if various topics (hashtags) emerge in year i—making topic (hashtag) distribution more divergent—then Shannon entropy is high. However, Shannon entropy is low if the topics (hashtags) are concentrated on a few specific interests.
The amount of information obtained for event e occurring with probability p is represented by Equation (4). Therefore, Shannon entropy determines probability distribution p1, p2, p3,…, pn of the proportion of hashtag n (n = 1 to N) appearing in year i, which was used as the diversity index of year i and is quantified by Equation (5).
I e = l o g 2 p e   [ b i t ]
d i v e r s i t y   i n d e x i = n = 1 N p n l o g 2 p n   [ b i t ]

2.4. Semantic Analysis of Hashtags

The most common hashtags were visualized and compared between the Japanese community and the other six language communities to address RQ2, which investigated the shared interest among the language communities and the special interest of the Japanese language community, and RQ3, which regarded the type of matters that promote interest in biodiversity in the Japanese language community. Further, hashtags were used as nodes in the network analysis to calculate the mediating centrality of the hashtags with the highest number of hashtags in each language community. All languages were translated into English using Google Translate API (ver. 4.0.0rc1) [55] to set the operations of the hashtags. The translations to English were accepted without evaluating the accuracy of the translation due to limitations of linguistic and cultural understanding.

2.4.1. Popular Hashtags in Language Communities

The most commonly used hashtags that appeared in language community l were counted to identify interest in biodiversity during the entire sample period. The hashtags with the highest proportions were considered as the interest of the language community l.

2.4.2. Common and Unique Hashtags among Language Communities

The set operation was conducted to find hashtags used among the Japanese language community and the other six language communities to identify the commonality and uniqueness of interest in biodiversity within a language community. The set of hashtags that appeared in the Japanese language community and the other six language communities during the whole period was defined as Ujapanese and U6langages, respectively. The intersection set between Ujapanese and U6langages was Ucommon, which represented the shared interest in biodiversity among the language communities. Ujapannese_only and U6langage_only were obtained using different sets of hashtags in each language community. These are presented in Equations (6)–(8).
U c o m m o n = U J a p a n e s e U 6 l a n g u a g e s
U j a p a n e s e _ o n l y = { h a s h t a g | h a s h t a g U j a p a n e s e h a s h t a g U 6 l a n g u a g e s }
U 6 l a n g u a g e s _ o n l y = { h a s h t a g | h a s h t a g U 6 l a n g u a g e s h a s h t a g U j a p a n e s e }
These hashtags were visualized in a word cloud [56], in which the size of each word represented the popularity of a given hashtag. The sizes in the different sets were adjusted using Hl (l = ja, ar, ch, en, fr, ru, es), which was the proportion of the number of particular hashtags in each language community l. The size of the intersection set was adjusted using Itag, calculated as the harmonic mean of Hl and expressed by Equation (9).
I t a g = n 1 n ( 1 H j a + 1 H a r + + 1 H e s )

2.4.3. Comparison with IPBES Conceptual Framework (IPBES-CF)

To compare interest in biodiversity, popular hashtags were compared with the IPBES-CF, developed by Díaz et al. (2015) [57] as depicted in Figure 3. The IPBES-CF is a highly simplified model of the complex interactions between the natural world and human societies. It comprises six interlinked elements—(1) nature, (2) nature’s contributions to people (NCP), (3) anthropogenic assets, (4) institutions and governance systems and other indirect drivers of change, (5) direct drivers of change, and (6) good quality of life. The most commonly used hashtags in Japanese and English language communities were categorized according to these six elements, and their distributions were compared.

2.4.4. Network Analysis

To understand the context of the hashtags, the co-occurrence relationship of the hashtags was identified, and network graphs were created to calculate the betweenness centrality with the NetworkX (ver. 2.4) library [58], which is a metric of the importance of network nodes (Equation (10)). In addition, the betweenness centrality of the Japanese language community was calculated to identify the time-series trend of important concepts. bi denotes the betweenness centrality of node i; σst(i) is the number of the shortest passes that include node i; and σst is the number of the shortest passes between s and t (1 ≤ s, tn ∩ s, ti).
b i = s = 1 ( s i ) n t = 1 ( t i ) n σ s t ( i ) σ s t

3. Results

3.1. Tweet Statistics

Table 1 presents the number of posted tweets in each language that included the term “biodiversity” during the whole search period. The total number of related tweets in all languages was 5,140,273. There were 301,571 such tweets in Japanese, which was 6% of the total number of tweets. English (2,925,742, 53%) and Spanish (1,422,724, 26%) accounted for approximately 80% of the total tweets. Russian (3450, 0.6%) and Arabic (14,975, 0.2%) had the least proportions.
Figure 4 displays the time series of the number of tweets according to language. Overall, the number of tweets increased in all language communities. The Japanese, Arabic, Russian, and Chinese language communities experienced a burst of activity in certain years. For instance, in 2010, Japan hosted the Tenth Meeting of the Conference of the Parties (COP) of the CBD, which may have led to a surge in the data. International events may contribute to the number of tweets on biodiversity.

3.2. Quantitative Trends in Hashtags

3.2.1. Hashtag Statistics

Table 2 presents the statistics of hashtags appearing in tweets. The Japanese language community used 0.27 hashtags per tweet, while the English language community used 1.69 hashtags and tended to annotate with various hashtags in a tweet. Certain language communities, especially the English language community, repeatedly used the same hashtags an average of 12.8 times, while in other language communities (such as the Chinese, and Russian communities), hashtags were reused on an average of only twice. Regarding tweets about biodiversity, the top 10% most used hashtags took up more than 80% of the total number of hashtags used in the English, French, and Spanish language communities such that topics and words on Twitter were widely shared and repeated within the language communities. The Chinese and Russian language communities experienced an opposite tendency. Thus, the hashtags performed differently in different language communities.

3.2.2. Time series of Engagement and Diversity Index

The three engagements in each year are depicted in Figure 5a–c. The Japanese, Arabic, French, Russian, and Spanish language communities experienced increasing trends in RT_engagement from 2006 to about 2018–2020, after which it began to decline or slow. On the other hand, the Chinese and English language communities continued to see increases. Like_engagement was increasing in all language communities, but significant decreases were observed in Arabic, Chinese, and Russian language communities in 2021. Reply_engagement has generally been increasing but has been slowing for the Chinese, French, and Russian language communities in recent years. From the above, the English language community experienced a strong trend of increasing engagement as the number of tweets increased; the Japanese, French, and Spanish language communities experienced a gradual increase in the number of tweets and engagement; and the Arabic, Chinese, and Russian language communities experienced saturating or decreasing trends in engagements when the tweets increased. In contrast, the diversity index in Figure 5d depicts a monotonous increase in all language communities, indicating an increase in the diversity of interest.

3.3. Semantic Trends of Hashtags

3.3.1. Finding the Primary Interest

Table 3 reports the top 10 most frequently occurring hashtags for each language community presented in the original language and English as well as the proportion of the number to all tweets throughout the search period. All extracted hashtags from the dataset were split into seven language communities, and the most frequent 10 hashtags were extracted. The proportion of each hashtag was calculated by dividing the frequency by the total number of hashtags in each language. The hashtag #biodiversity itself ranked in the top 10 in all language communities. Between 16.02% and 9.12% of hashtags in the Arabic, English, French, Russian, and Spanish language communities included #biodiversity, implying that the word “biodiversity” itself signifies a common interest and concept perceived as a major topic for information dissemination across language communities. These language communities also tended to include the biodiversity-related concepts such as “nature”, “wildlife”, “ecology”, “agriculture”, “environment”, “climate”, and “planet”. This tendency is very important to treat the biodiversity nexus issues such as the nature–climate nexus, so we will deepen the discussion later. However, the rate of use of #biodiversity in the Japanese and Chinese language communities was less than 2.60%. The word “biodiversity” itself may not be in common use and might be replaced with other words such as “richness of nature” in the Japanese and Chinese language communities. This terminology issue is also important for developing the strategies to expand and centralize the biodiversity concept in the societies. The CBD-COP made a strong contribution to promoting interest in biodiversity, so institutions and governance play a great role in managing nature’s contribution to people, as shown in Figure 3. In addition, the interest in biodiversity was closely associated with the Sustainable Development Goals (SDGs), which include SDG Global Goals 14 (Ocean) and 15 (Land), in the Japanese language community compared with other language communities.

3.3.2. Commonalities of Interest among Language Communities and the Uniqueness of the Japanese Language Community

The Venn diagram in Figure 6 depicts the intersection and difference sets between the Japanese language community and the other language communities. The Japanese difference set includes only hashtags used in the Japanese language community. The intersection set consists of hashtags that appeared in all language communities, and the difference set of the other six languages exhibits hashtags mentioned in all six languages. The hashtags that appeared in all language communities were sorted in descending order of the harmonic mean presented in Table 4.
The highest harmonic means among the intersection hashtags were for #biodiversity and #Biodiversity. The concept of biodiversity appears to be gaining ground. Moreover, the hashtag #nature has been shared worldwide as a common concept with biodiversity. Furthermore, the hashtag #COP15, which denotes interest in biodiversity, has been increasing rapidly around the world in recent years. The hashtag #science is also popular, which may indicate the contribution of science communities such as IPBES to biodiversity.
The difference set of the six languages comprised hashtags generated only in the six languages. The concepts of ecosystem and protection were inseparable from biodiversity conservation and mainstreaming; they also appeared with hashtags related to the names of specific regions, such as Poland (the host country of the UNFCCC-COP19) and Pakistan (the Japanese language community did not use this word).
Figure 7 presents a comparison of common and unique hashtags between the Japanese and the other six language communities. In all comparison sets, #COP10, #Teddy bear, #Pounding, #ngo, and #Events appeared often in the Japanese difference set. Further, the CBD-COP10 had a large impact on the Japanese biodiversity interest. The Japanese government and the Ministry of the Environment in particular played an important role in mainstreaming biodiversity through the adoption of the Aichi Targets and the Nagoya Protocol. The original Japanese hashtag for #Teddy bear was identified as #MISIA by manually checking the original tweets (this may have been an inaccurate translation from Japanese by Google Translate). #MISIA is a well-known singer who served as a CBD-COP10 honorary ambassador [59]. This singer is an influencer committed to the SDGs and biodiversity. This suggests that influencers have a significant potential to motivate people to develop an interest in biodiversity. #Pounding refers to the Japanese word “kakiage”—a type of tempura consisting of finely chopped and deep-fried seafood and vegetables such as shrimp. Tweets including #Events were checked manually, and it was found that it was also a mistranslation of “ebi-ten”, a type of shrimp tempura. This was related to the fact that the shrimp contained in “kakiage” and “ebi-ten” represented a large proportion of imported shrimp, which led to significant concern regarding the destruction of mangrove forests entailed in their harvesting [60]. The hashtag #ngo was seen in many tweets related to event organization and participation, indicating the large role of nonprofit and nongovernmental organizations in biodiversity.
To classify in detail the hashtags observed in the Japanese language community, Table 5 lists the top 10 most prominent hashtags for every five years since 2010. We found a tweet-count explosion in tweets related to biodiversity in 2010 in the Japanese language community. The oldest hashtag trend from 2010 was related to the topics of the CBD-COP10 and specific organizations such as #mudef (a social action group that includes celebrities from various fields, such as artists, actors, and sports players [61]) and #nhk—a state-run broadcasting company (Nippon Hoso Kyokai (NHK)) [62]. Twitter-specific tokens, such as #twvt, were also created as hashtags. The hashtag #twvt stands for a Twitter invite, which is an event and participation announcement service, and #tenen was a hashtag frequently used with #cop10 and #bdjp; however, the exact meaning cannot be clearly identified at this point. The most prominent and autHoritative organizations tended to lead topics in 2010. However, in 2015, contents regarding the relationship between diet and biodiversity (#tempura and #fried shrimp), biodiversity conservation activities by nongovernmental and nonprofit organizations (#npo and #ngo), and raising funds for biodiversity conservation (#click donation) were seen more frequently. These issues appear to relate much more closely to citizens’ daily lives. As recently as 2020, biodiversity again attracted attention as part of the Global Goals in the SDGs (#SDG). Food systems were also referred to again as lifestyles (#vegan) in 2020.

3.3.3. Correspondence with IPBES-CF

The hashtags in the intersection set of the seven language communities presented in Table 4 were grouped with reference to the IPBES-CF category, as presented in Table 6. The indirect drivers covered 53% of the total hashtags. These drivers could be further categorized into international events and initiatives (#Biodiversityday, #WorldEnvironmentDay, #EarthDay, #WorldOceanDay, and #fornature), institutions and governance (#SDGs, #COP15, and #OnePlanetSummit), countries and organizations (#Japan, #Germany, #China, #FAO, #France, #Russia, and #UNEP), and current events (#news). Nature was found in 23% of the hashtags. The two categories of nature commonly observed were climate (#Climate and #climate) and ecosystems (#biodiversity, #Biodiversity, #nature, #Nature, and #ecology). The anthropogenic assets and direct drivers’ categories followed with 10% each. The rest referred to NCP and the good quality of life associated with NCP. Neither category had any hashtag within the intersection hashtag set. The hashtag #today, categorized among other views, was used to indicate biodiversity-related events and happenings at that time.
Table 7 presents the proportion of the IPBES-CF categories for the 117 hashtags in the Japanese and English language communities, corresponding to the highest 1% of hashtags used only in the Japanese language community and the highest 0.03% of hashtags used only in the English language community. In all, 67% of hashtags in the Japanese language community were grouped under the indirect drivers’ category. The common hashtags included frequently used abbreviations in Japan for the names CBD-COP10 (#BDJP), national broadcasting company (#nhk), #click donations [63], and the youth special mission to realize the Aichi Targets (#gakeiki_mission) [64].
A χ2 test detected a significant difference in the probability distribution between the language communities at the 5% level, and a residual test exhibited a difference in meaning in the probability of other views (p = 0.000) and direct drivers (p = 0.003). In the Japanese language community, 15% of the hashtags cited biodiversity from a different perspective other than IPBES-CF. Regarding the direct drivers, the English language community included all direct drivers defined by a transformative change study—“Land/sea-use change”, “Direct exploitation”, “Climate change”, “Pollution”, “Invasive species”, and “Conservation activities” [65]. By contrast, the Japanese language community included hashtags in limited categories such as invasive species (#invasive species), direct exploitation (#illegal deal), and bird conservation (#BirdBranch). The IPBES-CF was developed by international experts in an open process and widely shared, but this result suggests that there is still room for improvement.

3.3.4. Network Metrics

Table 8 presents the hashtags with the highest betweenness centrality among the hashtags of the communities with the highest number of hashtags, the hashtags translated into English by Google Translate, and the betweenness centrality values. More than three proper nouns, such as organization names and international conferences, were extracted from the Japanese, Arabic, Chinese, and Russian language communities. In addition, more than four nouns paraphrasing biodiversity were found as hashtags with high betweenness centrality in the English, French, Russian, and Spanish language communities. The Japanese language community’s uniqueness was that hashtags related to media such as the state-run broadcasting company and news had high betweenness centralities.
Table 9 presents the time series of the highest betweenness centrality hashtags in the Japanese language community. In terms of hashtag content, until 2011, hashtags regarding the CBD-COP10 were the focus. From 2012 to 2017, biodiversity (or paraphrasing of biodiversity) became the focus of interest. Education-related topics, such as #Kindle and #environmental education, had higher betweenness centralities. After 2018, sustainability topics other than biodiversity, such as #SDGs, #ESG, and #Zerohunger, were also connected to biodiversity and various topics.

4. Discussion

4.1. RQ1: Is the Interest in Biodiversity Continuously Activated (RQ1-1) and Diversified (RQ1-2)

In this study, we found that discussions on biodiversity are becoming more active in all seven language communities, as indicated by the trend of an increasing number of tweets (Figure 4) and engagements (Figure 5a–c), providing an affirmative answer to RQ1-1. In the Japanese language community, the number of tweets (Figure 4) has been increasing since 2015, and engagement (Figure 5a–c) was increasing until around 2019. Subsequently, engagement of the Japanese language community slowed after 2019, but the number of tweets increased, suggesting that Twitter became an active forum for biodiversity discussions.
The answer to RQ1-2 is also yes. The rate of emergence of unique hashtags (Table 2) increased, and the diversity of the interest increased almost monotonically (Figure 5d). This indicates that the global biodiversity conversation is becoming more active and diverse; views on the value of biodiversity are becoming more diverse; and stakeholders’ activities in biodiversity conservation are present in various fields. This results in a different understanding of the role that nature plays at the foundation of people’s lives and in their quality of life, leading to the formation of a wide diversity of values regarding nature [2].
From an international perspective, the Japanese language community has been relatively inactive in discussions that deal directly with biodiversity topics. Japan has the second-largest number of Twitter users in the world [66]; however, the number of tweets related to biodiversity was fourth among the seven language communities. This can be interpreted in two ways: the Japanese language community had a weak interest in biodiversity, or the Japanese language community does not use the word “biodiversity”, using other words to express this concept, such as “natural richness” or “gift from nature”. We will discuss this in the section below regarding limitations.

4.2. RQ2: What Are the Shared Interests among the Language Communities and the Special Interest of the Japanese Language Community?

Along with the IPBES-CF, all language communities expressed great interest in the indirect drivers’ category (Table 6). International days such as World Environment Day, Biodiversity Day, Earth Day, and World Oceans Day provided notable biodiversity content and attract attention. Zarrabeitia et al. (2022) found that these international days play a large role in positive discussions with collective sentiment [67]. Nature was the second most popular category observed with high betweenness centrality hashtags. This suggests that recapturing and disseminating nature value is effective in stimulating biodiversity discussions on Twitter. Twitter users are also interested in the relationship between the climate and ecosystems, the importance of which was strongly emphasized by IPBES and IPCC [68]. Biodiversity issues are expected to be simultaneously resolved through mitigation of and adaptation to climate change. Among anthropogenic assets, the science and technology to conserve biodiversity is universally recognized as the key in all language communities, indicating that IPBES has a significant role to play in biodiversity mainstreaming. In the direct drivers’ category, the interlinkage between food, biodiversity, and the impacts of desertification, land modification, and COVID-19 were addressed. In particular, human destruction of ecosystems and unsustainable use of NCP enlarge the risk of pandemics, as represented by the COVID-19 pandemic [69,70,71]. Hashtags such as #COVID19 and #OnePlanetSummit suggest that Twitter users focus on biodiversity conservation, which has synergistic effects on the prevention of infectious diseases.
As a feature of the uniqueness of the Japanese language community, it expressed little concern regarding the direct drivers. This result implies that the Japanese language community may have a weak perspective not only on individual species but also on conserving the ecosystem as a whole [72]. In particular, in the other-views category, the Japanese language community used hashtags that focused on biodiversity from a different perspective. For example, #Kaminoseki, the proposed construction site for a nuclear power plant, is a valuable habitat for the Branchiostoma japonicum and the black heron (Gerres equulus) [73], and opponents of nuclear power referred to biodiversity issues from the perspective of energy policy. This type of interlinkage is also important from the perspective of the biodiversity–energy nexus issues, and it will be a key issue in renewable energy mainstreaming [74]. Another hashtag, #Life After, mentioned a game with a system in which biodiversity conservation in the game process provides many benefits to its players. This suggests an interesting potential that the metaverse, including virtual reality games, can play an effective role in mainstreaming biodiversity [75,76].

4.3. RQ3: What Promotes the Interest in Biodiversity in the Japanese Language Community?

In 2010 and 2020, the main concern of the Japanese language community shifted from the hashtags of global top-down elements to local bottom-up elements (Table 5). In the early era of 2010, as individual organizations (#mudef and #nhk) and celebrities (#MISIA) appeared frequently, it indicated that media and influencers provided a considerable contribution to promoting the public conversation on biodiversity. The tweets revealed the most activity during the entire sample period (Figure 4 and Figure 5). This phenomenon, in which certain features rose sharply in frequency as the topic emerged, is called a “burst” of activity [77], which was triggered by major events such as earthquakes, elections, or fires [78]. On the other hand, the diversity index was not high around 2010 (Figure 5d), and hashtags related to the CBD-COP10 accounted for 32.6% of hashtags. In addition, the hashtags #bot and #recobot were used frequently. A “recobot” is an account that analyzes the words of a user’s reply to it to understand the user’s needs and searches for recommended products and information [79]. This type of bot may have been associated with this burst of tweets [80] and can have an impact on the formation of public opinion [81,82].
From 2015 to the present, hashtags for citizen-driven organizations, such as #npo and #ngo, have been used. These agents are innovators in increasing public awareness of biodiversity. From 2015 (oysters and fried shrimp) to 2020 (vegans), commitment to examining the relationship between food systems and biodiversity remained high. Growing biodiversity issues regarding the concept of the planetary health diet can stimulate discussions more broadly [83]. Furthermore, in 2020, the most recent year in the data, #SDG became one of the most used hashtags. SDGs may be growing as a hub of sustainability issues in the biodiversity domain. Viewing biodiversity not as a stand-alone environmental issue but as an issue connected with other environmental and socioeconomic issues inclusively is key to mainstreaming it.

4.4. Limitations

This study has five main limitations. The first relates to bias and accuracy problems that arose from the use of machine translation with Google Translate. Translations using the Google Translate API sometimes do not correctly capture the meaning of the original language. Reports of such errors are updated on the Google Help page from time to time [84]. For example, the Japanese hashtag “#kankyo”, which means “environment” written in the Roman alphabet, was translated to English as “pipe”. The hashtags also include many slang terms or informal language that is only widespread on Twitter, making the translation more complex. Previous studies have used machine translation with Google Translate or analyzed only English hashtags, but this is an important concern in global Twitter analysis that includes various local languages [85]. Another issue is determining appropriate preprocessing and morphological analysis, such as the lower method, substitution, and splitting of connective words at the point of translation into English. This kind of preprocessing can be a bias factor, so we can apply good techniques of using a multi-language tokenizer and lemmatizer coupled with multilingual emotion analysis as shown in Basile (2021) [86].
Second, interpreting the content was also an issue. Our research team understood original tweets in Japanese and English only. Therefore, tweets in other languages could not be investigated in detail because the authors were only able to interpret the texts of the tweets through machine translation. Various problems with the interpretation of social media data have been raised previously [87,88]. These obstacles are expected to be alleviated by having native speakers of each language participate and discuss the tweets together.
Third, as it was only possible to access publicly available tweets at the time of the search, the dataset changed at each search [89,90]. Social networks are dynamic by nature. Users add, change, and delete their content at any time. If a user deletes individual tweets or their entire account, tweets about biodiversity that they posted will also become inaccessible. Thus, quite unlike physical media, digital media are highly variable and fragile, which means that the results will change depending on when the data were saved. We may be able to avoid investigation bias by focusing on a shorter period and frequent collection is required. Notably, Twitter ended the services in July 2023, so the dataset may be dramatically changed before and after the change. This may be a great bias potential for the future studies.
Fourth, only tweets filtered by the query could be extracted. Tweets that did not include the query term “biodiversity” but included tokens that were equivalent to the concept of biodiversity could not be retrieved by the search. A search that uses concepts that are ontologically similar at the semantic level or machine-learning-based natural language processing technology could compensate for this [91,92]. For example, in Japanese, words such as “nature”, “creatures”, and “ecosystem” have comparable definitions.
Fifth, the topic mining and the co-occurrence of interest were not discussed in this study. In this study, we analyzed and evaluated the plain hashtags themselves for identification of the biodiversity interest. However, the detection of the topics that consisted of several hashtags would be very informative to the interest’s detection. The potential solutions for these issues are both topic-modeling technology, which was applied for biodiversity interests in Ohtani (2022) [44], and artificial intelligence-based pattern matching and community detection technology, which were applied in Cauteruccio et al. (2022) [93], Cauteruccio, F. and Terracina, G. (2023) [94], and Taecharungroj (2023) [95]. In addition, co-occurrence analysis is promising. Analyzing topics likely to be discussed at the same time can help identify new interests with strong connections to biodiversity, such as those related to food, water, health, etc. Such an approach is expected to support the analysis of biodiversity nexuses [96].
Overall, the novelty of this research is that our study conducted a quantitative analysis that did not use people’s stated preference data derived from well-structured surveys with limited participants, such as in [11], but used revealed preference data partially proxy to their interests and quantified the historical trend of biodiversity mentions. These results can support comparative analyses with other related surveys conducted in many language communities. This kind of research to monitor the interests using big data on the web is still limited at this point; however, it is a very informative way to evaluate the contributions of various interventions and countermeasures to promote mainstreaming biodiversity in societies.

5. Conclusions

This study investigated the commonalities among seven language communities (six official languages of the United Nations and the Japanese language) and the uniqueness of the Japanese language community with respect to biodiversity by comparing tweets that included the term biodiversity and clarifying changes in the interest and concern for biodiversity from the past to the present. The number of tweets and hashtag statics in each language community were compared for tweets that included “biodiversity” in the six official UN languages and the Japanese language. This study evaluated the level of engagement and diversity of each language community, and it identified perspectives on biodiversity common among the seven language communities or unique to each language community. In particular, the Japanese language community was found to view the relationship between biodiversity and humans from a different perspective than that presented in the scope of the CF. The approach of this study can be useful to identify and monitor the progress of mainstreaming biodiversity and facilitate the communication of shared interests and uniqueness, depending on the local context. It can also present effective interventions and drivers for enhancing interest in biodiversity. In this study, we focused on the uniqueness of the Japanese language community, but we can easily apply the same procedure to other languages to discover their own views on biodiversity.
The SDGs are a set of commonly agreed-upon global goals that establish targets for marine and terrestrial biodiversity in Global Goals 14 and 15, respectively. The SDGs require comprehensive solutions to all international goals in a co-operative manner. The Kunming-Montreal Global Biodiversity Framework, which follows the Aichi Targets, also proposes the importance of the inclusion of biodiversity values in all sectors [6,7]. It is necessary to group tweets by the co-occurrence of interest to clarify the inter between a passion for biodiversity and the SDGs in future work. In addition, although this analysis focused on the Japanese language community, collaborating with scholars from communities around the world is desirable for understanding the concept of biodiversity across traditions and cultures.

Author Contributions

S.I. and C.H. carried out the collection of the tweet database; S.I., C.H. and T.M. worked out almost all the technical details, developed the theoretical formalism, performed the analytic calculations, and performed the numerical simulations; K.H., S.H. and O.S. provided the latest knowledge and deepened the discussion; S.I. and T.M. conceived the original idea and created the draft manuscript; all authors joined the conceptualization and contributed to the finalization. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Environment Research and Technology Development Fund (S-21, JPMEERF23S12100, and 1-2104, JPMEERF20211004) of the Environmental Restoration and Conservation Agency of Japan and by the e-ASIA JRP (Grant Number JPMJSC20E6) of the Japan Science and Technology Agency. This manuscript has not been published elsewhere and is not under consideration by another journal.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data sharing not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. IPBES. Summary for Policymakers of the Global Assessment Report on Biodiversity and Ecosystem Services of the Intergovernmental Science-Policy Platform on Biodiversity and Ecosystem Services; Díaz, S., Settele, J., Brondízio, E.S., Ngo, H.T., Guèze, M., Agard, J., Arneth, A., Balvanera, P., Brauman, K.A., Butchart, S.H.M., et al., Eds.; IPBES Secretariat: Bonn, Germany, 2019; 56p. [Google Scholar] [CrossRef]
  2. IPBES. Summary for Policymakers of the Methodological Assessment Report on the Diverse Values and Valuation of Nature of the Intergovernmental Science-Policy Platform on Biodiversity and Ecosystem Services; Pascual, U., Balavanera, P., Christie, M., Baptiste, B., Gonzalez-Jimenez, D., Anderson, C.B., Athayde, S., Chaplin-Kramer, R., Jacobs, S., Kelemen, E., et al., Eds.; IPBES Secretariat: Bonn, Germany, 2022. [Google Scholar] [CrossRef]
  3. Chan, K.M.A.; Balvanera, P.; Benessaiah, K.; Chapman, M.; Díaz, S.; Gómez-Baggethun, E.; Gould, R.; Hannahs, N.; Jax, K.; Klain, S.; et al. Opinion: Why Protect Nature? Rethinking Values and the Environment. Proc. Natl. Acad. Sci. USA 2016, 113, 1462–1465. [Google Scholar] [CrossRef] [PubMed]
  4. Aichi Biodiversity Targets|Convention on Biological Diversity. Available online: https://www.cbd.int/sp/targets/ (accessed on 7 February 2023).
  5. Secretariat of the Convention on Biological Diversity. Global Biodiversity Outlook 5. Montreal. Canada. Available online: https://www.cbd.int/gbo5 (accessed on 18 August 2023).
  6. Convention on Biological Diversity. National Biodiversity Strategies and Action Plans (NBSAPs). Available online: https://www.cbd.int/nbsap/ (accessed on 7 February 2023).
  7. Ministry of the Environment. The 5th National Biodiversity Strategy and Action Plan. Available online: https://www.biodic.go.jp/biodiversity/about/initiatives5/index.html (accessed on 18 August 2023).
  8. Kunming-Montreal Global Biodiversity Framework|Convention on Biological Diversity. Kunming, China. Available online: https://www.cbd.int/gbf/ (accessed on 18 August 2023).
  9. Ministry of the Environment. The 6th National Biodiversity Strategy and Action Plan. Available online: https://www.biodic.go.jp/biodiversity/about/initiatives/index.html (accessed on 18 August 2023).
  10. Ministry of the Environment. The Plan of Next National Biodiversity Strategy and Action Plan. Available online: https://www.biodic.go.jp/biodiversity/about/initiatives6/dprocess.html (accessed on 18 August 2023).
  11. Ministry of the Environment. Japan Biodiversity Outlook 3; Japan. 2021. Available online: https://www.biodic.go.jp/biodiversity/activity/policy/jbo3/generaloutline/index.html (accessed on 18 August 2023).
  12. Leadley, P.; Gonzalez, A.; Obura, D.; Krug, C.B.; Londoño-Murcia, M.C.; Millette, K.L.; Radulovici, A.; Rankovic, A.; Shannon, L.J.; Archer, E.; et al. Achieving global biodiversity goals by 2050 requires urgent and integrated actions. One Earth 2022, 5, 597–603. [Google Scholar] [CrossRef]
  13. Toivonen, T.; Heikinheimo, V.; Fink, C.; Hausmann, A.; Hiippala, T.; Järv, O.; Tenkanen, H.; Di Minin, E. Social Media Data for Conservation Science: A Methodological Overview. Biol. Conserv. 2019, 233, 298–315. [Google Scholar] [CrossRef]
  14. Du, X.; Kowalski, M.; Varde, A.S.; de Melo, G.; Taylor, R.W. Public Opinion Matters: Mining Social Media Text for Environmental Management. SIGWEB Newsl. 2020, 5, 15. [Google Scholar] [CrossRef]
  15. Puri, M.; Du, X.; Varde, A.S.; de Melo, G. Mapping Ordinances and Tweets Using Smart City Characteristics to Aid Opinion Mining. In Companion of the Web Conference 2018 on The Web Conference 2018—WWW ’18; ACM Press: Lyon, France, 2018; pp. 1721–1728. [Google Scholar] [CrossRef]
  16. Wang, R.-Q.; Mao, H.; Wang, Y.; Rae, C.; Shaw, W. Hyper-Resolution Monitoring of Urban Flooding with Social Media and Crowdsourcing Data. Comput. Geosci. 2018, 111, 139–147. [Google Scholar] [CrossRef]
  17. Tandon, N.; Varde, A.S.; de Melo, G. Commonsense Knowledge in Machine Intelligence. SIGMOD Rec. 2018, 46, 49–52. [Google Scholar] [CrossRef]
  18. Gu, Y.; Qian, Z.; Chen, F. From Twitter to Detector: Real-Time Traffic Incident Detection Using Social Media Data. Transp. Res. Part C Emerg. Technol. 2016, 67, 321–342. [Google Scholar] [CrossRef]
  19. Zhan, X.; Ukkusuri, S.V.; Zhu, F. Inferring Urban Land Use Using Large-Scale Social Media Check-in Data. Netw. Spat. Econ. 2014, 14, 647–667. [Google Scholar] [CrossRef]
  20. Wang, S.; Zhang, X.; Cao, J.; He, L.; Stenneth, L.; Yu, P.S.; Li, Z.; Huang, Z. Computing Urban Traffic Congestions by Incorporating Sparse GPS Probe Data and Social Media Data. ACM Trans. Inf. Syst. 2017, 35, 1–30. [Google Scholar] [CrossRef]
  21. Nuortimo, K. Measuring Public Acceptance with Opinion Mining: The Case of the Energy Industry with Long-Term Coal R&D Investment Projects. JISIB 2018, 8, 6–22. [Google Scholar] [CrossRef]
  22. About Twitter|Our Company and Priorities. Available online: https://about.twitter.com/ (accessed on 7 February 2023).
  23. Twitter, Inc. Financial Information—Quarterly Results. Available online: https://investor.twitterinc.com/financial-information/quarterly-results/default.aspx (accessed on 30 June 2022).
  24. Full-Archive Search Quick Start Guide. Available online: https://developer.twitter.com/en/docs/twitter-api/tweets/search/quick-start/full-archive-search (accessed on 7 February 2023).
  25. Pearce, W.; Holmberg, K.; Hellsten, I.; Nerlich, B. Climate Change on Twitter: Topics, Communities and Conversations about the 2013 IPCC Working Group 1 Report. PLoS ONE 2014, 9, e94785. [Google Scholar] [CrossRef] [PubMed]
  26. Kim, S.C.; Cooke, S.L. Environmental Framing on Twitter: Impact of Trump’s Paris Agreement Withdrawal on Climate Change and Ocean Acidification Dialogue. Cogent Environ. Sci. 2018, 4, 1532375. [Google Scholar] [CrossRef]
  27. Molodtsova, T. Linking Local Weather to Climate Change: One Year of Twitter in the US. Theses and Dissertations. 1574. 2014. Available online: https://commons.und.edu/theses/1574 (accessed on 15 May 2023).
  28. Arlt, D.; Hoppe, I.; Schmitt, J.B.; De Silva-Schmidt, F.; Brüggemann, M. Climate Engagement in a Digital Age: Exploring the Drivers of Participation in Climate Discourse Online in the Context of COP21. Environ. Commun. 2018, 12, 84–98. [Google Scholar] [CrossRef]
  29. Sanford, M.; Witkowska, M.; Gifford, R.; Formanowicz, M. Emotional framing in online environmental activism: Pairing a Twitter study with an offline experiment. Front. Psychol. 2023, 13, 1099331. [Google Scholar] [CrossRef] [PubMed]
  30. Gaytan Camarillo, M.; Ferguson, E.; Ljevar, V.; Spence, A. Big Changes Start with Small Talk: Twitter and Climate Change in Times of Coronavirus Pandemic. Front. Psychol. 2021, 12, 661395. [Google Scholar] [CrossRef] [PubMed]
  31. Loureiro, M.L.; Alló, M. How Has the COVID-19 Pandemic Affected the Climate Change Debate on Twitter? Environ. Sci. Policy 2021, 124, 451–460. [Google Scholar] [CrossRef] [PubMed]
  32. Schuette, S.; Folk, R.A.; Cantley, J.T.; Martine, C.T. The Hidden Heuchera: How Science Twitter Uncovered a Globally Imperiled Species in Pennsylvania, USA. PhytoKeys 2018, 96, 87–97. [Google Scholar] [CrossRef]
  33. Thakur, N. Sentiment Analysis and Text Analysis of the Public Discourse on Twitter about COVID-19 and MPox. Big Data Cogn. Comput. 2023, 7, 116. [Google Scholar] [CrossRef]
  34. Vaz, A.S.; Moreno-Llorca, R.A.; Gonçalves, J.F.; Vicente, J.R.; Méndez, P.F.; Revilla, E.; Santamaria, L.; Bonet-García, F.J.; Honrado, J.P.; Alcaraz-Segura, D. Digital conservation in biosphere reserves: Earth observations, social media, and nature’s cultural contributions to people. Conserv. Lett. 2020, 13, e12704. [Google Scholar] [CrossRef]
  35. Daume, S. Mining Twitter to monitor invasive alien species—An analytical framework and sample information topologies. Ecol. Inform. 2016, 31, 70–82. [Google Scholar] [CrossRef]
  36. Tenkanen, H.; Di Minin, E.; Heikinheimo, V.; Hausmann, A.; Herbst, M.; Kajala, L.; Toivonen, T. Instagram, Flickr, or Twitter: Assessing the Usability of Social Media Data for Visitor Monitoring in Protected Areas. Sci. Rep. 2017, 7, 17615. [Google Scholar] [CrossRef]
  37. Jarić, I.; Courchamp, F.; Gessner, J.; Roberts, D.L. Data Mining in Conservation Research Using Latin and Vernacular Species Names. PeerJ 2016, 4, e2202. [Google Scholar] [CrossRef]
  38. Papworth, S.K.; Nghiem, T.P.L.; Chimalakonda, D.; Posa, M.R.C.; Wijedasa, L.S.; Bickford, D.; Carrasco, L.R. Quantifying the role of online news in linking conservation research to Facebook and Twitter. Conserv. Biol. 2015, 29, 825–833. [Google Scholar] [CrossRef] [PubMed]
  39. Hawkins, R.; Silver, J.J. From Selfie to #sealfie: Nature 2.0 and the Digital Cultural Politics of an Internationally Contested Resource. Geoforum 2017, 79, 114–123. [Google Scholar] [CrossRef]
  40. Macdonald, D.W.; Jacobsen, K.S.; Burnham, D.; Johnson, P.J.; Loveridge, A.J. Cecil: A Moment or a Movement? Analysis of Media Coverage of the Death of a Lion, Panthera Leo. Animals 2016, 6, 26. [Google Scholar] [CrossRef]
  41. Hammond, N.L.; Dickman, A.; Biggs, D. Examining attention given to threats to elephant conservation on social media. Conserv. Sci. Pract. 2022, 4, e12785. [Google Scholar] [CrossRef]
  42. Kidd, L.R.; Gregg, E.A.; Bekessy, S.A.; Robinson, J.A.; Garrard, G.E. Tweeting for their lives: Visibility of threatened species on twitter. J. Nat. Conserv. 2018, 46, 106–109. [Google Scholar] [CrossRef]
  43. Cooper, M.W. Developing a Global Indicator for Aichi Target 1 by Merging Online Data Sources to Measure Biodiversity Awareness and Engagement. Biol. Conserv. 2019, 230, 29–36. [Google Scholar] [CrossRef]
  44. Ohtani, S. How Is People’s Awareness of “Biodiversity” Measured? Using Sentiment Analysis and LDA Topic Modeling in the Twitter Discourse Space from 2010 to 2020. SN Comput. Sci. 2022, 3, 371. [Google Scholar] [CrossRef] [PubMed]
  45. Barrios-O’Neill, D. Focus and social contagion of environmental organization advocacy on Twitter. Conserv. Biol. 2021, 35, 307–315. [Google Scholar] [CrossRef]
  46. Reyes-Menendez, A.; Saura, J.R.; Alvarez-Alonso, C. Understanding #WorldEnvironmentDay User Opinions in Twitter: A Topic-Based Sentiment Analysis Approach. Int. J. Environ. Res. Public Health 2018, 15, 2537. [Google Scholar] [CrossRef] [PubMed]
  47. Twitter API. Available online: https://developer.twitter.com/en/docs/twitter-api (accessed on 10 August 2023).
  48. F Search Tweets. Available online: https://developer.twitter.com/en/docs/twitter-api/tweets/search/api-reference/get-tweets-search-all (accessed on 10 August 2023).
  49. United Nations. Official Languages. United Nations. Available online: https://www.un.org/en/our-work/official-languages (accessed on 7 February 2023).
  50. Derhab, A.; Alawwad, R.; Dehwah, K.; Tariq, N.; Khan, F.A.; Al-Muhtadi, J. Tweet-Based Bot Detection Using Big Data Analytics. IEEE Access 2021, 9, 65988–66005. [Google Scholar] [CrossRef]
  51. How to Use Hashtags. Available online: https://help.twitter.com/ja/using-twitter/how-to-use-hashtags (accessed on 7 February 2023).
  52. Springer, A.E.H. The Private Sector & UN Sustainable Development Goals: An Evaluation of Progress Using Twitter. Master’s Thesis, The University of Texas at Austin, Austin, TX, USA, 2020. [Google Scholar] [CrossRef]
  53. Patuelli, A.; Saracco, F. Sustainable Development Goals as Unifying Narratives in Large UK Firms’ Twitter Discussions. arXiv 2022, arXiv:2207.14664. [Google Scholar] [CrossRef] [PubMed]
  54. Shannon, C.E. A Mathematical Theory of Communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
  55. Python Client Library|Cloud Translation. Google Cloud. Available online: https://cloud.google.com/translate/docs/reference/libraries/v2/python (accessed on 7 February 2023).
  56. Oesper, L.; Merico, D.; Isserlin, R.; Bader, G.D. WordCloud: A Cytoscape Plugin to Create a Visual Semantic Summary of Networks. Source Code Biol. Med. 2011, 6, 7. [Google Scholar] [CrossRef] [PubMed]
  57. Díaz, S.; Demissew, S.; Carabias, J.; Joly, C.; Lonsdale, M.; Ash, N.; Larigauderie, A.; Adhikari, J.R.; Arico, S.; Báldi, A.; et al. The IPBES Conceptual Framework—Connecting nature and people. Curr. Opin. Environ. Sustain. 2015, 14, 1–16. [Google Scholar] [CrossRef]
  58. Tutorial—NetworkX 2.6.2 Documentation. Available online: https://networkx.org/documentation/stable/tutorial.html (accessed on 7 February 2023).
  59. Japanese Singer MISIA Appointed Honorary Ambassador for Tenth Conference of Parties to UN Convention on Biodiversity, in Nagoya, 18–29 October|Meetings Coverage and Press Releases. Available online: https://www.un.org/press/en/2010/note6244.doc.htm (accessed on 7 February 2023).
  60. Farmed Shrimp|Industries|WWF. World Wildlife Fund. Available online: https://www.worldwildlife.org/industries/farmed-shrimp (accessed on 7 February 2023).
  61. Mudef|Music Design Foundation. Mudef. Available online: https://mudef.jp/ (accessed on 7 February 2023).
  62. NHK World-Japan. NHK World. Available online: https://www3.nhk.or.jp/nhkworld/ (accessed on 7 February 2023).
  63. Click-to-Donate Site. Wikipedia. Available online: https://en.wikipedia.org/wiki/Click-to-donate_site (accessed on 18 August 2023).
  64. The Campaign of Gakkepuchi No Seibutsutayousei. Available online: http://gakeiki.web.fc2.com/ (accessed on 13 February 2023).
  65. Chan, K.M.A.; Boyd, D.R.; Gould, R.K.; Jetzkowitz, J.; Liu, J.; Muraca, B.; Naidoo, R.; Olmsted, P.; Satterfield, T.; Selomane, O.; et al. Levers and Leverage Points for Pathways to Sustainability. People Nat. 2020, 2, 693–717. [Google Scholar] [CrossRef]
  66. Twitter: Most Users by Country. Statista. Available online: https://www.statista.com/statistics/242606/number-of-active-twitter-users-in-selected-countries/ (accessed on 7 February 2023).
  67. Zarrabeitia-Bilbao, E.; Rio-Belver, R.-M.; Alvarez-Meaza, I.; Martínez de Alegría-Mancisidor, I. World Environment Day: Understanding Environmental Programs Impact on Society Using Twitter Data Mining. Soc. Indic. Res. 2022, 164, 263–284. [Google Scholar] [CrossRef]
  68. Portner, H.O.; Scholes, R.J.; Agard, J.; Archer, E.; Arneth, A.; Bai, X.; Barnes, D.; Burrows, M.; Chan, L.; Cheung, W.L.; et al. IPBES-IPCC Co-Sponsored Workshop Report Synopsis on Biodiversity and Climate Change; Zenodo: Meyrin, Switzerland, 2021. [Google Scholar] [CrossRef]
  69. Díaz, S.; Pascual, U.; Stenseke, M.; Martín-López, B.; Watson, R.T.; Molnár, Z.; Hill, R.; Chan, K.M.A.; Baste, I.A.; Brauman, K.A.; et al. Assessing Nature’s Contributions to People. Science 2018, 359, 270–272. [Google Scholar] [CrossRef]
  70. IPBES. Workshop Report on Biodiversity and Pandemics of the Intergovernmental Platform on Biodiversity and Ecosystem Services (IPBES); Zenodo: Meyrin, Switzerland, 2020. [Google Scholar] [CrossRef]
  71. Malakar, K.; Majumder, P.; Lu, C. Twitterati on COVID-19 pandemic-environment linkage: Insights from mining one year of tweets. Environ. Dev. 2023, 46, 100835. [Google Scholar] [CrossRef]
  72. Saeki, I.; Yokogawa, M.; Sashimura, N.; Ashizawa, K.; Ohtani, M.; Kawano, N.; Akashi, K.; Furumoto, R. Endangered Ecosystems: An Approach beyond the Conservation of Species. Jpn. J. Conserv. Ecol. 2013, 18, 187–201. [Google Scholar] [CrossRef]
  73. Ankei, Y. Biodiversity of the Suo-Nada Sea and Kaminoseki Nuclear Power Plant Project. Jpn. J. Conserv. Ecol. 2003, 8, 83–86. [Google Scholar] [CrossRef]
  74. Tanaka, K.; Haga, C.; Hori, K.; Matsui, T. Renewable Energy Nexus: Interlinkages with Biodiversity and Social Issues in Japan. Energy Nexus 2022, 6, 100069. [Google Scholar] [CrossRef]
  75. Schulze, J.; Martin, R.; Finger, A.; Henzen, C.; Lindner, M.; Pietzsch, K.; Werntze, A.; Zander, U.; Seppelt, R. Design, Implementation and Test of a Serious Online Game for Exploring Complex Relationships of Sustainable Land Management and Human Well-Being. Environ. Model. Softw. 2015, 65, 58–66. [Google Scholar] [CrossRef]
  76. Lombana, D.A.B.; Ventura, R.B.; Chen, Y.T.; Porfiri, M. Educating Youth About Human Impact on Freshwater Ecosystems Using an Online Serious Game. IEEE Trans. Games 2022. [Google Scholar] [CrossRef]
  77. Kleinberg, J. Bursty and Hierarchical Structure in Streams. Data Min. Knowl. Discov. 2003, 7, 373–397. [Google Scholar] [CrossRef]
  78. Recommend BOT (β(@recobot)/Twitter. Twitter. Available online: https://twitter.com/recobot (accessed on 7 February 2023).
  79. The Information of Media Post|ALBERT Co., Ltd., ALBERT Co., Ltd. Available online: https://www.albert2005.co.jp/corporate/media/page/29 (accessed on 7 February 2023).
  80. Toriumi, F.; Sakaki, T.; Yoshida, M. Social Emotions Under the Spread of COVID-19 Using Social Media. Trans. Jpn. Soc. Artif. Intell. 2020, 35, F-K45_1-7. [Google Scholar] [CrossRef]
  81. Toriumi, F. Topic Analysis for Burst Phenomena. Inf. Process. Soc. Jpn. 2017, 58, 1287–1299. [Google Scholar]
  82. Chen, C.-F.; Shi, W.; Yang, J.; Fu, H.-H. Social Bots’ Role in Climate Change Discussion on Twitter: Measuring Standpoints, Topics, and Interaction Strategies. Adv. Clim. Chang. Res. 2021, 12, 913–923. [Google Scholar] [CrossRef]
  83. Willett, W.; Rockström, J.; Loken, B.; Springmann, M.; Lang, T.; Vermeulen, S.; Garnett, T.; Tilman, D.; DeClerck, F.; Wood, A.; et al. Food in the Anthropocene: The EAT–Lancet Commission on Healthy Diets from Sustainable Food Systems. Lancet 2019, 393, 447–492. [Google Scholar] [CrossRef]
  84. Social Gathering for Harmony with Nature|Biodiversity. Available online: https://www.biodic.go.jp/biodiversity/activity/policy/kyosei/ (accessed on 7 February 2023).
  85. Error Conversation. Available online: https://groups.google.com/g/google-translate-api/search?q=detect+&pli=1 (accessed on 30 November 2022).
  86. Basile, V.; Cauteruccio, F.; Terracina, G. How Dramatic Events Can Affect Emotionality in Social Posting: The Impact of COVID-19 on Reddit. Futur. Internet 2021, 13, 29. [Google Scholar] [CrossRef]
  87. Otero, P.; Gago, J.; Quintas, P. Twitter Data Analysis to Assess the Interest of Citizens on the Impact of Marine Plastic Pollution. Mar. Pollut. Bull. 2021, 170, 112620. [Google Scholar] [CrossRef] [PubMed]
  88. Olteanu, A.; Castillo, C.; Diaz, F.; Kıcıman, E. Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries. Front. Big Data 2019, 2, 13. [Google Scholar] [CrossRef] [PubMed]
  89. Bruns, A.; Stieglitz, S. Twitter Data: What Do They Represent? It-Inf. Technol. 2014, 56, 240–245. [Google Scholar] [CrossRef]
  90. Search Tweets Introduction. Available online: https://developer.twitter.com/en/docs/twitter-api/tweets/search/introduction (accessed on 30 November 2022).
  91. Kumazawa, T.; Saito, O.; Kozaki, K.; Matsui, T.; Mizoguchi, R. Toward Knowledge Structuring of Sustainability Science Based on Ontology Engineering. Sustain. Sci. 2009, 4, 99–116. [Google Scholar] [CrossRef]
  92. Matsui, T.; Suzuki, K.; Ando, K.; Kitai, Y.; Haga, C.; Masuhara, N.; Kawakubo, S. A Natural Language Processing Model for Supporting Sustainable Development Goals: Translating Semantics, Visualizing Nexus, and Connecting Stakeholders. Sustain. Sci. 2022, 17, 969–985. [Google Scholar] [CrossRef]
  93. Cauteruccio, F.; Corradini, E.; Terracina, G.; Ursino, D.; Virgili, L. Extraction and analysis of text patterns from NSFW adult content in Reddit. Data Knowl. Eng. 2022, 138, 101979. [Google Scholar] [CrossRef]
  94. Cauteruccio, F.; Terracina, G. Extended High-Utility Pattern Mining: An Answer Set Programming-Based Framework and Applications. Theory Pract. Log. Program. 2023, 1–31. [Google Scholar] [CrossRef]
  95. Taecharungroj, V. “What Can ChatGPT Do?” Analyzing Early Reactions to the Innovative AI Chatbot on Twitter. Big Data Cogn. Comput. 2023, 7, 35. [Google Scholar] [CrossRef]
  96. Secretariat. Nexus Assessment. IPBES Secretariat. Available online: https://ipbes.net/nexus (accessed on 30 November 2022).
Figure 1. Transformative change in global sustainability pathways (IPBES, 2019) [1]. This image depicts the connected processes from the cultural aspect, indirect drivers, human activities, and direct drivers of ecosystems, as well as eight leverage points that are essential for transformation and five levers to enable transformative change. License: Creative Commons Attribution 4.0 International (CC BY 4.0).
Figure 1. Transformative change in global sustainability pathways (IPBES, 2019) [1]. This image depicts the connected processes from the cultural aspect, indirect drivers, human activities, and direct drivers of ecosystems, as well as eight leverage points that are essential for transformation and five levers to enable transformative change. License: Creative Commons Attribution 4.0 International (CC BY 4.0).
Sustainability 15 12893 g001
Figure 2. Overall structure from collecting tweets to analysis and pseudocode. The upper part of Figure 2 is the overall analytical flow. The input is the Twitter dataset, and the outputs are quantitative and qualitative assessments of biodiversity interests on Twitter. The lower part of Figure 2 is the pseudocode of the calculation process in the analysis, which is described in detail below.
Figure 2. Overall structure from collecting tweets to analysis and pseudocode. The upper part of Figure 2 is the overall analytical flow. The input is the Twitter dataset, and the outputs are quantitative and qualitative assessments of biodiversity interests on Twitter. The lower part of Figure 2 is the pseudocode of the calculation process in the analysis, which is described in detail below.
Sustainability 15 12893 g002
Figure 3. IPBES-CF [57]. Anthropogenic assets, institutions and governance, and other indirect drivers define the dynamics of direct drivers and the system of nature, nature’s contribution to people, and good quality of life. The next institutions and governance and other indirect drivers are established based on the status of quality of life, which form multiple feedback loops of socioecological systems.
Figure 3. IPBES-CF [57]. Anthropogenic assets, institutions and governance, and other indirect drivers define the dynamics of direct drivers and the system of nature, nature’s contribution to people, and good quality of life. The next institutions and governance and other indirect drivers are established based on the status of quality of life, which form multiple feedback loops of socioecological systems.
Sustainability 15 12893 g003
Figure 4. Time series of the number of related tweets according to language. The x-axis represents the year, and the y-axis represents the proportion of the number of tweets in each year to the total number of relevant tweets in a given language over the entire sample period.
Figure 4. Time series of the number of related tweets according to language. The x-axis represents the year, and the y-axis represents the proportion of the number of tweets in each year to the total number of relevant tweets in a given language over the entire sample period.
Sustainability 15 12893 g004
Figure 5. Time series of RT, like, reply, and diversity index according to language. An increase in (ac) implies that the topics related to biodiversity were active, and an increase in (d) implies that the biodiversity interests were diversified.
Figure 5. Time series of RT, like, reply, and diversity index according to language. An increase in (ac) implies that the topics related to biodiversity were active, and an increase in (d) implies that the biodiversity interests were diversified.
Sustainability 15 12893 g005aSustainability 15 12893 g005b
Figure 6. Venn diagram of hashtags used among Japanese and the other six languages communities. The Japanese difference set consists of hashtags mentioned only in the Japanese language community; intersection refers to the set of hashtags that appeared in all language communities; and the six languages’ difference set includes hashtags that appeared in all six languages. The size of the words displayed in the intersection is proportional to the value of the harmonic mean in Table 5. The number of hashtags displayed in the word cloud was limited to 50 for visibility.
Figure 6. Venn diagram of hashtags used among Japanese and the other six languages communities. The Japanese difference set consists of hashtags mentioned only in the Japanese language community; intersection refers to the set of hashtags that appeared in all language communities; and the six languages’ difference set includes hashtags that appeared in all six languages. The size of the words displayed in the intersection is proportional to the value of the harmonic mean in Table 5. The number of hashtags displayed in the word cloud was limited to 50 for visibility.
Sustainability 15 12893 g006
Figure 7. Comparison of each of the other six languages with Japanese. In all the Venn diagrams, from (af), the Japanese difference set is visualized on the left, the intersection set between Japanese and the compared language community is in the center, and the compared language community’s difference set is on the right. The 30 most common hashtags in Table 4 were excluded from the intersection set. The number of hashtags displayed in the word cloud was limited to 50 to ensure readability.
Figure 7. Comparison of each of the other six languages with Japanese. In all the Venn diagrams, from (af), the Japanese difference set is visualized on the left, the intersection set between Japanese and the compared language community is in the center, and the compared language community’s difference set is on the right. The 30 most common hashtags in Table 4 were excluded from the intersection set. The number of hashtags displayed in the word cloud was limited to 50 to ensure readability.
Sustainability 15 12893 g007aSustainability 15 12893 g007bSustainability 15 12893 g007c
Table 1. Collected tweets according to language.
Table 1. Collected tweets according to language.
LanguageSearch WordTweets
Japanese生物多様性301,571
Arabicلتنوع البيولوجي14,975
Chinese生物多样性4971
Englishbiodiversity2,925,742
Frenchbiodiversité466,840
Russianбиopaзнooбpaзиe3450
Spanishbiodiversidad1,422,724
Note: the languages are presented in alphabetical order except for Japanese.
Table 2. Statistics of hashtag frequencies.
Table 2. Statistics of hashtag frequencies.
LanguageCollected TweetsCumulative Number of Hashtags UsedHashtags per TweetCumulative Number of Tags Used/Number of Unique HashtagsOccupied Proportion by
Top 10% HashtagsTop 1% Hashtags
Japanese301,57182,2030.275.20.750.04
Arabic14,97516,2781.093.90.660.10
Chinese497119740.402.20.470.05
English2,925,7424,953,3941.6912.80.890.15
French466,840550,5981.187.20.810.16
Russian345028600.832.40.540.13
Spanish1,422,7241,423,7351.007.40.810.09
Table 3. Top 10 hashtags according to language.
Table 3. Top 10 hashtags according to language.
OriginalTranslatedTag/Cummulative Tags (%)OriginalTranslatedTag/Cummulative Tags (%)
JapaneseArabic
1bdjpBDJP3.77التنوع_البيولوجيBiodiversity10.18
2cop10COP 102.88صديق_البيئةenvironmentally friendly1.47
3生物多様性Biological2.60موتمر_التنوع_البيولوجيConference of the biological diversity1.44
4biodiversitybiodiversity1.92السيسيSisi1.43
5SDGsSDGs1.50الإماراتUAE1.21
6エビフライFried Shrimp1.37البيئةThe environment1.16
7かきあげPounding1.33مصرEgypt1.10
8ngongo1.25أبوظبيAbu Dhabi1.10
9nponpo1.24قطرDiameter0.92
10COP 10COP 101.21من_أجل_الطبيعةFrom_Al_Al-nature0.88
ChineseEnglish
1ChinaChina4.56biodiversity 14.91
2COP 15COP 152.63Biodiversity 3.28
3生物多样性Biological diversity2.03nature 2.27
4中国China1.93conservation 1.22
5新闻news1.67environment 0.93
6世界world1.32climate change 0.89
7西藏Tibet1.17wildlife 0.88
8TibetTibet1.06photography 0.81
9钓鱼岛Diaoyu Islands0.96500pxrtg 0.77
10biodiversitybiodiversity0.91flora 0.77
FrenchRussian
1biodiversitébiodiversity16.02биoразнooбразиеbiodiversity12.66
2biodiversitebiodiversity3.07экoлoгияecology2.90
3BiodiversitéBiodiversity2.29БиoразнooбразиеBiodiversity2.69
4environnementenvironment2.02biodiversitybiodiversity1.71
5naturenature1.24ООНUN1.57
6climatclimate0.98прирoдаNature1.36
7EnvironnementEnvironment0.63радиприрoдыRadiriroda1.12
8ecologieecology0.48UNUN1.01
9agricultureagriculture0.46SocotraSocotra1.01
10NatureNature0.41SaveSocotraSavesocotra1.01
Spanish
1biodiversidadbiodiversity9.12
2BiodiversidadBiodiversity5.92
3MedioAmbienteEnvironment1.67
4medioambienteenvironment1.09
5PlanetaPlanet0.99
6naturalezanature0.67
7CambioClimáticoClimate change0.49
8planetaplanet0.49
9ColombiaColombia0.41
10MéxicoMexico0.39
Note 1: Retweets and special symbols were excluded. Note 2: The words in the translated column are hashtags as translated from each language into English by Google Translate [54]. However, the accuracies of the translations were not evaluated due to linguistic limitations.
Table 4. Common hashtags in all language communities.
Table 4. Common hashtags in all language communities.
Intersection (N = 30)
TagJapaneseArabicChineseEnglishFrenchRussianSpanishOverall
1biodiversity1.91600.36860.911914.914616.018612.65739.12141.5055
2Biodiversity0.315110.17940.15203.28032.29242.69235.91700.6287
3nature0.15940.13520.05072.26901.23920.41960.67490.1817
4COP 150.13140.47302.63420.13580.14910.17480.04540.1349
5BiodiversityDay0.06450.36860.10130.23240.13090.41960.06620.1217
6news0.62040.13521.67170.09660.01980.76920.08110.0832
7Science0.04140.04300.05070.15390.05900.13990.15280.0671
8Nature0.01090.06760.05070.45440.41061.36360.30570.0520
9WorldEnvironmentDay0.01340.18430.10130.25780.03670.24480.02250.0413
10SDGs1.50480.12900.05070.23380.01470.13990.01100.0353
11Research0.04870.00610.05070.02490.02740.03500.03310.0207
12EarthDay0.00490.01840.10130.09280.03400.10490.02990.0198
13Climate0.00490.01230.05070.13840.37920.03500.01780.0174
14Japan0.02550.03690.05070.01510.00650.03500.01380.0172
15climate0.00490.00610.05070.50060.97970.55940.06100.0171
16Germany0.04620.04910.05070.01380.00530.10490.01260.0170
17WorldOceansDay0.00970.01230.05070.03520.01830.06990.00770.0163
18ecology0.00360.00610.05070.30970.48402.90210.34960.0151
19China0.06810.50994.55930.05610.00180.06990.03580.0112
20FAO0.00240.01840.05070.01260.01250.69930.02380.0102
21Technology0.00240.00610.10130.01550.00420.03500.04470.0075
22Agriculture0.00240.00610.10130.06290.18710.20980.00240.0068
23OnePlanetSummit0.00120.03070.05070.01860.09410.17480.00890.0066
24France0.00850.22730.10130.01660.25680.03500.00110.0063
25Russia0.00240.02460.05070.00690.00440.38460.00330.0061
26Desertification0.00360.03690.05070.00450.00150.03500.00880.0051
27today0.00490.04300.05070.00270.00040.06990.00530.0020
28UNEP0.00850.11670.05070.03270.00040.17480.00140.0019
29COVID190.01820.04910.05070.11830.11970.17480.00030.0019
30fornature0.00120.01230.05070.02010.00040.06990.00090.0014
Six Languages Difference
TagJapaneseArabicChineseEnglishFrenchRussianSpanishOverall
1Environment00.19040.05070.41230.62530.03501.67210.1031
2ecosystem00.01230.05070.20590.04340.10490.08110.0398
3protection00.01230.05070.01580.07370.06990.01470.0231
4Poland00.00610.10130.00290.04010.03500.00340.0070
5IDB201800.01230.10130.00410.00200.03500.00290.0050
6Pakistan00.00610.10130.02170.00130.03500.00050.0020
Note: The numbers indicate the proportion of the cumulative number of times the hashtag was mentioned to the total number of hashtags over the sample period for a given language community. The overall value of each row is the harmonic mean of the proportions of all language communities.
Table 5. Transition of popular hashtags in the Japanese language community.
Table 5. Transition of popular hashtags in the Japanese language community.
Rank201020152020
1bdjpTemprabiodiversity
2cop10Fried shrimpSDGs
3biodiversitynponote
4COP10ngonews
5MISIAbiodiversityInternational biodiversity day
6tenenClick donationvegan
7mudefbookmetervegan
8gakeiki_missionnewsliving things
9twvtKyushu University pressvegetarian
10nhkenvironmental protectionenvironment
Note: Japanese hashtags were manually translated into English by the author.
Table 6. Comparison of hashtags and IPBES-CF.
Table 6. Comparison of hashtags and IPBES-CF.
Framework Proportion of Tags
(N = 31)
Tag Examples
1 Nature0.23biodiversity, Biodiversity, nature, Nature,
Climate, climate, ecology
2 Nature’s benefits to people0.00-
3 Anthropogenic assets0.10Science, Research, Technology
4 Indirect drivers0.53COP 15, BiodiversityDay, news, WorldEnvironmentDay,
SDGs, EarthDay, Japan, Germany, WorldOceansDay, China, FAO,
OnePlanetSummit, France, Russia, UNEP, fornature
5 Direct drivers0.10Agriculture, Desertification, COVID19
6 Good quality of life0.00-
Other views0.03today
Note: the hashtags listed in “Tag examples” are given in descending order of the overall values presented in Table 4.
Table 7. Comparison of unique hashtags of Japanese and English language communities according to the conceptual framework element.
Table 7. Comparison of unique hashtags of Japanese and English language communities according to the conceptual framework element.
FrameworkProportion of Tags
N = 117
JapaneseEnglishp Value
1. Nature0.030.090.098
2. Nature’s contributions to people0.020.030.408
3. Anthropogenic assets0.070.110.253
4.I ndirect drivers0.670.560.107
5. Direct drivers0.040.160.003
6. Good quality of life0.030.040.472
Other views0.150.000.000
Table 8. The highest betweenness centers in each language community.
Table 8. The highest betweenness centers in each language community.
JapaneseBetweenness
Centrality
ArabicBetweenness
Centrality
ChineseBetweenness
Centrality
EnglishBetweenness
Centrality
1biodiversity0.371Diameter0.141COP150.371Biodiversity0.15
2cop100.252Kuwait0.072biodiversity0.142Wildlife0.06
3Nagoya0.133Abu Dhabi0.073Biological diversity0.133Nature0.05
4surroundings0.134UAE0.064COP15 in Kunming0.094nature0.04
5Quiet life0.125Egypt0.065Conservation0.075Environment0.04
6BDJP0.116Climate change0.056SDG150.056Conservation0.02
7news0.097Today-the world_of the environment0.057Connect2Earth0.027climate change0.02
8COP100.088Sisi0.058Endangered Species0.028conservation0.01
9biology0.079France0.049Hype0.029environment0.01
10nhk0.0710Sharm El-Shaikh0.0310BRI0.0210wildlife0.01
FrenchBetweenness Centrality RussianBetweenness Centrality SpanishBetweenness Centrality
1Nature0.071ecology0.131Biodiversity0.48
2Biodiversity0.072Biodiversity0.122Environment0.05
3biodiversity0.053FAO0.053Nature0.05
4Environment0.034biodiversity0.044nature0.03
5Ecology0.025Tsur0.045app0.02
6environment0.026AND0.046Corlantatura0.02
7Hunt0.017soil0.037Space0.02
8PAC0.018Nature0.038Wetlands0.02
9Marseille0.019UN0.039environment0.02
10animals0.0110scientificrussia0.0310conservación0.01
Table 9. Time series of hashtags with the highest betweenness centralities in the Japanese language community.
Table 9. Time series of hashtags with the highest betweenness centralities in the Japanese language community.
HashtagsBetween
Centrality
HashtagsBetween
Centrality
HashtagsBetween
Centrality
2009 2010 2011
1bdjp1.000cop100.398cop100.436
2 bdjp0.291bdjp0.308
3 biodiversity0.171kaminoseki0.245
4 COP100.160biodiversity(inEnglish)0.234
5 tokyo0.089biodiversity(chinesecharacter)0.193
6 eco0.079genpatsu0.145
7 CBD0.073COP100.098
8 nagoya0.068CBD0.085
9 followdaibosyu0.053kankyo0.073
10 followmejp0.0521 year passed after COP100.056
2012 2013 2014
1biodiversity0.659nature0.446biodiversity0.349
2environment0.483living things0.382npo0.282
3yokohama0.138environment0.275environment0.266
4clam0.065biodiversity0.218ngo0.256
5yokohama0.065ngo0.217sea0.101
6kankyo0.065npo0.217livingthing0.077
7eco0.065CSR0.043ESD0.027
8living things0.039Henoko0.043science0.026
9science0.039event0.043COP120.026
10animal0.039science0.016kindle0.026
2017 2018 2019
1biology0.527biodiversity0.616biodiversity0.871
2environment0.495biology0.388My world through a viewfinder0.143
3nature0.495International day for biodiversity0.314extinction0.132
4blog0.143Okinawa0.256crowdfunding0.121
5 SDGs0.221palm oil0.120
6 environment0.129International day for biodiversity0.104
7 nature0.128Borneo0.094
8 Henoko0.124Climate Change0.070
9 ESG0.062SDGs0.064
10 What day is today0.032ZeroHunger0.059
2020 2021
1biodiversity0.862biodiversity0.636
2International day for biodiversity0.114news0.238
3animal0.093SDGs0.165
4climate crisis0.085living things0.128
5living thing0.082environment0.106
6news0.071diversity0.075
7Climate Crisis0.058International day for biodiversity0.060
8Naniwa eco meeting0.052climate change0.058
9Hanno0.052animal0.058
10Aichi0.051satoyama0.045
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ishida, S.; Matsui, T.; Haga, C.; Hori, K.; Hashimoto, S.; Saito, O. Twitter Mining for Detecting Interest Trends on Biodiversity: Messages from Seven Language Communities. Sustainability 2023, 15, 12893. https://doi.org/10.3390/su151712893

AMA Style

Ishida S, Matsui T, Haga C, Hori K, Hashimoto S, Saito O. Twitter Mining for Detecting Interest Trends on Biodiversity: Messages from Seven Language Communities. Sustainability. 2023; 15(17):12893. https://doi.org/10.3390/su151712893

Chicago/Turabian Style

Ishida, Shu, Takanori Matsui, Chihiro Haga, Keiko Hori, Shizuka Hashimoto, and Osamu Saito. 2023. "Twitter Mining for Detecting Interest Trends on Biodiversity: Messages from Seven Language Communities" Sustainability 15, no. 17: 12893. https://doi.org/10.3390/su151712893

APA Style

Ishida, S., Matsui, T., Haga, C., Hori, K., Hashimoto, S., & Saito, O. (2023). Twitter Mining for Detecting Interest Trends on Biodiversity: Messages from Seven Language Communities. Sustainability, 15(17), 12893. https://doi.org/10.3390/su151712893

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop