1. Introduction
Investor sentiment has aroused rising attention from financial investors and literature for its broad and multiplex impact on financial markets. Due to social media and market integration, investors’ emotions are easily propagated across populations and countries, which then forms a transmission network [
1,
2]. Literature has found that investor sentiment is transmissible across investors, firms, and markets [
3,
4]. As a result, local volatility of investor sentiment propagates quickly along the network and causes systemic impacts. Furthermore, since the limit to short arbitrage, the markets are dominated by optimistic investors but do not reflect sufficient pessimistic emotions, which causes the markets to become overheated. It is the trigger for most historical financial crises. In addition, once the bubbles burst, the panic quickly spreads among investors and lead to a market crash, such as the bank run and underselling [
5]. Thus, it is vital to explore the transmission patterns of investor sentiment and understand its characteristics in order to monitor the market shock.
Among research on investor sentiment transmission, the spillover effect makes investor sentiments of different firms interactive, and the effect of multiple temporal scales makes the interactions complex [
6,
7,
8]. To explore those effects, prior literature tends to use financial data to reflect the multi-scale effect [
9], and use correlation measurements or agent-model-based simulations to abstract the sentiment transmission [
10], few have constructed a composite set of methods and tools according to the complex interactiveness and multi-scale of investor sentiment transmission.
Specifically, first, the emotions and attitudes of investors toward the market prospect diffuse through online comments and analyst reports, which makes individual emotions become population sentiment. In addition, globalization, production specialization and complex competition–cooperation relationships among market participants also make firms around the world interdependent. The sentiment of a certain stock will influence its price, which is then observed by more investors and affects their attitudes toward the stock. Furthermore, the sentiment toward a certain asset will affect the investors’ attitudes towards other assets and derivatives due to interdependencies among firms, industries, and markets [
11]. Those factors jointly result in the interactiveness of investor sentiment transmission. To well represent and investigate such interactiveness, complex network analysis is useful, which admits the complex interactions among investors, firms, and markets and constructs the topological structure of sentiment transmission.
Second, investors are the unity of the “animal spirit” and “rational-economic man” [
12,
13,
14]. The characteristics of irrational emotions are often transmitted by violent fluctuations in a short time. Rational emotions represent investors’ long-term understanding of the investment objects. This rational understanding has difficulties in undergoing drastic changes, such as irrational emotions, in a short time. Instead, it evolves and propagates gradually over a long time scale. In addition, the short-term irrational emotion induced by emergencies and bad news will be gradually modified by the long-term rational emotions induced by aggregate economic fundamentals [
15]. The coexistence of irrational and rational emotions leads to different transmission modes of investor sentiment on different time scales. This gives rise to the second attribute of investor sentiment transmission, i.e., multi-scale. From the time scale, investors can be generally divided into three categories: high-frequency traders who pay attention to very short-term strategies, technical investors who make profits through medium-term trading, and long-term investors who focus on long-term value investment [
16]. Investors with different time scales will have different expectations for identical market information and reflect different emotions, which further leads to the different diffusion and transmission of investor sentiment on different time scales [
17]. Thus, exploring the multi-scale differences in investor sentiment transmission can help to build an understanding of the intensity and patterns of rational and irrational emotion transmission in the short and long term.
Therefore, considering the above attributes of investor sentiment transmission and the lack of literature addressing these attributes, our study aims at deciphering the differences in investor sentiment transmission characteristics in the short term and long term through a composite set of methods. To figure it out, we select the Baidu search index for listed companies in China to represent investor sentiment. The Baidu search index is widely accepted to measure investor sentiment in the Chinese market [
18]. First, considering the multi-scale features of investor sentiment transmission, we apply the wavelet method to decompose and extract the multi-scale information of investor sentiment for every listed company. Second, we construct the investor sentiment transmission networks (ISTNs) in the short and long term based on the transfer entropy among the investor sentiment for different listed companies. The ISTN takes into account the interactiveness of investor sentiment transmission not only from close neighbours but also from the whole network, while the transfer entropy method differentiates well between the ‘driving’ nodes and the ‘responding’ nodes, and is then able to measure the influences that one variable can have on another [
19]. Third, we analyse and discuss the investor sentiment transmission characteristics in the short and long term based on network analysis to reveal how the investor sentiment transmit among listed companies on different scales and what the differences are in transmission characteristics between the short term and long term. The results exhibit the following main findings: (1) the transmission of investor sentiment presents more active in the short term and takes place in a local range; (2) the transmission of investor sentiment presents patterns of continuity and growth from short term to long term; (3) the transmission patterns of investor sentiment will have specific evolutions from short term to long term.
Our study contributes to the investor sentiment transmission literature in the following aspects. We apply a composite set of methods combining the wavelet, transfer entropy and complex network analysis. These methods are selected and integrated based on the multi-scale, complex interactiveness and directionality of investor sentiment transmission. This composite method can help to build a realistic transmission network of investor sentiment and uncover the local and network effect at multi-frequencies. In addition, this study is one of the few exploring the transmission patterns and characteristics of firm-specific investor sentiment. As far as we know, literature mainly focuses on the relationships between investor sentiment and asset returns and ignores that investor sentiment itself is intrinsically a complex information interactive system. Our findings can provide strategic suggestions for heterogeneous investors and market regulators on paying attention to specific investor sentiment according to short-term or long-term objectives.
The rest of this paper is organized as follows:
Section 2 is the review of related works.
Section 3 introduces and describes the data and methods applied. In
Section 4, the experiment results and corresponding discussion are presented. In
Section 5, the conclusion, strategic suggestions, and research perspectives are presented.
4. Empirical Results and Discussion
Our study applied composite methods of wavelet, transfer entropy, and complex network to investigate the investor sentiment transmission at multiple temporal scales. According to the above steps of methodology and selections of network statistics, this paper obtains the daily data of the Baidu search index for 189 stocks comprising the new energy vehicle sector of China for 7 years from 2015 to 2021, and constructs a set of ISTNs. Each year, multi-scale extraction is carried out, respectively. We obtained the ISTNs at 5 different temporal scales according to the length of the time scale. In order to highlight the differences in time scales, this paper compares and shows the results on the short and long scales. The short scale represents a time scale of 1–2 days and the long scale represents a time scale of 16 days and longer. Our discussion is based on the ISTNs at high and low frequencies for each year from 2015 to 2021 and the corresponding network statistics.
4.1. Overall Network Structural Evolution
Figure 3 shows the overall network structural evolution from 2015 to 2021. We include four network statistics in the discussion, i.e., the sum of edge weights, density, average weighted clustering coefficient, and core number. Blue lines represent the structural evolution trend at high frequencies, while orange lines are at low frequencies.
Figure 3A shows the total edge weight of the ISTN at high and low frequencies. The total of edge weights represents the total amount of information transmission in the network, and the activity of transmission relationships or transmission behaviours in the ISTN. The blue line in the figure is higher than the yellow line as a whole, indicating that the investor sentiment transmission is more active at high frequencies.
Figure 3B shows the network density of the ISTN at high and low frequencies. The network density indicates the overall closeness of the nodes in the network measured by the fraction of existing edges to total potential edges. In ISTN, it indicates the overall closeness of the investor transmission relationships among firms. Generally, the overall closeness of investor transmission relationships under high frequency is higher. It further supports the results in
Figure 3A that investor sentiment propagates more actively and frequently in the short term than in the long term.
Figure 3C shows the average weighted clustering coefficient of the ISTN at high and low frequencies. The average weighted clustering coefficient indicates the degree of interconnection between the neighbours of a node, which indicates the degree of closeness of local transmission relationships in the ISTN. First of all, it shows that the local transmission is stronger at high frequencies. Although we observe that the transmission of the entire network is stronger at high frequencies, it exhibits that the closeness of local transmission is larger than that of the entire network. In other words, short-term or irrational investor sentiment is easier to spread locally (within a small range).
Figure 3D shows the core number of the ISTNs at high and low frequencies. The core number represents the depth of network space or the order of the hierarchical structure (number of hierarchical levels) and the order or depth of the transmission relationships in the ISTN. As shown in the figure, the transmission path of investor sentiment at high frequency is deeper and farther, and the hierarchical structure of transmission is more obvious, which presents a stronger ‘pyramid tower’ structure. In other words, at a high frequency, the short-term or irrational investor sentiment of a firm at the top of the ‘pyramid tower’ needs to span longer hierarchical levels in order to finally affect the bottom of the tower. At low frequencies, there are fewer hierarchical levels so it is easier for the long-term rational emotions at the top of the tower to finally be transmitted to the bottom of the tower. In other words, in the short term, investor sentiment toward a firm will not spread to the whole system but diffuse around a relatively small range of ‘neighbours’, while, in the long term, it will have a systemic influence on the whole network.
Figure 3E shows the betweenness of the ISTNs at high and low frequencies. Betweenness measures the number of shortest pathways connecting any two nodes that pass through a certain node. In other words, betweenness is a way to detect the amount of influence a node has over the flow of information or investor sentiment in the network. When a node’s betweenness is high, it means that this node is an ‘intermediary’ in the ISTN. An ‘intermediary’ in the ISTN can serve as the bridge of information and investor sentiment, which can affect and even controls the information and investor sentiment from the sender nodes. As shown in the figure, the blue curve (high frequency) is above the yellow curve (low frequency) as a whole, which implies that the number of information intermediaries is more at high frequency than that at low frequency. Investor sentiment is distributed and transmitted through those intermediaries so that a firm’s investor sentiment can have higher possibilities to be transmitted to the whole network in the short term. This result reflects similar characteristics of investor sentiment transmission to the results of total edge weights that the transmission of investor sentiment is more active at high frequency.
Figure 3F shows the closeness of the ISTNs at high and low frequencies. Closeness is estimated by the reciprocal of the total length of the shortest paths between a certain node and all other nodes in the network. The larger the closeness of a node is, the closer it is to all other nodes and the more central it is in the network. This metric reflects the central position of a node in the whole network. As shown in the figure, the yellow curve (low frequency) is above the blue curve (high frequency) as a whole, which implies that at low frequency, the entities of ISTN are closer to each other on average. In other words, in the long term, generally, a firm’s investor sentiment can influence other firms’ investor sentiment through a relatively short path. This result is similar to that of the clustering coefficient that long-term sentiment is easier to spread systemically (within a wide range).
The above results have indicated the following findings. First, the transmission of investor sentiment is more active in the short term. Second, the investor sentiment toward a firm is prone to transmit locally in the short term and systemically in the long run. As demonstrated in the study of McClure et al. [
50], passions or irrational emotions can dominate in short-run decision-making and rationality takes over in long-run decision-making. Yang and Gao [
6] also indicate that irrationality has more significant influences on short-term decision-making. With the passage of time, more information related to the price and performance of firms is disclosed, and the influence of longer-term investor sentiment will become weaker than that of shorter-term investor sentiment. It can be inferred from our results that the transmission of irrational sentiment is more active in the short term, and becomes gradually weaker in the transmission process and, thus, is limited to a relatively local range. Thus, investors should pay more attention to the irrational sentiment if they are short-term arbitrageurs and more to the rational sentiment if they pursue long-term value investment [
8].
4.2. Comparative Analysis of High-Frequency and Low-Frequency ISTNs
The above analysis has shown the network structural evolution at multi-scales (short term and long term) of investor sentiment transmission from 2015 to 2021 and has indicated the differences in sentiment transmission between the short and long term. To further explore the relations and differences between sentiment transmission in the short term and long term, we then have a comparative analysis from the level of nodes and edges, respectively.
From a node’s perspective, we can see whether the ‘celebrity’ of the sentiment transmission at high frequencies is also a ‘celebrity’ at low frequencies (shown in
Figure 4). In ISTN, ‘celebrity’ refers to the node or firm with more transmission relationships [
51].
Figure 4A identifies ‘celebrities’ by the order of centrality measured by degree among all nodes, and
Figure 4B uses the weighted centrality measured by weighted degree. In
Figure 4A, the
x-axis is the rank of the node’s degree at high-frequency. The smaller the value of rank is, the larger the degree of the node is. The
y-axis is the year from 2015 to 2021. The coloured area shows the rank of the node’s degree at low frequency. When a node with a high rank of degree at high-frequency is coloured in red, it means that this node also ranks high at low frequency. In
Figure 4B, the rank is based on the node’s weighted degree (strength). When a node with a high rank of strength at a high frequency is coloured in red, it means that this node also ranks high at a low frequency.
Figure 4A,B both reflect the comparison of the ‘celebrity effect’ between high and low frequencies, while
Figure 4A focuses on the transmission range of a celebrity’s sentiment and
Figure 4B focuses on the transmission strength of a celebrity’s sentiment. Although both reflect the characteristics of ‘celebrities’, the emphasis is different. The degree centrality reflects the number of transmission relationships or influence range of ‘celebrities’, while the weighted degree centrality reflects the intensity of transmission or influence strengths of ‘celebrities’. A ‘celebrity’ with a wide range of influence does not necessarily have a strong influence strength. For example, ‘celebrity
A’ is known to many people, i.e.,
A has a wide range of influence, but people only have an ordinary or weak impression of
A, that is,
A’s influence strength is not high.
From
Figure 4A,B, we can see that when a node is a ‘celebrity’ at high frequencies, it is also a ‘celebrity’ at low frequencies. However, the relationships reflected in
Figure 4B are more explicit and strong than that in
Figure 4A. In other words, nodes with greater transmission intensity at high frequencies also have greater influence strength at low frequencies. In the context of investor sentiment transmission, influence strength is expressed as the intensity or strength of the transmission relationship. The stronger the influence of firm
A has, the stronger the transmission relationship of firm
A is, which means the investor sentiment of firm
A has a stronger impact on other firms.
Figure 5 discusses similar phenomena from an edge’s perspective.
Figure 5A shows the ratio of overlapping investor sentiment transmission relationships at high and low frequencies, where the
x-axis is the year from 2015 to 2021. The blue curve is the fitted trend when treating the network as a directed one while the orange curve is the fitted trend when treating the network as an undirected one, and the shaded areas under curves are confidence intervals at 95%. It can be observed that whether the direction of the transmission relationship is distinguished or not, the overlapping rate of the transmission relationships is about 46% to 54%, that is, about half of the transmission relationships exist at both high-frequency and low-frequency.
Figure 5B shows whether the strong transmission relationship at high frequency is also strong at low frequency. In
Figure 5B, the rank is based on the edge’s strengths or edge weights. When an edge with a high rank of strength at a high frequency is coloured in red, it means that this edge also ranks high at a low frequency. In general, a strong transmission relationship at high frequency is also strong at low frequency. However, at high-frequency, the transmission relationships ranking in the last quarter show strong transmission strengths at low frequencies. It appears that some of the weak transmission relationships at high frequencies grow into strong relationships at low frequencies. Combining the results of
Figure 5A that half of the transmission relationships exist both at low and high frequencies, we can infer that most of the transmissions have consistent strengths between the situations at high and low frequencies, which reflects the continuity of investor sentiment transmission. In addition, some of the transmissions have increasing strengths from short term to long term, which reflects the growth of investor sentiment transmission. It is plausible that the sentiments with growing strengths of transmission from short term to long term are those reflecting rational emotions and attitudes. While irrational emotions induced by rumours and scandals have strong transmissions among firms in the short term, rational emotions can make deviated sentiments return to normal levels. This process of rationality-reversion is implicit in the short term but is explicitly observed in the long term.
4.3. Analysis of ISTN with Motifs
Network motifs, in the context of sentiment transmission, reflect the sentiment transmission activities among firms. Thus, looking into the distribution of network motifs may provide a deep insight into the function of the entire network in terms of sentiment transmission.
Figure 6 is the collection of network motifs.
Figure 7 shows the results of motif distribution at high and low frequencies.
Figure 7A shows the distribution of each type of motif at high frequency. Wherein, the main motif is type
. Considering the potential functions of motifs in the context of investor sentiment transmission, this paper believes that type
reflects a one-way (uni-directional) circulation of investor sentiment transmission. This motif shows that at high frequencies, investor sentiment transmission has strong causality and recursiveness.
Figure 7B shows the distribution of various motifs at low frequencies. Among them, the main motifs are type
,
, and
.
and
reflect a multi-directional transmission structure driven by a core node. In
, the driving node acts as the source of the sentiment transmission. These motifs show that at low frequencies, investor sentiment transmission is more flexible than that at high frequencies and exhibits some reciprocity which is driven by the transmission source. When sentiment transmission of the network is driven or initiated by several source firms, these source firms should be paid particular attention to. Thus, in the long run, we need to detect and monitor those driving firms of sentiment transmission. Their sentiments will have a larger influence on the entire network compared to other firms.
Figure 8 shows how the motifs in the short term change to another type in the long term. First, we observed that the evolution maintains high stability and consistency in different years (between 2015 and 2021), and forms several main evolution patterns from high to low frequencies. Taking 2015 as an example, it is mainly reflected in the evolution of the following motif pairs:
,
,
, and
. The former is the motif manifested at high frequencies, and the latter is at low frequencies. The pair reflects the evolution of motifs from short term to long term. These four motif pairs show the following two patterns (we do not discuss
since it only exhibits a mere reduction in random transmission with little information displayed).
(1) : From the outdegree structure at high frequency to the in-degree structure at low frequency. In the context of investor sentiment transmission, this evolution means that in the short term, the node that is easy to transmit its sentiment outward gradually becomes susceptible to other nodes in the long term. From another aspect, firms that are prone to transmit their irrational emotions to the outside in the short term will become more susceptible to the rational emotions of other firms in the long term. It is plausible that in the short term, irrational emotions toward a firm spread easily to other firms, but with the passage of time, affected firms will feed back rational emotions to the original source of irrational emotions through stable business performance and disclosure of favourable information.
(2) and : From the cyclic structure at high frequency to the core node driving structure at low frequency. This evolution means that in the short term, the transmissions are diverse and active. With the enlarging of the time scale, some active transmission relationships gradually become silent and eventually form some dominant nodes. These dominant nodes play a driving role in the sentiment transmissions, that is, the emotions of other nodes need to pass through this driving node to further affect another node. In the short term, irrational investor sentiment tends to be disorderly transmitted and diffused. With the extension of time, the market no longer pays attention to and recognizes some irrational investor sentiment transmission relationships, but forms attention to some driving firms, which makes rational investor sentiment be transmitted orderly in the long run.
5. Conclusions and Perspectives
The aim of our study is to investigate the multi-scale transmission of investor sentiment based on complex network analysis. As investor sentiment transmission is drastically influential toward financial markets, prior studies do not reach a consensus in terms of the diverse and complex transmission patterns of investor sentiment on multi-scales. In our study, based on the complex interactiveness and multiple temporal scales (referred to as term structure otherwise) of investor sentiment transmission, we proposed a composite set of methods to explore the transmission patterns and characteristics of investor sentiment among firms.
After taking the Baidu search index data as the proxy for the investor sentiment toward firms, we use the wavelet method to extract multi-scale transmission information and adopt transfer entropy to measure the directional and non-linear sentiment transmissions among firms and construct investor sentiment transmission networks at high and low frequencies for each year from 2015 to 2021. By conducting complex network analysis, we find that the transmission of investor sentiment presents as more active in the short term and takes place in a local range. As irrational emotions dominate in short-term decision-making and rationality takes over for long-term decision-making, we suggest that investors pay more attention to the irrational sentiment factors if trading as short-term arbitrageurs and focus on rational sentiment factors as long-term value investors. On the other hand, we find that the transmission of investor sentiment presents patterns of continuity and growth from short term to long term. It suggests that the sentiment of some firms will have a growing influence in the long term despite that it is less perceived in the short term. Additionally, it is plausible that the growing sentiment of those firms will be a determining factor for macro economies and long-term investors. In addition, we also find that from short term to long term, the transmission patterns of investor sentiment will have specific evolution. It suggests that a firm’s emotions spread easily to other firms in the short term, but on a longer scale, affected firms will then have feedback on the previous source of emotions through stable performance and disclosure of favourable information. In addition, in the short term, investor sentiment is prone to be disorderly or diversely transmitted and diffused. While on a long scale, the market pays less attention to some irrational investor sentiment but more to the sentiment of some driving firms, which makes investor sentiment be transmitted orderly in the long term. It suggests that when investors pursue long-term benefits, they should detect those driving firms and have particular concentration on them. As for market regulators, those driving firms play an important role in the long-term stability of markets and economies. Regulators should monitor the investor sentiment toward those driving firms, which will provide a warning signal for systemic risk.
Overall, our study explores the multi-scale sentiment transmission patterns and characteristics and proposed a composite set of methods based on the complex interactiveness, causality of transmission and multiple temporal scales. While the transmission of investor sentiment presents discrepancies in different market norms and constraints, applying the composite method in the present study to different markets and countries for comparative analysis is the subject of ongoing research. In addition, since the time range used in our study is only 7 years, from 2015 to 2021, the findings are limited. First, the findings may not be effective outside this time range. Second, the fitted trends of the network statistics may be unsolid. Thus, it is also one of the future works to incorporate a longer time range to consolidate the findings.