1. Social Impact Bonds and Social Uncertainty: Setting the Context
The purpose of this exploratory study is to realize a practical application of a model for social uncertainty evaluation in Social Impact Bonds (SIBs).
Since their introduction in the United Kingdom in 2010, SIBs have become popular in the academic, practitioner, and policy-maker world [
1,
2]. In the impact investing arena, SIBs are an interesting innovative financial mechanism [
3], that seem to promise a solution for financing complex social interventions [
4] and reallocate performance and financial risk from the public towards the private sector ([
5], p. 41). More in detail, SIBs have been defined as “
payment by results contracts that leverage private social investment to cover the up-front expenditure associated with welfare services” ([
6], p. 57). According to ([
7], p. 1), payment-by-results (PbR) is a scheme for “
delivering public services where the government (or the commissioner) pays providers for the outcome they achieve rather than the activities they deliver”.
A literature overview provided by ([
8], p. 72) discusses the main characteristics of PbR: (1) contingent payments depend on independent verification of results; (2) the PbR contracts should include both reward and penalties useful for achieving the outcomes; (3) risk transfer exists and depends on both the impact and the success of the project. SIBs are a form of PbR, but SIBs extend PbR by harnessing social investment from capital markets to cover (partly or fully) the costs of service intervention [
6].
Therefore, investor return depends on the impact of the project and on the achieving outcomes [
9]. The key point in the SIBs (and PbR) schemes is the outcome measurement, and, accordingly, the quality of the performance measure (see
Section 2.1).
On the theoretical level, several scholars note that the opportunities related to the SIB market development are vast and perhaps endless [
5], but, at the same time, challenges and obstacles are often highlighted [
10]. The complexity of the SIBs schemes is evident (see
Section 2.1): they are characterized by public–private partnerships [
1] between several counterparties, and each actor has different interests, goals, and expectations, and different risk attitudes and perceptions [
11].
On the empirical level, examining the evidence from the SIB market, it is worth noticing that the diffusion of SIBs is still modest. From the launch, in 2010, of the first pilot SIB in the UK (HMP Peterborough) to the end of October 2019, only just over 130 SIBs have been implemented worldwide [
12]. Among these, only 34 are closed. It is clear that the limited size of the SIB market clashes with the theoretically great potential of SIBs.
The risk (or uncertainty) issue represents one of the main controversial points related to the use of SIBs on a larger scale [
5,
13,
14] and, at the same time, it is still underexplored [
15,
16,
17]. It is clear that the complexity of the SIBs model increases the probability of SIBs failing. On the other hand, some scholars are underlying downsides and risk facets of SIB models (see
Section 2.1).
More generally, with regards to the impact investing field—of which SIBs are key components—some scholars are exploring, amongst other things, social risk [
18,
19,
20,
21,
22], impact risk [
23,
24,
25,
26], social uncertainty [
27,
28], and their linkages and differences [
27], by also using methods and approaches very different from those of traditional mainstream finance. Impact investing is considered a “revolutionary” way to improve sustainability. Impact investing is characterized by the explicit intention (intentionality), social purpose, and the ability to generate positive environmental and/or social impact in accordance with an appropriate risk-return investment [
11]. It is important to note that “intentionality” represents the linkage between ethics and facts, thus determining a consequentiality between (positive) values and value creation. These innovative approaches open new views for research on how finance should be “reconsidered” [
29,
30,
31], and also question the foundations of the mainstream finance [
32].
In order to shed light on these open problems, some works are focusing on SIBs’ practices to better understand risks (or uncertainty) related to SIBs’ contractual schemes and social and financial features [
1,
5,
16]. In the light of the social impact investing approaches, a constant focus on social risk (rather than on other risks) certainly favors a more effective social finance capital allocation and reminds us that the first purpose of social finance is to bring about maximum social change rather than to make money ([
33], p. 308).
To the best of our knowledge, only very few works, such as Scognamiglio et al. [
5,
22], have proposed a framework able to support the evaluation of an SIB, starting from a measurement of the social uncertainty (for a discussion of these previous works see
Section 3.1, in which we also explain several main alternative approaches to the impact evaluation). These works open new promising avenues for researches, useful to promote the development of SIB models, on large scale.
However, currently, regarding the SIBs, there are still no definite and effective models (already widely tested and used) capable of evaluating the social uncertainty. Starting from these works, we attempt to test an implemented model—which is an extension of a framework proposed by [
5]—capable of providing final scores related to social uncertainty evaluation of the overall population of SIBs closed worldwide at the end of October 2019. As far as we are aware, this is the first analysis using all of the closed extant SIBs. Our work offers some preliminary but significant results on social uncertainty, which is one of the most relevant and underexplored issues of SIBs. In addition, it proposes a quantitative measure of several qualitative key elements related to the SIB Programs, providing useful suggestions for investors and other stakeholders. Then, it realizes an in-depth comparison of several SIB programs and an insight of market practices. Finally, this pilot study is a retrospective analysis and represents a first step towards implementing a prediction model for social uncertainty evaluation.
In our work, we use the following definition of social uncertainty provided by ([
5], p. 42): “
most definitions interpret social uncertainty as the risk of not reaching the intended impact [18] or as the likelihood that a given allocation of capital will generate the expected social outcomes irrespective of any financial returns or losses [27]. Despite this heterogeneity of definitions, social uncertainty could be considered more generally as a concept providing an indication of the certainty that an output will lead to the stated impact [
26].”
The choice to focus on uncertainty (rather than risk) deserves some clarification.
Knight [
34] established a clear difference between the concepts of “risk”and “uncertainty”. The famous phrase “
uncertainty is an unknown risk, whilst risk is a measurable uncertainty” underlines the fact that risk refers to situations that can be quantitatively measured through a distinct probability distribution; on the other hand, uncertainty refers to those events for which there is not enough knowledge to identify objective probabilities. Therefore, the revolutionary works of Knight shed light on different facets of concepts of uncertainty and show the limitedness of the mechanical analogy in understanding enterprise and profit ([
35], p. 458). In this vein, Knight’s view is a challenge to neoclassical theory [
35]: uncertainty arises out of partial knowledge ([
35], p. 459) and this concept represents the turning point for advancing in psychological decision theory, such as a host of psychological insights beyond the risk–uncertainty distinction ([
36], p. 458). In more detail, Knight described features of risky choices that were to become the key components of prospect theory (Kahneman and Tversky [
37]): the reference-dependent valuation of outcomes and the non-linear weighting of probabilities.
Based on these considerations, Knight’s view is a useful starting point for our exploratory analysis: risk is objective since it represents a variable independently of the individual; uncertainty is, on the contrary, subjective, because it depends on the individual, their information, and their temperament. Hence, a decision under the conditions of uncertainty has a higher impact than one made under the conditions of risk, both for the decision maker and for the context in which they operate. The tools used to analyze risk and uncertainty are different, too. For risk statistical techniques are used, whilst for uncertainty heuristic ones are used [
38].
From this point of view, our application appears particularly interesting, because its focus is on social uncertainty (rather than on social risk), following Knight’s view: it overcomes the limitations of the traditional finance paradigm (and of its mechanistic models) by leading towards more complete foundations for building momentum for systemic change. In addition, our analysis uses a modified Analytic Hierarchic Process (AHP) that consists of combining any leaf with the hierarchy tree; typical of such a multi-criteria method, indicators with appropriate measuring scales are capable of describing the characteristics of the problem. Compared to the previous methods [
5,
39,
40], our implemented technique offers the advantage of obtaining more flexible adjustments and implementations in terms of levels of investigation, indicators, and scales. These characteristics seem to be particularly useful for better understanding SIB model dynamics and peculiarities (see
Section 2.1 and
Section 3.1).
3. Method, Model Description and Data Collection
3.1. Methodological Approach
In this work we attempt to measure the uncertainty of SIBs by combining a multi-criteria decision method with a process for constructing composite indicators. We test a model that is an extension and an improvement of [
5]. In [
5], social uncertainty evaluation in SIBs was hierarchically explained involving 3 main categories, 5 factors and 16 sub-factors. Scognamiglio et al. [
5], in turn, referred to Serrano-Cinca and Gutierrez-Nieto [
40], where the goodness of investment in Social Venture Capital (SVC) was evaluated through a model hierarchically structured in 3 principal factors, 26 criteria, and 160 indicators, as suggested by a small group of SVC’s analysts and academics. Furthermore, the same structure of model in [
5] was used in Scognamiglio et al. [
22] to explore the relationship between social risk and financial return within the context of SIBs.
Starting from these previous works, our model uses the multi-criteria decision method, the aggregation techniques, and the measuring scales in a different manner.
Among the different multi-criteria decision methods, AHP is suitable for its versatility in different fields (see Saaty [
39]) and for solving complex problems after disassembling the phenomenon into criteria (e.g., categories, factors, and sub-factors) and assigning a priority to each of them.
In order to the design the model, an SIBs uncertainty score is given using an aggregation process where the elements, similarly to [
5] in type, number, and position, are placed over a three-level tree diagram. At the first level there are three main categories; at the second level, five factors are divided into two groups, and two indicators are linked to an upper category; and at the third level, 14 indicators are grouped around the upper factors.
Figure 2 shows the diagram tree used in detail.
In our study, despite [
40], AHP is applied only for the first two levels of the diagram tree (categories and factors) and, differently from [
5], the same is separately repeated for groups of the same level. The computation of weights for the different levels allows us to obtain a final scoring of immediate interpretation without using the comparisons typical of AHP. Furthermore, the computation of weight for different groups reduces the error in terms of consistent judgments (see
Appendix B.1).
The explorative nature of our analysis and the absence of a large dataset required the use of alternative statistical techniques (with respect to the classical ones such as factorial analysis, cluster analysis, etc.). In addition, it is worth emphasizing that these techniques need strong assumptions not available in our context (see, e.g., normality conditions).
Our analysis also realized pilot interviews for scholars with extensive experience in the SIBs field (via questionnaire) in order to compare all pairwise elements belonging to the same hierarchical level and concerning the common upper aspect.
Table 3 presents the experts’ pairwise comparisons among the criteria and the weights of each criteria, in accordance with
Figure 2.
In particular, the comparisons were made by the experts for groups of the same level, using the 9-point Saaty scale (in which 1 = “two criteria are equally important”and 9 = “the first criterion is absolutely more important than the second”, 3, 5, 7 denote respectively slight, medium, and strong importance, while
,
,
, and
assume the reciprocal sense). The results are the entries of the so-called Pair Comparison Matrices (PCMs) (see the matrices in columns 3, 5, and 7 of
Table 3). Each PCM then produces the priorities of the criteria involved (see the percentages in columns 4, 6, and 8 of
Table 3). Thus, for example, if we consider how the expert #R1 evaluates the categories, then her PCM can be read: “Program Process”and “Contractual Condition” contribute equally to uncertainty (value = 1); while both “Program Process”and “Contractual Condition” are strongly more uncertain than “Evaluation” (value = 5); and by contrast, “Evaluation” is strongly more certain (value =
) both than “Program Process”and “Contractual Condition”. By this PCM, the priorities of “Program Process”, “Contractual Condition” and “Evaluation” are 45.5%, 45.5%, and 9.1%, respectively.
The composite indicators have been constructed by using the min-max normalization procedure with an objective weighting method and a weighted mean as the aggregation function.
In particular, the min-max normalization method (see [
54],
Table 3, p. 30) is a linear transformation that scales the data between 0 and 1 after subtracting the minimum value and successively scaling the data to the range of the collected data. This type of normalization allows us to free all indicators from their unit of measurement and to mark those unnecessary, which will hence be deleted from the next computation.
The weighting procedure restitutes an objective value proportional to the length of the measuring scale used and near to the ideal one. In fact, despite [
5], a wider 5-point Likert scale (1 = randomized control Trial, 2 = quasi-experimental, 3 = validate administrative data, 4 = historical comparison, and 5 = unavailable) is used to measure the indicator “Evaluation methodology”. In this case, the additional and better specified attributes (one more than [
5] and with a well-balanced scale as suggested by Go LAB) allow us partially to offset the structural gap among categories. In particular, with regard to the distribution of indicators (12.5% in “Evaluation”, 31.3% in “Contractual conditions” and 56.3% in “Program process”), “Evaluation methodology” has 66.7% priority in its category and 6.67% more than its corresponding priority in [
5]. This positive variation multiplied by the difference of the scores in “Evaluation“ contributes to improving our model. Moreover, involving the indicators with the longest scales (“Evaluation methodology” and “Number of Outcomes”), this variation is fundamental for the “Evaluation” category and, more in general, provides a better basis for carrying out the uncertainty.
In conclusion, the SIBs uncertainty is a number ranging from 0 to 1 (with 0 = absolute certainty and 1 = absolute uncertainty).
3.2. The Model
We start from the diagram tree shown in
Figure 2 to explain the model. We use the following graphic symbols to specify the position of each element involved and its relationships with others:
- -
ℓ := level;
- -
:= elements (categories, factors, indicators) in the level ℓ;
- -
:= compositions of the elements in the level ℓ;
- -
k:= label of the group of elements shared by a characteristic;
- -
:= group of elements shared by a characteristic in the level ℓ.
The final score is intended as the result of the following phases:
- Phase 1 or Preparation of data.
For every SIBs i, transform the information relative to any indicator n placed at the level ℓ inside the group , in a number taking into account the measuring scale adopted in terms of uncertainty (e.g., on a 3-point Likert scale 1 is the best value and 3 the worst value). Normalize all indicators’ scores in pure numbers over a ratio scale to homogenize the different ranges.
- Phase 2 or Priorities of categories and factors.
Compare all the elements that refer to the same aspect pairwise (categories with categories, factors with factors). Build thus the Pairwise Comparison Matrices and, after proving the sufficient consistency for each of them, estimate the weights using the principal eigenvector method. The ranking thus obtained will be used only when the procedure arrives at the level where these elements are.
- Phase 3 or Total aggregation.
Aggregate the scores of the indicators which refer to the same aspect to obtain the composite indicators:
Continue the aggregative process, going up to the next level, while taking into account the weights previously computed by the related PCM. When the first level (goal) is achieved, the process is ended and the final score thus obtained for aggregation gives us the measure of uncertainty:
3.3. Sample and Data Collection
For this pilot analysis, a practical application was tested by using a “population” of SIBs (see
Appendix A). The sample was not randomly selected. In more detail, SIBs must be closed (as of the end of October 2019), and they have to comply with the requirements of the transparency and verifiability of information.
To evaluate the model (specified in the previous sections) and its usefulness, we tested it by using data collected from 34 SIBs. More in detail, we tested the model using two types of data: the objective publicly available data (through the Government Outcomes Lab—GO Lab—database and other sources) relatively to scales and units of indicators, and the subjective data through a preliminary survey (via questionnaire) to obtain the priorities among factors and categories of our factor tree.
Two junior researchers separately selected publicly available material on online databases and publicly available material from other sources. Subsequently, senior researchers checked the information drawn up and created a protocol to use relevant information for analysis, ensuring research quality and rigor.
With regard to the survey, we selected the experts using the following criteria:
- Criterion 1:
the experts on SIBs are selected within those who authored or co-authored peer-reviewed chapters of books and books on the topic area;
- Criterion 2:
the experts have published at least one research article evaluating and/or measuring aspects of SIBs in prestigious international journals.
In August and September 2019, we found through Google Scholar (using search terms in line with Criteria 1 and 2) a total of 26 potential works in line with Criterion 1 and only 10 in line with Criterion 2. The experts were selected if they met at minimum one of the criteria. Finally, 3 experts were selected (of which 11.5 % are in line with the first criterion and 33.3% with all criteria). From a statistical point of view, the size of this sample guarantees a standard error of with a 99.5% confidence level on the reliability of the prioritizing method.
Furthermore, the experts were also consulted to refine the structure of the diagram tree.
3.4. The Sample: A Focus on Key Characteristics
We have analyzed for each of the 34 completed SIBs the following key characteristics:
Duration: length of providing services (expressed in years);
Financial resources:
- -
Max outcome payments: total outcome payments (expressed in home country currency);
- -
Capital raised: capital invested by the investors (expressed in home country currency);
Service provider: a private for-profit or not-for-profit organization to deliver social services;
Number of outcomes: number of outcomes defined by the project;
Target population: people to which an outcome is targeted;
Intermediaries: subjects involved to coordinate investor(s), service provider(s) and evaluator(s).
We assigned a number (from 1 to 34) for each of the 34 SIBs analyzed in our work in order to simplify the elaboration and presentation of our study (see
Appendix A).
These 34 completed SIBs at the end of October 2019, according to the GO Lab Database, represent about 25% of the total number of SIBs launched worldwide.
With regard to our “population”, we have analyzed in depth:
All types of SIBs structures (direct, managed, and intermediated, in accordance with the Goodall approach [
55]);
All categories of outcomes evaluation methodology (randomized control trial, quasi-experimental, validate administrative data, and historical comparison in accordance with Gustafsson-Wright et al. [
56]);
The main service providers involved in two or more SIBs, in accordance with Social Finance UK (Buzinezzclub, Thames Reach, Core Assets, Career Connect, Teens and Toddlers, Action for Children);
The most important intermediaries specialized and involved in SIBs, such as Social Finance UK and Triodos Bank UK, in accordance with Calderini et al. [
57];
All financial aspects;
All outcomes achieved.
For detailed information about these key characteristics of the SIBs analyzed in our model see
Table A2 in
Appendix A.
Despite this, our “population” shows some limitations regarding “capital raised” and “issue area”.
Analyzing the “capital raised”, our “sample” has reached about $50 million compared to $440 million. These 34 completed SIBs show, on average, a low amount of “capital raised”.
In accordance with Social Finance UK, the major investments have been made in United States since 2013, seeing that 26 projects were launched that correspond to $219 million of “capital raised”. The projects that were launched covered vast territories and/or a high number of people. These big projects are still being implemented and should be finished within the next few years. Only one of these big projects (NYC Adolescent Behavioral Learning Experience Project for Incarcerated Youth) has been concluded and was included in our “population”.
Comparing the characteristics of the “issue area” between our sample and all of the SIBs launched around the world, we noted that our “population” lacked two new important issue areas, in accordance with the categories defined by Social Finance: “health” and “poverty and environment”.
SIBs for the health sector are called Health Impact Bonds (HIBs) and they represent a new method of financing the health sector, as observed by Rizzello et al. [
8], which has always been characterized by financial deficit. These HIBs have developed in many parts of the world since 2015, not only in developing countries but also in industrialized countries. Currently, there are 22 HIBs launched in many countries all over the world. These projects are still being implemented according to the GO Lab database, and will be concluded within a few years.
There were three SIBs addressed to a “Poverty and environment” issue launched between 2016–2019. These projects were launched in the USA, The Netherlands, and in Uganda and will be concluded in the next few years. Currently, these types of SIBs are in the implementation phase, according to the GO Lab database.
In our sample, most of the SIBs analyzed were concerned with “Workforce Development” and “Homelessness” social issues (see
Table 2).
For an overview in terms of the “socio-demographic” data of the sample with a focus on the investors involved in each SIB project, see
Table A1 in
Appendix A.1.
5. Discussion of Results
The results obtained in the analysis highlight the weakness of the factors tree initially developed by Scognamiglio et al. [
5]. The analysis revealed the presence of a high number of variables that are not able to capture uncertainty in an SIB project.
Nevertheless, the comparison between our uncertainty scores and the main characteristics of failed SIBs let us detect the need for further variables. Among closed SIBs, the “NYC Adolescent Behavioral Learning Experience Project for Incarcerated Youth” has been considered by two respondents as an SIB with a lower level of uncertainty. The project was launched by Goldman Sachs Bank’s Urban Investment Group (UIG) that announced a $9.6 million loan to support the delivery of the program to a predefined target population. The program was terminated after three years due to the lack of results. Similarly, the SIB “HMP Peterborough (The One Service)” was discontinued due to the “Transforming Rehabilitation” reforms to probation in the United Kingdom.
The “NYC Adolescent Behavioral Learning Experience Project for Incarcerated Youth” was the first SIB launched in the US and was guaranteed by Bloomberg Philanthropies, substantially reducing the downside risk. Conversely, the SIB “HMP Peterborough (The One Service)” was the first SIB launched worldwide and the first attempt to improve a funding mechanism for reducing recidivism among prisoners.
The ex-post analysis of these two SIBs led us to understand that further variables are required in order to capture uncertainty. In particular, further variables should consider the presence of guarantee schemes, the type of social issue addressed, and the voluntary or involuntary engagement of the target population by focusing not only on the contractual conditions of the SIB project but also on the “financial” and “social issue” characteristics.
Moreover, our analysis starts from the definition provided by Nicholls and Tomkinson [
27], in which social uncertainty captures endogenous (e.g., organizational/responsiveness) and exogenous (e.g., political, economic, social, technological, and environmental) factors and in which both are negatively correlated to social return. In the case of SIBs, the higher the risk of change in the context in which the social program is delivered, the lower the social return that can be achieved. If the context of a social program changes—as in the case of Peterborough—it is more suitable to disconnect the entire program without further efforts by renouncing the possibility to achieve a lower level of social return (with the related reputational effects). Similarly, future projects in which there is a likelihood of contextual change will fail due to the fewer incentives related to their implementation. In the same way, in the context of endogenous factors, the likelihood that contracts or staff achieve less than optimal results will lead to changes or the termination of the contract before detrimental outcomes (including reputational ones).
In conclusion, the fact that no complete information about the remaining 32 closed SIBs is available does not help us to understand if our score is effectively able to capture their level of uncertainty or if their risk management processes have been able to capture the manifestation of their potential uncertainty factors early. Nevertheless, our analysis reveals that in order to capture the overall uncertainty related to an SIB, further risk factors should be considered by distinguishing among those strictly related to the contractual characteristics of the project (e.g., financial and social issues) and those related to endogenous/exogenous factors that have been not explored before.
These preliminary findings open new avenues for future research in the field of uncertainty and risk factors in the social impact bond landscape.
6. Concluding Remarks, Implications, and Future Research Directions
Our analysis focussed on social uncertainty, meant as the possibility that an SIB could not reach the intended impact or generate the expected social outcomes. The study was conducted in order to apply the model previously developed by Scognamiglio et al. [
5] in a practical context by trying to improve it and to understand if and how it was able to reveal uncertainty in SIB projects. By using an “ex-post”approach and the population of closed SIBs, we performed a retrospective test of social uncertainty by considering a given set of variables, factors, and indicators. The results reveal a high number of non-relevant indicators. Further analyses are required in order to transform this retrospective approach into a predictive—and thus standardized—approach useful to evaluate ex-ante the uncertainty level of each new SIB project proposed.
Moreover, while several scholars had pointed out the main characteristics of SIBs, their enabling factors and the main opportunities that come from their implementations, it had remained unclear how to evaluate their level of uncertainty. We have attempted to address this knowledge gap through this work. At the same time, this work contributes to the field of alternative methodological approaches for sustainable finance. Under this perspective, we used an alternative methodological approach that cannot be brought back to the methods of mainstream finance in order to estimate the ability of SIBs to achieve outcomes. This implies not only a methodological shift related to the use of the AHP method—typically used in the social sciences—but also in the conceptual foundation of finance research typically oriented towards a more quantitative aspect and less interested in social purposes.
From a more practical point of view, this work could potentially contribute to the development of the entire SIB market by allowing both practitioners and policy makers to understand the areas from which uncertainty arises and thus helping the development of more focused risk management practices and more standardized SIBs schemes.
Although the findings are encouraging, there are a few limitations that need to be considered when interpreting the results of this study. The first limitation is related to the number of experts interviewed. More respondents would have allowed us to get more information and to achieve more comparisons by allowing us to refine the initial factor tree further. However, considering the exploratory nature of this study and especially our initial aims—testing the uncertainty model in a retrospective manner—this could not be considered as a point of weakness: some non-relevant indicators have been detected as needing further refinements. Future development of the model should consider this aspect as a starting point by considering the opportunity to improve the sample of respondents, and mixing qualitative and quantitative comparisons would help to clarify the validity of the model, also supporting the development of a more fine-tuned weighting scale to reduce subjective biases. The second limitation refers to the fact that we did not consider any interactions between indicators. The simultaneous presence of several elements could potentially reduce or increase the overall level of uncertainty. Finally, our analysis does not consider the presence of a “residual risk” that the model could not be able to capture and the possibility that uncertainty could not be explained without considering aspects such as opportunistic behavior and information asymmetries among the involved parties.