1. Introduction
We analyze optimal risk sharing in society at large. We consider a one-period model of uncertainty where Pareto optimal risk sharing, or equivalently, optimal consumption is characterized. The article is primarily a review paper, where the possible originality is in the presentation and composition of the various subject matters, in some of the proofs, in extensions, and within various applications that are pointed out.
We discuss the no-arbitrage question and the associated state prices, both issues important in the financial literature, and also in optimal risk sharing. Here we use the most basic framework, and this material can be found in an appendix. We characterize Pareto optimality in
Section 3, with examples. A competitive equilibrium is a special Pareto optimal sharing rule, which we treat in
Section 4, also with examples.
In
Section 5, we work with syndicates, where a group of individuals must make a common decision under uncertainty that will result in a payoff to be shared jointly among the members. Here we present conditions under which the syndicate behaves as a Savage rational decision maker. This was first treated in [
1] as an extension of the general model by [
2].
In this section we review conditions under which any member of the syndicate can be relegated the task of making decisions under certainty on behalf of the group. This problem is considered both when the members of the group have homogeneous beliefs, and when the probabilities are heterogeneous. Interestingly enough, even in the latter case it is possible to find some common ground.
An illustration is given of unanimity within a group via an example of optimal diversification, where some classical results are recovered. Existence of an evaluation measure of a syndicate is of importance in this theory.
The framework may be used to study risk sharing at various levels in society, like the risk sharing problem in mutual insurance companies, reinsurance markets, or at the state level of a given nation, and also between nations via international organizations.
The paper ends with an application of the theory of syndicates to financial markets and general equilibrium.
2. The Basic Risk-Exchange Model
In this section we study the following basic one period model having two time points, 0 and 1. Let be a group of N agents having preferences over a suitable set of random variables, or gambles with realizations (outcomes) in some subset , . Here . This preference relation is a binary relation on the relevant set of probability distributions that is transitive and complete, and that satisfies the von Neumann–Morgenstern axioms (the substitution axiom and the continuity axiom).
The model is rather general, and the agents could be insurers, reinsurers, consumers, or even countries or regions. For convenience, we shall phrase the model in terms of individuals, consumers, or insurers.
The preferences are represented by expected utility, meaning that there is a set of continuous utility functions (indices) , such that if and only if , where x and y are random variables signifying final consumption, or final portfolios. The reader may recognize this as a hybrid of the Savage approach and the von Neumann–Morgenstern theory. Viewed in light of interpretations in a business world, however, it is in the spirit of the Savage theory provided all the agents have the same (subjective) probability measure P. This is illustrated later.
We assume monotonic preferences, and risk aversion, so that, granted enough smoothness, we have for all w in the relevant domains (note that the concepts of monotonicity and risk aversion make perfect sense without assuming the existence of these derivatives). Each agent is endowed with a random payoff , a random variable at time 0, called an initial endowment (or portfolio). More precisely, there exists a probability space such that agent i is entitled to payoff at time 1 if occurs, . This means that uncertainty is objective and external, and all the uncertainty is revealed at time 1. There is no informational asymmetry. All parties agree upon as the probabilistic description of the stochastic environment, the latter being unaffected by their actions. It will be convenient to posit that both expected values and variances exist for all these initial portfolios, which means that all , or just for short.
We suppose the agents can negotiate any affordable contracts among themselves, resulting in a new set of random variables , representing the possible final consumption to the different members of the group, or final portfolios (in a one-period model final consumption equals final wealth).
In the equilibrium version of our model, transactions are carried out right away at “market prices”, where represents the market price for any , i.e., it signifies the group’s valuation of the random variable c relative to the other random variables in at time 0. Notice that this trade, or risk exchange, takes place at time 0, and results in the random final consumption bundles , also at time 0. Then if happens, the consumption of agent i is , here a real number at time 1. The essential objective is then to determine:
(a) Pareto optimal sharing rules.
(b) The market price of any consumption bundle from the set of preferences of the agents and the joint cumulative probability distribution function of the random vector .
(c) For each , the final consumption bundle most preferred by agent i among those satisfying their budget constraint .
In case (a) market prices as in Equation (
1) below, are not part of the program.
2.1. Some Basic References
References to this type of problem are plentiful, both in the economic literature, in the actuarial literature, and otherwise scattered around in various journals. The model related to optimal consumption in a world of uncertainty is treated in [
3,
4,
5] in the economics literature. In the actuarial literature we have, for example, [
6] as well as the basic treatment by [
7,
8,
9], where the focus is on insurance and reinsurance. In the economics literature, we also have [
2,
10,
11,
12], here directed at the economics of insurance. These models have been reviewed and extended in, for example, [
13,
14,
15,
16,
17]. This type of model has also been extended to a dynamic setting in [
18] related to insurance, and there is a large literature in economic dynamics, where elements of this model are central, several of them treated in [
19,
20].
2.2. The No Arbitrage Requirement
Before we start, some basic facts are in order. First, observe that the possible events
is the sigma-field generated by the initial random variables
x, so that any random variable can be written in the form
for
f a suitable Borel-measurable function (this is a result that is known from measure theory, e.g., [
21]. This means that for the optimal final portfolios
for some appropriate functions
. In order to avoid trivialities, we assume that
is complete, i.e., augmented with all the sets of
P-measure zero.
Second, for (b) and (c) above we require that there is no arbitrage. By an arbitrage, or an arbitrage possibility, we mean the possibility of receiving a strictly positive amount at time 1 in some states of positive probability (), without paying anything net at time 0. Alternatively, an arbitrage would also exist if an agent obtains a strictly positive amount at time 0 with no further payments at time 1. In other words, an arbitrage possibility would yield a strictly positive amount with positive probability, either at time 0, or at time 1, or possibly at both time points, and with no payout at any time, i.e., the possibility of receiving something net with no risk. It is hardly surprising that this possibility can not be allowed in a simple and rational model of an insurance or financial market.
Let us assume that market prices exist, denote the market price of any risk y by , and consider risks in the space . Then we can show that unless the functional is linear and strictly positive, there would be arbitrage possibilities.
The following theorem can be shown:
Theorem 1. There is no arbitrage if and only if there exists a strictly positive random variable π, the state price deflator, representing prices through the relation: An analogue of the above theorem in financial markets, where only common stocks can be traded, is known as “The Fundamental Theorem of Asset Pricing”. In the finite case, the theorem is due to Steven Ross [
22] in the one period framework, and the same reasoning carry over to the dynamic case having a finite number of time periods. There is an extensive literature on the infinite dimensional case, some of which is reviewed in [
19].
Readers familiar with the economics of uncertainty will typically be acquainted with the concept of state prices; here is the Arrow–Debreu state price in units of probability.
My experience is that while this concept is well known to economists, it is lesser known or not quite appreciated by mathematicians. In order to explain its importance, we provide a basic exposition in
Appendix B.
3. Pareto Optimality
Next we introduce the concept of (strong) Pareto optimality of an allocation. This is a criterion of the outcome of a negotiations process between individuals that does not depend on the probability distribution of the initial portfolios.
We need the following definition:
Definition 1. An allocation is called feasible if: The concept of Pareto optimality offers a minimal and uncontroversial test that any social optimal economic outcome should pass. In words, an economic outcome is Pareto optimal if it is impossible to make some individuals better off without making some other individuals worse off. Formally we have:
Definition 2. A feasible allocation is called Pareto optimal if there is no feasible allocation with for all i and with for some j.
Next we give a few useful results in establishing Pareto optimality.
3.1. The Characterization of a Pareto Optimum
First we focus on the following “representative agent” result:
Theorem 2. Suppose are concave and increasing for all i. Then the allocation is Pareto optimal if and only if there exists a nonzero vector of agent weights such that solves the problem: In the above
, or equivalently
, is defined as the function, possibly depending on the agent weights
, satisfying the real optimization problem
defined by:
This problem is referred to as the sup convolution problem, and the function
is called the utility function of the representative agent in economics and finance, or the evaluation measure of the group of individuals in the reinsurance literature and in the theory of syndicates. Below we shall show how it is linked to the Lagrange multiplier associated with problem (
3).
Let us first explain the connection between the solutions of problems (
2) and (
3). In that regard we assume all the random variables are defined on the same probability space
with generic element
. A subset
,
, where
we call a
P-null set for short. Problem (
3) can be interpreted as a real-valued problem for any given state
.
Problem (
3) can be interpreted as a decision problem in which the group must share a “cake” of size
in order to maximize a weighted sum of the member’s utilities. The proof that the optimal solution
generated by the sequence of “cake-sharing” programs (
3) is the optimal solution of problem (
2) can be obtained by contradiction. The reason is the additive nature of utility. Suppose that this is not the case. Then there is a feasible allocation
other than the one generated by sequence (
3) that satisfies
for all
except for a
P-null set. But then, by Fubini’s Theorem,
, contradicting the fact that
y solves problem (
2).
We can now return to the proof of the theorem itself, which can be found in
Appendix C.
This basic result gives rise to the following characterization of a Pareto optimum. This result is known as Borch’s Theorem:
Theorem 3. A Pareto optimum y is characterized by the existence of non-negative agent weights and the real function such that: The proof of Theorem 3 can be found in
Appendix C.
First, to be noticed here is that the ’s only depend on the marginal utility functions, not on the probability distribution of the random vector of the agents’ initial holdings. Thus it is not necessary to know this probability distribution in order to characterize Pareto optimal allocations.
Second, as a by-product we have a characterization of the state price deflator
of the last sections, and this key quantity is connected to the economy via the identity:
where
is some normalizing constant. We return to this later when we consider a competitive equilibrium.
Alternatively, the above step can be further formalized by a theorem of Zahl [
23], who analyzed the part of infinite dimensional analysis where the Lagrange multiplier must be a function of the state variable. In our case, it means that the Lagrange multiplier computed at
is stochastic, since
is a random variable.
Next we illustrate by an example. For this we need the following definition: The relative risk aversion is defined by the function for any utility function u. First we consider the case of constant relative risk aversion.
Example 1. Consider the case of power utility, where for and for when , where the natural logarithm results as a limit when . This example only makes sense in the no-bankruptcy case where P-a.s. for all i. The parameters are then the relative risk aversions of the agents, which are given by positive constants for this class of preferences.
Consider first the case where . Here all the marginal utilities are given by , and using Theorem 3, the first order condition for Pareto optimality is: which implies that , a.s. Using the market clearing condition , a.s., which only says that no risk disappears during the process of risk sharing, we obtain: showing that the marginal utility of the representative agent is of the same type as that of the individual agents. The optimal sharing rules are linear, and given by: From this example, we notice that the optimal allocations depend on the initial ones only through the aggregate
. Thus we may write
for all
. This is quite general, and follows from (
2) of Theorem 2, as well as Equation (
3) and Equation (
4) of Theorem 3.
The linearity of the optimal allocations is lost when the relative risk aversions are allowed to be different for the various agents. This leads to non-linear contracts.
In addition, notice that we have dropped the -dependence in the marginal utility function of the representative agent. The explanation for this will come later, but notice that prices are determined modulo a normalization, which can here be taken to be . This means that the evaluation measure u does not depend on the sharing rule .
3.2. Risk Tolerance and Aggregation
Consider a group of individuals, let us call such a group a syndicate, where the sharing rules are Pareto optimal. Of interest now are two basic and useful results. For this we first define what we mean by the absolute risk aversion of an agent with utility function u. It is given by , and is strictly positive under our assumptions. The risk tolerance of an agent is defined by .
The first result relates the Pareto optimal allocations to a solution of a system of ordinary differential equations. It says the following:
The Pareto optimal contracts
as a real function of the real variable
, satisfies the non-linear, first-order ordinary differential equation:
where
is the absolute risk aversion function of the representative agent, and
is the absolute risk aversion of agent
i at the Pareto optimal allocation function
,
. The notations
and
have the same meaning as
in that the functions
u and
A may depend on the sharing rule
. Here
represent the initial conditions of these differential equations, where
The second result says that the risk tolerance of the representative agent is the sum or the risk tolerances of the individual members at the Pareto optimal allocations. More precisely,
as an equality between random variables. This allows us to rewrite the differential Equations (
5) as follows:
We summarize these two results as:
Theorem 4. (a) The risk tolerance of the syndicate equals the sum of the risk tolerances of the individual agents in a Pareto optimum.
(b) The real, Pareto optimal allocation functions , satisfy the first order ordinary, nonlinear differential Equations (5), or equivalently, (7). Since these two results are central in the theory, in
Appendix C we present a simple proof.
The result (
6) has several interesting interpretations. One is that the syndicate can better carry risk than the individuals can in autarky. For example, a mutual insurance company can be interpreted as a syndicate, where the syndicate members are the customers. This means that the members smooth their individual risks in a pool, and in a Pareto optimum, their individual liability is a real function of the aggregate risk.
By (
7) it follows that contracts
are all increasing in
x. This yields the mutuality principle. It means that an aggregate wealth increase will affect all members in a positive direction, and a wealth decrease will affect all the members negatively. The direction is the same for all, but how much is individual.
The economic consequences of pandemics may be observed to have these features: It affects most people negatively, but to a varying degree.
A nation, or any international organization like the EU, UN, or Red Cross, can be considered as a syndicate, as can the whole world for that matter, where the members are the inhabitants of the country, or nations of the organization, or in the world, respectively. Since a nation is, by the above result, less risk averse than the individual people that make it up, some projects are better undertaken by the state; they may simply be too grand for individual citizens. Typical infrastructural projects like roads, tunnels, bridges, railways, harbors, airports, museums, etc. are often undertaken by the state. Similar interpretations are valid for organizations, or the world. For example, the climate problems facing earth are too big for any nation to solve alone, which calls for international cooperation.
Pareto optimal contracts are characterized by a continuum of contracts along a Pareto optimal frontier. Consider for example the case
. The feasible contracts can then be thought of as a bounded section of the first quadrant by a concave curve, the Pareto frontier, from one axis to the other (see
Figure 1).
A tangent to the Pareto frontier is characterized by two numbers, the agent weights determining the slope of the tangent, which is . Imagine the line with this slope, , that cuts through the feasible region. It corresponds to a particular sharing rule. Then move the line with this slope, by varying the constant c, until it is tangency to the Pareto frontier. The point at which this happens corresponds to the Pareto optimum for this particular sharing rule. By varying the weightings, and repeating this process, the Pareto optimal frontier is spanned out.
The Pareto optimal frontier includes contracts that will not be likely outcomes of real negotiations between the parties, since they may not satisfy individual rationality. For
N individuals this means that only the section of the Pareto optimal frontier that satisfies:
will be rational for all parties. This section is called the core. For
it is indicated in
Figure 1.
3.3. HARA-Utility Functions
The class of utility functions with affine risk tolerances is called the Hyperbolic Absolute Risk Aversion (HARA) class, and plays a special role in this theory. The utility functions in this class can be given by analytic expressions. Recall that with expected utility, if the utility function represents the preference relation, then represents the same preference relation, where a and b are two scalars with .
The utility function
such that
is HARA with coefficients
if and only if there exists scalars
a and
b such that:
When the parameter , we have quadratic utility, which is also a member of the HARA class.
In the paper we will be interested in optimal risk sharing, and in order to gain some basic insight, it is an advantage to consider a class of utility functions where agents are allowed to have different preferences, and where optimal sharing rules are affine. Our next example satisfies both these criteria, and is included in the above list.
Example 2. Consider the case with negative exponential utility functions, with marginal utilities , where is the absolute risk aversion of agent i, or is the corresponding risk tolerance. Using the characterization (4), the first order conditions for Pareto optimal sharing rules are: After taking logarithms in this relation, and summing over i, market clearing implies: Furthermore, from the same first-order conditions we also obtain that the optimal sharing rules (or portfolios) can be written: The “reinsurance contracts”, if we for the moment use this interpretation, involve optimal sharing rules that are affine in . Market clearing holds here, since and .
As we will show in the next section, the result of this example is consistent with theory, since our utility functions belongs to the HARA class with identical cautiousness, or slope .
Contracts of this type are termed proportional reinsurance (the more correct term, affine, is not in industry use). The constants of proportionality are simply equal to to each agent’s risk tolerance, measured relative to the group. The more risk tolerant a member is, the larger fraction of the aggregate risk is held.
In order to compensate for the fact that the least risk-averse reinsurer will hold the larger proportion of the market, zero-sum side payments occur between the reinsurers, here represented by the terms .
Without these side payments, an agent, with a “small” initial endowment but with a large risk tolerance, would end up with a “large” final endowment, but this would not be consistent with individual rationality, or as we will see below, with the agents’ budget constraints.
This kind of treaty seems common in reinsurance practice, and is, moreover, easy to interpret and understand.
An example with the more general form of HARA utility class is the following:
Example 3. Consider the class , , when and . Using first-order characterization (4), we obtain the following: Some routine calculations show that the optimal sharing rules are given by: That is, the sharing rules are affine. Market clearing is seen to hold.
Our general risk sharing model with
N agents can also be specialized to two agents, one insurance customer and one insurer. The first to have used the model for this basic problem seems to have been [
23].
3.4. Affine Contracts
In this section we formalize what we have demonstrated so far when it comes to optimal risk sharing. Effectively we then rule out non-linear contracts.
Denote as above the absolute risk aversion function of an agent by , , and the risk tolerance function by . The relative risk aversion function is defined by . Recall, HARA utility means that the risk tolerance functions of the agents are affine, i.e., if , , where the and are all constants. Then affine sharing rules obtain provided the agents have HARA utility functions with the same cautiousness parameters, i.e., when , for all . More precisely:
Theorem 5. The Pareto optimal sharing rules are affine if and only if the risk tolerances are affine with identical cautiousness, i.e., for some constants , , , , ⇔ , for some constants and β, .
A short, and we claim original, proof of this theorem can be found in
Appendix C based on the system of differential equations in (
7).
There are several important insights when members belong to the class of utility functions considered above, which we return to later.
We next consider the pricing issue in a competitive market, where the concept of an equilibrium is central.
4. Equilibrium
The problem each agent
is supposed to solve is the following:
An important issue is, of course, existence (and uniqueness) of solutions to (
9). We shall not elaborate on this here, suffice it is to note the following. If:
is bounded (in
-norm), then existence is guaranteed (by i.a., the Banach–Alaogher theorem). In addition, a strictly concave
suffices for uniqueness.
See [
5,
6,
14,
19,
24,
25,
26,
27,
28,
29] for existence and uniqueness of equilibrium (existence of Arrow–Debreu equilibria in infinite-dimensional settings seems to have been first treated in [
25].
Definition 3. A competitive equilibrium is a collection consisting of a price functional Π and a feasible allocation such that for each i, solves the problem (9) and markets clear; (market clearing is usually defined by . Since we have strictly monotonic preferences, equality will result in equilibrium).
We close the system by assuming rational expectations. This means that the market clearing price implied by agent behavior is assumed to be the same as the price functional on which agent decisions are based. The main analytic issue is then the determination of equilibrium price behavior.
In this section we characterize a competitive equilibrium (CE) assuming that it exists. We take it that the initial portfolios are not identically equal to zero, and that a unique equilibrium exists. We also assume quite naturally that for each i. In fact, it seems reasonable that each agent is required to bring to the market an initial “endowment” of positive value (this is of course a weaker requirement than the positivity assumption P-a.s. for all i found in consumer theory).
The computation of an equilibrium requires that the joint probability distribution of the initial endowments is known. Only relative prices can be determined in equilibrium, modulo a normalizing constant.
In this case we have the following:
Theorem 6. Suppose the preferences of the agents are strictly monotonic and convex, i.e., and for all , and assume that a competitive equilibrium exists, where for each i. The equilibrium is then characterized by the existence of positive constants , , such that for the equilibrium allocation : where π is the Riesz representation of the pricing functional Π.
Comparing this result to the first order conditions of a Pareto optimum, Equation (
4) of Theorem 3, we notice that the Lagrange multipliers
are just the reciprocals of the agent weights
,
, implying that a competitive equilibrium is, if it exists, Pareto optimal.
We next explain the basics of the proof of this result, which can be found in
Appendix C.
In order to illustrate the new feature here, the pricing question, we present a simple example.
Example 4. Let us return to Example 2, where we now demonstrate how a single Pareto optimal point is picked out by a competitive equilibrium. This is done by determining the ray λ of agent weights.
In order to determine this vector , we employ the budget constraints: Hence, the optimal equilibrium allocations are completely determined in terms of the given primitives of the model. The ray λ can also be determined modulo a normalization. Letting denote this normalization, then: If we impose the normalization of the state price deflator, we obtain , in which case the constants λ are given by: Through this example, we also discover a “premium principle” in insurance, since market prices are given by: The pricing rule given by expression (11) is referred to as the “Esscher principle” in actuarial mathematics, but then with the important distinction that the aggregate market index in (11) is substituted by the risk z itself. For this latter “principle”, the pricing rule is of course no longer a linear functional, which will, unfortunately, lead to arbitrage possibilities and other anomalies. A CE picks out a point on the Pareto frontier that satisfies individual rationality, so the competitive equilibrium allocation is located in the core. In the case of the two-agent problem, this point is located in the individual rationality section of the Pareto frontier in the NE-quadrant (see
Figure 1).
Recall, that the first order conditions for optimality does not depend on probabilities, but when we employ the budget constraints, probabilities enter.
In [
30], risk sharing and contingent premiums is discussed in relation to the UK Covid-19 economic losses.
We next include some results on Pareto optimal risk sharing in groups, where there is a decision to be undertaken by the group. Here we also take a look at the situation where agents can have different probability beliefs.
5. Syndicates I
A syndicate is defined to be a group of individuals who must make a common decision under uncertainty that will result in a payoff to be shared jointly among the members. Let us call the common decision , where is the decision space. We limit ourselves to , a subset of the real numbers.
The first result we have in mind belongs to group decision theory, or the theory of syndicates. It specifies a payoff to the group , where Z is a random variable with realization z, and g is a real function: , assumed to be a smooth -function.
The syndicate is faced with an investment project with payoff function g. In this section we assume that before action a is taken, the syndicate comes together and negotiates a sharing rule, as we have described in the above. Here we only consider sharing rules that do not depend on the decision itself. This means that the members of the syndicate are motivated by income and not by the decision itself.
Individual sharing rules will accordingly all depend on the payoff and the random variable Z, but not on the decision a. Otherwise Pareto optimal sharing rules are found by the same principles as before.
We shall here allow the various members to have individual probability beliefs regarding the random variable
Z, represented by the probability density functions
,
. We assume the individuals to be Savage rational decision makers satisfying Savage’s 7 axioms ([
31]). The probability distributions are defined on the same support, and we assume them to be mutually, absolutely continuous with respect to each other. Suppose
and
represent two arbitrary, random prospects, members of some set
F of “acts”, facing agent
i, defined on the same probability space. Savage’s expected utility theorem then says:
There exist a utility index and a probability distribution such that if and only if ,
In other words, the preference relation ⪰ defined on the set of random prospects F has a numerical representation not only given by a utility index , but also by a probability distribution . In our case, the corresponds to the cumulative probability distribution function where the derivative . The strict interpretation of this representation is that all uncertainty is subjective, while in the von Neumann–Morgenstern framework, all uncertainty is objective.
It has been claimed that the Savage interpretation works better in a business world. The following simple example illustrates:
Example 5. Consider the two following lotteries, called acts by Savage:
: You win 1000 Euro if the football (soccer) team Barcelona ends among the top three teams in its division, Ecuardorian Serie A, next year, otherwise you get 0.
: You win 1000 Euro if a fair coin lands heads in 4 consecutive trials, otherwise you get 0.
Suppose you get the choice between and , and your utility function is u. You would then calculate: Here we have normalized so that and , which we can do in both frameworks. Also, p is the probability of success in lottery .
Notice that your choice does not depend on the utility function u. In the von Neumann–Morgenstern framework, this is not a bona fide decision problem, since the choice only depends on probabilities, which are objective and hence known to the decision maker. In the Savage approach however, probabilities are subjective and part of the preference representation, and is accordingly a decision problem.
5.1. Homogeneous Probability Beliefs
When probabilities are the same across the agents, the numerical representation of preferences looks the same as with the von Neumann–Morgenstern interpretation, where the probabilities are considered to be objective, so it can in principal be interpreted either way. However, recall Example 5.
If one happens to be a Bayesian however, one will argue a bit differently. According to [
32], if two people have the same priors, and their posteriors for an event
A are common knowledge, then these posteriors are equal.
The normal situation is that the weights
will affect the sharing rules
, and by this the risk tolerance of the syndicate, via the inequalities within the allocation
. This is true with or without homogeneous beliefs. However, if the risk tolerances of the individual members are of the HARA type with equal cautiousness parameter that is given by
, we obtain from the result (
6) that the risk tolerance of the syndicate is given by:
where
is a constant, and where
for all
z by market clearing. (Recall, no risk disappears after risk sharing; the total risk is, presumably now better distributed among the members so that the ones who can carry more risk, do so at the optimum.) Notice that the risk tolerance of the group does not depend on the weights
, so we have dropped the superscript on
. This also means that the evaluation measure of the syndicate, what we earlier denoted by
, does not depend on
either, so we refer to it as
when this is the case. This means that the inequalities in the wealth distribution does not affect the syndicate’s willingness to risk-taking.
Because of this fact, we limit ourselves for the moment to the case of HARA-utility functions for the members of the group, where the risk tolerances are all affine with the same cautiousness . Here for all when probability beliefs are homogeneous.
In this situation, we define the ‘derived’ utility
of member
i as follows:
This means that once the group has been established and the members have agreed upon a Pareto optimal sharing rule , we consider the individual’s utility function after this sharing rule has been implemented, as a function of the aggregate wealth x. Since there is an element of optimization behind this construction, we call this the derived utility functions of the members (or, perhaps, the indirect utility functions). We can then show the following:
Theorem 7. Let the members of the group all have affine risk tolerances with the same cautiousness. Then the risk tolerance of any member’s derived utility functions , , is the same as the risk tolerance of the syndicate.
As a consequence of this result, any member of such a group can be given the task of making decisions under uncertainty on behalf of the group. There is unanimity on the management of risk followed by the planner. Such a group is called an unanimous syndicate, which we formalize later.
Since the result is rather central, in
Appendix C we present a proof.
The attitude towards the aggregate risk of each member of the pool is identical, and equal to the one of the central planner, despite the fact that the members have different preferences to start with. These important properties hold only in the case of HARA-utility functions with identical cautiousness , as will be pointed out later.
This theorem is of course a bit special, but is nevertheless a remarkable result. If the conditions were true in practical life, there would not be much disagreement among us. However recall that the individuals are assumed to be equally well informed and, moreover, have homogeneous probability beliefs.
In other contexts than purely financial, ‘group thinking’ may imply more agreement than is desirable, in that the ability of flexibility and critical thinking may be lost.
With differential information and/or heterogeneous probabilities, results might be different, and, perhaps, more realistic. We shall return to this below.
To illustrate, we consider an example, where the individual members have different preferences of the negative exponential type described earlier, a HARA class of utilities with equal cautiousness across the population.
5.2. Example: Optimal Diversification
Suppose that a partnership (syndicate) has the opportunity to invest its capital of 1 USD in a project with an uncertain return of Z per dollar invested. The partnership can borrow, or lend capital in any amount a at interest rate r, so a payoff is available, and the decision problem is to choose an optimal amount of debt, or equivalently, an optimal amount invested in the risky project.
Suppose that all members agree that Z is normally distributed with probability density having mean m and variance v, and moreover the individuals have all negative exponential utility functions , where are the risk tolerances of the individuals.
Then we know that the syndicate has risk tolerance and evaluation measure , which does not depend on .
Before the decision is made, the members have decided on a Pareto optimal sharing rule, which under our assumptions is given by for some zero sum constants , as explained in Example 2.
The syndicates decision problem consists in finding the value of
a which maximizes:
The first order condition is both necessary and sufficient for a maximum here, so we solve
, which has solution:
We show the calculation below.
The interpretation of this expression is:
(i) More is invested in the risky project the larger the risk tolerance of the syndicate.
(ii) More is invested risky the larger the “risk premium” .
(iii) Less is invested risky the larger the variance v.
For those familiar with the optimal portfolio selection theory in finance, the above solution has the same basic features. See [
33,
34,
35,
36]. The two first consider intertemporary models in discrete time, the next uses continuous time with continuous dynamics, while the last one uses continuous time and continuous dynamics with jumps included.
Let us look at the calculations:
Utilizing the form of the normal distribution and forming a full square, this is equivalent to:
Using that the integral of a probability distribution equals 1, and the definition of expected value of a normal variate, we now have after canceling the constants:
which proves our result (
14).
Now we come to the more interesting point, the one of agreement in the syndicate. Consider any member of the group, let us say member i. This agent’s decision problem after the syndicate has been formed and a Pareto optimal sharing rule
has been established, is the following:
The latter optimization problem is seen to give the same result as the problem of the central planner, since the two different constants, versus , multiplying the exponential function, simply cancel in both cases after differentiation, equating the result to 0, and thus do not affect the optimal solution .
Hence we have an application of Theorem 7.
5.3. Heterogeneous Probability Beliefs
With different probability functions in the Savage representations, we choose to interpret these distributions as posterior probability distributions, which may be the result from different priors where information may, or may not, be different. Below we shall also make use of the dispersion functions , assuming for all z and for all i.
In a syndicate, the dispersion of the individuals result in a syndicate dispersion at a Pareto optimum, which is a mixture of individual dispersions and the derivatives of the sharing rules with respect to x. Since the latter depend on the utility functions of the members, the resulting probability distribution of the central planner is a mixture of the individual probability distributions and the corresponding utility functions via the sharing rules.
We do not obtain a simple separation between some probability distribution of the syndicate that depend solely on the members probability distributions, and a utility function which is related to a sup-convolution problem. However, we characterize conditions under which we obtain a separation between a resulting probability distribution of the syndicate,
, and its utility function
, which comprises the evaluation measure, call it
of the syndicate. Since the resulting probability density
f will also depend on the member’s utility functions through the sharing rules, this brings the theory well into the framework of [
31]. Below we follow [
1,
37,
38], where many of the proofs that we omit can be found.
Unlike our treatment in
Section 3 and
Section 4, we now have both a decision
a and a random variable
Z that affect the payoff
. In order to be precise, this calls for some definitions.
By a sharing rule, we mean a set of functions:
A contract is an ordered pair,
, of a sharing rule
y and a decision
a. A contract
Pareto dominates another contract
if, for all
i:
with strong inequality for at least one
i. The operator
denotes expectation with respect to the density
. A contract is Pareto optimal if there does not exist any Pareto dominating contract. This generates a partial order over contracts. The associated preference order is called the Pareto order of contracts.
A decision
a is Pareto-preferred to a decision
if there exists a sharing rule
y such that:
where we have dropped the possible
-dependence for notational convenience. When we wish to take account of the the dependence of the sharing rule on decision
a, we write
.
This defines a partial order on and is called the Pareto order on .
A sharing rule
y is Pareto optimal for some
provided there does not exist a
such that:
with strong inequality for at least one agent
j.
Generalizing the results of
Section 3, let
y be a Pareto optimal sharing rule for
. Then there exists a vector of agent weights
such that
maximizes:
Assume that for all z and that for all . By optimizing the Lagrange function of the above problem, we can show the following:
Theorem 8. First order necessary and sufficient conditions for Pareto optimality of the sharing rule y is that there exists non-negative weights and a function such that: We say that a complete order on
can be derived from an evaluation measure if there exists a function
such that
a is preferred to
if and only if:
We now assume as in
Section 5.1 and
Section 5.2 that a sharing rule is chosen before decision
a is considered, in which case the agent weights
do not depend on
a. We seek conditions for the existence of an evaluation measure representing group decision processes.
Towards this end we start with a sufficiently rich set and suppose that the group must choose a Pareto optimal contract. Let be such a contract. Then y must be a Pareto optimal sharing rule for a, and since this is characterized by the weights , which do not depend on a, y is also Pareto optimal for all .
Now, let
be the weights corresponding to
y. Then it must be the case that:
The reason is that the feasible set over is convex from our assumption about the structure of .
Then we can reformulate (
15) to:
We now define an order ⪰ on
by:
An order ⪰ is said to be Pareto-inclusive if a is Pareto-preferred to to imply that .
Here it can be shown that ⪰ is Pareto inclusive. The order ⪰ selects the same optimal decision as the group chooses.
We now assume that an evaluation measure
exists independent of
a. Then by the use of Theorem 8 notice that:
In other words, the Lagrange multiplier
is the marginal change in the evaluation measure with respect to
x, just as
in the proof of Theorem 3. This is also in analogy with the way the state price in
Appendix B is connected to the marginal utility of the representative agent via
, here related to equilibrium.
We have the risk tolerances
of the agents and dispersion functions
of the probability distributions,
. Since
, we can reconstruct the probability densities from the dispersion functions as follows:
exp
. We now define the corresponding concepts for the syndicate:
The following general properties hold for the functions
,
,
, and
:
Here (i) and (ii) are the extensions of (
6) and (
7) to the inhomogeneous case, while (iii) is new. It implies that the syndicate’s dispersion function depends on the individuals’ dispersion functions as well as of the sharing rule, which in turn depend on the members’ utility functions.
Returning to the question mentioned earlier: Given a Pareto optimal sharing rule y, under what conditions, if any, will the syndicate behave as a Savage-rational decision maker in the choice of a?
The answer is that this happens if and only if the evaluation measure
is separable with respect to
x and
z, that is, if and only if there exist a function
and a probability density
such that:
where
is strictly concave and strictly increasing and
is mutually absolutely continuous with respect to
for all
and possesses a dispersion function.
As a consequence of (iii) above, the syndicate’s probability density function depends on the members’ probability density functions as well as their utility functions, which is remarkable and emphasizes the Savage (or de Finetti)-style nature of the syndicate.
In this case, the complete preference order on
is generated by:
and the syndicate is said to be a Wilson syndicate (after Robert Wilson, who was awarded the Sveriges Riksbank Prize in Economic Sciences in Memory of Alfred Nobel in 2020, together with Paul R. Milgrom).
For a Wilson syndicate we notice that:
does not depend on
z, and is the syndicates risk tolerance, while the syndicates dispersion function is given by:
We now need a formal definition: A sharing rule is said to be affine provided is independent of x for all . We can then show the following (see Amershi and Stockenius (1983)):
Theorem 9. If the members of a syndicate have the same probability assessments, or if the sharing rules are affine, then the syndicate is a Wilson syndicate.
An example may illustrate.
Example 6. Let for and , for , where . Consider the particular sharing rule where . The associated Pareto optimal contracts y satisfy: From for all , it follows that which is separable, hence this is a Wilson syndicate.
The sharing rules are given by , . The evaluation measure , from which we can take and , a trapezium distribution, which is a mixture of which is uniform, and which is triangular, but its final shape is also influenced by and .
Here and do not depend on x, so the sharing rules are here linear. According to the result above, this fact is enough to verify that we here have a Wilson syndicate as well.
6. Syndicates II
In part I, we formulated the conditions under which a group of people behave as a Savage-rational decision maker. Except from
Section 5.1 and
Section 5.2, the theory in part I is relevant for a group of people who use a given Pareto optimal sharing rule. The constructed evaluation measure is valid for this particular weighting
, with an associate Pareto-inclusive complete preference ordering on
.
In this section, we ask the following question: Under which conditions are the preference orderings generated by different -weightings equivalent?
We need the following definition: A sharing rule is said to be determinate provided does not depend on z for all .
A sharing rule is both affine and determinate if is a constant for all i.
Note that in Example 5 the sharing rule is not determinate.
For what comes next we need the following technical result:
Notice that the evaluation measure for a given weighting is determined by the relative sizes of the weights, so the partial derivative of with respect to can be considered a change in the weighting.
By this result, the following can now be shown:
Theorem 10. The evaluation measure of a syndicate is determined independent, modulo a proportional constant, of the chosen Pareto optimal sharing rule λ if and only if all the sharing rules are affine and determinate.
A natural question is now how to characterize utility functions that guarantee that the Pareto optimal sharing rules are affine and determinate. The first result in this direction is closely related to Theorem 7:
Theorem 11. Suppose that the probability assessments are all homogeneous. All the Pareto optimal sharing rules are affine and determinate if and only if the member’s utility functions all have linear risk tolerances (HARA) with an identical cautiousness parameter.
In order to connect more directly to the result in Theorem 7, we need the following two definitions:
A group of people is said to be a
-unanimous syndicate provided for all
i,
j,
a, and
we have:
is equivalent to:
where
y is the Pareto optimal sharing rule associated with
.
A syndicate which is -unanimous for every is said to be an unanimous syndicate.
In an unanimous syndicate one can delegate the decision making process to any of the syndicate’s members.
We now have two results, which we also cite without proofs:
Theorem 12. A syndicate is λ-unanimous if and only if the λ-Pareto optimal sharing rule is affine and determinate.
Corollary 1. A syndicate is unanimous if and only if all the Pareto optimal sharing rules are linear and determinate.
The following summarizes many of the above results in the case when there is agreement with regard to probability assessments:
Theorem 13. In a syndicate with agreement about the probability distributions, the following are equivalent:
(1) The syndicate is unanimous.
(2) All members of the group have HARA-class utility functions with the same degree of cautiousness.
(3) The evaluation measure of the syndicate is determined independent of λ, or equivalently, the complete preference orders on generated by the group’s different evaluation measures are all equivalent.
(4) The group’s Pareto ordering on is complete.
We notice that (1) and (2) is a sharpening of our previous Theorem 7 in
Section 5.1 in that we in fact have equivalence between the HARA-class with the same cautiousness and the property of unanimity.
The following can be summarized in the general case:
Theorem 14. For a syndicate with heterogenous probability beliefs the following are equivalent:
(1) The syndicate is λ-unanimous for some λ.
(2) The group is an unanimous syndicate.
(3) All the members of the group have constant absolute risk aversions.
(4) The preference orders generated by are identical for all λ.
(5) The group’s Pareto-order is complete.
This theorem gives an answer to our earlier discussion about agreements in a group. When probability assessments are different between the members of the group, it is more difficult for the group to be a unanimous syndicate: All the members must have negative exponential utility functions, which is more limiting than having general HARA utility with the same cautiousness parameter. Their absolute risk aversions are allowed to be different, but otherwise this class is, of course, somewhat restricted. However, this property constitutes a good example in illustrating optimal risk sharing, as we pointed out in Example 2 of
Section 3.3, and actually provides reinsurance treaties that are in daily use in the insurance industry.
Wilson in [
1] presented an example of a group decision problem of the same type as we discussed in
Section 5.2 with the difference that the members have different probability distributions for the risky asset. It is interesting to notice how the syndicate’s probability assessment is constructed in this case. We focus on this part in his example.
6.1. An Example
We consider the same problem as in
Section 5.2 but with heterogeneous probabilities. As before, we have negative exponential utility functions with risk tolerances
, which implies that the syndicate has the same type of utility function with risk tolerance
. Here member
i has a probability density according to the
distribution, so these are normally distributed with means and variances depending on the individual.
The sharing rules satisfy
,
. Here the syndicate’s probability density happens to be normal with mean
m and variance
v, where:
and
Here we have used property (iii) for dispersions in
Section 5.3.
Notice how the syndicate’s probability distribution is a mixture of the individual members’ probability distributions and the utility functions of the members, represented by the risk tolerances of the individuals and of the syndicate. This places this theory in the subjective tradition of representing preferences in a non-trivial way.
The computations from now on are similar to our previous calculations, where the optimal investment in the risky asset is given by Equation (
14), with
m and
v as given above.
We also have to show that the result is the same when the task of decision making is left in the hands of an arbitrary member of the syndicate. This goes much as before, but notice that here the sharing rules depend on z and hence on a mixture of all the parameters of the preferences and the probability distributions, but since the sharing rules are also determinate, these dependencies simplify after differentiation with respect to a, and the result becomes as before, in agreement with Property (2) in Theorem 14.
In our model the agents know their own probability distribution as well as all those of the other agents. Accordingly, the model can not be used directly to analyze asymmetric information. However, close variants of the model have been utilized for this topic as well. An example is the model of [
39], who analyzes the situation with two agents, a principal and an agent. Only the agent can make the decision, which can not be observed by the principal, which is where the asymmetric information comes in (Bengt Holmstrøm obtained the Sveriges Riksbank Prize in Economic Sciences in Memory of Alfred Nobel in 2016, together with Oliver Hart. Holmstrøm’s Ph.D. advisor was Robert Wilson). The resulting model was used for situations with moral hazard. A model where the different agents have different priors as well as different information is presented by [
40,
41]. see also [
42].
We can use the theory of syndicates at several levels. At the first level for risk sharing between individuals, the objectives consist in maximizing individual preferences of the members, assuming these can be represented by expected utility. This leads to an evaluation measures for the mutual insurance company, the reinsurance market, the nation, or the international organization, respectively, whatever the focus of the study. This construction allows us to form several levels of syndication.
An example of this is given in [
43], where marine insurance is studied at three levels. The preferences of ship owners give rise to the evaluation measures of the various P&I-Club, and the latter give rise to the final evaluation measure of the reinsurance syndicate, where the various clubs obtain their reinsurances. From this, prices in the insurance market will be formed according to the theory.
The situation with different probability beliefs typically gives rise to betting between the agents. We next consider this phenomenon when they do not formally enter a syndicate and agree on a sharing rule, which is the case when the model is interpreted as an Arrow–Debreu equilibrium model.
7. Syndicates, Financial Markets, and General Equilibrium
Consider as in
Section 4 an exchange economy under uncertainty with one consumption good. The
N agents are characterized by utility functions and associated probability density functions as well as initial endowments, with the same assumptions as above. The uncertainty in the economy is associated with a random vector
of dimension
m say, which affect the market portfolio
, where
are the initial endowment of agent
i as before. We assume the joint probability distribution of the vector
is known, although we do not need to specify the functional relationship between
x and
Z here. We know that any competitive equilibrium is Pareto optimal, and that any Pareto optimum can be implemented as a competitive equilibrium, possibly after a redistribution of the initial endowments (by the First and Second Welfare Theorem).
The extension to a multivariate
Z will primarily affect the relationship (iii) in
Section 5.3. If we define the marginal dispersions of the individuals and the syndicate as:
respectively, then we obtain the following generalization of (iii):
By extending this to higher order partial derivatives, the joint distribution can be determined from the sharing rules and higher order dispersions.
An alternative to this, and a bit outside the theme of the paper, is to determine the univariate marginal distributions from the above equations, and employ copulas to determine the joint distribution. Correlation coefficients are good measures of dependence in the case of joint normality, but in general can copulas give a better picture?
Define the decision
a by
for any realization
z of
Z, and consider the set:
The expectation is taken with respect to the joint probability distribution , which is determined as described above.
In an exchange economy, we can interpret a decision as a budget-feasible consumption plan for a “pseudo agent” with initial allocation who is confronted with state prices in state z. Defined this way, the set is convex and satisfies our earlier conditions.
Based on the syndicate theory, in particular Theorem 8, construction prices occur in equilibrium if and only if there exists a such that is the optimal decision in based on the preference order generated by the evaluation measure . The evaluation measure depends in general on the weights , where is the optimal consumption of agent i, .
An economy populated with agents
is said to have the aggregation property if all the equilibria are characterized by the same price functional based on
(modulo a normalization), see [
37].
Suppose now that is not a constant as varies. Then we say that there is a non-trivial social risk.
Under this assumption, the following result can be proven:
Proposition 2. The following is equivalent:
(i) The economy has the aggregation property.
(ii) The economy is equivalent to an unanimous syndicate.
If
for all
, then
so the ratio between prices in different states of the world would be:
independent of the sharing rule
, which is a degenerate case, since this ratio is in general equal to:
for any states
z and
s. This follows from the first order conditions, where the the marginal rates of substitution of consumption in states
z and
s is the same for all agents
.
A market constructed this way would typically imply “betting” between agents with different probability beliefs (and has been questioned for this added complexity). To see how a sharing rule will typically look like, consider the market analogue to the syndicate in the example in
Section 6.1, where
. The sharing rules
,
are given by:
where
(see Example 2), and with the following expression for agent 2:
when
. Here we have used the system differential equations given in
Section 5.3, equation (ii). Notice that
for all
z as the case should be.
According to agent 1’s probability belief, realization z should be close to . If this happens and , then the z-dependent term (the third term on the right-hand side) in the above expression involves a net transfer from agent 2 to agent 1 at time 1, in which case agent 1 wins the bet.
On the other hand, if z falls close to and , agent 2 wins.
In the situation of Proposition 1, the optimal sharing rules are linear, in which case we may alternatively think of the model as a stock market economy (in contrast to the more general Arrow–Debreu economy), at least if probability beliefs are homogeneous. Such market sharing rules are automatically linear, since stocks can be bought and sold in proportions only (number of shares times price). Such a market is sometimes said to be effectively complete when probability beliefs are homogeneous, since non-linear contracts are not optimal.
Under certain conditions we may implement a securities market equilibrium with
N securities and associated stock prices
S at time 0, and portfolio weights
, such that
has security market equilibrium with the same consumption allocation
as in the Arrow–Debreu economy (see for example [
19] for a definition of such a security market equilibrium). This construction can be carried out when the securities market is complete, but can alternatively result with linear sharing rules as explained above.
In the one-period model, without explicit Brownian motion dynamics, a simple model obtains when the state space is finite. When the number of the risky stock-owned companies, whose vectors of state-contingent payoffs are linearly independent, is at least equal to, or larger than the number of states, the resulting market is complete, recall the last result in
Appendix B.1.1 (
Appendix B). In our setting, we have an uncountable large state space, since we have probability densities on compacts. This could, perhaps, be related to the theory of so-called “large markets”, which uses contiguity concepts and is beyond the scope of the present article, see for example [
44].
Non-linear contracts, like various derivatives and options, do exist and are traded in real life markets, and one may wonder if this can be analyzed within the framework of the present model. Consider for example a contract with payoff
at time 1,
, still with homogeneous probability beliefs. Its price at time 0 is simply:
provided the integral exists. Such a claim is, moreover, not redundant in this model, and the optimal consumption allocations
can be attained in a market of stock-owned firms only. The completeness issue is if such a security can be spanned by a portfolio of the given securities.
In general, with the existence of general Arrow–Debreu securities, the finite number of optimal contracts, interpreted as securities in a stock market, is not sufficient to determine the Arrow–Debreu state prices. However, in the present setting, the Arrow–Debreu prices are known from the structure of the model.
If we go beyond the class of HARA-utility functions, we encounter non-linear Pareto optimal contracts. However, the resulting contracts may not come close to the financial instruments that market participants want to trade.
When probability beliefs are inhomogeneous, betting occurs as we have demonstrated above. However, the bets are structured by the model. In the standard model, derivatives are considered as bets on the state of the economy, or on the price of one of more of the given securities or functions of such, but the setting only leaves the agents indifferent between status quo or actually acquiring the derivative, short or long. In the present setting, agents bet actively, more like in the real world, and one may wonder if this added feature could be useful in analyzing derivatives.
8. Conclusions
We reviewed optimal risk sharing among members of various organizations. The starting point was optimal risk sharing in an Arrow–Debreu economy, or equivalently, in a Borch-type reinsurance market. From the results of this model we inferred how risk is optimally distributed between individuals according to their preferences and initial endowments. Not surprisingly, risk aversions play an important role. In addition to employing the criterion of Pareto optimality, we also narrowed the scope to the pricing problem connected with competitive equilibrium within the same framework.
The mutuality principle basically says that when it is blowing from the north, all the trees bend southwards: This principle, we claim, can be connected to the economic consequences of a pandemic; it makes most people worse off, and when it is over, those who survived may improve their situation (although it may take some time).
From this we progressed to a review of the more general theory of syndicates, where, in addition, a group of people is to make a common decision under uncertainty. Conditions were formulated under which the syndicate behaves as a rational Savage decision maker, with an associated evaluation measure.
Depending on the structure of this measure, we can infer the degree of unanimity in the syndicate. Presumably, a theory like this can shed some light on what goes on in world meetings, like for example, the UN Climate Change Conference (COP26) held in Glasgow in November 2021. It basically says that it is hard to agree, which was experienced. Lastly, we also extended the syndicate setting to a competitive market equilibrium.