1. Introduction
In economics, consumption decisions are the result of an optimizing choice. The consumer chooses the consumption bundle that maximizes their felicity, represented by an utility function, given a budget constraint. Since Ramsey [
1], the dynamics of consumption are analyzed considering a representative consumer which maximizes the weighted sum of all their future lifetime utilities, wherein weights decrease with time. In this context, the consumer is seen as a perfectly rational individual, who can exactly predict future earnings and prices and will never change their preferences over time.
However, considering some specific goods, such as alcohol, tobacco or drugs, the phenomenon of addiction may arise and the Ramsey model could be inappropriate for a correct analysis. The state of addiction can lead the consumer to give more importance to the immediate consumption of the addictive good regardless of anything else. In general, addiction creates physical abstinence or withdrawal symptoms when the use of the substance is discontinued, and generates tolerance, which is a physiological phenomenon requiring the individual to use more and more of the substance [
2,
3]. However, addiction does not arise from one day to another: it is the result of a habitual consumption perpetuated over time. The consumption of possibly addictive goods implies a habit formation process, in which the habit is built day by day through consumption itself. Hence, a habit is formed when past and current consumption are linked by a positive relation. The higher previous consumption was, the larger the habit, and the higher the current consumption level needs to be to deliver the same utility. Pioneer studies on habit formation are due to Gorman [
4], Pollak [
5], Lluch [
6] and Boyer [
7], but the reference model for applied studies of rational addiction (RA) is proposed by Becker and Murphy [
8].
This model is characterized by a constant discount rate for future utilities that implies a constant rate of time preference. The rate of time preference is a subjective indicator of impatience representing the desire of an agent to anticipate and enjoy the benefits stemming from higher current consumption. A high rate of time preference lowers the propensity towards future utility in determining current consumption choices. This property is known as the time consistency of consumers, as once they have chosen an optimal consumption path in, e.g., time t, they will stick to it even if allowed to reconsider its consumption path in, e.g., .
Such a preference structure cannot properly describe situations such as addiction to alcohol (but also in drug use or cigarette smoking), or the existence of goods as holidays and works of art whose benefits continue over the consumption act. For these reasons, several works, starting with Strotz [
9], Blanchard and Fisher [
10], Deaton [
11] and Romer [
12], have criticized the assumption of a constant rate of time preference, as suggested more by convenience than economic rationales. Critics have also been made by empirical works, such as Bishai [
13] and Laibson et al. [
14], who rejected the hypothesis of constant discounting.
One parsimonious attempt at modeling time inconsistency is by introducing the quasi-hyperbolic discounting of future utilities that enable capturing present-biased time preferences. In the formulation proposed by [
15], the degree of present bias is modeled through an additional discount parameter
, which reduces the weight of all future utilities accordingly. As a consequence, the optimal future consumption path will never be realized because the intertemporal trade-off changes over time as each future period becomes present and the present bias kicks in.
Present-biased preferences modeled through quasi-hyperbolic discounting have been applied to a variety of choices and situations. For instance, refs. [
15,
16,
17,
18,
19] studied consumption and saving behavior; ref. [
20] studied retirement decisions; ref. [
21] applied it to economic growth; ref. [
22] analyzed caloric intake; ref. [
23] applied it to welfare program participation and labor supply; ref. [
24] applied it to job search; and ref. [
25] applied it to gym attendance). More closely related to our study, ref. [
26] introduced quasi-hyperbolic discounting in an RA setting and applied it smoking behavior (also see [
27,
28] for similar, more recent applications.
Present-biased preferences, however, are not the only way in which time inconsistency may arise, and such a formulation may not be sufficient to capture more complex forms of time inconsistency. For instance, Piccoli and Tiezzi [
29], who developed an empirical test for quasi-hyperbolic discounting within the rational addiction framework developed by [
26], found no evidence of time inconsistency of this type for Russian smokers.
Given the conspicuous critics of the constant rate of time preference of the RA model and the possibly insufficient complexity of the quasi-hyperbolic discounting extension, this paper contributes to the literature by proposing an extended theory of rational addiction (ERA) that extends the RA model proposed by Becker and Murphy [
8]. The relevance of properly introducing time-inconsistent consumers within the RA framework has also recently been highlighted by [
30]. In our model, future utilities are discounted by an endogenous discount rate that depends on the stock of habits accumulated over time and is thus capable of describing the behavior of more complex time inconsistent consumers, but remains encompassing to the RA model itself as a special case. In the present paper, we solve the ERA model, study the existence and uniqueness of a steady state in consumption and habits, and analyze its local stability properties. We then open the way for a possible explanation in the observed heterogeneity of the rate of time preference, which is through the heterogeneity of the habit-formation process: individuals that more rapidly accumulate habits will increase their discount rate faster over time and may behave very differently from other individuals. We study these heterogeneity properties of the rate of time preference by a comparative dynamics exercise.
This approach, first introduced by Epstein and Shi [
31], generates a non-constant rate of time preference which depends on habits—an index of past consumption. The consumption decision is influenced by the time at which the decision is taken, as different time periods imply different stocks of habits (which evolve over time) and therefore different discount rates. Thus, a consumer that is allowed to reconsider their consumption plan in a future period may choose to change plans, because the accumulated habits have increased their discount rate, and hence the weights given to future utilities. In contrast to Epstein and Shi [
31], we keep the general utility structure of the RA model since it allows to model the negative effects that habits can have on consumers, for example, through health problems (see [
32,
33,
34,
35,
36]). A similar approach has recently been applied by [
37,
38] to the study of health behavior and its consequences for aging and longevity.
The paper is structured as follows.
Section 2 introduces the extended rational addiction model, where the consumer maximizes a lifetime utility function with an endogenous discount rate depending on habits. Euler equations resulting from the ERA model are derived and analyzed.
Section 3 proposes a definition of the steady state and derives its properties.
Section 4 assesses the stability properties of the steady state.
Section 5 discusses heterogeneity in the habit formation process.
Section 6 presents the comparative dynamics of the rate of time preferences and
Section 7 concludes.
2. The Extended Theory Rational Addiction (ERA)
The reference model in the field of addiction is the rational addiction model proposed by Becker and Murphy [
8]. This model endows some characteristics that make it an appealing tool for applied works (see, for example, [
32,
39,
40]), being characterized by a linear Euler equation with a simple test for the presence of addiction. This comes at a cost: a constant discount rate equal to the rate of return to savings is assumed. This assumption has never been supported by theoretical reasoning or empirical evidence and has been widely criticized by the literature. We propose to relax this assumption in favor of an endogenous specification of the discount rate determined by the habits accumulated by the individuals.
The underlying assumption is that habits are likely to influence the discount rate and the rate of time preference inducing increased impatience and, eventually, addiction. In doing so, we follow the work of Epstein and Shi [
31], in which the discount rate is a function of the stock of habits, an index of past consumption (Becker and Murphy [
8] refers to this concept as the “consumption capital,” in a framework in which “past consumption of
c affects current utility through a process of learning by doing….” We prefer to use the definition proposed by Ryder and Heal [
41], which explicitly talks about habits in determining this index of past consumption.) denoted as
z. To enlighten notation, we omit the time indication for time varying variables, except when integrating over time. For instance,
would generally be noted as simply
z. An individual’s degree of habit is represented by the stock variable
z, which accumulates according to a dynamic process. Generally, this process depends on the personal characteristics of the individual and on the consumption level of the addictive good
c. The endogeneity of the discount rate implies that the rate of time preference is also endogenous, making the analysis more interesting in terms of optimizing behavior.
In this framework, consumers maximize their felicity, which depends on the consumption of two goods and the stock of habits. The first good,
g, is a composite good, which does not cause addiction. The second good,
c, is a potentially addictive good (by which we mean a good which generates addiction when consumed under certain conditions, for example, in high quantity) which generates habits in consumption. Alcoholic beverages, for example, are potentially addictive goods. A moderate consumption of alcohol does not lead to addiction, but excessive consumption does. We assume that felicity also depends on the strength of habits, synthesized by the stock of habits
z. In line with Becker and Murphy [
8], we assume that this effect is negative. Being a potentially addictive good,
c generates habits over time, which in turn causes felicity to decrease, so that a larger amount of
c needs to be consumed to maintain the same level of felicity.
To keep things simple, we assume that
g does not directly interact with
c and
z in generating felicity, so that the utility function is additively separable and defined by
. In line with Cawley and Ruhm [
42], addiction is characterized by enforcement—the marginal utility of current consumption increases with the stock of habits (
). We denote the derivative of a generic function
with respect to some variable
x (
) as
The second derivative (
) is denoted as
—tolerance–and the stock of habits lowers utility (
)—and withdrawal—as the current consumption of the addictive good increases utility (
). Other usual regularity conditions impose
,
,
, and
.
The endogenous specification of the discount rate closely follows Epstein and Shi [
31]. In each instant in time, the discount rate is defined by a discount function
which depends on the stock of habits. The discount function
is a twice continuously differentiable function assumed to be strictly positive (
), strictly increasing (
) and concave (
).
Regarding the habit formation process, the literature proposes two approaches, which we call the partial adjustment approach and adaptive approach. We propose these names to remind, respectively, the partial adjustment model and the adaptive expectation model, which have evident similarities with the habits specifications under discussion. The partial adjustment approach consists of considering the habit formation as an investment process. The stock of habits
z accumulates as if it was capital. Investment is represented by current consumption and the stock of habits depreciates at rate
. As in Becker and Murphy [
8], “the rate of habits depreciation
measures the exogenous rate of disappearance of the physical and mental effects of past consumption of
c.”The dynamic equation which describes this process is
In the adaptive approach, the habits accumulation process is due to the difference between the current consumption of the potentially addictive good
c and the current stock of habits
z, through the rate of habits adjustment
. If current consumption
c exceeds the current stock of habits
z, there will be a formation of further habits, at a rate
; otherwise, the stock of habits decreases. The dynamic equation which describes this approach is
In this work, we follow the partial adjustment approach, for consistency with the RA model. However, using the adaptive approach, as in [
31], would not substantially alter the results.
In the economic literature there is an open discussion if the endogenous discount rate should be considered as increasing or decreasing with respect to consumption
c. Koopmans [
43] suggests a decreasing rate of impatience, while Lucas and Stokey [
44] observe that an increasing rate of impatience is necessary to obtain a single, stable, non-degenerate equilibrium point into wealth distribution in a deterministic horizon with a finite number of agents. According to Blanchard and Fisher [
10], the assumption of an increasing rate of impatience is difficult to defend ex ante. On the other side, Epstein [
45,
46] argues that the more a person consumes, the more she discounts in the future. In line with Epstein, we assume that the endogenous discount rate,
, is strictly increasing with respect to the stock of habits, and hence consumption. This assumption does not imply an always increasing discount rate. The discount rate may also decrease if, for instance, the consumer quits consuming the addictive good. In this case, the stock of habits
z smoothly depreciates at rate
, and consequently also lowers the discount rate.
This condition is necessary to ensure the stability of the long-run optimal consumption plan, because it guarantees that consumption levels in different dates are substitutes. In this case, as wealth and consumption rise, the marginal private return to further savings, which depends on the marginal utility of future consumption, falls. A discount rate decreasing in consumption would cause consumption in different dates to be complements, so that an increase in present consumption rises the marginal utility of future consumption.
The implication of a discount rate strictly increasing with respect to consumption is that a higher consumption level at time t increases the discount rate applied to utilities after t. An increase in current consumption in t induces an increase in the rate of time preference: the consumer’s desire to anticipate the effects of future consumption is picked up by a larger consumption in . An increase in current consumption in rises the stock of habits in , inducing a further increase in the discount rate: the larger the previous consumption is, the larger the habit and the larger the current consumption must be to deliver the same utility. Moreover, an increase in the discount rate rises the degree of adjacent complementarity and hence strengthens the commitment to all habits.
Defining a cumulative subjective discount rate Θ as (We use this formulation because it allows for some mathematical simplifications. This specification is equivalent to using the integral, from period 0 to
t, of the discount function. In fact,
)
the consumer problem can be written as
where
and | are the instantaneous utility functions; |
g | is current consumption of the non-addictive good; |
c | is current consumption of the addictive good; |
z | is the stock of habits; |
| is the cumulative discount rate; |
a | is real wealth; |
p | is the relative price of the addictive good |
| (price of is normalized to 1); |
r | is the rate of return to savings; |
| is the rate of habits depreciation; |
and the rate of habit depreciation
is assumed to be bounded between 0 and 1.
In (
3), consumption goods
g and
c are control variables, while real wealth
a, the stock of habits
z and the cumulative subjective discount rate
are state variables.
This specification nests several models of habit formation and addiction (for a sketch of the proof, see
Appendix A), such as:
The Ramsey model [
1,
43,
47], characterized by a constant rate of time preference;
The rational addiction model of Becker and Murphy [
8], characterized by a constant discount rate and additive utility function with habits;
The multiplicative habits model proposed by Carroll [
48], characterized by a constant discount rate and a multiplicative utility function with habits;
The Uzawa [
49] or Obstfeld [
50] models, that assume an endogenous discount rate depending on current consumption, with no explicit modeling of habit formation;
The Epstein and Shi [
31] model, characterized by an endogenous discount rate depending on habits.
The optimization problem is solved according to the maximum principle of Pontriagin (optimal control theory), and the present value Hamiltonian function (note that the relation between the current value and present value Hamiltonian is
where
is the current value Hamiltonian,
H is the present value Hamiltonian and
r is the rate of return to savings) is
where
and
are the discounted costate variables.
The first-order necessary conditions for an interior solution are (to save notation, partial derivatives are denoted with a subscript. Thus, for example,
)
and
It is convenient to re-scale the co-state variables in order to eliminate
. Letting
,
and
, the first-order necessary conditions become
and, given that
, and
the remaining conditions are
Differentiating Equations (
4) and (
5) with respect to time, we obtain
and using Equation (
10) with (
6), we obtain the following Euler equation for
Note that, by equating Equations (
4) and (
5), it is possible to obtain a simple analytical expression for
which is
The differential Equation (
7) gives a continuous time specification of the recursive structure of consumer preferences for every feasible consumption path. If we solve the differential Equation (
7) (recall that the solution for a differential equation with no constant coefficients as
is
. The value that the solution approaches is referred to as the steady state so the limit for
of the solution is
.), we obtain
which is the present value of future utilities at time
t which corresponds to the shadow price of the accumulated impatience rate
.
By equating the two equations for
, (
6) and (
9), we can solve for
and find the Euler equation
where, by means of Equations (
1), (
8), (
5) and (
12), the rate of time preference for good
c is
and the elasticity of intertemporal substitution is
In summary, the resulting Euler equations for
g and
c are
In light of Equation (
13), the shadow price of the cumulative discount rate
may be seen as an index of impatience. In fact, since
is the weighted sum of all future utilities from time
t, the higher future utilities are, the higher the patience is, since the agent is willing to wait for the realization of their desires of consumption. For this reason, from now on, we will refer to
as the
rate of impatience.This formulation, which involves the non-separability of preferences, is suggested by Ryder and Heal [
41], who introduced the notion of adjacent complementarity. Adjacent complementarity occurs when the past consumption of a good raises the marginal utility of present consumption. An increase or decrease in consumption at
can induce a variation of the marginal rate of substitution of current and future consumption at
. Complementarity is represented by a utility function that depends on both current consumption
c and the stock of habits
z, which is a weighted average of past consumption levels. Weights decline exponentially in the past at the exogenous depreciation rate
, which can be interpreted as a measure of permanence of physical and mental effects of past consumption on present consumption
c. As
becomes larger, less weight is given to past consumption in determining
z.
The two Euler equations in (
15) and (
16) implicitly define the rates of time preference and the elasticities of inter-temporal substitution for the two goods. The rate of time preference for
g is simply equal to the discount function
, as for any standard inter-temporal model of consumption. However, the discount rate is not constant and depends on the stock of habits
z, implying that the rate of time preference for
g depends on the past consumption of
c. The inter-temporal elasticity of substitution for
g is completely defined by the preference structure, and in particular, by the shape of
. The rate of time preference for
c, shown in Equation (
14), embeds:
Memory of past events through the stock of habits z and the rate of habits depreciation ;
Perception of present events by the current consumption levels of c and g;
The anticipation of future events by the present-value of future utilities
Consumer behavior is non-separable over time, revealing complementarity and time inconsistency. The present consumption of the potentially addictive good c and future consumption (represented by ) depends on the past consumption of the addictive good through the rate of habits depreciation , and does not need to be valued equally along a locally constant consumption path. The rate of time preference expresses the propensity that a person reveals towards future utility in determining current choices. This depends on the ability to anticipate the benefits of future consumption and the related physical and mental consequences of present and past consumption effects.
The Euler equation for c is different from canonical expressions mainly because it endows the complementarity between past consumption z, current consumption c and g, and future consumption by the rate of impatience through the endogenous rate of time preference and the elasticity of inter-temporal substitution.
3. The Steady State
Given the Euler Equations (
15) and (
16), the system of differential equations which describes the dynamic behavior of the ERA model is
Defining the steady state as an optimal solution to program (
3) in which the stock of habits
z and wealth
a do not change over time, (
,
), a steady state for system (
17)–(
21) exists and lies on the optimal consumption path. This definition comes from the consideration that it is physically implausible that a substance continues to generate habits to the infinite, otherwise, to keep a certain level of utility, the consumer should continuously increase the consumption of the addictive good. This, in a world of finite resources, is rather unlikely, although not impossible. If an equilibrium exists where
, it brings a number of consequences. First, if the stock of habits does not grow, then the consumption of the addictive good
c also does not grow at the steady state. In fact, from Equation (
17), we find that
(with the superscript
, we denote the variables at their steady state value). Second, if
z is constant,
is also constant because of Equation (
26), and thus
is constant and equal to
r, as implied by the cumulative discount rate motion equation and
.
These considerations lead to a first important consequence. At the steady state, the stock of habits is uniquely determined by the discount function and the interest rate r. Hence, the steady state consumption level , which is determined by the stock of habits and the rate of habits adjustment , also depends on the shape of the discount factor and the interest rate, but not on the structure of preferences.
The fact that
and
are constant implies that utility
is constant at the steady state, and so must be the index of impatience
, which is the discounted value of the future streams of utilities. From Equation (
18), considering that
, we obtain
The fact that, at the steady state,
has another important consequence:
g does not grow at the steady state. A condition for this to happen is that marginal utility
is constant, whereby Equation (
5) implies that
q is also constant, i.e.,
. From condition (
4), we find that
which in turn, by condition (
5), defines the steady state value of
.
Finally, assuming that, at the steady state, real wealth
a does not grow, from Equation (
21), we obtain the steady state value of wealth
To date, we determined the steady state levels of the variable involved in the dynamic system of the ERA model. To verify the existence of the steady state and that the steady state is an optimum solution to the maximization program, we can use Equation (
19). We know that in the steady state,
must be zero. For this to be verified, the term in square brackets must also be zero, and since
, we can concentrate on the term
Considering that, from (
17),
is 0 and substituting Equation (
12), we obtain
and using Equation (
12), Equation (
8) can be set to 0, and
which implies that
Then, the steady state lies on the optimal solution to the ERA model. The steady state defined as (
,
) is unique. To verify the uniqueness of the steady state, we consider the case in which
is still 0, but assume that this hypothetical steady state level of
is such that
, i.e.,
can be any value different from
. This implies that at the steady state, the cumulative discount rate
constantly changes because
is constant. By Equation (
20),
g also constantly changes at the steady state.
Since
z is constant, by Equation (
17), the consumption of addictive good
c must also be constant. The fact that
g grows indefinitely at the steady state while
c does not suggests that this hypothetical steady state may not satisfy all optimality conditions, so we need to verify that
even when
. The steady state levels of
and
are not constant and will change at a constant rate proportional to
. From (
19), the necessary condition for
is
but since
and
, using Equation (
11), the expression can be written as
which, by Equation (
20), is 0 if and only if
Note that the right-hand side of Equation (
24) is constant, which would either imply that both
and
were constant and take some specific values, which is not the case, or that
which, again, is not the case. This implies that this alternative steady state is infeasible. It follows that, if we define the steady state as an optimal solution to the optimization problem (
3) in which the stock of habits is stationary (
), the unique feasible solution implies that the discount rate
has to be equal to the interest rate
r. Other steady states may exist when the definition of the steady state is different. However, at least we proved the existence and uniqueness of at least one steady state. This, in turns, defines the steady state value of the stock of habits
and the consumption of the addictive good
purely as a function of the habit-generating process (through
) and the discount function shape. This sounds intuitively reasonable, since a steady level of the stock of habits could be reached only if the process that generate habits stabilizes and does not affect preferences for the addictive good.
4. Local Stability Properties
The system described by (
17)–(
21) generates a five-dimensional hyperplane which is divided into a number of regions (
, to be precise), each of which is characterized by a force leading the system toward the steady state or away from it.
We can reduce the dimension of the system, noting that
a is only present in the wealth accumulation equation. Since
a does not influence other dynamic equations, we can consider the accumulation equation of wealth (
21) as exogenous to the system and drop it from the stability analysis. Because
a is recursively determined for each value of
c and
g by Equation (
21), when considered in the system, it would imply an eigenvalue equal to
r, which would not influence the stability properties of the system. The new system generates a four-dimensional hyperplane, which allows for an easier mathematical analysis.
Rewriting Equation (
19) as
where
Equation (
20) as
where
and making each dynamic equation equal to 0 at the steady state (and since every function is intended to be evaluated at the steady state), the system can be written as
To analyze local stability, we first drop the (
) equation, since the corresponding eigenvalue is trivial and equal to
r, and then we linearize the system by a first-order Taylor expansion, obtaining
where
,
,
,
and
. The resulting Jacobian matrix
J is
| z | | c | g |
z | | 0 | 1 | 0 |
| | r | | |
c | C | D | A | B |
g | | 0 | 0 | E |
The stability properties of system (
25) are determined by the eigenvalues of matrix
J, through its characteristic polynomial
where
The stability of the system is determined by the number of stable and unstable eigenvalues for matrix J, and in particular, if all eigenvalues are real and positive, the steady state is an unstable point; if they are all real and negative, the steady state is a stable point; if they are all real but some are positive and some negative, the steady state is an saddle node; if some of them are complex, the steady state is a saddle focus. The saddle node is the multidimensional equivalent of the saddle point. The system will converge towards the steady state only if the starting point lies on the stable eigenspace. In the saddle, focus convergence (if the real part of the eigenvalue is negative) or divergence (if the real part of the eigenvalue is positive) will be cyclical.
The roots of the characteristic polynomial correspond to the eigenvalues of matrix J. We know that an analytical solution exists for the roots of polynomials of the fourth degree, and that it is given by the Ferrari method. However, the analytical solution and the relative stability analysis are by far too complex to be meaningful, even if we decided to use the Routh–Hurwitz stability criterion.
It is clear that the existence and stability of the steady state depends on the parameters values. For example, if the discount function is always greater than r, a steady state is not possible because of the increasing property of . For this reason, to characterize the stability properties of the steady state, we conduct a numerical simulation. The strategy of the simulation is as follows: first, we choose a functional form for the utility functions and the discount function; second, we select plausible numerical values for the parameters of the system and compute the steady state; third, we let the parameters vary and calculate the numerical value of the eigenvalues of matrix J.
In the choice of the functional form of the utility function, we depart from Becker and Murphy [
8]. In fact, a quadratic utility function is characterized by a point of maximum and the non-satiation property, commonly assumed in consumption studies, would not hold. The model could still reach the steady state, if lower than the global maximum of utility, but this would imply an additional set of restrictions on the parameters which we prefer to avoid. Instead, we use logarithmic utility functions, and, for simplicity, a linear specification for the discount function:
The discount function parameters and are both chosen to be . Utility parameters and are set to , while is . The relative price of the addictive good p is fixed to , the rate of returns to savings r is equal to and the rate of habits depreciation is . To evaluate the stability of the steady state, we allow each parameter to vary.
The results of the simulations (The table with all eigenvalues is available upon request, together with the program that generates the results.) indicate that the system is a saddle node in the neighbor of the parameters values. The eigenvalues are all real—three are positive and one is negative. This means that the system monotonically approaches the steady state, provided that the starting point lies on the stable eigenfold. This situation is depicted in
Figure 1. This phaseplot for the variable current consumption of the addictive good
and the current value of future utilities
represents in red the stable eigenfold and black arrows represent the forces driving the system away from the steady state if out of the stable eigenfold.
The stability of the system with respect to the variations of the parameters is less trivial. For example, the steady state keeps being a saddle node for values of in the range . For values smaller than or greater than and smaller than , the equilibrium is an unstable focus, with a cyclical behavior that diverges from equilibrium. Then, it becomes a stable focus and for values greater than , again a saddle node. Aside from the numerical values, that depend on the actual choices for the parameters and the consequent steady state’s values, this means that the rate of habit depreciation is a key parameter for the behavioral interpretation of the model, and a fundamental ingredient to analyze heterogeneity in habit formation. Normal people with reasonable rates of habit depreciation can find an equilibrium in the consumption of the addictive substance and enjoy a moderate consumption. People with a small may be more likely to develop an addiction and for them, a stable equilibrium might not be possible. The cyclical fluctuations of the equilibrium for values in between 0.25 and 0.55 could be interpreted as binge drinking, while a very large rate of habits depreciation may not result in problematic behaviors.
Other parameters that could possibly be of interest from the behavioral point of view are the discount function parameters. In particular, determines the degree of dependence of the discount rate on the stock habits z. The discount rate corresponds to the rate of time preference of g and is one of the main determinants of that of c. We observe that for very small values of the parameter, the system becomes an unstable focus. This seems reasonable since one of the equations that must be verified at the steady state is . If does not grow sufficiently quickly with z, then the equilibrium could be unstable. On the other side, for any value larger than the chosen one, the system behaves as a saddle node.
Furthermore, the disutility caused by the stock of habits, through the utility parameter , plays an important role. We observe that the equilibrium becomes an unstable focus with a cyclical path if the disutility of habits is small (below ). This may indicate that people with an insufficient perception of the health issues related to the consumption of alcohol, for example, may fall into binge drinking. For larger values, the equilibrium is always a saddle node. Other parameters’ variations do not show unexpected results, and in general, result in a saddle node equilibrium for wide intervals.
A similar result was observed using isoelastic utility functions, while, as mentioned above, the case of a quadratic utility function is slightly more complicated. Our simulations suggest that the model behaves as in the previous cases in several situations, however, depending on the parameters of the utility function, the steady state may be infeasible, since it can lie above the global maximum of the utility functions.
5. Heterogeneity in the Habits Formation Process
Heterogeneity is a necessary feature for almost any work on consumer demand. In fact, it is a common opinion that each individual has their own preference structure and that a correct demand analysis should take this into account. When considering a dynamic context, heterogeneous preferences can be captured through the rate of time preference, as suggested by Fuchs [
51] and Lawrance [
52]. Following this path, in this section, we concentrate on introducing heterogeneity through the rate of time preference for
c.
The definition of an endogenous rate of time preference permits separating the effects of heterogeneity from a generic habit effect. This is achieved through the definition of a rate of habits depreciation
as a function of personal characteristics
d. This implies that each individual develops habits and eventually addiction with a different consumption–habits path and achieve a different steady state. In particular, in
Section 3, we have seen how the steady state level of the stock of habit
only depends on the discount function
and on the interest rate
r. As a consequence, the steady state level of consumption of the addictive good
, which is equal to
, depends on the individual characteristics of the consumer
d. Moreover, these characteristics may not be constant over time, as for age, which, as suggested by Bishai [
13], may influence the rate of time preference.
An example can help one understand our hypotheses. Define two categories of agents which constitute society, e.g., myopic (m) and forward-looking (f) (any resemblance between the symbols used to indicate the two kind of agents and those used to indicate males and females is purely coincidental). An agent is myopic if they are characterized by strong a preference towards actual consumption, with a large degree of impatience. The impatience is caused by a large discount rate , which in turn is caused by a large stock of habits z. A large stock of habits may be caused by a rapid growth of z induced by a small rate of habits depreciation .
An agent is forward-looking if they are characterized by a preference towards future consumption, with a small degree of impatience. The discount rate is small, due to a small stock of habits z, which remains small thanks to a relatively large rate of habits depreciation .
The likelihood that they reveal addiction is picked up by the rate of habit depreciation
and their preferences towards alcohol consumption and habits. The larger is
, and less weight is given to past habits in determining current habits
z. Therefore, even if preferences towards alcohol are strong, the large value of
and the strong adversity against health injuries caused by habits, can prevent falling into addiction. Two simulations with a plausible value of
to ensure a saddle node steady state are depicted in
Figure 2. Here, the forward-looking agent is characterized by a larger
, which ensures a lower level of consumption of the addictive good and achieves the steady state later in life.
Regarding the shape of the rate of habit depreciation , to ensure the needed regularity properties of the maximization problem, it is necessary to bound it between 0 and 1. In fact, allowing for a negative rate of habit depreciation would rule out the existence of a steady state since habits would arise and continuously accumulate over time even if the consumption of the addictive good happened just once. On the other hand, for values of greater than 1, the stock of habits z would respond more than proportionally to a change in consumption c, allowing for a negative stock of habits z which would have no sense from a behavioral point of view. The limiting case of implies that the stock of habits fully depreciates each period, thus excluding the possibility that habits accumulate and addiction ensues.
Other characteristics of , which are not relevant for obtaining an interior solution to the optimization problem, such as slope and concavity, may have important implications for the behavioral analysis.
For example, one appealing feature could be an inverse-U-shape with respect to age, which would imply that an individual is more likely to develop addiction when young or elder, rather than when they are middle-aged. Another example is provided by recent medical evidence that suggests that men and women have different a predisposition to addiction. For example, a research conducted by Mancinelli et al. [
53] links the stronger effects produced by alcohol abuse, with a scarce presence of alcohol dehydrogenase (ADH), which is an enzyme involved in the metabolization of alcohol. It is observed that, in general, women have smaller quantities of ADH than men. This may lead to a physiological differences in the process that generates addiction, in which women may be more exposed to the risk of become addicted. However, even if women need less alcohol than men to become addicted to alcohol, they may have different preferences with respect to both alcohol, which may be less valued, and to habits, regarded as the negative health effects that habits may bring, which could be greater valued (in negative). The result may be that there is a smaller probability that women become addicted, since, even if women are more likely to become addicted for a given amount of consumed alcohol, they may be much less inclined to consume than men.
6. Comparative Dynamics
The comparative dynamic analysis investigates how a variation of a variable or parameter of interest, such as c, z, and , affects the endogenous rate of time preference in the neighborhood of the steady state.
The rate of time preference
is strictly decreasing with respect to current consumption
c. In the neighborhood of the steady state
, the partial derivative of the rate of time preference with respect to current consumption is thus
The rate of time preference is decreasing with respect to an increase in the current consumption of the possibly addictive good
. This implies that the consumer is more concerned with immediate consumption rather than future consumption. In such cases, the need to “
save against a rainy day” becomes less urgent and there is higher willingness to consume today. In other words, a rise in current consumption causes an increase in the level of impatience. The rate of time preference
strictly decreases with respect to the stock of habits
z. In the neighborhood of the steady state
, the partial derivative of the rate of time preference with respect to the stock of habits is impatience
The rate of time preference
is strictly decreasing with respect to the rate of impatience
. The partial derivative of the rate of time preference with respect to the rate of impatience is
An increase in the rate of impatience
indicates that the consumer is less impatient, giving more weight to future consumption. Hence, the rate of time preference is reduced and the consumption of the addictive good grows at a smaller rate. The rate of time preference
is strictly decreasing with respect to the rate of the habit depreciation
if
. In the neighborhood of the steady state
, the partial derivative of the rate of time preference with respect to the rate of habits depreciation is thus
An increase in the rate of habits depreciation implies that the rate of time preference declines, i.e., a decrease in the habit effects of the possibly addictive good tends to reduce the growth of consumption of c. This is true provided that the marginal utility of the possibly addictive good is smaller than the marginal utility of the non-addictive good, . Hence, unless preferences are biased towards the addictive good, an increase in the rate of habit depreciation lowers the rate of time preference, reducing the growth path of c.
The analysis of the variation of the rate of time preference with respect to a variation of the stock of habits is more complex, with no meaningful results.
The analytical and behavioral properties of the rate of time preference allow us to describe the dynamic evolution of an agent from a condition of potential habit to a state of addiction.
Reconsider the case of the myopic and forward-looking agent introduced above. The degree of impatience of the myopic agent is generally higher (lower ) than the degree of the forward-looking and so will be the degree of habits (). The propensity to exchange current for future consumption becomes less and less considerable. The myopic agent reveals an increasing impatience since their stock of habits is larger.
The subjective rate of time preference of the myopic agent encloses reinforcement and tolerance, two behavioral factors that are closely related to the concept of adjacent complementarity. Reinforcement means that, to obtain the same level of utility, the consumption of the addictive good has to increase when current consumption increases, while tolerance means that given levels of consumption are less satisfying when past consumption has been greater.
The analysis reveals a patience-dependence trade-off. A patient person tends to have a lower stock of habits than an impatient person, since the desire to anticipate future consumption is lower. It is not surprising that addiction is more likely for people who discount the future heavily since they pay less attention to the adverse consequences. Becker et al. [
40] suggested that poorer and younger persons discount the future more heavily while Chaloupka [
32] found that less educated persons may have higher rates of time preference. The ability of anticipating the consequences of present and past consumption depends on income, education, rank and degree of awareness of dangers.
In line with Becker and Mulligan [
54], we find that “
… the analysis of endogenous discount rates implies that even fully rational utility-maximizing individuals who become addicted to drugs and other harmful substances or behavior are induced to place less weight on the future, even if the addiction itself does not affect the discount rate.”