We propose a mathematical model to support pricing decisions for products with multiple attributes in competitive markets, considering consumers’ willingness to pay and multiple segments. We represent the demands by modeling the consumer behavior for products and services through a multinomial logit model and then include consumers’ maximum willingness to pay through soft constraints within the demand function. The proposed model is a nonlinear profit maximization problem. We start by describing the consumer behavior model, then the mathematical optimization model, and finally, the solution strategy.
2.1. Consumer Behavior
The consumer behavior or demand function is based on the stochastic discrete choice model called constrained multinomial logit. This model is an econometric tool based on the theory of rationality where each consumer chooses a product that generates the greatest utility, among a finite set of alternatives. In mathematical terms,
is defined as the utility of product
of firm
for consumers in socioeconomic segment
s. That utility is known by the consumer. In random utility models, the probability that the consumer of segment
s chooses product
of firm
is
In these models,
is considered to be known by the consumer, but not by the modeler or data analyst. This utility is represented as the sum of two components:
A deterministic part
is known by the modeler and is a function of the vector of attributes that define the product i, and another random component
represents the error. If it is assumed that the errors
follow an independent and identically distributed (iid) Gumbel, then the probability of choice (
1) has a multinomial logit functional form:
where
and
is the non-purchase utility of consumers in segment
s.
The following formulation is proposed for
:
where
is the marginal rate of substitution of income for price in segment s. In other words, it is the importance that each segment assigns to the price of the product.
is the attractiveness of firm k for consumers in segment s. That is the preference with respect to each firm or brand.
is the total attractiveness of product i of firm k for consumers in segment s. This attractiveness is obtained as a combination of the valuation of the attributes of each product in each firm.
An assumption of (
4) is that consumers follow a compensatory behavior, that is, a level of utility of a product can be fixed, increasing its price and attractiveness simultaneously. However, in many contexts, consumers have bounds or thresholds with respect to price (or other attributes that define the product), making this compensatory assumption not feasible. In particular, consumers in segment
s cannot spend more than their maximum willingness to pay
. Hence, a feasible product for segment
s must meet
, which is the hard version of the constraint. However, these constraints are considered applying the CMNL model [
31], where the individuals impose thresholds to the attributes directly in the utility function, by penalty specifications in the binomial logit form to greatly reduce those products’ utility that do not meet any constraints. Ref. [
31] assumed that deterministic utility can be separated into a compensating term (
) and a non-compensating term (
), indicating the feasibility of product
i for
s:
where
is a cut-off or penalty function imposed by the consumer segment
s on the price of product
i. This penalty by means of a logarithm function achieves a smooth transition between the compensatory feasible space (
) and the infeasible non-compensatory space (
), allowing budget constraints to be subtly breached by the consumer.
Finally, the probability of a consumer in segment
choosing the product
of firm
,
with cut-off functions is
In particular, in (
6), applying the CMNL approach, we model a buying probability
that penalizes the consumer’s utility, which depends on the price
of the product
i of the firm
k.
where
is the inverse of the variance of consumers concerning the reservation price. The parameter
in (
8) is inversely proportional to
, and depends on
that is the proportion of the population that violates the price constraint. Considering that
, then each time the prices surpass the
,
will lower the choice probability for that product. At the limit, if
,
, and if
,
.
An alternative is to understand as soft bounds that replace the constraints that impose the price as being strictly lower than or equal to the willingness to pay (), which precludes considering prices above the willingness to pay. On the other hand, through , one can consider prices slightly higher than the willingness to pay.
The cutoffs emulate the hard constraint through their ’s’ shape. The willingness to pay splits the cutoff, the higher values of are when prices are lower than , and decreases as the price increases. Furthermore, the higher the sigma value, the more negative the slope of the ’s’.
The cutoff functions act on each exponential term by lowering the relative relevance of the alternative as the price surpasses the willingness to pay. Let us notice that it does not eliminate the option when the price is ’slightly’ above the willingness to pay, but the probability of choosing that bundle is lower than in the absence of the cutoff.
When the prices do not surpass the willingness to pay, the cutoffs values are almost one; also, when the modeler does not consider the cutoffs, it is equivalent to . The probabilities take the same (or approximately) form as the classic multinomial logit in both cases.
When Martínez et al. [
31] proposed constrained MNL (CMNL), they explained the model construction and its interpretation for endogenous and exogenous constraints. They indicated theoretical insights regarding parameter estimation but did not show details. On the other hand, Castro et al. [
68] described the estimation method specialized in CMNL; through the maximum likelihood method, they calculated the first-order optimality conditions and discussed the possibility of identifying the cutoffs’ parameters for the possible two cases: endogenous and exogenous, indicated by Martínez et al. [
31]. We explain both in the following paragraphs. The numerators in the expression are the relevant parts of the probability for estimating the parameters of the consumers’ utility. In the MNL context, it is equivalent to consider that the cutoffs are equal to one, so the numerator is
. In a CMNL, the numerator includes the cutoff and is
. We can recognize a modified deterministic utility for the CMNL case as follows:
The last term in the logarithm is quite relevant because it is added to a traditional MNL. The modeler needs to estimate more parameters, as usual, through a maximum likelihood scheme. Considering the mathematical properties of likelihood functions, the estimators obtained under general assumptions are consistent, asymptotically normal, and efficient.
The modeler can classify the bound of cutoff (in our case, this bound is the willingness to pay) as exogenous or known by the modeler, or endogenous, meaning that the bound is a new parameter to estimate. Please note that our research is independent of this aspect, which means the bound could be exogenous or endogenous. If the modeler has the data, then she/he can estimate the CMNL and then calculate prices in competition with our approach. Interested modelers can follow guidelines from [
31,
68], estimate the parameters of a CMNL, and then use our methodology to estimate prices.
When the cutoff is exogenous, the bound is known by the modeler. One can calculate the proportion
from the data (in general, from surveys or data provided for a company) not accomplishing the willingness to pay (bundles bought by consumers with a price higher than the
) and obtain
. To do this, we need
. Hence, the modeler must estimate the parameters of a traditional MNL, and the only additional required CMNL parameter is
. In an endogenous approach, the modeler does not explicitly know the bound. Therefore, one requires estimating the traditional MNL parameters and three additional sets of parameters specific for CMNL, in this case, the willingness to pay,
, and
[
68].
Regarding the possibility of identifying the base attractiveness of a firm, it is not possible to estimate it but within a point of reference, fixing one of the values to one, for example, because we are only interested in the difference between firms. Consumers prefer some companies, although the offered bundle is the same in terms of its composition and price. It could be due to the consumers’ perception of quality or personal preferences the modeler cannot understand. Nevertheless, it is impossible to estimate this parameter without a reference point unless considering a reference value. On the other hand, the attractiveness of the services within the bundles (telecom services) will depend on if the service is included in the bundle and the ’offered quantity’, in this case, velocity for internet services, minutes in telephony, and channel availability in TV [
63].
Interested readers will be able to find more details on
,
and, in general, the CMNL formulation and cutoffs in Martínez et al. [
31], Castro et al. [
68], and Pérez et al. [
15].
2.3. Equivalent Nonlinear Problem and Our PSO Approach
We first calculate the optimality first-order conditions for each firm maximization problem , and then we propose a PSO approach, in which we solve an equivalent non-linear optimization problem approach to determine the prices that accomplishes an equilibrium.
The derivative of the profit of the firm
to the product price
, with
takes the following form:
We define
in (
13), and show the derivative for the market share of
in (
14). See
Appendix A for details on determining Equation (
14).
When every firm
,
reaches the optimality conditions, there is an equilibrium for the game defined by (
11), and it is our aim to find the vector of prices accomplishing these conditions. We update the expression for
in Equation (
15).
We write this expression as a fixed point on prices in (
16).
Note that when there is only one segment, and for
, the derivative is always positive, meaning that increasing the price of product
i will increase all the other market shares. When
, we have that a price increasing logically reduces the market share of product
i. If we impose the first-order optimality conditions, we have
This particular case in Equation (
17) corresponds to a nonlinear equation system in terms of products’ prices. Note that
,
,
, and if we assume that there are no cross-subsidies among firm’s products, then
,
,
and
, and
. This simplification of our model is an extension of the case with one product, one segment and without competition (one firm) reported in [
15], where
. Our general expression in (
16) extends the case reported in [
21] by including competition, and
with the CMNL.
Any price vector
accomplishing
is an equilibrium for the game defined by (
11). Note that even a simplified version of our problem without competition (one firm) may have a non-convex profit function on
(for details see
Section 3.1). Then, to deal with our problem, in (
18), we propose an equivalent nonlinear optimization problem on
to find an equilibrium of (
11). A price vector
is an equilibrium for (
11) when the objective function
z for (
18) is zero.
Pérez et al. [
15] applied a bisection approach to solve the pricing and composition problem; therefore, they may be able to handle the problem to optimality because they considered a simplified version of the problem, with a concave profit. Li et al. [
21] did not consider competition, nor consumers’ willingness to pay; they proposed bisection search and gradient descent to solve the problem. We propose a heuristic approach based on PSO [
27,
72] to solve (
18), and hence, finding an equilibrium for (
11).
Considering that we are proposing a heuristic, we can only check if the solution for (
18) is zero, equivalent to fulfilling the first-order conditions of optimality for (
11). We run various PSO trials, implementing multiple combinations of hyper-parameters and different swarm sizes; this allows us to find one vector
accomplishing the nonlinear system of equations in (
17).
To solve (
18), let us define
price vectors, where
is the swarm individual,
is the velocity in the iteration
m of the PSO algorithm,
is the velocity inertia weight [
73], and
and
are constants associated with the particle best vector
for
w and general best
[
74]. Finally, we have the random parameters
and
, both
. Following PSO, in Equations (
19) and (
20), we update the price vectors
through velocity
balancing among
,
and
. The algorithm stops when
n consecutive solutions have differences smaller than the threshold
.
It is relevant to point out that when the optimal value of the objective function in (
18) is zero, the optimal price vector accomplishes the first-order optimality conditions. Because for every pair
the term of the sum is zero, every equation of the set of optimality conditions for the problem (
11) holds. Nevertheless, not every price vector accomplishing the first-order optimality conditions will be an equilibrium. To improve the search for equilibria (although we cannot ensure finding one), we include additional requirements for the PSO heuristic, which reports whether the Hessian of the problem in the expression (
18) evaluated in the price vector found is negative definite or not.
The procedure is:
If PSO finds a solution such that (the modeler defines as a positive small number), then the program reports the best solution (the smaller one) and indicates that this solution does not accomplish the first-order optimality conditions.
If PSO finds one or more solution vectors, then we evaluate the second-order conditions in these price points as well as the following:
- −
We report this vector as the solution if any price vector has a negative-definite Hessian.
- −
If no vector has a negative-definite Hessian, the methodology reports that there was no equilibrium and the found solutions.
Our results indicate that PSO is quite efficient in finding price vectors fulfilling (
16). We show details on PSO convergence; see
Appendix C.