Next Article in Journal
A Pretest Estimator for the Two-Way Error Component Model
Previous Article in Journal
Public Debt and Economic Growth: A Panel Kink Regression Latent Group Structures Approach
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Biases in the Maximum Simulated Likelihood Estimation of the Mixed Logit Model

Department of Economics, University of South Florida, Tampa, FL 33620, USA
*
Author to whom correspondence should be addressed.
Econometrics 2024, 12(2), 8; https://doi.org/10.3390/econometrics12020008
Submission received: 10 February 2024 / Revised: 11 March 2024 / Accepted: 23 March 2024 / Published: 27 March 2024

Abstract

:
In a recent study, it was demonstrated that the maximum simulated likelihood (MSL) estimator produces significant biases when applied to the bivariate normal and bivariate Poisson-lognormal models. The study’s conclusion suggests that similar biases could be present in other models generated by correlated bivariate normal structures, which include several commonly used specifications of the mixed logit (MIXL) models. This paper conducts a simulation study analyzing the MSL estimation of the error components (EC) MIXL. We find that the MSL estimator produces significant biases in the estimated parameters. The problem becomes worse when the true value of the variance parameter is small and the correlation parameter is large in magnitude. In some cases, the biases in the estimated marginal effects are as large as 12% of the true values. These biases are largely invariant to increases in the number of Halton draws.

1. Introduction

This paper examines the maximum simulated likelihood (MSL) estimator of the error components (EC) mixed logit (MIXL) model. The MIXL has been preferred by applied economists due to its flexible latent structure, which allows for various specifications of behavioral patterns. Since the model does not have a closed form, its estimation relies on simulation-based methods, specifically the MSL estimator, which has been the dominant estimation strategy for more than 20 years. However, Jumamyradov and Munkin (2021) showed that the MSL estimator produces significant biases when applied to the bivariate normal and bivariate Poisson-lognormal models. Their conclusion is that similar biases could be present in other models generated by correlated bivariate normal structures, which include the most commonly used specifications of the MIXL models. Therefore, further analysis of the MSL estimator in the context of the MIXL model is necessary.
The multinomial logit (MNL) model was introduced by McFadden (1974). It has a closed-form solution due to two convenient, however, restrictive, assumptions. First, the MNL model assumes that the error terms are independently and identically distributed (i.i.d.) as a type 1 extreme value (EV1) across the individuals and alternatives. As a result, the MNL model suffers from the independence from irrelevant alternatives (IIA) property (Debreu 1960), which in the literature has been illustrated by the “red-bus, blue-bus” example (Quandt 1970). Second, the MNL model does not allow for the unobserved variation in individual tastes (i.e., taste heterogeneity) in the population, meaning that the coefficients, associated with alternative-specific variables and observable alternative attributes that vary among individuals, are fixed. Although the MNL model has become the ”workhorse” in discrete choice analysis (Hensher and Greene 2003), its inconsistencies with realistic behavioral patterns have led researchers to look for more flexible alternative models. The MIXL model was derived by relaxing these restrictive assumptions (see McFadden 2001).
The first contribution to the development of the MIXL model came with relaxing the assumption of homogeneous parameters. Specifically, Boyd and Mellman (1980) and Cardell and Dunbar (1980) analyzed market demand for automobiles by allowing the consumer taste coefficients associated with the attributes of the alternatives to vary among individuals, in the form of random variables representing random taste heterogeneity (i.e., taste patterns). This specification of the MIXL model is also known as random coefficients, with application examples including Revelt and Train (1998) and Bhat (2000). Revelt and Train (1998) analyzed households’ choices of efficiency levels for refrigerators based on rebates and loans using the panel data MIXL model. Bhat (2000) studied urban work travel mode choices by incorporating observed and unobserved individual characteristics into the panel data MIXL model. Similar to random coefficients, alternative-specific constants (ASCs) may not be homogeneous within a sample, potentially leading to substitution patterns (e.g., red bus–blue bus Quandt 1970). The challenges of accommodating such heterogeneity are well-known in choice modeling (see Hensher et al. 2005).
Next, the i.i.d. assumption of the MNL model was relaxed, allowing for non-independent and non-identical errors, leading to the EC MIXL and generalized mixed logit (GMIXL). The EC specification of the MIXL model assumes that the stochastic portion of the utility consists of two parts, the i.i.d. errors with an EV1 distribution and additional components varying among the alternatives and individuals. This specification induces various correlation structures (i.e., taste and substitution patterns) as well as heteroskedasticity through the nests or cross nests created among the alternatives as a result of shared error components. Brownstone and Train (1998) used this approach to forecast new product penetration rates by allowing for flexible substitution patterns among the alternative sources of fuel for vehicles.
Recent studies related to discrete choice modeling have recognized the necessity of the heterogeneity of the scale parameter (see Louviere et al. 1999, 2002, 2008), which led to another specification of the MIXL model that relaxed the i.i.d. assumption. The scale parameter is directly related to the variance of the EV1 error terms, and is usually restricted to one because it cannot be identified separately from the slope coefficients. However, Fiebig et al. (2010) as well as Greene and Hensher (2010) proposed the generalized mixed logit (GMIXL) model that allows for individual variation in the variance of the EV1 error terms (i.e., scale heterogeneity) along with the unobserved individual heterogeneity of the slope coefficients. Although it has been shown that the GMIXL model performed better than the standard MIXL model (Fiebig et al. 2010; Keane and Wasi 2013), Hess and Train (2017), as well as Hess and Rose (2012), raised concerns about the identifiability of the GMIXL model. It is an open research question as to what additional assumptions need to be imposed to make the GMIXL model estimable.
The flexibility of the MIXL model is achieved by introducing latent variables into the model. However, this leads to the intractability of the choice probabilities, which cannot be evaluated analytically since they do not have a closed form. Therefore, the estimation of the MIXL model relies on a numerical approximation of the choice probabilities through simulation. The MSL estimator was introduced by Lerman and Manski (1980) to replace the intractable choice probabilities of the multinomial probit (MNP) model with simulated probabilities.
A well-known limitation of the MSL estimator is that it is biased when the number of simulations is limited, as is always the case in applications (see Gourieroux and Monfort 1996; Lee 1995; Hajivassiliou et al. 1996; Train 2009). Nevertheless, the estimation of the MIXL model in the literature is based on the MSL estimator, including in studies by Ben-Akiva et al. (1993), Revelt and Train (1998), Bhat (1998), Brownstone and Train (1998), McFadden and Train (2000), and Hess et al. (2005). The usual practice is to use the MSL in combination with Halton draws to reduce the simulation bias. Bhat (2001) showed that 100 Halton draws provide better approximation results than 1000 pseudo-random draws for the mixed logit model. According to Palma et al. (2020), around 93% of over 150 papers indexed in the Research Papers in Economics (RePEc) produced during 2008–2018 used less than 1000 Halton draws in their estimations of the mixed logit model. Furthermore, 72% and 40% of these papers used less than 500 and 250 Halton draws, respectively. Czajkowski and Budziński (2019) found that more than 3000 Halton draws are necessary to achieve a Minimum Tolerance Level of 5%. However, in the RePEc database, only 5.6% of papers used more than 2000 Halton draws (Palma et al. 2020).
Jumamyradov and Munkin (2021) primarily focused their analysis on the estimation of the correlation parameter in the bivariate normal and bivariate Poisson-lognormal models. In this paper, we closely follow their strategy and allow for correlation across the utilities of different alternatives. We also utilize Halton draws and analyze two error components specifications of the MIXL model. The first specification is the MIXL model with correlated slope coefficients and fixed alternative-specific coefficients (ASCs). The second example is the MIXL model with correlated ASCs and fixed slope coefficients. Moreover, for simplicity, we assume that there is only one attribute that varies among the alternatives and individuals. It should be noted that in most specifications of the MIXL model used by practitioners, the correlation parameter is assumed to be zero for simplicity, compromising robustness to the IIA property. However, practitioners are mostly interested in the estimated mean and variance of the random parameters. In this paper, we simulate the data according to the MIXL model and assess the MSL performance based on the difference between the true and estimated parameters. Our findings confirm simulation biases even in cases with zero correlation.
There have been several studies that have compared the MIXL results produced by estimators and software packages. Huber and Train (2001), Regier et al. (2009), Haan et al. (2015), Bastin and Cirillo (2010), and Elshiewy et al. (2017) compared the MSL and Bayesian estimation of the MIXL model. The first three of these studies were based on a single panel dataset. Bastin and Cirillo (2010) estimated the simulation biases in the MSL with respect to the number of draws and sample sizes, without comparing the MSL with the Bayesian estimation. Elshiewy et al. (2017) used cross-sectional and panel data with three empirical and four simulated datasets. Although Elshiewy et al. (2017) found MSL biases in the correlation parameter of the cross-sectional MIXL model, they only tested two values (0.75 and 0.25). We analyze the MSL estimator with respect to an extensive range of values of the correlation parameter and standard deviation, as well as different numbers of Halton draws. To the best of our knowledge, an extensive Monte Carlo simulation study like this has not been conducted before.
The rest of the paper is organized as follows. Section 2 introduces the MSL estimator. Section 3 presents different logit model specifications. Section 4 presents numerical examples using MIXL data simulation and produces MSL estimation results. Section 5 discusses the results.

2. Maximum Simulated Likelihood Estimator

The maximum likelihood (ML) estimator of parameter vector θ can be utilized when f ( y i | x i , θ ) , where the density of dependent variable y i conditional on the vector of independent variables x i , has a closed form, such that
θ ^ N = arg max θ i = 1 N log f ( y i | x i , θ ) ,
where ( y i . x i ) is a set of independent observations for i = 1 , , N . However, the ML is not feasible when f ( y i | x i , θ ) does not have a tractable closed form. This could be because the density is specified only conditionally on latent variables, which cannot be integrated out. Then, the MSL estimator is a possible alternative, which we define following Gourieroux and Monfort (1990, 1996). Suppose f ~ ( y i , x i , u , θ ) is an unbiased simulator of the conditional density f ( y i | x i , θ ) , such that
f y i x i , θ = E u [ f ~ ( y i , x i , u , θ ) | y i , x i ]
where the distribution of u is known and independent of y i and x i . Then, the MSL estimator of θ is defined as
θ ^ S N = arg max θ i = 1 N log 1 S f ~ ( y i , x i , u i s , θ )
where u i s ( s = 1 , , S ) are drawn independently for each individual i from the distribution u i . The MSL estimator is obtained by replacing the intractable conditional p.d.f. f ( y i | x i , θ ) with its unbiased approximation based on the simulator f ~ ( y i , x i , u , θ ) . However, although f ~ ( y i , x i , u , θ ) is an unbiased simulator of f ( y i | x i , θ ) , its log transformation log f ~ ( y i , x i , u , θ ) is not an unbiased simulator of log f ( y i | x i , θ ) , which results in simulation biases in the MSL estimator.
The asymptotic properties of the MSL estimator are determined by the relationship between S and N . For instance, the MSL estimator is biased when S is fixed and N tends to infinity (Property 1 in Gourieroux and Monfort 1990). If S increases with N , then the MSL estimator is consistent (Property 2 in Gourieroux and Monfort 1990). If S increases faster than N ( N / S 0 ) , then the MSL estimator is also efficient and, therefore, asymptotically equivalent to the ML estimator (Property 7 in Gourieroux and Monfort 1990). In practice, neither N or S might be close enough to infinity. However, the expectation is that there are achievable levels large enough for the biases to become acceptably small.

3. Materials and Methods

In this section, we define the MNL model and two specifications of the EC MIXL model. We also provide detailed information on how to simulate the corresponding likelihood functions.

3.1. Random Utility Maximization

Discrete choice models are usually introduced based on the random utility maximization (RUM) theory (see McFadden 1974), which states that the utility of individual i = 1 , , N from the chosen alternative j = 1 ,   J can be presented as U i j = V i j + ε i j , where V i j is the observed part of the utility and ε i j is the stochastic portion, unobserved by the researcher. Individual i will choose alternative j if and only if the level of utility associated with alternative j is higher than the levels associated with the other alternatives:
P i j = P U i j > U i k   ,     k j P i j = P ( V i j + ε i j > V i k + ε i k   ,     k j ) P i j = P ( ε i k ε i j < V i j V i k   ,     k j )
Since the utilities are latent, the choice probabilities are evaluated at relative measures, where the utility of one of the alternatives is taken as the reference. In order to calculate the choice probabilities, the distributional assumptions of the stochastic utility must be made. In the logit family of models, ε i j is assumed to be independently and identically distributed (i.i.d.) across individuals and alternatives with an extreme value type 1 (EV1) distribution. As a result, the difference between two i.i.d. EV1 error terms ( ε i k ε i j ) has a logistic distribution with the cumulative distribution function
P i j = 1 1 + k = 1 J exp [ ( V i k V i j ) ] ,     k j ,
The observed utility V i j is a function of individual characteristics and alternative attributes, and usually assumed to be linear for the parameters.

3.2. Multinomial Logit (MNL) Model

The MNL model is derived under the assumption that all the coefficients are fixed, implying that all the individuals in the population have homogeneous tastes. In this paper, we consider the case of three alternatives, in which the third alternative is restricted as the referent category. Therefore, we work with two utility differences, Z i 1 = U i 1 U i 3 and Z i 2 = U i 2 U i 3 , defined as
Z i 1 = α 1 + β 1 x i 1 + ε i 1 Z i 2 = α 2 + β 2 x i 2 + ε i 2
where ε i 1 ~ L o g i s t i c ( 0,1 ) and ε i 2 ~ L o g i s t i c ( 0,1 ) are i.i.d. logistically distributed, x i 1 and x i 2 are alternative attributes, α 1 and α 2 are alternative-specific coefficients (ASC), and β 1 and β 2 are coefficients of the alternative attributes. In some specifications, these coefficients are restricted to be equal, β 1 = β 2 = β . In the numerical examples, we choose the distribution of the covariates to be standard normal, such that x i 1 ~ N ( 0,1 ) and x i 2 ~ N ( 0,1 ) . The observability conditions for the outcome variables y i 1 , y i 2 and y i 3 are defined as
y i 1 = 1   i f   a n d   o n l y   i f   Z i 1 Z i 2 ,   Z i 1 0 y i 2 = 1   i f   a n d   o n l y   i f   Z i 2 > Z i 1 ,   Z i 2 0 y i 3 = 1   i f   a n d   o n l y   i f   Z i 1 < 0 ,   Z i 2 < 0
In other words, individual i chooses the alternative with the highest utility.

3.3. Mixed Logit (MIXL) Model

The assumption of homogeneous preferences leads to computationally convenient functional forms for the choice probabilities. However, preference homogeneity is not consistent with realistic behavioral patterns. Next, we present two specifications of the EC MIXL model that allow for various taste and substitution patterns through a correlation among the utilities of the different alternatives. The first specification is the MIXL model, with correlated slope coefficients and fixed ASCs. The second example is the MIXL model, with correlated ASCs and fixed slope coefficients. We refer to these two examples as EC1 and EC2, respectively. Under the EC1 specification taste patterns, we assume that
Z i 1 = α 1 + ( β 1 + u i 1 ) x i 1 + ε i 1 Z i 2 = α 2 + ( β 2 + u i 2 ) x i 2 + ε i 2
where u i 1 and u i 2 are jointly normally distributed ( u i 1 , u i 2 ) ~ N ( ( 0,0 ) , Σ ) with covariance matrix Σ . Similarly, under the EC2 specification substitution patterns, we assume
Z i 1 = ( α 1 + u i 1 ) + β x i 1 + ε i 1 Z i 2 = ( α 2 + u i 2 ) + β x i 2 + ε i 2
where, once again, ( u i 1 , u i 2 ) ~ N ( ( 0,0 ) , Σ ) . The covariance matrix in both cases is parametrized as
Σ = σ 1 2 ρ σ 1 σ 2 ρ σ 1 σ 2 σ 2 2
where restriction σ 1 = 1 is imposed for identification, such that
Σ = 1 ρ σ 2 ρ σ 2 σ 2 2 .
We define the lower triangular matrix
A = 1 0 σ 2 ρ σ 2 1 ρ 2 .
to be the Choleski decomposition of the covariance matrix, such that Σ = A A . Then, the bivariate normal u i 1 and u i 2 can be written as
u i 1 u i 2 = A v i 1 v i 2
where v i 1 N ( 0,1 ) and v i 2 N 0,1 , which helps us to approximate the simulated likelihood function drawn from the known density.
Both the EC1 and EC2 specifications induce correlation in the utilities of the different alternatives. The EC1 specification allows for correlation through the coefficients associated with alternative attributes xi1 and xi2. This correlation is known as taste patterns, because the weights for an attribute are associated with the weights of another attribute. The EC2 specification allows for correlations through the ASCs, similar to the classic red bus–blue bus example. This is also known as the substitution patterns, because the weights of an alternative are associated with those of another (e.g., red and blue bus). Each MIXL specification relaxes the preference homogeneity assumption in a slightly different way, and may be warranted depending on the decision context.

3.4. Simulated Likelihood Function of MIXL

The MIXL choice probabilities, unconditional of the unobserved latent variables v i 1 and v i 2 , can be written as integrals over the density f ( v i 1 ,   v i 2 ) , such that
P ( y i 1 = 1 ) = v i 1 v i 2 exp V i 1 1 + exp V i 1 + exp V i 2 f v i 1 , v i 2 d v i 2 d v i 1 P ( y i 2 = 1 ) = v i 1 v i 2 exp V i 2 1 + exp V i 1 + exp V i 2 f v i 1 , v i 2 d v i 2 d v i 1 P ( y i 3 = 1 ) = v i 1 v i 2 1 1 + exp V i 1 + exp V i 2 f v i 1 , v i 2 d v i 2 d v i 1
where the form of V i 1 and V i 2 depends on the EC model. In the EC1 specification,
V i 1 = α 1 + ( β 1 + v i 1 ) x i 1 V i 2 = α 2 + ( β 2 + σ 2 ρ v i 1 + σ 2 v i 2 1 ρ 2 ) x i 2
and in the EC2 specification,
V i 1 = ( α 1 + v i 1 ) + β x i 1 V i 2 = α 2 + σ 2 ρ v i 1 + σ 2 v i 2 1 ρ 2 + β x i 2
The log-likelihood function to be maximized can be written as
L L = i = 1 N j = 1 3 I y i j = 1 ln P ( y i j = 1 )  
However, the choice probabilities in Equation (10) do not have a closed form, and the log-likelihood function cannot be calculated analytically. Therefore, we approximate the choice probabilities through simulation, and maximize the simulated log-likelihood function
S L L = i = 1 N j = 1 3 I y i j = 1 ln P ^ ( y i j = 1 )  
where the simulated choice probabilities are
P ^ ( y i 1 = 1 ) = 1 S s = 1 S exp V i 1 s 1 + exp V i 1 s + exp V i 2 s P ^ ( y i 2 = 1 ) = 1 S s = 1 S exp V i 2 s 1 + exp V i 1 s + exp V i 2 s P ^ ( y i 3 = 1 ) = 1 S s = 1 S 1 1 + exp V i 1 s + exp V i 2 s
and s = 1 , , S represents the draw for v i 1 s and v i 2 s used to evaluate V i 1 s and V i 2 s .

4. Results

We generate data according to the EC1 and EC2 models. For each data generation process, the following specifications are used: α 1 = α 2 = 0.25 , β = β 1 = β 2 = 1 , x i 1 N ( 0,1 ) , x i 2 N ( 0,1 ) , ( u i 1 , u i 2 ) N ( ( 0,0 ) ,   ( 1 , ρ σ 2 , σ 2 2 ) ) , i = 1 ,   , N , and N = 1000 . We test three different values for σ 2 = ( 0.25,0.5,1 ) and generate data sets for all values of the correlation parameter ρ , ranging from −0.95 to 0.95 with increments of 0.05, ρ = { 0.95 : 0.05 : 0.95 } . Hence, a total of 3 × 39 = 117 covariance matrices for each specification is analyzed.
To examine the performance of the MSL estimator, we estimate the MIXL model under three different sets of restrictions imposed on the covariance matrix (M0, M1, and M2). Under M0, we do not impose any restrictions on the covariance matrix and estimate all the parameters. Under M1, we restrict the correlation parameter to zero, ρ = 0 , and estimate the remaining parameters. Finally, under M2, we restrict the correlation parameter to its true value ( ρ = T V ) and estimate the remaining parameters. In each example, the number of Halton draws are chosen to be H = ( 250 , 500 , 1000 ) , which is consistent with the levels used in leading MSL applications.
In summary, new MIXL data sets are generated for all 117 values of the covariance matrices for both the EC1 and EC2 specifications. For each data set, three specifications (M0, M1, and M2) are estimated, each with three different numbers of Halton draws ( 250 ,   500 ,   a n d   1000 ) . We repeat each simulation 100 times, R = 100 , generating a new data set and collecting the MSL estimates. The reported results are based on the means and standard errors calculated for these 100 simulations (i.e., Wald test).

4.1. Taste Patterns: EC1 Simulation Evidence

Table 1 presents the MSL results (M0, M1, M2) for the EC1 simulations with a high and negative correlation value of ρ = 0.95 . This extreme case produces a few results that deserve attention. First, there are biases in all the coefficient estimates under the M0 specification. For instance, when the true values are σ 2 = 0.25 , ρ = 0.95 , and H = 250 , the estimated values for α 1 , α 2 , β 1 , and β 2 are −0.307 (0.008), −0.349 (0.010), 1.131 (0.011), and 1.248 (0.013), respectively. In other words, α ^ 1 , α ^ 2 , β ^ 1 , and β ^ 2 are separated from their true values by 7, 10, 12, and 19 standard errors, respectively, and, therefore, the null hypotheses that H o : α 1 = α 2 = 0.25 and H o : β 1 = β 2 = 1 are overwhelmingly rejected. Notice also that there are no apparent reductions in the biases for α 1 , α 2 , β 1 , and β 2 , regardless of whether we increase the true variance σ 2 or the number of Halton draws.
Second, the MSL produces biased results for σ 2 with small true values, regardless of the chosen number of Halton draws. For example, when σ 2 = 0.25 and H = 250 , the estimated value of σ 2 is 0.413 (0.026), which is separated from its true value by 6.27 standard errors; therefore, the null hypothesis H o : σ 2 = 0.25 is rejected. However, the estimated σ 2 comes closer to its true value when we increase the variance. For example, when σ 2 = 0.5 , the estimated σ 2 is 0.569 (0.023). The case when σ 2 = 1 produces a similar result. However, the biased results for small variances do not change with the number of Halton draws. For instance, when H = 500 and H = 1000 , the estimated values for the true σ 2 = 0.25 are 0.439 (0.030) and 0.405 (0.025), respectively. In both of these cases, the null hypothesis H o : σ 2 = 0.25 is rejected. It is also interesting to notice that when the true standard deviation is σ 2 = 1 , the estimated σ 2 is much smaller for M0 than for M1 and M2.
Third, for almost all the parameter sets presented in Table 1, the estimated ρ is within three standard errors from its true value. The only case where the null hypothesis, that H o : ρ = 0.95 , may be rejected is when H = 250 and σ 2 = 1 . The EC1 simulation results for all the other 38 correlation values are provided in the Supplementary Materials.
Figure 1 plots the estimated ρ ^ against its true values, which ranges from −0.95 to 0.95 with increments of 0.05, where ρ ^ is calculated as the averages of ρ MSL estimates under M0 specification, obtained based on 100 samples ( R = 100 ) generated for the same set of true values and estimated with 1000 Halton draws ( H = 1000 ). The diagonal black line represents the true value of ρ . The blue, red and green lines correspond to σ 2 = 0.25 , σ 2 = 0.5 and σ 2 = 1 , respectively. Figure 1 shows that ρ ^ is mostly biased downward for H = 1000 .
Finally, when the researcher erroneously assumes that the true correlation is zero (M1), there is no substantial worsening in the performance of the MSL estimates. Similarly, when ρ is restricted at the true values (M2), there is no substantial improvement in the estimation of the parameters. A potential explanation for this is that the biases in the MSL estimation of taste patterns are mostly caused by difficulties in estimating the correlation parameter, with the efficiency of the ρ estimates declining for smaller values of σ 2 .

4.2. Substitution Patterns: EC2 Simulation Evidence

Table 2 presents the M0, M1, and M2 results for the EC2 simulations when the true correlation is ρ = 0.95 . First, notice that increasing the true value of variance σ 2 in M0 reduces the bias in α 1 , α 2 , and β . For example, given the estimated α 1 , α 2 , and β as −0.274 (0.012), −0.314 (0.011), and 1.16 (0.009), respectively, we may reject the null hypotheses that H o : α 2 = 0.25 and H o : β = 1 . However, when we increase σ 2 to one, the estimated α 1 , α 2 , and β are within 3 standard errors of their respective true values. This conclusion is irrespective of the number of Halton draws.
Second, the standard errors for the estimated σ 2 are substantially larger than those for α 1 , α 2 , and β . As a result, the estimated σ 2 is within 3 standard errors of its true value across almost all the parameter sets presented in Table 2. Therefore, the null hypothesis that σ 2 is equal to the true value cannot be rejected for almost all the cases. The only exception is the case when σ 2 = 1 and H = 1000 , and σ ^ 2 is 0.853 (0.043) and it is separated from the true value by 3.4 standard errors. The standard errors decrease slightly when the correlation is restricted in the specifications M1 and M2.
Third, the correlation parameter ρ is estimated with substantial biases in all the M0 specifications. The estimated ρ ^ is separated from the true value by 4 ( H = 1000 ,   σ 2 = 1 ) to 6 standard errors ( H = 500 ,   σ 2 = 0.25 ), and the null hypothesis H o : ρ = 0.95 is rejected in all the cases. The EC2 results for the other 38 correlation values are provided in the Supplementary Materials.
Figure 2 plots the estimated ρ ^ against the true values, once again ranging from −0.95 to 0.95 with increments of 0.05. Although ρ ^ is close to ρ for some values, the estimated correlation parameter mostly displays biases. The biases are smaller for σ 2 = 1 relative to when σ 2 = 0.25 or σ 2 = 0.5 . This finding is consistent with that of Jumamyradov and Munkin (2021) for the bivariate normal and bivariate Poisson-lognormal models. They report larger biases for smaller standard deviations. Overall, the M0 results show biases for all five parameters α 1 , α 2 , β , σ 2 , and ρ .
It is also interesting to notice that the MSL estimates of α 1 and α 2 in M1 have larger biases than in M0 for larger variances, and this is regardless of the number of Halton draws. For example, when H = 250 and σ 2 = 1 , the estimated α 1 and α 2 are −0.095 (0.009) and −0.161 (0.013), and are separated from their true value α 1 = α 2 = 0.25 by 17 and 7 standard errors, respectively. This does not change much for larger numbers of Halton draws. Thus, misspecifying the model setting correlation to ρ = 0 results in very large biases in α 1 and α 2 . Moreover, M1 produces larger positive biases for σ 2 compared to M0. For example, when H = 1000 , the estimated σ 2 are 0.365 (0.041), 0.558 (0.050), and 1.211 (0.055), while the true σ 2 = 0.25 , σ 2 = 0.5 and σ 2 = 1 , respectively. Moreover, the estimates of α 1 , α 2 , and β improve with larger variances in M0; however, we do not observe similar patterns in the M2 estimation, although there is an improvement in the estimation of σ 2 .

4.3. Choice Probabilities and Marginal Effects

Next, we examine how these reported biases affect the estimated choice probabilities and marginal effects (see Appendix A for the formulas of the marginal effects). Figure 3 plots the true and estimated P ( y = 1 ) calculated based on the M0 estimates of the EC1 specification with 500 Halton draws. The true probability means are calculated for the true values of all the parameters. The straight lines represent the true choice probabilities and the dashed lines represent the estimated choice probabilities. Figure 4 plots the true and estimated P ( y = 1 ) based on the M0 estimates of the EC2 specification with 500 Halton draws. Even though there are significant biases in the estimated parameters, as expected, the choice probabilities are close to their true values for both the EC1 and EC2 specifications (i.e., taste and substitution patterns).
However, when comparing the true and estimated marginal effects, the differences are considerable. Figure 5 plots the true and estimated P ( y = 1 ) / x 1 for EC1 (M0, 500 Halton draws). For example, when σ 2 = 1 and ρ = 0.95 , the true P ( y = 1 ) / x 1 is 0.1509 and the estimated P ( y = 1 ) / x 1 is 0.1679. Thus, the marginal effect in this case is overestimated by 11%. Figure 6 plots the true and estimated P ( y = 1 ) / x 1 for EC2 (M0, 500 Halton draws). For example, when σ 2 = 1 and ρ = 0.95 , the true P ( y = 1 ) / x 1 is 0.164 and the estimated P ( y = 1 ) / x 1 is 0.1839, which is overestimated by 12%.

5. Discussion

In this paper, we examine the properties of the MSL estimator in the context of two MIXL model specifications, EC1 and EC2 (i.e., taste and substitution patterns), where random parameters are generated by a correlated bivariate normal structure. We find that the MSL estimator produces significant biases in the estimated parameters. The problem becomes worse when the true value of the variance parameter is small and the correlation parameter is large in magnitude. Furthermore, we find that the biases in the estimated marginal effects can be as large as 12% of the true values. These biases are largely invariant to increases in the number of Halton draws. Since the existing literature has relied heavily on the MSL estimator in the analysis of the MIXL model, our findings should be an important additional warning to researchers about potential sizable biases in the results.
We also discover that the performance of the MSL depends on other factors, such as the model specification (i.e., EC1 or EC2), distributional assumptions, exogenous variation, as well as the true values of variance and correlation parameters. Therefore, we believe that biases in empirical applications (e.g., discrete choice experiments in health preference research) are likely to be worse due to real-world complexity; however, more research is needed to address such questions. Future simulation studies may examine biases in more complex specifications, such as the generalized MIXL or EC MIXL with more than two random parameters.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/econometrics12020008/s1. Table S1: Results for all parameter combinations.

Author Contributions

Conceptualization, M.J., M.M. and B.M.C.; methodology, M.J., M.M., W.H.G. and B.M.C.; software, M.J.; validation, M.J., M.M., W.H.G. and B.M.C.; formal analysis, M.J.; investigation, M.J.; data curation, M.J.; writing—original draft preparation, M.J.; writing—review and editing, M.J., M.M., W.H.G. and B.M.C.; visualization, M.J.; supervision, M.M., W.H.G. and B.M.C.; project administration, M.M., W.H.G. and B.M.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The dataset is available on request from the authors.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

The marginal effects taken with respect to x 1 for EC1 are presented below. The marginal effects with respect to x 2 can be derived in the same way.
P ^ y = 1 x 1 = 1 N i = 1 N exp V ^ i 1 1 + exp V ^ i 1 + exp V ^ i 2 x i 1 = 1 N i = 1 N β ^ i 1 exp V ^ i 1 1 + exp V ^ i 1 + exp V ^ i 2 1 exp V ^ i 1 1 + exp V ^ i 1 + exp V ^ i 2 P ^ y = 2 x 1 = 1 N i = 1 N exp V ^ i 2 1 + exp V ^ i 1 + exp V ^ i 2 x i 1 = 1 N i = 1 N β ^ i 1 exp V ^ i 1 1 + exp V ^ i 1 + exp V ^ i 2 exp V ^ i 2 1 + exp V ^ i 1 + exp V ^ i 2 P ^ y = 3 x 1 = 1 N i = 1 N 1 1 + exp V ^ i 1 + exp V ^ i 2 x i 1 = 1 N i = 1 N β ^ i 1 exp V ^ i 1 1 + exp V ^ i 1 + exp V ^ i 2 1 1 + exp V ^ i 1 + exp V ^ i 2
where the conditional mean β ^ i 1 is estimated by simulation methods
β ^ i 1 = E β ^ i 1 q | y , X = 1 Q q = 1 Q β ^ i 1 q exp V ^ i 1 q 1 + exp V ^ i 1 q + exp V ^ i 2 q 1 Q q = 1 Q exp V ^ i 1 q 1 + exp V ^ i 1 q + exp V ^ i 2 q        V ^ i 1 q = α ^ 1 + β ^ i 1 q x i 1        V ^ i 2 q = α ^ 2 + β ^ i 2 q x i 2        β ^ i 1 q = β ^ 1 + v i 1 q        β ^ i 2 q = β ^ 2 + σ ^ 2 ρ ^ v i 1 q + σ ^ 2 v i 2 q 1 ρ ^ 2
where v i 1 q and v i 2 q are independent standard normal random variables for individual i = 1 , , N and random draws q = 1 , , Q .

References

  1. Bastin, Fabian, and Cinzia Cirillo. 2010. Reducing simulation bias in mixed logit model estimation. Journal of Choice Modelling 3: 71–88. [Google Scholar] [CrossRef]
  2. Ben-Akiva, Moshe, Denis Bolduc, and Mark Bradley. 1993. Estimation of travel choice models with randomly distributed values of time. Transportation Research Record 1413: 88–97. [Google Scholar]
  3. Bhat, Chandra. 1998. Accommodating flexible substitution patterns in multidimensional choice modeling: Formulation and application to travel mode and departure time choice. Transportation Research Part B: Methodological 32: 455–66. [Google Scholar] [CrossRef]
  4. Bhat, Chandra. 2000. Incorporating observed and unobserved heterogeneity in urban work travel mode choice modeling. Transportation Science 34: 228–38. [Google Scholar] [CrossRef]
  5. Bhat, Chandra. 2001. Quasi-random maximum simulated likelihood estimation of the mixed multinomial logit model. Transportation Research Part B: Methodological 35: 677–93. [Google Scholar] [CrossRef]
  6. Boyd, Hayden, and Robert Mellman. 1980. The effect of fuel economy standards on the U.S. automotive market: An hedonic demand analysis. Transportation Research Part A: General 14: 367–78. [Google Scholar] [CrossRef]
  7. Brownstone, David, and Kenneth Train. 1998. Forecasting new product penetration with flexible substitution patterns. Journal of Econometrics 89: 109–29. [Google Scholar] [CrossRef]
  8. Cardell, Scott, and Frederick Dunbar. 1980. Measuring the societal impacts of automobile downsizing. Transportation Research Part A: General 14: 423–34. [Google Scholar] [CrossRef]
  9. Czajkowski, Mikołaj, and Wiktor Budziński. 2019. Simulation error in maximum likelihood estimation of discrete choice models. Journal of Choice Modelling 31: 73–85. [Google Scholar] [CrossRef]
  10. Debreu, Gerard. 1960. Review of Duncan Luce, Individual Choice Behavior: A theoretical analysis. American Economic Review 50: 186–88. [Google Scholar]
  11. Elshiewy, Ossama, German Zenetti, and Yasemin Boztug. 2017. Differences between classical and Bayesian estimates for mixed logit models: A replication study. Journal of Applied Econometrics 32: 470–76. [Google Scholar] [CrossRef]
  12. Fiebig, Denzil, Michael Keane, Jordan Louviere, and Nada Wasi. 2010. The generalized multinomial logit model: Accounting for scale and coefficient heterogeneity. Marketing Science 29: 393–421. [Google Scholar] [CrossRef]
  13. Gourieroux, Christian, and Alain Monfort. 1990. Simulation based inference in models with heterogeneity. Annales d’Economie et de Statistique 20/21: 69–107. [Google Scholar] [CrossRef]
  14. Gourieroux, Christian, and Alain Monfort. 1996. Simulation-Based Econometric Methods. New York: Oxford University Press. [Google Scholar]
  15. Greene, William, and David Hensher. 2010. Does scale heterogeneity across individuals matter? An empirical assessment of alternative logit models. Transportation 37: 413–28. [Google Scholar] [CrossRef]
  16. Haan, Peter, Daniel Kemptner, and Arne Uhlendorff. 2015. Bayesian procedures as a numerical tool for the estimation of an intertemporal discrete choice model. Empirical Economics 49: 1123–41. [Google Scholar] [CrossRef]
  17. Hajivassiliou, Vassilis, Daniel McFadden, and Paul Ruud. 1996. Simulation of multivariate normal rectangle probabilities and their derivatives theoretical and computational results. Journal of Econometrics 72: 85–134. [Google Scholar] [CrossRef]
  18. Hensher, David, and William Greene. 2003. The mixed logit model: The state of practice. Transportation 30: 133–76. [Google Scholar] [CrossRef]
  19. Hensher, David, John Rose, and William Greene. 2005. Applied Choice Analysis: A Primer. Cambridge: Cambridge University Press. [Google Scholar]
  20. Hess, Stephane, and John Rose. 2012. Can scale and coefficient heterogeneity be separated in random coefficients models? Transportation 39: 1225–39. [Google Scholar] [CrossRef]
  21. Hess, Stephane, and Kenneth Train. 2017. Correlation and scale in mixed logit models. Journal of Choice Modelling 23: 1–8. [Google Scholar] [CrossRef]
  22. Hess, Stephane, Michel Bierlaire, and John Polak. 2005. Estimation of value of travel-time savings using mixed logit models. Transportation Research Part A: Policy and Practice 39: 221–36. [Google Scholar] [CrossRef]
  23. Huber, Joel, and Kenneth Train. 2001. On the similarity of classical and Bayesian estimates of individual mean partworths. Marketing Letters 12: 259–69. [Google Scholar] [CrossRef]
  24. Jumamyradov, Maksat, and Murat Munkin. 2021. Biases in maximum simulated likelihood estimation of bivariate models. Journal of Econometric Methods 11: 55–70. [Google Scholar] [CrossRef]
  25. Keane, Michael, and Nada Wasi. 2013. Comparing alternative models of heterogeneity in consumer choice behavior. Journal of Applied Econometrics 28: 1018–45. [Google Scholar] [CrossRef]
  26. Lee, Lung-Fei. 1995. Asymptotic bias in simulated maximum likelihood estimation of discrete choice models. Econometric Theory 11: 437–83. [Google Scholar] [CrossRef]
  27. Lerman, Steven, and Charles Manski. 1980. On the use of simulated frequencies to approximate choice probabilities. In Structural Analysis of Discrete Data with Econometric Applications. Edited by Charles Manski and Daniel McFadden. Cambridge: MIT Press, pp. 305–19. [Google Scholar]
  28. Louviere, Jordan, Deborah Street, Leonie Burgess, Nada Wasi, Towhidul Islam, and Anthony Marley. 2008. Modeling the choices of individual decision-makers by combining efficient choice experiment designs with extra preference information. Journal of Choice Modelling 1: 128–64. [Google Scholar] [CrossRef]
  29. Louviere, Jordan, Deborah Street, Richard Carson, Andrew Ainslie, Jay DeShazo, Trudy Cameron, David Hensher, Robert Kohn, and Tony Marley. 2002. Dissecting the random component of utility. Marketing Letters 13: 177–93. [Google Scholar] [CrossRef]
  30. Louviere, Jordan, Robert Meyer, David Bunch, Richard Carson, Benedict Dellaert, Michael Hanemann, David Hensher, and Julie Irwin. 1999. Combining sources of preference data for modeling complex decision processes. Marketing Letters 10: 205–17. [Google Scholar] [CrossRef]
  31. McFadden, Daniel. 1974. Conditional logit analysis of qualitative choice behavior. In Frontiers in Econometrics. Edited by Paul Zarambka. New York: Academic Press, pp. 105–42. [Google Scholar]
  32. McFadden, Daniel. 2001. Economic choices. American Economic Review 91: 351–78. [Google Scholar] [CrossRef]
  33. McFadden, Daniel, and Kenneth Train. 2000. Mixed mnl models for discrete response. Journal of Applied Econometrics 15: 447–70. [Google Scholar] [CrossRef]
  34. Palma, Marco, Dmitry Vedenov, and David Bessler. 2020. The order of variables, simulation noise, and accuracy of mixed logit estimates. Empirical Economics 58: 2049–83. [Google Scholar] [CrossRef]
  35. Quandt, Richard. 1970. The Demand for Travel: Theory and Measurement. Lexington: Heath and Company. [Google Scholar]
  36. Regier, Dean A., Mandy Ryan, Euan Phimister, and Carlo A. Marra. 2009. Bayesian and classical estimation of mixed logit: An application to genetic testing. Journal of Health Economics 28: 598–610. [Google Scholar] [CrossRef] [PubMed]
  37. Revelt, David, and Kenneth Train. 1998. Mixed logit with repeated choices: Households’ choices of appliance efficiency level. The Review of Economics and Statistics 80: 647–57. [Google Scholar] [CrossRef]
  38. Train, Kenneth. 2009. Discrete Choice Methods with Simulation. New York: Cambridge University Press. [Google Scholar]
Figure 1. Plots of ρ ^ for EC1 (taste patterns) using M0 (H = 1000).
Figure 1. Plots of ρ ^ for EC1 (taste patterns) using M0 (H = 1000).
Econometrics 12 00008 g001
Figure 2. Plots of ρ ^ for EC2 (substitution patterns) using M0 (H = 1000).
Figure 2. Plots of ρ ^ for EC2 (substitution patterns) using M0 (H = 1000).
Econometrics 12 00008 g002
Figure 3. Plots of true and estimated P ( y = 1 ) for EC1 (taste patterns) using M0 (H = 500).
Figure 3. Plots of true and estimated P ( y = 1 ) for EC1 (taste patterns) using M0 (H = 500).
Econometrics 12 00008 g003
Figure 4. Plots of true and estimated P ( y = 1 ) for EC2 (substitution patterns) using M0 (H = 500).
Figure 4. Plots of true and estimated P ( y = 1 ) for EC2 (substitution patterns) using M0 (H = 500).
Econometrics 12 00008 g004
Figure 5. Plots of true and estimated P ( y = 1 ) x 1 for EC1 (taste patterns) using M0 (H = 500).
Figure 5. Plots of true and estimated P ( y = 1 ) x 1 for EC1 (taste patterns) using M0 (H = 500).
Econometrics 12 00008 g005
Figure 6. Plots of true and estimated P ( y = 1 ) x 1 for EC2 (substitution patterns) using M0 (H = 500).
Figure 6. Plots of true and estimated P ( y = 1 ) x 1 for EC2 (substitution patterns) using M0 (H = 500).
Econometrics 12 00008 g006
Table 1. MSL estimates for EC1 (taste patterns).
Table 1. MSL estimates for EC1 (taste patterns).
ρ = 0.95
250 Draws500 Draws1000 Draws
σ 2 = 0.25 σ 2 = 0.5 σ 2 = 1 σ 2 = 0.25 σ 2 = 0.5 σ 2 = 1 σ 2 = 0.25 σ 2 = 0.5 σ 2 = 1
M0 α ^ 1 −0.307−0.299−0.299−0.299−0.301−0.303−0.308−0.298−0.308
(0.008)(0.009)(0.009)(0.008)(0.009)(0.009)(0.008)(0.009)(0.009)
α ^ 2 −0.349−0.342−0.273−0.349−0.342−0.296−0.348−0.344−0.289
(0.010)(0.009)(0.011)(0.009)(0.010)(0.012)(0.010)(0.010)(0.011)
β ^ 1 1.1311.1221.1631.1331.1411.1541.1321.1371.159
(0.011)(0.011)(0.012)(0.012)(0.012)(0.012)(0.011)(0.011)(0.012)
β ^ 2 1.2481.1971.0701.2651.2411.1361.2451.2201.120
(0.013)(0.013)(0.017)(0.013)(0.019)(0.027)(0.013)(0.014)(0.025)
σ ^ 2 0.4130.5690.8850.4390.6620.9300.4050.5660.897
(0.026)(0.023)(0.038)(0.030)(0.033)(0.054)(0.025)(0.022)(0.050)
ρ ^ −0.877−0.938−0.975−0.875−0.899−0.970−0.887−0.904−0.973
(0.026)(0.017)(0.008)(0.025)(0.023)(0.009)(0.025)(0.021)(0.009)
M1 α ^ 1 −0.308−0.297−0.294−0.300−0.302−0.302−0.308−0.298−0.302
(0.008)(0.009)(0.009)(0.009)(0.009)(0.009)(0.008)(0.009)(0.009)
α ^ 2 −0.342−0.342−0.295−0.342−0.337−0.323−0.342−0.341−0.323
(0.010)(0.010)(0.011)(0.010)(0.010)(0.012)(0.010)(0.010)(0.012)
β ^ 1 1.1511.1471.1961.1511.1671.1901.1511.1621.190
(0.011)(0.011)(0.011)(0.012)(0.012)(0.012)(0.011)(0.010)(0.012)
β ^ 2 1.2501.2271.1791.2611.2541.2641.2491.2391.263
(0.014)(0.015)(0.020)(0.016)(0.019)(0.025)(0.014)(0.016)(0.025)
σ ^ 2 0.3050.5221.0130.3220.5631.1070.3200.5081.108
(0.033)(0.036)(0.042)(0.035)(0.039)(0.049)(0.031)(0.032)(0.048)
M2 α ^ 1 −0.307−0.295−0.291−0.299−0.299−0.299−0.307−0.297−0.299
(0.008)(0.009)(0.009)(0.008)(0.009)(0.009)(0.008)(0.009)(0.009)
α ^ 2 −0.348−0.354−0.315−0.348−0.349−0.341−0.348−0.352−0.340
(0.010)(0.010)(0.011)(0.010)(0.009)(0.011)(0.010)(0.010)(0.011)
β ^ 1 1.1271.1141.1481.1261.1321.1401.1271.1291.140
(0.011)(0.011)(0.011)(0.012)(0.012)(0.012)(0.011)(0.010)(0.012)
β ^ 2 1.2491.2391.1931.2621.2661.2721.2491.2501.272
(0.012)(0.014)(0.017)(0.013)(0.016)(0.024)(0.012)(0.014)(0.024)
σ ^ 2 0.4300.6771.2000.4460.7221.2710.4300.6491.271
(0.025)(0.030)(0.034)(0.028)(0.030)(0.045)(0.025)(0.025)(0.045)
Table 2. MSL estimates for EC2 (substitution patterns).
Table 2. MSL estimates for EC2 (substitution patterns).
ρ = 0.95
250 Draws500 Draws1000 Draws
σ 2 = 0.25 σ 2 = 0.5 σ 2 = 1 σ 2 = 0.25 σ 2 = 0.5 σ 2 = 1 σ 2 = 0.25 σ 2 = 0.5 σ 2 = 1
M0 α ^ 1 −0.274−0.271−0.241−0.278−0.246−0.243−0.285−0.266−0.238
(0.012)(0.012)(0.012)(0.011)(0.012)(0.012)(0.012)(0.011)(0.012)
α ^ 2 −0.341−0.299−0.227−0.321−0.289−0.233−0.349−0.301−0.216
(0.011)(0.013)(0.012)(0.014)(0.015)(0.014)(0.014)(0.013)(0.012)
β ^ 1.1601.1271.0781.1651.1261.0711.1461.1271.069
(0.009)(0.009)(0.010)(0.009)(0.010)(0.009)(0.008)(0.008)(0.009)
σ ^ 2 0.2580.4690.8920.2840.4840.8740.2520.4420.853
(0.055)(0.050)(0.044)(0.058)(0.060)(0.044)(0.053)(0.050)(0.043)
ρ ^ −0.781−0.810−0.799−0.826−0.740−0.819−0.805−0.768−0.803
(0.040)(0.034)(0.036)(0.037)(0.039)(0.033)(0.034)(0.035)(0.036)
M1 α ^ 1 −0.243−0.200−0.095−0.247−0.188−0.093−0.255−0.199−0.095
(0.009)(0.009)(0.009)(0.009)(0.009)(0.010)(0.010)(0.010)(0.009)
α ^ 2 −0.285−0.241−0.161−0.292−0.244−0.171−0.312−0.247−0.161
(0.010)(0.011)(0.013)(0.013)(0.013)(0.016)(0.011)(0.013)(0.013)
β ^ 1.1531.1111.0851.1571.1171.0801.1321.1131.084
(0.010)(0.009)(0.011)(0.009)(0.011)(0.012)(0.008)(0.010)(0.011)
σ ^ 2 0.3350.5341.1990.4520.6601.1990.3650.5581.211
(0.052)(0.053)(0.058)(0.049)(0.053)(0.060)(0.041)(0.050)(0.055)
M2 α ^ 1 −0.283−0.280−0.278−0.283−0.265−0.281−0.291−0.278−0.278
(0.012)(0.012)(0.013)(0.012)(0.013)(0.010)(0.013)(0.011)(0.013)
α ^ 2 −0.319−0.312−0.308−0.319−0.303−0.319−0.347−0.316−0.308
(0.013)(0.015)(0.016)(0.016)(0.017)(0.019)(0.015)(0.016)(0.016)
β ^ 1.1601.1341.1231.1601.1291.1171.1441.1351.123
(0.010)(0.010)(0.011)(0.010)(0.011)(0.011)(0.009)(0.010)(0.011)
σ ^ 2 0.2350.4521.0270.2230.4491.0580.2220.4511.028
(0.048)(0.047)(0.055)(0.052)(0.053)(0.044)(0.045)(0.045)(0.055)
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Jumamyradov, M.; Munkin, M.; Greene, W.H.; Craig, B.M. Biases in the Maximum Simulated Likelihood Estimation of the Mixed Logit Model. Econometrics 2024, 12, 8. https://doi.org/10.3390/econometrics12020008

AMA Style

Jumamyradov M, Munkin M, Greene WH, Craig BM. Biases in the Maximum Simulated Likelihood Estimation of the Mixed Logit Model. Econometrics. 2024; 12(2):8. https://doi.org/10.3390/econometrics12020008

Chicago/Turabian Style

Jumamyradov, Maksat, Murat Munkin, William H. Greene, and Benjamin M. Craig. 2024. "Biases in the Maximum Simulated Likelihood Estimation of the Mixed Logit Model" Econometrics 12, no. 2: 8. https://doi.org/10.3390/econometrics12020008

APA Style

Jumamyradov, M., Munkin, M., Greene, W. H., & Craig, B. M. (2024). Biases in the Maximum Simulated Likelihood Estimation of the Mixed Logit Model. Econometrics, 12(2), 8. https://doi.org/10.3390/econometrics12020008

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop