Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Mixed logit

From Wikipedia, the free encyclopedia
Statistical model
Part of a series on
Regression analysis
Models
Estimation
Background

Mixed logit is a fully general statistical model for examiningdiscrete choices. It overcomes three important limitations of the standardlogit model by allowing for random taste variation across choosers, unrestricted substitution patterns across choices, and correlation in unobserved factors over time.[1] Mixed logit can choose any distributionf{\displaystyle f} for the random coefficients, unlike probit which is limited to the normal distribution. It is called "mixed logit" because the choice probability is a mixture of logits, withf{\displaystyle f} as the mixing distribution.[2] It has been shown that a mixed logit model can approximate to any degree of accuracy any true random utility model of discrete choice, given appropriate specification of variables and the coefficient distribution.[3]

Random taste variation

[edit]

The standard logit model's "taste" coefficients, orβ{\displaystyle \beta }'s, are fixed, which means theβ{\displaystyle \beta }'s are the same for everyone. Mixed logit has differentβ{\displaystyle \beta }'s for each person (i.e., each decision maker.)

In the standard logit model, the utility of personn{\displaystyle n} for alternativei{\displaystyle i} is:

Uni=βxni+εni{\displaystyle U_{ni}=\beta x_{ni}+\varepsilon _{ni}}

with

εni{\displaystyle \varepsilon _{ni}} ~ iid extreme value

For the mixed logit model, this specification is generalized by allowingβn{\displaystyle \beta _{n}} to be random. The utility of personn{\displaystyle n} for alternativei{\displaystyle i} in the mixed logit model is:

Uni=βnxni+εni{\displaystyle U_{ni}=\beta _{n}x_{ni}+\varepsilon _{ni}}

with

εni{\displaystyle \varepsilon _{ni}} ~ iid extreme value
βnf(β|θ){\displaystyle \quad \beta _{n}\sim f(\beta |\theta )}

whereθ are the parameters of the distribution ofβn{\displaystyle \beta _{n}}'s over the population, such as the mean and variance ofβn{\displaystyle \beta _{n}}.

Conditional onβn{\displaystyle \beta _{n}}, the probability that personn{\displaystyle n} chooses alternativei{\displaystyle i} is the standard logit formula:

Lni(βn)=eβnXnijeβnXnj{\displaystyle L_{ni}(\beta _{n})={\frac {e^{\beta _{n}X_{ni}}}{\sum _{j}e^{\beta _{n}X_{nj}}}}}

However, sinceβn{\displaystyle \beta _{n}} is random and not known, the (unconditional) choice probability is the integral of this logit formula over the density ofβn{\displaystyle \beta _{n}}.

Pni=Lni(β)f(β|θ)dβ{\displaystyle P_{ni}=\int L_{ni}(\beta )f(\beta |\theta )d\beta }

This model is also called the random coefficient logit model sinceβn{\displaystyle \beta _{n}} is a random variable. It allows the slopes of utility (i.e., themarginal utility) to be random, which is an extension of therandom effects model where only the intercept was stochastic.

Anyprobability density function can be specified for the distribution of the coefficients in the population, i.e., forf(β|θ){\displaystyle f(\beta |\theta )}. The most widely used distribution is normal, mainly for its simplicity. For coefficients that take the same sign for all people, such as a price coefficient that is necessarily negative or the coefficient of a desirable attribute, distributions with support on only one side of zero, like the lognormal, are used.[4][5] When coefficients cannot logically be unboundedly large or small, then bounded distributions are often used, such as theSb{\displaystyle S_{b}} or triangular distributions.

Unrestricted substitution patterns

[edit]

The mixed logit model can represent general substitution pattern because it does not exhibit logit's restrictiveindependence of irrelevant alternatives (IIA) property. The percentage change in personn{\displaystyle n}'s unconditional probability of choosing alternativei{\displaystyle i} given a percentage change in themth attribute of alternativej{\displaystyle j} (theelasticity ofPni{\displaystyle P_{ni}} with respect toxnjm{\displaystyle x_{nj}^{m}}) is

ElasticityPni,xnjm=xnjmPniβmLni(β)Lnj(β)f(β)dβ=xnjmβmLnj(β)Lni(β)Pnif(β)dβ{\displaystyle {\text{Elasticity}}_{P_{ni},x_{nj}^{m}}=-{\frac {x_{nj}^{m}}{P_{ni}}}\int \beta ^{m}L_{ni}(\beta )L_{nj}(\beta )f(\beta )d\beta =-x_{nj}^{m}\int \beta ^{m}L_{nj}(\beta ){\frac {L_{ni}(\beta )}{P_{ni}}}f(\beta )d\beta }

whereβm{\displaystyle \beta ^{m}} is themth element ofβ{\displaystyle \beta }.[1][5] It can be seen from this formula that a ten-percent reduction forPni{\displaystyle P_{ni}} need not imply (as with logit) a ten-percent reduction in each other alternativePnj{\displaystyle P_{nj}}.[1] The reason is that the relative percentages depend on the correlation between the conditional likelihood that personn{\displaystyle n} will choose alternativei,Lni,{\displaystyle i,L_{ni},} and the conditional likelihood that personn{\displaystyle n} will choose alternativej,Lnj,{\displaystyle j,L_{nj},} over various draws ofβ{\displaystyle \beta }.

Correlation in unobserved factors over time

[edit]

Standard logit does not take into account any unobserved factors that persist over time for a given decision maker. This can be a problem if you are using panel data, which represent repeated choices over time. By applying a standard logit model to panel data you are making the assumption that the unobserved factors that affect a person's choice are new every time the person makes the choice. That is a very unlikely assumption. To take into account both random taste variation and correlation in unobserved factors over time, the utility for respondent n for alternative i at time t is specified as follows:

Unit=βnXnit+εnit{\displaystyle U_{nit}=\beta _{n}X_{nit}+\varepsilon _{nit}}

where the subscript t is the time dimension. We still make the logit assumption which is thatε{\displaystyle \varepsilon } is i.i.d extreme value. That means thatε{\displaystyle \varepsilon } is independent over time, people, and alternatives.ε{\displaystyle \varepsilon } is essentially just white noise. However, correlation over time and over alternatives arises from the common effect of theβ{\displaystyle \beta }'s, which enter utility in each time period and each alternative.

To examine the correlation explicitly, assume that theβ's are normally distributed with meanβ¯{\displaystyle {\bar {\beta }}} and varianceσ2{\displaystyle \sigma ^{2}}. Then theutility equation becomes:

Unit=(β¯+σηn)Xnit+εnit{\displaystyle U_{nit}=({\bar {\beta }}+\sigma \eta _{n})X_{nit}+\varepsilon _{nit}}

andη is a draw from the standard normal density. Rearranging, the equation becomes:

Unit=β¯Xnit+(σηnXnit+εnit){\displaystyle U_{nit}={\bar {\beta }}X_{nit}+(\sigma \eta _{n}X_{nit}+\varepsilon _{nit})}
Unit=β¯Xnit+enit{\displaystyle U_{nit}={\bar {\beta }}X_{nit}+e_{nit}}

where the unobserved factors are collected inenit=σηnXnit+εnit{\displaystyle e_{nit}=\sigma \eta _{n}X_{nit}+\varepsilon _{nit}}. Of the unobserved factors,εnit{\displaystyle \varepsilon _{nit}} is independent over time, andσηnXnit{\displaystyle \sigma \eta _{n}X_{nit}} is not independent over time or alternatives.

Then the covariance between alternativesi{\displaystyle i} andj{\displaystyle j} is,

Cov(enit,enjt)=σ2(XnitXnjt){\displaystyle {\text{Cov}}(e_{nit},e_{njt})=\sigma ^{2}(X_{nit}X_{njt})}

and the covariance between timet{\displaystyle t} andq{\displaystyle q} is

Cov(enit,eniq)=σ2(XnitXniq){\displaystyle {\text{Cov}}(e_{nit},e_{niq})=\sigma ^{2}(X_{nit}X_{niq})}

By specifying the X's appropriately, one can obtain any pattern of covariance over time and alternatives.

Conditional onβn{\displaystyle \beta _{n}}, the probability of the sequence of choices by a person is simply the product of the logit probability of each individual choice by that person:

Ln(βn)=teβnXnitjeβnXnjt{\displaystyle L_{n}(\beta _{n})=\prod _{t}{\frac {e^{\beta _{n}X_{nit}}}{\sum _{j}e^{\beta _{n}X_{njt}}}}}

sinceεnit{\displaystyle \varepsilon _{nit}} is independent over time. Then the (unconditional) probability of the sequence of choices is simply the integral of this product of logits over the density ofβ{\displaystyle \beta }.

Pni=Ln(β)f(β|θ)dβ{\displaystyle P_{ni}=\int L_{n}(\beta )f(\beta |\theta )d\beta }

Simulation

[edit]

Unfortunately there is no closed form for the integral that enters the choice probability, and so the researcher must simulate Pn. Fortunately for the researcher, simulating Pn can be very simple. There are four basic steps to follow

1. Take a draw from the probability density function that you specified for the 'taste' coefficients. That is, take a draw fromf(β|θ){\displaystyle f(\beta |\theta )} and label the drawβr{\displaystyle \beta ^{r}}, forr=1{\displaystyle r=1} representing the first draw.

2. CalculateLn(βr){\displaystyle L_{n}(\beta ^{r})}. (The conditional probability.)

3. Repeat many times, forr=2,...,R{\displaystyle r=2,...,R}.

4. Average the results

Then the formula for the simulation look like the following,

P~ni=rLni(βr)R{\displaystyle {\tilde {P}}_{ni}={\frac {\sum _{r}L_{ni}(\beta ^{r})}{R}}}

where R is the total number of draws taken from the distribution, and r is one draw.

Once this is done you will have a value for the probability of each alternative i for each respondent n.

See also

[edit]

Further reading

[edit]

References

[edit]
  1. ^abc"Train, K. (2003) Discrete Choice Methods with Simulation"(PDF).Econometrics Laboratory University of California at Berkeley. Retrieved2025-02-05.
  2. ^Hensher, David A. & William H. Greene (2003). "The Mixed Logit Model: The State of Practice,"Transportation, Vol. 30, pp. 133–176, at p. 135.
  3. ^McFadden, D. andTrain, K. (2000). “Mixed MNL Models for Discrete Response,”Journal of Applied Econometrics, Vol. 15, No. 5, pp. 447-470.
  4. ^David Revelt andTrain, K (1998). "Mixed Logit with Repeated Choices: Households' Choices of Appliance Efficiency Level," Review of Economics and Statistics, Vol. 80, No. 4, pp. 647-657
  5. ^abTrain, K (1998)."Recreation Demand Models with Taste Variation," Land Economics, Vol. 74, No. 2, pp. 230-239.
Retrieved from "https://en.wikipedia.org/w/index.php?title=Mixed_logit&oldid=1274095666"
Category:
Hidden categories:

[8]ページ先頭

©2009-2026 Movatter.jp