Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Binary regression

From Wikipedia, the free encyclopedia
Statistical estimation method
Part of a series on
Regression analysis
Models
Estimation
Background

Instatistics, specificallyregression analysis, abinary regression estimates a relationship between one or moreexplanatory variables and a single outputbinary variable. Generally the probability of the two alternatives is modeled, instead of simply outputting a single value, as inlinear regression.

Binary regression is usually analyzed as a special case ofbinomial regression, with a single outcome (n=1{\displaystyle n=1}), and one of the two alternatives considered as "success" and coded as 1: the value is thecount of successes in 1 trial, either 0 or 1. The most common binary regression models are thelogit model (logistic regression) and theprobit model (probit regression).

Applications

[edit]

Binary regression is principally applied either for prediction (binary classification), or for estimating theassociation between the explanatory variables and the output. In economics, binary regressions are used to modelbinary choice.

Interpretations

[edit]

Binary regression models can be interpreted aslatent variable models, together with a measurement model; or as probabilistic models, directly modeling the probability.

Latent variable model

[edit]

The latent variable interpretation has traditionally been used inbioassay, yielding theprobit model, where normal variance and a cutoff are assumed. The latent variable interpretation is also used initem response theory (IRT).

Formally, the latent variable interpretation posits that the outcomey is related to a vector of explanatory variablesx by

y=1[y>0]{\displaystyle y=1[y^{*}>0]}

wherey=xβ+ε{\displaystyle y^{*}=x\beta +\varepsilon } andεxG{\displaystyle \varepsilon \mid x\sim G},β is a vector ofparameters andG is aprobability distribution.

This model can be applied in many economic contexts. For instance, the outcome can be the decision of a manager whether invest to a program,y{\displaystyle y^{*}} is the expected netdiscounted cash flow andx is a vector of variables which can affect the cash flow of this program. Then the manager will invest only when she expects the net discounted cash flow to be positive.[1]

Often, theerror termε{\displaystyle \varepsilon } is assumed to follow anormal distribution conditional on the explanatory variablesx. This generates the standardprobit model.[2]

Probabilistic model

[edit]

The simplest direct probabilistic model is thelogit model, which models thelog-odds as a linear function of the explanatory variable or variables. The logit model is "simplest" in the sense ofgeneralized linear models (GLIM): the log-odds are the natural parameter for theexponential family of the Bernoulli distribution, and thus it is the simplest to use for computations.

Another direct probabilistic model is thelinear probability model, which models the probability itself as a linear function of the explanatory variables. A drawback of the linear probability model is that, for some values of the explanatory variables, the model will predict probabilities less than zero or greater than one.

See also

[edit]

References

[edit]
  1. ^For a detailed example, refer to: Tetsuo Yai, Seiji Iwakura, Shigeru Morichi, Multinomial probit with structured covariance for route choice behavior, Transportation Research Part B: Methodological, Volume 31, Issue 3, June 1997, Pages 195–207, ISSN 0191-2615
  2. ^Bliss, C. I. (1934). "The Method of Probits". Science 79 (2037): 38–39.
Stub icon

Thisstatistics-related article is astub. You can help Wikipedia byadding missing information.

Retrieved from "https://en.wikipedia.org/w/index.php?title=Binary_regression&oldid=1322636188"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2026 Movatter.jp