Movatterモバイル変換

[0]ホーム

Jump to content

Logistic regression

Edit links

From Wikipedia, the free encyclopedia

Statistical model for a binary dependent variable

"Logit model" redirects here; not to be confused withLogit function.

Example graph of a logistic regression curve fitted to data. The curve shows the estimated probability of passing an exam (binary dependent variable) versus hours studying (scalar independent variable). See§ Example for worked details.

Instatistics, alogistic model (orlogit model) is astatistical model that models thelog-odds of an event as alinear combination of one or moreindependent variables. Inregression analysis,logistic regression^[1] (orlogit regression)estimates the parameters of a logistic model (the coefficients in the linear or non linear combinations). In binary logistic regression there is a singlebinary dependent variable, coded by anindicator variable, where the two values are labeled "0" and "1", while theindependent variables can each be a binary variable (two classes, coded by an indicator variable) or acontinuous variable (any real value). The corresponding probability of the value labeled "1" can vary between 0 (certainly the value "0") and 1 (certainly the value "1"), hence the labeling;^[2] the function that converts log-odds to probability is thelogistic function, hence the name. Theunit of measurement for the log-odds scale is called alogit, fromlogistic unit, hence the alternative names. See§ Background and§ Definition for formal mathematics, and§ Example for a worked example.

Binary variables are widely used in statistics to model the probability of a certain class or event taking place, such as the probability of a team winning, of a patient being healthy, etc. (see§ Applications), and the logistic model has been the most commonly used model forbinary regression since about 1970.^[3] Binary variables can be generalized tocategorical variables when there are more than two possible values (e.g. whether an image is of a cat, dog, lion, etc.), and the binary logistic regression generalized tomultinomial logistic regression. If the multiple categories areordered, one can use theordinal logistic regression (for example the proportional odds ordinal logistic model^[4]). See§ Extensions for further extensions. The logistic regression model itself simply models probability of output in terms of input and does not performstatistical classification (it is not a classifier), though it can be used to make a classifier, for instance by choosing a cutoff value and classifying inputs with probability greater than the cutoff as one class, below the cutoff as the other; this is a common way to make abinary classifier.

Analogous linear models for binary variables with a differentsigmoid function instead of the logistic function (to convert the linear combination to a probability) can also be used, most notably theprobit model; see§ Alternatives. The defining characteristic of the logistic model is that increasing one of the independent variables multiplicatively scales the odds of the given outcome at aconstant rate, with each independent variable having its own parameter; for a binary dependent variable this generalizes theodds ratio. More abstractly, the logistic function is thenatural parameter for theBernoulli distribution, and in this sense is the "simplest" way to convert a real number to a probability.

The parameters of a logistic regression are most commonly estimated bymaximum-likelihood estimation (MLE). This does not have a closed-form expression, unlikelinear least squares; see§ Model fitting. Logistic regression by MLE plays a similarly basic role for binary or categorical responses as linear regression byordinary least squares (OLS) plays forscalar responses: it is a simple, well-analyzed baseline model; see§ Comparison with linear regression for discussion. The logistic regression as a general statistical model was originally developed and popularized primarily byJoseph Berkson,^[5] beginning inBerkson (1944), where he coined "logit"; see§ History.

Regression analysis
Part of a series on
Models
Linear regression Simple regression Polynomial regression General linear model
Generalized linear model Vector generalized linear model Discrete choice Binomial regression Binary regression Logistic regression Multinomial logistic regression Mixed logit Probit Multinomial probit Ordered logit Ordered probit Poisson
Multilevel model Fixed effects Random effects Linear mixed-effects model Nonlinear mixed-effects model
Nonlinear regression Nonparametric Semiparametric Robust Quantile Isotonic Principal components Least angle Local Segmented
Errors-in-variables
Estimation
Least squares Linear Non-linear
Ordinary Weighted Generalized Generalized estimating equation
Partial Total Non-negative Ridge regression Regularized
Least absolute deviations Iteratively reweighted Bayesian Bayesian multivariate Least-squares spectral analysis
Background
Regression validation Mean and predicted response Errors and residuals Goodness of fit Studentized residual Gauss–Markov theorem
Mathematics portal
v t e

Hours (x_k)	0.50	0.75	1.00	1.25	1.50	1.75	1.75	2.00	2.25	2.50	2.75	3.00	3.25	3.50	4.00	4.25	4.50	4.75	5.00	5.50
Pass (y_k)	0	0	0	0	0	0	1	0	1	0	1	0	1	0	1	1	1	1	1	1

Hours of study (x)	Passing exam
Hours of study (x)	Log-odds (t)	Odds (e^t)	Probability (p)
1	−2.57	0.076 ≈ 1:13.1	0.07
2	−1.07	0.34 ≈ 1:2.91	0.26
⁠ $\mu \approx 2.7$ ⁠	0	1	⁠1/2⁠ = 0.50
3	0.44	1.55	0.61
4	1.94	6.96	0.87
5	3.45	31.4	0.97

	Coefficient	Std. Error	z-value	p-value (Wald)
Intercept (β₀)	−4.1	1.8	−2.3	0.021
Hours (β₁)	1.5	0.9	1.7	0.017

	Center-right	Center-left	Secessionist
High-income	strong +	strong −	strong −
Middle-income	moderate +	weak +	none
Low-income	none	strong +	none

Authority control databases
International	GND FAST
National	United States France BnF data Israel
Other	Yale LUX

Movatterモバイル変換

Applications

General

Supervised machine learning

Example

Problem

Model

Fit

Parameter estimation

Predictions

Model evaluation

Generalizations

Background

Definition of the logistic function

Definition of the inverse of the logistic function

Interpretation of these terms

Definition of the odds

The odds ratio

Multiple explanatory variables

Definition

Many explanatory variables, two categories

Multinomial logistic regression: Many explanatory variables and many categories

Interpretations

As a generalized linear model

As a latent-variable model

Two-way latent-variable model

Example

As a "log-linear" model

As a single-layer perceptron

In terms of binomial data

Model fitting

Maximum likelihood estimation (MLE)

Iteratively reweighted least squares (IRLS)

Bayesian

"Rule of ten"

Error and significance of fit

Deviance and likelihood ratio test ─ a simple case

Goodness of fit summary

Deviance and likelihood ratio tests

Pseudo-R-squared

Hosmer–Lemeshow test

Coefficient significance

Likelihood ratio test

Wald statistic

Case-control sampling

Discussion

Machine learning and cross-entropy loss function

Comparison with linear regression

Alternatives

History

Extensions

See also

References

Sources

External links