Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Logistic function

From Wikipedia, the free encyclopedia
S-shaped curve
For the recurrence relation, seeLogistic map.

Alogistic function orlogistic curve is a common S-shaped curve (sigmoid curve) with the equation

f(x)=L1+ek(xx0){\displaystyle f(x)={\frac {L}{1+e^{-k(x-x_{0})}}}}

where

The logistic function has domain thereal numbers, the limit asx{\displaystyle x\to -\infty } is 0, and the limit asx+{\displaystyle x\to +\infty } isL{\displaystyle L}.

Standard logistic function whereL=1,k=1,x0=0{\displaystyle L=1,k=1,x_{0}=0}

Theexponential function with negated argument(ex{\displaystyle e^{-x}}) is used to define thestandard logistic function whereL=1,k=1,x0=0{\displaystyle L=1,k=1,x_{0}=0}, which has the equationf(x)=11+ex{\displaystyle f(x)={\frac {1}{1+e^{-x}}}}and is sometimes simply called thesigmoid function.[2] It is also sometimes called theexpit, being the inverse function of thelogit.[3][4]

The logistic function finds applications in a range of fields, includingbiology (especiallyecology),biomathematics,chemistry,demography,economics,geoscience,mathematical psychology,probability,sociology,political science,linguistics,statistics, andartificial neural networks. There are variousgeneralizations, depending on the field.

History

[edit]
Original image of a logistic curve, contrasted with what Verhulst called a "logarithmic curve" (in modern terms, "exponential curve")

The logistic function was introduced in a series of three papers byPierre François Verhulst between 1838 and 1847, who devised it as a model ofpopulation growth by adjusting theexponential growth model, under the guidance ofAdolphe Quetelet.[5] Verhulst first devised the function in the mid 1830s, publishing a brief note in 1838,[1] then presented an expanded analysis and named the function in 1844 (published 1845);[a][6] the third paper adjusted the correction term in his model of Belgian population growth.[7]

The initial stage of growth is approximately exponential (geometric); then, as saturation begins, the growth slows to linear (arithmetic), and at maturity, growth approaches the limit with an exponentially decaying gap, like the initial stage in reverse.

Verhulst did not explain the choice of the term "logistic" (French:logistique), but it is presumably in contrast to thelogarithmic curve,[8][b] and by analogy with arithmetic and geometric. His growth model is preceded by a discussion ofarithmetic growth andgeometric growth (whose curve he calls alogarithmic curve, instead of the modern termexponential curve), and thus "logistic growth" is presumably named by analogy,logistic being fromAncient Greek:λογιστικός,romanizedlogistikós, a traditional division ofGreek mathematics.[c]

As a word derived from ancient Greek mathematical terms,[9]the name of this function is unrelated to the military and management termlogistics, which is instead fromFrench:logis "lodgings",[10] though some believe the Greek term also influencedlogistics;[9] seeLogistics § Origin for details.

Mathematical properties

[edit]

Thestandard logistic function is the logistic function with parametersk=1{\displaystyle k=1},x0=0{\displaystyle x_{0}=0},L=1{\displaystyle L=1}, which yields

f(x)=11+ex=exex+1=ex/2ex/2+ex/2.{\displaystyle f(x)={\frac {1}{1+e^{-x}}}={\frac {e^{x}}{e^{x}+1}}={\frac {e^{x/2}}{e^{x/2}+e^{-x/2}}}.}

In practice, due to the nature of theexponential functionex{\displaystyle e^{-x}}, it is often sufficient to compute the standard logistic function forx{\displaystyle x} over a small range of real numbers, such as a range contained in [−6, +6], as it quickly converges very close to its saturation values of 0 and 1.

Symmetries

[edit]

The logistic function has the symmetry property that

1f(x)=f(x).{\displaystyle 1-f(x)=f(-x).}

This reflects that the growth from 0 whenx{\displaystyle x} is small is symmetric with the decay of the gap to the limit (1) whenx{\displaystyle x} is large.

Further,xf(x)1/2{\displaystyle x\mapsto f(x)-1/2} is anodd function.

The sum of the logistic function and its reflection about the vertical axis,f(x){\displaystyle f(-x)}, is

11+ex+11+e(x)=exex+1+1ex+1=1.{\displaystyle {\frac {1}{1+e^{-x}}}+{\frac {1}{1+e^{-(-x)}}}={\frac {e^{x}}{e^{x}+1}}+{\frac {1}{e^{x}+1}}=1.}

The logistic function is thus rotationally symmetrical about the point (0, 1/2).[11]

Inverse function

[edit]

The logistic function is the inverse of the naturallogit function

logitp=logp1p for 0<p<1{\displaystyle \operatorname {logit} p=\log {\frac {p}{1-p}}\quad {\text{ for }}\,0<p<1}

and so converts the logarithm ofodds into aprobability. The conversion from thelog-likelihood ratio of two alternatives also takes the form of a logistic curve.

Hyperbolic tangent

[edit]

The logistic function is an offset and scaledhyperbolic tangent function:f(x)=12+12tanh(x2),{\displaystyle f(x)={\frac {1}{2}}+{\frac {1}{2}}\tanh \left({\frac {x}{2}}\right),}ortanh(x)=2f(2x)1.{\displaystyle \tanh(x)=2f(2x)-1.}

This follows fromtanh(x)=exexex+ex=ex(1e2x)ex(1+e2x)=f(2x)e2x1+e2x=f(2x)e2x+111+e2x=2f(2x)1.{\displaystyle {\begin{aligned}\tanh(x)&={\frac {e^{x}-e^{-x}}{e^{x}+e^{-x}}}\\&={\frac {e^{x}\cdot \left(1-e^{-2x}\right)}{e^{x}\cdot \left(1+e^{-2x}\right)}}\\&=f(2x)-{\frac {e^{-2x}}{1+e^{-2x}}}\\&=f(2x)-{\frac {e^{-2x}+1-1}{1+e^{-2x}}}\\&=2f(2x)-1.\end{aligned}}}

The hyperbolic-tangent relationship leads to another form for the logistic function's derivative:

ddxf(x)=14sech2(x2),{\displaystyle {\frac {d}{dx}}f(x)={\frac {1}{4}}\operatorname {sech} ^{2}\left({\frac {x}{2}}\right),}

which ties the logistic function into thelogistic distribution.

Geometrically, the hyperbolic tangent function is thehyperbolic angle on theunit hyperbolax2y2=1{\displaystyle x^{2}-y^{2}=1}, which factors as(x+y)(xy)=1{\displaystyle (x+y)(x-y)=1}, and thus has asymptotes the lines through the origin with slope1{\displaystyle -1} and with slope1{\displaystyle 1}, and vertex at(1,0){\displaystyle (1,0)} corresponding to the range and midpoint (1{\displaystyle {1}}) of tanh. Analogously, the logistic function can be viewed as the hyperbolic angle on the hyperbolaxyy2=1{\displaystyle xy-y^{2}=1}, which factors asy(xy)=1{\displaystyle y(x-y)=1}, and thus has asymptotes the lines through the origin with slope0{\displaystyle 0} and with slope1{\displaystyle 1}, and vertex at(2,1){\displaystyle (2,1)}, corresponding to the range and midpoint (1/2{\displaystyle 1/2}) of the logistic function.

Parametrically,hyperbolic cosine andhyperbolic sine give coordinates on the unit hyperbola:[d]((et+et)/2,(etet)/2){\displaystyle \left((e^{t}+e^{-t})/2,(e^{t}-e^{-t})/2\right)}, with quotient the hyperbolic tangent. Similarly,(et/2+et/2,et/2){\displaystyle {\bigl (}e^{t/2}+e^{-t/2},e^{t/2}{\bigr )}} parametrizes the hyperbolaxyy2=1{\displaystyle xy-y^{2}=1}, with quotient the logistic function. These correspond tolinear transformations (and rescaling the parametrization) ofthe hyperbolaxy=1{\displaystyle xy=1}, with parametrization(et,et){\displaystyle (e^{-t},e^{t})}: the parametrization of the hyperbola for the logistic function corresponds tot/2{\displaystyle t/2} and the linear transformation(1101){\displaystyle {\bigl (}{\begin{smallmatrix}1&1\\0&1\end{smallmatrix}}{\bigr )}}, while the parametrization of the unit hyperbola (for the hyperbolic tangent) corresponds to the linear transformation12(1111){\displaystyle {\tfrac {1}{2}}{\bigl (}{\begin{smallmatrix}1&1\\-1&1\end{smallmatrix}}{\bigr )}}.

Derivative

[edit]
The logistic function and its first 3 derivatives

The standard logistic function has an easily calculatedderivative. The derivative is known as the density of thelogistic distribution:

f(x)=11+ex=ex1+ex,{\displaystyle f(x)={\frac {1}{1+e^{-x}}}={\frac {e^{x}}{1+e^{x}}},}

ddxf(x)=ex(1+ex)exex(1+ex)2=ex(1+ex)2=(ex1+ex)(11+ex)=(ex1+ex)(1ex1+ex)=f(x)(1f(x)){\displaystyle {\begin{aligned}{\frac {d}{dx}}f(x)&={\frac {e^{x}\cdot (1+e^{x})-e^{x}\cdot e^{x}}{{\left(1+e^{x}\right)}^{2}}}\\[1ex]&={\frac {e^{x}}{{\left(1+e^{x}\right)}^{2}}}\\[1ex]&=\left({\frac {e^{x}}{1+e^{x}}}\right)\left({\frac {1}{1+e^{x}}}\right)\\[1ex]&=\left({\frac {e^{x}}{1+e^{x}}}\right)\left(1-{\frac {e^{x}}{1+e^{x}}}\right)\\[1.2ex]&=f(x)\left(1-f(x)\right)\end{aligned}}}from which all higher derivatives can be derived algebraically. For example,f=(12f)(1f)f{\displaystyle f''=(1-2f)(1-f)f}.

The logistic distribution is alocation–scale family, which corresponds to parameters of the logistic function. IfL=1{\displaystyle L=1} is fixed, then the midpointx0{\displaystyle x_{0}} is the location and the slopek{\displaystyle k} is the scale.

Integral

[edit]

Conversely, itsantiderivative can be computed by thesubstitutionu=1+ex{\displaystyle u=1+e^{x}}, since

f(x)=ex1+ex=uu,{\displaystyle f(x)={\frac {e^{x}}{1+e^{x}}}={\frac {u'}{u}},}

so (dropping theconstant of integration)

ex1+exdx=1udu=lnu=ln(1+ex).{\displaystyle \int {\frac {e^{x}}{1+e^{x}}}\,dx=\int {\frac {1}{u}}\,du=\ln u=\ln(1+e^{x}).}

Inartificial neural networks, this is known as thesoftplus function and (with scaling) is a smooth approximation of theramp function, just as the logistic function (with scaling) is a smooth approximation of theHeaviside step function.

Taylor series

[edit]

The standard logistic function isanalytic on the whole real line sincef:RR{\displaystyle f:\mathbb {R} \to \mathbb {R} },f(x)=11+ex=h(g(x)){\displaystyle f(x)={\frac {1}{1+e^{-x}}}=h(g(x))} whereg:RR{\displaystyle g:\mathbb {R} \to \mathbb {R} },g(x)=1+ex{\displaystyle g(x)=1+e^{-x}} andh:(0,)(0,){\displaystyle h:(0,\infty )\to (0,\infty )},h(x)=1x{\displaystyle h(x)={\frac {1}{x}}} are analytic on their domains, and the composition of analytic functions is again analytic.

A formula for thenth derivative of the standard logistic function is

dnfdxn=i=1n(j=1n(1)i+j(ij)jn)eix(1+ex)i+1{\displaystyle {\frac {d^{n}f}{dx^{n}}}=\sum _{i=1}^{n}{\frac {\left(\sum _{j=1}^{n}{\left(-1\right)}^{i+j}{\binom {i}{j}}j^{n}\right)e^{-ix}}{{\left(1+e^{-x}\right)}^{i+1}}}}

therefore itsTaylor series about the pointa{\displaystyle a} is

f(x)=f(a)(xa)+n=1i=1n(j=1n(1)i+j(ij)jn)eix(1+ex)i+1(xa)nn!.{\displaystyle f(x)=f(a)(x-a)+\sum _{n=1}^{\infty }\sum _{i=1}^{n}{\frac {\left(\sum _{j=1}^{n}{\left(-1\right)}^{i+j}{\binom {i}{j}}j^{n}\right)e^{-ix}}{{\left(1+e^{-x}\right)}^{i+1}}}{\frac {{\left(x-a\right)}^{n}}{n!}}.}

Logistic differential equation

[edit]

The unique standard logistic function is the solution of the simple first-order non-linearordinary differential equation

ddxf(x)=f(x)(1f(x)){\displaystyle {\frac {d}{dx}}f(x)=f(x){\big (}1-f(x){\big )}}

withboundary conditionf(0)=1/2{\displaystyle f(0)=1/2}. This equation is the continuous version of thelogistic map. Note that the reciprocal logistic function is solution to a simple first-orderlinear ordinary differential equation.[12]

The qualitative behavior is easily understood in terms of thephase line: the derivative is 0 when the function is 1; and the derivative is positive forf{\displaystyle f} between 0 and 1, and negative forf{\displaystyle f} above 1 or less than 0 (though negative populations do not generally accord with a physical model). This yields an unstable equilibrium at 0 and a stable equilibrium at 1, and thus for any function value greater than 0 and less than 1, it grows to 1.

The logistic equation is a special case of theBernoulli differential equation and has the following solution:

f(x)=exex+C.{\displaystyle f(x)={\frac {e^{x}}{e^{x}+C}}.}

Choosing the constant of integrationC=1{\displaystyle C=1} gives the other well known form of the definition of the logistic curve:

f(x)=exex+1=11+ex.{\displaystyle f(x)={\frac {e^{x}}{e^{x}+1}}={\frac {1}{1+e^{-x}}}.}

More quantitatively, as can be seen from the analytical solution, the logistic curve shows earlyexponential growth for negative argument, which reaches to linear growth of slope 1/4 for an argument near 0, then approaches 1 with an exponentially decaying gap.

The differential equation derived above is a special case of a general differential equation that only models the sigmoid function forx>0{\displaystyle x>0}. In many modeling applications, the moregeneral form[citation needed]df(x)dx=kLf(x)(Lf(x)),f(0)=L1+ekx0{\displaystyle {\frac {df(x)}{dx}}={\frac {k}{L}}f(x){\big (}L-f(x){\big )},\quad f(0)={\frac {L}{1+e^{kx_{0}}}}} can be desirable. Its solution is the shifted and scaledsigmoid functionLσ(k(xx0))=L1+ek(xx0){\displaystyle L\sigma {\big (}k(x-x_{0}){\big )}={\frac {L}{1+e^{-k(x-x_{0})}}}}.

Probabilistic interpretation

[edit]
Further information:Logistic regression

When the capacityL=1{\displaystyle L=1}, the value of the logistic function is in the range(0,1){\displaystyle (0,1)} and can be interpreted as a probabilityp.[e] In more detail,p can be interpreted as the probability of one of two alternatives (the parameter of aBernoulli distribution);[f] the two alternatives are complementary, so the probability of the other alternative isq=1p{\displaystyle q=1-p} andp+q=1{\displaystyle p+q=1}. The two alternatives are coded as 1 and 0, corresponding to the limiting values asx±{\displaystyle x\to \pm \infty }.

In this interpretation the inputx is thelog-odds for the first alternative (relative to the second, measured in "logistic units" orlogits), and soex{\displaystyle e^{x}} is theodds for the first alternative (relative to the second). Given odds for an event ofO=O:1{\displaystyle O=O:1} (O{\displaystyle O} against1), the probability is the ratio of "for" over "for plus against",O/(O+1){\displaystyle O/(O+1)}. We see that the logistic function,ex/(ex+1)=1/(1+ex)=p{\displaystyle e^{x}/(e^{x}+1)=1/(1+e^{-x})=p}, is the probability of the first alternative.

Conversely,x is the log-oddsagainst the second alternative,x{\displaystyle -x} is the log-oddsfor the second alternative,ex{\displaystyle e^{-x}} is the odds for the second alternative, andex/(ex+1)=1/(1+ex)=q=1p{\displaystyle e^{-x}/(e^{-x}+1)=1/(1+e^{x})=q=1-p} is the probability of the second alternative.

This can be framed more symmetrically in terms of two inputs,x0{\displaystyle x_{0}} andx1{\displaystyle x_{1}}, which then generalizes naturally to more than two alternatives. Given two real number inputs,x0{\displaystyle x_{0}} andx1{\displaystyle x_{1}}, interpreted as logits, theirdifferencex1x0{\displaystyle x_{1}-x_{0}} is the log-odds for option 1 (the log-oddsagainst option 0),ex1x0{\displaystyle e^{x_{1}-x_{0}}} is the odds,ex1x0/(ex1x0+1)=1/(1+e(x1x0))=ex1/(ex0+ex1){\displaystyle e^{x_{1}-x_{0}}/(e^{x_{1}-x_{0}}+1)=1/\left(1+e^{-(x_{1}-x_{0})}\right)=e^{x_{1}}/(e^{x_{0}}+e^{x_{1}})} is the probability of option 1, and similarlyex0/(ex0+ex1){\displaystyle e^{x_{0}}/(e^{x_{0}}+e^{x_{1}})} is the probability of option 0.

This form immediately generalizes to more alternatives as thesoftmax function, which is a vector-valued function whosei-th coordinate isexi/i=0nexi{\textstyle e^{x_{i}}/\sum _{i=0}^{n}e^{x_{i}}}.

More subtly, the symmetric form emphasizes interpreting the inputx asx1x0{\displaystyle x_{1}-x_{0}} and thusrelative to some reference point, implicitly tox0=0{\displaystyle x_{0}=0}. Notably, the softmax function is invariant under adding a constant to all the logitsxi{\displaystyle x_{i}}, which corresponds to the differencexjxi{\displaystyle x_{j}-x_{i}} being the log-odds for optionj against optioni, but the individual logitsxi{\displaystyle x_{i}} not being log-odds on their own. Often one of the options is used as a reference ("pivot"), and its value fixed as0, so the other logits are interpreted as odds versus this reference. This is generally done with the first alternative, hence the choice of numbering:x0=0{\displaystyle x_{0}=0}, and thenxi=xix0{\displaystyle x_{i}=x_{i}-x_{0}} is the log-odds for optioni against option0. Sincee0=1{\displaystyle e^{0}=1}, this yields the+1{\displaystyle +1} term in many expressions for the logistic function and generalizations.[g]

Generalizations

[edit]

In growth modeling, numerous generalizations exist, including thegeneralized logistic curve, theGompertz function, thecumulative distribution function of theshifted Gompertz distribution, and thehyperbolastic function of type I.

In statistics, where the logistic function is interpreted as the probability of one of two alternatives, the generalization to three or more alternatives is thesoftmax function, which is vector-valued, as it gives the probability of each alternative.

Applications

[edit]

In ecology: modeling population growth

[edit]
Pierre-François Verhulst (1804–1849)

A typical application of the logistic equation is a common model ofpopulation growth (see alsopopulation dynamics), originally due toPierre-François Verhulst in 1838, where the rate of reproduction is proportional to both the existing population and the amount of available resources, all else being equal. The Verhulst equation was published after Verhulst had readThomas Malthus'An Essay on the Principle of Population, which describes theMalthusian growth model of simple (unconstrained) exponential growth. Verhulst derived his logistic equation to describe the self-limiting growth of abiological population. The equation was rediscovered in 1911 byA. G. McKendrick for the growth of bacteria in broth and experimentally tested using a technique for nonlinear parameter estimation.[13] The equation is also sometimes called theVerhulst-Pearl equation following its rediscovery in 1920 byRaymond Pearl (1879–1940) andLowell Reed (1888–1966) of theJohns Hopkins University.[14] Another scientist,Alfred J. Lotka derived the equation again in 1925, calling it thelaw of population growth.

LettingP{\displaystyle P} represent population size (N{\displaystyle N} is often used in ecology instead) andt{\displaystyle t} represent time, this model is formalized by thedifferential equation:

dPdt=rP(1PK),{\displaystyle {\frac {dP}{dt}}=rP\left(1-{\frac {P}{K}}\right),}

where the constantr{\displaystyle r} defines thegrowth rate andK{\displaystyle K} is thecarrying capacity.

In the equation, the early, unimpeded growth rate is modeled by the first term+rP{\displaystyle +rP}. The value of the rater{\displaystyle r} represents the proportional increase of the populationP{\displaystyle P} in one unit of time. Later, as the population grows, the modulus of the second term (which multiplied out isrP2/K{\displaystyle -rP^{2}/K}) becomes almost as large as the first, as some members of the populationP{\displaystyle P} interfere with each other by competing for some critical resource, such as food or living space. This antagonistic effect is called thebottleneck, and is modeled by the value of the parameterK{\displaystyle K}. The competition diminishes the combined growth rate, until the value ofP{\displaystyle P} ceases to grow (this is calledmaturity of the population).The solution to the equation (withP0{\displaystyle P_{0}} being the initial population) is

P(t)=KP0ertK+P0(ert1)=K1+(KP0P0)ert,{\displaystyle P(t)={\frac {KP_{0}e^{rt}}{K+P_{0}\left(e^{rt}-1\right)}}={\frac {K}{1+\left({\frac {K-P_{0}}{P_{0}}}\right)e^{-rt}}},}

where

limtP(t)=K,{\displaystyle \lim _{t\to \infty }P(t)=K,}

whereK{\displaystyle K} is the limiting value ofP{\displaystyle P}, the highest value that the population can reach given infinite time (or come close to reaching in finite time). The carrying capacity is asymptotically reached independently of the initial valueP(0)>0{\displaystyle P(0)>0}, and also in the case thatP(0)>K{\displaystyle P(0)>K}.

In ecology,species are sometimes referred to asr{\displaystyle r}-strategist orK{\displaystyle K}-strategist depending upon theselective processes that have shaped theirlife history strategies.Choosing the variable dimensions so thatn{\displaystyle n} measures the population in units of carrying capacity, andτ{\displaystyle \tau } measures time in units of1/r{\displaystyle 1/r}, gives the dimensionless differential equation

dndτ=n(1n).{\displaystyle {\frac {dn}{d\tau }}=n(1-n).}

Integral

[edit]

Theantiderivative of the ecological form of the logistic function can be computed by thesubstitutionu=K+P0(ert1){\displaystyle u=K+P_{0}\left(e^{rt}-1\right)}, sincedu=rP0ertdt{\displaystyle du=rP_{0}e^{rt}dt}

KP0ertK+P0(ert1)dt=Kr1udu=Krlnu+C=Krln(K+P0(ert1))+C{\displaystyle \int {\frac {KP_{0}e^{rt}}{K+P_{0}\left(e^{rt}-1\right)}}\,dt=\int {\frac {K}{r}}{\frac {1}{u}}\,du={\frac {K}{r}}\ln u+C={\frac {K}{r}}\ln \left(K+P_{0}(e^{rt}-1)\right)+C}

Time-varying carrying capacity

[edit]

Since the environmental conditions influence the carrying capacity, as a consequence it can be time-varying, withK(t)>0{\displaystyle K(t)>0}, leading to the following mathematical model:

dPdt=rP(1PK(t)).{\displaystyle {\frac {dP}{dt}}=rP\cdot \left(1-{\frac {P}{K(t)}}\right).}

A particularly important case is that of carrying capacity that varies periodically with periodT{\displaystyle T}:

K(t+T)=K(t).{\displaystyle K(t+T)=K(t).}

It can be shown[15] that in such a case, independently from the initial valueP(0)>0{\displaystyle P(0)>0},P(t){\displaystyle P(t)} will tend to a unique periodic solutionP(t){\displaystyle P_{*}(t)}, whose period isT{\displaystyle T}.

A typical value ofT{\displaystyle T} is one year: In such caseK(t){\displaystyle K(t)} may reflect periodical variations of weather conditions.

Another interesting generalization is to consider that the carrying capacityK(t){\displaystyle K(t)} is a function of the population at an earlier time, capturing a delay in the way population modifies its environment. This leads to a logistic delay equation,[16] which has a very rich behavior, with bistability in some parameter range, as well as a monotonic decay to zero, smooth exponential growth, punctuated unlimited growth (i.e., multiple S-shapes), punctuated growth or alternation to a stationary level, oscillatory approach to a stationary level, sustainable oscillations, finite-time singularities as well as finite-time death.

In statistics and machine learning

[edit]

Logistic functions are used in several roles in statistics. For example, they are thecumulative distribution function of thelogistic family of distributions, and they are, a bit simplified, used to model the chance a chess player has to beat their opponent in theElo rating system. More specific examples now follow.

Logistic regression

[edit]
Main article:Logistic regression

Logistic functions are used inlogistic regression to model how the probabilityp{\displaystyle p} of an event may be affected by one or moreexplanatory variables: an example would be to have the model

p=f(a+bx),{\displaystyle p=f(a+bx),}

wherex{\displaystyle x} is the explanatory variable,a{\displaystyle a} andb{\displaystyle b} are model parameters to be fitted, andf{\displaystyle f} is the standard logistic function.

Logistic regression and otherlog-linear models are also commonly used inmachine learning. A generalisation of the logistic function to multiple inputs is thesoftmax activation function, used inmultinomial logistic regression.

Another application of the logistic function is in theRasch model, used initem response theory. In particular, the Rasch model forms a basis formaximum likelihood estimation of the locations of objects or persons on acontinuum, based on collections ofcategorical data, for example the abilities of persons on a continuum based on responses that have been categorized as correct and incorrect.

Neural networks

[edit]

Logistic functions are often used inartificial neural networks to introducenonlinearity in the model or to clamp signals to within a specifiedinterval. A popularneural net element computes alinear combination of its input signals, and applies a bounded logistic function as theactivation function to the result; this model can be seen as a "smoothed" variant of the classicalthreshold neuron.

A common choice for the activation or "squashing" functions, used to clip large magnitudes to keep the response of the neural network bounded,[17] is

g(h)=11+e2βh,{\displaystyle g(h)={\frac {1}{1+e^{-2\beta h}}},}

which is a logistic function.

These relationships result in simplified implementations ofartificial neural networks withartificial neurons. Practitioners caution that sigmoidal functions which areantisymmetric about the origin (e.g. thehyperbolic tangent) lead to faster convergence when training networks withbackpropagation.[18]

The logistic function is itself the derivative of another proposed activation function, thesoftplus.

In medicine: modeling of growth of tumors

[edit]
See also:Gompertz curve § Growth of tumors

Another application of logistic curve is in medicine, where the logistic differential equation can be used to model the growth oftumors. This application can be considered an extension of the above-mentioned use in the framework of ecology (see also theGeneralized logistic curve, allowing for more parameters). Denoting withX(t){\displaystyle X(t)} the size of the tumor at timet{\displaystyle t}, its dynamics are governed by

X=r(1XK)X,{\displaystyle X'=r\left(1-{\frac {X}{K}}\right)X,}

which is of the type

X=F(X)X,F(X)0,{\displaystyle X'=F(X)X,\quad F'(X)\leq 0,}

whereF(X){\displaystyle F(X)} is the proliferation rate of the tumor.

If a course ofchemotherapy is started with a log-kill effect, the equation may be revised to be

X=r(1XK)Xc(t)X,{\displaystyle X'=r\left(1-{\frac {X}{K}}\right)X-c(t)X,}

wherec(t){\displaystyle c(t)} is the therapy-induced death rate. In the idealized case of very long therapy,c(t){\displaystyle c(t)} can be modeled as a periodic function (of periodT{\displaystyle T}) or (in case of continuous infusion therapy) as a constant function, and one has that

1T0Tc(t)dt>rlimt+x(t)=0,{\displaystyle {\frac {1}{T}}\int _{0}^{T}c(t)\,dt>r\to \lim _{t\to +\infty }x(t)=0,}

i.e. if the average therapy-induced death rate is greater than the baseline proliferation rate, then there is the eradication of the disease. Of course, this is an oversimplified model of both the growth and the therapy. For example, it does not take into account the evolution of clonal resistance, or the side-effects of the therapy on the patient. These factors can result in the eventual failure of chemotherapy, or its discontinuation.[citation needed]

In medicine: modeling of a pandemic

[edit]
Main article:Compartmental models in epidemiology

A novel infectious pathogen to which a population has no immunity will generally spread exponentially in the early stages, while the supply of susceptible individuals is plentiful. The SARS-CoV-2 virus that causesCOVID-19 exhibited exponential growth early in the course of infection in several countries in early 2020.[19] Factors including a lack of susceptible hosts (through the continued spread of infection until it passes the threshold forherd immunity) or reduction in the accessibility of potential hosts through physical distancing measures, may result in exponential-looking epidemic curves first linearizing (replicating the "logarithmic" to "logistic" transition first noted byPierre-François Verhulst, as noted above) and then reaching a maximal limit.[20]

A logistic function, or related functions (e.g. theGompertz function) are usually used in a descriptive or phenomenological manner because they fit well not only to the early exponential rise, but to the eventual levelling off of the pandemic as the population develops a herd immunity. This is in contrast to actual models of pandemics which attempt to formulate a description based on the dynamics of the pandemic (e.g. contact rates, incubation times, social distancing, etc.). Some simple models have been developed, however, which yield a logistic solution.[21][22][23]

Modeling early COVID-19 cases

[edit]
Generalized logistic function (Richards growth curve) in epidemiological modeling

Ageneralized logistic function, also called the Richards growth curve, has been applied to model the early phase of theCOVID-19 outbreak.[24] The authors fit the generalized logistic function to the cumulative number of infected cases, here referred to asinfection trajectory. There are different parameterizations of thegeneralized logistic function in the literature. One frequently used forms is

f(t;θ1,θ2,θ3,ξ)=θ1[1+ξexp(θ2(tθ3))]1/ξ{\displaystyle f(t;\theta _{1},\theta _{2},\theta _{3},\xi )={\frac {\theta _{1}}{{\left[1+\xi \exp \left(-\theta _{2}\cdot (t-\theta _{3})\right)\right]}^{1/\xi }}}}

whereθ1,θ2,θ3{\displaystyle \theta _{1},\theta _{2},\theta _{3}} are real numbers, andξ{\displaystyle \xi } is a positive real number. The flexibility of the curvef{\displaystyle f} is due to the parameterξ{\displaystyle \xi }: (i) ifξ=1{\displaystyle \xi =1} then the curve reduces to the logistic function, and (ii) asξ{\displaystyle \xi } approaches zero, the curve converges to theGompertz function. In epidemiological modeling,θ1{\displaystyle \theta _{1}},θ2{\displaystyle \theta _{2}}, andθ3{\displaystyle \theta _{3}} represent the final epidemic size, infection rate, and lag phase, respectively. See the right panel for an example infection trajectory when(θ1,θ2,θ3){\displaystyle (\theta _{1},\theta _{2},\theta _{3})} is set to(10000,0.2,40){\displaystyle (10000,0.2,40)}.

Extrapolated infection trajectories of 40 countries severely affected by COVID-19 and grand (population) average through May 14th

One of the benefits of using a growth function such as thegeneralized logistic function in epidemiological modeling is its relatively easy application to themultilevel model framework, where information from different geographic regions can be pooled together.

In chemistry: reaction models

[edit]

The concentration of reactants and products inautocatalytic reactions follow the logistic function.The degradation ofPlatinum group metal-free (PGM-free) oxygen reduction reaction (ORR) catalyst in fuel cell cathodes follows the logistic decay function,[25] suggesting an autocatalytic degradation mechanism.

In physics: Fermi–Dirac distribution

[edit]

The logistic function determines the statistical distribution of fermions over the energy states of a system in thermal equilibrium. In particular, it is the distribution of the probabilities that each possible energy level is occupied by a fermion, according toFermi–Dirac statistics.

In optics: mirage

[edit]

The logistic function also finds applications in optics, particularly in modelling phenomena such asmirages. Under certain conditions, such as the presence of a temperature or concentration gradient due to diffusion and balancing with gravity, logistic curve behaviours can emerge.[26][27]

A mirage, resulting from a temperature gradient that modifies the refractive index related to the density/concentration of the material over distance, can be modelled using a fluid with a refractive index gradient due to the concentration gradient. This mechanism can be equated to a limiting population growth model, where the concentrated region attempts to diffuse into the lower concentration region, while seeking equilibrium with gravity, thus yielding a logistic function curve.[26]

In material science: phase diagrams

[edit]

SeeDiffusion bonding.

In linguistics: language change

[edit]

In linguistics, the logistic function can be used to modellanguage change:[28] an innovation that is at first marginal begins to spread more quickly with time, and then more slowly as it becomes more universally adopted.

In agriculture: modeling crop response

[edit]

The logistic S-curve can be used for modeling the crop response to changes in growth factors. There are two types of response functions:positive andnegative growth curves. For example, the crop yield mayincrease with increasing value of the growth factor up to a certain level (positive function), or it maydecrease with increasing growth factor values (negative function owing to a negative growth factor), which situation requires aninverted S-curve.

S-curve model for crop yield versus depth ofwater table[29]
Inverted S-curve model for crop yield versussoil salinity[30]

In economics and sociology: diffusion of innovations

[edit]

The logistic function can be used to illustrate the progress of thediffusion of an innovation through its life cycle.

InThe Laws of Imitation (1890),Gabriel Tarde describes the rise and spread of new ideas through imitative chains. In particular, Tarde identifies three main stages through which innovations spread: the first one corresponds to the difficult beginnings, during which the idea has to struggle within a hostile environment full of opposing habits and beliefs; the second one corresponds to the properly exponential take-off of the idea, withf(x)=2x{\displaystyle f(x)=2^{x}}; finally, the third stage is logarithmic, withf(x)=log(x){\displaystyle f(x)=\log(x)}, and corresponds to the time when the impulse of the idea gradually slows down while, simultaneously new opponent ideas appear. The ensuing situation halts or stabilizes the progress of the innovation, which approaches an asymptote.

In asovereign state, the subnational units (constituent states or cities) may use loans to finance their projects. However, this funding source is usually subject to strict legal rules as well as to economyscarcity constraints, especially the resources the banks can lend (due to theirequity orBasel limits). These restrictions, which represent a saturation level, along with an exponential rush in aneconomic competition for money, create apublic finance diffusion of credit pleas and the aggregate national response is asigmoid curve.[31]

Historically, when new products are introduced there is an intense amount ofresearch and development which leads to dramatic improvements in quality and reductions in cost. This leads to a period of rapid industry growth. Some of the more famous examples are: railroads, incandescent light bulbs,electrification, cars and air travel. Eventually, dramatic improvement and cost reduction opportunities are exhausted, the product or process are in widespread use with few remaining potential new customers, and markets become saturated.

Logistic analysis was used in papers by several researchers at the International Institute of Applied Systems Analysis (IIASA). These papers deal with the diffusion of various innovations, infrastructures and energy source substitutions and the role of work in the economy as well as with the long economic cycle. Long economic cycles were investigated by Robert Ayres (1989).[32] Cesare Marchetti published onlong economic cycles and on diffusion of innovations.[33][34] Arnulf Grübler's book (1990) gives a detailed account of the diffusion of infrastructures including canals, railroads, highways and airlines, showing that their diffusion followed logistic shaped curves.[35]

Carlota Perez used a logistic curve to illustrate the long (Kondratiev) business cycle with the following labels: beginning of a technological era asirruption, the ascent asfrenzy, the rapid build out assynergy and the completion asmaturity.[36]

Inflection Point Determination in Logistic Growth Regression

[edit]

Logistic growth regressions carry significant uncertainty when data is available only up to around the inflection point of the growth process. Under these conditions, estimating the height at which the inflection point will occur may have uncertainties comparable to the carrying capacity (K) of the system.

A method to mitigate this uncertainty involves using the carrying capacity from a surrogate logistic growth process as a reference point.[37] By incorporating this constraint, even if K is only an estimate within a factor of two, the regression is stabilized, which improves accuracy and reduces uncertainty in the prediction parameters. This approach can be applied in fields such as economics and biology, where analogous surrogate systems or populations are available to inform the analysis.

Sequential analysis

[edit]

Link[38] created an extension ofWald's theory of sequential analysis to a distribution-free accumulation of random variables until either a positive or negative bound is first equaled or exceeded. Link[39] derives the probability of first equaling or exceeding the positive boundary as1/(1+eθA){\displaystyle 1/(1+e^{-\theta A})}, the logistic function. This is the first proof that the logistic function may have a stochastic process as its basis. Link[40] provides a century of examples of "logistic" experimental results and a newly derived relation between this probability and the time of absorption at the boundaries.

See also

[edit]

Notes

[edit]
  1. ^The paper was presented in 1844, and published in 1845: "(Lu à la séance du 30 novembre 1844)." "(Read at the session of 30 November 1844).", p. 1.
  2. ^Verhulst first refers to arithmeticprogression and geometricprogression, and refers to the geometric growth curve as alogarithmic curve (confusingly, the modern term is insteadexponential curve, which is the inverse). He then calls his curvelogistic, in contrast tologarithmic, and compares the logarithmic curve and logistic curve in the figure of his paper.
  3. ^In Ancient Greece,λογιστικός referred to practical computation and accounting, in contrast toἀριθμητική (arithmētikḗ), the theoretical or philosophical study of numbers. Confusingly, in English,arithmetic refers to practical computation, even though it derives fromἀριθμητική, notλογιστικός. See for exampleLouis Charles Karpinski,Nicomachus of Gerasa: Introduction to Arithmetic (1926) p. 3: "Arithmetic is fundamentally associated by modern readers, particularly by scientists and mathematicians, with the art of computation. For the ancient Greeks afterPythagoras, however, arithmetic was primarily a philosophical study, having no necessary connection with practical affairs. Indeed the Greeks gave a separate name to the arithmetic of business,λογιστική [accounting or practical logistic] ... In general the philosophers and mathematicians of Greece undoubtedly considered it beneath their dignity to treat of this branch, which probably formed a part of the elementary instruction of children."
  4. ^Usingt{\displaystyle t} for the parameter and(x,y){\displaystyle (x,y)} for the coordinates.
  5. ^This can be extended to theExtended real number line by settingf()=0{\displaystyle f(-\infty )=0} andf(+)=1{\displaystyle f(+\infty )=1}, matching the limit values.
  6. ^In fact, the logistic function is the inverse mapping to thenatural parameter of the Bernoulli distribution, namely thelogit function, and in this sense it is the "natural parametrization" of a binary probability.
  7. ^For example, thesoftplus function (the integral of the logistic function) is a smooth version ofmax(0,x){\displaystyle \max(0,x)}, while the relative form is a smooth form ofmax(x0,x1){\displaystyle \max(x_{0},x_{1})}, specificallyLogSumExp. Softplus thus generalizes as (note the 0 and the corresponding 1 for the reference class)LSE0+(x1,,xn):=LSE(0,x1,,xn)=ln(1+ex1++exn).{\displaystyle \operatorname {LSE_{0}} ^{+}(x_{1},\dots ,x_{n}):=\operatorname {LSE} (0,x_{1},\dots ,x_{n})=\ln(1+e^{x_{1}}+\cdots +e^{x_{n}}).}

References

[edit]
  1. ^abVerhulst, Pierre-François (1838)."Notice sur la loi que la population poursuit dans son accroissement"(PDF).Correspondance Mathématique et Physique (in French).10:113–121. Retrieved3 December 2014.
  2. ^"Sigmoid".PyTorch 1.10.1 documentation. Retrieved13 October 2025.
  3. ^"expit {clusterPower}".inside-R. Revolution Analytics. Archived fromthe original on 7 May 2016.
  4. ^"scipy.special.expit".SciPy v1.7.1 Manual. Retrieved13 October 2025.
  5. ^Cramer 2002, pp. 3–5.
  6. ^Verhulst, Pierre-François (1845)."Recherches mathématiques sur la loi d'accroissement de la population" [Mathematical Researches into the Law of Population Growth Increase].Nouveaux Mémoires de l'Académie Royale des Sciences et Belles-Lettres de Bruxelles.18:8. Retrieved18 February 2013.Nous donnerons le nom delogistique à la courbe [We will give the namelogistic to the curve]
  7. ^Verhulst, Pierre-François (1847)."Deuxième mémoire sur la loi d'accroissement de la population".Mémoires de l'Académie Royale des Sciences, des Lettres et des Beaux-Arts de Belgique.20:1–32.doi:10.3406/marb.1847.3457. Retrieved18 February 2013.
  8. ^Shulman, Bonnie (1998)."Math-alive! using original sources to teach mathematics in social context".PRIMUS.8 (March):1–14.doi:10.1080/10511979808965879.The diagram clinched it for me: there two curves labeled "Logistique" and "Logarithmique" are drawn on the same axes, and one can see that there is a region where they match almost exactly, and then diverge.
    I concluded that Verhulst's intention in naming the curve was indeed to suggest this comparison, and that "logistic" was meant to convey the curve's "log-like" quality.
  9. ^abTepic, J.; Tanackov, I.; Stojić, Gordan (2011)."Ancient logistics – historical timeline and etymology"(PDF).Technical Gazette.18 (3).S2CID 42097070. Archived fromthe original(PDF) on 9 March 2019.
  10. ^Baron de Jomini (1830).Tableau Analytique des principales combinaisons De La Guerre, Et De Leurs Rapports Avec La Politique Des États: Pour Servir D'Introduction Au Traité Des Grandes Opérations Militaires. p. 74.
  11. ^Raul Rojas.Neural Networks – A Systematic Introduction(PDF). Retrieved15 October 2016.
  12. ^Kocian, Alexander; Carmassi, Giulia; Cela, Fatjon; Incrocci, Luca; Milazzo, Paolo; Chessa, Stefano (7 June 2020)."Bayesian Sigmoid-Type Time Series Forecasting with Missing Data for Greenhouse Crops".Sensors.20 (11): 3246.Bibcode:2020Senso..20.3246K.doi:10.3390/s20113246.PMC 7309099.PMID 32517314.
  13. ^A. G. McKendricka; M. Kesava Paia1 (January 1912)."XLV.—The Rate of Multiplication of Micro-organisms: A Mathematical Study".Proceedings of the Royal Society of Edinburgh.31:649–653.doi:10.1017/S0370164600025426.{{cite journal}}: CS1 maint: numeric names: authors list (link)
  14. ^Raymond Pearl &Lowell Reed (June 1920)."On the Rate of Growth of the Population of the United States"(PDF).Proceedings of the National Academy of Sciences of the United States of America. Vol. 6, no. 6. p. 275.
  15. ^Griffiths, Graham; Schiesser, William (2009)."Linear and nonlinear waves".Scholarpedia.4 (7): 4308.Bibcode:2009SchpJ...4.4308G.doi:10.4249/scholarpedia.4308.ISSN 1941-6016.
  16. ^Yukalov, V. I.; Yukalova, E. P.; Sornette, D. (2009). "Punctuated evolution due to delayed carrying capacity".Physica D: Nonlinear Phenomena.238 (17):1752–1767.arXiv:0901.4714.Bibcode:2009PhyD..238.1752Y.doi:10.1016/j.physd.2009.05.011.S2CID 14456352.
  17. ^Gershenfeld 1999, p. 150.
  18. ^LeCun, Y.; Bottou, L.; Orr, G.; Muller, K. (1998)."Efficient BackProp"(PDF). In Orr, G.; Muller, K. (eds.).Neural Networks: Tricks of the trade. Springer.ISBN 3-540-65311-2. Archived fromthe original(PDF) on 31 August 2018. Retrieved16 September 2009.
  19. ^Worldometer: COVID-19 CORONAVIRUS PANDEMIC
  20. ^Villalobos-Arias, Mario (2020). "Using generalized logistics regression to forecast population infected by Covid-19".arXiv:2004.02406 [q-bio.PE].
  21. ^{{cite journal |last1=Postnikov |first1=Eugene B. |date=June 2020 |title=Estimation of COVID-19 dynamics "on a back-of-envelope": Does the simplest SIR model provide quantitative parameters and predictions? |url= |journal=Chaos, Solitons & Fractals |volume=135 |article-number=109841 |doi=10.1016/j.chaos.2020.109841 |pmid=32501369 |pmc=7252058

External links

[edit]
Wikimedia Commons has media related toLogistic functions.
Retrieved from "https://en.wikipedia.org/w/index.php?title=Logistic_function&oldid=1316756392"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp