Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Derivative

This is a good article. Click here for more information.
Page semi-protected
From Wikipedia, the free encyclopedia
Instantaneous rate of change (mathematics)
Not to be confused withDerivation (differential algebra).
For other uses, seeDerivative (disambiguation).

Thegraph of a function, drawn in black, and atangent line to that graph, drawn in red. Theslope of the tangent line is equal to the derivative of the function at the marked point.
The derivative at different points of a differentiable function. In this case, the derivative is equal tosin(x2)+2x2cos(x2){\displaystyle \sin \left(x^{2}\right)+2x^{2}\cos \left(x^{2}\right)}.
Part of a series of articles about
Calculus
abf(t)dt=f(b)f(a){\displaystyle \int _{a}^{b}f'(t)\,dt=f(b)-f(a)}

Inmathematics, thederivative is a fundamental tool that quantifies the sensitivity to change of afunction's output with respect to its input. The derivative of a function of a single variable at a chosen input value, when it exists, is theslope of thetangent line to thegraph of the function at that point. The tangent line is the bestlinear approximation of the function near that input value. For this reason, the derivative is often described as theinstantaneous rate of change, the ratio of the instantaneous change in the dependent variable to that of the independent variable.[1] The process of finding a derivative is calleddifferentiation.

There are multiple differentnotations for differentiation.Leibniz notation, named afterGottfried Wilhelm Leibniz, is represented as the ratio of twodifferentials, whereasprime notation is written by adding aprime mark. Higher order notations represent repeated differentiation, and they are usually denoted in Leibniz notation by adding superscripts to the differentials, and in prime notation by adding additional prime marks. Thehigher order derivatives can be applied in physics; for example, while the first derivative of the position of a moving object with respect totime is the object'svelocity, how the position changes as time advances, the second derivative is the object'sacceleration, how the velocity changes as time advances.

Derivatives can be generalized tofunctions of several real variables. In this case, the derivative is reinterpreted as alinear transformation whose graph is (after an appropriate translation) the best linear approximation to the graph of the original function. TheJacobian matrix is thematrix that represents this linear transformation with respect to the basis given by the choice of independent and dependent variables. It can be calculated in terms of thepartial derivatives with respect to the independent variables. For areal-valued function of several variables, the Jacobian matrix reduces to thegradient vector.

Definition

As a limit

Afunction of a real variablef(x){\displaystyle f(x)} isdifferentiable at a pointa{\displaystyle a} of itsdomain, if its domain contains anopen interval containinga{\displaystyle a}, and thelimitL=limh0f(a+h)f(a)h{\displaystyle L=\lim _{h\to 0}{\frac {f(a+h)-f(a)}{h}}}exists.[2] This means that, for every positivereal numberε{\displaystyle \varepsilon }, there exists a positive real numberδ{\displaystyle \delta } such that, for everyh{\displaystyle h} such that|h|<δ{\displaystyle |h|<\delta } andh0{\displaystyle h\neq 0} thenf(a+h){\displaystyle f(a+h)} is defined, and|Lf(a+h)f(a)h|<ε,{\displaystyle \left|L-{\frac {f(a+h)-f(a)}{h}}\right|<\varepsilon ,}where the vertical bars denote theabsolute value. This is an example of the(ε, δ)-definition of limit.[3]

If the functionf{\displaystyle f} is differentiable ata{\displaystyle a}, that is if the limitL{\displaystyle L} exists, then this limit is called thederivative off{\displaystyle f} ata{\displaystyle a}. Multiple notations for the derivative exist.[4] The derivative off{\displaystyle f} ata{\displaystyle a} can be denotedf(a){\displaystyle f'(a)}, read as "f{\displaystyle f} prime ofa{\displaystyle a}"; or it can be denoteddfdx(a){\displaystyle \textstyle {\frac {df}{dx}}(a)}, read as "the derivative off{\displaystyle f} with respect tox{\displaystyle x} ata{\displaystyle a}" or "df{\displaystyle df} by (or over)dx{\displaystyle dx} ata{\displaystyle a}". See§ Notation below. Iff{\displaystyle f} is a function that has a derivative at every point in itsdomain, then a function can be defined by mapping every pointx{\displaystyle x} to the value of the derivative off{\displaystyle f} atx{\displaystyle x}. This function is writtenf{\displaystyle f'} and is called thederivative function or thederivative off{\displaystyle f}. The functionf{\displaystyle f} sometimes has a derivative at most, but not all, points of its domain. The function whose value ata{\displaystyle a} equalsf(a){\displaystyle f'(a)} wheneverf(a){\displaystyle f'(a)} is defined and elsewhere is undefined is also called the derivative off{\displaystyle f}. It is still a function, but its domain may be smaller than the domain off{\displaystyle f}.[5]

For example, letf{\displaystyle f} be the squaring function:f(x)=x2{\displaystyle f(x)=x^{2}}. Then the quotient in the definition of the derivative is[6]f(a+h)f(a)h=(a+h)2a2h=a2+2ah+h2a2h=2a+h.{\displaystyle {\frac {f(a+h)-f(a)}{h}}={\frac {(a+h)^{2}-a^{2}}{h}}={\frac {a^{2}+2ah+h^{2}-a^{2}}{h}}=2a+h.}The division in the last step is valid as long ash0{\displaystyle h\neq 0}. The closerh{\displaystyle h} is to0{\displaystyle 0}, the closer this expression becomes to the value2a{\displaystyle 2a}. The limit exists, and for every inputa{\displaystyle a} the limit is2a{\displaystyle 2a}. So, the derivative of the squaring function is the doubling function:f(x)=2x{\displaystyle f'(x)=2x}.

The ratio in the definition of the derivative is the slope of the line through two points on the graph of the functionf{\displaystyle f}, specifically the points(a,f(a)){\displaystyle (a,f(a))} and(a+h,f(a+h)){\displaystyle (a+h,f(a+h))}. Ash{\displaystyle h} is made smaller, these points grow closer together, and the slope of this line approaches the limiting value, the slope of thetangent to the graph off{\displaystyle f} ata{\displaystyle a}. In other words, the derivative is the slope of the tangent.[7]

Using infinitesimals

One way to think of the derivativedfdx(a){\textstyle {\frac {df}{dx}}(a)} is as the ratio of aninfinitesimal change in the output of the functionf{\displaystyle f} to an infinitesimal change in its input.[8] In order to make this intuition rigorous, a system of rules for manipulating infinitesimal quantities is required.[9] The system ofhyperreal numbers is a way of treatinginfinite and infinitesimal quantities. The hyperreals are anextension of thereal numbers that contain numbers greater than anything of the form1+1++1{\displaystyle 1+1+\cdots +1} for any finite number of terms. Such numbers are infinite, and theirreciprocals are infinitesimals. The application of hyperreal numbers to the foundations of calculus is callednonstandard analysis. This provides a way to define the basic concepts of calculus such as the derivative and integral in terms of infinitesimals, thereby giving a precise meaning to thed{\displaystyle d} in the Leibniz notation. Thus, the derivative off(x){\displaystyle f(x)} becomesf(x)=st(f(x+dx)f(x)dx){\displaystyle f'(x)=\operatorname {st} \left({\frac {f(x+dx)-f(x)}{dx}}\right)} for an arbitrary infinitesimaldx{\displaystyle dx}, wherest{\displaystyle \operatorname {st} } denotes thestandard part function, which "rounds off" each finite hyperreal to the nearest real.[10] Taking the squaring functionf(x)=x2{\displaystyle f(x)=x^{2}} as an example again,f(x)=st(x2+2xdx+(dx)2x2dx)=st(2xdx+(dx)2dx)=st(2xdxdx+(dx)2dx)=st(2x+dx)=2x.{\displaystyle {\begin{aligned}f'(x)&=\operatorname {st} \left({\frac {x^{2}+2x\cdot dx+(dx)^{2}-x^{2}}{dx}}\right)\\&=\operatorname {st} \left({\frac {2x\cdot dx+(dx)^{2}}{dx}}\right)\\&=\operatorname {st} \left({\frac {2x\cdot dx}{dx}}+{\frac {(dx)^{2}}{dx}}\right)\\&=\operatorname {st} \left(2x+dx\right)\\&=2x.\end{aligned}}}

Continuity and differentiability

This function does not have a derivative at the marked point, as the function is not continuous there (specifically, it has ajump discontinuity).
The absolute value function is continuous but fails to be differentiable atx = 0 since the tangent slopes do not approach the same value from the left as they do from the right.

Iff{\displaystyle f} isdifferentiable ata{\displaystyle a}, thenf{\displaystyle f} must also becontinuous ata{\displaystyle a}.[11] As an example, choose a pointa{\displaystyle a} and letf{\displaystyle f} be thestep function that returns the value 1 for allx{\displaystyle x} less thana{\displaystyle a}, and returns a different value 10 for allx{\displaystyle x} greater than or equal toa{\displaystyle a}. The functionf{\displaystyle f} cannot have a derivative ata{\displaystyle a}. Ifh{\displaystyle h} is negative, thena+h{\displaystyle a+h} is on the low part of the step, so the secant line froma{\displaystyle a} toa+h{\displaystyle a+h} is very steep; ash{\displaystyle h} tends to zero, the slope tends to infinity. Ifh{\displaystyle h} is positive, thena+h{\displaystyle a+h} is on the high part of the step, so the secant line froma{\displaystyle a} toa+h{\displaystyle a+h} has slope zero. Consequently, the secant lines do not approach any single slope, so the limit of the difference quotient does not exist. However, even if a function is continuous at a point, it may not be differentiable there. For example, theabsolute value function given byf(x)=|x|{\displaystyle f(x)=|x|} is continuous atx=0{\displaystyle x=0}, but it is not differentiable there. Ifh{\displaystyle h} is positive, then the slope of the secant line from 0 toh{\displaystyle h} is one; ifh{\displaystyle h} is negative, then the slope of the secant line from0{\displaystyle 0} toh{\displaystyle h} is1{\displaystyle -1}.[12] This can be seen graphically as a "kink" or a "cusp" in the graph atx=0{\displaystyle x=0}. Even a function with a smooth graph is not differentiable at a point where itstangent is vertical: For instance, the function given byf(x)=x1/3{\displaystyle f(x)=x^{1/3}} is not differentiable atx=0{\displaystyle x=0}. In summary, a function that has a derivative is continuous, but there are continuous functions that do not have a derivative.[13]

Most functions that occur in practice have derivatives at all points oralmost every point. Early in thehistory of calculus, many mathematicians assumed that a continuous function was differentiable at most points.[14] Under mild conditions (for example, if the function is amonotone or aLipschitz function), this is true. However, in 1872, Weierstrass found the first example of a function that is continuous everywhere but differentiable nowhere. This example is now known as theWeierstrass function.[15] In 1931,Stefan Banach proved that the set of functions that have a derivative at some point is ameager set in the space of all continuous functions. Informally, this means that hardly any random continuous functions have a derivative at even one point.[16]

Notation

Main article:Notation for differentiation

One common way of writing the derivative of a function isLeibniz notation, introduced byGottfried Wilhelm Leibniz in 1675, which denotes a derivative as the quotient of twodifferentials, such asdy{\displaystyle dy} anddx{\displaystyle dx}.[17] It is still commonly used when the equationy=f(x){\displaystyle y=f(x)} is viewed as a functional relationship betweendependent and independent variables. The first derivative is denoted bydydx{\displaystyle \textstyle {\frac {dy}{dx}}}, read as "the derivative ofy{\displaystyle y} with respect tox{\displaystyle x}".[18] This derivative can alternately be treated as the application of adifferential operator to a function,dydx=ddxf(x).{\textstyle {\frac {dy}{dx}}={\frac {d}{dx}}f(x).} Higher derivatives are expressed using the notationdnydxn{\textstyle {\frac {d^{n}y}{dx^{n}}}} for then{\displaystyle n}-th derivative ofy=f(x){\displaystyle y=f(x)}. These are abbreviations for multiple applications of the derivative operator; for example,d2ydx2=ddx(ddxf(x)).{\textstyle {\frac {d^{2}y}{dx^{2}}}={\frac {d}{dx}}{\Bigl (}{\frac {d}{dx}}f(x){\Bigr )}.}[19] Unlike some alternatives, Leibniz notation involves explicit specification of the variable for differentiation, in the denominator, which removes ambiguity when working with multiple interrelated quantities. The derivative of acomposed function can be expressed using thechain rule: ifu=g(x){\displaystyle u=g(x)} andy=f(g(x)){\displaystyle y=f(g(x))} thendydx=dydududx.{\textstyle {\frac {dy}{dx}}={\frac {dy}{du}}\cdot {\frac {du}{dx}}.}[20]

Another common notation for differentiation is by using theprime mark in the symbol of a functionf(x){\displaystyle f(x)}. This notation, due toJoseph-Louis Lagrange, is now known asprime notation.[21] The first derivative is written asf(x){\displaystyle f'(x)}, read as "f{\displaystyle f} prime ofx{\displaystyle x}", ory{\displaystyle y'}, read as "y{\displaystyle y} prime".[22] Similarly, the second and the third derivatives can be written asf{\displaystyle f''} andf{\displaystyle f'''}, respectively.[23] For denoting the number of higher derivatives beyond this point, some authors use Roman numerals insuperscript, whereas others place the number in parentheses, such asfiv{\displaystyle f^{\mathrm {iv} }} orf(4){\displaystyle f^{(4)}}.[24] The latter notation generalizes to yield the notationf(n){\displaystyle f^{(n)}} for then{\displaystyle n}th derivative off{\displaystyle f}.[19]

InNewton's notation or thedot notation, a dot is placed over a symbol to represent a time derivative. Ify{\displaystyle y} is a function oft{\displaystyle t}, then the first and second derivatives can be written asy˙{\displaystyle {\dot {y}}} andy¨{\displaystyle {\ddot {y}}}, respectively. This notation is used exclusively for derivatives with respect to time orarc length. It is typically used indifferential equations inphysics anddifferential geometry.[25] However, the dot notation becomes unmanageable for high-order derivatives (of order 4 or more) and cannot deal with multiple independent variables.

Another notation isD-notation, which represents the differential operator by the symbolD{\displaystyle D}.[19] The first derivative is writtenDf(x){\displaystyle Df(x)} and higher derivatives are written with a superscript, so then{\displaystyle n}-th derivative isDnf(x){\displaystyle D^{n}f(x)}. This notation is sometimes calledEuler notation, although it seems thatLeonhard Euler did not use it, and the notation was introduced byLouis François Antoine Arbogast.[26] To indicate a partial derivative, the variable differentiated by is indicated with a subscript, for example given the functionu=f(x,y){\displaystyle u=f(x,y)}, its partial derivative with respect tox{\displaystyle x} can be writtenDxu{\displaystyle D_{x}u} orDxf(x,y){\displaystyle D_{x}f(x,y)}. Higher partial derivatives can be indicated by superscripts or multiple subscripts, e.g.Dxyf(x,y)=y(xf(x,y)){\textstyle D_{xy}f(x,y)={\frac {\partial }{\partial y}}{\Bigl (}{\frac {\partial }{\partial x}}f(x,y){\Bigr )}} andDx2f(x,y)=x(xf(x,y)){\displaystyle \textstyle D_{x}^{2}f(x,y)={\frac {\partial }{\partial x}}{\Bigl (}{\frac {\partial }{\partial x}}f(x,y){\Bigr )}}.[27]

Rules of computation

Main article:Differentiation rules

In principle, the derivative of a function can be computed from the definition by considering the difference quotient and computing its limit. Once the derivatives of a few simple functions are known, the derivatives of other functions are more easily computed usingrules for obtaining derivatives of more complicated functions from simpler ones. This process of finding a derivative is known asdifferentiation.[28]

Rules for basic functions

The following are the rules for the derivatives of the most common basic functions. Here,a{\displaystyle a} is a real number, ande{\displaystyle e} isthe base of the natural logarithm, approximately2.71828.[29]

Rules for combined functions

Given that thef{\displaystyle f} andg{\displaystyle g} are the functions. The following are some of the most basic rules for deducing the derivative of functions from derivatives of basic functions.[30]

Computation example

The derivative of the function given byf(x)=x4+sin(x2)ln(x)ex+7{\displaystyle f(x)=x^{4}+\sin \left(x^{2}\right)-\ln(x)e^{x}+7} isf(x)=4x(41)+d(x2)dxcos(x2)d(lnx)dxexln(x)d(ex)dx+0=4x3+2xcos(x2)1xexln(x)ex.{\displaystyle {\begin{aligned}f'(x)&=4x^{(4-1)}+{\frac {d\left(x^{2}\right)}{dx}}\cos \left(x^{2}\right)-{\frac {d\left(\ln {x}\right)}{dx}}e^{x}-\ln(x){\frac {d\left(e^{x}\right)}{dx}}+0\\&=4x^{3}+2x\cos \left(x^{2}\right)-{\frac {1}{x}}e^{x}-\ln(x)e^{x}.\end{aligned}}}Here the second term was computed using thechain rule and the third term using theproduct rule. The known derivatives of the elementary functionsx2{\displaystyle x^{2}},x4{\displaystyle x^{4}},sin(x){\displaystyle \sin(x)},ln(x){\displaystyle \ln(x)}, andexp(x)=ex{\displaystyle \exp(x)=e^{x}}, as well as the constant7{\displaystyle 7}, were also used.

Antidifferentiation

Main article:Antiderivative

Anantiderivative of a functionf{\displaystyle f} is a function whose derivative isf{\displaystyle f}. Antiderivatives are not unique: ifA{\displaystyle A} is an antiderivative off{\displaystyle f}, then so isA+c{\displaystyle A+c}, wherec{\displaystyle c} is any constant, because the derivative of a constant is zero.[31] Thefundamental theorem of calculus shows that finding an antiderivative of a function gives a way to compute the areas of shapes bounded by that function. More precisely, theintegral of a function over aclosed interval is equal to the difference between the values of an antiderivative evaluated at the endpoints of that interval.[32]

Higher-order derivatives

Higher order derivatives are the result of differentiating a function repeatedly. Given thatf{\displaystyle f} is a differentiable function, the derivative off{\displaystyle f} is the first derivative, denoted asf{\displaystyle f'}. The derivative off{\displaystyle f'} is thesecond derivative, denoted asf{\displaystyle f''}, and the derivative off{\displaystyle f''} is thethird derivative, denoted asf{\displaystyle f'''}. By continuing this process, if it exists, then{\displaystyle n}th derivative is the derivative of the(n1){\displaystyle (n-1)}th derivative or thederivative of ordern{\displaystyle n}. As has beendiscussed above, the generalization of derivative of a functionf{\displaystyle f} may be denoted asf(n){\displaystyle f^{(n)}}.[33] A function that hask{\displaystyle k} successive derivatives is calledk{\displaystyle k} times differentiable. If thek{\displaystyle k}-th derivative is continuous, then the function is said to be ofdifferentiability classCk{\displaystyle C^{k}}.[34] A function that has infinitely many derivatives is calledinfinitely differentiable orsmooth.[35] Anypolynomial function is infinitely differentiable; taking derivatives repeatedly will eventually result in aconstant function, and all subsequent derivatives of that function are zero.[36]

Oneapplication of higher-order derivatives is inphysics. Suppose that a function represents the position of an object at the time. The first derivative of that function is thevelocity of an object with respect to time, the second derivative of the function is theacceleration of an object with respect to time,[28] and the third derivative is thejerk.[37]

In other dimensions

See also:Vector calculus andMultivariable calculus

Vector-valued functions

Avector-valued functiony{\displaystyle \mathbf {y} } of a real variable sends real numbers to vectors in somevector spaceRn{\displaystyle \mathbb {R} ^{n}}. A vector-valued function can be split up into its coordinate functionsy1(t),y2(t),,yn(t){\displaystyle y_{1}(t),y_{2}(t),\dots ,y_{n}(t)}, meaning thaty=(y1(t),y2(t),,yn(t)){\displaystyle \mathbf {y} =(y_{1}(t),y_{2}(t),\dots ,y_{n}(t))}. This includes, for example,parametric curves inR2{\displaystyle \mathbb {R} ^{2}} orR3{\displaystyle \mathbb {R} ^{3}}. The coordinate functions are real-valued functions, so the above definition of derivative applies to them. The derivative ofy(t){\displaystyle \mathbf {y} (t)} is defined to be thevector, called thetangent vector, whose coordinates are the derivatives of the coordinate functions. That is,[38]y(t)=limh0y(t+h)y(t)h,{\displaystyle \mathbf {y} '(t)=\lim _{h\to 0}{\frac {\mathbf {y} (t+h)-\mathbf {y} (t)}{h}},}if the limit exists. The subtraction in the numerator is the subtraction of vectors, not scalars. If the derivative ofy{\displaystyle \mathbf {y} } exists for every value oft{\displaystyle t}, theny{\displaystyle \mathbf {y} '} is another vector-valued function.[38]

Partial derivatives

Main article:Partial derivative

Functions can depend uponmore than one variable. Apartial derivative of a function of several variables is its derivative with respect to one of those variables, with the others held constant. Partial derivatives are used invector calculus anddifferential geometry. As with ordinary derivatives, multiple notations exist: the partial derivative of a functionf(x,y,){\displaystyle f(x,y,\dots )} with respect to the variablex{\displaystyle x} is variously denoted by

fx{\displaystyle f_{x}},fx{\displaystyle f'_{x}},xf{\displaystyle \partial _{x}f},xf{\displaystyle {\frac {\partial }{\partial x}}f}, orfx{\displaystyle {\frac {\partial f}{\partial x}}},

among other possibilities.[39] It can be thought of as the rate of change of the function in thex{\displaystyle x}-direction.[40] Here is a roundedd called thepartial derivative symbol. To distinguish it from the letterd, ∂ is sometimes pronounced "der", "del", or "partial" instead of "dee".[41] For example, letf(x,y)=x2+xy+y2{\displaystyle f(x,y)=x^{2}+xy+y^{2}}, then the partial derivative of functionf{\displaystyle f} with respect to both variablesx{\displaystyle x} andy{\displaystyle y} are, respectively:fx=2x+y,fy=x+2y.{\displaystyle {\frac {\partial f}{\partial x}}=2x+y,\qquad {\frac {\partial f}{\partial y}}=x+2y.}In general, the partial derivative of a functionf(x1,,xn){\displaystyle f(x_{1},\dots ,x_{n})} in the directionxi{\displaystyle x_{i}} at the point(a1,,an){\displaystyle (a_{1},\dots ,a_{n})} is defined to be:[42]fxi(a1,,an)=limh0f(a1,,ai+h,,an)f(a1,,ai,,an)h.{\displaystyle {\frac {\partial f}{\partial x_{i}}}(a_{1},\ldots ,a_{n})=\lim _{h\to 0}{\frac {f(a_{1},\ldots ,a_{i}+h,\ldots ,a_{n})-f(a_{1},\ldots ,a_{i},\ldots ,a_{n})}{h}}.}

This is fundamental for the study of thefunctions of several real variables. Letf(x1,,xn){\displaystyle f(x_{1},\dots ,x_{n})} be such areal-valued function. If all partial derivativesf{\displaystyle f} with respect toxj{\displaystyle x_{j}} are defined at the point(a1,,an){\displaystyle (a_{1},\dots ,a_{n})}, these partial derivatives define the vectorf(a1,,an)=(fx1(a1,,an),,fxn(a1,,an)),{\displaystyle \nabla f(a_{1},\ldots ,a_{n})=\left({\frac {\partial f}{\partial x_{1}}}(a_{1},\ldots ,a_{n}),\ldots ,{\frac {\partial f}{\partial x_{n}}}(a_{1},\ldots ,a_{n})\right),}which is called thegradient off{\displaystyle f} ata{\displaystyle a}. Iff{\displaystyle f} is differentiable at every point in some domain, then the gradient is avector-valued functionf{\displaystyle \nabla f} that maps the point(a1,,an){\displaystyle (a_{1},\dots ,a_{n})} to the vectorf(a1,,an){\displaystyle \nabla f(a_{1},\dots ,a_{n})}. Consequently, the gradient determines avector field.[43]

Directional derivatives

Main article:Directional derivative

Iff{\displaystyle f} is a real-valued function onRn{\displaystyle \mathbb {R} ^{n}}, then the partial derivatives off{\displaystyle f} measure its variation in the direction of the coordinate axes. For example, iff{\displaystyle f} is a function ofx{\displaystyle x} andy{\displaystyle y}, then its partial derivatives measure the variation inf{\displaystyle f} in thex{\displaystyle x} andy{\displaystyle y} direction. However, they do not directly measure the variation off{\displaystyle f} in any other direction, such as along the diagonal liney=x{\displaystyle y=x}. These are measured using directional derivatives. Given a vectorv=(v1,,vn){\displaystyle \mathbf {v} =(v_{1},\ldots ,v_{n})}, then thedirectional derivative off{\displaystyle f} in the direction ofv{\displaystyle \mathbf {v} } at the pointx{\displaystyle \mathbf {x} } is:[44]Dvf(x)=limh0f(x+hv)f(x)h.{\displaystyle D_{\mathbf {v} }{f}(\mathbf {x} )=\lim _{h\rightarrow 0}{\frac {f(\mathbf {x} +h\mathbf {v} )-f(\mathbf {x} )}{h}}.}

If all the partial derivatives off{\displaystyle f} exist and are continuous atx{\displaystyle \mathbf {x} }, then they determine the directional derivative off{\displaystyle f} in the directionv{\displaystyle \mathbf {v} } by the formula:[45]Dvf(x)=j=1nvjfxj.{\displaystyle D_{\mathbf {v} }{f}(\mathbf {x} )=\sum _{j=1}^{n}v_{j}{\frac {\partial f}{\partial x_{j}}}.}

Total derivative and Jacobian matrix

Main article:Total derivative

Whenf{\displaystyle f} is a function from an open subset ofRn{\displaystyle \mathbb {R} ^{n}} toRm{\displaystyle \mathbb {R} ^{m}}, then the directional derivative off{\displaystyle f} in a chosen direction is the best linear approximation tof{\displaystyle f} at that point and in that direction. However, whenn>1{\displaystyle n>1}, no single directional derivative can give a complete picture of the behavior off{\displaystyle f}. The total derivative gives a complete picture by considering all directions at once. That is, for any vectorv{\displaystyle \mathbf {v} } starting ata{\displaystyle \mathbf {a} }, the linear approximation formula holds:[46]f(a+v)f(a)+f(a)v.{\displaystyle f(\mathbf {a} +\mathbf {v} )\approx f(\mathbf {a} )+f'(\mathbf {a} )\mathbf {v} .}Similarly with the single-variable derivative,f(a){\displaystyle f'(\mathbf {a} )} is chosen so that the error in this approximation is as small as possible. The total derivative off{\displaystyle f} ata{\displaystyle \mathbf {a} } is the unique linear transformationf(a):RnRm{\displaystyle f'(\mathbf {a} )\colon \mathbb {R} ^{n}\to \mathbb {R} ^{m}} such that[46]limh0f(a+h)(f(a)+f(a)h)h=0.{\displaystyle \lim _{\mathbf {h} \to 0}{\frac {\lVert f(\mathbf {a} +\mathbf {h} )-(f(\mathbf {a} )+f'(\mathbf {a} )\mathbf {h} )\rVert }{\lVert \mathbf {h} \rVert }}=0.}Hereh{\displaystyle \mathbf {h} } is a vector inRn{\displaystyle \mathbb {R} ^{n}}, so the norm in the denominator is the standard length onRn{\displaystyle \mathbb {R} ^{n}}. However,f(a)h{\displaystyle f'(\mathbf {a} )\mathbf {h} } is a vector inRm{\displaystyle \mathbb {R} ^{m}}, and the norm in the numerator is the standard length onRm{\displaystyle \mathbb {R} ^{m}}.[46] Ifv{\displaystyle v} is a vector starting ata{\displaystyle a}, thenf(a)v{\displaystyle f'(\mathbf {a} )\mathbf {v} } is called thepushforward ofv{\displaystyle \mathbf {v} } byf{\displaystyle f}.[47]

If the total derivative exists ata{\displaystyle \mathbf {a} }, then all the partial derivatives and directional derivatives off{\displaystyle f} exist ata{\displaystyle \mathbf {a} }, and for allv{\displaystyle \mathbf {v} },f(a)v{\displaystyle f'(\mathbf {a} )\mathbf {v} } is the directional derivative off{\displaystyle f} in the directionv{\displaystyle \mathbf {v} }. Iff{\displaystyle f} is written using coordinate functions, so thatf=(f1,f2,,fm){\displaystyle f=(f_{1},f_{2},\dots ,f_{m})}, then the total derivative can be expressed using the partial derivatives as amatrix. This matrix is called theJacobian matrix off{\displaystyle f} ata{\displaystyle \mathbf {a} }:[48]f(a)=Jaca=(fixj)ij.{\displaystyle f'(\mathbf {a} )=\operatorname {Jac} _{\mathbf {a} }=\left({\frac {\partial f_{i}}{\partial x_{j}}}\right)_{ij}.}

Generalizations

Main article:Generalizations of the derivative

The concept of a derivative can be extended to many other settings. The common thread is that the derivative of a function at a point serves as alinear approximation of the function at that point.

See also

Notes

  1. ^Apostol 1967, p. 160;Stewart 2002, pp. 129–130;Strang et al. 2023, p. 224.
  2. ^Apostol 1967, p. 160;Stewart 2002, p. 127;Strang et al. 2023, p. 220.
  3. ^Gonick 2012, p. 83;Thomas et al. 2014, p. 60.
  4. ^Gonick 2012, p. 88;Strang et al. 2023, p. 234.
  5. ^Gonick 2012, p. 83;Strang et al. 2023, p. 232.
  6. ^Gonick 2012, pp. 77–80.
  7. ^Thompson 1998, pp. 34, 104;Stewart 2002, p. 128.
  8. ^Thompson 1998, pp. 84–85.
  9. ^Keisler 2012, pp. 902–904.
  10. ^Keisler 2012, p. 45;Henle & Kleinberg 2003, p. 66.
  11. ^Gonick 2012, p. 156;Thomas et al. 2014, p. 114;Strang et al. 2023, p. 237.
  12. ^Gonick 2012, p. 149;Thomas et al. 2014, p. 113;Strang et al. 2023, p. 237.
  13. ^Gonick 2012, p. 156;Thomas et al. 2014, p. 114;Strang et al. 2023, pp. 237–238.
  14. ^Jašek 1922;Jarník 1922;Rychlík 1923.
  15. ^David 2018.
  16. ^Banach 1931, cited inHewitt & Stromberg 1965.
  17. ^Apostol 1967, p. 172;Cajori 2007, p. 204.
  18. ^Moore & Siegel 2013, p. 110.
  19. ^abcVarberg, Purcell & Rigdon 2007, pp. 125–126.
  20. ^In the formulation of calculus in terms of limits, various authors have assigned thedu{\displaystyle du} symbol various meanings. Some authors such asVarberg, Purcell & Rigdon 2007, p. 119 andStewart 2002, p. 177 do not assign a meaning todu{\displaystyle du} by itself, but only as part of the symboldudx{\textstyle {\frac {du}{dx}}}. Others definedx{\displaystyle dx} as an independent variable, and definedu{\displaystyle du} bydu=dxf(x){\displaystyle \textstyle du=dxf'(x)}.Innon-standard analysisdu{\displaystyle du} is defined as an infinitesimal. It is also interpreted as theexterior derivative of a functionu{\displaystyle u}. Seedifferential (infinitesimal) for further information.
  21. ^Schwartzman 1994, p. 171;Cajori 1923, pp. 6–7, 10–12, 21–24.
  22. ^Moore & Siegel 2013, p. 110;Goodman 1963, pp. 78–79.
  23. ^Varberg, Purcell & Rigdon 2007, pp. 125–126;Cajori 2007, p. 228.
  24. ^Choudary & Niculescu 2014, p. 222;Apostol 1967, p. 171.
  25. ^Evans 1999, p. 63;Kreyszig 1991, p. 1.
  26. ^Cajori 1923.
  27. ^Apostol 1967, p. 172;Varberg, Purcell & Rigdon 2007, pp. 125–126.
  28. ^abApostol 1967, p. 160.
  29. ^Varberg, Purcell & Rigdon 2007. See p. 133 for the power rule, pp. 115–116 for the trigonometric functions, p. 326 for the natural logarithm, pp. 338–339 for exponential with basee{\displaystyle e}, p. 343 for the exponential with basea{\displaystyle a}, p. 344 for the logarithm with basea{\displaystyle a}, and p. 369 for the inverse of trigonometric functions.
  30. ^ For constant rule and sum rule, seeApostol 1967, pp. 161, 164, respectively. For the product rule, quotient rule, and chain rule, seeVarberg, Purcell & Rigdon 2007, pp. 111–112, 119, respectively. For the special case of the product rule, that is, the product of a constant and a function, seeVarberg, Purcell & Rigdon 2007, pp. 108–109.
  31. ^Strang et al. 2023, pp. 485–486.
  32. ^Strang et al. 2023, pp. 552–559.
  33. ^Apostol 1967, p. 160;Varberg, Purcell & Rigdon 2007, pp. 125–126.
  34. ^Warner 1983, p. 5.
  35. ^Debnath & Shah 2015, p. 40.
  36. ^Carothers 2000, p. 176.
  37. ^Stewart 2002, p. 193.
  38. ^abStewart 2002, p. 893.
  39. ^Stewart 2002, p. 947;Christopher 2013, p. 682.
  40. ^Stewart 2002, p. 949.
  41. ^Silverman 1989, p. 216;Bhardwaj 2005, Seep. 6.4.
  42. ^Mathai & Haubold 2017, p. 52.
  43. ^Gbur 2011, pp. 36–37.
  44. ^Varberg, Purcell & Rigdon 2007, p. 642.
  45. ^Guzman 2003, p. 35.
  46. ^abcDavvaz 2023, p. 266.
  47. ^Lee 2013, p. 72.
  48. ^Davvaz 2023, p. 267.
  49. ^Roussos 2014, p. 303.
  50. ^Gbur 2011, pp. 261–264.
  51. ^Gray, Abbena & Salamon 2006, p. 826.
  52. ^Azegami 2020. See p.209 for the Gateaux derivative, and p.211 for the Fréchet derivative.
  53. ^Funaro 1992, pp. 84–85.
  54. ^Kolchin 1973, pp. 58,126.
  55. ^Georgiev 2018, p. 8.
  56. ^Barbeau 1961.

References

External links

Differentiation at Wikipedia'ssister projects
Precalculus
Limits
Differential calculus
Integral calculus
Vector calculus
Multivariable calculus
Sequences and series
Special functions
and numbers
History of calculus
Lists
Integrals
Miscellaneous topics
Major topics inmathematical analysis
National
Other
Retrieved from "https://en.wikipedia.org/w/index.php?title=Derivative&oldid=1298519545"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp