Movatterモバイル変換

Gradient

From Wikipedia, the free encyclopedia

Multivariate derivative (mathematics)

This article is about a generalized derivative of a multivariate function. For another use in mathematics, seeSlope. For a similarly spelled unit of angle, seeGradian. For other uses, seeGradient (disambiguation).

This articleneeds additional citations forverification. Please helpimprove this article byadding citations to reliable sources. Unsourced material may be challenged and removed.
Find sources: "Gradient" – news ·newspapers ·books ·scholar ·JSTOR(January 2018) (Learn how and when to remove this message)

The gradient, represented by the blue arrows, denotes the direction of greatest change of a scalar function. The values of the function are represented in greyscale and increase in value from white (low) to dark (high).

Part of a series of articles about

Calculus

\int _{a}^{b}f'(t)\,dt=f(b)-f(a)

Fundamental theorem

Differential

Definitions
Derivative (generalizations) Differential infinitesimal of a function total
Concepts
Differentiation notation Second derivative Implicit differentiation Logarithmic differentiation Related rates Taylor's theorem
Rules and identities
Sum Product Chain Power Quotient L'Hôpital's rule Inverse General Leibniz Faà di Bruno's formula Reynolds

Integral

Definitions
Lists of integrals Integral transform Leibniz integral rule
Antiderivative Integral (improper) Riemann integral Lebesgue integration Contour integration Integral of inverse functions
Integration by
Parts Discs Cylindrical shells Substitution (trigonometric,tangent half-angle,Euler) Euler's formula Partial fractions (Heaviside's method) Changing order Reduction formulae Differentiating under the integral sign Risch algorithm

Series

Convergence tests
Geometric (arithmetico-geometric) Harmonic Alternating Power Binomial Taylor
Summand limit (term test) Ratio Root Integral Direct comparison Limit comparison Alternating series Cauchy condensation Dirichlet Abel

Vector

Theorems
Gradient Divergence Curl Laplacian Directional derivative Identities
Gradient Green's Stokes' Divergence Generalized Stokes Helmholtz decomposition

Multivariable

Formalisms
Matrix Tensor Exterior Geometric
Definitions
Partial derivative Multiple integral Line integral Surface integral Volume integral Jacobian Hessian

Advanced

Specialized

Miscellanea

Invector calculus, thegradient of ascalar-valued differentiable function $f {\displaystyle f}$ of several variables is thevector field (orvector-valued function) $\nabla f$ whose value at a point $p {\displaystyle p}$ gives the direction and the rate of fastest increase. The gradient transforms like a vector under change of basis of the space of variables of $f {\displaystyle f}$ . If the gradient of a function is non-zero at a point $p {\displaystyle p}$ , the direction of the gradient is the direction in which the function increases most quickly from $p {\displaystyle p}$ , and themagnitude of the gradient is the rate of increase in that direction, the greatestabsolute directional derivative.^[1] Further, a point where the gradient is thezero vector is known as astationary point. The gradient thus plays a fundamental role inoptimization theory,machine learning, andartificial intelligence, where it is used to minimize a function bygradient descent. In coordinate-free terms, the gradient of a function $f(\mathbf {r} )$ may be defined by:

$df=\nabla f\cdot d\mathbf {r}$

where $d f {\displaystyle df}$ is the total infinitesimal change in $f {\displaystyle f}$ for an infinitesimal displacement $d\mathbf {r}$ , and is seen to be maximal when $d\mathbf {r}$ is in the direction of the gradient $\nabla f$ . Thenabla symbol $\nabla$ , written as an upside-down triangle and pronounced "del", denotes thevector differential operator.

When a coordinate system is used in which the basis vectors are not functions of position, the gradient is given by thevector^[a] whose components are thepartial derivatives of $f {\displaystyle f}$ at $p {\displaystyle p}$ .^[2] That is, for $f\colon \mathbb {R} ^{n}\to \mathbb {R}$ , its gradient $\nabla f\colon \mathbb {R} ^{n}\to \mathbb {R} ^{n}$ is defined at the point $p=(x_{1},\ldots ,x_{n})$ inn-dimensional space as the vector^[b]

$\nabla f(p)={\begin{bmatrix}{\frac {\partial f}{\partial x_{1}}}(p)\\\vdots \\{\frac {\partial f}{\partial x_{n}}}(p)\end{bmatrix}}$ .

Note that the above definition for gradient is defined for the function $f {\displaystyle f}$ only if $f {\displaystyle f}$ is differentiable at $p {\displaystyle p}$ . There can be functions for which partial derivatives exist in every direction but fail to be differentiable. Furthermore, this definition as the vector of partial derivatives is only valid when the basis of the coordinate system isorthonormal. For any other basis, themetric tensor at that point needs to be taken into account.

For example, the function $f(x,y)={\frac {x^{2}y}{x^{2}+y^{2}}}$ unless at origin where $f(0,0)=0$ , is not differentiable at the origin as it does not have a well defined tangent plane despite having well defined partial derivatives in every direction at the origin.^[3] In this particular example, under rotation of x-y coordinate system, the above formula for gradient fails to transform like a vector (gradient becomes dependent on choice of basis for coordinate system) and also fails to point towards the 'steepest ascent' in some orientations. For differentiable functions where the formula for gradient holds, it can be shown to always transform as a vector under transformation of the basis so as to always point towards the fastest increase.

The gradient is dual to thetotal derivative $d f {\displaystyle df}$ : the value of the gradient at a point is atangent vector – a vector at each point; while the value of the derivative at a point is acotangent vector – a linear functional on vectors.^[c] They are related in that thedot product of the gradient of $f {\displaystyle f}$ at a point $p {\displaystyle p}$ with another tangent vector $\mathbf {v}$ equals thedirectional derivative of $f {\displaystyle f}$ at $p {\displaystyle p}$ of the function along $\mathbf {v}$ ; that is, ${\textstyle \nabla f(p)\cdot \mathbf {v} ={\frac {\partial f}{\partial \mathbf {v} }}(p)=df_{p}(\mathbf {v} )}$ . The gradient admits multiple generalizations to more general functions onmanifolds; see§ Generalizations.

Movatterモバイル変換

Motivation

Notation

Definition

Cartesian coordinates

Cylindrical and spherical coordinates

General coordinates

Relationship with derivative

Relationship with total derivative

Differential or (exterior) derivative

Linear approximation to a function

Relationship withFréchet derivative

Further properties and applications

Level sets

Conservative vector fields and the gradient theorem

Gradient is direction of steepest ascent

Generalizations

Jacobian

Gradient of a vector field

Riemannian manifolds

See also

Notes

References

Further reading

External links