The partial derivative of a function with respect to the variable is variously denoted by
,,,,,, or.
It can be thought of as the rate of change of the function in the-direction.
Sometimes, for, the partial derivative of with respect to is denoted as Since a partial derivative generally has the same arguments as the original function, its functional dependence is sometimes explicitly signified by the notation, such as in:
Like ordinary derivatives, the partial derivative is defined as alimit. LetU be anopen subset of and a function. The partial derivative off at the point with respect to thei-th variablexi is defined as
Where is theunit vector ofi-th variablexi. Even if all partial derivatives exist at a given pointa, the function need not becontinuous there. However, if all partial derivatives exist in aneighborhood ofa and are continuous there, thenf istotally differentiable in that neighborhood and the total derivative is continuous. In this case, it is said thatf is aC1 function. This can be used to generalize for vector valued functions,, by carefully using a componentwise argument.
The partial derivative can be seen as another function defined onU and can again be partially differentiated. If the direction of derivative isnot repeated, it is called amixed partial derivative. If all mixed second order partial derivatives are continuous at a point (or on a set),f is termed aC2 function at that point (or on that set); in this case, the partial derivatives can be exchanged byClairaut's theorem:
When dealing with functions of multiple variables, some of these variables may be related to each other, thus it may be necessary to specify explicitly which variables are being held constant to avoid ambiguity. In fields such asstatistical mechanics, the partial derivative off with respect tox, holdingy andz constant, is often expressed as
Conventionally, for clarity and simplicity of notation, the partial derivativefunction and thevalue of the function at a specific point areconflated by including the function arguments when the partial derivative symbol (Leibniz notation) is used. Thus, an expression like
is used for the function, while
might be used for the value of the function at the point. However, this convention breaks down when we want to evaluate the partial derivative at a point like. In such a case, evaluation of the function must be expressed in an unwieldy manner as
or
in order to use the Leibniz notation. Thus, in these cases, it may be preferable to use the Euler differential operator notation with as the partial derivative symbol with respect to thei-th variable. For instance, one would write for the example described above, while the expression represents the partial derivativefunction with respect to the first variable.[3]
For higher order partial derivatives, the partial derivative (function) of with respect to thej-th variable is denoted. That is,, so that the variables are listed in the order in which the derivatives are taken, and thus, in reverse order of how the composition of operators is usually notated. Of course,Clairaut's theorem implies that as long as comparatively mild regularity conditions onf are satisfied.
An important example of a function of several variables is the case of ascalar-valued function on a domain in Euclidean space (e.g., on or). In this casef has a partial derivative with respect to each variablexj. At the pointa, these partial derivatives define the vector
This vector is called thegradient off ata. Iff is differentiable at every point in some domain, then the gradient is a vector-valued function∇f which takes the pointa to the vector∇f(a). Consequently, the gradient produces avector field.
Acontour plot of, showing the gradient vector in black, and the unit vector scaled by the directional derivative in the direction of in orange. The gradient vector is longer because the gradient points in the direction of greatest rate of increase of a function.
Suppose thatf is a function of more than one variable. For instance,
A graph ofz =x2 +xy +y2. For the partial derivative at(1, 1) that leavesy constant, the correspondingtangent line is parallel to thexz-plane.
A slice of the graph above showing the function in thexz-plane aty = 1. The two axes are shown here with different scales. The slope of the tangent line is 3.
Thegraph of this function defines asurface inEuclidean space. To every point on this surface, there are an infinite number oftangent lines. Partial differentiation is the act of choosing one of these lines and finding itsslope. Usually, the lines of most interest are those that are parallel to thexz-plane, and those that are parallel to theyz-plane (which result from holding eithery orx constant, respectively).
To find the slope of the line tangent to the function atP(1, 1) and parallel to thexz-plane, we treaty as a constant. The graph and this plane are shown on the right. Below, we see how the function looks on the planey = 1. By finding thederivative of the equation while assuming thaty is a constant, we find that the slope off at the point(x,y) is:
So at(1, 1), by substitution, the slope is3. Therefore,
at the point(1, 1). That is, the partial derivative ofz with respect tox at(1, 1) is3, as shown in the graph.
The functionf can be reinterpreted as a family of functions of one variable indexed by the other variables:
In other words, every value ofy defines a function, denotedfy, which is a function of one variablex.[6] That is,
In this section the subscript notationfy denotes a function contingent on a fixed value ofy, and not a partial derivative.
Once a value ofy is chosen, saya, thenf(x,y) determines a functionfa which traces a curvex2 +ax +a2 on thexz-plane:
In this expression,a is aconstant, not avariable, sofa is a function of only one real variable, that beingx. Consequently, the definition of the derivative for a function of one variable applies:
The above procedure can be performed for any choice ofa. Assembling the derivatives together into a function gives a function which describes the variation off in thex direction:
This is the partial derivative off with respect tox. Here '∂' is a rounded 'd' called thepartial derivative symbol; to distinguish it from the letter 'd', '∂' is sometimes pronounced "partial".
Second and higher order partial derivatives are defined analogously to the higher order derivatives of univariate functions. For the function the "own" second partial derivative with respect tox is simply the partial derivative of the partial derivative (both with respect tox):[7]: 316–318
The cross partial derivative with respect tox andy is obtained by taking the partial derivative off with respect tox, and then taking the partial derivative of the result with respect toy, to obtain
Schwarz's theorem states that if the second derivatives are continuous, the expression for the cross partial derivative is unaffected by which variable the partial derivative is taken with respect to first and which is taken second. That is,
or equivalently
Own and cross partial derivatives appear in theHessian matrix which is used in thesecond order conditions inoptimization problems.The higher order partial derivatives can be obtained by successive differentiation
There is a concept for partial derivatives that is analogous toantiderivatives for regular derivatives. Given a partial derivative, it allows for the partial recovery of the original function.
Consider the example of
The so-called partial integral can be taken with respect tox (treatingy as constant, in a similar manner to partial differentiation):
Here, theconstant of integration is no longer a constant, but instead a function of all the variables of the original function exceptx. The reason for this is that all the other variables are treated as constant when taking the partial derivative, so any function which does not involvex will disappear when taking the partial derivative, and we have to account for this when we take the antiderivative. The most general way to represent this is to have the constant represent an unknown function of all the other variables.
Thus the set of functions, whereg is any one-argument function, represents the entire set of functions in variablesx,y that could have produced thex-partial derivative.
If all the partial derivatives of a function are known (for example, with thegradient), then the antiderivatives can be matched via the above process to reconstruct the original function up to a constant. Unlike in the single-variable case, however, not every set of functions can be the set of all (first) partial derivatives of a single function. In other words, not every vector field isconservative.
ThevolumeV of acone depends on the cone'sheighth and itsradiusr according to the formula
The partial derivative ofV with respect tor is
which represents the rate with which a cone's volume changes if its radius is varied and its height is kept constant. The partial derivative with respect toh equals, which represents the rate with which the volume changes if its height is varied and its radius is kept constant.
By contrast, thetotal derivative ofV with respect tor andh are respectively
The difference between the total and partial derivative is the elimination of indirect dependencies between variables in partial derivatives.
If (for some arbitrary reason) the cone's proportions have to stay the same, and the height and radius are in a fixed ratiok,
This gives the total derivative with respect tor,
which simplifies to
Similarly, the total derivative with respect toh is
The total derivative with respect tobothr andh of the volume intended as scalar function of these two variables is given by thegradient vector
Partial derivatives appear in any calculus-basedoptimization problem with more than one choice variable. For example, ineconomics a firm may wish to maximizeprofitπ(x,y) with respect to the choice of the quantitiesx andy of two different types of output. Thefirst order conditions for this optimization areπx = 0 = πy. Since both partial derivativesπx andπy will generally themselves be functions of both argumentsx andy, these two first order conditions form asystem of two equations in two unknowns.
Thermodynamics, quantum mechanics and mathematical physics
Partial derivatives appear in thermodynamic equations likeGibbs-Duhem equation, in quantum mechanics as inSchrödinger wave equation, as well as in other equations frommathematical physics. The variables being held constant in partial derivatives here can be ratios of simple variables likemole fractionsxi in the following example involving the Gibbs energies in a ternary mixture system:
Expressmole fractions of a component as functions of other components' mole fraction and binary mole ratios:
Differential quotients can be formed at constant ratios like those above:
Ratios X, Y, Z of mole fractions can be written for ternary and multicomponent systems:
Partial derivatives are key to target-aware image resizing algorithms. Widely known asseam carving, these algorithms require eachpixel in an image to be assigned a numerical 'energy' to describe their dissimilarity against orthogonal adjacent pixels. Thealgorithm then progressively removes rows or columns with the lowest energy. The formula established to determine a pixel's energy (magnitude ofgradient at a pixel) depends heavily on the constructs of partial derivatives.
Partial derivatives play a prominent role ineconomics, in which most functions describing economic behaviour posit that the behaviour depends on more than one variable. For example, a societalconsumption function may describe the amount spent on consumer goods as depending on both income and wealth; themarginal propensity to consume is then the partial derivative of the consumption function with respect to income.