This articleneeds additional citations forverification. Please helpimprove this article byadding citations to reliable sources. Unsourced material may be challenged and removed. Find sources: "Heat equation" – news ·newspapers ·books ·scholar ·JSTOR(August 2024) (Learn how and when to remove this message) |

Inmathematics andphysics (more specificallythermodynamics), theheat equation is aparabolic partial differential equation. The theory of the heat equation was first developed byJoseph Fourier in 1822 for the purpose of modeling how a quantity such asheat diffuses through a given region. Since then, the heat equation and its variants have been found to be fundamental in many parts of both pure and applied mathematics.
Given an open subsetU ofRn and a subintervalI ofR, one says that a functionu :U ×I →R is a solution of theheat equation if
where(x1, ...,xn,t) denotes a general point of the domain.[1] It is typical to refer tot as time andx1, ...,xn as spatial variables, even in abstract contexts where these phrases fail to have their intuitive meaning. The collection of spatial variables is often referred to simply asx. For any given value oft, the right-hand side of the equation is theLaplacian of the functionu(⋅,t) :U →R. As such, the heat equation is often written more compactly as
In physics and engineering contexts, especially in the context of diffusion through a medium, it is more common to fix aCartesian coordinate system and then to consider the specific case of afunctionu(x,y,z,t) of three spatial variables(x,y,z) andtime variablet. One then says thatu is a solution of the heat equation if
in whichα is a positivecoefficient called thethermal diffusivity of the medium. In addition to other physical phenomena, this equation describes the flow of heat in a homogeneous and isotropic medium, withu(x,y,z,t) being the temperature at the point(x,y,z) and timet. If the medium is not homogeneous and isotropic, thenα would not be a fixed coefficient, and would instead depend on(x,y,z); the equation would also have a slightly different form. In the physics and engineering literature, it is common to use∇2 to denote the Laplacian, rather than∆.
In mathematics as well as in physics and engineering, it is common to useNewton's notation for time derivatives, so that is used to denote∂u/∂t, so the equation can be written
Note also that the ability to use either∆ or∇2 to denote the Laplacian, without explicit reference to the spatial variables, is a reflection of the fact that the Laplacian is independent of the choice of coordinate system. In mathematical terms, one would say that the Laplacian is translationally and rotationally invariant. In fact, it is (loosely speaking) the simplest differential operator which has these symmetries. This can be taken as a significant (and purely mathematical) justification of the use of the Laplacian and of the heat equation in modeling any physical phenomena which are homogeneous and isotropic, of which heat diffusion is a principal example.
The diffusivity constantα is often not present in mathematical studies of the heat equation, while its value can be very important in engineering. This is not a major difference, for the following reason. Letu be a function with
Define a new function. Then, according to thechain rule, one has
| ⁎ |
Thus, there is a straightforward way of translating between solutions of the heat equation with a general value ofα and solutions of the heat equation withα = 1. As such, for the sake of mathematical analysis, it is often sufficient to only consider the caseα = 1.
Since there is another option to define a satisfying as in (⁎) above by setting. Note that the two possible means of defining the new function discussed here amount, in physical terms, to changing the unit of measure of time or the unit of measure of length.
The nonhomogeneous heat equation is
for a given function which is allowed to depend on bothx andt.[1] The inhomogeneous heat equation models thermal problems in which a heat source modeled byf is switched on. For example, it can be used to model the temperature throughout a room with a heater switched on. If is the region of the room where the heater is and the heater is constantly generatingq units of heat per unit of volume, thenf would be given by.
A solution to the heat equation is said to be a steady-state solution if it does not vary with respect to time:
Flowingu via. the heat equation causes it to become closer and closer as time increases to a steady-state solution. For very large time,u is closely approximated by a steady-state solution. A steady state solution of the heat equation is equivalently a solution ofLaplace's equation.
Similarly, a solution to the nonhomogeneous heat equation is said to be a steady-state solution if it does not vary with respect to time:
This is equivalently a solution ofPoisson's equation.
In the steady-state case, a nonzero spatial thermal gradient may (or may not) be present, but if it is, it does not change in time. The steady-state equation describes the end result in all thermal problems in which a source is switched on (for example, an engine started in an automobile), and enough time has passed for all permanent temperature gradients to establish themselves in space, after which these spatial gradients no longer change in time (as again, with an automobile in which the engine has been running for long enough). The other (trivial) solution is for all spatial temperature gradients to disappear as well, in which case the temperature become uniform in space, as well. The steady-state equations are simpler and can help to understand better the physics of the materials without focusing on the dynamics of heat transport. It is widely used for simple engineering problems assuming there is equilibrium of the temperature fields and heat transport, with time.
Informally, the Laplacian operator∆ gives the difference between the average value of a function in the neighborhood of a point, and its value at that point. Thus, ifu is the temperature,∆u conveys if (and by how much) the material surrounding each point is hotter or colder, on the average, than the material at that point.
By thesecond law of thermodynamics, heat will flow from hotter bodies to adjacent colder bodies, in proportion to the difference of temperature and of thethermal conductivity of the material between them. When heat flows into (respectively, out of) a material, its temperature increases (respectively, decreases), in proportion to the amount of heat divided by the amount (mass) of material, with aproportionality factor called thespecific heat capacity of the material.
By the combination of these observations, the heat equation says the rate at which the material at a point will heat up (or cool down) is proportional to how much hotter (or cooler) the surrounding material is. The coefficientα in the equation takes into account the thermal conductivity, specific heat, anddensity of the material.
The first half of the above physical thinking can be put into a mathematical form. The key is that, for any fixedx, one has
whereu(x)(r) is the single-variable function denoting theaverage value ofu over the surface of the sphere of radiusr centered atx; it can be defined by
in whichωn − 1 denotes thesurface area of the unit ball inn-dimensional Euclidean space. This formalizes the above statement that the value of∆u at a pointx measures the difference between the value ofu(x) and the value ofu at points nearby tox, in the sense that the latter is encoded by the values ofu(x)(r) for small positive values ofr.
Following this observation, one may interpret the heat equation as imposing aninfinitesimal averaging of a function. Given a solution of the heat equation, the value ofu(x,t + τ) for a small positive value ofτ may be approximated as1/2n times the average value of the functionu(⋅,t) over a sphere of very small radius centered atx.


The heat equation implies that peaks (local maxima) of will be gradually eroded down, while depressions (local minima) will be filled in. The value at some point will remain stable only as long as it is equal to the average value in its immediate surroundings. In particular, if the values in a neighborhood are very close to a linear function, then the value at the center of that neighborhood will not be changing at that time (that is, the derivative will be zero).
A more subtle consequence is themaximum principle, that says that the maximum value of in any region of the medium will not exceed the maximum value that previously occurred in, unless it is on the boundary of. That is, the maximum temperature in a region can increase only if heat comes in from outside. This is a property ofparabolic partial differential equations and is not difficult to prove mathematically (see below).
Another interesting property is that even if initially has a sharp jump (discontinuity) of value across some surface inside the medium, the jump is immediately smoothed out by a momentary, infinitesimally short but infinitely large rate of flow of heat through that surface. For example, if two isolated bodies, initially at uniform but different temperatures and, are made to touch each other, the temperature at the point of contact will immediately assume some intermediate value, and a zone will develop around that point where will gradually vary between and.
If a certain amount of heat is suddenly applied to a point in the medium, it will spread out in all directions in the form of adiffusion wave. Unlike theelastic andelectromagnetic waves, the speed of a diffusion wave drops with time: as it spreads over a larger region, the temperature gradient decreases, and therefore the heat flow decreases too.
For heat flow, the heat equation follows from the physical laws ofconduction of heat andconservation of energy (Cannon 1984).
ByFourier's law for an isotropic medium, the rate of flow of heat energy per unit area through a surface is proportional to the negative temperature gradient across it:
where is thethermal conductivity of the material, is the temperature, and is avector field that represents the magnitude and direction of the heat flow at the point of space and time.
If the medium is a thin rod of uniform section and material, the positionx is a single coordinate and the heat flow towards is ascalar field. The equation becomes
Let be theinternal energy (heat) per unit volume of the bar at each point and time. The rate of change in heat per unit volume in the material,, is proportional to the rate of change of its temperature,. That is,
where is the specific heat capacity (at constant pressure, in case of a gas) and is the density (mass per unit volume) of the material. This derivation assumes that the material has constant mass density and heat capacity through space as well as time.
Applying the law of conservation of energy to a small element of the medium centred at, one concludes that the rate at which heat changes at a given point is equal to the derivative of the heat flow at that point (the difference between the heat flows either side of the particle). That is,
From the above equations it follows that
which is the heat equation in one dimension, with diffusivity coefficient
This quantity is called thethermal diffusivity of the medium.
An additional term may be introduced into the equation to account for radiative loss of heat. According to theStefan–Boltzmann law, this term is, where is the temperature of the surroundings, and is a coefficient that depends on theStefan-Boltzmann constant, theemissivity of the material, and the geometry. The rate of change in internal energy becomes
and the equation for the evolution of becomes
Note that the state equation, given by thefirst law of thermodynamics (i.e. conservation of energy), is written in the following form (assuming no mass transfer or radiation). This form is more general and particularly useful to recognize which property (e.g.cp or) influences which term.
where is the volumetric heat source.
In general, the study of heat conduction is based on several principles. Heat flow is a form ofenergy flow, and as such it is meaningful to speak of the time rate of flow of heat into a region of space.
Putting these equations together gives the general equation of heat flow:
Remarks
In the special cases of propagation of heat in anisotropic andhomogeneous medium in a 3-dimensional space, this equation is
where:
The heat equation is a consequence of Fourier's law of conduction (seeheat conduction).
If the medium is not the whole space, in order to solve the heat equation uniquely we also need to specifyboundary conditions foru. To determine uniqueness of solutions in the whole space it is necessary to assume additional conditions, for example an exponential bound on the growth of solutions[2] or a sign condition (nonnegative solutions are unique by a result ofDavid Widder).[3]
Solutions of the heat equation are characterized by a gradual smoothing of the initial temperature distribution by the flow ofheat from warmer to colder areas of an object. Generally, many different states and starting conditions will tend toward the same stableequilibrium. As a consequence, to reverse the solution and conclude something about earlier times or initial conditions from the present heat distribution is very inaccurate except over the shortest of time periods.
The heat equation is the prototypical example of aparabolic partial differential equation.
Using theLaplace operator, the heat equation can be simplified, and generalized to similar equations over spaces of arbitrary number of dimensions, as
where the Laplace operator, denoted as either Δ or as ∇2 (the divergence of the gradient), is taken in the spatial variables.
The heat equation governs heat diffusion, as well as other diffusive processes, such asparticle diffusion or the propagation ofaction potential in nerve cells. Although they are not diffusive in nature, some quantum mechanics problems are also governed by a mathematical analog of the heat equation (see below). It also can be used to model some phenomena arising infinance, like theBlack–Scholes or theOrnstein-Uhlenbeck processes. The equation, and various non-linear analogues, has also been used in image analysis.
The heat equation is, technically, in violation ofspecial relativity, because its solutions involve instantaneous propagation of a disturbance. The part of the disturbance outside the forwardlight cone can usually be safely neglected, but if it is necessary to develop a reasonable speed for the transmission of heat, ahyperbolic problem should be considered instead – like a partial differential equation involving a second-ordertime derivative. Some models of nonlinear heat conduction (which are also parabolic equations) have solutions with finite heat transmission speed.[4][5]
The functionu above represents temperature of a body. Alternatively, it is sometimes convenient to change units and representu as the heat density of a medium. Since heat density is proportional to temperature in a homogeneous medium, the heat equation is still obeyed in the new units.
Suppose that a body obeys the heat equation and, in addition, generates its own heat per unit volume (e.g., in watts/litre - W/L) at a rate given by a known functionq varying in space and time.[6] Then the heat per unit volumeu satisfies an equation
For example, a tungsten light bulb filament generates heat, so it would have a positive nonzero value forq when turned on. While the light is turned off, the value ofq for the tungsten filament would be zero.

The following solution technique for the heat equation was proposed byJoseph Fourier in his treatiseThéorie analytique de la chaleur, published in 1822. Consider the heat equation for one space variable. This could be used to model heat conduction in a rod. The equation is
| 1 |
whereu =u(x,t) is a function of two variablesx andt. Here
We assume the initial condition
| 2 |
where the functionf is given, and the boundary conditions
| . | 3 |
Let us attempt to find a solution of (1) that is not identically zero satisfying the boundary conditions (3) but with the following property:u is a product in which the dependence ofu onx,t is separated, that is:
| 4 |
This solution technique is calledseparation of variables. Substitutingu back into equation (1),
Since the right hand side depends only onx and the left hand side only ont, both sides are equal to some constant value −λ. Thus:
| 5 |
and
| 6 |
We will now show that nontrivial solutions for (6) for values ofλ ≤ 0 cannot occur:
This solves the heat equation in the special case that the dependence ofu has the special form (4).
In general, the sum of solutions to (1) that satisfy the boundary conditions (3) also satisfies (1) and (3). We can show that the solution to (1), (2) and (3) is given by
where
The solution technique used above can be greatly extended to many other types of equations. The idea is that the operatoruxx with the zero boundary conditions can be represented in terms of itseigenfunctions. This leads naturally to one of the basic ideas of thespectral theory of linearself-adjoint operators.
Consider thelinear operator Δu =uxx. The infinite sequence of functions
forn ≥ 1 are eigenfunctions of Δ. Indeed,
Moreover, any eigenfunctionf of Δ with the boundary conditionsf(0) =f(L) = 0 is of the formen for somen ≥ 1. The functionsen forn ≥ 1 form anorthonormal sequence with respect to a certaininner product on the space of real-valued functions on [0,L]. This means
Finally, the sequence {en}n ∈N spans a dense linear subspace ofL2((0,L)). This shows that in effect we havediagonalized the operator Δ.
Solutions of the heat equations
satisfy a mean-value property analogous to themean-value properties of harmonic functions, solutions of
though a bit more complicated. Precisely, ifu solves
and
then
where is aheat-ball, that is a super-level set of the fundamental solution of the heat equation:
Notice that
as so the above formula holds for any in the (open) set for large enough.[7]
Afundamental solution of the heat equation is a solution that corresponds to the initial condition of an initial point source of heat at a known position. These can be used to find a general solution of the heat equation over certain domains (see, for instance,Evans 2010).
In one variable, theGreen's function is a solution of the initial value problem (byDuhamel's principle, equivalent to the definition of Green's function as one with a delta function as solution to the first equation)
where is theDirac delta function. The fundamental solution to this problem is given by theheat kernel
One can obtain the general solution of the one variable heat equation with initial conditionu(x, 0) =g(x) for −∞ <x < ∞ and 0 <t < ∞ by applying aconvolution:
In several spatial variables, the fundamental solution solves the analogous problem
Then-variable fundamental solution is the product of the fundamental solutions in each variable; i.e.,
The general solution of the heat equation onRn is then obtained by a convolution, so that to solve the initial value problem withu(x, 0) =g(x), one has
The general problem on a domain Ω inRn is
with eitherDirichlet orNeumann boundary data. AGreen's function always exists, but unless the domain Ω can be readily decomposed into one-variable problems (see below), it may not be possible to write it down explicitly. Other methods for obtaining Green's functions include themethod of images,separation of variables, andLaplace transforms (Cole, 2011).
A variety of elementary Green's function solutions in one-dimension are recorded here; many others are available elsewhere.[8] In some of these, the spatial domain is (−∞,∞). In others, it is the semi-infinite interval (0,∞) with eitherNeumann orDirichlet boundary conditions. One further variation is that some of these solve the inhomogeneous equation
wheref is some given function ofx andt.

Comment. This solution is theconvolution with respect to the variablex of the fundamental solution
and the functiong(x). (TheGreen's function number of the fundamental solution is X00.)
Therefore, according to the general properties of the convolution with respect to differentiation,u =g ∗ Φ is a solution of the same heat equation, for
Moreover,
so that, by general facts aboutapproximation to the identity, Φ(⋅,t) ∗g →g ast → 0 in various senses, according to the specificg. For instance, ifg is assumed bounded and continuous onR thenΦ(⋅,t) ∗g converges uniformly tog ast → 0, meaning thatu(x,t) is continuous onR × [0, ∞) withu(x, 0) =g(x).
Comment. This solution is obtained from the preceding formula as applied to the datag(x) suitably extended toR, so as to be anodd function, that is, lettingg(−x) := −g(x) for allx. Correspondingly, the solution of the initial value problem on (−∞,∞) is an odd function with respect to the variablex for all values oft, and in particular it satisfies the homogeneous Dirichlet boundary conditionsu(0,t) = 0.TheGreen's function number of this solution is X10.
Comment. This solution is obtained from the first solution formula as applied to the datag(x) suitably extended toR so as to be aneven function, that is, lettingg(−x) :=g(x) for allx. Correspondingly, the solution of the initial value problem onR is an even function with respect to the variablex for all values oft > 0, and in particular, being smooth, it satisfies the homogeneous Neumann boundary conditionsux(0,t) = 0. TheGreen's function number of this solution is X20.
Comment. This solution is theconvolution with respect to the variablet of
and the functionh(t). Since Φ(x,t) is the fundamental solution of
the functionψ(x,t) is also a solution of the same heat equation, and so isu :=ψ ∗h, thanks to general properties of the convolution with respect to differentiation. Moreover,
so that, by general facts aboutapproximation to the identity,ψ(x, ⋅) ∗h →h asx → 0 in various senses, according to the specifich. For instance, ifh is assumed continuous onR with support in [0, ∞) thenψ(x, ⋅) ∗h converges uniformly on compacta toh asx → 0, meaning thatu(x,t) is continuous on[0, ∞) × [0, ∞) withu(0,t) =h(t).

Comment. This solution is the convolution inR2, that is with respect to both the variablesx andt, of the fundamental solution
and the functionf(x,t), both meant as defined on the wholeR2 and identically 0 for allt → 0. One verifies that
which expressed in the language of distributions becomes
where the distribution δ is theDirac's delta function, that is the evaluation at 0.
Comment. This solution is obtained from the preceding formula as applied to the dataf(x,t) suitably extended toR × [0,∞), so as to be an odd function of the variablex, that is, lettingf(−x,t) := −f(x,t) for allx andt. Correspondingly, the solution of the inhomogeneous problem on (−∞,∞) is an odd function with respect to the variablex for all values oft, and in particular it satisfies the homogeneous Dirichlet boundary conditionsu(0,t) = 0.
Comment. This solution is obtained from the first formula as applied to the dataf(x,t) suitably extended toR × [0,∞), so as to be an even function of the variablex, that is, lettingf(−x,t) :=f(x,t) for allx andt. Correspondingly, the solution of the inhomogeneous problem on (−∞,∞) is an even function with respect to the variablex for all values oft, and in particular, being a smooth function, it satisfies the homogeneous Neumann boundary conditionsux(0,t) = 0.
Since the heat equation is linear, solutions of other combinations of boundary conditions, inhomogeneous term, and initial conditions can be found by taking an appropriatelinear combination of the above Green's function solutions.
For example, to solve
letu =w +v wherew andv solve the problems
Similarly, to solve
letu =w +v +r wherew,v, andr solve the problems
As the prototypicalparabolic partial differential equation, the heat equation is among the most widely studied topics inpure mathematics, and its analysis is regarded as fundamental to the broader field ofpartial differential equations. The heat equation can also be considered onRiemannian manifolds, leading to many geometric applications. Following work ofSubbaramiah Minakshisundaram andÅke Pleijel, the heat equation is closely related withspectral geometry. A seminalnonlinear variant of the heat equation was introduced todifferential geometry byJames Eells andJoseph Sampson in 1964, inspiring the introduction of theRicci flow byRichard Hamilton in 1982 and culminating in the proof of thePoincaré conjecture byGrigori Perelman in 2003. Certain solutions of the heat equation known asheat kernels provide subtle information about the region on which they are defined, as exemplified through their application to theAtiyah–Singer index theorem.[9]
The heat equation, along with variants thereof, is also important in many fields of science andapplied mathematics. Inprobability theory, the heat equation is connected with the study ofrandom walks andBrownian motion via theFokker–Planck equation. TheBlack–Scholes equation offinancial mathematics is a small variant of the heat equation, and theSchrödinger equation ofquantum mechanics can be regarded as a heat equation inimaginary time. Inimage analysis, the heat equation is sometimes used to resolve pixelation and toidentify edges. FollowingRobert Richtmyer andJohn von Neumann's introduction of artificial viscosity methods, solutions of heat equations have been useful in the mathematical formulation ofhydrodynamical shocks. Solutions of the heat equation have also been given much attention in thenumerical analysis literature, beginning in the 1950s with work of Jim Douglas, D.W. Peaceman, and Henry Rachford Jr.
One can model particlediffusion by an equation involving either:
In either case, one uses the heat equation
or
Bothc andP are functions of position and time.D is the diffusion coefficient that controls the speed of the diffusive process, and is typically expressed in meters squared over second. If the diffusion coefficientD is not constant, but depends on the concentrationc (orP in the second case), then one gets thenonlinear diffusion equation.
Let thestochastic process be the solution to thestochastic differential equation
where is theWiener process (standard Brownian motion). Theprobability density function of is given at any time by
which is the solution to the initial value problem
where is theDirac delta function.
With a simple division, theSchrödinger equation for a single particle ofmassm in the absence of any applied force field can be rewritten in the following way:
wherei is theimaginary unit,ħ is thereduced Planck constant, andψ is thewave function of the particle.
This equation is formally similar to the particle diffusion equation, which one obtains through the following transformation:
Applying this transformation to the expressions of the Green functions determined in the case of particle diffusion yields the Green functions of theSchrödinger equation, which in turn can be used to obtain thewave function at any time through an integral on thewave function att = 0:
with
Remark: this analogy between quantum mechanics and diffusion is a purely formal one. Physically, the evolution of thewave function satisfying theSchrödinger equation might have an origin other than diffusion[citation needed].
A direct practical application of the heat equation, in conjunction withFourier theory, in spherical coordinates, is the prediction of thermal transfer profiles and the measurement of thethermal diffusivity inpolymers (Unsworth andDuarte). This dual theoretical-experimental method is applicable to rubber, various other polymeric materials of practical interest, and microfluids. These authors derived an expression for the temperature at the center of a sphereTC
whereT0 is the initial temperature of the sphere andTS the temperature at the surface of the sphere, of radiusL. This equation has also found applications in protein energy transfer and thermal modeling in biophysics.
The heat equation arises in anumber of phenomena and is often used infinancial mathematics in themodeling ofoptions. TheBlack–Scholes option pricing model'sdifferential equation can be transformed into the heat equation allowing relatively easy solutions from a familiar body of mathematics. Many of the extensions to the simple option models do not have closed form solutions and thus must be solved numerically to obtain a modeled option price. The equation describing pressure diffusion in a porous medium is identical in form with the heat equation.Diffusion problems dealing withDirichlet,Neumann andRobin boundary conditions have closed form analytic solutions (Thambynayagam 2011).
The heat equation is also widely used in image analysis (Perona & Malik 1990) and inmachine learning as the driving theory behindscale-space orgraph Laplacian methods. The heat equation can be efficiently solved numerically using the implicitCrank–Nicolson method of (Crank & Nicolson 1947). This method can be extended to many of the models with no closed form solution, see for instance (Wilmott, Howison & Dewynne 1995).
An abstract form of heat equation onmanifolds provides a major approach to theAtiyah–Singer index theorem, and has led to much further work on heat equations inRiemannian geometry.