This article is about the family of orthogonal polynomials on the real line. For polynomial interpolation on a segment using derivatives, seeHermite interpolation. For integral transform of Hermite polynomials, seeHermite transform.
Hermite polynomials were defined byPierre-Simon Laplace in 1810,[1][2] though in scarcely recognizable form, and studied in detail byPafnuty Chebyshev in 1859.[3] Chebyshev's work was overlooked, and they were named later afterCharles Hermite, who wrote on the polynomials in 1864, describing them as new.[4] They were consequently not new, although Hermite was the first to define the multidimensional polynomials.
Like the otherclassical orthogonal polynomials, the Hermite polynomials can be defined from several different starting points. Noting from the outset that there are two different standardizations in common use, one convenient method is as follows:
The"probabilist's Hermite polynomials" are given by
while the"physicist's Hermite polynomials" are given by
These equations have the form of aRodrigues' formula and can also be written as,
The two definitions are not exactly identical; each is a rescaling of the other:
These are Hermite polynomial sequences of different variances; see the material on variances below.
Thenth-order Hermite polynomial is a polynomial of degreen. The probabilist's versionHen has leading coefficient 1, while the physicist's versionHn has leading coefficient2n.
Hn(x) andHen(x) arenth-degree polynomials forn = 0, 1, 2, 3,.... Thesepolynomials are orthogonal with respect to theweight function (measure) ori.e., we have
The Hermite polynomials (probabilist's or physicist's) form anorthogonal basis of theHilbert space of functions satisfyingin which the inner product is given by the integralincluding theGaussian weight functionw(x) defined in the preceding section.
An orthogonal basis forL2(R,w(x)dx) is acomplete orthogonal system. For an orthogonal system,completeness is equivalent to the fact that the 0 function is the only functionf ∈L2(R,w(x)dx) orthogonal toall functions in the system.
Since thelinear span of Hermite polynomials is the space of all polynomials, one has to show (in physicist case) that iff satisfiesfor everyn ≥ 0, thenf = 0.
One possible way to do this is to appreciate that theentire functionvanishes identically. The fact then thatF(it) = 0 for every realt means that theFourier transform off(x)e−x2 is 0, hencef is 0almost everywhere. Variants of the above completeness proof apply to other weights withexponential decay.
In the Hermite case, it is also possible to prove an explicit identity that implies completeness (see section on theCompleteness relation below).
An equivalent formulation of the fact that Hermite polynomials are an orthogonal basis forL2(R,w(x)dx) consists in introducing Hermitefunctions (see below), and in saying that the Hermite functions are an orthonormal basis forL2(R).
The probabilist's Hermite polynomials are solutions of thedifferential equationwhereλ is a constant. Imposing the boundary condition thatu should be polynomially bounded at infinity, the equation has solutions only ifλ is a non-negative integer, and the solution is uniquely given by, where denotes a constant.
Rewriting the differential equation as aneigenvalue problemthe Hermite polynomials may be understood aseigenfunctions of the differential operator . This eigenvalue problem is called theHermite equation, although the term is also used for the closely related equation whose solution is uniquely given in terms of physicist's Hermite polynomials in the form, where denotes a constant, after imposing the boundary condition thatu should be polynomially bounded at infinity.
The general solutions to the above second-order differential equations are in fact linear combinations of both Hermite polynomials and confluent hypergeometric functions of the first kind. For example, for the physicist's Hermite equationthe general solution takes the formwhere and are constants, are physicist's Hermite polynomials (of the first kind), and are physicist's Hermite functions (of the second kind). The latter functions are compactly represented as where areConfluent hypergeometric functions of the first kind. The conventional Hermite polynomials may also be expressed in terms of confluent hypergeometric functions, see below.
The sequence of probabilist's Hermite polynomials also satisfies therecurrence relationIndividual coefficients are related by the following recursion formula: anda0,0 = 1,a1,0 = 0,a1,1 = 1.
For the physicist's polynomials, assumingwe haveIndividual coefficients are related by the following recursion formula:anda0,0 = 1,a1,0 = 0,a1,1 = 2.
The Hermite polynomials constitute anAppell sequence, i.e., they are a polynomial sequence satisfying the identity
An integral recurrence that is deduced and demonstrated in[6] is as follows:
The physicist's Hermite polynomials can be written explicitly as
These two equations may be combined into one using thefloor function:
The probabilist's Hermite polynomialsHe have similar formulas, which may be obtained from these by replacing the power of2x with the corresponding power of√2x and multiplying the entire sum by2−n/2:
This equality is valid for allcomplex values ofx andt, and can be obtained by writing the Taylor expansion atx of the entire functionz →e−z2 (in the physicist's case). One can also derive the (physicist's) generating function by usingCauchy's integral formula to write the Hermite polynomials as
Using this in the sum one can evaluate the remaining integral using the calculus of residues and arrive at the desired generating function.
The moments of the standard normal (with expected value zero) may be read off directly from the relation for even indices:where(2n − 1)!! is thedouble factorial. Note that the above expression is a special case of the representation of the probabilist's Hermite polynomials as moments:
From the generating-function representation above, we see that the Hermite polynomials have a representation in terms of acontour integral, aswith the contour encircling the origin.
Using the Fourier transform of the gaussian, we have
Asymptotically, asn → ∞, the expansion[12] holds true. For certain cases concerning a wider range of evaluation, it is necessary to include a factor for changing amplitude:which, usingStirling's approximation, can be further simplified, in the limit, to
A better approximation, which accounts for the variation in frequency, is given by
A finer approximation,[13] which takes into account the uneven spacing of the zeros near the edges, makes use of the substitution with which one has the uniform approximation
Similar approximations hold for the monotonic and transition regions. Specifically, if thenwhile for witht complex and bounded, the approximation iswhereAi is theAiry function of the first kind.
Let be a real symmetric matrix, then theKibble–Slepian formula states that where is the-fold summation over all symmetric matrices with non-negative integer entries, is thetrace of, and is defined as. This givesMehler's formula when.
Equivalently stated, if is apositive semidefinite matrix, then set, we have, soEquivalently stated in a form closer to thebosonquantum mechanics of theharmonic oscillator:[14] where each is the-th eigenfunction of the harmonic oscillator, defined asThe Kibble–Slepian formula was proposed by Kibble in 1945[15] and proven by Slepian in 1972 using Fourier analysis.[16] Foata gave a combinatorial proof[17] while Louck gave a proof via boson quantum mechanics.[14] It has a generalization for complex-argument Hermite polynomials.[18][19]
Similar to Taylor expansion, some functions are expressible as an infinite sum of Hermite polynomials. Specifically, if, then it has an expansion in the physicist's Hermite polynomials.[22]
Given such, the partial sums of the Hermite expansion of converges to in the norm if and only if.[23]
The probabilist's Hermite polynomials satisfy the identity[24] whereD represents differentiation with respect tox, and theexponential is interpreted by expanding it as apower series. There are no delicate questions of convergence of this series when it operates on polynomials, since all but finitely many terms vanish.
Since the power-series coefficients of the exponential are well known, and higher-order derivatives of the monomialxn can be written down explicitly, this differential-operator representation gives rise to a concrete formula for the coefficients ofHn that can be used to quickly compute these polynomials.
Since the formal expression for theWeierstrass transformW iseD2, we see that the Weierstrass transform of(√2)nHen(x/√2) isxn. Essentially the Weierstrass transform thus turns a series of Hermite polynomials into a correspondingMaclaurin series.
The existence of some formal power seriesg(D) with nonzero constant coefficient, such thatHen(x) =g(D)xn, is another equivalent to the statement that these polynomials form anAppell sequence. Since they are an Appell sequence, they area fortiori aSheffer sequence.
The probabilist's Hermite polynomials defined above are orthogonal with respect to the standard normal probability distribution, whose density function iswhich has expected value 0 and variance 1.
Scaling, one may analogously speak ofgeneralized Hermite polynomials[25]of varianceα, whereα is any positive number. These are then orthogonal with respect to the normal probability distribution whose density function isThey are given by
Now, ifthen the polynomial sequence whosenth term isis called theumbral composition of the two polynomial sequences. It can be shown to satisfy the identitiesandThe last identity is expressed by saying that thisparameterized family of polynomial sequences is known as a cross-sequence. (See the above section on Appell sequences and on thedifferential-operator representation, which leads to a ready derivation of it. Thisbinomial type identity, forα =β =1/2, has already been encountered in the above section on#Recursion relations.)
Since polynomial sequences form agroup under the operation ofumbral composition, one may denote bythe sequence that is inverse to the one similarly denoted, but without the minus sign, and thus speak of Hermite polynomials of negative variance. Forα > 0, the coefficients of are just the absolute values of the corresponding coefficients of.
These arise as moments of normal probability distributions: Thenth moment of the normal distribution with expected valueμ and varianceσ2 iswhereX is a random variable with the specified normal distribution. A special case of the cross-sequence identity then says that
One can define theHermite functions (often called Hermite-Gaussian functions) from the physicist's polynomials:Thus,
Since these functions contain the square root of theweight function and have been scaled appropriately, they areorthonormal:and they form an orthonormal basis ofL2(R). This fact is equivalent to the corresponding statement for Hermite polynomials (see above).
The Hermite functions satisfy the differential equationThis equation is equivalent to theSchrödinger equation for a harmonic oscillator in quantum mechanics, so these functions are theeigenfunctions.
The Hermite functionsψn(x) are a set ofeigenfunctions of thecontinuous Fourier transformF. To see this, take the physicist's version of the generating function and multiply bye−1/2x2. This gives
The Fourier transform of the left side is given by
The Fourier transform of the right side is given by
Equating like powers oft in the transformed versions of the left and right sides finally yields
The Hermite functionsψn(x) are thus an orthonormal basis ofL2(R), whichdiagonalizes the Fourier transform operator.[29] In short, we have:
In the Hermite polynomialHen(x) of variance 1, the absolute value of the coefficient ofxk is the number of (unordered) partitions of ann-element set intok singletons andn −k/2 (unordered) pairs. Equivalently, it is the number of involutions of ann-element set with preciselyk fixed points, or in other words, the number of matchings in thecomplete graph onn vertices that leavek vertices uncovered (indeed, the Hermite polynomials are thematching polynomials of these graphs). The sum of the absolute values of the coefficients gives the total number of partitions into singletons and pairs, the so-calledtelephone numbers
Moreover, the followingcompleteness identity for the above Hermite functions holds in the sense ofdistributions:whereδ is theDirac delta function,ψn the Hermite functions, andδ(x −y) represents theLebesgue measure on the liney =x inR2, normalized so that its projection on the horizontal axis is the usual Lebesgue measure.
This distributional identity followsWiener (1958) by takingu → 1 inMehler's formula, valid when−1 <u < 1:which is often stated equivalently as a separable kernel,[35][36]
The function(x,y) →E(x,y;u) is the bivariate Gaussian probability density onR2, which is, whenu is close to 1, very concentrated around the liney =x, and very spread out on that line. It follows thatwhenf andg are continuous and compactly supported.
This yields thatf can be expressed in Hermite functions as the sum of a series of vectors inL2(R), namely,
With this representation forHn(x) andHn(y), it is evident thatand this yields the desired resolution of the identity result, using again the Fourier transform of Gaussian kernels under the substitution
^Tom H. Koornwinder, Roderick S. C. Wong, and Roelof Koekoek et al. (2010) andAbramowitz & Stegun.
^Hurtado Benavides, Miguel Ángel. (2020). De las sumas de potencias a las sucesiones de Appell y su caracterización a través de funcionales. [Tesis de maestría]. Universidad Sergio Arboleda.
^Gradshteĭn, I. S.; Zwillinger, Daniel (2015).Table of integrals, series, and products (8 ed.). Amsterdam ; Boston: Elsevier, Academic Press is an imprint of Elsevier.ISBN978-0-12-384933-5.
^abFeldheim, Ervin. "Développements en série de polynômes d’Hermite et de Laguerrea l’aide des transformations de Gauss et de Hankel." Proceedings of the Koninklijke Nederlandse Akademie van Wetenschappen 435 (1940). PartI,II,III
^In this case, we used the unitary version of the Fourier transform, so theeigenvalues are(−i)n. The ensuing resolution of the identity then serves to define powers, including fractional ones, of the Fourier transform, to wit aFractional Fourier transform generalization, in effect aMehler kernel.
^Folland, G. B. (1989),Harmonic Analysis in Phase Space, Annals of Mathematics Studies, vol. 122, Princeton University Press,ISBN978-0-691-08528-9
Laplace, P. S. (1810), "Mémoire sur les intégrales définies et leur application aux probabilités, et spécialement a la recherche du milieu qu'il faut choisir entre les résultats des observations",Mémoires de l'Académie des Sciences:279–347Oeuvres complètes 12, pp.357-412,English translationArchived 2016-03-04 at theWayback Machine.
Rainville, Earl David (1971).Special functions. Bronx, N.Y: Chelsea Pub. Co.ISBN978-0-8284-0258-3.
Shohat, J.A.; Hille, Einar; Walsh, Joseph L. (1940),A bibliography on orthogonal polynomials, Bulletin of the National Research Council, Washington D.C.: National Academy of Sciences - 2000 references of Bibliography on Hermite polynomials.
Szegő, Gábor (1955) [1939],Orthogonal Polynomials, Colloquium Publications, vol. 23 (4th ed.), American Mathematical Society,ISBN978-0-8218-1023-1{{citation}}:ISBN / Date incompatibility (help)
Temme, Nico (1996),Special Functions: An Introduction to the Classical Functions of Mathematical Physics, New York: Wiley,ISBN978-0-471-11313-3
Wiener, Norbert (1958) [1933],The Fourier Integral and Certain of its Applications (revised ed.), New York: Dover Publications,ISBN0-486-60272-9{{citation}}:ISBN / Date incompatibility (help)