Inprobability theory andstatistics, themoment-generating function of a real-valuedrandom variable is an alternative specification of itsprobability distribution. Thus, it provides the basis of an alternative route to analytical results compared with working directly withprobability density functions orcumulative distribution functions. There are particularly simple results for the moment-generating functions of distributions defined by the weighted sums of random variables. However, not all random variables have moment-generating functions.
As its name implies, the moment-generating function can be used to compute a distribution’smoments: then-th moment about 0 is then-th derivative of the moment-generating function, evaluated at 0.
In addition to real-valued distributions (univariate distributions), moment-generating functions can be defined for vector- or matrix-valued random variables, and can even be extended to more general cases.
The moment-generating function of a real-valued distribution does not always exist, unlike thecharacteristic function. There are relations between the behavior of the moment-generating function of a distribution and properties of the distribution, such as the existence of moments.
Let be arandom variable withCDF. The moment generating function (mgf) of (or), denoted by, is
provided thisexpectation exists for in some openneighborhood of 0. That is, there is an such that for all in, exists. If the expectation does not exist in an open neighborhood of 0, we say that the moment generating function does not exist.[1]
In other words, the moment-generating function ofX is theexpectation of the random variable. More generally, when, an-dimensionalrandom vector, and is a fixed vector, one uses instead of :
always exists and is equal to 1. However, a key problem with moment-generating functions is that moments and the moment-generating function may not exist, as the integrals need not converge absolutely. By contrast, thecharacteristic function or Fourier transform always exists (because it is the integral of a bounded function on a space of finitemeasure), and for some purposes may be used instead.
The moment-generating function is so named because it can be used to find the moments of the distribution.[2] The series expansion of is
Hence,
where is the-thmoment. Differentiating times with respect to and setting, we obtain the-th moment about the origin,; see§ Calculations of moments below.
If is a continuous random variable, the following relation between its moment-generating function and thetwo-sided Laplace transform of its probability density function holds:
since the PDF's two-sided Laplace transform is given as
This is consistent with the characteristic function of being aWick rotation of when the moment generating function exists, as the characteristic function of a continuous random variable is theFourier transform of its probability density function, and in general when a function is ofexponential order, the Fourier transform of is a Wick rotation of its two-sided Laplace transform in the region of convergence. Seethe relation of the Fourier and Laplace transforms for further information.
Here are some examples of the moment-generating function and the characteristic function for comparison. It can be seen that the characteristic function is aWick rotation of the moment-generating function when the latter exists.
If, where theXi are independent random variables and theai are constants, then the probability density function forSn is theconvolution of the probability density functions of each of theXi, and the moment-generating function forSn is given by
An important property of the moment-generating function is that it uniquely determines the distribution. In other words, if and are two random variables and for all values of t,
then
for all values ofx (or equivalentlyX andY have the same distribution). This statement is not equivalent to the statement "if two distributions have the same moments, then they are identical at all points." This is because in some cases, the moments exist and yet the moment-generating function does not, because the limit
Jensen's inequality provides a simple lower bound on the moment-generating function:where is the mean ofX.
The moment-generating function can be used in conjunction withMarkov's inequality to bound the upper tail of a real random variableX. This statement is also called theChernoff bound. Since is monotonically increasing for, we havefor any and anya, provided exists. For example, whenX is a standard normal distribution and, we can choose and recall that. This gives, which is within a factor of1+a of the exact value.
Various lemmas, such asHoeffding's lemma orBennett's inequality provide bounds on the moment-generating function in the case of a zero-mean, bounded random variable.
When is non-negative, the moment generating function gives a simple, useful bound on the moments:For any and.
This follows from the inequality into which we can substitute implies for any.Now, if and, this can be rearranged to.Taking the expectation on both sides gives the bound on in terms of.
As an example, consider with degrees of freedom. Then from theexamples.Picking and substituting into the bound:We know thatin this case the correct bound is.To compare the bounds, we can consider the asymptotics for large.Here the moment-generating function bound is,where the real bound is.The moment-generating function bound is thus very strong in this case.
Thecharacteristic function is related to the moment-generating function via the characteristic function is the moment-generating function ofiX or the moment generating function ofX evaluated on the imaginary axis. This function can also be viewed as theFourier transform of theprobability density function, which can therefore be deduced from it by inverse Fourier transform.
Thecumulant-generating function is defined as the logarithm of the moment-generating function; some instead define the cumulant-generating function as the logarithm of thecharacteristic function, while others call this latter thesecond cumulant-generating function.