Statistical probability Distribution for discrete event counts
Hermite
Probability mass function
The horizontal axis is the indexk, the number of occurrences. The function is only defined at integer values ofk. The connecting lines are only guides for the eye.
Cumulative distribution function
The horizontal axis is the indexk, the number of occurrences. The CDF is discontinuous at the integers ofk and flat everywhere else because a variable that is Hermite distributed only takes on integer values.
The distribution first appeared in the paperApplications of Mathematics to Medical Problems,[2] byAnderson Gray McKendrick in 1926. In this work the author explains several mathematical methods that can be applied to medical research. In one of this methods he considered thebivariate Poisson distribution and showed that the distribution of the sum of two correlated Poisson variables follow a distribution that later would be known as Hermite distribution.
As a practical application, McKendrick considered the distribution of counts ofbacteria inleucocytes. Using themethod of moments he fitted the data with the Hermite distribution and found the model more satisfactory than fitting it with aPoisson distribution.
The distribution was formally introduced and published by C. D. Kemp and Adrienne W. Kemp in 1965 in their workSome Properties of ‘Hermite’ Distribution. The work is focused on the properties of this distribution for instance a necessary condition on the parameters and theirmaximum likelihood estimators (MLE), the analysis of theprobability generating function (PGF) and how it can be expressed in terms of the coefficients of (modified)Hermite polynomials. An example they have used in this publication is the distribution of counts of bacteria in leucocytes that used McKendrick but Kemp and Kemp estimate the model using themaximum likelihood method.
The same authors published in 1966 the paperAn alternative Derivation of the Hermite Distribution.[5] In this work established that the Hermite distribution can be obtained formally bycompounding aPoisson distribution with anormal distribution.
In 1971, Y. C. Patel[6] did a comparative study of various estimation procedures for the Hermite distribution in his doctoral thesis. It included maximum likelihood, moment estimators, mean and zero frequency estimators and the method of even points.
In 1974, Gupta and Jain[7] did a research on a generalized form of Hermite distribution.
When arandom variableY =X1 + 2X2 is distributed by an Hermite distribution, whereX1 andX2 are two independent Poisson variables with parametersa1 anda2, we write
Themoment generating function of a random variableX is defined as the expected value ofet, as a function of the real parametert. For an Hermite distribution with parametersX1 andX2, the moment generating function exists and is equal to
In a discrete distribution thecharacteristic function of any real-valued random variable is defined as theexpected value of, wherei is the imaginary unit andt ∈ R
This function is related to the moment-generating function via. Hence for this distribution the characteristic function is,[1]
This distribution can have any number ofmodes. As an example, the fitted distribution for McKendrick’s[2] data has an estimated parameters of,. Therefore, the first five estimated probabilities are 0.899, 0.012, 0.084, 0.001, 0.004.
Example of a multi-modal data, Hermite distribution(0.1,1.5).
This distribution is closed under addition or closed under convolutions.[9] Like thePoisson distribution, the Hermite distribution has this property. Given two Hermite-distributed random variables and, thenY =X1 +X2 follows an Hermite distribution,.
This distribution allows a moderateoverdispersion, so it can be used when data has this property.[9] A random variable has overdispersion, or it is overdispersed with respect the Poisson distribution, when its variance is greater than its expected value. The Hermite distribution allows a moderate overdispersion because the coefficient of dispersion is always between 1 and 2,
Given a sampleX1, ...,Xm areindependent random variables each having an Hermite distribution we wish to estimate the value of the parameters and. We know that the mean and the variance of the distribution are and, respectively. Using these two equation,
We can parameterize the probability function by μ andd
It can be shown that thelog-likelihood function is strictly concave in the domain of the parameters. Consequently, the MLE is unique.
The likelihood equation does not always have a solution like as it shows the following proposition,
Proposition:[9] LetX1, ...,Xm come from a generalized Hermite distribution with fixedn. Then the MLEs of the parameters are and if only if, where indicates the empirical factorial momement of order 2.
Remark 1: The condition is equivalent to where is the empirical dispersion index
Remark 2: If the condition is not satisfied, then the MLEs of the parameters are and, that is, the data are fitted using the Poisson distribution.
A usual choice for discrete distributions is the zero relative frequency of the data set which is equated to the probability of zero under the assumed distribution. Observing that and. Following the example of Y. C. Patel (1976) the resulting system of equations,
When Hermite distribution is used to model a data sample is important to check if thePoisson distribution is enough to fit the data. Following the parametrizedprobability mass function used to calculate the maximum likelihood estimator, is important to corroborate the following hypothesis,
Where is the log-likelihood function. Asd = 1 belongs to the boundary of the domain of parameters, under the null hypothesis,W does not have an asymptotic distribution as expected. It can be established that the asymptotic distribution ofW is a 50:50 mixture of the constant 0 and the. The α upper-tail percentage points for this mixture are the same as the 2α upper-tail percentage points for a; for instance, for α = 0.01, 0.05, and 0.10 they are 5.41189, 2.70554 and 1.64237.
The asymptotic distribution of the score test statistic under the null hypothesis is a distribution. It may be convenient to use a signed version of the score test, that is,, following asymptotically a standard normal.
^Huiming, Zhang; Yunxiao Liu; Bo Li (2014). "Notes on discrete compound Poisson model with applications to risk theory".Insurance: Mathematics and Economics.59:325–336.doi:10.1016/j.insmatheco.2014.09.012.
^abcdJohnson, N.L., Kemp, A.W., and Kotz, S. (2005) Univariate Discrete Distributions, 3rd Edition, Wiley,ISBN978-0-471-27246-5.
^Kemp, ADRIENNE W.; Kemp C.D (1966). "An alternative derivation of the Hermite distribution".Biometrika.53 (3–4):627–628.doi:10.1093/biomet/53.3-4.627.
^abcPatel, Y.C (1976). "Even Point Estimation and Moment Estimation in Hermite Distribution".Biometrics.32 (4):865–873.doi:10.2307/2529270.JSTOR2529270.
^Gupta, R.P.; Jain, G.C. (1974). "A Generalized Hermite distribution and Its Properties".SIAM Journal on Applied Mathematics.27 (2):359–363.doi:10.1137/0127027.JSTOR2100572.
^abKotz, Samuel (1982–1989).Encyclopedia of statistical sciences. John Wiley.ISBN978-0471055525.
^abcdefghPuig, P. (2003). "Characterizing Additively Closed Discrete Models by a Property of Their Maximum Likelihood Estimators, with an Application to Generalized Hermite Distributions".Journal of the American Statistical Association.98 (463):687–692.doi:10.1198/016214503000000594.JSTOR30045296.S2CID120484966.