| Part of a series on |
| Regression analysis |
|---|
| Models |
| Estimation |
| Background |
Ineconometrics, arandom effects model, also called avariance components model, is astatistical model where the model effects arerandom variables. It is a kind ofhierarchical linear model, which assumes that the data being analysed are drawn from a hierarchy of different populations whose differences relate to that hierarchy. A random effects model is a special case of amixed model.
Contrast this to thebiostatistics definitions,[1][2][3][4][5] as biostatisticians use "fixed" and "random" effects to respectively refer to the population-average and subject-specific effects (and where the latter are generally assumed to be unknown,latent variables).
Random effect models assist in controlling forunobserved heterogeneity when the heterogeneity is constant over time and not correlated with independent variables. This constant can be removed from longitudinal data through differencing, since taking a first difference will remove any time invariant components of the model.[6]
Two common assumptions can be made about the individual specific effect: the random effects assumption and the fixed effects assumption. The random effects assumption is that the individual unobserved heterogeneity is uncorrelated with the independent variables. The fixed effect assumption is that the individual specific effect is correlated with the independent variables.[6]
If the random effects assumption holds, the random effects estimator is moreefficient than the fixed effects model.
Suppose large elementary schools are chosen randomly from among thousands in a large country. Suppose also that pupils of the same age are chosen randomly at each selected school. Their scores on a standard aptitude test are ascertained. Let be the score of the-th pupil at the-th school.
A simple way to model this variable is
where is the average test score for the entire population.
In this model is the school-specificrandom effect: it measures the difference between the average score at school and the average score in the entire country. The term is the individual-specific random effect, i.e., it's the deviation of the-th pupil's score from the average for the-th school.
The model can be augmented by including additional explanatory variables, which would capture differences in scores among different groups. For example:
where is a binarydummy variable andrecords, say, the average education level of a child's parents. This is amixed model, not a purely random effects model, as it introducesfixed-effects terms for Sex and Parents' Education.
The variance of is the sum of the variances and of and respectively.
Let
be the average, not of all scores at the-th school, but of those at the-th school that are included in therandom sample. Let
be thegrand average.
Let
be respectively the sum of squares due to differenceswithin groups and the sum of squares due to differencebetween groups. Then it can be shown[citation needed] that
and
These "expected mean squares" can be used as the basis forestimation of the "variance components" and.
The parameter is also called theintraclass correlation coefficient.
This article mayrequirecleanup to meet Wikipedia'squality standards. The specific problem is:need to show formulas. Please helpimprove this article if you can.(April 2024) (Learn how and when to remove this message) |
For random effects models themarginal likelihoods are important.[7]
Random effects models used in practice include theBühlmann model of insurance contracts and theFay-Herriot model used forsmall area estimation.