This articleneeds additional citations forverification. Please helpimprove this article byadding citations to reliable sources. Unsourced material may be challenged and removed. Find sources: "Centering matrix" – news ·newspapers ·books ·scholar ·JSTOR(August 2024) (Learn how and when to remove this message) |
Inmathematics andmultivariate statistics, thecentering matrix[1] is asymmetric andidempotent matrix, which when multiplied with a vector has the same effect as subtracting themean of the components of the vector from every component of that vector.
Thecentering matrix of sizen is defined as then-by-n matrix
where is theidentity matrix of sizen and is ann-by-nmatrix of all 1's.
For example
Given a column-vector, of sizen, thecentering property of can be expressed as
where is acolumn vector of ones and is the mean of the components of.
is symmetricpositive semi-definite.
isidempotent, so that, for. Once the mean has been removed, it is zero and removing it again has no effect.
is singular. The effects of applying the transformation cannot be reversed.
has theeigenvalue 1 of multiplicityn − 1 and eigenvalue 0 of multiplicity 1.
has anullspace of dimension 1, along the vector.
is anorthogonal projection matrix. That is, is a projection of onto the (n − 1)-dimensionalsubspace that is orthogonal to the nullspace. (This is the subspace of alln-vectors whose components sum to zero.)
The trace of is.
Although multiplication by the centering matrix is not a computationally efficient way of removing the mean from a vector, it is a convenient analytical tool. It can be used not only to remove the mean of a single vector, but also of multiple vectors stored in the rows or columns of anm-by-n matrix.
The left multiplication by subtracts a corresponding mean value from each of then columns, so that each column of the product has a zero mean. Similarly, the multiplication by on the right subtracts a corresponding mean value from each of them rows, and each row of the product has a zero mean.The multiplication on both sides creates a doubly centred matrix, whose row and column means are equal to zero.
The centering matrix provides in particular a succinct way to express thescatter matrix, of a data sample, where is thesample mean. The centering matrix allows us to express the scatter matrix more compactly as
is thecovariance matrix of themultinomial distribution, in the special case where the parameters of that distribution are, and.