Inprobability theory andmathematical physics, arandom matrix is amatrix-valuedrandom variable—that is, a matrix in which some or all of its entries aresampled randomly from aprobability distribution.Random matrix theory (RMT) is the study of properties of random matrices, often as they become large. RMT provides techniques likemean-field theory, diagrammatic methods, thecavity method, or thereplica method to compute quantities liketraces,spectral densities, or scalar products between eigenvectors. Many physical phenomena, such as thespectrum ofnuclei of heavy atoms,[1][2] thethermal conductivity of alattice, or the emergence ofquantum chaos,[3] can be modeled mathematically as problems concerning large, random matrices.
This sectionneeds expansion. You can help byadding missing information.(April 2024) |
Random matrix theory first gained attention beyond mathematics literature in the context of nuclear physics. Experiments byEnrico Fermi and others demonstrated evidence that individualnucleons cannot be approximated to move independently, leadingNiels Bohr to formulate the idea of acompound nucleus. Because there was no knowledge of directnucleon-nucleon interactions,Eugene Wigner andLeonard Eisenbud approximated that the nuclearHamiltonian could be modeled as a random matrix. For larger atoms, the distribution of theenergy eigenvalues of the Hamiltonian could be computed in order to approximatescattering cross sections by invoking theWishart distribution.[4]
Innuclear physics, random matrices were introduced byEugene Wigner to model the nuclei of heavy atoms.[1][2] Wigner postulated that the spacings between the lines in the spectrum of a heavy atom nucleus should resemble the spacings between theeigenvalues of a random matrix, and should depend only on the symmetry class of the underlying evolution.[5] Insolid-state physics, random matrices model the behaviour of large disorderedHamiltonians in themean-field approximation.
Inquantum chaos, the Bohigas–Giannoni–Schmit (BGS) conjecture asserts that the spectral statistics of quantum systems whose classical counterparts exhibit chaotic behaviour are described by random matrix theory.[3]
Inquantum optics, transformations described by random unitary matrices are crucial for demonstrating the advantage of quantum over classical computation (see, e.g., theboson sampling model).[6] Moreover, such random unitary transformations can be directly implemented in an optical circuit, by mapping their parameters to optical circuit components (that isbeam splitters and phase shifters).[7]
Inmultivariate statistics, random matrices were introduced byJohn Wishart, who sought toestimate covariance matrices of large samples.[8]Chernoff-,Bernstein-, andHoeffding-type inequalities can typically be strengthened when applied to the maximal eigenvalue (i.e. the eigenvalue of largest magnitude) of a finite sum of randomHermitian matrices.[9] Random matrix theory is used to study the spectral properties of random matrices—such as sample covariance matrices—which is of particular interest inhigh-dimensional statistics. Random matrix theory also saw applications inneural networks[10] anddeep learning, with recent work utilizing random matrices to show that hyper-parameter tunings can be cheaply transferred between large neural networks without the need for re-training.[11]
Innumerical analysis, random matrices have been used since the work ofJohn von Neumann andHerman Goldstine[12] to describe computation errors in operations such asmatrix multiplication. Although random entries are traditional "generic" inputs to an algorithm, theconcentration of measure associated with random matrix distributions implies that random matrices will not test large portions of an algorithm's input space.[13]
Innumber theory, the distribution of zeros of theRiemann zeta function (and otherL-functions) is modeled by the distribution of eigenvalues of certain random matrices.[14] The connection was first discovered byHugh Montgomery andFreeman Dyson. It is connected to theHilbert–Pólya conjecture.
The relation offree probability with random matrices[15] is a key reason for the wide use of free probability in other subjects. Voiculescu introduced the concept of freeness around 1983 in an operator algebraic context; at the beginning there was no relation at all with random matrices. This connection was only revealed later in 1991 by Voiculescu;[16] he was motivated by the fact that the limit distribution which he found in his free central limit theorem had appeared before in Wigner's semi-circle law in the random matrix context.
In the field of computational neuroscience, random matrices are increasingly used to model the network of synaptic connections between neurons in the brain. Dynamical models of neuronal networks with random connectivity matrix were shown to exhibit a phase transition to chaos[17] when the variance of the synaptic weights crosses a critical value, at the limit of infinite system size. Results on random matrices have also shown that the dynamics of random-matrix models are insensitive to mean connection strength. Instead, the stability of fluctuations depends on connection strength variation[18][19] and time to synchrony depends on network topology.[20][21]
In the analysis of massive data such asfMRI, random matrix theory has been applied in order to perform dimension reduction. When applying an algorithm such asPCA, it is important to be able to select the number of significant components. The criteria for selecting components can be multiple (based on explained variance, Kaiser's method, eigenvalue, etc.). Random matrix theory in this content has its representative theMarchenko-Pastur distribution, which guarantees the theoretical high and low limits of the eigenvalues associated with a random variable covariance matrix. This matrix calculated in this way becomes the null hypothesis that allows one to find the eigenvalues (and their eigenvectors) that deviate from the theoretical random range. The components thus excluded become the reduced dimensional space (see examples in fMRI[22][23]).
Inoptimal control theory, the evolution ofn state variables through time depends at any time on their own values and on the values ofk control variables. With linear evolution, matrices of coefficients appear in the state equation (equation of evolution). In some problems the values of the parameters in these matrices are not known with certainty, in which case there are random matrices in the state equation and the problem is known as one ofstochastic control.[24]: ch. 13 [25] A key result in the case oflinear-quadratic control with stochastic matrices is that thecertainty equivalence principle does not apply: while in the absence ofmultiplier uncertainty (that is, with only additive uncertainty) the optimal policy with a quadratic loss function coincides with what would be decided if the uncertainty were ignored, the optimal policy may differ if the state equation contains random coefficients.
Incomputational mechanics, epistemic uncertainties underlying the lack of knowledge about the physics of the modeled system give rise to mathematical operators associated with the computational model, which are deficient in a certain sense. Such operators lack certain properties linked to unmodeled physics. When such operators are discretized to perform computational simulations, their accuracy is limited by the missing physics. To compensate for this deficiency of the mathematical operator, it is not enough to make the model parameters random, it is necessary to consider a mathematical operator that is random and can thus generate families of computational models in the hope that one of these captures the missing physics. Random matrices have been used in this sense,[26] with applications in vibroacoustics, wave propagations, materials science, fluid mechanics, heat transfer, etc.
Random matrix theory can be applied to the electrical and communications engineering research efforts to study, model and develop Massive Multiple-Input Multiple-Output (MIMO) radio systems.[citation needed]
The most-commonly studied random matrixdistributions are the Gaussian ensembles: GOE, GUE and GSE. They are often denoted by theirDyson index,β = 1 for GOE,β = 2 for GUE, andβ = 4 for GSE. This index counts the number of real components per matrix element.
TheGaussian unitary ensemble is described by theGaussian measure with densityon the space ofHermitian matrices. Hereis a normalization constant, chosen so that the integral of the density is equal to one. The termunitary refers to the fact that the distribution is invariant under unitary conjugation. The Gaussian unitary ensemble modelsHamiltonians lacking time-reversal symmetry.
TheGaussian orthogonal ensemble is described by the Gaussian measure with densityon the space ofn × n real symmetric matricesH = (Hij)n
i,j=1. Its distribution is invariant under orthogonal conjugation, and it models Hamiltonians with time-reversal symmetry. Equivalently, it is generated by, where is an matrix with IID samples from the standard normal distribution.
TheGaussian symplectic ensemble is described by the Gaussian measure with densityon the space ofn × n Hermitianquaternionic matrices, e.g. symmetric square matrices composed ofquaternions,H = (Hij)n
i,j=1. Its distribution is invariant under conjugation by thesymplectic group, and it models Hamiltonians with time-reversal symmetry but no rotational symmetry.
Pointcorrelation functions The ensembles as defined here have Gaussian distributed matrix elements with mean ⟨Hij⟩ = 0, and two-point correlations given byfrom which all higher correlations follow byIsserlis' theorem.
Themoment generating function for the GOE iswhere is theFrobenius norm.

The jointprobability density for theeigenvaluesλ1,λ2, ...,λn of GUE/GOE/GSE is given by
| 1 |
whereZβ,n is a normalization constant which can be explicitly computed, seeSelberg integral. In the case of GUE (β = 2), the formula (1) describes adeterminantal point process. Eigenvalues repel as the joint probability density has a zero (ofth order) for coinciding eigenvalues, and.
More succinctly,where is theVandermonde determinant.
The distribution of the largest eigenvalue for GOE, and GUE, are explicitly solvable.[27] They converge to theTracy–Widom distribution after shifting and scaling appropriately.
The spectrum, divided by, converges in distribution to thesemicircular distribution on the interval:. Here is the variance of off-diagonal entries. The variance of the on-diagonal entries do not matter.
Wishart matrices aren × n random matrices of the formH =XX*, whereX is ann × m random matrix (m ≥ n) with independent entries, andX* is itsconjugate transpose. In the important special case considered by Wishart, the entries ofX are identically distributed Gaussian random variables (either real or complex).
Thelimit of the empirical spectral measure of Wishart matrices was found[28] byVladimir Marchenko andLeonid Pastur.
Random band matrices are random matrices with the property that all entries outside a certain band are zero.[29] They can be used to roughly model systems of interacting particles arranged roughly in a grid such that each particle is only allowed to interact with its neighbors, which is an improvement on the mean field model.[29]
In one dimension, this means that if, where W is the band width. Physically, this means that the amount by which particlesi andj interact is 0 if their separation is over W. In more than one dimension, i and j are no longer integers butnd vectors with integer components, and if, where indicates thetaxicab distance between the two locations. for all i,j and nonzero values of have variances of the same order of magnitude, normalized such that for each value of j.[29]
The spectral theory of random matrices studies the distribution of the eigenvalues as the size of the matrix goes to infinity.[30]
Theempirical spectral measure of is defined byor more succinctly, if are the eigenvalues of
Usually, the limit of is a deterministic measure; this is a particular case ofself-averaging. Thecumulative distribution function of the limiting measure is called theintegrated density of states and is denotedN(λ). If the integrated density of states is differentiable, its derivative is called thedensity of states and is denoted ρ(λ).
Given a matrix ensemble, we say that its spectral measures convergeweakly to iff for any measurable set, the ensemble-average converges:Convergenceweakly almost surely: If we sample independently from the ensemble, then with probability 1,for any measurable set.
In another sense, weak almost sure convergence means that we sample, not independently, but by "growing" (astochastic process), then with probability 1, for any measurable set.
For example, we can "grow" a sequence of matrices from the Gaussian ensemble as follows:
Note that generic matrix ensembles do not allow us to grow, but most of the common ones, such as the three Gaussian ensembles, do allow us to grow.
In theglobal regime, one is interested in the distribution of linear statistics of the form.
The limit of the empirical spectral measure for Wigner matrices was described byEugene Wigner; seeWigner semicircle distribution andWigner surmise. As far as sample covariance matrices are concerned, atheory was developed by Marčenko and Pastur.[28][31]
The limit of the empirical spectral measure of invariant matrix ensembles is described by a certain integral equation which arises frompotential theory.[32]
For the linear statisticsNf,H =n−1 Σf(λj), one is also interested in the fluctuations about ∫ f(λ) dN(λ). For many classes of random matrices, a central limit theorem of the formis known.[33][34]
Consider the measure
where is the potential of the ensemble and let be the empirical spectral measure.
We can rewrite with as
the probability measure is now of the form
where is the above functional inside the squared brackets.
Let now
be the space of one-dimensional probability measures and consider the minimizer
For there exists a unique equilibrium measure through theEuler-Lagrange variational conditions for some real constant
where is the support of the measure and define
The equilibrium measure has the following Radon–Nikodym density
[36][37] The typical statement of the Wigner semicircular law is equivalent to the following statement: For eachfixed interval centered at a point, as, the number of dimensions of the gaussian ensemble increases, the proportion of the eigenvalues falling within the interval converges to, where is the density of the semicircular distribution.
If can be allowed to decrease as increases, then we obtain strictly stronger theorems, named "local laws" or "mesoscopic regime".
The mesoscopic regime is intermediate between the local and the global. In themesoscopic regime, one is interested in the limit distribution of eigenvalues in a set that shrinks to zero, but slow enough, such that the number of eigenvalues inside.
For example, the Ginibre ensemble has a mesoscopic law: For any sequence of shrinking disks with areasinside the unite disk, if the disks have area, the conditional distribution of the spectrum inside the disks also converges to a uniform distribution. That is, if we cut the shrinking disks along with the spectrum falling inside the disks, then scale the disks up to unit area, we would see the spectra converging to a flat distribution in the disks.[37]
In thelocal regime, one is interested in the limit distribution of eigenvalues in a set that shrinks so fast that the number of eigenvalues remains.
Typically this means the study of spacings between eigenvalues, and, more generally, in the joint distribution of eigenvalues in an interval of length of order 1/n. One distinguishes betweenbulk statistics, pertaining to intervals inside the support of the limiting spectral measure, andedge statistics, pertaining to intervals near the boundary of the support.
Formally, fix in theinterior of thesupport of. Then consider thepoint processwhere are the eigenvalues of the random matrix.
The point process captures the statistical properties of eigenvalues in the vicinity of. For theGaussian ensembles, the limit of is known;[5] thus, for GUE it is adeterminantal point process with the kernel(thesine kernel).
Theuniversality principle postulates that the limit of as should depend only on the symmetry class of the random matrix (and neither on the specific model of random matrices nor on). Rigorous proofs of universality are known for invariant matrix ensembles[38][39] and Wigner matrices.[40][41]
One example of edge statistics is theTracy–Widom distribution.
As another example, consider the Ginibre ensemble. It can be real or complex. The real Ginibre ensemble has i.i.d. standard Gaussian entries, and the complex Ginibre ensemble has i.i.d. standard complex Gaussian entries.
Now let be sampled from the real or complex ensemble, and let be the absolute value of its maximal eigenvalue:We have the following theorem for the edge statistics:[42]
Edge statistics of the Ginibre ensemble—For and as above, with probability one,
Moreover, if andthen converges in distribution to theGumbel law, i.e., the probability measure on with cumulative distribution function.
This theorem refines thecircular law of the Ginibre ensemble. In words, the circular law says that the spectrum of almost surely falls uniformly on the unit disc. and the edge statistics theorem states that the radius of the almost-unit-disk is about, and fluctuates on a scale of, according to the Gumbel law.
The phenomenon ofspectral rigidity states that the eigenvalues from most commonly used matrix ensembles tend to be more uniformly distributed than they would be if they were sampled independently at random. That is, they together clump less than a purelyPoisson point process. It is also calledeigenvalue rigidity orlevel repulsion.
More quantitatively, suppose that a matrix ensemble has limit spectral density measure. Fix some subset such that. This is the proportion of eigenvalues that falls within at the limit of large, so the expected number of eigenvalues falling within is. Now, a purely Poisson point process would have meant that the actual number of, since is the standard deviation of the number of points falling within when the points are completely independent of each other. Conversely, if the points are completely rigid, then the actual number would be equal to without fluctuation. Now, it turns out that in many matrix ensembles, the number of points falling within is, i.e. not completely rigid, but very close to it.[43][44] Spectral rigidity has been numerically observed in the zeros of theRiemann zeta function.[45]
The joint probability density of the eigenvalues of random Hermitian matrices, with partition functions of the formwhereand is the standard Lebesgue measure on the space of Hermitian matrices, is given byThe-point correlation functions (ormarginal distributions) are defined aswhich are skew symmetric functions of their variables. In particular, the one-point correlation function, ordensity of states, isIts integral over a Borel set gives the expected number of eigenvalues contained in:
The following result expresses these correlation functions as determinants of the matrices formed from evaluating the appropriate integral kernel at the pairs of points appearing within the correlator.
Theorem [Dyson-Mehta] For any, the-point correlation function can be written as a determinantwhere is theth Christoffel-Darboux kernelassociated to, written in terms of the quasipolynomials where is a complete sequence of monic polynomials, of the degrees indicated, satisfying the orthogonality conditions
Wigner matrices are random Hermitian matrices such that the entriesabove the main diagonal are independent random variables with zero mean and have identical second moments.
The Gaussian ensembles can be extended for using the Dumitriu-Edelman tridiagonal trick. These are called thebeta ensembles.[46]
Invariant matrix ensembles are random Hermitian matrices with density on the space of real symmetric/Hermitian/quaternionic Hermitian matrices, which is of the form where the functionV is called the potential.
The Gaussian ensembles are the only common special cases of these two classes of random matrices. This is a consequence of a theorem by Porter and Rosenzweig.[47][48]
Heavy tailed distributions generalize to random matrices asheavy tailed matrix ensembles.[49]
{{citation}}: CS1 maint: work parameter with ISBN (link){{cite journal}}: CS1 maint: unflagged free DOI (link){{citation}}: CS1 maint: work parameter with ISBN (link){{citation}}: CS1 maint: work parameter with ISBN (link)