Inchemistry andphysics, theexchange interaction is aquantum mechanical constraint on the states ofindistinguishable particles. While sometimes called anexchange force, or, in the case of fermions,Pauli repulsion, its consequences cannot always be predicted based on classical ideas offorce.[1] Bothbosons andfermions can experience the exchange interaction.
Thewave function ofindistinguishable particles is subject toexchange symmetry: the wave function either changes sign (for fermions) or remains unchanged (for bosons) when two particles are exchanged. The exchange symmetry alters theexpectation value of the distance between two indistinguishable particles when their wave functions overlap. For fermions the expectation value of the distance increases, and for bosons it decreases (compared to distinguishable particles).[2]
The exchange interaction arises from the combination of exchange symmetry and theCoulomb interaction. For an electron in anelectron gas, the exchange symmetry creates an "exchange hole" in its vicinity, which other electrons with the samespin tend to avoid due to thePauli exclusion principle. This decreases the energy associated with theCoulomb interactions between the electrons with same spin.[3] Since two electrons with different spins are distinguishable from each other and not subject to the exchange symmetry, the effect tends to align the spins. Exchange interaction is the main physical effect responsible forferromagnetism, and has noclassical analogue.
For bosons, the exchange symmetry makes them bunch together, and the exchange interaction takes the form of an effective attraction that causes identical particles to be found closer together, as inBose–Einstein condensation.
Exchange interaction effects were discovered independently by physicistsWerner Heisenberg andPaul Dirac in 1926.[4][5]
Quantum particles are fundamentally indistinguishable.Wolfgang Pauli demonstrated that this is a type of symmetry: states of two particles must be either symmetric or antisymmetric when coordinate labels are exchanged.[6]In a simple one-dimensional system with two identical particles in two states and the system wavefunction can therefore be written two ways: Exchanging and gives either a symmetric combination of the states ("plus") or an antisymmetric combination ("minus"). Particles that give symmetric combinations are called bosons; those with antisymmetric combinations are called fermions.
The two possible combinations imply different physics. For example, the expectation value of the square of the distance between the two particles is[7]: 258 The last term reduces the expected value for bosons and increases the value for fermions but only when the states and physically overlap().
The physical effect of the exchange symmetry requirement is not aforce. Rather it is a significant geometrical constraint, increasing the curvature of wavefunctions to prevent the overlap of the states occupied by indistinguishable fermions. The terms "exchange force" and "Pauli repulsion" for fermions are sometimes used as an intuitive description of the effect but this intuition can give incorrect physical results.[1][7]: 291
Quantum mechanical particles are classified as bosons or fermions. Thespin–statistics theorem ofquantum field theory demands that all particles withhalf-integerspin behave as fermions and all particles withinteger spin behave as bosons. Multiple bosons may occupy the samequantum state; however, by thePauli exclusion principle, no two fermions can occupy the same state. Sinceelectrons have spin 1/2, they are fermions. This means that the overall wave function of a system must be antisymmetric when two electrons are exchanged, i.e. interchanged with respect to both spatial and spin coordinates. First, however, exchange will be explained with the neglect of spin.
Taking a hydrogen molecule-like system (i.e. one with two electrons), one may attempt to model the state of each electron by first assuming the electrons behave independently (that is, as if the Pauli exclusion principle did not apply), and taking wave functions in position space of for the first electron and for the second electron. The functions and are orthogonal, and each corresponds to an energy eigenstate. To enforce the indistinguishability of the two electrons, two wave functions for the overall system in position space can be constructed. One uses an antisymmetric combination of the product wave functions in position space:
| 1 |
The other uses a symmetric combination of the product wave functions in position space:
| 2 |
To treat the problem of the hydrogen moleculeperturbatively, the overallHamiltonian is decomposed into an unperturbed Hamiltonian of the non-interacting hydrogen atoms and a perturbing Hamiltonian, which accounts for interactions between the two atoms. The full Hamiltonian is then:
where and
The first two terms of denote thekinetic energy of the electrons. The remaining terms account for attraction between the electrons and their host protons (). The terms in account for thepotential energy corresponding to: proton–proton repulsion (), electron–electron repulsion (), and electron–proton attraction between the electron of one host atom and the proton of the other (). All quantities are assumed to bereal.
Two eigenvalues for the system energy are found:
| 3 |
where the is the spatially symmetric solution and is the spatially antisymmetric solution, corresponding to and respectively. A variational calculation yields similar results. can be diagonalized by using the position–space functions given by Eqs. (1) and (2). In Eq. (3), is the two-site two-electronCoulomb integral (It may be interpreted as the repulsive potential for electron-one at a particular point in anelectric field created by electron-two distributed over the space with the probability density,[a] is theoverlap integral, and is theexchange integral, which is similar to the two-site Coulomb integral but includes exchange of the two electrons. It has no simple physical interpretation, but it can be shown to arise entirely due to the anti-symmetry requirement. These integrals are given by:
| 4 |
| 5 |
| 6 |
Although in the hydrogen molecule the exchange integral, Eq. (6), is negative, Heisenberg first suggested that it changes sign at some critical ratio of internuclear distance to mean radial extension of the atomic orbital.[8][9][10] The detailed calculation, including evaluation of the above integrals with ground state hydrogen atom wave functions, and application of the variational principle to obtain the minimum energy in both cases of the hydrogen molecule for atomic orbitals and the hydrogen radical (i.e. with only one electron) for molecular orbitals can be found in the book of Müller-Kirsten pp. 272-292 (only in the second edition).[11]
The symmetric and antisymmetric combinations in Equations (1) and (2) did not include the spin variables (α = spin-up; β = spin-down); there are also antisymmetric and symmetric combinations of the spin variables:
| 7 |
To obtain the overall wave function, these spin combinations have to be coupled with Eqs. (1) and (2). The resulting overall wave functions, calledspin-orbitals, are written asSlater determinants. When the orbital wave function is symmetrical the spin one must be anti-symmetrical and vice versa. Accordingly, above corresponds to the spatially symmetric/spin-singlet solution and to the spatially antisymmetric/spin-triplet solution.
J. H. Van Vleck presented the following analysis:[12]
The potential energy of the interaction between the two electrons in orthogonal orbitals can be represented by a matrix, say. From Eq. (3), the characteristic values of this matrix are. The characteristic values of a matrix are its diagonal elements after it is converted to a diagonal matrix (that is, eigenvalues). Now, the characteristic values of the square of the magnitude of the resultant spin, is. The characteristic values of the matrices and are each and. The characteristic values of the scalar product are and, corresponding to both the spin-singlet () and spin-triplet () states, respectively.
From Eq. (3) and the aforementioned relations, the matrix is seen to have the characteristic value when has the characteristic value −3/4 (i.e. when; the spatially symmetric/spin-singlet state). Alternatively, it has the characteristic value when has the characteristic value +1/4 (i.e. when; the spatially antisymmetric/spin-triplet state). Therefore,
8 and, hence,
9 where the spin momenta are given as and.
Dirac pointed out that the critical features of the exchange interaction could be obtained in an elementary way by neglecting the first two terms on the right-hand side of Eq. (9), thereby considering the two electrons as simply having their spins coupled by a potential of the form:
| 10 |
It follows that the exchange interaction Hamiltonian between two electrons in orbitals and can be written in terms of their spin momenta and. This interaction is named theHeisenberg exchange Hamiltonian or the Heisenberg–Dirac Hamiltonian in the older literature:
| 11 |
is not the same as the quantity labeled in Eq. (6). Rather,, which is termed theexchange constant, is a function of Eqs. (4), (5), and (6), namely,
| 12 |
However, with orthogonal orbitals (in which = 0), for example with different orbitals in thesame atom,.
If is positive the exchange energy favors electrons with parallel spins; this is a primary cause offerromagnetism in materials in which the electrons are considered localized in theHeitler–London model of chemical bonding, but this model of ferromagnetism has severe limitations in solids (seebelow). If is negative, the interaction favors electrons with antiparallel spins, potentially causingantiferromagnetism. The sign of is essentially determined by the relative sizes of and the product of. This sign can be deduced from the expression for the difference between the energies of the triplet and singlet states,:
| 13 |
Although theseconsequences of the exchange interaction are magnetic in nature, thecause is not; it is due primarily to electric repulsion and the Pauli exclusion principle. In general, the direct magnetic interaction between a pair of electrons (due to theirelectron magnetic moments) is negligibly small compared to this electric interaction.
Exchange energy splittings are very elusive to calculate for molecular systems at large internuclear distances. However, analytical formulae have been worked out for thehydrogen molecular ion (see references herein).
Normally, exchange interactions are very short-ranged, confined to electrons in orbitals on the same atom (intra-atomic exchange) or nearest neighbor atoms (direct exchange) but longer-ranged interactions can occur via intermediary atoms and this is termedsuperexchange.
In a crystal, generalization of the Heisenberg Hamiltonian in which the sum is taken over the exchange Hamiltonians for all the pairs of atoms of the many-electron system gives:.
| 14 |
The 1/2 factor is introduced because the interaction between the same two atoms is counted twice in performing the sums. Note that the in Eq.(14) is the exchange constant above not the exchange integral. The exchange integral is related to yet another quantity, called theexchange stiffness constant () which serves as a characteristic of a ferromagnetic material. The relationship is dependent on thecrystal structure. For a simple cubic lattice with lattice parameter,
| 15 |
For a body-centered cubic lattice,
| 16 |
and for a face-centered cubic lattice,
| 17 |
The form of Eq. (14) corresponds identically to theIsing model of ferromagnetism except that in the Ising model, thedot product of the two spin angular momenta is replaced by the scalar product. The Ising model was invented byWilhelm Lenz in 1920 and solved for the one-dimensional case by his doctoral student Ernst Ising in 1925. The energy of the Ising model is defined to be:
| 18 |
Because the Heisenberg Hamiltonian presumes the electrons involved in the exchange coupling are localized in the context of the Heitler–London, orvalence bond (VB), theory of chemical bonding, it is an adequate model for explaining the magnetic properties of electrically insulating narrow-band ionic and covalent non-molecular solids where this picture of the bonding is reasonable. Nevertheless, theoretical evaluations of the exchange integral for non-molecular solids that display metallic conductivity in which the electrons responsible for the ferromagnetism are itinerant (e.g. iron, nickel, and cobalt) have historically been either of the wrong sign or much too small in magnitude to account for the experimentally determined exchange constant (e.g. as estimated from the Curie temperatures via where is the exchange interaction averaged over all sites).
The Heisenberg model thus cannot explain the observed ferromagnetism in these materials.[13] In these cases, a delocalized, or Hund–Mulliken–Bloch (molecular orbital/band) description, for the electron wave functions is more realistic. Accordingly, theStoner model of ferromagnetism is more applicable.
In the Stoner model, the spin-only magnetic moment (in Bohr magnetons) per atom in a ferromagnet is given by the difference between the number of electrons per atom in the majority spin and minority spin states. The Stoner model thus permits non-integral values for the spin-only magnetic moment per atom. However, with ferromagnets ( = 2.0023 ≈ 2) tends to overestimate the totalspin-only magnetic moment per atom.
For example, a net magnetic moment of 0.54 μB per atom for Nickel metal is predicted by the Stoner model, which is very close to the 0.61 Bohr magnetons calculated based on the metal's observed saturation magnetic induction, its density, and its atomic weight.[14] By contrast, an isolated Ni atom (electron configuration = 3d84s2) in a cubic crystal field will have two unpaired electrons of the same spin (hence,) and would thus be expected to have in the localized electron model a total spin magnetic moment of (but the measured spin-only magnetic moment along one axis, the physical observable, will be given by).
Generally, valences andp electrons are best considered delocalized, while 4f electrons are localized and 5f and 3d/4d electrons are intermediate, depending on the particular internuclear distances.[15] In the case of substances where both delocalized and localized electrons contribute to the magnetic properties (e.g. rare-earth systems), theRuderman–Kittel–Kasuya–Yosida (RKKY) model is the currently accepted mechanism.