Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Matrix mechanics

From Wikipedia, the free encyclopedia
Formulation of quantum mechanics
Part of a series of articles about
Quantum mechanics
iddt|Ψ=H^|Ψ{\displaystyle i\hbar {\frac {d}{dt}}|\Psi \rangle ={\hat {H}}|\Psi \rangle }

Matrix mechanics is a formulation ofquantum mechanics created byWerner Heisenberg,Max Born, andPascual Jordan in 1925. It was the first conceptually autonomous and logically consistent formulation of quantum mechanics. Its account ofquantum jumps supplanted theBohr model'selectron orbits. It did so by interpreting the physical properties of particles asmatrices that evolve in time. It is equivalent to theSchrödinger wave formulation of quantum mechanics, as manifest inDirac'sbra–ket notation.

In some contrast to the wave formulation, it produces spectra of (mostly energy) operators by purely algebraic,ladder operator methods.[1] Relying on these methods,Wolfgang Pauli derived the hydrogen atom spectrum in 1926,[2] before the development of wave mechanics.

Development of matrix mechanics

[edit]

In 1925,Werner Heisenberg,Max Born, andPascual Jordan formulated the matrix mechanics representation of quantum mechanics.

Epiphany at Heligoland

[edit]
Main article:Umdeutung paper

In 1925 Werner Heisenberg was working inGöttingen on the problem of calculating thespectral lines ofhydrogen. By May 1925 he began trying to describe atomic systems byobservables only. On June 7, after weeks of failing to alleviate hishay fever with aspirin and cocaine,[3] Heisenberg left for the pollen-freeNorth Sea island ofHeligoland. While there, in between climbing and memorizing poems fromGoethe'sWest-östlicher Diwan, he continued to ponder the spectral issue and eventually realised that adoptingnon-commuting observables might solve the problem. He later wrote:

It was about three o' clock at night when the final result of the calculation lay before me. At first I was deeply shaken. I was so excited that I could not think of sleep. So I left the house and awaited the sunrise on the top of a rock.[4]: 275 

The three fundamental papers

[edit]

After Heisenberg returned to Göttingen, he showedWolfgang Pauli his calculations, commenting at one point:

Everything is still vague and unclear to me, but it seems as if the electrons will no more move on orbits.[5]

On July 9 Heisenberg gave the same paper of his calculations to Max Born, saying that "he had written a crazy paper and did not dare to send it in for publication, and that Born should read it and advise him" prior to publication. Heisenberg then departed for a while, leaving Born to analyse the paper.[6]

In the paper, Heisenberg formulated quantum theory without sharply-defined electron orbits, directly advocating for a re-interpretation of quantum theory that only focused on experimental observables like frequencies and transition probabilities.

Before Heisenberg's paper,Hendrik Kramers had calculated the relative intensities of spectral lines in theSommerfeld model by interpreting theFourier coefficients of the orbits as intensities. But his answer, like all other calculations in theold quantum theory, was only correct forlarge orbits.

Heisenberg, after a collaboration with Kramers,[7] began to believe that the transition probabilities describing quantum transitions would need a new interpretation different from classical mechanics because Heisenberg believed that the frequencies that should appear in a series describing the position of the electron shouldonly be the ones that are experimentally observed in quantum transitions (like through spectral lines), not the complete set of spatial frequencies that come from making a traditionalFourier series of classical orbits.

The quantities in Heisenberg's original formulation involved a series that described position as a series of "virtual oscillators" with two indices, with the two indices representing the initial and final states of a quantum transition.[8] Rather than following the multiplication rule as expected from multiplying Fourier series, Heisenberg formed a non-commutative multiplication rule to ensure that multiplying position states would preserve the frequencies that are only found in the quantum transitions.

When Born read the paper, he recognized the formulation, particularly the non-commutative multiplication rule, as one which could be transcribed and extended to the systematiclanguage of matrices,[9] which he had learned from his study underJakob Rosanes[10] atBreslau University. Born, with the help of his assistant and former student Pascual Jordan, began immediately to make the transcription and extension, and they submitted their results for publication; the paper was received for publication just 60 days after Heisenberg's paper.[11]

A follow-on paper was submitted for publication before the end of the year by all three authors.[12] (A brief review of Born's role in the development of the matrix mechanics formulation of quantum mechanics along with a discussion of the key formula involving the non-commutativity of the probability amplitudes can be found in an article byJeremy Bernstein.[13] A detailed historical and technical account can be found in Mehra and Rechenberg's bookThe Historical Development of Quantum Theory. Volume 3. The Formulation of Matrix Mechanics and Its Modifications 1925–1926.[14])

The three fundamental papers:

  • W. Heisenberg,Über quantentheoretische Umdeutung kinematischer und mechanischer Beziehungen,Zeitschrift für Physik,33, 879-893, 1925 (received July 29, 1925). [English translation in: B. L. van der Waerden, editor,Sources of Quantum Mechanics (Dover Publications, 1968)ISBN 0-486-61881-1 (English title:Quantum-Theoretical Re-interpretation of Kinematic and Mechanical Relations).]
  • M. Born and P. Jordan,Zur Quantenmechanik,Zeitschrift für Physik,34, 858-888, 1925 (received September 27, 1925). [English translation in: B. L. van der Waerden, editor,Sources of Quantum Mechanics (Dover Publications, 1968)ISBN 0-486-61881-1 (English title:On Quantum Mechanics).]
  • M. Born, W. Heisenberg, and P. Jordan,Zur Quantenmechanik II,Zeitschrift für Physik,35, 557-615, 1926 (received November 16, 1925). [English translation in: B. L. van der Waerden, editor,Sources of Quantum Mechanics (Dover Publications, 1968)ISBN 0-486-61881-1 (English title:On Quantum Mechanics II).]

Up until this time, matrices were seldom used by physicists; they were considered to belong to the realm of pure mathematics, thus requiring Born and Jordan's paper to introduce matrix algebra to physicists unaware of their use.Gustav Mie had used them in a paper on electrodynamics in 1912 and Born had used them in his work on the lattices theory of crystals in 1921. While matrices were used in these cases, the algebra of matrices with their multiplication did not enter the picture as they did in the matrix formulation of quantum mechanics.[15]

Born, however, had learned matrix algebra from Rosanes, as already noted, but Born had also learned Hilbert's theory of integral equations and quadratic forms for an infinite number of variables as was apparent from a citation by Born of Hilbert's workGrundzüge einer allgemeinen Theorie der Linearen Integralgleichungen published in 1912.[16][17]

Jordan, too, was well equipped for the task. For a number of years, he had been an assistant toRichard Courant at Göttingen in the preparation of Courant andDavid Hilbert's bookMethoden der mathematischen Physik I, which was published in 1924.[18] This book, fortuitously, contained a great many of the mathematical tools necessary for the continued development of quantum mechanics.

In 1926,John von Neumann became assistant to David Hilbert, and he would coin the termHilbert space to describe the algebra and analysis which were used in the development of quantum mechanics.[19][20]

A linchpin contribution to this formulation was achieved in Dirac's reinterpretation/synthesis paper of 1925,[21] which invented the language and framework usually employed today, in full display of the noncommutative structure of the entire construction.

Heisenberg's reasoning

[edit]

Before matrix mechanics, the old quantum theory described the motion of a particle by a classical orbit, with well defined position and momentumX(t) ,P(t) , with the restriction that the time integral over one period T  of the momentum times the velocity must be a positive integer multiple of thePlanck constant(h ) as described by theSommerfeld-Wilson quantization condition 0TP dX dtdt = 0TPdX=n h .{\displaystyle \ \int _{0}^{T}P\;{\frac {\ \mathrm {d} X\ }{\mathrm {d} t}}\;\mathrm {d} t\ =\ \int _{0}^{T}P\;\mathrm {d} X=n\ h~.}While this restriction correctly selects orbits with theright energy valuesEn , the old quantum formalism did not describe time dependent processes, such as the emission or absorption of radiation.

When a classical particle is weakly coupled to a radiation field, so that the radiative damping can be neglected, it will emitradiation in a pattern that repeats itself every orbital period. The frequencies that make up the outgoing wave are then integer multiples of the orbital frequency, and this is a reflection of the fact thatX(t)  is periodic, so that itsFourier representation has frequencies 2 π n / T only.X(t) =n=ei 2πnt/TXn .{\displaystyle X(t)\ =\sum _{n=-\infty }^{\infty }e^{i\ 2\pi nt/T}X_{n}~.}The coefficients Xn arecomplex numbers. The ones with negative frequencies must be thecomplex conjugates of the ones with positive frequencies, so thatX(t)  will always be real,Xn=Xn .{\displaystyle X_{n}=X_{-n}^{*}~.}

A quantum mechanical particle, on the other hand, cannot emit radiation continuously; it can only emit photons. Under theBohr model, for a quantum particle starting in quantum number n  that then emits a photon by transitioning to orbit numberm , the energy of the photon isEn − Em , that gives a photon of frequency is En − Em/ h .

For large  n  andm , but withn − m relatively small,Bohr'scorrespondence principle expects the same classical frequenciesEnEm h (nm) T .{\displaystyle E_{n}-E_{m}\approx {\frac {\ h\ (n-m)\ }{T}}~.}In the formula above,  T  is the classical period of either orbit  n  or orbit m , since the difference between them is higher order inh . But for small  n  andm , or if n − m is large, the frequencies are not integer multiples of any single frequency.

Since in classical mechanics, the frequencies that the particle emits are the same as the frequencies in the Fourier description of its motion, Heisenberg inferred that in the time-dependent description of the particle, there should be something oscillating with frequency En − Em/ h . Heisenberg called this quantityXnm , and demanded that it should reduce to the classicalFourier coefficients in the classical limit. For large values ofn and m  but with  n − m  relatively small, Xnm is the(nm)th  Fourier coefficient of the classical motion at orbitn . Since Xnm has opposite frequency toXmn , the condition that X  is real becomesXnm=Xmn .{\displaystyle X_{nm}=X_{mn}^{*}~.}

By definition, Xnm only has the frequency En − Em/h , so its time evolution is may be described as: Xnm(t) = ei 2π(EnEm)t/h Xnm(0) = ei (EnEm)t/ Xnm(0) .{\displaystyle \ X_{nm}(t)\ =\ e^{i\ 2\pi \left(E_{n}-E_{m}\right)t/h}\ X_{nm}(0)\ =\ e^{i\ \left(E_{n}-E_{m}\right)t/\hbar }\ X_{nm}(0)~.}This is the original form of Heisenberg's equation of motion.

Given two arrays Xnm and Pnm describing two physical quantities, when modeling each as classical Fourier series, it is expected that their multiplication Xnk Pkm should also result in a new frequency as part of a new Fourier series. Whilst the Fourier coefficients of the product of two quantities is theconvolution of the Fourier coefficients of each one separately, Heisenbergchanged the multiplication rule to ensure that when multiplying each component, the new frequencies would only correspond to frequencies that already existed in the quantum orbit: (X P)mn = k=0 Xmk Pkn .{\displaystyle \ \left(X\ P\right)_{mn}\ =\ \sum _{k=0}^{\infty }\ X_{mk}\ P_{kn}~.}

Born noticed thatthis is the law of matrix multiplication, so that the position, the momentum, the energy, all the observable quantities in the theory, are interpreted as matrices. Under this multiplication rule, the product depends on the order:X P is different fromP X .

The X  matrix is a complete description of the motion of a quantum mechanical particle. Because the frequencies in the quantum motion are not multiples of a common frequency, the matrix elementscannot be interpreted as the Fourier coefficients of a sharp classical trajectory. Nevertheless, as matrices,X(t)  andP(t)  satisfy the classical equations of motion; also see Ehrenfest's theorem, below.

Matrix basics

[edit]

When it was introduced by Werner Heisenberg, Max Born and Pascual Jordan in 1925, matrix mechanics was not immediately accepted and was a source of controversy, at first. Schrödinger's later introduction ofwave mechanics was greatly favored.

Part of the reason was that Heisenberg's formulation was in an odd mathematical language, for the time, while Schrödinger's formulation was based on familiar wave equations. But there was also a deeper sociological reason. Quantum mechanics had been developing by two paths, one led by Einstein, who emphasized the wave–particle duality he proposed for photons, and the other led by Bohr, that emphasized the discrete energy states and quantum jumps that Bohr discovered. De Broglie had reproduced the discrete energy states within Einstein's framework – the quantum condition is the standing wave condition, and this gave hope to those in the Einstein school that all the discrete aspects of quantum mechanics would be subsumed into a continuous wave mechanics.

Matrix mechanics, on the other hand, came from the Bohr school, which was concerned with discrete energy states and quantum jumps. Bohr's followers did not appreciate physical models that pictured electrons as waves, or as anything at all. They preferred to focus on the quantities that were directly connected to experiments.

In atomic physics,spectroscopy gave observational data on atomic transitions arising from the interactions of atoms with lightquanta. The Bohr school required that only those quantities that were in principle measurable by spectroscopy should appear in the theory. These quantities include the energy levels and their intensities but they do not include the exact location of a particle in its Bohr orbit. It is very hard to imagine an experiment that could determine whether an electron in the ground state of a hydrogen atom is to the right or to the left of the nucleus. It was a deep conviction that such questions did not have an answer.

The matrix formulation was built on the premise that all physical observables are represented by matrices, whose elements are indexed by two different energy levels.[22] The set ofeigenvalues of the matrix were eventually understood to be the set of all possible values that the observable can have. Since Heisenberg's matrices areHermitian, the eigenvalues are real.

If an observable is measured and the result is a certain eigenvalue, the correspondingeigenvector is the state of the system immediately after the measurement. The act of measurement in matrix mechanics collapses the state of the system. If one measures two observables simultaneously, the state of the system collapses to a common eigenvector of the two observables. Since most matrices don't have any eigenvectors in common, most observables can never be measured precisely at the same time. This is theuncertainty principle.

If two matrices share their eigenvectors, they can be simultaneously diagonalized. In the basis where they are both diagonal, it is clear that their product does not depend on their order because multiplication of diagonal matrices is just multiplication of numbers. The uncertainty principle, by contrast, is an expression of the fact that often two matricesA andB do not always commute, i.e., thatABBA does not necessarily equal 0. The fundamental commutation relation of matrix mechanics,k(XnkPkmPnkXkm)=iδnm{\displaystyle \sum _{k}\left(X_{nk}P_{km}-P_{nk}X_{km}\right)=i\hbar \,\delta _{nm}}implies then thatthere are no states that simultaneously have a definite position and momentum.

This principle of uncertainty holds for many other pairs of observables as well. For example, the energy does not commute with the position either, so it is impossible to precisely determine the position and energy of an electron in an atom.

Nobel Prize

[edit]

In 1928,Albert Einstein nominated Heisenberg, Born, and Jordan for theNobel Prize in Physics.[23] The announcement of the Nobel Prize in Physics for 1932 was delayed until November 1933.[24] It was at that time that it was announced Heisenberg had won the Prize for 1932 "for the creation of quantum mechanics, the application of which has,inter alia, led to the discovery of the allotropic forms of hydrogen"[25] andErwin Schrödinger andPaul Adrien Maurice Dirac shared the 1933 Prize "for the discovery of new productive forms of atomic theory".[25]

On November 25, 1933, Born received a letter from Heisenberg in which he said he had been delayed in writing due to a "bad conscience" that he alone had received the Prize "for work done in Göttingen in collaboration – you, Jordan and I". Heisenberg went on to say that Born and Jordan's contribution to quantum mechanics cannot be changed by "a wrong decision from the outside".[26] In 1954, Heisenberg wrote an article honoringMax Planck for his insight in 1900. In the article, Heisenberg credited Born and Jordan for the final mathematical formulation of matrix mechanics and Heisenberg went on to stress how great their contributions were to quantum mechanics, which were not "adequately acknowledged in the public eye".[27]

Mathematical development

[edit]

Once Heisenberg introduced the matrices forX andP, he could find their matrix elements in special cases by guesswork, guided by the correspondence principle. Since the matrix elements are the quantum mechanical analogs of Fourier coefficients of the classical orbits, the simplest case is theharmonic oscillator, where the classical position and momentum,X(t) andP(t), are sinusoidal.

Harmonic oscillator

[edit]

In units where the mass and frequency of the oscillator are equal to one (seenondimensionalization), the energy of the oscillator isH=12( P2+X2) .{\displaystyle H={\tfrac {1}{2}}\left(\ P^{2}+X^{2}\right)~.}

Thelevel sets of H  are the clockwise orbits, and they are nested circles in phase space. The classical orbit with energy E  is

 X(t)=+2E  cos(t){\textstyle \ X(t)=+{\sqrt {2E\ }}\ \cos(t)\qquad } andP(t)=2E  sin(t) .{\textstyle \qquad P(t)=-{\sqrt {2E\ }}\ \sin(t)~.}

The old quantum condition dictates that the integral ofP dX over an orbit, which is the area of the circle in phase space, must be an integer multiple of thePlanck constant. The area of the circle of radius2E is2π E . SoE= n h 2π=n  ,{\displaystyle E={\frac {\ n\ h\ }{2\pi }}=n\ \hbar \ ,}or, innatural units whereħ ≡ 1 , the energy becomes somewhole number.

TheFourier components ofX(t )  andP(t )  are simple, and even more so when they are re-expressed as a sum and difference of position X  and momentumP :

 A(t)=X(t)+i P(t)=2E  ei t{\textstyle \ A(t)=X(t)+i\ P(t)={\sqrt {2E\ }}\ e^{-i\ t}\qquad } andA(t)=X(t)i P(t)=2E  e+i t .{\textstyle \qquad A^{\dagger }(t)=X(t)-i\ P(t)={\sqrt {2E\ }}\ e^{+i\ t}~.}

Both A  andA have only a single frequency, and X  and P  can be recovered from the similar sum and difference of A  andA .

SinceA(t )  has a classical Fourier series with only the lowest frequency, and the matrix element Amn is the(mn)th Fourier coefficient of the classical orbit, the matrix forA is nonzero only on the line just above the diagonal, where it is equal to2En . The matrix forA is likewise only nonzero on the line below the diagonal, with the same elements. Thus,from A  andA, reconstruction yields2  X(0)= [01 0001 02 0002 03 0003 04 ] ,{\displaystyle {\sqrt {2\ }}\ X(0)={\sqrt {\hbar \ }}\;{\begin{bmatrix}0&{\sqrt {1\ }}&0&0&0&\cdots \\{\sqrt {1\ }}&0&{\sqrt {2\ }}&0&0&\cdots \\0&{\sqrt {2\ }}&0&{\sqrt {3\ }}&0&\cdots \\0&0&{\sqrt {3\ }}&0&{\sqrt {4\ }}&\cdots \\\vdots &\vdots &\vdots &\vdots &\vdots &\ddots \\\end{bmatrix}}\ ,}and2  P(0)= [0i1 000i10i2 000i2 0i3 000i3 0i4 ] ,{\displaystyle {\sqrt {2\ }}\ P(0)={\sqrt {\hbar \ }}\;{\begin{bmatrix}0&-i{\sqrt {1\ }}&0&0&0&\cdots \\i{\sqrt {1}}&0&-i{\sqrt {2\ }}&0&0&\cdots \\0&i{\sqrt {2\ }}&0&-i{\sqrt {3\ }}&0&\cdots \\0&0&i{\sqrt {3\ }}&0&-i{\sqrt {4\ }}&\cdots \\\vdots &\vdots &\vdots &\vdots &\vdots &\ddots \\\end{bmatrix}}\ ,}which, up to the choice of units, are the Heisenberg matrices for the harmonic oscillator. Both matrices areHermitian, since they are constructed from the Fourier coefficients of real quantities.

FindingX(t )  andP(t )  is direct, since they are quantum Fourier coefficients so they evolve simply with time, as

 Xmn(t) = Xmn(0) ei (EmEn)t{\textstyle \ X_{mn}(t)\ =\ X_{mn}(0)\ e^{i\ (E_{m}-E_{n})t}\qquad } andPmn(t) = Pmn(0) ei (EmEn)t .{\textstyle \qquad P_{mn}(t)\ =\ P_{mn}(0)\ e^{i\ (E_{m}-E_{n})t}~.}

The matrix product of X  and P  is not hermitian, but has a real and imaginary part. The real part is one half the symmetric expression X P + P X , while the imaginary part is proportional to thecommutator, which is written as [ X , P ]  X PP X .{\displaystyle \ {\bigl [}\ X\ ,\ P\ {\bigr ]}\ \equiv \ X\ P-P\ X~.}In the special case of the harmonic oscillator, it is simple to verify explicitly that X P − P X isi ħ I ,  where I  is theidentity matrix.

It is likewise simple to verify that the matrixH=12( X2+P2){\displaystyle H={\tfrac {1}{2}}\left(\ X^{2}+P^{2}\right)}is adiagonal matrix, witheigenvalues Ei .

Further information:Quantum harmonic oscillator § Ladder operator method

Conservation of energy

[edit]
Main article:Conservation of energy

The harmonic oscillator is an important case. Finding the matrices is easier than determining the general conditions from these special forms. For this reason, Heisenberg investigated theanharmonic oscillator, withHamiltonianH=12P2+12X2+εX3 .{\displaystyle H={\tfrac {1}{2}}P^{2}+{\tfrac {1}{2}}X^{2}+\varepsilon X^{3}~.}

In this case, theX andP matrices are no longer simple off-diagonal matrices, since the corresponding classical orbits are slightly squashed and displaced, so that they have Fourier coefficients at every classical frequency. To determine the matrix elements, Heisenberg required that the classical equations of motion be obeyed as matrix equations,dXdt=P ,dPdt=X3εX2 .{\displaystyle {\frac {dX}{dt}}=P~,\qquad {\frac {dP}{dt}}=-X-3\varepsilon X^{2}~.}

He noticed that if this could be done, thenH, considered as a matrix function ofX andP, will have zero time derivative.dHdt=PdPdt+(X+3εX2)dXdt=0 ,{\displaystyle {\frac {dH}{dt}}=P*{\frac {dP}{dt}}+\left(X+3\varepsilon X^{2}\right)*{\frac {dX}{dt}}=0~,}whereAB is theanticommutator,AB=12(AB+BA) .{\displaystyle A*B={\tfrac {1}{2}}(AB+BA)~.}

Given that all the off diagonal elements have a nonzero frequency;H being constant implies thatH is diagonal.It was clear to Heisenberg that in this system, the energy could be exactly conserved in an arbitrary quantum system, a very encouraging sign.

The process of emission and absorption of photons seemed to demand that the conservation of energy will hold at best on average. If a wave containing exactly one photon passes over some atoms, and one of them absorbs it, that atom needs to tell the others that they can't absorb the photon anymore. But if the atoms are far apart, any signal cannot reach the other atoms in time, and they might end up absorbing the same photon anyway and dissipating the energy to the environment. When the signal reached them, the other atoms would have to somehowrecall that energy. This paradox ledBohr, Kramers and Slater to abandon exact conservation of energy. Heisenberg's formalism, when extended to include the electromagnetic field, was obviously going to sidestep this problem, a hint that the interpretation of the theory will involvewavefunction collapse.

Differentiation trick — canonical commutation relations

[edit]

Demanding that the classical equations of motion are preserved is not a strong enough condition to determine the matrix elements. The Planck constant does not appear in the classical equations, so that the matrices could be constructed for many different values ofħ and still satisfy the equations of motion, but with different energy levels.

So, in order to implement his program, Heisenberg needed to use the old quantum condition to fix the energy levels, then fill in the matrices with Fourier coefficients of the classical equations, then alter the matrix coefficients and the energy levels slightly to make sure the classical equations are satisfied. This is clearly not satisfactory. The old quantum conditions refer to the area enclosed by the sharp classical orbits, which do not exist in the new formalism.

The most important thing that Heisenberg discovered is how to translate the old quantum condition into a simple statement in matrix mechanics.

To do this, he investigated the action integral as a matrix quantity,0TkPmk(t)dXkndtdt?Jmn .{\displaystyle \int _{0}^{T}\sum _{k}P_{mk}(t){\frac {dX_{kn}}{dt}}dt\,\,{\stackrel {\scriptstyle ?}{\approx }}\,\,J_{mn}~.}

There are several problems with this integral, all stemming from the incompatibility of the matrix formalism with the old picture of orbits. Which periodT should be used?Semiclassically, it should be eitherm orn, but the difference is orderħ, and an answer to orderħ is sought. Thequantum condition tells us thatJmn is2πn on the diagonal, so the fact thatJ is classically constant tells us that the off-diagonal elements are zero.

His crucial insight was to differentiate the quantum condition with respect ton. This idea only makes complete sense in the classical limit, wheren is not an integer but the continuousaction variableJ, but Heisenberg performed analogous manipulations with matrices, where the intermediate expressions are sometimes discrete differences and sometimes derivatives.

In the following discussion, for the sake of clarity, the differentiation will be performed on the classical variables, and the transition to matrix mechanics will be done afterwards, guided by the correspondence principle.

In the classical setting, the derivative is the derivative with respect toJ of the integral which definesJ, so it is tautologically equal to 1.ddJ0TPdX=1=0Tdt(dPdJdXdt+PddJdXdt)=0Tdt(dPdJdXdtdPdtdXdJ){\displaystyle {\begin{aligned}{}{\frac {d}{dJ}}\int _{0}^{T}PdX&=1\\&=\int _{0}^{T}dt\left({\frac {dP}{dJ}}{\frac {dX}{dt}}+P{\frac {d}{dJ}}{\frac {dX}{dt}}\right)\\&=\int _{0}^{T}dt\left({\frac {dP}{dJ}}{\frac {dX}{dt}}-{\frac {dP}{dt}}{\frac {dX}{dJ}}\right)\end{aligned}}}where the derivativesdP/dJ anddX/dJ should be interpreted as differences with respect toJ at corresponding times on nearby orbits, exactly what would be obtained if the Fourier coefficients of the orbital motion were differentiated. (These derivatives are symplectically orthogonal in phase space to the time derivativesdP/dt anddX/dt).

The final expression is clarified by introducing the variable canonically conjugate toJ, which is called theangle variableθ: The derivative with respect to time is a derivative with respect toθ, up to a factor of2πT,2πT0Tdt(dPdJdXdθdPdθdXdJ)=1.{\displaystyle {\frac {2\pi }{T}}\int _{0}^{T}dt\left({\frac {dP}{dJ}}{\frac {dX}{d\theta }}-{\frac {dP}{d\theta }}{\frac {dX}{dJ}}\right)=1\,.}So the quantum condition integral is the average value over one cycle of thePoisson bracket ofX andP.

An analogous differentiation of the Fourier series ofP dX demonstrates that the off-diagonal elements of the Poisson bracket are all zero. The Poisson bracket of two canonically conjugate variables, such asX andP, is the constant value 1, so this integral really is the average value of 1; so it is 1, as we knew all along, because it isdJ/dJ after all. But Heisenberg, Born and Jordan, unlike Dirac, were not familiar with the theory of Poisson brackets, so, for them, the differentiation effectively evaluated{X, P} inJ,θ coordinates.

The Poisson Bracket, unlike the action integral, does have a simple translation to matrix mechanics – it normally corresponds to the imaginary part of the product of two variables, thecommutator.

To see this, examine the (antisymmetrized) product of two matricesA andB in the correspondence limit, where the matrix elements are slowly varying functions of the index, keeping in mind that the answer is zero classically.

In the correspondence limit, when indicesm,n are large and nearby, whilek,r are small, the rate of change of the matrix elements in the diagonal direction is the matrix element of theJ derivative of the corresponding classical quantity. So it is possible to shift any matrix element diagonally through the correspondence,A(m+r)(n+r)Amnr(dAdJ)mn{\displaystyle A_{(m+r)(n+r)}-A_{mn}\approx r\;\left({\frac {dA}{dJ}}\right)_{mn}}where the right hand side is really only the(mn)th Fourier component ofdA/dJ at the orbit nearm to this semiclassical order, not a full well-defined matrix.

The semiclassical time derivative of a matrix element is obtained up to a factor ofi by multiplying by the distance from the diagonal,ikAm(m+k)(T2πdAdt)m(m+k)=(dAdθ)m(m+k).{\displaystyle ikA_{m(m+k)}\approx \left({\frac {T}{2\pi }}{\frac {dA}{dt}}\right)_{m(m+k)}=\left({\frac {dA}{d\theta }}\right)_{m(m+k)}\,.}since the coefficientAm(m+k) is semiclassically thekth Fourier coefficient of themth classical orbit.

The imaginary part of the product ofA andB can be evaluated by shifting the matrix elements around so as to reproduce the classical answer, which is zero.

The leading nonzero residual is then given entirely by the shifting. Since all the matrix elements are at indices which have a small distance from the large index position(m,m), it helps to introduce two temporary notations:A[r,k] =A(m+r)(m+k) for the matrices, anddA/dJ[r] for therth Fourier components of classical quantities,(ABBA)[0,k]=r=(A[0,r]B[r,k]A[r,k]B[0,r])=r(A[r+k,k]+(rk)dAdJ[r])(B[0,kr]+rdBdJ[rk])rA[r,k]B[0,r].{\displaystyle {\begin{aligned}(AB-BA)[0,k]&=\sum _{r=-\infty }^{\infty }{\bigl (}A[0,r]B[r,k]-A[r,k]B[0,r]{\bigr )}\\&=\sum _{r}\left(A[-r+k,k]+(r-k){\frac {dA}{dJ}}[r]\right)\left(B[0,k-r]+r{\frac {dB}{dJ}}[r-k]\right)-\sum _{r}A[r,k]B[0,r]\,.\end{aligned}}}

Flipping the summation variable in the first sum fromr tor′ =kr, the matrix element becomes,r(A[r,k]rdAdJ[kr])(B[0,r]+(kr)dBdJ[r])rA[r,k]B[0,r]{\displaystyle \sum _{r'}\left(A[r',k]-r'{\frac {dA}{dJ}}[k-r']\right)\left(B[0,r']+(k-r'){\frac {dB}{dJ}}[r']\right)-\sum _{r}A[r,k]B[0,r]}and it is clear that the principal (classical) part cancels.

The leading quantum part, neglecting the higher order product of derivatives in the residual expression, is then equal tor(dBdJ[r](kr)A[r,k]dAdJ[kr]rB[0,r]){\displaystyle \sum _{r'}\left({\frac {dB}{dJ}}[r'](k-r')A[r',k]-{\frac {dA}{dJ}}[k-r']r'B[0,r']\right)}so that, finally,(ABBA)[0,k]=r(dBdJ[r]idAdθ[kr]dAdJ[kr]idBdθ[r]){\displaystyle (AB-BA)[0,k]=\sum _{r'}\left({\frac {dB}{dJ}}[r']i{\frac {dA}{d\theta }}[k-r']-{\frac {dA}{dJ}}[k-r']i{\frac {dB}{d\theta }}[r']\right)}which can be identified withi times thekth classical Fourier component of the Poisson bracket.

Heisenberg's original differentiation trick was eventually extended to a full semiclassical derivation of the quantum condition, in collaboration with Born and Jordan.Once they were able to establish thati{X,P}PB[X,P]XPPX=i,{\displaystyle i\hbar \{X,P\}_{\mathrm {PB} }\qquad \longmapsto \qquad [X,P]\equiv XP-PX=i\hbar \,,}this condition replaced and extended the old quantization rule, allowing the matrix elements ofP andX for an arbitrary system to be determined simply from the form of the Hamiltonian.

The new quantization rule wasassumed to be universally true, even though the derivation from the old quantum theory required semiclassical reasoning.(A full quantum treatment, however, for more elaborate arguments of the brackets, was appreciated in the 1940s to amount to extending Poisson brackets toMoyal brackets.)

State vectors and the Heisenberg equation

[edit]

To make the transition to standard quantum mechanics, the most important further addition was thequantum state vector, now written|ψ,which is the vector that the matrices act on. Without the state vector, it is not clear which particular motion the Heisenberg matrices are describing, since they include all the motions somewhere.

The interpretation of the state vector, whose components are writtenψm, was furnished by Born. This interpretation is statistical: the result of a measurement of the physical quantity corresponding to the matrixA is random, with an average value equal tomnψmAmnψn.{\displaystyle \sum _{mn}\psi _{m}^{*}A_{mn}\psi _{n}\,.}Alternatively, and equivalently, the state vector gives theprobability amplitudeψn for the quantum system to be in the energy staten.

Once the state vector was introduced, matrix mechanics could be rotated toany basis, where theH matrix need no longer be diagonal. The Heisenberg equation of motion in its original form states thatAmn evolves in time like a Fourier component,Amn(t)=ei(EmEn)tAmn(0) ,{\displaystyle A_{mn}(t)=e^{i(E_{m}-E_{n})t}A_{mn}(0)~,}which can be recast in differential formdAmndt=i(EmEn)Amn ,{\displaystyle {\frac {dA_{mn}}{dt}}=i(E_{m}-E_{n})A_{mn}~,}and it can be restated so that it is true in an arbitrary basis, by noting that theH matrix is diagonal with diagonal valuesEm,dAdt=i(HAAH) .{\displaystyle {\frac {dA}{dt}}=i(HA-AH)~.}This is now a matrix equation, so it holds in any basis. This is the modern form of the Heisenberg equation of motion.

Its formal solution is:A(t)=eiHtA(0)eiHt .{\displaystyle A(t)=e^{iHt}A(0)e^{-iHt}~.}

All these forms of the equation of motion above say the same thing, thatA(t) is equivalent toA(0), through a basis rotation by theunitary matrixeiHt, a systematic picture elucidated by Dirac in his bra–ket notation.

Conversely, by rotating the basis for the state vector at each time byeiHt, the time dependence in the matrices can be undone. The matrices are now time independent, but the state vector rotates,|ψ(t)=eiHt|ψ(0),d|ψdt=iH|ψ.{\displaystyle |\psi (t)\rangle =e^{-iHt}|\psi (0)\rangle ,\qquad {\frac {d|\psi \rangle }{dt}}=-iH|\psi \rangle \,.}This is theSchrödinger equation for the state vector, and this time-dependent change of basis amounts to transformation to theSchrödinger picture, withx|ψ⟩ =ψ(x).

In quantum mechanics in theHeisenberg picture thestate vector,|ψ does not change with time, while an observableA satisfies theHeisenberg equation of motion,

dAdt=i[H,A]+At .{\displaystyle {\frac {dA}{dt}}={\frac {i}{\hbar }}[H,A]+{\frac {\partial A}{\partial t}}~.}

The extra term is for operators such asA=(X+t2P){\displaystyle A=\left(X+t^{2}P\right)}which have anexplicit time dependence, in addition to the time dependence from the unitary evolution discussed.

The Heisenberg picture does not distinguish time from space, so it is better suited torelativistic theories than the Schrödinger equation. Moreover, the similarity toclassical physics is more manifest: the Hamiltonian equations of motion for classical mechanics are recovered by replacing the commutator above by thePoisson bracket (see also below). By theStone–von Neumann theorem, the Heisenberg picture and the Schrödinger picture must be unitarily equivalent, as detailed below.

Further results

[edit]

Matrix mechanics rapidly developed into modern quantum mechanics, and gave interesting physical results on the spectra of atoms.

Wave mechanics

[edit]

Jordan noted that the commutation relations ensure thatP acts as a differential operator.

The operator identity[a,bc]=abcbca=abcbac+bacbca=[a,b]c+b[a,c]{\displaystyle [a,bc]=abc-bca=abc-bac+bac-bca=[a,b]c+b[a,c]}allows the evaluation of the commutator ofP with any power ofX, and it implies that[P,Xn]=in Xn1{\displaystyle \left[P,X^{n}\right]=-in~X^{n-1}}which, together with linearity, implies that aP-commutator effectively differentiates any analytic matrix function ofX.

Assuming limits are defined sensibly, this extends to arbitrary functions−but the extension need not be made explicit until a certain degree of mathematical rigor is required,

[P,f(X)]=if(X).{\displaystyle [P,f(X)]=-if'(X)\,.}

SinceX is a Hermitian matrix, it should be diagonalizable, and it will be clear from the eventual form ofP that every real number can be an eigenvalue. This makes some of the mathematics subtle, since there is a separate eigenvector for every point in space.

In the basis whereX is diagonal, an arbitrary state can be written as a superposition of states with eigenvaluesx,|ψ=xψ(x)|x,{\displaystyle |\psi \rangle =\int _{x}\psi (x)|x\rangle \,,}so thatψ(x) = ⟨x|ψ, and the operatorX multiplies each eigenvector byx,X|ψ=xxψ(x)|x .{\displaystyle X|\psi \rangle =\int _{x}x\psi (x)|x\rangle ~.}

Define a linear operatorD which differentiatesψ,Dxψ(x)|x=xψ(x)|x,{\displaystyle D\int _{x}\psi (x)|x\rangle =\int _{x}\psi '(x)|x\rangle \,,}and note that(DXXD)|ψ=x[(xψ(x))xψ(x)]|x=xψ(x)|x=|ψ,{\displaystyle (DX-XD)|\psi \rangle =\int _{x}\left[\left(x\psi (x)\right)'-x\psi '(x)\right]|x\rangle =\int _{x}\psi (x)|x\rangle =|\psi \rangle \,,}so that the operatoriD obeys the same commutation relation asP. Thus, the difference betweenP andiD must commute withX,[P+iD,X]=0,{\displaystyle [P+iD,X]=0\,,}so it may be simultaneously diagonalized withX: its value acting on any eigenstate ofX is some functionf of the eigenvaluex.

This function must be real, because bothP andiD are Hermitian,(P+iD)|x=f(x)|x,{\displaystyle (P+iD)|x\rangle =f(x)|x\rangle \,,}rotating each state|x⟩ by a phasef(x), that is, redefining the phase of the wavefunction:ψ(x)eif(x)ψ(x).{\displaystyle \psi (x)\rightarrow e^{-if(x)}\psi (x)\,.}The operatoriD is redefined by an amount:iDiD+f(X),{\displaystyle iD\rightarrow iD+f(X)\,,}which means that, in the rotated basis,P is equal toiD.

Hence, there is always a basis for the eigenvalues ofX where the action ofP on any wavefunction is known:Pxψ(x)|x=xiψ(x)|x,{\displaystyle P\int _{x}\psi (x)|x\rangle =\int _{x}-i\psi '(x)|x\rangle \,,}and the Hamiltonian in this basis is a linear differential operator on the state-vector components,[P22m+V(X)]xψx|x=x[12m2x2+V(x)]ψx|x{\displaystyle \left[{\frac {P^{2}}{2m}}+V(X)\right]\int _{x}\psi _{x}|x\rangle =\int _{x}\left[-{\frac {1}{2m}}{\frac {\partial ^{2}}{\partial x^{2}}}+V(x)\right]\psi _{x}|x\rangle }

Thus, the equation of motion for the state vector is but a celebrated differential equation,

itψt(x)=[12m2x2+V(x)]ψt(x).{\displaystyle i{\frac {\partial }{\partial t}}\psi _{t}(x)=\left[-{\frac {1}{2m}}{\frac {\partial ^{2}}{\partial x^{2}}}+V(x)\right]\psi _{t}(x)\,.}

See also:Schrödinger picture

SinceD is a differential operator, in order for it to be sensibly defined, there must be eigenvalues ofX which neighbors every given value. This suggests that the only possibility is that the space of all eigenvalues ofX is all real numbers, and thatP isiD, up to a phase rotation.

To make this rigorous requires a sensible discussion of the limiting space of functions, and in this space this is theStone–von Neumann theorem: any operatorsX andP which obey the commutation relations can be made to act on a space of wavefunctions, withP a derivative operator. This implies that a Schrödinger picture is always available.

Matrix mechanics easily extends to many degrees of freedom in a natural way. Each degree of freedom has a separateX operator and a separate effective differential operatorP, and the wavefunction is a function of all the possible eigenvalues of the independent commutingX variables.[Xi,Xj]=0[Pi,Pj]=0[Xi,Pj]=iδij.{\displaystyle {\begin{aligned}\left[X_{i},X_{j}\right]&=0\\[1ex]\left[P_{i},P_{j}\right]&=0\\[1ex]\left[X_{i},P_{j}\right]&=i\delta _{ij}\,.\end{aligned}}}

In particular, this means that a system ofN interacting particles in 3 dimensions is described by one vector whose components in a basis where all theX are diagonal is a mathematical function of3N-dimensional spacedescribing all their possible positions, effectively amuch bigger collection of values than the mere collection ofN three-dimensional wavefunctions in one physical space. Schrödinger came to the same conclusion independently, and eventually proved the equivalence of his own formalism to Heisenberg's.

Since the wavefunction is a property of the whole system, not of any one part, the description in quantum mechanics is not entirely local. The description of several quantum particles has them correlated, orentangled. This entanglement leads to strange correlations between distant particles which violate the classicalBell's inequality.

Even if the particles can only be in just two positions, the wavefunction forN particles requires2N complex numbers, one for each total configuration of positions. This is exponentially many numbers inN, so simulating quantum mechanics on a computer requires exponential resources. Conversely, this suggests that it might be possible to find quantum systems of sizeN which physically compute the answers to problems which classically require2N bits to solve. This is the aspiration behindquantum computing.

Ehrenfest theorem

[edit]
Main article:Ehrenfest theorem

For the time-independent operatorsX andP,A/t = 0 so the Heisenberg equation above reduces to:[28]idAdt=[A,H]=AHHA,{\displaystyle i\hbar {\frac {dA}{dt}}=[A,H]=AH-HA,}where the square brackets[ , ] denote the commutator. For a Hamiltonian which isp2/2m +V(x), theX andP operators satisfy:dXdt=Pm,dPdt=V,{\displaystyle {\frac {dX}{dt}}={\frac {P}{m}},\quad {\frac {dP}{dt}}=-\nabla V,}where the first is classically thevelocity, and second is classically theforce, orpotential gradient. These reproduce Hamilton's form ofNewton's laws of motion. In the Heisenberg picture, theX andP operators satisfy the classical equations of motion. You can take the expectation value of both sides of the equation to see that, in any state|ψ:ddtX=ddtψ|X|ψ=1mψ|P|ψ=1mPddtP=ddtψ|P|ψ=ψ|(V)|ψ=V.{\displaystyle {\begin{aligned}{\frac {d}{dt}}\langle X\rangle &={\frac {d}{dt}}\langle \psi |X|\psi \rangle ={\frac {1}{m}}\langle \psi |P|\psi \rangle ={\frac {1}{m}}\langle P\rangle \\[1.5ex]{\frac {d}{dt}}\langle P\rangle &={\frac {d}{dt}}\langle \psi |P|\psi \rangle =\langle \psi |(-\nabla V)|\psi \rangle =-\langle \nabla V\rangle \,.\end{aligned}}}

So Newton's laws are exactly obeyed by the expected values of the operators in any given state. This isEhrenfest's theorem, which is an obvious corollary of the Heisenberg equations of motion, but is less trivial in the Schrödinger picture, where Ehrenfest discovered it.

Transformation theory

[edit]
Main article:Transformation theory (quantum mechanics)

In classical mechanics, a canonical transformation of phase space coordinates is one which preserves the structure of the Poisson brackets. The new variablesx,p have the same Poisson brackets with each other as the original variablesx,p. Time evolution is a canonical transformation, since the phase space at any time is just as good a choice of variables as the phase space at any other time.

The Hamiltonian flow is thecanonical transformation:xx+dx=x+Hpdtpp+dp=pHxdt .{\displaystyle {\begin{aligned}x&\rightarrow x+dx=x+{\frac {\partial H}{\partial p}}dt\\[1ex]p&\rightarrow p+dp=p-{\frac {\partial H}{\partial x}}dt~.\end{aligned}}}

Since the Hamiltonian can be an arbitrary function ofx andp, there are such infinitesimal canonical transformations corresponding toevery classical quantityG, whereG serves as the Hamiltonian to generate a flow of points in phase space for an increment of times,dx=Gpds={G,X}dsdp=Gxds={G,P}ds.{\displaystyle {\begin{aligned}dx&={\frac {\partial G}{\partial p}}ds=\left\{G,X\right\}ds\\[1ex]dp&=-{\frac {\partial G}{\partial x}}ds=\left\{G,P\right\}ds\,.\end{aligned}}}

For a general functionA(x,p) on phase space, its infinitesimal change at every stepds under this map isdA=Axdx+Apdp={A,G}ds.{\displaystyle dA={\frac {\partial A}{\partial x}}dx+{\frac {\partial A}{\partial p}}dp=\{A,G\}ds\,.}The quantityG is called theinfinitesimal generator of the canonical transformation.

In quantum mechanics, the quantum analogG is now a Hermitian matrix, and the equations of motion are given by commutators,dA=i[G,A]ds.{\displaystyle dA=i[G,A]ds\,.}

The infinitesimal canonical motions can be formally integrated, just as the Heisenberg equation of motion were integrated,A=UAU{\displaystyle A'=U^{\dagger }AU}whereU =eiGs ands is an arbitrary parameter.

The definition of a quantum canonical transformation is thus an arbitrary unitary change of basis on the space of all state vectors.U is an arbitrary unitary matrix, a complex rotation in phase space,U=U1.{\displaystyle U^{\dagger }=U^{-1}\,.}These transformations leave the sum of the absolute square of the wavefunction componentsinvariant, while they take states which are multiples of each other (including states which are imaginary multiples of each other) to states which are thesame multiple of each other.

The interpretation of the matrices is that they act asgenerators of motions on the space of states.

For example, the motion generated byP can be found by solving the Heisenberg equation of motion usingP as a Hamiltonian,dX=i[X,P]ds=dsdP=i[P,P]ds=0.{\displaystyle {\begin{aligned}dX&=i[X,P]ds=ds\\[1ex]dP&=i[P,P]ds=0\,.\end{aligned}}}These are translations of the matrixX by a multiple of the identity matrix,XX+sI .{\displaystyle X\rightarrow X+sI~.}This is the interpretation of the derivative operatorD:eiPs =eD,the exponential of a derivative operator is a translation (so Lagrange'sshift operator).

TheX operator likewise generates translations inP. The Hamiltonian generatestranslations in time, the angular momentum generatesrotations in physical space, and the operatorX2 +P2 generatesrotations in phase space.

When a transformation, like a rotation in physical space, commutes with the Hamiltonian, the transformation is called asymmetry (behind a degeneracy) of the Hamiltonian – the Hamiltonian expressed in terms of rotated coordinates is the same as the original Hamiltonian. This means that the change in the Hamiltonian under the infinitesimal symmetry generatorL vanishes,dHds=i[L,H]=0.{\displaystyle {\frac {dH}{ds}}=i[L,H]=0\,.}

It then follows that the change in the generator undertime translation also vanishes,dLdt=i[H,L]=0{\displaystyle {\frac {dL}{dt}}=i[H,L]=0}so that the matrixL is constant in time: it is conserved.

The one-to-one association of infinitesimal symmetry generators and conservation laws was discovered byEmmy Noether for classical mechanics, where the commutators arePoisson brackets, but the quantum-mechanical reasoning is identical. In quantum mechanics, any unitary symmetry transformation yields a conservation law, since if the matrix U has the property thatU1HU=H{\displaystyle U^{-1}HU=H}so it follows thatUH=HU{\displaystyle UH=HU}and that the time derivative ofU is zero – it is conserved.

The eigenvalues of unitary matrices are pure phases, so that the value of a unitary conserved quantity is a complex number of unit magnitude, not a real number. Another way of saying this is that a unitary matrix is the exponential ofi times a Hermitian matrix, so that the additive conserved real quantity, the phase, is only well-defined up to an integer multiple of2π. Only when the unitary symmetry matrix is part of a family that comes arbitrarily close to the identity are the conserved real quantities single-valued, and then the demand that they are conserved become a much more exacting constraint.

Symmetries which can be continuously connected to the identity are calledcontinuous, and translations, rotations, and boosts are examples. Symmetries which cannot be continuously connected to the identity arediscrete, and the operation of space-inversion, orparity, andcharge conjugation are examples.

The interpretation of the matrices as generators of canonical transformations is due to Paul Dirac.[29] The correspondence between symmetries and matrices was shown byEugene Wigner to be complete, ifantiunitary matrices which describe symmetries which include time-reversal are included.

Selection rules

[edit]

It was physically clear to Heisenberg that the absolute squares of the matrix elements ofX, which are the Fourier coefficients of the oscillation, would yield the rate of emission of electromagnetic radiation.

In the classical limit of large orbits, if a charge with positionX(t) and chargeq is oscillating next to an equal and opposite charge at position 0, the instantaneous dipole moment isq X(t), and the time variation of this moment translates directly into the space-time variation of the vector potential, which yields nested outgoing spherical waves.

For atoms, the wavelength of the emitted light is about 10,000 times the atomic radius, and the dipole moment is the only contribution to the radiative field, while all other details of the atomic charge distribution can be ignored.

Ignoring back-reaction, the power radiated in each outgoing mode is a sum of separate contributions from the square of each independent time Fourier mode ofd,P(ω)=23ω4|di|2 .{\displaystyle P(\omega )={\tfrac {2}{3}}{\omega ^{4}}|d_{i}|^{2}~.}

Now, in Heisenberg's representation, the Fourier coefficients of the dipole moment are the matrix elements ofX. This correspondence allowed Heisenberg to provide the rule for the transition intensities, the fraction of the time that, starting from an initial statei, a photon is emitted and the atom jumps to a final statej,Pij=23(EiEj)4|Xij|2.{\displaystyle P_{ij}={\tfrac {2}{3}}\left(E_{i}-E_{j}\right)^{4}\left|X_{ij}\right|^{2}\,.}

This then allowed the magnitude of the matrix elements to be interpreted statistically:they give the intensity of the spectral lines, the probability for quantum jumps from the emission of dipole radiation.

Since the transition rates are given by the matrix elements ofX, whereverXij is zero, the corresponding transition should be absent. These were called theselection rules, which were a puzzle until the advent of matrix mechanics.

An arbitrary state of the hydrogen atom, ignoring spin, is labelled by|n;l,m, where the value ofl is a measure of the total orbital angular momentum andm is itsz-component, which defines the orbit orientation. The components of the angular momentumpseudovector areLi=εijkXjPk{\displaystyle L_{i}=\varepsilon _{ijk}X^{j}P^{k}}where the products in this expression are independent of order and real, because different components ofX andP commute.

The commutation relations ofL with all three coordinate matricesX,Y,Z (or with any vector) are easy to find,[Li,Xj]=iεijkXk,{\displaystyle \left[L_{i},X_{j}\right]=i\varepsilon _{ijk}X_{k}\,,}which confirms that the operatorL generates rotations between the three components of the vector of coordinate matricesX.

From this, the commutator ofLz and the coordinate matricesX,Y,Z can be read off,[Lz,X]=iY,[Lz,Y]=iX.{\displaystyle {\begin{aligned}\left[L_{z},X\right]&=iY\,,\\[1ex]\left[L_{z},Y\right]&=-iX\,.\end{aligned}}}

This means that the quantitiesX +iY andXiY have a simple commutation rule,[Lz,X+iY]=(X+iY),[Lz,XiY]=(XiY).{\displaystyle {\begin{aligned}\left[L_{z},X+iY\right]&=(X+iY)\,,\\[1ex]\left[L_{z},X-iY\right]&=-(X-iY)\,.\end{aligned}}}

Just like the matrix elements ofX +iP andXiP for the harmonic oscillator Hamiltonian, this commutation law implies that these operators only have certain off diagonal matrix elements in states of definitem,Lz((X+iY)|m)=(X+iY)Lz|m+(X+iY)|m=(m+1)(X+iY)|m{\displaystyle L_{z}{\bigl (}(X+iY)|m\rangle {\bigr )}=(X+iY)L_{z}|m\rangle +(X+iY)|m\rangle =(m+1)(X+iY)|m\rangle }meaning that the matrix(X +iY) takes an eigenvector ofLz with eigenvaluem to an eigenvector with eigenvaluem + 1. Similarly,(XiY) decreasem by one unit, whileZ does not change the value ofm.

So, in a basis of|l,m states whereL2 andLz have definite values, the matrix elements of any of the three components of the position are zero, except whenm is the same or changes by one unit.

This places a constraint on the change in total angular momentum. Any state can be rotated so that its angular momentum is in thez-direction as much as possible, wherem =l. The matrix element of the position acting on|l,m can only produce values ofm which are bigger by one unit, so that if the coordinates are rotated so that the final state is|l′,l′⟩, the value ofl can be at most one bigger than the biggest value ofl that occurs in the initial state. Sol is at mostl + 1.

The matrix elements vanish forl′ >l + 1, and the reverse matrix element is determined by Hermiticity, so these vanish also whenl′ <l − 1: Dipole transitions are forbidden with a change in angular momentum of more than one unit.

Sum rules

[edit]

The Heisenberg equation of motion determines the matrix elements ofP in the Heisenberg basis from the matrix elements ofX.Pij=mddtXij=im(EiEj)Xij,{\displaystyle P_{ij}=m{\frac {d}{dt}}X_{ij}=im\left(E_{i}-E_{j}\right)X_{ij}\,,}which turns the diagonal part of the commutation relation into a sum rule for the magnitude of the matrix elements:jPijxjiXijpji=ij2m(EiEj)|Xij|2=i.{\displaystyle \sum _{j}P_{ij}x_{ji}-X_{ij}p_{ji}=i\sum _{j}2m\left(E_{i}-E_{j}\right)\left|X_{ij}\right|^{2}=i\,.}

This yields a relation for the sum of the spectroscopic intensities to and from any given state, although to be absolutely correct, contributions from the radiative capture probability for unbound scattering states must be included in the sum:j2m(EiEj)|Xij|2=1.{\displaystyle \sum _{j}2m\left(E_{i}-E_{j}\right)\left|X_{ij}\right|^{2}=1\,.}

See also

[edit]

References

[edit]
  1. ^Herbert S. Green (1965).Matrix mechanics (P. Noordhoff Ltd, Groningen, Netherlands) ASIN : B0006BMIP8.
  2. ^Pauli, Wolfgang (1926). "Über das Wasserstoffspektrum vom Standpunkt der neuen Quantenmechanik".Zeitschrift für Physik.36 (5):336–363.Bibcode:1926ZPhy...36..336P.doi:10.1007/BF01450175.S2CID 128132824.
  3. ^Rechenberg, Helmut (2010).Werner Heisenberg – Die Sprache der Atome. Leben und Wirken. Springer. p. 322.ISBN 978-3-540-69221-8.
  4. ^Pais, Abraham (1993).Niels Bohr's times: in physics, philosophy, and polity (Repr. ed.). Oxford: Clarendon.ISBN 978-0-19-852049-8.
  5. ^"IQSA International Quantum Structures Association".www.vub.be. Archived fromthe original on 2021-04-20. Retrieved2020-11-13.
  6. ^W. Heisenberg,Über quantentheoretische Umdeutung kinematischer und mechanischer Beziehungen,Zeitschrift für Physik,33, 879-893, 1925 (received July 29, 1925). [English translation in: B. L. van der Waerden, editor,Sources of Quantum Mechanics (Dover Publications, 1968)ISBN 0-486-61881-1 (English title: "Quantum-Theoretical Re-interpretation of Kinematic and Mechanical Relations").]
  7. ^H. A. Kramers und W. Heisenberg,Über die Streuung von Strahlung durch Atome, Zeitschrift für Physik31, 681-708 (1925).
  8. ^Emilio Segrè,From X-Rays to Quarks: Modern Physicists and their Discoveries (W. H. Freeman and Company, 1980)ISBN 0-7167-1147-8, pp 153–157.
  9. ^Abraham Pais,Niels Bohr's Times in Physics, Philosophy, and Polity (Clarendon Press, 1991)ISBN 0-19-852049-2, pp 275–279.
  10. ^Max Born – Nobel Lecture (1954)
  11. ^M. Born and P. Jordan,Zur Quantenmechanik,Zeitschrift für Physik,34, 858-888, 1925 (received September 27, 1925). [English translation in: B. L. van der Waerden, editor,Sources of Quantum Mechanics (Dover Publications, 1968)ISBN 0-486-61881-1]
  12. ^M. Born, W. Heisenberg, and P. Jordan,Zur Quantenmechanik II,Zeitschrift für Physik,35, 557-615, 1925 (received November 16, 1925). [English translation in: B. L. van der Waerden, editor,Sources of Quantum Mechanics (Dover Publications, 1968)ISBN 0-486-61881-1]
  13. ^Jeremy BernsteinMax Born and the Quantum Theory,Am. J. Phys.73 (11) 999-1008 (2005)
  14. ^Mehra, Volume 3 (Springer, 2001)
  15. ^Jammer, 1966, pp. 206-207.
  16. ^van der Waerden, 1968, p. 51.
  17. ^The citation by Born was in Born and Jordan's paper, the second paper in the trilogy which launched the matrix mechanics formulation. See van der Waerden, 1968, p. 351.
  18. ^Constance RiedCourant (Springer, 1996) p. 93.
  19. ^John von NeumannAllgemeine Eigenwerttheorie Hermitescher Funktionaloperatoren,Mathematische Annalen102 49–131 (1929)
  20. ^When von Neumann left Göttingen in 1932, his book on the mathematical foundations of quantum mechanics, based on Hilbert's mathematics, was published under the titleMathematische Grundlagen der Quantenmechanik. See: Norman Macrae,John von Neumann: The Scientific Genius Who Pioneered the Modern Computer, Game Theory, Nuclear Deterrence, and Much More (Reprinted by the American Mathematical Society, 1999) and Constance Reid,Hilbert (Springer-Verlag, 1996)ISBN 0-387-94674-8.
  21. ^P.A.M. Dirac, "The fundamental equations of quantum mechanics",Proceedings of the Royal Society of London. Series A, Containing Papers of a Mathematical and Physical Character,109 (752), 642-653 (1925),online
  22. ^Fan, Castaly; Zamick, Larry (July 2021). "Matrix model: Emergence of a quantum number in the strong coupling regime".International Journal of Modern Physics E.30 (07): 2150059.arXiv:2107.11200.doi:10.1142/S0218301321500592.
  23. ^Bernstein, 2004, p. 1004.
  24. ^Greenspan, 2005, p. 190.
  25. ^abNobel Prize in Physics and1933 – Nobel Prize Presentation Speech.
  26. ^Greenspan, 2005, p. 191.
  27. ^Greenspan, 2005, pp. 285-286.
  28. ^Quantum Mechanics, E. Abers, Pearson Ed., Addison Wesley, Prentice Hall Inc, 2004,ISBN 978-0-13-146100-0
  29. ^Dirac, P. A. M. (1981).The Principles of Quantum Mechanics (4th revised ed.). New York: Oxford University Press.ISBN 0-19-852011-5.

Further reading

[edit]

External links

[edit]
Background
Fundamentals
Formulations
Equations
Interpretations
Experiments
Science
Technology
Extensions
Related
Authority control databases: NationalEdit this at Wikidata
Retrieved from "https://en.wikipedia.org/w/index.php?title=Matrix_mechanics&oldid=1336957825"
Category:
Hidden categories:

[8]ページ先頭

©2009-2026 Movatter.jp