Canonical commutation rule for positionq and momentump variables of a particle, 1927.pq −qp =h/(2πi). Uncertainty principle of Heisenberg, 1927.
Theuncertainty principle, also known asHeisenberg's indeterminacy principle, is a fundamental concept inquantum mechanics. It states that there is a limit to the precision with which certain pairs of physical properties, such as position andmomentum, can be simultaneously known. In other words, the more accurately one property is measured, the less accurately the other property can be known.
The quintessentially quantum mechanical uncertainty principle comes in many forms other than position–momentum. The energy–time relationship is widely used to relate quantum state lifetime to measured energy widths but its formal derivation is fraught with confusing issues about the nature of time. The basic principle has been extended in numerous directions; it must be considered in many kinds of fundamental physical measurements.
The superposition of several plane waves to form a wave packet. This wave packet becomes increasingly localized with the addition of many waves. The Fourier transform is a mathematical operation that separates a wave packet into its individual plane waves. The waves shown here arereal for illustrative purposes only; in quantum mechanics the wave function is generallycomplex.
It is vital to illustrate how the principle applies to relatively intelligible physical situations since it is indiscernible on the macroscopic[8] scales that humans experience. Two alternative frameworks for quantum physics offer different explanations for the uncertainty principle. Thewave mechanics picture of the uncertainty principle is more visually intuitive, but the more abstractmatrix mechanics picture formulates it in a way that generalizes more easily.
Mathematically, in wave mechanics, the uncertainty relation between position and momentum arises because the expressions of the wavefunction in the two correspondingorthonormalbases inHilbert space areFourier transforms of one another (i.e., position and momentum areconjugate variables). A nonzero function and its Fourier transform cannot both be sharply localized at the same time.[9] A similar tradeoff between the variances of Fourier conjugates arises in all systems underlain by Fourier analysis, for example in sound waves: A pure tone is asharp spike at a single frequency, while its Fourier transform gives the shape of the sound wave in the time domain, which is a completely delocalized sine wave. In quantum mechanics, the two key points are that the position of the particle takes the form of a matter wave, and momentum is its Fourier conjugate, assured by thede Broglie relationp =ħk, wherek is thewavenumber.
Inmatrix mechanics, themathematical formulation of quantum mechanics, any pair of non-commutingself-adjoint operators representingobservables are subject to similar uncertainty limits. An eigenstate of an observable represents the state of the wavefunction for a certain measurement value (the eigenvalue). For example, if a measurement of an observableA is performed, then the system is in a particular eigenstateΨ of that observable. However, the particular eigenstate of the observableA need not be an eigenstate of another observableB: If so, then it does not have a unique associated measurement for it, as the system is not in an eigenstate of that observable.[10]
The uncertainty principle can be visualized using the position- and momentum-space wavefunctions for one spinless particle with mass in one dimension.
The more localized the position-space wavefunction, the more likely the particle is to be found with the position coordinates in that region, and correspondingly the momentum-space wavefunction is less localized so the possible momentum components the particle could have are more widespread. Conversely, the more localized the momentum-space wavefunction, the more likely the particle is to be found with those values of momentum components in that region, and correspondingly the less localized the position-space wavefunction, so the position coordinates the particle could occupy are more widespread. These wavefunctions areFourier transforms of each other: mathematically, the uncertainty principle expresses the relationship between conjugate variables in the transform.
Positionx and momentump wavefunctions corresponding to quantum particles. The colour opacity of the particles corresponds to theprobability density of finding the particle with positionx or momentum componentp. Top: If wavelengthλ is unknown, so are momentump, wave-vectork and energyE (de Broglie relations). As the particle is more localized in position space, Δx is smaller than for Δpx. Bottom: Ifλ is known, so arep,k, andE. As the particle is more localized in momentum space, Δp is smaller than for Δx.
Propagation ofde Broglie waves in 1d—real part of thecomplex amplitude is blue, imaginary part is green. The probability (shown as the colouropacity) of finding the particle at a given pointx is spread out like a waveform, there is no definite position of the particle. As the amplitude increases above zero thecurvature reverses sign, so the amplitude begins to decrease again, and vice versa—the result is an alternating amplitude: a wave.
According to thede Broglie hypothesis, every object in the universe is associated with awave. Thus every object, from an elementary particle to atoms, molecules and on up to planets and beyond are subject to the uncertainty principle.
The time-independent wave function of a single-moded plane wave of wavenumberk0 or momentump0 is[11]
In the case of the single-mode plane wave, is1 if and0 otherwise. In other words, the particle position is extremely uncertain in the sense that it could be essentially anywhere along the wave packet.
On the other hand, consider a wave function that is asum of many waves, which we may write aswhereAn represents the relative contribution of the modepn to the overall total. The figures to the right show how with the addition of many plane waves, the wave packet can become more localized. We may take this a step further to thecontinuum limit, where the wave function is anintegral over all possible modeswith representing the amplitude of these modes and is called the wave function inmomentum space. In mathematical terms, we say that is theFourier transform of and thatx andp areconjugate variables. Adding together all of these plane waves comes at a cost, namely the momentum has become less precise, having become a mixture of waves of many different momenta.[12]
One way to quantify the precision of the position and momentum is thestandard deviationσ. Since is a probability density function for position, we calculate its standard deviation.
The precision of the position is improved, i.e. reducedσx, by using many plane waves, thereby weakening the precision of the momentum, i.e. increasedσp. Another way of stating this is thatσx andσp have aninverse relationship or are at least bounded from below. This is the uncertainty principle, the exact limit of which is the Kennard bound.
Proof of the Kennard inequality using wave mechanics
We are interested in thevariances of position and momentum, defined as
Without loss of generality, we will assume that themeans vanish, which just amounts to a shift of the origin of our coordinates. (A more general proof that does not make this assumption is given below.) This gives us the simpler form
With this inner product defined, we note that the variance for position can be written as
We can repeat this for momentum by interpreting the function as a vector, but we can also take advantage of the fact that and are Fourier transforms of each other. We evaluate the inverse Fourier transform throughintegration by parts:where in the integration by parts, the cancelled term vanishes because the wave function vanishes at both infinities and, and then use theDirac delta function which is valid because does not depend onp .
Themodulus squared of any complex numberz can be expressed aswe let and and substitute these into the equation above to get
All that remains is to evaluate these inner products.
Plugging this into the above inequalities, we getand taking the square root
with equality if and only ifp andx are linearly dependent. Note that the onlyphysics involved in this proof was that and are wave functions for position and momentum, which are Fourier transforms of each other. A similar result would hold forany pair of conjugate variables.
In matrix mechanics, observables such as position and momentum are represented by self-adjoint operators.[12] When considering pairs of observables, an important quantity is thecommutator. For a pair of operators and, one defines their commutator asIn the case of position and momentum, the commutator is thecanonical commutation relation
The physical meaning of the non-commutativity can be understood by considering the effect of the commutator on position and momentumeigenstates. Let be a right eigenstate of position with a constant eigenvaluex0. By definition, this means that Applying the commutator to yieldswhereÎ is theidentity operator.
Suppose, for the sake ofproof by contradiction, that is also a right eigenstate of momentum, with constant eigenvaluep0. If this were true, then one could writeOn the other hand, the above canonical commutation relation requires thatThis implies that no quantum state can simultaneously be both a position and a momentum eigenstate.
When a state is measured, it is projected onto an eigenstate in the basis of the relevant observable. For example, if a particle's position is measured, then the state amounts to a position eigenstate. This means that the state isnot a momentum eigenstate, however, but rather it can be represented as a sum of multiple momentum basis eigenstates. In other words, the momentum must be less precise. This precision may be quantified by the standard deviations,
As in the wave mechanics interpretation above, one sees a tradeoff between the respective precisions of the two, quantified by the uncertainty principle.
Consider a one-dimensional quantum harmonic oscillator. It is possible to express the position and momentum operators in terms of thecreation and annihilation operators:
Using the standard rules for creation and annihilation operators on the energy eigenstates,the variances may be computed directly,The product of these standard deviations is then
In particular, the above Kennard bound[6] is saturated for theground staten=0, for which the probability density is just thenormal distribution.
Quantum harmonic oscillators with Gaussian initial condition
Position (blue) and momentum (red) probability densities for an initial Gaussian distribution. From top to bottom, the animations show the casesΩ =ω,Ω = 2ω, andΩ =ω/2. Note the tradeoff between the widths of the distributions.
In a quantum harmonic oscillator of characteristic angular frequencyω, place a state that is offset from the bottom of the potential by some displacementx0 aswhere Ω describes the width of the initial state but need not be the same asω. Through integration over thepropagator, we can solve for the full time-dependent solution. After many cancelations, the probability densities reduce towhere we have used the notation to denote a normal distribution of meanμ and varianceσ2. Copying the variances above and applyingtrigonometric identities, we can write the product of the standard deviations as
From the relationswe can conclude the following (the right most equality holds only whenΩ =ω):
In the picture where the coherent state is a massive particle in a quantum harmonic oscillator, the position and momentum operators may be expressed in terms of the annihilation operators in the same formulas above and used to calculate the variances,Therefore, every coherent state saturates the Kennard boundwith position and momentum each contributing an amount in a "balanced" way. Moreover, everysqueezed coherent state also saturates the Kennard bound although the individual contributions of position and momentum need not be balanced in general.
The product of the standard deviations is thereforeFor all, the quantity is greater than 1, so the uncertainty principle is never violated. For numerical concreteness, the smallest value occurs when, in which case
Position space probability density of an initially Gaussian state moving at minimally uncertain, constant momentum in free space
Assume a particle initially has amomentum space wave function described by a normal distribution around some constant momentump0 according towhere we have introduced a reference scale, with describing the width of the distribution—cf.nondimensionalization. If the state is allowed to evolve in free space, then the time-dependent momentum and position space wave functions are
Since and, this can be interpreted as a particle moving along with constant momentum at arbitrarily high precision. On the other hand, the standard deviation of the position issuch that the uncertainty product can only increase with time as
Starting with Kennard's derivation of position-momentum uncertainty,Howard Percy Robertson developed[13][1] a formulation for arbitraryHermitian operators expressed in terms of their standard deviationwhere the brackets indicate anexpectation value of the observable represented by operator. For a pair of operators and, define their commutator asand the Robertson uncertainty relation is given by[14]
Erwin Schrödinger[15] showed how to allow for correlation between the operators, giving a stronger inequality, known as theRobertson–Schrödinger uncertainty relation,[16][1]
The derivation shown here incorporates and builds off of those shown in Robertson,[13] Schrödinger[16] and standard textbooks such as Griffiths.[17]: 138 For any Hermitian operator, based upon the definition ofvariance, we havewe let and thus
Similarly, for any other Hermitian operator in the same statefor
The product of the two deviations can thus be expressed as
1
In order to relate the two vectors and, we use theCauchy–Schwarz inequality[18] which is defined asand thus Equation (1) can be written as
2
Since is in general a complex number, we use the fact that the modulus squared of any complex number is defined as, where is the complex conjugate of. The modulus squared can also be expressed as
3
we let and and substitute these into the equation above to get
4
The inner product is written out explicitly asand using the fact that and are Hermitian operators, we find
Similarly it can be shown that
Thus, we haveand
We now substitute the above two equations above back into Eq. (4) and get
Substituting the above into Equation (2) we get the Schrödinger uncertainty relation
This proof has an issue[19] related to the domains of the operators involved. For the proof to make sense, the vector has to be in the domain of theunbounded operator, which is not always the case. In fact, the Robertson uncertainty relation is false if is an angle variable and is the derivative with respect to this variable. In this example, the commutator is a nonzero constant—just as in the Heisenberg uncertainty relation—and yet there are states where the product of the uncertainties is zero.[20] (See the counterexample section below.) This issue can be overcome by using avariational method for the proof,[21][22] or by working with an exponentiated version of the canonical commutation relations.[20]
Note that in the general form of the Robertson–Schrödinger uncertainty relation, there is no need to assume that the operators and areself-adjoint operators. It suffices to assume that they are merelysymmetric operators. (The distinction between these two notions is generally glossed over in the physics literature, where the termHermitian is used for either or both classes of operators. See Chapter 9 of Hall's book[23] for a detailed discussion of this important but technical distinction.)
In thephase space formulation of quantum mechanics, the Robertson–Schrödinger relation follows from a positivity condition on a real star-square function. Given aWigner function withstar product ★ and a functionf, the following is generally true:[24]
Choosing, we arrive at
Since this positivity condition is true foralla,b, andc, it follows that all the eigenvalues of the matrix are non-negative.
The non-negative eigenvalues then imply a corresponding non-negativity condition on thedeterminant,or, explicitly, after algebraic manipulation,
Since the Robertson and Schrödinger relations are for general operators, the relations can be applied to any two observables to obtain specific uncertainty relations. A few of the most common relations found in the literature are given below.
Position–linear momentum uncertainty relation: for the position and linear momentum operators, the canonical commutation relation implies the Kennard inequality from above:
Angular momentum uncertainty relation: For two orthogonal components of thetotal angular momentum operator of an object: wherei,j,k are distinct, andJi denotes angular momentum along thexi axis. This relation implies that unless all three components vanish together, only a single component of a system's angular momentum can be defined with arbitrary precision, normally the component parallel to an external (magnetic or electric) field. Moreover, for, a choice,, in angular momentum multiplets,ψ = |j,m⟩, bounds theCasimir invariant (angular momentum squared,) from below and thus yields useful constraints such asj(j + 1) ≥m(m + 1), and hencej ≥m, among others.
The derivation of the Robertson inequality for operators and requires and to be defined. There are quantum systems where these conditions are not valid.[27]One example is a quantumparticle on a ring, where the wave function depends on an angular variable in the interval. Define "position" and "momentum" operators and byandwith periodic boundary conditions on. The definition of depends the range from 0 to. These operators satisfy the usual commutation relations for position and momentum operators,. More precisely, whenever both and are defined, and the space of such is a dense subspace of the quantum Hilbert space.[28]
Now let be any of the eigenstates of, which are given by. These states are normalizable, unlike the eigenstates of the momentum operator on the line. Also the operator is bounded, since ranges over a bounded interval. Thus, in the state, the uncertainty of is zero and the uncertainty of is finite, so thatThe Robertson uncertainty principle does not apply in this case: is not in the domain of the operator, since multiplication by disrupts the periodic boundary conditions imposed on.[20]
For the usual position and momentum operators and on the real line, no such counterexamples can occur. As long as and are defined in the state, the Heisenberg uncertainty principle holds, even if fails to be in the domain of or of.[29]
The Robertson–Schrödinger uncertainty can be improved noting that it must hold for all components in any decomposition of thedensity matrix given asHere, for the probabilities and hold. Then, using the relationfor,it follows that[30]where the function in the bound is definedThe above relation very often has a bound larger than that of the original Robertson–Schrödinger uncertainty relation. Thus, we need to calculate the bound of the Robertson–Schrödinger uncertainty for the mixed components of the quantum state rather than for the quantum state, and compute an average of their square roots. The following expression is stronger than the Robertson–Schrödinger uncertainty relationwhere on the right-hand side there is a concave roof over the decompositions of the density matrix.The improved relation above is saturated by all single-qubit quantum states.[30]
With similar arguments, one can derive a relation with a convex roof on the right-hand side[30]where denotes thequantum Fisher information and the density matrix is decomposed to pure states asThe derivation takes advantage of the fact that thequantum Fisher information is the convex roof of the variance times four.[31][32]
A simpler inequality follows without a convex roof[33]which is stronger than the Heisenberg uncertainty relation, since for the quantum Fisher information we havewhile for pure states the equality holds.
The Robertson–Schrödinger uncertainty relation can be trivial if the state of the system is chosen to be eigenstate of one of the observable. The stronger uncertainty relations proved by Lorenzo Maccone andArun K. Pati give non-trivial bounds on the sum of the variances for two incompatible observables.[34] (Earlier works on uncertainty relations formulated as the sum of variances include, e.g., Ref.[35] due to Yichen Huang.) For two non-commuting observables and the first stronger uncertainty relation is given bywhere,, is a normalized vector that is orthogonal to the state of the system and one should choose the sign of to make this real quantity a positive number.
The second stronger uncertainty relation is given bywhere is a state orthogonal to.The form of implies that the right-hand side of the new uncertainty relation is nonzero unless is an eigenstate of. One may note that can be an eigenstate of without being an eigenstate of either or. However, when is an eigenstate of one of the two observables the Heisenberg–Schrödinger uncertainty relation becomes trivial. But the lower bound in the new relation is nonzero unless is an eigenstate of both.
An energy–time uncertainty relation like has a long, controversial history; the meaning of and varies and different formulations have different arenas of validity.[36] However, one well-known application is both well established[37][38] and experimentally verified:[39][40] the connection between the life-time of a resonance state, and its energy width:In particle-physics, widths from experimental fits to theBreit–Wigner energy distribution are used to characterize the lifetime of quasi-stable or decaying states.[41]
An informal, heuristic meaning of the principle is the following:[42] A state that only exists for a short time cannot have a definite energy. To have a definite energy, the frequency of the state must be defined accurately, and this requires the state to hang around for many cycles, the reciprocal of the required accuracy. For example, inspectroscopy, excited states have a finite lifetime. By the time–energy uncertainty principle, they do not have a definite energy, and, each time they decay, the energy they release is slightly different. The average energy of the outgoing photon has a peak at the theoretical energy of the state, but the distribution has a finite width called thenatural linewidth. Fast-decaying states have a broad linewidth, while slow-decaying states have a narrow linewidth.[43] The same linewidth effect also makes it difficult to specify therest mass of unstable, fast-decaying particles inparticle physics. The faster theparticle decays (the shorter its lifetime), the less certain is its mass (the larger the particle'swidth).
The concept of "time" in quantum mechanics offers many challenges.[44] There is no quantum theory of time measurement; relativity is both fundamental to time and difficult to include in quantum mechanics.[36] While position and momentum are associated with a single particle, time is a system property: it has no operator needed for the Robertson–Schrödinger relation.[1] The mathematical treatment of stable and unstable quantum systems differ.[45] These factors combine to make energy–time uncertainty principles controversial.
Three notions of "time" can be distinguished:[36] external, intrinsic, and observable. External or laboratory time is seen by the experimenter; intrinsic time is inferred by changes in dynamic variables, like the hands of a clock or the motion of a free particle; observable time concerns time as an observable, the measurement of time-separated events.
An external-time energy–time uncertainty principle might say that measuring the energy of a quantum system to an accuracy requires a time interval.[38] However,Yakir Aharonov andDavid Bohm[46][36] have shown that, in some quantum systems, energy can be measured accurately within an arbitrarily short time: external-time uncertainty principles are not universal.
Intrinsic time is the basis for several formulations of energy–time uncertainty relations, including the Mandelstam–Tamm relation discussed in the next section. A physical system with an intrinsic time closely matching the external laboratory time is called a "clock".[44]: 31
Observable time, measuring time between two events, remains a challenge for quantum theories; some progress has been made using positive operator-valued measure concepts.[36]
In 1945,Leonid Mandelstam andIgor Tamm derived a non-relativistictime–energy uncertainty relation as follows.[47][36] From Heisenberg mechanics, the generalizedEhrenfest theorem for an observableB without explicit time dependence, represented by a self-adjoint operator relates time dependence of the average value of to the average of its commutator with the Hamiltonian:
The value of is then substituted in theRobertson uncertainty relation for the energy operator and:giving(whenever the denominator is nonzero).While this is a universal result, it depends upon the observable chosen and that the deviations and are computed for a particular state.Identifying and the characteristic timegives an energy–time relationshipAlthough has the dimension of time, it is different from the time parametert that enters theSchrödinger equation. This can be interpreted as time for which the expectation value of the observable, changes by an amount equal to one standard deviation.[48]Examples:
The time a free quantum particle passes a point in space is more uncertain as the energy of the state is more precisely controlled: Since the time spread is related to the particle position spread and the energy spread is related to the momentum spread, this relation is directly related to position–momentum uncertainty.[17]: 144
ADelta particle, a quasistable composite of quarks related to protons and neutrons, has a lifetime of 10−23 s, so its measured mass equivalent to energy, 1232 MeV/c2, varies by ±120 MeV/c2; this variation is intrinsic and not caused by measurement errors.[17]: 144
Two energy states with energies superimposed to create a composite state
The probability amplitude of this state has a time-dependent interference term:
The oscillation period varies inversely with the energy difference:.[17]: 144
Each example has a different meaning for the time uncertainty, according to the observable and state used.
Some formulations ofquantum field theory uses temporary electron–positron pairs in its calculations calledvirtual particles. The mass-energy and lifetime of these particles are related by the energy–time uncertainty relation. The energy of a quantum systems is not known with enough precision to limit their behavior to a single, simple history. Thus the influence ofall histories must be incorporated into quantum calculations, including those with much greater or much less energy than the mean of the measured/calculated energy distribution.
The energy–time uncertainty principle does not temporarily violateconservation of energy; it does not imply that energy can be "borrowed" from the universe as long as it is "returned" within a short amount of time.[17]: 145 The energy of the universe is not an exactly known parameter at all times.[1] When events transpire at very short time intervals, there is uncertainty in the energy of these events.
In the context ofharmonic analysis the uncertainty principle implies that one cannot at the same time localize the value of a function and its Fourier transform. To wit, the following inequality holds,
Further mathematical uncertainty inequalities, including the aboveentropic uncertainty, hold between a functionf and its Fourier transform ƒ̂:[49][50][51]
In the context oftime–frequency analysis uncertainty principles are referred to as theGabor limit, afterDennis Gabor, or sometimes theHeisenberg–Gabor limit. The basic result, which follows from "Benedicks's theorem", below, is that a function cannot be bothtime limited andband limited (a function and its Fourier transform cannot both have bounded domain)—seebandlimited versus timelimited. More accurately, thetime-bandwidth orduration-bandwidth product satisfieswhere and are the standard deviations of the time and frequency energy concentrations respectively.[52] The minimum is attained for aGaussian-shaped pulse (Gabor wavelet) [For the un-squared Gaussian (i.e. signal amplitude) and its un-squared Fourier transform magnitude; squaring reduces each by a factor.] Another common measure is the product of the time and frequencyfull width at half maximum (of the power/energy), which for the Gaussian equals (seebandwidth-limited pulse).
Stated differently, one cannot simultaneously sharply localize a signalf in both thetime domain andfrequency domain.
When applied tofilters, the result implies that one cannot simultaneously achieve a high temporal resolution and high frequency resolution at the same time; a concrete example are theresolution issues of the short-time Fourier transform—if one uses a wide window, one achieves good frequency resolution at the cost of temporal resolution, while a narrow window has the opposite trade-off.
Alternate theorems give more precise quantitative results, and, in time–frequency analysis, rather than interpreting the (1-dimensional) time and frequency domains separately, one instead interprets the limit as a lower limit on the support of a function in the (2-dimensional) time–frequency plane. In practice, the Gabor limit limits thesimultaneous time–frequency resolution one can achieve without interference; it is possible to achieve higher resolution, but at the cost of different components of the signal interfering with each other.
As a result, in order to analyze signals where thetransients are important, thewavelet transform is often used instead of the Fourier.
Denote by the number of non-zero elements in the time sequence and by the number of non-zero elements in the frequency sequence. Then,
This inequality issharp, with equality achieved whenx orX is a Dirac mass, or more generally whenx is a nonzero multiple of a Dirac comb supported on a subgroup of the integers moduloN (in which caseX is also a Dirac comb supported on a complementary subgroup, and vice versa).
More generally, ifT andW are subsets of the integers moduloN, let denote the time-limiting operator andband-limiting operators, respectively. Thenwhere the norm is theoperator norm of operators on the Hilbert space of functions on the integers moduloN. This inequality has implications forsignal reconstruction.[53]
WhenN is aprime number, a stronger inequality holds:Discovered byTerence Tao, this inequality is also sharp.[54]
Amrein–Berthier[55] and Benedicks's theorem[56] intuitively says that the set of points wheref is non-zero and the set of points whereƒ̂ is non-zero cannot both be small.
Specifically, it is impossible for a functionf inL2(R) and its Fourier transformƒ̂ to both besupported on sets of finiteLebesgue measure. A more quantitative version is[57][58]
One expects that the factorCeC|S||Σ| may be replaced byCeC(|S||Σ|)1/d, which is only known if eitherS orΣ is convex.
The mathematicianG. H. Hardy formulated the following uncertainty principle:[59] it is not possible forf and ƒ̂ to both be "very rapidly decreasing". Specifically, iff in is such thatand ( an integer),then, ifab > 1,f = 0, while ifab = 1, then there is a polynomialP of degree≤N such that
This was later improved as follows: if is such thatthenwhereP is a polynomial of degree(N −d)/2 andA is a reald ×d positive definite matrix.
This result was stated in Beurling's complete works without proof and proved in Hörmander[60] (the case) and Bonami, Demange, and Jaming[61] for the general case. Note that Hörmander–Beurling's version implies the caseab > 1 in Hardy's Theorem while the version by Bonami–Demange–Jaming covers the full strength of Hardy's Theorem. A different proof of Beurling's theorem based on Liouville's theorem appeared in ref.[62]
A full description of the caseab < 1 as well as the following extension to Schwartz class distributions appears in ref.[63]
Theorem— If a tempered distribution is such thatandthenfor some convenient polynomialP and real positive definite matrixA of typed ×d.
Inquantum metrology, and especiallyinterferometry, theHeisenberg limit is the optimal rate at which the accuracy of a measurement can scale with the energy used in the measurement. Typically, this is the measurement of a phase (applied to one arm of abeam-splitter) and the energy is given by the number of photons used in aninterferometer. Although some claim to have broken the Heisenberg limit, this reflects disagreement on the definition of the scaling resource.[64] Suitably defined, the Heisenberg limit is a consequence of the basic principles of quantum mechanics and cannot be beaten, although the weak Heisenberg limit can be beaten.[65]
The inequalities above focus on thestatistical imprecision of observables as quantified by the standard deviation. Heisenberg's original version, however, was dealing with thesystematic error, a disturbance of the quantum system produced by the measuring apparatus, i.e., an observer effect.
If we let represent the error (i.e.,inaccuracy) of a measurement of an observableA and the disturbance produced on a subsequent measurement of the conjugate variableB by the former measurement ofA, then the inequality proposed by Masanao Ozawa − encompassing both systematic and statistical errors - holds:[66]
Heisenberg's uncertainty principle, as originally described in the 1927 formulation, mentions only the first term of Ozawa inequality, regarding thesystematic error. Using the notation above to describe theerror/disturbance effect ofsequential measurements (firstA, thenB), it could be written as
The formal derivation of the Heisenberg relation is possible but far from intuitive. It wasnot proposed by Heisenberg, but formulated in a mathematically consistent way only in recent years.[67][68]Also, it must be stressed that the Heisenberg formulation is not taking into account the intrinsic statistical errors and. There is increasing experimental evidence[69][70][71][72] that the total quantum uncertainty cannot be described by the Heisenberg term alone, but requires the presence of all the three terms of the Ozawa inequality.
Using the same formalism,[1] it is also possible to introduce the other kind of physical situation, often confused with the previous one, namely the case ofsimultaneous measurements (A andB at the same time):
The two simultaneous measurements onA andB are necessarily[73]unsharp orweak.
It is also possible to derive an uncertainty relation that, as the Ozawa's one, combines both the statistical and systematic error components, but keeps a form very close to the Heisenberg original inequality. By adding Robertson[1]
and Ozawa relations we obtainThe four terms can be written as:Defining:as theinaccuracy in the measured values of the variableA andas theresulting fluctuation in the conjugate variableB, Kazuo Fujikawa[74] established an uncertainty relation similar to the Heisenberg original one, but valid both forsystematic and statistical errors:
For many distributions, the standard deviation is not a particularly natural way of quantifying the structure. For example, uncertainty relations in which one of the observables is an angle has little physical meaning for fluctuations larger than one period.[22][75][76][77] Other examples include highlybimodal distributions, orunimodal distributions with divergent variance.
A solution that overcomes these issues is an uncertainty based onentropic uncertainty instead of the product of variances. While formulating themany-worlds interpretation of quantum mechanics in 1957,Hugh Everett III conjectured a stronger extension of the uncertainty principle based on entropic certainty.[78] This conjecture, also studied by I. I. Hirschman[79] and proven in 1975 by W. Beckner[80] and by Iwo Bialynicki-Birula and Jerzy Mycielski[81] is that, for two normalized, dimensionless Fourier transform pairsf(a) andg(b) where
The probability distribution functions associated with the position wave functionψ(x) and the momentum wave functionφ(x) have dimensions of inverse length and momentum respectively, but the entropies may be rendered dimensionless bywherex0 andp0 are some arbitrarily chosen length and momentum respectively, which render the arguments of the logarithms dimensionless. Note that the entropies will be functions of these chosen parameters. Due to theFourier transform relation between the position wave functionψ(x) and the momentum wavefunctionφ(p), the above constraint can be written for the corresponding entropies as
Depending on one's choice of thex0 p0 product, the expression may be written in many ways. Ifx0p0 is chosen to beh, then
If, instead,x0p0 is chosen to beħ, then
Ifx0 andp0 are chosen to be unity in whatever system of units are being used, thenwhereh is interpreted as a dimensionless number equal to the value of the Planck constant in the chosen system of units. Note that these inequalities can be extended to multimode quantum states, or wavefunctions in more than one spatial dimension.[82]
The quantum entropic uncertainty principle is more restrictive than the Heisenberg uncertainty principle. From the inverselogarithmic Sobolev inequalities[83](equivalently, from the fact that normal distributions maximize the entropy of all such with a given variance), it readily follows that this entropic uncertainty principle isstronger than the one based on standard deviations, because
In other words, the Heisenberg uncertainty principle, is a consequence of the quantum entropic uncertainty principle, but not vice versa. A few remarks on these inequalities. First, the choice ofbase e is a matter of popular convention in physics. The logarithm can alternatively be in any base, provided that it be consistent on both sides of the inequality. Second, recall theShannon entropy has been used,not the quantumvon Neumann entropy. Finally, the normal distribution saturates the inequality, and it is the only distribution with this property, because it is themaximum entropy probability distribution among those with fixed variance (cf.here for proof).
Entropic uncertainty of the normal distribution
We demonstrate this method on the ground state of the QHO, which as discussed above saturates the usual uncertainty based on standard deviations. The length scale can be set to whatever is convenient, so we assign
The probability distribution is the normal distributionwith Shannon entropy
A completely analogous calculation proceeds for the momentum distribution. Choosing a standard momentum of:
The entropic uncertainty is therefore the limiting value
A measurement apparatus will have a finite resolution set by the discretization of its possible outputs into bins, with the probability of lying within one of the bins given by the Born rule. We will consider the most common experimental situation, in which the bins are of uniform size. Letδx be a measure of the spatial resolution. We take the zeroth bin to be centered near the origin, with possibly some small constant offsetc. The probability of lying within the jth interval of widthδx is
To account for this discretization, we can define the Shannon entropy of the wave function for a given measurement apparatus as
Under the above definition, the entropic uncertainty relation is
Here we note thatδxδp/h is a typical infinitesimal phase space volume used in the calculation of apartition function. The inequality is also strict and not saturated. Efforts to improve this bound are an active area of research.
Normal distribution example
We demonstrate this method first on the ground state of the QHO, which as discussed above saturates the usual uncertainty based on standard deviations.
The probability of lying within one of these bins can be expressed in terms of theerror function.
The momentum probabilities are completely analogous.
For simplicity, we will set the resolutions toso that the probabilities reduce to
The Shannon entropy can be evaluated numerically.
The entropic uncertainty is indeed larger than the limiting value.
Note that despite being in the optimal case, the inequality is not saturated.
Sinc function example
An example of a unimodal distribution with infinite variance is thesinc function. If the wave function is the correctly normalized uniform distribution,
then its Fourier transform is the sinc function,which yields infinite momentum variance despite having a centralized shape. The entropic uncertainty, on the other hand, is finite. Suppose for simplicity that the spatial resolution is just a two-bin measurement,δx = a, and that the momentum resolution isδp = h/a.
Partitioning the uniform spatial distribution into two equal bins is straightforward. We set the offsetc = 1/2 so that the two bins span the distribution.
The bins for momentum must cover the entire real line. As done with the spatial distribution, we could apply an offset. It turns out, however, that the Shannon entropy is minimized when the zeroth bin for momentum is centered at the origin. (The reader is encouraged to try adding an offset.) The probability of lying within an arbitrary momentum bin can be expressed in terms of thesine integral.
The Shannon entropy can be evaluated numerically.
The entropic uncertainty is indeed larger than the limiting value.
Uncertainty relation with three angular momentum components
For a particle oftotal angular momentum the following uncertainty relation holdswhere are angular momentum components. The relation can be derived fromandThe relation can be strengthened as[30][84]where is the quantum Fisher information.
In 1925 Heisenberg published theUmdeutung (reinterpretation) paper where he showed that central aspect of quantum theory was the non-commutativity: the theory implied that the relative order of position and momentum measurement was significant. Working withMax Born andPascual Jordan, he continued to developmatrix mechanics, that would become the first modern quantum mechanics formulation.[85]
Werner Heisenberg and Niels Bohr
In March 1926, working in Bohr's institute, Heisenberg realized that the non-commutativity implies the uncertainty principle. Writing toWolfgang Pauli in February 1927, he worked out the basic concepts.[86]
In his celebrated 1927 paper "Über den anschaulichen Inhalt der quantentheoretischen Kinematik und Mechanik" ("On the Perceptual Content of Quantum Theoretical Kinematics and Mechanics"), Heisenberg established this expression as the minimum amount of unavoidable momentum disturbance caused by any position measurement,[2] but he did not give a precise definition for the uncertainties Δx and Δp. Instead, he gave some plausible estimates in each case separately. His paper gave an analysis in terms of a microscope that Bohr showed was incorrect; Heisenberg included an addendum to the publication.
In his 1930 Chicago lecture[87] he refined his principle:
A1
Later work broadened the concept. Any two variables that do not commute cannot be measured simultaneously—the more precisely one is known, the less precisely the other can be known. Heisenberg wrote:
It can be expressed in its simplest form as follows: One can never know with perfect accuracy both of those two important factors which determine the movement of one of the smallest particles—its position and its velocity. It is impossible to determine accuratelyboth the position and the direction and speed of a particleat the same instant.[88]
Kennard[6][1]: 204 in 1927 first proved the modern inequality:
A2
whereħ =h/2π, andσx,σp are the standard deviations of position and momentum. (Heisenberg only proved relation (A2) for the special case of Gaussian states.[87]) In 1929 Robertson generalized the inequality to all observables and in 1930 Schrödinger extended the form to allow non-zero covariance of the operators; this result is referred to as Robertson-Schrödinger inequality.[1]: 204
Throughout the main body of his original 1927 paper, written in German, Heisenberg used the word "Ungenauigkeit",[2]to describe the basic theoretical principle. Only in the endnote did he switch to the word "Unsicherheit". Later on, he always used "Unbestimmtheit". When the English-language version of Heisenberg's textbook,The Physical Principles of the Quantum Theory, was published in 1930, however, only the English word "uncertainty" was used, and it became the term in the English language.[89]
Heisenberg's gamma-ray microscope for locating an electron (shown in blue). The incoming gamma ray (shown in green) is scattered by the electron up into the microscope's aperture angleθ. The scattered gamma-ray is shown in red. Classicaloptics shows that the electron position can be resolved only up to an uncertainty Δx that depends onθ and the wavelengthλ of the incoming light.
The principle is quite counter-intuitive, so the early students of quantum theory had to be reassured that naive measurements to violate it were bound always to be unworkable. One way in which Heisenberg originally illustrated the intrinsic impossibility of violating the uncertainty principle is by using theobserver effect of an imaginary microscope as a measuring device.[87]
He imagines an experimenter trying to measure the position and momentum of anelectron by shooting aphoton at it.[90]: 49–50
Problem 1 – If the photon has a shortwavelength, and therefore, a large momentum, the position can be measured accurately. But the photon scatters in a random direction, transferring a large and uncertain amount of momentum to the electron. If the photon has a longwavelength and low momentum, the collision does not disturb the electron's momentum very much, but the scattering will reveal its position only vaguely.
Problem 2 – If a largeaperture is used for the microscope, the electron's location can be well resolved (seeRayleigh criterion); but by the principle ofconservation of momentum, the transverse momentum of the incoming photon affects the electron's beamline momentum and hence, the new momentum of the electron resolves poorly. If a small aperture is used, the accuracy of both resolutions is the other way around.
The combination of these trade-offs implies that no matter what photon wavelength and aperture size are used, the product of the uncertainty in measured position and measured momentum is greater than or equal to a lower limit, which is (up to a small numerical factor) equal to thePlanck constant.[91] Heisenberg did not care to formulate the uncertainty principle as an exact limit, and preferred to use it instead, as a heuristic quantitative statement, correct up to small numerical factors, which makes the radically new noncommutativity of quantum mechanics inevitable.
Historically, the uncertainty principle has been confused[92][66] with a related effect inphysics, called theobserver effect, which notes that measurements of certain systems cannot be made without affecting the system,[93][94] that is, without changing something in a system. Heisenberg used such an observer effect at the quantum level (see below) as a physical "explanation" of quantum uncertainty.[95] It has since become clearer, however, that the uncertainty principle is inherent in the properties of allwave-like systems,[69] and that it arises in quantum mechanics simply due to thematter wave nature of all quantum objects.[96] Thus, the uncertainty principle actually states a fundamental property of quantum systems and is not a statement about the observational success of current technology.[97]
The Copenhagen interpretation of quantum mechanics and Heisenberg's uncertainty principle were, in fact, initially seen as twin targets by detractors. According to theCopenhagen interpretation of quantum mechanics, there is no fundamental reality that thequantum state describes, just a prescription for calculating experimental results. There is no way to say what the state of a system fundamentally is, only what the result of observations might be.
Albert Einstein believed that randomness is a reflection of our ignorance of some fundamental property of reality, whileNiels Bohr believed that the probability distributions are fundamental and irreducible, and depend on which measurements we choose to perform.Einstein and Bohr debated the uncertainty principle for many years.
Wolfgang Pauli called Einstein's fundamental objection to the uncertainty principle "the ideal of the detached observer" (phrase translated from the German):
"Like the moon has a definite position," Einstein said to me last winter, "whether or not we look at the moon, the same must also hold for the atomic objects, as there is no sharp distinction possible between these and macroscopic objects. Observation cannotcreate an element of reality like a position, there must be something contained in the complete description of physical reality which corresponds to thepossibility of observing a position, already before the observation has been actually made." I hope, that I quoted Einstein correctly; it is always difficult to quote somebody out of memory with whom one does not agree. It is precisely this kind of postulate which I call the ideal of the detached observer.
— Letter from Pauli to Niels Bohr, February 15, 1955[98]
The first of Einstein'sthought experiments challenging the uncertainty principle went as follows:
Consider a particle passing through a slit of widthd. The slit introduces an uncertainty in momentum of approximatelyh/d because the particle passes through the wall. But let us determine the momentum of the particle by measuring the recoil of the wall. In doing so, we find the momentum of the particle to arbitrary accuracy by conservation of momentum.
Bohr's response was that the wall is quantum mechanical as well, and that to measure the recoil to accuracyΔp, the momentum of the wall must be known to this accuracy before the particle passes through. This introduces an uncertainty in the position of the wall and therefore the position of the slit equal toh/Δp, and if the wall's momentum is known precisely enough to measure the recoil, the slit's position is uncertain enough to disallow a position measurement.
A similar analysis with particles diffracting through multiple slits is given byRichard Feynman.[99]
Bohr was present when Einstein proposed the thought experiment which has become known asEinstein's box. Einstein argued that "Heisenberg's uncertainty equation implied that the uncertainty in time was related to the uncertainty in energy, the product of the two being related to the Planck constant."[100] Consider, he said, an ideal box, lined with mirrors so that it can contain light indefinitely. The box could be weighed before a clockwork mechanism opened an ideal shutter at a chosen instant to allow one single photon to escape. "We now know, explained Einstein, precisely the time at which the photon left the box."[101] "Now, weigh the box again. The change of mass tells the energy of the emitted light. In this manner, said Einstein, one could measure the energy emitted and the time it was released with any desired precision, in contradiction to the uncertainty principle."[100]
Bohr spent a sleepless night considering this argument, and eventually realized that it was flawed. He pointed out that if the box were to be weighed, say by a spring and a pointer on a scale, "since the box must move vertically with a change in its weight, there will be uncertainty in its vertical velocity and therefore an uncertainty in its height above the table. ... Furthermore, the uncertainty about the elevation above the Earth's surface will result in an uncertainty in the rate of the clock",[102] because of Einstein's own theory ofgravity's effect on time. "Through this chain of uncertainties, Bohr showed that Einstein's light box experiment could not simultaneously measure exactly both the energy of the photon and the time of its escape."[103]
In 1935, Einstein,Boris Podolsky andNathan Rosen published an analysis of spatially separatedentangled particles (EPR paradox).[104] According to EPR, one could measure the position of one of the entangled particles and the momentum of the second particle, and from those measurements deduce the position and momentum of both particles to any precision, violating the uncertainty principle. In order to avoid such possibility, the measurement of one particle must modify the probability distribution of the other particle instantaneously, possibly violating theprinciple of locality.[105]
In 1964,John Stewart Bell showed that this assumption can be falsified, since it would imply a certaininequality between the probabilities of different experiments.Experimental results confirm the predictions of quantum mechanics, ruling out EPR's basic assumption oflocal hidden variables.
Science philosopherKarl Popper approached the problem of indeterminacy as a logician andmetaphysical realist.[106] He disagreed with the application of the uncertainty relations to individual particles rather than toensembles of identically prepared particles, referring to them as "statistical scatter relations".[106][107] In this statistical interpretation, aparticular measurement may be made to arbitrary precision without invalidating the quantum theory.
In 1934, Popper publishedZur Kritik der Ungenauigkeitsrelationen ("Critique of the Uncertainty Relations") inNaturwissenschaften,[108] and in the same yearLogik der Forschung (translated and updated by the author asThe Logic of Scientific Discovery in 1959[106]), outlining his arguments for the statistical interpretation. In 1982, he further developed his theory inQuantum theory and the schism in Physics, writing:
[Heisenberg's] formulae are, beyond all doubt, derivablestatistical formulae of the quantum theory. But they have beenhabitually misinterpreted by those quantum theorists who said that these formulae can be interpreted as determining some upper limit to theprecision of our measurements. [original emphasis][109]
Popper proposed an experiment tofalsify the uncertainty relations, although he later withdrew his initial version after discussions withCarl Friedrich von Weizsäcker, Heisenberg, and Einstein; Popper sent his paper to Einstein and it may have influenced the formulation of the EPR paradox.[110]: 720
Some scientists, includingArthur Compton[111] andMartin Heisenberg,[112] have suggested that the uncertainty principle, or at least the general probabilistic nature of quantum mechanics, could be evidence for the two-stage model of free will. One critique, however, is that apart from the basic role of quantum mechanics as a foundation for chemistry,nontrivial biological mechanisms requiring quantum mechanics are unlikely, due to the rapiddecoherence time of quantum systems at room temperature.[113] Proponents of this theory commonly say that this decoherence is overcome by both screening and decoherence-free subspaces found in biological cells.[113]
Uncertainty principles relate quantum particles – electrons for example – to classical concepts – position and momentum. This presumes quantum particles have position and momentum.Edwin C. Kemble pointed out[115][clarification needed] in 1937 that such properties cannot be experimentally verified and assuming they exist gives rise to many contradictions; similarlyRudolf Haag notes that position in quantum mechanics is an attribute of an interaction, say between an electron and a detector, not an intrinsic property.[116][117] From this point of view the uncertainty principle is not a fundamental quantum property but a concept "carried over from the language of our ancestors", as Kemble says.
Since the uncertainty principle is such a basic result in quantum mechanics, typical experiments in quantum mechanics routinely observe aspects of it. All forms ofspectroscopy, includingparticle physics use the relationship to relate measured energy line-width to the lifetime of quantum states. Certain experiments, however, may deliberately test a particular form of the uncertainty principle as part of their main research program. These include, for example, tests of number–phase uncertainty relations insuperconducting[118] orquantum optics[119] systems. Applications dependent on the uncertainty principle for their operation include extremely low-noise technology such as that required ingravitational wave interferometers.[120]
Goodhart's law – Adage about statistical measures — when an attempt is made to use a statistical measure for purposes of control (directing), its statistical validity breaks down
^Schrödinger, E., Zum Heisenbergschen Unschärfeprinzip, Berliner Berichte, 1930, pp. 296–303.
^abSchrödinger, E. (1930), "Zum Heisenbergschen Unschärfeprinzip",Sitzungsberichte der Preussischen Akademie der Wissenschaften, Physikalisch-mathematische Klasse,14:296–303
^Anderson, P. W. (1964), "Special Effects in Superconductivity", in Caianiello, E. R. (ed.),Lectures on the Many-Body Problem, Vol. 2, New York: Academic Press
^Karplus, Martin, and Porter, Richard Needham (1970).Atoms and Molecules. California: Benjamin Cummings. p. 68ISBN978-0805352184.OCLC984466711
^The broad linewidth of fast-decaying states makes it difficult to accurately measure the energy of the state, and researchers have even used detuned microwave cavities to slow down the decay rate, to get sharper peaks.Gabrielse, Gerald; H. Dehmelt (1985). "Observation of Inhibited Spontaneous Emission".Physical Review Letters.55 (1):67–70.Bibcode:1985PhRvL..55...67G.doi:10.1103/PhysRevLett.55.67.PMID10031682.
^Amrein, W.O.; Berthier, A.M. (1977), "On support properties ofLp-functions and their Fourier transforms",Journal of Functional Analysis,24 (3):258–267,doi:10.1016/0022-1236(77)90056-8.
^Nazarov, F. (1994), "Local estimates for exponential polynomials and their applications to inequalities of the uncertainty principle type",St. Petersburg Math. J.,5:663–717
^Whittaker, Edmund T. (1989).A history of the theories of aether & electricity. Vol. II: The modern theories,1900–1926 (Repr ed.). New York: Dover Publ. p. 267.ISBN978-0-486-26126-3.
^abcHeisenberg, W. (1930),Physikalische Prinzipien der Quantentheorie (in German), Leipzig: Hirzel English translationThe Physical Principles of Quantum Theory. Chicago: University of Chicago Press, 1930.
^Heisenberg, W.,Die Physik der Atomkerne, Taylor & Francis, 1952, p. 30.
^Cassidy, David; Saperstein, Alvin M. (2009), "Beyond Uncertainty: Heisenberg, Quantum Physics, and the Bomb",Physics Today,63 (1), New York: Bellevue Literary Press: 185,Bibcode:2010PhT....63a..49C,doi:10.1063/1.3293416
^Greenstein, George;Zajonc, Arthur (2006).The Quantum Challenge: Modern Research on the Foundations of Quantum Mechanics. Jones & Bartlett Learning.ISBN978-0-7637-2470-2.
^Wheeler, John Archibald (1977), Lopes, José Leite; Paty, Michel (eds.),"Include the Observer in the Wave Function?",Quantum Mechanics, A Half Century Later: Papers of a Colloquium on Fifty Years of Quantum Mechanics, Held at the University Louis Pasteur, Strasbourg, May 2–4, 1974, Episteme, Dordrecht: Springer Netherlands, pp. 1–18,doi:10.1007/978-94-010-1196-9_1,ISBN978-94-010-1196-9,archived from the original on 2024-02-23, retrieved2023-07-19
^Werner Heisenberg,The Physical Principles of the Quantum Theory, p. 20
^Kumar, Manjit (2011).Quantum: Einstein, Bohr and the great debate about the nature of reality (1st paperback ed.). New York: Norton.ISBN978-0-393-33988-8.
^Jarvie, Ian Charles; Milford, Karl; Miller, David W. (2006).Karl Popper: a centenary assessment. Vol. 3. Ashgate.ISBN978-0-7546-5712-5.
^Popper, Karl; Carl Friedrich von Weizsäcker (1934). "Zur Kritik der Ungenauigkeitsrelationen" [Critique of the Uncertainty Relations].Naturwissenschaften (in German).22 (48):807–808.Bibcode:1934NW.....22..807P.doi:10.1007/BF01496543.S2CID40843068.
^Popper, K. (1982).Quantum theory and the schism in Physics. Unwin Hyman. pp. 53–54.
^Elion, W. J.; Matters, M.; Geigenmüller, U.; Mooij, J. E. (1994). "Direct demonstration of Heisenberg's uncertainty principle in a superconductor".Nature.371 (6498):594–595.Bibcode:1994Natur.371..594E.doi:10.1038/371594a0.S2CID4240085.