Optical heterodyne detection is a method of extracting information encoded asmodulation of thephase,frequency or both ofelectromagnetic radiation in thewavelength band of visible orinfrared light. The light signal is compared with standard or reference light from a "local oscillator" (LO) that would have a fixed offset in frequency and phase from the signal if the latter carried null information. "Heterodyne" signifies more than one frequency, in contrast to the single frequency employed inhomodyne detection.[1]
The comparison of the two light signals is typically accomplished by combining them in aphotodiode detector, which has a response that islinear inenergy, and hencequadratic inamplitude ofelectromagnetic field. Typically, the two light frequencies are similar enough that their difference orbeat frequency observed by the detector is in the radio or microwave band that can be conveniently processed by electronic means.
This technique became widely applicable totopographical andvelocity-sensitiveimaging with the invention in the 1990s of synthetic array heterodyne detection.[2] The light reflected from a target scene is focused on a relatively inexpensivephotodetector consisting of a single large physical pixel, while a different LO frequency is tightly focused on each virtual pixel of this detector (a different LO frequency signal focused on a different part of the detector), resulting in an electrical signal from the detector carrying a mixture of beat frequencies that can be electronically isolated and distributed spatially (as we know which part of the detector gives which beat frequency) to present an image of the scene.[2]
Optical heterodyne detection began to be studied at least as early as 1962, within two years of the construction of the firstlaser.[3] However, laser illumination is not the only way to produce spatially coherent light. In 1995, Guerra[4] published results in which he used a "form of optical heterodyning" to detect and image a grating with frequency many times smaller than the illuminating wavelength, and therefore smaller than the resolution, or passband, of the microscope, by beating it against a local oscillator in the form of a similar but transparent grating. A form of super-resolution microscopy, this work continues to spawn a family and generation of microscopes of particular use in the life sciences, known as "structured illumination microscopy", Polaroid Corp. patented Guerra's invention in 1997.[5]
It is instructive to contrast the practical aspects ofheterodyne detection inoptical band toradio frequency (RF) band.
Unlike RF band detection, optical frequencies oscillate too rapidly to directly measure and process theelectric field electronically (e.g., 632 nm in wavelength for a visibleHeNe laser that appears red, is 4.75×1014 Hz in frequency). Instead optical photons are (usually) detected by absorbing the photon's energy, thus only revealing the magnitude of an optical signal, not the electric field phase. Hence the primary purpose ofheterodyne mixing is to down shift the signal from the optical band to an electronically tractable frequency range.
In RF band detection, typically, the electromagnetic field drives oscillatory motion of electrons in anantenna; the capturedEMF is subsequently electronically mixed with a local oscillator (LO) by any convenient non-linear circuit element with a quadratic term (most commonly a rectifier). In optical detection, the desired non-linearity is inherent in the photon absorption process itself. Conventional light detectors—so called "Square-law detectors"—respond to thephoton energy to free bound electrons, and since the energy flux scales as the square of the electric field, so does the rate at which electrons are freed. A frequency difference between an input signal and a LO signal to a detector appears in the detector output electrical current, only when both signals illuminate the detector at the same time, causing the square of their combined fields to have a cross term or "difference" frequency modulating the average rate at which free electrons are generated.
Another point of contrast is the expected bandwidth of the input signal and local oscillator signal to the detector. Typically, an RF local oscillator is a pure frequency; pragmatically, "purity" means that a local oscillator's frequency bandwidth is much much less than the difference frequency between the input and LO signals. With optical signals, even with a laser, it is not simple to produce a reference frequency sufficiently pure to have either an instantaneous bandwidth or long term temporal stability that is less than a typical megahertz or kilohertz scale difference frequency. For this reason, the same source is often used to produce the LO and the input signals so that their difference frequency can be kept constant even if the center frequency wanders.
As a result, the mathematics of squaring the sum of two pure tones, normally invoked to explain RFheterodyne detection, is an oversimplified model of optical heterodyne detection. Nevertheless, the intuitive pure-frequency heterodyne concept still holds perfectly for thewideband caseprovided that the signal and LO are mutuallycoherent. Crucially, one can obtain narrow-band interference from coherent broadband sources: this is the basis forwhite light interferometry andoptical coherence tomography. Mutual coherence permits the rainbow inNewton's rings, andsupernumerary rainbows.
Consequently, optical heterodyne detection is usually performed asinterferometry where the LO and (input) signal share a common origin, rather than, as in radio, a transmitter sending to a remote receiver. The remote receiver geometry is uncommon because generating a local oscillator signal that is coherent with a signal of independent origin is technologically difficult at optical frequencies. However, lasers of sufficiently narrow linewidth to allow the signal and LO to originate from different lasers do exist.[6]
After optical heterodyne became an established technique, consideration was given to the conceptual basis for operation at such low signal light levels that "only a few, or even fractions of, photons enter the receiver in a characteristic time interval".[7] It was concluded that even when photons of different energies (so different frequencies as energy per photon is where is thePlanck constant and is the wave frequency) are absorbed at a countable rate by a detector at different (random) times, the detector can still produce a difference frequency. Hence light seems to have wave-like properties not only as it propagates through space, but also when it interacts with matter.[8] Progress with photon counting was such that by 2008 it was proposed that, even with larger signal strengths available, it could be advantageous to employ local oscillator power low enough to allow detection of the beat signal by photon counting. This was understood to have a main advantage of imaging with available and rapidly developing large-format multi-pixel counting photodetectors.[9]
Photon counting was applied withfrequency-modulatedcontinuous wave (FMCW) lasers.Numerical algorithms were developed to optimize the statistical performance of the analysis of the data from photon counting.[10][11][12]
The strength of the difference frequency signal can be larger than the strength of the original signal. The strength of the difference frequency is proportional to the product of theamplitudes of the LO and original signal electric fields. Thus the larger the LO amplitude, the stronger the difference frequency strength. Hence there is gain in the photon conversion process. ThePoynting vector of the sum of the LO and signal is proportional to the square of the sum:
A photodetector, like aphotodiode, is much slower than optical frequency (4.75×1014 Hz at 632 nm in wavelength as a red color), so the detector integrates the Poynting vector over a time window much longer than an optical period (2.1×10-15 second for 632 nm in wavelength). In this sense, we can only consider the average of the Poynting vector over the integration window. By usingthe product-to-sum trigonometric identity and assuming that the detector is much faster than the difference frequency, the detector output is proportional to the following. (The sum of integrations of terms with at frequencies over a time window much longer than the periods of the frequencies statistically gives zero or a noise in values.)
where. The first two terms are proportional to the average (DC) energy flux absorbed (or, equivalently, the average current in the case of photon counting). The third term is the difference frequency. In many applications the signal is weaker than the LO, thus it can be seen that gain occurs because the energy flux in the difference frequency is greater than the DC energy flux of the signal.
Suppose that a signal light in form of, where is the wavenumber in vacuum, is toward a moving target mirror which position is. The mirror receives the light at time as
where is the refractive index of the main wave propagation medium, is a generally inhomogeneous refractive index of a transmissive optical elementi with its length, and is a phase shift by a reflective optical elementj (Mirror reflection phase shift).
The target mirror immediately (ignoring thetime dilation in thespecial relativity) reflects the light, so the reflected light is
where in is dropped for simplicity, and with now that the second summation (with the indexj) includes a phase shift from the target mirror reflection.
The angular frequency of the reflected light is given by the time derivative of its phase;
where is the mirror velocity at. This is theDoppler-shifted frequency by the mirror in moving w.r.t the light source.
The reflected light is now toward a detector fixed at the location. The detector receives this light at time where is an additional delay by the light travelling through each transmissive optical elementp till the detector. So, the detected light is
where. The angular frequency of the detected signal light is given by the time derivative of its phase;
because ( by ignoring the time dilation).[Note 1] Here, constant refractive indices and reflection phase changes over a frequency range made by Doppler shift is assumed.[Note 2] Note that,, and areretarded quantities (retarded position, velocity, and time) for the light reaching the detector.
Then, with a LO light (that travels through fixed optical elements) at the detector at the time,, the heterodyne detection signal is proportional to
where where is a constant. The time derivate of the phase of the heterodyne signal is, which turns to measure the mirror velocity (at the retarded time) if the signal and LO frequencies are known, and the integration of the velocity gives the mirror displacement w.r.t a measurement starting time. Note again that the measured mirror velocity (and the mirror displacement) are retarded quantities, at.
So far, one round travel (positive integern = 1) to the moving target mirror by the signal light is considered, giving the Doppler-shifted angular frequency at the detector. For two round travels (n = 2), the Doppler-shifted light will again go to the target mirror, reflected again toward the detector, so undergo another doppler-shift. Forn round travels (e.g.,n = 2 for 2-paths interferometer) and that the mirror velocity is about the same during these travels,[Note 3] the detector receives the signal with the frequency of.
Depending on a purpose of the measurement, an additional method to measure the absolute position of the mirror (that may be with less accuracy) at the heterodyne detection measurement start time may be required to acquire absolute mirror position measurement in a given coordinates system (measured position = initial absolute position + heterodyne-measured relative position change).
The electronically measured signal beam's energy flux,, is DC and thus erases the phase associated with its optical frequency;Heterodyne detection allows this phase to be detected. If the optical phase of the signal beam shifts by an angle phi, then the phase of the electronic difference frequency shifts by exactly the same angle phi. More properly, to discuss an optical phase shift one needs to have a common time base reference. Typically the signal beam is derived from the same laser as the LO but shifted by some modulator in frequency. In other cases, the frequency shift may arise from reflection from a moving object. As long as the modulation source maintains a constant offset phase between the LO and signal source, any added optical phase shifts over time arising from external modification of the return signal are added to the phase of the difference frequency and thus are measurable.
As noted above, the difference frequency linewidth can be much smaller than the optical linewidth of the signal and LO signal, provided the two are mutually coherent. Thus small shifts in optical signal center-frequency can be measured: For example, Dopplerlidar systems can discriminate wind velocities with a resolution better than 1 meter per second, which is less than a part in a billion Doppler shift in the optical frequency. Likewise small coherent phase shifts can be measured even for nominally incoherent broadband light, allowingoptical coherence tomography to image micrometer-sized features. Because of this, an electronic filter can define an effective optical frequency bandpass that is narrower than any realizable wavelength filter operating on the light itself, and thereby enable background light rejection and hence the detection of weak signals.
As with any small signal amplification, it is most desirable to get gain as close as possible to the initial point of the signal interception: moving the gain ahead of any signal processing reduces the additive contributions of effects like resistorJohnson–Nyquist noise, or electrical noises in active circuits. In optical heterodyne detection, the mixing-gain happens directly in the physics of the initial photon absorption event, making this ideal. Additionally, to a first approximation, absorption is perfectly quadratic, in contrast to RF detection by a diode non-linearity.
One of the virtues of heterodyne detection is that the difference frequency is generally far awayspectrally from the potential noises radiated during the process of generating either the signal or the LO signal, thus the spectral region near the difference frequency may be relatively quiet. Hence, narrow electronic filtering near the difference frequency is highly effective at removing the remaining, generally broadband, noise sources.
The primary remaining source of noise is photon shot noise from the nominally constant DC level, which is typically dominated by the Local Oscillator (LO). Since theshot noise scales as theamplitude of the LO electric field level, and the heterodyne gain also scales the same way, the ratio of the shot noise to the mixed signal is constant no matter how large the LO.
Thus in practice one increases the LO level, until the gain on the signal raises it above all other additive noise sources, leaving only the shot noise. In this limit, the signal to noise ratio is affected by the shot noise of thesignal only (i.e. there is no noise contribution from the powerful LO because it divided out of the ratio). At that point there is no change in the signal to noise as the gain is raised further. (Of course, this is a highly idealized description; practical limits on the LO intensity matter in real detectors and an impure LO might carry some noise at the difference frequency)
Array detection of light, i.e. detecting light in a large number of independent detector pixels, is common in digital cameraimage sensors. However, it tends to be quite difficult in heterodyne detection, since the signal of interest is oscillating (also calledAC by analogy to circuits), often at millions of cycles per second or more. At the typical frame rates for image sensors, which are much slower, each pixel would integrate the total light received over many oscillation cycles, and this time-integration would destroy the signal of interest. Thus a heterodyne array must usually have parallel direct connections from every sensor pixel to separate electrical amplifiers, filters, and processing systems. This makes large, general purpose, heterodyne imaging systems prohibitively expensive. For example, simply attaching 1 million leads to a megapixel coherent array is a daunting challenge.
To solve this problem, synthetic array heterodyne detection (SAHD) was developed.[2] In SAHD, large imaging arrays can bemultiplexed into virtual pixels on a single element detector with single readout lead, single electrical filter, and single recording system.[13] The time domain conjugate of this approach isFourier transform heterodyne detection,[14] which also has the multiplex advantage and also allows a single element detector to act like an imaging array. SAHD has been implemented asRainbow heterodyne detection[15][16] in which instead of a single frequency LO, many narrowly spaced frequencies are spread out across the detector element surface like a rainbow. The physical position where each photon arrived is encoded in the resulting difference frequency itself, making a virtual 1D array on a single element detector. If the frequency comb is evenly spaced then, conveniently, theFourier transform of the output waveform is the image itself. Arrays in 2D can be created as well, and since the arrays are virtual, the number of pixels, their size, and their individual gains can be adapted dynamically. The multiplex disadvantage is that the shot noise from all the pixels combine since they are not physically separated.
As discussed, the LO and signal must be temporallycoherent. They also need to be spatially coherent across the face of the detector or they will destructively interfere. In many usage scenarios the signal is reflected from optically rough surfaces or passes through optically turbulent media leading towavefronts that are spatially incoherent. In laser scattering this is known asspeckle.[17]
In RF detection the antenna is rarely larger than the wavelength so all excited electrons move coherently within the antenna, whereas in optics the detector is usually much larger than the wavelength and thus can intercept a distorted phase front, resulting in destructive interference by out-of-phase photo-generated electrons within the detector.
While destructive interference dramatically reduces the signal level, the summed amplitude of a spatially incoherent mixture does not approach zero but rather the mean amplitude of a single speckle.[17] However, since the standard deviation of the coherent sum of the speckles is exactly equal to the mean speckle intensity, optical heterodyne detection of scrambled phase fronts can never measure the absolute light level with an error bar less than the size of the signal itself.This upper bound signal-to-noise ratio of unity is only for absolute magnitude measurement: it can havesignal-to-noise ratio better than unity for phase, frequency or time-varying relative-amplitude measurements in a stationary speckle field.
In RF detection, "diversity reception" is often used to mitigate low signals when the primary antenna is inadvertently located at an interference null point: by having more than one antenna one can adaptively switch to whichever antenna has the strongest signal or even incoherently add all of the antenna signals. Simply adding the antennae coherently can produce destructive interference just as happens in the optical realm.
The analogous diversity reception for optical heterodyne has been demonstrated with arrays of photon-counting detectors.[9] For incoherent addition of the multiple element detectors in a random speckle field, the ratio of the mean to the standard deviation will scale as the square root of the number of independently measured speckles. This improved signal-to-noise ratio makes absolute amplitude measurements feasible in heterodyne detection.
However, as noted above, scaling physical arrays to large element counts is challenging for heterodyne detection due to the oscillating or even multi-frequency nature of the output signal. Instead, a single-element optical detector can also act like diversity receiver via synthetic array heterodyne detection or Fourier transform heterodyne detection. With a virtual array one can then either adaptively select just one of the LO frequencies, track a slowly moving bright speckle, or add them all in post-processing by the electronics.
One can incoherently add the magnitudes of a time series ofN independent pulses to obtain a√N improvement in the signal to noise on the amplitude, but at the expense of losing the phase information. Instead coherent addition (adding the complex magnitude and phase) of multiple pulse waveforms would improve the signal to noise by a factor ofN, not its square root, and preserve the phase information. The practical limitation is adjacent pulses from typical lasers have a minute frequency drift that translates to a large random phase shift in any long distance return signal, and thus just like the case for spatially scrambled-phase pixels, destructively interfere when added coherently. However, coherent addition of multiple pulses is possible with advanced laser systems that narrow the frequency drift far below the difference frequency (intermediate frequency). This technique has been demonstrated in multi-pulse coherent DopplerLIDAR.[18]