Ascanning tunneling microscope (STM) is a type ofscanning probe microscope used for imagingsurfaces at theatomic level. Its development in 1981 earned its inventors,Gerd Binnig andHeinrich Rohrer, then atIBM Zürich, theNobel Prize in Physics in 1986.[1][2][3] STM senses the surface by using an extremely sharpconducting tip that can distinguish features smaller than 0.1 nm with a 0.01 nm (10 pm) depth resolution.[4] This means that individual atoms can routinely be imaged and manipulated. Most scanning tunneling microscopes are built for use inultra-high vacuum at temperatures approachingabsolute zero, but variants exist for studies in air, water and other environments, and for temperatures over 1000 °C.[5][6]
Scanning tunneling microscope operating principle
STM is based on the concept ofquantum tunneling. When the tip is brought very near to the surface to be examined, abias voltage applied between the two allowselectrons to tunnel through thevacuum separating them. The resultingtunnelingcurrent is a function of the tip position, applied voltage, and thelocal density of states (LDOS) of the sample. Information is acquired by monitoring the current as the tip scans across the surface, and is usually displayed in image form.[5]
A refinement of the technique known asscanning tunneling spectroscopy consists of keeping the tip in a constant position above the surface, varying the bias voltage and recording the resultant change in current. Using this technique, the local density of the electronic states can be reconstructed.[7] This is sometimes performed in high magnetic fields and in presence of impurities to infer the properties and interactions of electrons in the studied material.
Scanning tunneling microscopy can be a challenging technique, as it requires extremely clean and stable surfaces, sharp tips, excellentvibration isolation, and sophisticated electronics. Nonetheless, many hobbyists build their own microscopes.[8]
The tip is brought close to the sample by a coarse positioning mechanism that is usually monitored visually. At close range, fine control of the tip position with respect to the sample surface is achieved bypiezoelectric scanner tubes whose length can be altered by a control voltage. A biasvoltage is applied between the sample and the tip, and the scanner is gradually elongated until the tip starts receiving the tunneling current. The tip–sample separationw is then kept somewhere in the 4–7 Å (0.4–0.7 nm) range, slightly above the height where the tip would experience repulsive interaction(w < 3 Å), but still in the region where attractive interaction exists(3 <w < 10 Å).[5] The tunneling current, being in the sub-nanoampere range, is amplified as close to the scanner as possible. Once tunneling is established, the sample bias and tip position with respect to the sample are varied according to the requirements of the experiment.
As the tip is moved across the surface in a discretex–y matrix, the changes in surface height and population of the electronic states cause changes in the tunneling current. Digital images of the surface are formed in one of the two ways: in theconstant-height mode changes of the tunneling current are mapped directly, while in theconstant-current mode the voltage that controls the height (z) of the tip is recorded while the tunneling current is kept at a predetermined level.[5]
In constant-current mode, feedback electronics adjust the height by a voltage to the piezoelectric height-control mechanism. If at some point the tunneling current is below the set level, the tip is moved towards the sample, and conversely. This mode is relatively slow, as the electronics need to check the tunneling current and adjust the height in a feedback loop at each measured point of the surface. When the surface is atomically flat, the voltage applied to thez-scanner mainly reflects variations in local charge density. But when an atomic step is encountered, or when the surface is buckled due toreconstruction, the height of the scanner also have to change because of the overall topography. The image formed of thez-scanner voltages that were needed to keep the tunneling current constant as the tip scanned the surface thus contain both topographical and electron density data. In some cases it may not be clear whether height changes came as a result of one or the other.
In constant-height mode, thez-scanner voltage is kept constant as the scanner swings back and forth across the surface, and the tunneling current, exponentially dependent on the distance, is mapped. This mode of operation is faster, but on rough surfaces, where there may be large adsorbed molecules present, or ridges and groves, the tip will be in danger of crashing.
Theraster scan of the tip is anything from a 128×128 to a 1024×1024 (or more) matrix, and for each point of the raster a single value is obtained. The images produced by STM are thereforegrayscale, and color is only added in post-processing in order to visually emphasize important features.
In addition to scanning across the sample, information on the electronic structure at a given location in the sample can be obtained by sweeping the bias voltage (along with a small AC modulation to directly measure the derivative) and measuring current change at a specific location.[4] This type of measurement is calledscanning tunneling spectroscopy (STS) and typically results in a plot of the localdensity of states as a function of the electrons' energy within the sample. The advantage of STM over other measurements of the density of states lies in its ability to make extremely local measurements. This is how, for example, the density of states at animpurity site can be compared to the density of states around the impurity and elsewhere on the surface.[9]
The main components of a scanning tunneling microscope are the scanning tip, piezoelectrically controlled height (z axis) and lateral (x andy axes) scanner, and coarse sample-to-tip approach mechanism. The microscope is controlled by dedicated electronics and a computer. The system is supported on a vibration isolation system.[5]
The tip is often made oftungsten orplatinum–iridium wire, thoughgold is also used.[4] Tungsten tips are usually made by electrochemical etching, and platinum–iridium tips by mechanical shearing. Theresolution of an image is limited by theradius of curvature of the scanning tip. Sometimes, image artefacts occur if the tip has more than one apex at the end; most frequentlydouble-tip imaging is observed, a situation in which two apices contribute equally to the tunneling.[4] While several processes for obtaining sharp, usable tips are known, the ultimate test of quality of the tip is only possible when it is tunneling in the vacuum. Every so often the tips can be conditioned by applying high voltages when they are already in the tunneling range, or by making them pick up an atom or a molecule from the surface.
In most modern designs the scanner is a hollow tube of a radially polarized piezoelectric with metallized surfaces. The outer surface is divided into four long quadrants to serve asx andy motion electrodes with deflection voltages of two polarities applied on the opposing sides. The tube material is alead zirconate titanate ceramic with a piezoelectric constant of about 5 nanometres per volt. The tip is mounted at the center of the tube. Because of some crosstalk between the electrodes and inherent nonlinearities, the motion iscalibrated, and voltages needed for independentx,y andz motion applied according to calibration tables.[5]
Due to the extreme sensitivity of the tunneling current to the separation of the electrodes, proper vibration isolation or a rigid STM body is imperative for obtaining usable results. In the first STM by Binnig and Rohrer,magnetic levitation was used to keep the STM free from vibrations; now mechanical spring orgas spring systems are often employed.[5] Additionally, mechanisms for vibration damping usingeddy currents are sometimes implemented. Microscopes designed for long scans in scanning tunneling spectroscopy need extreme stability and are built inanechoic chambers—dedicated concrete rooms with acoustic and electromagnetic isolation that are themselves floated on vibration isolation devices inside the laboratory.
Maintaining the tip position with respect to the sample, scanning the sample and acquiring the data is computer-controlled. Dedicatedsoftware for scanning probe microscopies is used forimage processing as well as performing quantitative measurements.[10]
Some scanning tunneling microscopes are capable of recording images at high frame rates.[11][12] Videos made of such images can show surfacediffusion[13] or track adsorption and reactions on the surface. In video-rate microscopes, frame rates of 80 Hz have been achieved with fully working feedback that adjusts the height of the tip.[14]
Quantum tunneling of electrons is a functioning concept of STM that arises fromquantum mechanics. Classically, a particle hitting an impenetrable barrier will not pass through. If the barrier is described by a potential acting alongz direction, in which an electron of massme acquires the potential energyU(z), the electron's trajectory will be deterministic and such that the sumE of its kinetic and potential energies is at all times conserved:
The electron will have a defined, non-zero momentump only in regions where the initial energyE is greater thanU(z). In quantum physics, however, the electron can pass through classically forbidden regions. This is referred to astunneling.[5]
The real and imaginary parts of the wave function in a rectangular potential barrier model of the scanning tunneling microscope
The simplest model of tunneling between the sample and the tip of a scanning tunneling microscope is that of arectangular potential barrier.[15][5] An electron of energyE is incident upon an energy barrier of heightU, in the region of space of widthw. An electron's behavior in the presence of a potentialU(z), assuming one-dimensional case, is described bywave functions that satisfySchrödinger's equation
whereħ is thereduced Planck constant,z is the position, andme is theelectron mass. In the zero-potential regions on two sides of the barrier, the wave function takes on the forms
forz < 0,
forz >w,
where. Inside the barrier, whereE <U, the wave function is a superposition of two terms, each decaying from one side of the barrier:
for 0 <z <w,
where.
The coefficientsr andt provide measure of how much of the incident electron's wave is reflected or transmitted through the barrier. Namely, of the whole impinging particle current only is transmitted, as can be seen from theprobability current expression
which evaluates to. The transmission coefficient is obtained from the continuity condition on the three parts of the wave function and their derivatives atz = 0 andz =w (detailed derivation is in the articleRectangular potential barrier). This gives where. The expression can be further simplified, as follows:
In STM experiments, typical barrier height is of the order of the material's surfacework functionW, which for most metals has a value between 4 and 6 eV.[15] Thework function is the minimum energy needed to bring an electron from an occupied level, the highest of which is theFermi level (for metals atT = 0 K), tovacuum level. The electrons can tunnel between two metals only from occupied states on one side into the unoccupied states of the other side of the barrier. Without bias, Fermi energies are flush, and there is no tunneling. Bias shifts electron energies in one of the electrodes higher, and those electrons that have no match at the same energy on the other side will tunnel. In experiments, bias voltages of a fraction of 1 V are used, so is of the order of 10 to 12 nm−1, whilew is a few tenths of a nanometre. The barrier is strongly attenuating. The expression for the transmission probability reduces to The tunneling current from a single level is therefore[15]
where both wave vectors depend on the level's energyE, and
Tunneling current is exponentially dependent on the separation of the sample and the tip, typically reducing by an order of magnitude when the separation is increased by 1 Å (0.1 nm).[5] Because of this, even when tunneling occurs from a non-ideally sharp tip, the dominant contribution to the current is from its most protruding atom or orbital.[15]
Negative sample biasV raises its electronic levels bye⋅V. Only electrons that populate states between the Fermi levels of the sample and the tip are allowed to tunnel.
As a result of the restriction that the tunneling from an occupied energy level on one side of the barrier requires an empty level of the same energy on the other side of the barrier, tunneling occurs mainly with electrons near the Fermi level. The tunneling current can be related to the density of available or filled states in the sample. The current due to an applied voltageV (assume tunneling occurs from the sample to the tip) depends on two factors: 1) the number of electrons between the Fermi levelEF andEF − eV in the sample, and 2) the number among them which have corresponding free states to tunnel into on the other side of the barrier at the tip.[5] The higher the density of available states in the tunneling region the greater the tunneling current. By convention, a positiveV means that electrons in the tip tunnel into empty states in the sample; for a negative bias, electrons tunnel out of occupied states in the sample into the tip.[5]
For small biases and temperatures near absolute zero, the number of electrons in a given volume (the electron concentration) that are available for tunneling is the product of the density of the electronic statesρ(EF) and the energy interval between the two Fermi levels,eV.[5] Half of these electrons will be travelling away from the barrier. The other half will represent theelectric current impinging on the barrier, which is given by the product of the electron concentration, charge, and velocityv (Ii = nev),[5]
The tunneling electric current will be a small fraction of the impinging current. The proportion is determined by the transmission probabilityT,[5] so\
In the simplest model of a rectangular potential barrier the transmission probability coefficientT equals |t|2.
Tip, barrier and sample wave functions in a model of the scanning tunneling microscope. Barrier width isw. Tip bias isV. Surface work functions areϕ.
A model that is based on more realistic wave functions for the two electrodes was devised byJohn Bardeen in a study of themetal–insulator–metal junction.[16] His model takes two separate orthonormal sets of wave functions for the two electrodes and examines their time evolution as the systems are put close together.[5][15] Bardeen's novel method, ingenious in itself,[5] solves a time-dependent perturbative problem in which the perturbation emerges from the interaction of the two subsystems rather than an external potential of the standardRayleigh–Schrödinger perturbation theory.
Each of the wave functions for the electrons of the sample (S) and the tip (T) decay into the vacuum after hitting the surface potential barrier, roughly of the size of the surface work function. The wave functions are the solutions of two separate Schrödinger's equations for electrons in potentialsUS andUT. When the time dependence of the states of known energies and is factored out, the wave functions have the following general form
If the two systems are put closer together, but are still separated by a thin vacuum region, the potential acting on an electron in the combined system isUT +US. Here, each of the potentials is spatially limited to its own side of the barrier. Only because the tail of a wave function of one electrode is in the range of the potential of the other, there is a finite probability for any state to evolve over time into the states of the other electrode.[5] The future of the sample's stateμ can be written as a linear combination with time-dependent coefficients of and all:
with the initial condition.[5] When the new wave function is inserted into the Schrödinger's equation for the potentialUT +US, the obtained equation is projected onto each separate (that is, the equation is multiplied by a and integrated over the whole volume) to single out the coefficients All are taken to benearly orthogonal to all (their overlap is a small fraction of the total wave functions), and only first-order quantities retained. Consequently, the time evolution of the coefficients is given by
Because the potentialUT is zero at the distance of a few atomic diameters away from the surface of the electrode, the integration overz can be done from a pointz0 somewhere inside the barrier and into the volume of the tip (z > z0).
If the tunneling matrix element is defined as
the probability of the sample's stateμ evolving in timet into the state of the tipν is
In a system with many electrons impinging on the barrier, this probability will give the proportion of those that successfully tunnel. If at a timet this fraction was at a later timet + dt the total fraction of would have tunneled. Thecurrent of tunneling electrons at each instance is therefore proportional to divided by which is the time derivative of[15]
The time scale of the measurement in STM is many orders of magnitude larger than the typicalfemtosecond time scale of electron processes in materials, and is large. The fraction part of the formula is a fast-oscillating function of that rapidly decays away from the central peak, where. In other words, the most probable tunneling process, by far, is the elastic one, in which the electron's energy is conserved. The fraction, as written above, is a representation of thedelta function, so
Solid-state systems are commonly described in terms of continuous rather than discrete energy levels. The term can be thought of as thedensity of states of the tip at energy giving
The number of energy levels in the sample between the energies and is When occupied, these levels are spin-degenerate (except in a few special classes of materials) and contain charge of either spin. With the sample biased to voltage tunneling can occur only between states whose occupancies, given for each electrode by theFermi–Dirac distribution, are not the same, that is, when either one or the other is occupied, but not both. That will be for all energies for which is not zero. For example, an electron will tunnel from energy level in the sample into energy level in the tip (), an electron at in the sample will find unoccupied states in the tip at (), and so will be for all energies in between. The tunneling current is therefore the sum of little contributions over all these energies of the product of three factors: representing available electrons, for those that are allowed to tunnel, and the probability factor for those that will actually tunnel:
Typical experiments are run at a liquid-helium temperature (around 4 K), at which the Fermi-level cut-off of the electron population is less than a millielectronvolt wide. The allowed energies are only those between the two step-like Fermi levels, and the integral becomes
When the bias is small, it is reasonable to assume that the electron wave functions and, consequently, the tunneling matrix element do not change significantly in the narrow range of energies. Then the tunneling current is simply the convolution of the densities of states of the sample surface and the tip:
How the tunneling current depends on distance between the two electrodes is contained in the tunneling matrix element
This formula can be transformed so that no explicit dependence on the potential remains. First, the part is taken out from the Schrödinger equation for the tip, and the elastic tunneling condition is used so that
Now is present in the Schrödinger equation for the sample and equals the kinetic plus the potential operator acting on However, the potential part containingUS is on the tip side of the barrier nearly zero. What remains,
can be integrated overz because the integrand in the parentheses equals
Bardeen's tunneling matrix element is an integral of the wave functions and their gradients over a surface separating the two planar electrodes:
The exponential dependence of the tunneling current on the separation of the electrodes comes from the very wave functions thatleak through the potential step at the surface and exhibit exponential decay into the classically forbidden region outside of the material.
The tunneling matrix elements show appreciable energy dependence, which is such that tunneling from the upper end of theeV interval is nearly an order of magnitude more likely than tunneling from the states at its bottom. When the sample is biased positively, its unoccupied levels are probed as if the density of states of the tip is concentrated at its Fermi level. Conversely, when the sample is biased negatively, its occupied electronic states are probed, but the spectrum of the electronic states of the tip dominates. In this case it is important that the density of states of the tip is as flat as possible.[5]
The results identical to Bardeen's can be obtained by considering adiabatic approach of the two electrodes and using the standard time-dependent perturbation theory.[15] This leads toFermi's golden rule for the transition probability in the form given above.
Bardeen's model is for tunneling between two planar electrodes and does not explain scanning tunneling microscope's lateral resolution. Tersoff and Hamann[17][18][19] used Bardeen's theory and modeled the tip as a structureless geometric point.[5] This helped them disentangle the properties of the tip—which are hard to model—from the properties of the sample surface. The main result was that the tunneling current is proportional to the local density of states of the sample at the Fermi level taken at the position of the center of curvature of a spherically symmetric tip (s-wave tip model). With such a simplification, their model proved valuable for interpreting images of surface features bigger than a nanometre, even though it predicted atomic-scale corrugations of less than a picometre. These are well below the microscope's detection limit and below the values actually observed in experiments.
In sub-nanometre-resolution experiments, the convolution of the tip and sample surface states will always be important, to the extent of the apparent inversion of the atomic corrugations that may be observed within the same scan. Such effects can only be explained by modeling of the surface and tip electronic states and the ways the two electrodes interact fromfirst principles.
One-atom-thick silver islands grown on terraces of the (111) surface of palladium. Image size is 250 nm by 250 nm.
The characteristic reconstruction fringes on the (100) surface of gold are 1.44 nm wide[20] and consist of six atomic rows that sit on top of five rows of the crystal bulk. Image size is approximately 10 nm by 10 nm.
An earlier invention similar to Binnig and Rohrer's, theTopografiner of R. Young, J. Ward, and F. Scire from theNIST, relied on field emission.[21] However, Young is credited by the Nobel Committee as the person who realized that it should be possible to achieve better resolution by using the tunnel effect.[22]
STM can be used to manipulate atoms and change the topography of the sample. This is attractive for several reasons. Firstly the STM has an atomically precise positioning system, which enables very accurate atomic-scale manipulation. Furthermore, after the surface is modified by the tip, the same instrument can be used to image the resulting structures.IBM researchers famously developed a way to manipulatexenon atoms adsorbed on anickel surface.[4] This technique has been used to create electroncorrals with a small number of adsorbed atoms and observeFriedel oscillations in the electron density on the surface of the substrate. Aside from modifying the actual sample surface, one can also use the STM to tunnel electrons into a layer of electron-beamphotoresist on the sample, in order to dolithography. This has the advantage of offering more control of the exposure than traditionalelectron-beam lithography. Another practical application of STM is atomic deposition of metals (gold, silver, tungsten, etc.) with any desired (pre-programmed) pattern, which can be used as contacts to nanodevices or as nanodevices themselves.[citation needed]
Wiesendanger R, Güntherodt HJ, eds. (1996).Scanning Tunneling Microscopy III – Theory of STM and Related Scanning Probe Methods. Springer Series in Surface Sciences. Vol. 29. Springer-Verlag Berlin Heidelberg.doi:10.1007/978-3-642-80118-1.ISBN978-3-540-60824-0.