REFERENCE TO RELATED APPLICATION This application is a continuation of U.S. patent application Ser. No. 10/728,126, filed Dec. 3, 2003, which claims the priority benefit under 35 U.S.C. §119(e) of provisional application No. 60/431,337, filed Dec. 5, 2002.
FIELD OF THE INVENTION The invention relates generally to the field of semiconductor processing and, more particularly, to an apparatus and method for the atomic layer deposition (ALD) of films onto semiconductor substrates.
BACKGROUND OF THE INVENTION Various types of reactors are used for atomic layer deposition (ALD) of thin films on semiconductor substrates. During ALD, a semiconductor substrate is exposed to alternating and sequential pulses of at least two mutually reactive reactants. The substrate temperature is chosen to be in a window above the reactants' condensation temperatures and below the reactant's thermal decomposition temperatures, so that during a reactant pulse a monolayer of the reactant adsorbs onto the substrate in a self-saturating manner without condensing or decomposing. After exposing the substrate to a first reactant, the process chamber is purged to remove any excess first reactant from the substrate and to remove any residual first reactant from the gas phase. The substrate is then exposed to a second reactant pulse, the second reactant being reactive with the first reactant.
Adequate purging and/or evacuating between the alternating reactant pulses is vital for good control of the deposition process. As purging takes time, the ALD process is generally a slow process.
Because of continual pressures within the semiconductor processing industry for increased throughput, there is a need for methods and apparatuses for ALD that allow efficient purging between subsequent reactant pulses, so that the purge time between pulses can be reduced and the overall deposition rate of the ALD process can be increased. In addition, because contaminating reactants can remain in a chamber by being deposited onto reactor walls there is also a need for reactors for ALD wherein deposition on the reactor walls is reduced or prevented.
SUMMARY OF THE INVENTION In accordance with one preferred embodiment of the invention, a film deposition station is provided for depositing a film onto a substrate. The deposition station comprises a first part and a second part for accommodating a semiconductor substrate between them. The first part and the second part are positioned opposite each other and parallel to a substrate, upon retention of the substrate between the first and second parts. The first part and the second part are also configured to be spaced less than about 2 mm from a main surface of a substrate accommodated between them. At least one of the parts is provided with a heater for heating that part and each part is provided with a set of gas supply channels connected to a source of gas. The source of gas for the first part is configured to supply mutually reactive reactants in a sequence of alternating, separated pulses for atomic layer deposition (ALD).
In accordance with another preferred embodiment, a reactor is provided for semiconductor processing. The reactor comprises an upper reactor block and a lower reactor block for accommodating a semiconductor substrate therebetween. The upper and the lower reactor blocks are configured to be less than about 2 mm from a major surface of the substrate when the substrate is retained therebetween. Also, the reactor is configured to discharge mutually reactive reactants from at least one of the reactor blocks to the substrate in sequential alternating, separated pulses. The at least one of the reactor blocks comprises a set of gas channels configured to transport and discharge the sequential alternating, separated pulses of reactant to the substrate.
In accordance with yet another preferred embodiment, a method is provided for depositing a layer on a semiconductor substrate. The method comprises providing an apparatus having a first side section and a second side section located opposite one another. The side sections each have facing planar surfaces and at least one of the side sections is heated to a temperature higher than about 200° C. The substrate is placed in the apparatus between the first and second side sections and two gas streams, in opposing directions, are applied from the first and second side sections to two opposing planar sides of the semiconductor substrate. The spacing between each of the first and second side sections and the semiconductor substrate is at most about 1 mm and the facing planar surfaces of the side sections extend completely across the opposing planar sides of the semiconductor substrate. At least one of the gas streams provides different reactants in a sequence of alternating, separated pulses for an atomic layer deposition (ALD) process.
In accordance with another preferred embodiment, a method is provided for semiconductor processing. The method comprises providing a processing apparatus having a first and a second reactor block. A substrate is positioned between the first and the second reactor blocks, wherein the substrate is less than about 2 mm from each of the first and the second reactor blocks after positioning. Mutually reactive reactants are discharged from the first reactor block in alternating, temporally separated pulses onto the substrate. The substrate is heated using the first or second reactor block to one or more desired substrate temperatures, wherein the first reactor block is at a first temperature at which condensation or decomposition of the mutually reactive reactants is substantially prevented.
In accordance with yet another preferred embodiment, a method is provided for semiconductor processing. The method comprises loading a substrate in a reaction chamber and completely supporting the substrate on a gas cushion in the chamber. While completely supporting the substrate on the gas cushion, mutually reactive reactants are discharged in alternating, temporally separated pulses onto a major surface of the substrate.
In accordance with another preferred embodiment of the invention, a film deposition station is provided for depositing a film onto a substrate. The deposition station comprises a first part and a second part for accommodating a semiconductor substrate between them. The first part and the second part are positioned opposite each other and parallel to the substrate upon retention of the substrate between the first and second parts. The first part is provided with a first set of gas supply channels connected to a source for a first reactant and a second set of gas supply channels connected to a source for a second reactant. The first and second set of gas supply channels are configured to keep the reactants separated until discharging the reactants out from the gas supply channels to the substrate, wherein the first and the second reactant are mutually reactive. The deposition station also comprises controls to supply the first and the second reactant from the source for a first reactant and from the source for a second reactant in sequential alternating separated pulses for atomic layer deposition (ALD).
BRIEF DESCRIPTION OF THE DRAWINGS The invention will be better understood from the detailed description of the preferred embodiments and from the appended drawings, which are meant to illustrate and not to limit the invention, and wherein:
FIG. 1 shows a graph of the theoretical reaction regimes of different process recipes for a particular set of ALD reactants in different temperature ranges;
FIG. 2 shows, schematically, a cross-sectional side view of a reactor having a top and a bottom part between which a substrate is accommodated, wherein one reactor part is heated and the other reactor part is cooled, in accordance with certain preferred embodiments of the invention;
FIG. 3 shows, schematically, a cross-sectional side view of the reactor ofFIG. 2, wherein both reactor parts are heated, in accordance with other preferred embodiments of the invention;
FIG. 4 shows, schematically, a cross-sectional plan view of one of the two reactor parts ofFIG. 3 as viewed along line A-A and showing gas channels, in accordance with preferred embodiments of the invention;
FIG. 5 shows, schematically, a cross-sectional side view of the reactor part ofFIG. 4 as viewed along line B-B;
FIG. 6 shows, schematically, a cross-sectional side view of a reactor in which the top reactor part comprises two separated sets of gas supply channels, in accordance with another preferred embodiment of the invention;
FIG. 7 shows, schematically, a cross-sectional side view of the reactor parts ofFIG. 6, showing a first set of the gas channels in the top reactor part;
FIG. 8 shows, schematically, a cross-sectional side view of the reactor parts ofFIG. 6, showing a second set of the gas channels in the top reactor part;
FIG. 9 shows, schematically, a cross-sectional side view of a reactor in accordance with yet another preferred embodiment of the invention; and
FIG. 10 shows, schematically, a cross-sectional side view of a more complete reactor module using the reactor ofFIG. 9, in accordance with another preferred embodiment of the invention.
DETAILED DESCRIPTION OF THE INVENTION According to preferred embodiments of the invention, a deposition station is provided for depositing films by exposing a substrate to mutually reactive reactants, preferably in alternating pulses, such as for an atomic layer deposition process. The station preferably comprises two parts which delimit a reaction chamber in which the substrate is accommodated. To prevent unintended reaction of the reactants, at least one of the parts is provided with a plurality of gas channels which allow multiple reactants to be flowed through that part without contacting one another until being discharged into the reaction chamber. Preferably, as discussed below, the two parts are positioned close to the substrate during processing to minimize the volume of the reaction chamber and to allow quicker purging and cycling of reactants into the chamber. In addition, as discussed below, at least one of the two parts delimiting the reaction chamber can be heated. Preferably, the temperatures of these two parts are different from the desired process temperature of the substrate and are out of the window of temperatures in which deposition rate is optimized, thus minimizing deposition of reactants on the deposition station surfaces. Advantageously, by minimizing the purge time and the contamination of deposition station surfaces, reactor throughput and the quality of deposited layers are improved.
A. Preferred Reactor Module
While the preferred embodiments can be applied to other reactors known to those of skill in the art, use of a floating substrate reactor is particularly advantageous. In a floating substrate reactor, a substrate can be processed without being mechanically supported; that is, the substrate can be processed without being directly contacted by a solid support. This enables very uniform and rapid heating of the substrate without the cold spots that can occur in reactors where substrates are mechanically contacted during a semiconductor fabrication process. In addition, parts of the reactor surrounding the substrate are preferably relatively massive such that each has a high heat capacity relative to the substrate, helping to stabilize the temperature of the substrate and minimizing the susceptibility of the reactor to temperature fluctuations upon loading and unloading of the substrate into the reactor. The basic configuration of such a floating substrate reactor is available commercially under the trade name Levitor® from ASM International N.V. of Bilthoven, The Netherlands. In this reactor, a substrate, such as a wafer, is typically accommodated between relatively massive upper and lower blocks. More detailed information about the Levitor® can be found in U.S. Pat. No. 6,183,565, the disclosure of which is incorporated herein by reference.
In addition, a Low Temperature (LT) module of the Levitor®, also available from ASM International N.V. of Bilthoven, The Netherlands, allows for processing at low temperatures and permits the annealing of copper (Cu) films at temperatures in the range of about 200-400° C. A short description of the Levitor® Low Temperature module is provided below. More detailed information about the Levitor® Low Temperature copper anneal module can be found in WO 01/50502 and US Patent Application Publication No. 2002/0092584, the disclosures of which are incorporated herein by reference.
In some preferred embodiments, both upper and lower reactor blocks of the reactor are provided with heaters, and the temperature of the blocks can be controlled such that the blocks have different temperatures: one block at a higher temperature and the other block at a lower temperature, wherein the desired substrate temperature lies between the high temperature and the low temperature. In other preferred embodiments, both blocks are heated to substantially the same temperature, which is preferably equal to the desired substrate temperature. The substrate temperature can also be regulated by varying the heat transference between the substrate and the reactor blocks. Methods for controlling the substrate temperature by varying heat transference are described in greater detail in co-pending U.S. application Ser. No. 10/186,269, METHOD AND APPARATUS FOR THE TREATMENT OF SUBSTRATES, filed Jun. 27, 2002 and METHOD FOR THE HEAT TREATMENT OF SUBSTRATES, filed Oct. 31, 2003, attorney docket No. ASMINT.057AUS, the disclosure of which is incorporated herein by reference.
Also, in some embodiments, to allow a substrate to be processed at a temperature within a processing window for optimal deposition while the upper and lower blocks are maintained at temperatures outside that window and to be able to quickly cool the substrate, the upper and lower blocks can be set at different temperatures, as noted above. For example, the upper block of the Levitor® Low Temperature module can be kept hot, preferably about 50° C. or more above the desired substrate treatment temperature, while the lower block is kept cold, preferably about 50° C. or more below the desired substrate treatment temperature or, more preferably, at 40° C. or less. The upper block is preferably heated by an array of heating elements, and the lower block is preferably water-cooled. The blocks are provided with a plurality (preferably about 60 to 100) of gas injection channels, spaced from each other and distributed over the surface of the block facing the semiconductor substrate. When the blocks are in a closed position with a substrate loaded between them, the gap between the substrate and each of the blocks is preferably less than about 2.0 mm, more preferably less than about 1.0 mm, and most preferably less than about 0.5 mm. In the illustrated embodiments the gap is about 0.2 mm. Typical powers required to maintain a temperature difference, ΔT, of about 400° C. between the blocks are in a range of roughly about 5-8 kW for 300 mm semiconductor wafers.
The temperature of a substrate within the reactor can be varied by switching the type of gas above and below the substrate between a high thermal conductivity gas and a low thermal conductivity gas (e.g., He or N2, respectively), and/or by varying the position of the substrate relative to the two blocks (e.g., the relative distance of the substrate from each block). When the blocks are at different temperatures, switching the composition of the gases on one or both sides can change the thermal conductivity of the medium between the substrate and the blocks. Thus, switching the gas compositions on one or both sides of the substrate can be employed to change the substrate temperature within a range between the temperatures of the two blocks. In addition, switching the relative flow rates of gases on either side of the substrate can move the substrate closer to one or the other of the blocks, thereby also changing the substrate temperature.
When the blocks are in a closed position and arranged close to a substrate, large gas flows (preferably greater than about 5 slm and, in a some embodiments, about 10 slm on each side) ensure that the substrate surfaces are effectively protected from the outside ambient. For example, this allows a Cu film to be annealed in the Levitor® Low Temperature module without the need to apply an additional gate valve to prevent oxidation of the Cu film. Because of the large gas flows, the substrate only comes into contact with outside air when the blocks are open and the substrate is relatively non-reactive (i.e., when the substrate is cold in comparison to its temperature during treatment). Preferably, to ensure that the substrate is relatively cold, the substrate temperature is switched to a low value at the end of processing, e.g., by appropriately switching the gases flowing above and below it, before opening the blocks.
In arrangements where the blocks are maintained at different temperatures, however, the substrate is susceptible to temperature non-uniformities. This is caused by the fact that temperature differences on different sides of a substrate produce a temperature gradient through the thickness of the substrate and thus can cause bowing of the substrate, which can immediately result in a temperature non-uniformity. For example, when the same gas is used on both sides of the substrate, with a ΔT between the blocks of about 400° C., and a gap of about 0.2 mm between each block and the substrate, a temperature non-uniformity across the substrate of about 1° C. per 1 μm bow can result. Thus, for a typical rest bow of about 10 μm, the difference in temperature of different parts of the substrate can be up to about 10° C. Advantageously, however, this non-uniformity can be reduced when the ΔT between the blocks is made smaller or when different gases are used on both sides of the substrate in order to reduce the differences in temperature between the two sides of the substrate.
Heat-up and cool-down with such an arrangement can advantageously be fast. Using good heat conduction in the thin gas layers results in typical heat-up and cool-down times of preferably less than about 5 seconds and, more preferably, between about 2-3 seconds.
As noted above, the blocks are preferably relatively massive compared to the substrate. The blocks thus have sufficient heat capacity such that, when heated, heat is transferred to an unheated substrate loaded between the side sections with negligible temperature loss from the blocks. The blocks preferably are formed of metal and have a preferred vertical thickness of greater than about 10 mm, more preferably greater than about 40 mm.
It will be appreciated that it is not trivial to deposit films from more than one reactant in reactors such as the Levitor® reactor, as suggested in U.S. Pat. No. 6,183,565, the disclosure of which is incorporated herein by reference. For example, with conventional CVD processes, it has been found that when deposition gases are pre-mixed, there is a tendency to cause deposition in the gas channels and in the tubing. In addition, there is a tendency toward depletion of reactants in the small volume in between the substrate and the block surface. This depletion can lead to relatively heavy deposition on the substrate immediately below the gas inlet channels, and to less deposition on areas of the substrate in between such inlet points. Consequently, processing using multiple reactants in these reactors can be problematic both in terms of adversely affecting reactor surfaces and in terms of their deposited layers on substrates, which exhibit poor uniformity of both thickness and coverage.
B. ALD in the Preferred Reactor Module
It has been found that the problems that can exist when depositing films using multiple reactants in reactors such as the Levitor® reactor can be avoided when ALD processes are used to deposit the films.
These problems can advantageously be avoided for a number of reasons. For example, in accordance with some preferred embodiments, when independent gas inlet channels are used for different reactants, there typically will be no deposition in the gas inlet channels.
Also, when a block facing the substrate device side is kept at a relatively low temperature (but high enough to avoid condensation), there will be substantially reduced ALD growth on the surfaces of that (typically colder) block facing the substrate. Alternatively, when the block facing the substrate device side is kept at a relatively high temperature (but not so high that thermal decomposition of a reactant occurs), such that the reaction conditions are outside the optimal process window for ALD, substantially reduced ALD deposition will occur on surfaces of this block due to a tendency for reactants to desorb.
Moreover, even where reactant depletion occurs, self-saturating pulses and true self-limiting reactions do not generally cause non-uniformity in ALD films. This is due to the saturation of absorption sites for reactants, which typically prevents an excess build-up of reactants. Thus, the problems with deposition on reactor surfaces and with poor quality deposited layers can be avoided.
In addition, various other features of the preferred reactor make it particularly advantageous for ALD processes. For example, the reactor volume is particularly small, due to the small spacing of the substrate from each of the upper and lower reactor blocks of about 2 mm or less, preferably about 1 mm or less and, more preferably, of about 0.2 mm or less. The small reactor volume allows a rapid switching between the alternating reactant pulses in ALD; any traces of a reactant pulse still residing in the reactor after termination of the pulse can be purged from the reactor in a very short time.
Not only is the reaction space volume small, but the gas distribution system used with the preferred reactor, described in further detail below, also has a small volume. Unlike conventional showerheads, the gas distribution system does not comprise a relatively voluminous gas distribution section to distribute gas laterally and uniformly over the substrate. Instead, the gas distribution system preferably comprises a set of horizontal channels to distribute the gas in a lateral direction over the substrate, which is typically placed in the reactor in a horizontal position. Preferably, a set of vertical channels, each vertical channel preferably in communication at one end with one of the horizontal channels and at the other end with the reaction space, discharges the gas into the reaction space. The channels are preferably drilled in the reactor blocks, which are preferably massive plates. The horizontal channels are relatively small in diameter, preferably about 3-5 mm and more preferably about 4 mm in diameter. The vertical channels are preferably about 1-3 mm in diameter and more preferably about 2 mm in diameter. Consequently, the total volume of the set of horizontal and vertical channels is small and easily purged in comparison to a conventional showerhead that comprises a relatively open gas distribution volume upstream of a perforated faceplate. In addition, the small gas volumes at both sides of the substrate further ensure that the total reactor volume is purged particularly efficiently.
Another advantage of the preferred reactor for ALD is the ability to control the temperature of the substrate at a value different from the temperature of the reactor blocks, thereby helping to minimize deposition on the reactor blocks. It will be appreciated that true self-limited atomic layer deposition will only occur in a certain temperature window. Preferably, the reactor block from which the reactants are dispersed is maintained at a temperature outside the window for optimized ALD growth, and preferably at a temperature where little or no growth occurs, so that little or no deposition-occurs on this block, while the substrate is maintained at a temperature within the window for ALD growth.
An additional advantage of the preferred reactor, e.g., the Levitor® LT module is that, during processing, the temperature of the substrate can rapidly be switched. This can beneficially be exploited in a manner as disclosed by Kurabayashi et. al. in a paper. entitled “Temperature synchronized molecular layer epitaxy,” Applied Surface Science 82/83 (1994) 97-102, the disclosure of which is incorporated herein by reference. Kurabayashi describes, among other examples, a process wherein a GaAs film is deposited on a substrate by exposing the substrate sequentially and repeatedly to pulses of AsH3and TMG (Tri Methyl Gallium), and wherein the substrate temperature is 500° C. during the AsH3pulse and 450° C. during the TMG pulse. The higher temperature during the AsH3pulse is preferred for achieving a sufficient reaction rate, whereas the lower temperature during the TMG pulse is preferred for avoiding premature thermal decomposition of the TMG. In general, this technique allows the use of temperature sensitive precursors during the low temperature part of the cycle whereas the high temperature part of the cycle ensures that a sufficient reaction rate and/or film quality is achieved. This is beneficial in the deposition of metals, metal oxides, and metal nitrides.
In some situations, cycling the temperature is a good alternative to the use of a plasma during the deposition process. Further, when depositing multicomponent materials, temperature cycling allows the use of combinations of reactants, in particular metal compounds, whose respective process windows do not match.
Advantageously, the preferred reactor is well suited for rapid switching of substrate temperature without requiring a large peak power and large variations in power for the heating system. It will be recognized that the temperatures between which processing is switched can be tailored depending on the reactants used. Also, the timing of the temperature switching can be tailored with respect to the timing of the gas pulses, to achieve optimum results. After supplying a first reactant at a first substrate temperature, the temperature of the substrate can be switched to the second temperature at the beginning, during or after the supply of the second reactant pulse. Although the moment of temperature switching can be shifted in time with respect to the moment of reactant switching, it will be appreciated that the period in which the reactor is at the first temperature should overlap with presence of the first reactant and the period in which the reactor is at the second temperature should coincide with the presence of the second reactant. Preferably, the duration of the period between which the temperature and reactants are switched is the same and switching of the temperature and reactants occurs at substantially the same frequency.
C. DETAILED DESCRIPTION TO THE FIGURES Reference will now be made to the Figures, wherein like numerals refer to like parts throughout.
With reference toFIG. 1, the temperature window for optimal ALD is explained in further detail. In this figure, the growth of the film per cycle is given as a function of temperature. The horizontal axis indicates the deposition temperature and the vertical axis indicates the amount of film growth per deposition cycle. W denotes the temperature window in which optimized atomic layer deposition occurs. In an ideal case for the temperature window W, represented in the drawings as a horizontal line across the vertical growth axis, one monolayer of film is deposited per full cycle. It will be appreciated, however, that due to surface reconstruction or steric hindrance by large surface ligands, less than a monolayer per cycle is typically deposited in an actual ALD process performed within this window, but optimal self-limited growth still obtains.
Outside the window, various factors can cause greater than or less than optimized growth per cycle. For example, at the low temperatures at the low end of the temperature window (W), depending on whether a reactant condenses at those temperatures, more or less than a monolayer can be deposited. L1denotes a region of increased growth per cycle, exhibited by reactants that condense at those temperatures. L2denotes a region of decreased growth per cycle, for other reactant combinations. The temperature dependence described by L2is indicative of a process with combinations of reactants for which the reaction is activation energy limited, where the reactivity of the reactants becomes too low. It should be noted, however, that even for chemistries exhibiting a low-temperature curve like L2, increased deposition can eventually occur at lower temperatures than those shown, since condensation usually will take place at a low enough temperature.
On the other hand, at temperatures at the high end of the temperature window, other factors can lead to deposition of greater than or less than one monolayer per cycle. HI indicates a situation where the growth per cycle is above the optimized, self-limited growth. This can occur for some process recipes when the temperature is so high that thermal decomposition of one of the reactants occurs or where non-volatile reaction by-products are formed. H2represents a situation where the growth per cycle is less than one monolayer. This may be the result of desorption or dissociation of a surface ligand that is needed to activate the surface for the next reactant. It should be noted, however, that even for chemistries exhibiting a high-temperature curve like H2, increased deposition can eventually occur at even higher temperatures than those shown, when thermal decomposition causes deposition of more than one monolayer per cycle.
As shown inFIG. 1, maintaining the substrate surface temperature within a certain window is important for achieving the desired amount of growth of a thin film on a substrate by ALD. Preferably, information about the growth curve for a particular reaction is used to select an appropriate substrate temperature. Thus, a substrate temperature is preferably chosen that falls within the window in which ALD occurs. Advantageously, the preferred embodiments also take advantage of the other information available from the growth curves to minimize deposition on the reactor walls and to provide an improved ALD method and apparatus.
Exemplary embodiments will now be described with reference toFIGS. 2-9.FIG. 2 shows schematically areactor100 having the general features of a Levitor® Low Temperature module. Thereactor100 comprises anupper reactor block120 and alower reactor block160, between which a substrate orwafer110 is accommodated. It will be appreciated that the surfaces of theblocks120 and160 facing each other delimit a reaction chamber for processing thewafer110. Theupper reactor block120 is preferably heated through resistance heating, using an array ofheating elements122, each heating element connected withwires123 to a source of electrical energy. Gas flows through theupper block120 via a gas inlet opening124, andgas inlet channels125 and126. Then the gas is dispersed over an entire upper or firstmajor surface110aof thewafer110 via a plurality ofgas dispersion channels128, in communication with thegas inlet channel126. Finally, the gas is discharged from theupper block120 towards afirst surface110athroughgas injection channels129.
Shown on the other side of thewafer110, thelower reactor block160 is provided withcooling channels162 for the passage of a cooling fluid. Gas flows through thelower block160 via a gas inlet opening164 andgas inlet channels165 and166. Then the gas is preferably dispersed over an entire wafer lower orsecond surface110bvia a plurality ofgas dispersion channels168 in communication with thegas inlet channel166. Finally, the gas is discharged from thelower block160 towards the secondmajor surface110bofwafer110, opposite thefirst surface110a, throughgas injection channels169. In the reaction space between theblocks120 and160, the gases are preferably flowed in a radial direction and are exhausted from the reaction space via aperipheral gap130 between theblocks120 and160.
It will be appreciated that thegas dispersion channels128,168 can also be provided in individual sections which do not extend over the whole of thesurfaces110aand110b, respectively. For example, while one horizontalgas inlet channel125,165 and one verticalgas inlet channel126,166 are illustrated distributing gas over theentire surface110aor110b, respectively, more than onegas inlet channel125,165 and126,166 can be provided, each providing gas to a gas dispersion channel that only extends over a limited section of thesurface110aor110b.
It will also be appreciated that a lower block need not be actively cooled. For example, in another embodiment, as shown inFIG. 3, thelower block150 is provided with theheating elements152, instead of cooling fluid channels, to maintain thelower block150 at a raised temperature. The temperature of thelower block150 can be the same as the temperature of theupper block120 or it can be a different temperature. Preferably, theheating elements152 are resistive heaters and are each connected viawires153 to a source of electrical energy. Gas is flowed through thelower block150 via a gas inlet opening154 andgas inlet channels155 and156. Then the gas is dispersed over theentire wafer surface110bvia a plurality ofgas dispersion channels158 in communication with the verticalgas inlet channel156. Finally, the gas is discharged from thelower block150 towards thesecond surface110bof thewafer110, opposite thefirst surface110a, throughgas injection channels159.
The configuration of the gas channels in the blocks is shown in greater detail inFIGS. 4 and 5.FIG. 4 shows a horizontal cross-section of ablock420 similar to theblock120 ofFIG. 3, taken across plane A-A ofFIG. 3. Like elements are referenced by reference numerals having like last two digits relative to the elements ofFIG. 3. Theblock420 is provided with thegas dispersion channels428 in a radial configuration. Eachgas dispersion channel428 is provided with a plurality ofgas injection channels429.
FIG. 5 shows a vertical cross-section taken along plane B-B inFIG. 4. Gas enters theblock420 through theinlet424, which is in communication with thegas inlet channel425, through which gas is discharged into the annularvertical inlet channel426. Thegas dispersion channels428 are in communication with thevertical channel426. The annular channel is formed by inserting aplug434 into acylindrical hole433 in theblock420. As theplug434 has, over a part of its length, a smaller diameter than thecylindrical hole433, thevertical inlet channel426 is left open. Thegas dispersion channels428 are preferably drilled in theblock420 and their radially outward ends are preferably closed by insertingplugs432.
Preferably, the cross-sectional area of each of thegas dispersion channels428, as represented by, e.g., their diameter(s), is larger than the cross-sectional area of each of the gas injection channels429 (FIG. 4), so that a roughly homogeneous distribution of gas through thegas dispersion channels428 and verticalgas injection channels429 is achieved. In a preferred embodiment, the diameter of thegas dispersion channels428 is about 3-5 mm and more preferably about 4 mm and the diameter of thegas injection channels429 are about 1-3 mm and more preferably about 2 mm.
In addition, thegas injection channels429 themselves preferably comprise a first section with a relatively small first diameter, directly connected to thegas dispersion channels428, and a second section with a wider second diameter, which opens to the reaction space occupied by the substrate110 (FIGS. 2 and 3). Preferably, the first diameter is about 1 mm or less and the second diameter is about 1 mm or more. In one preferred embodiment, the first diameter is about 0.25 mm and the second diameter is about 2 mm, the length of the first section is about 1 mm and the first section is at an upstream end intersecting with agas dispersion channel428 whereas the length of the second section is about 3 mm. The downstream end of the first section is in communication with the upstream end of the second section. The downstream end of the second section opens into the reaction space. In this way, a narrow restriction ingas injection channel429 is provided, whereas the second wider section allows expansion of the gas jet and reduction of the gas velocity before discharging the gas into the reaction space, so that the occurrence of gas jet marks on the substrate is advantageously avoided.
It will be appreciated that various other arrangements are possible for the gas channels, so long as they allow for gas to be distributed over a substrate surface. For example, thegas dispersion channels428 need not be in a radial configuration but, e.g., can be arranged according to a pattern of parallel, equidistant lines, connected at one end with a gas inlet channel oriented perpendicular to the gas dispersion channels, forming a rake type configuration. Further, the two sets of separated gas dispersion channels need not necessarily be vertically displaced but they can be, e.g., two oppositely oriented, interlaced rakes, disposed at about the same vertical level. Similarly, it will also be appreciated that various other arrangements are possible for thegas inlet424 and thegas inlet channels425 and426, so long as they allow gas to be provided to thegas dispersion channels428.
With reference toFIG. 6, a particularly advantageous embodiment is shown, wherein theupper block120 is provided with two different, separated sets of gas distribution channels, one set of channels for each one of the two mutually reactive reactants typically used during the deposition process. The first set comprises a gas inlet opening124, in communication with one end of agas inlet channel125, the other end of thegas inlet channel125 in communication with avertical inlet channel126, discharging into an annulargas distribution channel127. A plurality ofgas dispersion channels128 is in communication with the annulargas distribution channel127. Finally, gas is discharged from thegas dispersing channels128 throughgas injection channels129 towards thefirst surface110aof thewafer110.
A second set of gas distribution channels, separated from the first set, is configured similarly to the first set. The second set comprises a second gas inlet opening144, in communication with one end of a secondgas inlet channel145, the other end of the secondgas inlet channel145 being in communication with a secondvertical inlet channel146, discharging into a second annulargas distribution channel147. A plurality of secondgas dispersion channels148 are in communication with the annulargas distribution channel147. Finally, gas is discharged from thegas dispersing channels148 through the secondgas injection channels149 towards thefirst surface110aofwafer110. Preferably, other features inFIG. 6 are similar as those inFIG. 3.
It will be appreciated thatFIG. 6 is a schematic, cross-sectional side view, showing both sets of channels in theupper block120 simultaneously. Preferably, the two sets of horizontally extendinggas dispersion channels128 and148, shown with the first set at a first vertical level and the second set at a second vertical level, are not just vertically displaced with respect to each other, but are also horizontally displaced. In this way the verticalgas injection channels149 communicating with the upper set ofgas dispersion channels148 do not intersect the lower set ofgas dispersion channels128.
An exemplary such arrangement is illustrated in greater detail inFIGS. 7 and 8. InFIG. 7, which is a cross-sectional view taken through a first plane, the first set of gas channels124-129 is shown. Also shown is the annulargas distribution channel147, which is an annular ring and visible at any cross-section taken through the center axis of the reactor blocks120 and150. InFIG. 8, which is a cross-sectional view taken through a second plane, slightly rotated with respect to the first plane, the second set of gas channels144-149 is shown. Also shown is the annulargas distribution channel127. It will be appreciated that while no channels are illustrated in thelower block150 for clarity of illustration, thelower block150 can be provided with channels, as discussed herein.
It will be appreciated that the number of sets of channels such as the gas channels124-129 and144-149 is not limited to two; rather, the number of separated sets of channels is only limited by the number of set of channels that can be accommodated in theblock120. For example, in other arrangements, three or more separated sets of channels can be accommodated in theblock120. Also, thelower block150 can be provided with separated sets of channels, which can be arranged differently or can mirror those of theupper block120.
A further embodiment is shown inFIG. 9. Thegas inlets124 and144 of theupper block120 are now provided at the top and through the center of theupper block120 and thegas inlet154 of thelower block150 is provided at the bottom and through the center of thelower block150. It will be appreciated that thegas inlets124 and144 can connected togas lines224 and244, which are in turn connected to one ormore controllers250, which preferably comprise separate valves commonly controlled by a computer programmed to provide gases to thegas inlets124 and144 in sequential alternating, separated pulses. Thecontroller250 is preferably also connected toreactant sources324 and344. In addition, while illustrated only inFIG. 9 for ease of description, it will be appreciated that each of the gas inlets described above with reference to the other Figures can also be connected to gas sources and controllers for regulating the flow of gas from those gas sources. Also, the controller can be configured for inert gas valve control as described below.
FIG. 10 is a schematic cross section of an ALD module based on Levitor® principles in accordance with another preferred embodiment. The configuration of theblocks120 and150 is similar to that shown inFIG. 9. A chamber around the blocks is delimited by anupper wall190 and alower wall192. Theupper block120 is stationary and in fixed connection with theupper wall190. Thelower block150 is vertically movable to allow for loading and unloading of thewafer110 in between and out from between theblocks120 and150 through anopening195. The mechanism for vertically moving thelower block150 is schematically indicated by194. Aremovable shield193 is preferably connected to thelower block150. Reaction gases are exhausted from the reaction space between the blocks via theannular channel196.
In a preferred embodiment, theupper block120 is maintained at a relatively low temperature, below the desired substrate temperature, and thelower block150 is kept at a relatively high temperature, above the desired substrate temperature. Preferably, theremovable heat shield193 is kept at the same (relatively high) temperature as thelower block150. As theshield193 extends inwards such that its inside edge falls underneath thewafer110 and reactant not adsorbed or reacted on thewafer110 flows off thetop surface110aof the wafer and down to theshield193, atomic layer deposition on the process system surfaces is concentrated on thisshield193, and not on other parts of the system. Advantageously, by removing and replacing the shield, e.g., once every 10-20 μm of film deposition, a low particle generation level can be obtained. In addition, by making theshield193 longer, so that it extends further downwardly into theannular channel196, it can also fulfill the role of largely depleting the outgoing gas stream of reactants, in that way cleaning up the gas that is exhausted.
It will be appreciated that the system for supplying gas to thereactor100 can be configured in various ways known to those of skill in the art. Typically, the gas supply system will comprise one or more controllers which are programmed to provide separate, alternating gas pulses to thereactor100. In some arrangements, each of the blocks of a Levitor® reactor can have separate gas inlets which can be connected to separate gas feeds, as described in U.S. Pat. No. 6,183,565, the disclosure of which is incorporated herein by reference, or as shown in theFIGS. 6 and 9 discussed herein. In such a case, it will be appreciated that a gas switching mechanism is preferably provided upstream to allow for an ALD sequence of alternating and repeating reactant pulses, separated by reactant removal (e.g., purging) steps. An exemplary switching arrangement is sometimes referred to as “inert gas valving,” and is described in U.S. Patent Application Publication US 2001/0054377, the disclosure of which is incorporated herein by reference. In such an arrangement, pulses of reactant vapor are directed from the reactant source container towards the reaction chamber and, through switching of an inert gas flow, the reactant vapor flow is alternatingly: (i) directed to the reaction chamber by an inert gas flow from the source container towards the reaction chamber and then (ii) prevented from flowing from the source container to the reaction chamber by an inert gas flow in a reverse direction in a part of the conduit connecting the source container and the reaction chamber. The inert gas valving system is most particularly useful with liquid reactants that are vaporized prior to being employed in deposition. In other embodiments, the switching system can comprise conventional valves and valve control systems, particularly when more volatile gas sources are employed.
In the arrangements described above having separate gas inlets and separated sets of gas distribution channels in a block, each inlet can be connected to a different gas feed. For example, the separate channels can comprise a “spoke” arrangement, in which different spokes are connected to different reactant gas feeds.
In the preferred embodiments, deposition of a film can be performed by maintaining a ΔT of, e.g., about 100° C. or more and, more preferably, about 200° C. or more between the blocks. In thereactor100 ofFIG. 2, thelower block160 is cooled and can be at a temperature below about 40° C. Theupper block120 can be heated to a temperature of about 350° C. By the appropriate choice of the types of gases and gas flow ranges below and above thewafer110, a wafer temperature between that of the upper andlower blocks120,160, e.g., of about 300° C., can be achieved. This way of operation is beneficial for processes for which the increased temperature of about 350° C. results in desorption of the reactants from the high temperature surface but still does not result in substantial thermal decomposition of the reactants. In other words, with the block at about 350° C., the process should be in the H2regime discussed with reference toFIG. 1. In such an arrangement, it will be appreciated that the temperatures of the reactor blocks are preferably chosen based on the physical properties of the reactants, e.g., the decomposition and/or condensation points of the reactants.
In other embodiments, the lowerupper block150 can be heated, rather than cooled, but still maintained at a temperature that is lower than the desired wafer temperature, while the upper block120 (FIGS. 3 and 6-10) is maintained at a temperature that is higher than the desired wafer temperature.
In yet another embodiment, theupper block120 can be maintained at a lower temperature than the desired wafer temperature. When the desired wafer temperature is, e.g., about 300° C., one can keep theupper block120 at about 230° C. and thelower block150 at about 330° C. As noted above, this can be realized by the appropriate choice of the types of gases and flows below and above thewafer110. For example, because the temperature of thelower block150 is relatively close to the desired wafer temperature, one possibility is to physically keep thewafer110 close to thelower block150 to heat thewafer110 to the desired wafer temperature, which is closer to the lower block temperature than it is to the upper block temperature. The temperature of theupper block120 is preferably not be so low that condensation of the reactant occurs on it. In other words, for the block at about 230° C., the process is not in the L1regime, as discussed above regardingFIG. 1. Preferably, for theupper block120 at about 230° C., the process is in the L2regime.
Processing according to the embodiments of the invention offers numerous advantages. For example, for embodiments employing separated channels for each reactant, by eliminating pre-mixing of the gases in the gas introduction manifold up to the point where the gases enter the deposition volume, deposition in the gas channels leading to the reaction chamber is substantially prevented. In addition, a small nitrogen purge is preferably constantly flowed through the gas channels to ensure that the gas inlet system remains clean up to the final injection point facing the wafer.
By preferably keeping the temperature of the upper block outside the window for optimal ALD growth, but outside the range in which thermal decomposition can occur and outside the range in which condensation can occur, deposition on the upper block is also substantially prevented. To prevent thermal decomposition or condensation, the temperature of the upper block can be kept sufficiently below (e.g., about 70° C.) or above, respectively, that of the wafer. As noted above, in addition to the wafer process temperature, the selection of temperatures also depends on the reactants used. Similarly, the temperature of the lower block is preferably also outside of the window for optimal ALD growth. Note that all the embodiments described herein can simply be inverted so that the reactants flow through the lower block, or reactants can flow through both blocks for double-sided deposition.
The inert gas that preferably flows through the gap in between the wafer and the lower block substantially prevents deposition on the backside of the wafer, or on the lower block surface facing the wafer. Calculations indicate that the reactants present in the flow above the wafer typically will not penetrate the narrow space between the wafer and the lower block. This is because the flow rate below the wafer is relatively large (greater than about 5 slm) and the gap narrow (about 0.2 mm). As the Reynolds number is very small, this flow is expected to be substantially laminar.
As none of the reactants typically enter the volume between the wafer and the lower block, back feeding of reactants from this volume to the volume between the wafer and the upper block, into which reactants are introduced, can be substantially eliminated. Consequently, by preventing the different reactant gases from coming into contact with one another, CVD growth can be prevented.
The preferred embodiments advantageously allow reactants to be switched quickly. In principle, gas switching is much faster at low pressures than under atmospheric conditions. This has been a primary reason why conventional ALD reactors operate at reduced pressures. However, the extremely small reaction volume in a reactor according to the preferred embodiments, such as a Levitor® reactor, in combination with relatively large gas flows, ensures a high gas velocity, even at atmospheric pressure. For example, when a flow of, e.g., about 10 slm is distributed in the volume above the wafer, and the gap is about 0.2 mm, the gas velocity at the edge of a 300 mm wafer is about 0.9 m/s at room temperature. More generally, the gas velocity is preferably arranged to be about 0.1 m/s to 10 m/s. Under the preferred conditions, a typical distance of the wafer radius, i.e., about 0.15 m, is traversed in about 0.02 seconds to about 2 seconds, or about 0.17 seconds in this particular example. At temperatures above room temperature this time will even be shorter due to thermal expansion of the gases. Thus, the reactors can be quickly cleared of reactants for the next reactant pulse, regardless of whether separated gas channels are provided for the reactants.
In some embodiments, rotation (preferably about 20 rpm) of the wafer can be introduced by flowing gas through a number of additional gas channels. Arrangements for such rotation are known to those of skill in the art and was accomplished, for example, in the first version of the Levitor® 4000 reactor, produced by ASM International N.V. of Bilthoven, The Netherlands. Such rotation can improve uniformity by more evenly exposing the wafer to reactants. Also, the uniformity can be further improved by the fact that the wafer tends to be self-centering in such arrangements, positioning itself at the center of the reactor.
In addition, in reactors according to the preferred embodiments, the heat-up rate is fast, preferably on the order of a few seconds. In contrast, in conventional systems this can take several 10's of seconds. Thus, the fast heat-up rate minimizes the time needed for the wafer to reach the equilibrium temperature.
Also, particle generation is typically low. This is because the preferred large flow above the wafer prevents foreign material from penetrating the space above the wafer. Moreover, by preventing deposition on the blocks facing a wafer, build up of films on that surface is prevented, thus preventing formation of particles.
The cost of processing according to the preferred embodiments is advantageously low. The main reasons for this are that no vacuum is required, and, given the relatively low temperatures involved, easy to machine materials (e.g., aluminum) can be used for the reactor.
The footprint of the preferred reactors is also advantageously small. For example, the current Levitor® LT module has a width of about 45 cm and a depth of about 75 cm. In addition, two LT modules can be placed above one another, further minimizing the area occupied by the reactors.
In an advantageously simple embodiment of the invention, the apparatus described above is operated at atmospheric pressure, so that the use of vacuum pumps can be omitted. It is not necessary, however, to operate the apparatus at atmospheric pressure. A pressure of about 10 mbar to about 100 mbar would still be high enough to have a wafer supported by a gas cushion. In addition, an advantage of reducing the pressure is that it results in an expansion of the gas flows and a more efficient purging of the reaction space.
It will be appreciated that while discussed with reference to a Levitor® reactor for ease of description, the preferred embodiments are not limited to such a reactor. For example, the gas distribution system and the heating arrangement described above can readily be applied to other reactors to similar advantageous effect.
It will also be appreciated that although the invention has been described in terms of certain preferred embodiments, various other combinations, omissions, substitutions and modifications can be made to the embodiments described above without departing from the scope of the invention. All such modifications and changes are intended to fall within the scope of the invention. Accordingly, the present invention is not intended to be limited by the description of the preferred embodiments, but instead is to be defined by referenced to the appended claims.