![]() | This article includes a list ofgeneral references, butit lacks sufficient correspondinginline citations. Please help toimprove this article byintroducing more precise citations.(March 2014) (Learn how and when to remove this message) |
POWER,PowerPC, andPower ISA architectures |
---|
NXP (formerly Freescale and Motorola) |
IBM |
|
IBM/Nintendo |
Other |
Related links |
Cancelled in gray,historic in italic |
TheIBM RS64 is a family ofmicroprocessors introduced in the mid 1990s, and used in IBM'sRS/6000 andAS/400servers.
These microprocessors implement the "Amazon", or "PowerPC-AS",instruction set architecture (ISA). Amazon is a superset of thePowerPCinstruction set, with the addition of special features not in the PowerPC specification, mainly derived fromPOWER2[citation needed] and the original AS/400 processor, and has been64-bit from the start. The processors in this family are optimized for commercial workloads (integer performance, largecaches, frequentbranches) and do not feature the strongfloating point performance of the processors in thePOWER family, its sibling.
The RS64 family was phased out soon after the introduction of thePOWER4, which was developed to unite the RS64 and POWER families.
In 1990, theAS/400 engineering team atIBM Rochester began work on a new architecture known asC-RISC (CommercialRISC) to replace the IMPI architecture of the AS/400.[1][2] C-RISC was an evolution of the IMPI instruction set, extending theaddress space to 96 bits and adding some RISC instructions to speed up the more computationally intensive commercial applications that were being created for AS/400s. IBM presidentJack Kuehler wanted them to use PowerPC, but they resisted, arguing that the existing 32/64-bit PowerPC instruction set would not enable a viable transition for OS/400 software and that the existing instruction set required extensions for the commercial applications on the AS/400. At Kuehler's insistence, a team at Rochester led byFrank Soltis investigated the feasibility of extending the PowerPC instruction set to support the needs of the AS/400 platform. These extensions became known asAmazon and were selected by IBMexecutive management for further development over continued development of C-RISC.[3]
At the same time, theRS/6000 developers were broadly expanding their product line to include systems which spanned from low-endworkstations, tomainframe-competitor large enterprise SMP systems, to clusteredRS/6000-SP2 supercomputing systems. PowerPC processors developed in theAIM alliance suited the low-end RISC workstation and small server space well. But mainframe and large clustered supercomputing systems required more performance andreliability, availability and serviceability features than processors designed forApple Power Macs. Multiple processor designs were required to simultaneously meet the requirements of the cost-focused Apple Power Mac, high-performance and RAS RS/6000 systems, and the AS/400 transition to PowerPC.
Amazon was extended to support those features as well, so that processors could be designed for use in both high-end RS/6000 and AS/400 machines.
The project to develop the first such processor was "Bellatrix" (the name ofa star in the Orion constellation, also called the "Amazon Star"). The Bellatrix project was extremely ambitious in its pervasive use of self-timed & pulse-based circuits and the EDA tools required to support this design strategy, and was eventually terminated. To address technical workstation,supercomputer, and engineering/scientific markets, IBM Austin (the home of the RS/6000s) then started developing a time-to-market single-chip version of the Power2 (P2SC) in parallel with the development of a sophisticated 64-bit PowerPC processor with the POWER2 extensions and twin sophisticated MAFfloating-point units (the POWER3/630). To address RS/6000 commercial applications and AS/400 systems, IBM Rochester (the home of the AS/400s) started developing the first of the high-end 64-bit PowerPC processors with AS/400 extensions, and IBM Endicott started developing a low-end single-chip PowerPC processor with AS/400 extensions.
In 1995, IBM released theCobra, orA10 processor, the first full implementation of PowerPC AS, for the IBMAS/400 systems. It was a single-chip processor running at 50-77 MHz. It was designed with a semi-custom methodology, as a consequence of time-to-market constraints. The die contains 4.7 milliontransistors and measures 14.6 mm by 14.6 mm (213 mm2). It was fabricated by IBM in their CMOS 5L process, a 0.5 μm, four-layer-metal CMOS process. It used a 3.0 V power supply and dissipated 17.7 W maximum, 13.4 W minimum at 77 MHz. It was packaged in a 625-contactceramic ball grid array (CBGA) that measured 32 mm by 32 mm.
Cobra was preceded by a simplified implementation known asCobra-Lite in 1994, which was used in the firstIBM Advanced/36 systems.[1] It lacked 17 instructions from the full PowerPC AS ISA which were not needed for the Advanced/36.
In 1996, IBM released the high-end, 4-waySMP, multi-chip version calledMuskie,A25 orA30 in AS/400 systems. It ran at 125-154 MHz. It was manufactured on aBiCMOS fabrication process.
These processors were only used in AS/400 and Advanced/36 machines.
TheRS64 orApache was introduced in 1997. It was developed from "Cobra" and "Muskie" but included a more complete PowerPC ISA and was therefore set to be used inRS/6000 machines as well as in AS/400 systems. It featured 128KB total on-dieL1 cache, 4MB full speed off-chip L2 on a 128-bit bus, and a clock of 125 MHz. It scaled to a 12-processor SMP configuration in IBM's machines.
RS64 was calledA35 in AS/400 and was one time referred to as thePowerPC 625, between the defunctPowerPC 620 and the PowerPC 630 (later renamedPOWER3).
It was manufactured with a BiCMOS fabrication process.
TheRS64-II orNorthstar was introduced at 262 MHz in 1998 with 8 MB of full speed L2 on a 256 bit 6XX bus (also used inPowerPC 620 andPOWER3). Processor boards containing 4 RS64-II's could be swapped into machines designed for similar 4-way RS64 boards, avoiding a "fork lift upgrade". The RS64-II contained 12.5 million transistors, was 162 mm² large and drew 27 watts maximum power. Manufacturing changed to a 0.35 μmCMOS fabrication.
RS64-II was the first mass-market processor to implementmultithreading. Essentially, each chip stores state information for 2 threads at any given time and appears to be two processors to the OS. One logical processor runs what is called the foreground thread. When this thread encounters a high-latency event (L2 cache miss, etc.) the background thread is switched to, on the second logical processor from the OS's point of view. In the event of a "less long" latency event (L1 miss, etc.),thread switching will only occur if the background thread is ready to execute. If the background thread is also waiting for a miss, thread switching will not occur. IBM calls this scheme "coarse grained multithreading". It is not exactly the same thing assimultaneous multithreading as found on laterPentium 4 processors. An IBM paper notes that the coarse grained scheme is a better fit for an in-ordermicroarchitecture like RS64.
RS64-II was calledA50 in AS/400 systems.
TheRS64-III orPulsar was introduced in 1999 at 450 MHz. Key changes included larger 128 KiB L1 instruction and data caches, improvedbranch prediction accuracy and reduced branch misprediction penalties of zero or one cycle. The RS64-III has a five-stagepipeline and a 256-bit wide L2 cache bus, which provided the processor with 14.4 GB/s of bandwidth from the 8 MiB L2 cache, implemented with 225 MHz DDR SRAMs.
The RS64-III has 34 million transistors, a die size of 140 mm², and is manufactured on the 0.22 μm CMOS 7S process with six levels ofcopper interconnect.
In 2000, IBM launched a refined version calledIStar manufactured with aSOIfabrication process with copper interconnects, which increased the processor's clock frequency to 600 MHz. This was the first processor implemented in this process. Architecturally however, the IStar was identical to Pulsar.
TheRS64-IV orSstar was introduced in 2000 at 600 MHz, later increased to 750 MHz. Up to 16 MB DDR L2 cache was supported in the same manner as the RS64-III (19.2 GB/s bandwidth). The RS64-IV had 44 million transistors and was 128 mm² large manufactured on a 0.18 μm process. Unlike POWER, energy consumption remained low, at under 15 watts per core.
For a time, while the POWER line stagnated at half the clock speed of its competitors, the RS64 family was at the top of the IBM large SMPUNIX server line. The integer / commercial workload performance of the RS-64 IV was similar to theSun Microsystems processors with which it competed, though its floating point power was not comparable to the contemporaryPOWER3-II, which remained reasonably competitive throughout its lifecycle.