Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

NetBurst

From Wikipedia, the free encyclopedia
Intel processor microarchitecture
icon
This articleneeds additional citations forverification. Please helpimprove this article byadding citations to reliable sources. Unsourced material may be challenged and removed.
Find sources: "NetBurst" – news ·newspapers ·books ·scholar ·JSTOR
(November 2013) (Learn how and when to remove this message)

NetBurst
General information
LaunchedNovember 20, 2000; 25 years ago (November 20, 2000)
Performance
Max.CPUclock rate1.3 GHz to 3.8 GHz
FSB speeds100 MT/s to 1066 MT/s
Cache
L1cache8 KB to 16 KB per core
L2 cache128 KB to 4096 KB
L3 cache4 MB to 16 MB shared
Architecture and classification
MicroarchitectureNetBurst
Instruction setx86-16,IA-32,
x86-64 (some)
Extensions
Physical specifications
Transistors
Cores
  • 1-2 (2-4 threads with hyper-threading)
Sockets
Products, models, variants
Models
  • Celeron
  • Celeron D
  • Pentium 4
  • Pentium D
  • Xeon
History
PredecessorP6
SuccessorsIntel Core
IA-64

TheNetBurst microarchitecture,[1][2] calledP68 insideIntel, was the successor to theP6 microarchitecture in thex86 family ofcentral processing units (CPUs) made by Intel. The first CPU to use this architecture was theWillamette-core Pentium 4, released on November 20, 2000, and the first of thePentium 4 CPUs; all subsequent Pentium 4 andPentium D variants have also been based on NetBurst. In mid-2001, Intel released theFoster core, which was also based on NetBurst, thus switching theXeon CPUs to the new architecture as well. Pentium 4-basedCeleron CPUs also use the NetBurst architecture. It was discontinued in 2010 and replaced with theCore microarchitecture based on P6, released in July 2006.

Technology

[edit]

The NetBurst microarchitecture includes features such asHyper-threading,Hyper Pipelined Technology,Rapid Execution Engine,Execution Trace Cache, andreplay system which all were introduced for the first time in this particular microarchitecture, and some never appeared again afterwards.

Hyper-threading

[edit]
Main article:Hyper-threading

Hyper-threading is Intel's proprietarysimultaneous multithreading (SMT) implementation used to improve parallelization of computations (doing multiple tasks at once) performed on x86 processors. Intel introduced it with NetBurst processors in 2002. Later Intel reintroduced it in theNehalem microarchitecture after its absence in the Core 2.

Quad-Pumped Front-Side Bus

[edit]

The Northwood and Willamette cores feature an external Front Side Bus (FSB) that runs at 100 MHz which transfers four bits per clock cycle, thus having an effective speed of 400 MHz. Later revisions of the Northwood core, along with the Prescott core (and derivatives) have an effective 800 MHz front-side bus (200 MHz quad pumped).[1]

Hyper-Pipelined Technology

[edit]

The Willamette and Northwood cores contain a 20-stageinstruction pipeline. This is a significant increase in the number of stages compared to the Pentium III, which had only 10 stages in its pipeline. The Prescott core increased the length of the pipeline to 31 stages. A drawback of longer pipelines is the increase in the number of stages that need to be traced back in the event of a branch misprediction, increasing the penalty of said misprediction. To address this issue, Intel devised the Rapid Execution Engine and has invested a great deal into its branch prediction technology, which Intel claims reducesbranch mispredictions by 33% overPentium III.[3] In reality, the longer pipeline resulted in reduced efficiency through a lower number ofinstructions per clock (IPC) executed as high enough clock speeds were not able to be reached to offset lost performance due to larger than expected increase in power consumption and heat.

Rapid Execution Engine

[edit]

With this technology, the twoarithmetic logic units (ALUs) in the core of the CPU are double-pumped, meaning that they actually operate at twice the core clock frequency. For example, in a 3.8 GHz processor, the ALUs will effectively be operating at 7.6 GHz. The reason behind this is to generally make up for the low IPC count; additionally this considerably enhances the integer performance of the CPU. Intel also replaced the high-speedbarrel shifter with a shift/rotate execution unit that operates at the same frequency as the CPU core. The downside is that certain instructions are now much slower (relatively and absolutely)[note 1] than before, making optimization for multiple target CPUs difficult. An example is shift and rotate operations, which suffer from the lack of a barrel shifter which was present on every x86 CPU beginning with the i386, including the main competitor processor,Athlon.

Execution Trace Cache

[edit]
Main article:Trace cache

Within the L1 cache of the CPU, Intel incorporated its Execution Trace Cache. It stores decodedmicro-operations, so that when executing a new instruction, instead of fetching and decoding the instruction again, the CPU directly accesses the decoded micro-ops from the trace cache, thereby saving considerable time. Moreover, the micro-ops are cached in their predicted path of execution, which means that when instructions are fetched by the CPU from the cache, they are already present in the correct order of execution.[4] Intel later introduced a similar but simpler concept withSandy Bridge calledmicro-operation cache (UOP cache).

Replay system

[edit]
Main article:Replay system

The replay system is a subsystem within the Intel Pentium 4 processor to catch operations that have been mistakenly sent for execution by the processor's scheduler. Operations caught by the replay system are then re-executed in a loop until the conditions necessary for their proper execution have been fulfilled.

Branch prediction hints

[edit]

The Intel NetBurst architecture allowsbranch prediction hints to be inserted into the code to tell whether the static prediction should be taken or not taken, while this feature was abandoned in later Intel processors. According to Intel, NetBurst's branch prediction algorithm is 33% better than the one in P6.[5][6]

Issues in development and scaling

[edit]

Despite these enhancements, the NetBurst architecture created obstacles for engineers trying to scale up its performance. With this microarchitecture, Intel planned to attain clock speeds of 10 GHz,[7] but because of rising clock speeds, Intel faced increasing problems with keeping power/heat dissipation within normal, usable function. Intel reached a speed barrier of 3.8 GHz in November 2004 but encountered problems in the process. After these issues, Intel moved to develop a successor to NetBurst, theCore microarchitecture, in 2006 after heat and other related problems started to pose an unbreakable barrier (inspired by the P6 Core of thePentium Pro to theTualatinPentium III-S, and most directly thePentium M.)

Revisions

[edit]
Main article:Pentium 4
RevisionProcessor Brand(s)Pipeline stages
Willamette (180 nm)Celeron, Pentium 4, Xeon20
Northwood (130 nm)Celeron, Pentium 4, Pentium 4 HT, Pentium 4 HT Extreme Edition, Xeon20
Prescott (90 nm)Celeron D, Pentium 4, Pentium 4 HT,
Pentium 4 HT Extreme Edition, Xeon
31
Cedar Mill (65 nm)Celeron D, Pentium 4 HT31
Smithfield (90 nm)Pentium D, Xeon31
Presler (65 nm)Pentium D, Xeon31

Intel replaced the originalWillamette core with a redesigned version of the NetBurst microarchitecture calledNorthwood in January 2002. TheNorthwood design combined an increased cache size, a smaller 130 nm fabrication process, andHyper-threading (although initially all models but the 3.06 GHz model had this feature disabled) to produce a more modern, higher-performing version of the NetBurst microarchitecture.

In February 2004, Intel introducedPrescott, a more radical revision of the microarchitecture. ThePrescott core was produced on a 90 nm process, and included several major design changes, including the addition of an even larger cache (from 512 KB in theNorthwood to 1 MB, and 2 MB in Prescott 2M), a much deeperinstruction pipeline (31 stages as compared to 20 in theNorthwood), a heavily improvedbranch predictor, the introduction of theSSE3 instructions, and later, the implementation of Intel Extended Memory 64 Technology (EM64T), Intel's branding for their compatible implementation of thex86-64 64-bit version of thex86 microarchitecture (as with hyper-threading, allPrescott chips branded Pentium 4 HT have hardware to support this feature, but it was initially only enabled on the high-endXeon processors, before being officially introduced in processors with thePentium trademark). Power consumption and heat dissipation also became major issues withPrescott, which quickly became the hottest-running, and most power-hungry, of Intel's single-core x86 and x86-64 processors. Power and heat concerns prevented Intel from releasing a Prescott clocked above 3.8 GHz, along with a mobile version of the core clocked above 3.46 GHz.

Intel also released a dual-core processor based on the NetBurst microarchitecture branded Pentium D. The first Pentium D core was codenamedSmithfield, which is actually two Prescott cores in a single die, and laterPresler, which consists of twoCedar Mill cores on two separate dies (Cedar Mill being the 65 nm die-shrink ofPrescott).

Roadmap

[edit]
Intel CPU core roadmaps fromP6 to Panther Lake
Atom (ULV)Node namePentium/Core
Microarch.StepMicroarch.Step
600 nmP6Pentium Pro
(133 MHz)
500 nmPentium Pro
(150 MHz)
350 nmPentium Pro
(166–200 MHz)
Klamath
250 nmDeschutes
KatmaiNetBurst
180 nmCoppermineWillamette
130 nmTualatinNorthwood
Pentium MBaniasNetBurst(HT)NetBurst(×2)
90 nmDothanPrescottPrescott‑2MSmithfield
TejasCedarmill (Tejas)
65 nmYonahNehalem (NetBurst)Cedar MillPresler
CoreMerom4 cores on mainstream desktop,DDR3 introduced
BonnellBonnell45 nmPenryn
NehalemNehalemHT reintroduced, integratedMC, PCH
L3-cache introduced, 256 KB L2-cache/core
Saltwell32 nmWestmereIntroduced GPU on same package andAES-NI
Sandy BridgeSandy BridgeOn-die ring bus, no more non-UEFI motherboards
SilvermontSilvermont22 nmIvy Bridge
HaswellHaswellFully integrated voltage regulator
Airmont14 nmBroadwell
SkylakeSkylakeDDR4 introduced on mainstream desktop
GoldmontGoldmontKaby Lake
Coffee Lake6 cores on mainstream desktop
Amber LakeMobile-only
Goldmont PlusGoldmont PlusWhiskey LakeMobile-only
Coffee Lake Refresh8 cores on mainstream desktop
Comet Lake10 cores on mainstream desktop
Sunny CoveCypress Cove (Rocket Lake)Backported Sunny Cove microarchitecture for 14 nm
TremontTremont10 nmSkylakePalm Cove (Cannon Lake)Mobile-only
Sunny CoveSunny Cove (Ice Lake)512 KB L2-cache/core
Willow Cove (Tiger Lake)Xe graphics engine
GracemontGracemontIntel 7
(10 nm ESF)
Golden CoveGolden Cove (Alder Lake)Hybrid, DDR5, PCIe 5.0
Raptor Cove (Raptor Lake)
CrestmontCrestmontIntel 4Redwood CoveMeteor LakeMobile-only
NPU,chiplet architecture
Intel 3Arrow Lake-U
SkymontSkymontN3B (TSMC)Lion CoveLunar LakeLow power mobile only (9–30 W)
Arrow Lake
DarkmontDarkmontIntel 18ACougar CovePanther Lake
  • Strike-through indicates cancelled processors
  • Bold names are microarchitectures
  • Italic names are future processors

Successor

[edit]

Intel had NetBurst-based successors in development calledTejas and Jayhawk with between 40 and 50 pipeline stages, but ultimately decided to replace NetBurst with theCore microarchitecture,[8][9] released in July 2006; these successors were more directly derived from thePentium Pro (P6 microarchitecture). August 8, 2008 marked the end of Intel NetBurst-based processors.[10] The reason for NetBurst's abandonment was the severe heat problems caused by high clock speeds. While some Core- and Nehalem-based processors have higherTDPs, most processors are multi-core, so each core gives off a fraction of the maximum TDP, and the highest-clocked Core-based single-core processors give off a maximum of 27 W of heat. The fastest-clocked desktop Pentium 4 processors (single-core) had TDPs of 115 W, compared to 88 W for the fastest clocked mobile versions. Although, with the introduction of new steppings, TDPs for some models were eventually lowered.

The Nehalem microarchitecture, the successor to the Core microarchitecture, was supposed to be an evolution of NetBurst according to Intel roadmaps dating back to 2000.[citation needed] Nehalem reimplements certain features of NetBurst, including the Hyper-Threading technology first introduced in the 3.06 GHzNorthwood core, and L3 cache, first implemented on a consumer processor in theGallatin core used in the Pentium 4 Extreme Edition.

NetBurst-based chips

[edit]

See also

[edit]

Notes

[edit]
  1. ^"Relatively" as the number ofcycles needed for the instruction to complete, where "absolute" stands fortime it takes for the instruction to complete, after taking account of the clock speed. For example, an instruction taking 10 cycles on a 1 GHz processor takes 10 nanoseconds; it is thereforerelatively slower andabsolutely faster than an instruction taking 1 cycle on a 0.01 GHz (100 nanoseconds) processor.

References

[edit]
  1. ^Carmean, Doug (Spring 2002)."The Intel Pentium 4 Processor"(PDF).Intel. Archived fromthe original(PDF) on April 19, 2018.
  2. ^"Replay: Unknown Features of the NetBurst Core".XbitLabs. March 6, 2016. Archived fromthe original on March 6, 2016.
  3. ^"The Trace Cache Branch Prediction Unit".Intel's New Pentium 4 Processor.Tom's Hardware. November 20, 2000. RetrievedApril 30, 2021.
  4. ^"Entering The Execution Pipeline - Pentium 4's Trace Cache, Continued".Intel's New Pentium 4 Processor.Tom's Hardware. November 20, 2000. RetrievedApril 30, 2021.
  5. ^Fog, Agner (December 1, 2016)."The microarchitecture of Intel, AMD and VIA CPUs"(PDF). p. 36. RetrievedMarch 22, 2017.
  6. ^Milenkovic, Milena; Milenkovic, Aleksandar; Kulick, Jeffrey."Demystifying Intel Branch Predictors"(PDF).
  7. ^Shimpi, Anand Lal."The future of Intel's manufacturing processes". Archived fromthe original on June 18, 2011. RetrievedApril 4, 2018.
  8. ^"Intel says Adios to Tejas and Jayhawk chips".The Register.
  9. ^Goodwins, Rupert."Intel cancels Tejas and Jayhawk".ZDNet. RetrievedAugust 21, 2019.
  10. ^Shilov, Anton (May 21, 2007)."The Era of Intel's NetBurst Micro-Architecture Comes to End".XbitLabs. Archived fromthe original on October 17, 2015. RetrievedNovember 29, 2015.

External links

[edit]
Lists
Microarchitectures
IA-32 (32-bit x86)
x86-64 (64-bit)
x86ULV
Current products
x86-64 (64-bit)
Discontinued
BCD oriented (4-bit)
pre-x86 (8-bit)
Earlyx86 (16-bit)
x87 (externalFPUs)
8/16-bit databus
8087 (1980)
16-bit databus
80C187
80287
80387SX
32-bit databus
80387DX
80487
IA-32 (32-bit x86)
x86-64 (64-bit)
Other
Related
Retrieved from "https://en.wikipedia.org/w/index.php?title=NetBurst&oldid=1318349215"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp