Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Steamroller (microarchitecture)

From Wikipedia, the free encyclopedia
Microarchitecture by AMD
Steamroller - Family 15h (3rd-gen)
General information
LaunchedJanuary 14, 2014; 11 years ago (January 14, 2014)
Common manufacturer
Architecture and classification
Technology node28 nm SHP[1]
Instruction setAMD64 (x86-64)
Physical specifications
Sockets
Products, models, variants
Core name
History
PredecessorPiledriver - Family 15h (2nd-gen)
SuccessorExcavator - Family 15h (4th-gen)
Support status
iGPU unsupported

AMD Steamroller Family 15h is amicroarchitecture developed byAMD forAMD APUs, which succeededPiledriver in the beginning of 2014 as the third-generationBulldozer-based microarchitecture.[2] Steamroller APUs continue to use two-core modules as their predecessors, while aiming at achieving greater levels of parallelism.

Microarchitecture

[edit]

Steamroller still features two-core modules found inBulldozer andPiledriver designs calledclustered multi-thread (CMT), meaning that one module is marketed as a dual-core processor.[3] The focus ofSteamroller is for greater parallelism.[4] Improvements center on independent instruction decoders for each core within a module, 25% more of the maximum width dispatches per thread, better instruction schedulers, improved perceptron branch predictor, larger and smarter caches, up to 30% fewer instruction cache misses, branch misprediction rate reduced by 20%, dynamically resizable L2 cache, micro-operations queue,[5] more internal register resources and improved memory controller.

AMD estimated that these improvements will increaseinstructions per cycle (IPC) up to 30% compared to the first-generation Bulldozer core while maintaining Piledriver's high clock rates with decreased power consumption.[3] The final result was a 9% single-threaded IPC improvement, and 18% multi-threaded IPC improvement over Piledriver.[6]

Steamroller, the microarchitecture for CPUs, as well asGraphics Core Next, the microarchitecture for GPUs, are paired together in the APU lines to support features specified inHeterogeneous System Architecture.

History

[edit]

In 2011, AMD announced a third-generation Bulldozer-based line of processors for 2013,[7] withNext Generation Bulldozer as the working title, using the 28 nm manufacturing process.[8]

On 21 September 2011, leaked AMD slides indicated that this third generation of Bulldozer core was codenamedSteamroller.[9][10]

In January 2014, the firstKaveri APUs became available.[11]

Starting from May 2015 till March 2016 new APUs were launched as Kaveri-refresh (codenamed Godavari).[12]

Features

[edit]

The following table shows features ofAMD's processors with 3D graphics, includingAPUs (see also:List of AMD processors with 3D graphics).

[ VisualEditor ]
PlatformHigh, standard and low powerLow and ultra-low power
CodenameServerBasicToronto
MicroKyoto
DesktopPerformanceRaphaelPhoenix
MainstreamLlanoTrinityRichlandKaveriKaveri Refresh (Godavari)CarrizoBristol RidgeRaven RidgePicassoRenoirCezanne
Entry
BasicKabiniDalí
MobilePerformanceRenoirCezanneRembrandtDragon Range
MainstreamLlanoTrinityRichlandKaveriCarrizoBristol RidgeRaven RidgePicassoRenoir
Lucienne
Cezanne
Barceló
Phoenix
EntryDalíMendocino
BasicDesna, Ontario, ZacateKabini, TemashBeema, MullinsCarrizo-LStoney RidgePollock
EmbeddedTrinityBald EagleMerlin Falcon,
Brown Falcon
Great Horned OwlGrey HawkOntario, ZacateKabiniSteppe Eagle,Crowned Eagle,
LX-Family
Prairie FalconBanded KestrelRiver Hawk
ReleasedAug 2011Oct 2012Jun 2013Jan 20142015Jun 2015Jun 2016Oct 2017Jan 2019Mar 2020Jan 2021Jan 2022Sep 2022Jan 2023Jan 2011May 2013Apr 2014May 2015Feb 2016Apr 2019Jul 2020Jun 2022Nov 2022
CPUmicroarchitectureK10PiledriverSteamrollerExcavator"Excavator+"[13]ZenZen+Zen 2Zen 3Zen 3+Zen 4BobcatJaguarPumaPuma+[14]"Excavator+"ZenZen+"Zen 2+"
ISAx86-64 v1x86-64 v2x86-64 v3x86-64 v4x86-64 v1x86-64 v2x86-64 v3
SocketDesktopPerformanceAM5
MainstreamAM4
EntryFM1FM2FM2+FM2+[a],AM4AM4
BasicAM1FP5
OtherFS1FS1+,FP2FP3FP4FP5FP6FP7FL1FP7
FP7r2
FP8
FT1FT3FT3bFP4FP5FT5FP5FT6
PCI Express version2.03.04.05.04.02.03.0
CXL
Fab. (nm)GF32SHP
(HKMGSOI)
GF28SHP
(HKMG bulk)
GF14LPP
(FinFET bulk)
GF12LP
(FinFET bulk)
TSMCN7
(FinFET bulk)
TSMCN6
(FinFET bulk)
CCD: TSMCN5
(FinFET bulk)

cIOD: TSMCN6
(FinFET bulk)
TSMC4nm
(FinFET bulk)
TSMCN40
(bulk)
TSMCN28
(HKMG bulk)
GF 28SHP
(HKMG bulk)
GF14LPP
(FinFET bulk)
GF12LP
(FinFET bulk)
TSMCN6
(FinFET bulk)
Die area (mm2)228246245245250210[15]156180210CCD: (2x) 70
cIOD: 122
17875(+ 28FCH)107?125149~100
MinTDP (W)351712101565354.543.95106128
Max APUTDP (W)10095654517054182565415
Max stock APU base clock (GHz)33.84.14.13.73.83.63.73.84.03.34.74.31.752.222.23.22.61.23.352.8
Max APUs per node[b]11
Max core dies per CPU1211
Max CCX per core die1211
Max cores per CCX482424
MaxCPU[c]cores per APU481682424
Maxthreads per CPU core1212
Integer pipeline structure3+32+24+24+2+11+3+3+1+21+1+1+12+24+24+2+1
i386, i486, i586, CMOV, NOPL, i686,PAE,NX bit, CMPXCHG16B,AMD-V,RVI,ABM, and 64-bit LAHF/SAHFYesYes
IOMMU[d]v2v1v2
BMI1,AES-NI,CLMUL, andF16CYesYes
MOVBEYes
AVIC,BMI2,RDRAND, and MWAITX/MONITORXYes
SME[e],TSME[e],ADX,SHA,RDSEED,SMAP,SMEP, XSAVEC, XSAVES, XRSTORS, CLFLUSHOPT, CLZERO, and PTE CoalescingYesYes
GMET, WBNOINVD, CLWB, QOS, PQE-BW, RDPID, RDPRU, and MCOMMITYesYes
MPK,VAESYes
SGX
FPUs percore10.5110.51
Pipes per FPU22
FPU pipe width128-bit256-bit80-bit128-bit256-bit
CPUinstruction setSIMD levelSSE4a[f]AVXAVX2AVX-512SSSE3AVXAVX2
3DNow!3DNow!+
PREFETCH/PREFETCHWYesYes
GFNIYes
AMX
FMA4, LWP,TBM, andXOPYesYes
FMA3YesYes
AMD XDNAYes
L1 data cache per core (KiB)64163232
L1 data cacheassociativity (ways)2488
L1 instruction caches percore10.5110.51
Max APU total L1 instruction cache (KiB)2561281922565122566412896128
L1 instruction cacheassociativity (ways)23482348
L2 caches percore10.5110.51
Max APU total L2 cache (MiB)424161212
L2 cacheassociativity (ways)168168
Max on-dieL3 cache per CCX (MiB)416324
Max 3D V-Cache per CCD (MiB)64
Max total in-CCDL3 cache per APU (MiB)4816644
Max. total 3D V-Cache per APU (MiB)64
Max. boardL3 cache per APU (MiB)
Max totalL3 cache per APU (MiB)48161284
APU L3 cacheassociativity (ways)1616
L3 cache schemeVictimVictim
Max.L4 cache
Max stockDRAM supportDDR3-1866DDR3-2133DDR3-2133,DDR4-2400DDR4-2400DDR4-2933DDR4-3200,LPDDR4-4266DDR5-4800,LPDDR5-6400DDR5-5200DDR5-5600,LPDDR5x-7500DDR3L-1333DDR3L-1600DDR3L-1866DDR3-1866,DDR4-2400DDR4-2400DDR4-1600DDR4-3200LPDDR5-5500
MaxDRAM channels per APU21212
Max stockDRAMbandwidth (GB/s) per APU29.86634.13238.40046.93268.256102.40083.200120.00010.66612.80014.93319.20038.40012.80051.20088.000
GPUmicroarchitectureTeraScale 2 (VLIW5)TeraScale 3 (VLIW4)GCN 2nd genGCN 3rd genGCN 5th gen[16]RDNA 2RDNA 3TeraScale 2 (VLIW5)GCN 2nd genGCN 3rd gen[16]GCN 5th genRDNA 2
GPUinstruction setTeraScale instruction setGCN instruction setRDNA instruction setTeraScale instruction setGCN instruction setRDNA instruction set
Max stock GPU base clock (MHz)60080084486611081250140021002400400538600?847900120060013001900
Max stock GPU baseGFLOPS[g]480614.4648.1886.71134.517601971.22150.43686.4102.486???345.6460.8230.41331.2486.4
3D engine[h]Up to 400:20:8Up to 384:24:6Up to 512:32:8Up to 704:44:16[17]Up to 512:32:8768:48:8128:8:480:8:4128:8:4Up to 192:12:8Up to 192:12:4192:12:4Up to 512:?:?128:?:?
IOMMUv1IOMMUv2IOMMUv1?IOMMUv2
Video decoderUVD 3.0UVD 4.2UVD 6.0VCN 1.0[18]VCN 2.1[19]VCN 2.2[19]VCN 3.1?UVD 3.0UVD 4.0UVD 4.2UVD 6.2VCN 1.0VCN 3.1
Video encoderVCE 1.0VCE 2.0VCE 3.1VCE 2.0VCE 3.4
AMD Fluid MotionNoYesNoNoYesNo
GPU power savingPowerPlayPowerTunePowerPlayPowerTune[20]
TrueAudioYes[21]?Yes
FreeSync1
2
1
2
HDCP[i]?1.42.22.3?1.42.22.3
PlayReady[i]3.0 not yet3.0 not yet
Supported displays[j]2–32–433 (desktop)
4 (mobile, embedded)
42344
/drm/radeon[k][23][24]YesYes
/drm/amdgpu[k][25]Yes[26]Yes[26]
  1. ^For FM2+ Excavator models: A8-7680, A6-7480 & Athlon X4 845.
  2. ^A PC would be one node.
  3. ^An APU combines a CPU and a GPU. Both have cores.
  4. ^Requires firmware support.
  5. ^abRequires firmware support.
  6. ^No SSE4. No SSSE3.
  7. ^Single-precision performance is calculated from the base (or boost) core clock speed based on aFMA operation.
  8. ^Unified shaders :texture mapping units :render output units
  9. ^abTo play protected video content, it also requires card, operating system, driver, and application support. A compatible HDCP display is also needed for this. HDCP is mandatory for the output of certain audio formats, placing additional constraints on the multimedia setup.
  10. ^To feed more than two displays, the additional panels must have nativeDisplayPort support.[22] Alternatively active DisplayPort-to-DVI/HDMI/VGA adapters can be employed.
  11. ^abDRM (Direct Rendering Manager) is a component of the Linux kernel. Support in this table refers to the most current version.

Processors

[edit]

APU lines

[edit]
Main articles:"Kaveri" desktop processors and"Kaveri" mobile processors
  1. Kaveri A-series APU
  2. Berlin APU - canceled
    • Announced in 2013 by AMD[36] theBerlin APU were targeted at the enterprise and server markets featuring fourSteamroller cores, up to 512 stream processors and support forECC memory.

FX lines (discontinued)

[edit]
Main article:List of AMD FX microprocessors

In November 2013 AMD confirmed it would not update theFX series in 2014, neither itsSocket AM3+ version, nor will it receive aSteamroller version with a new socket.[37][38]

AMD however, released a Kaveri based FX-770K for desktop and FX-7600P for mobile which are basically APUs with their integrated graphics disabled similar to the Athlon X4 FM2+ line. Those APUs were released for OEMs only.

Server lines (canceled)

[edit]
Main article:List of AMD Opteron microprocessors

AMD's server roadmaps for 2014 showed:[39][40]

  • Berlin APU - quad-core x86 Steamroller architecture (as described above) for 1 Processor (1P) compute and media clusters
  • Berlin CPU - quad-core x86 Steamroller architecture for 1P web and enterprise services clusters
  • Seattle CPU - 4/8 coreAArch64Cortex-A57 architecture (Opteron A1100) for 1P web and enterprise services clusters[41]
  • Warsaw CPU - up to 16 corex86Piledriver (2nd gen Bulldozer) architecture (Opteron 6338P and 6370P) for 2P/4P servers[42]

However, plans forSteamroller Opteron products were cancelled, likely due to the poorenergy efficiency achieved in this generation of theBulldozer architecture. Energy efficiency was greatly increased in the following generation,Excavator, which exceededJaguar in performance per watt, and approximately doubled performance/watt overSteamroller (for example 20.74 pt/W vs 10.85 pt/W when comparing similar mobile APUs using rough arbitrary metrics).[43][44]

References

[edit]
  1. ^"Page 2 - AMD Kaveri A10-7850K and A8-7600 review: Was it worth the wait for the first true heterogeneous chip?". ExtremeTech. Retrieved2014-02-19.
  2. ^"AMD Kaveri Review: A8-7600 and A10-7850K Tested". Anandtech.com. 2014-01-14. Archived fromthe original on January 15, 2014. Retrieved2014-02-08.
  3. ^ab"AMD: We Are On Track With Steamroller Micro-Architecture in 2013". X-bit labs. 2013-03-31. Archived fromthe original on 2013-10-25. Retrieved2013-09-29.
  4. ^Su, Lisa (2012-02-02)."Consumerization, Cloud, Convergence"(PDF).AMD 2012 Financial Analyst Day. Sunnyvale, California: Advanced Micro Devices. p. 26. Retrieved2012-02-04.
  5. ^Anand Lal Shimpi (2012-08-28)."AMD's Steamroller Detailed: 3rd Generation Bulldozer Core". Archived fromthe original on August 30, 2012. Retrieved2013-11-16.
  6. ^Miller, Michael J. (2014-02-14)."Ivytown, Steamroller, 14 and 16nm Process Highlight ISSCC". Forwardthinking.pcmag.com. Retrieved2014-02-19.
  7. ^Anton Shilov (2010-11-09)."AMD Plans to Release Twenty-Core Microprocessor in 2012". X-bit labs. Archived fromthe original on 2012-02-05. Retrieved2012-01-23.
  8. ^"2012 Financial Analyst Day". 2012-02-02. Archived fromthe original on 2014-09-06. Retrieved2013-09-29.
  9. ^"Hosszútávú mobil útiterv szivárgott ki az AMD-től - PROHARDVER! Processzor hír". Prohardver.hu. 2011-09-21. Retrieved2012-01-23.
  10. ^"Nuove roadmap AMD sulle future APU in programma nel 2012 e nel 2013 per il mercato mobile". 2011-09-21. Archived fromthe original on 2013-01-11. Retrieved2012-01-23.
  11. ^Joel Hruska (2014-01-14)."AMD Kaveri A10-7850K and A8-7600 review: Was it worth the wait for the first true heterogeneous chip?". extremetech.com. Retrieved2014-01-17.
  12. ^"12 款 APU 及 CPU 準備出發,「Godavari」為 AMD 產品新代號".VR-Zone. Archived fromthe original on 11 February 2017. Retrieved8 February 2017.
  13. ^"AMD Announces the 7th Generation APU: Excavator mk2 in Bristol Ridge and Stoney Ridge for Notebooks". 31 May 2016. Retrieved3 January 2020.
  14. ^"AMD Mobile "Carrizo" Family of APUs Designed to Deliver Significant Leap in Performance, Energy Efficiency in 2015" (Press release). 20 November 2014. Retrieved16 February 2015.
  15. ^"The Mobile CPU Comparison Guide Rev. 13.0 Page 5 : AMD Mobile CPU Full List". TechARP.com. Retrieved13 December 2017.
  16. ^ab"AMD VEGA10 and VEGA11 GPUs spotted in OpenCL driver". VideoCardz.com. Retrieved6 June 2017.
  17. ^Cutress, Ian (1 February 2018)."Zen Cores and Vega: Ryzen APUs for AM4 – AMD Tech Day at CES: 2018 Roadmap Revealed, with Ryzen APUs, Zen+ on 12nm, Vega on 7nm". Anandtech. Retrieved7 February 2018.
  18. ^Larabel, Michael (17 November 2017)."Radeon VCN Encode Support Lands in Mesa 17.4 Git". Phoronix. Retrieved20 November 2017.
  19. ^ab"AMD Ryzen 5000G 'Cezanne' APU Gets First High-Res Die Shots, 10.7 Billion Transistors In A 180mm2 Package".wccftech. Aug 12, 2021. RetrievedAugust 25, 2021.
  20. ^Tony Chen; Jason Greaves,"AMD's Graphics Core Next (GCN) Architecture"(PDF),AMD, retrieved13 August 2016
  21. ^"A technical look at AMD's Kaveri architecture". Semi Accurate. Retrieved6 July 2014.
  22. ^"How do I connect three or More Monitors to an AMD Radeon™ HD 5000, HD 6000, and HD 7000 Series Graphics Card?". AMD. Retrieved8 December 2014.
  23. ^Airlie, David (26 November 2009)."DisplayPort supported by KMS driver mainlined into Linux kernel 2.6.33". Retrieved16 January 2016.
  24. ^"Radeon feature matrix".freedesktop.org. Retrieved10 January 2016.
  25. ^Deucher, Alexander (16 September 2015)."XDC2015: AMDGPU"(PDF). Retrieved16 January 2016.
  26. ^abMichel Dänzer (17 November 2016)."[ANNOUNCE] xf86-video-amdgpu 1.2.0".lists.x.org.
  27. ^"AMD Unleashes More Details About Kaveri: HSA, TrueAudio, Mantle". Archived fromthe original on 2016-08-04. Retrieved2013-11-16.
  28. ^"AMD's Next-Gen "Kaveri" APUs Will Require New Mainboards". 30 May 2013. Archived fromthe original on 7 June 2013. Retrieved9 June 2013.
  29. ^"A technical look at AMD's Kaveri architecture".SemiAccurate. 15 January 2014.
  30. ^"AMD Kaveri APU Architecture Detailed". 4 July 2013.
  31. ^"Multi-monitor: Civilization V on A10-7850K "Kaveri"".YouTube.
  32. ^"AMD A8-7600 Kaveri APU review - The Embedded GPU - HSA & hUMA". 14 January 2014.
  33. ^"AMD A10-7850K Graphics Performance". 14 February 2014. Retrieved2 April 2014.
  34. ^"AMD to add ARM processors to boost chip security". 14 June 2012. Retrieved3 September 2013.
  35. ^"AMD and ARM Fusion redefine beyond x86". Retrieved10 November 2013.
  36. ^"AMD Berlin Server APU Provides Glimpse At Upcoming Kaveri APU With 4 Steamroller Cores and 512 GCN SPs". 19 June 2013. Retrieved29 September 2013.
  37. ^Anton Shilov (2013-11-13)."AMD Cans Plans to Introduce Next-Gen FX Microprocessors Next Year". xbitlabs.com. Archived fromthe original on 2019-12-01. Retrieved2013-11-16.
  38. ^Josh Walrath (2013-09-04)."AMD's Processor Shift: The Future Really is Fusion". Retrieved2013-09-29.
  39. ^"Berlin, Warsaw are the future of AMD's x86 server lineup". The Tech Report. 2013-06-18. Retrieved2013-09-29.
  40. ^Mujtaba, Hassan (December 26, 2013)."AMD Opteron Roadmap Reveals Next Generation Toronto and Carrizo APU Details". WCCF Tech. RetrievedJanuary 15, 2015.
  41. ^Mendoza, Menchie (August 13, 2014)."AMD unwraps 64-bit ARM 'Seattle' server chip". Tech Times. RetrievedJanuary 15, 2015.
  42. ^Gasior, Geoff (January 22, 2014)."16-coreWarsaw CPUs added to Opteron lineup". The Tech Report. RetrievedJanuary 15, 2015.
  43. ^"Opteron X2150 vs A10 8700P". cpuboss.com. Archived fromthe original on November 16, 2016. RetrievedNovember 15, 2016.
  44. ^"AMD A10 8700P vs 7300". cpuboss.com. Archived fromthe original on November 27, 2016. RetrievedNovember 15, 2016.
Lists
Microarchitectures
IA-32 (32-bit)
x86-64 desktop
x86-64 low-power
ARM64
Current products
x86-64 (64-bit)
Discontinued
Early x86 (16-bit)
IA-32 (32-bit)
x86-64 (64-bit)
Other
Italics indicates an upcoming architecture.
AMD CPU core roadmaps fromK7 toZen
Turion / ULVNode range
label
x86
Microarchi.StepMicroarchi.Step
180 nmK7Athlon Classic
Thunderbird
Palomino
130 nmThoroughbred
Barton/Thorton
K8ClawHammer
Newcastle
SledgeHammer
K8LLancaster90 nmWinchesterK8(×2)K9
RichmondSan DiegoToledoGreyhound
Taylor / TrinidadWindsor
Tyler65 nmOrleansBrisbane
LionK10Phenom4 cores on mainstream desktop,DDR3 introduced
Caspian45 nmPhenom II /Athlon II6 cores on mainstream desktop
14hBobcat40 nm
32 nmK10Lynx
LlanoAPU introduced; CPU and GPU on single die
Bulldozer 15hBulldozer8 cores on mainstream desktop
Piledriver
16hJaguar28 nmSteamrollerAPU/mobile-only
PumaExcavatorAPU/mobile-only,DDR4 introduced
K12K12 (ARM64)14 nmZenZenSMT introduced
12 nmZen+
7 nmZen 212 and 16 cores on mainstream desktop,chiplet design
Zen 33D V-Cache variants introduced
6 nmZen 3+Mobile-only,DDR5 introduced
5 nm / 4 nmZen 4High core density "Cloud" (Zenxc) variants introduced
4 nm /3 nmZen 5
3 nm /2 nmZen 6
2 nmZen 7
  • Strike-through indicates cancelled processors
  • Bold names are the microarchitecture names
  • Italic names are future processors
Retrieved from "https://en.wikipedia.org/w/index.php?title=Steamroller_(microarchitecture)&oldid=1304464696"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp