Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Ada Lovelace (microarchitecture)

From Wikipedia, the free encyclopedia
GPU microarchitecture by Nvidia

This articlerelies excessively onreferences toprimary sources. Please improve this article by addingsecondary or tertiary sources.
Find sources: "Ada Lovelace" microarchitecture – news ·newspapers ·books ·scholar ·JSTOR
(September 2022) (Learn how and when to remove this message)
Ada Lovelace
LaunchedOctober 12, 2022; 2 years ago (2022-10-12)
Designed byNvidia
Manufactured by
Fabrication processTSMC4N
Codename(s)AD10x
Product Series
Desktop
Professional/workstation
  • RTX Ada Generation
Server/datacenter
Specifications
Clock rate735 MHz to 2640 MHz
L1 cache128 KB (per SM)
L2 cache32 MB to 96 MB
Memory support
Memory clock rate21-23 Gbit/s
PCIe supportPCIe 4.0
Supported GraphicsAPIs
DirectXDirectX 12 Ultimate (Feature Level 12_2)
Direct3DDirect3D 12
Shader ModelShader Model 6.8
OpenCLOpenCL 3.0
OpenGLOpenGL 4.6
CUDACompute Capability 8.9
VulkanVulkan 1.3
Supported ComputeAPIs
CUDACUDA Toolkit 11.6
DirectComputeYes
Media Engine
Encode codecs
Decode codecs
Color bit-depth
  • 8-bit
  • 10-bit
Encoder(s) supportedNVENC
Display outputs
History
PredecessorAmpere
VariantHopper(datacenter)
SuccessorBlackwell
Support status
Supported

Ada Lovelace, also referred to simply asLovelace,[1] is agraphics processing unit (GPU) microarchitecture developed byNvidia as the successor to theAmpere architecture, officially announced on September 20, 2022. It is named after the 19th century English mathematicianAda Lovelace,[2] one of the first computerprogrammers. Nvidia announced the architecture along with theGeForce RTX 40 series consumer GPUs[3] and the RTX 6000 Ada Generation workstation graphics card.[4] The Lovelace architecture is fabricated onTSMC's custom4N process which offers increased efficiency over the previousSamsung8 nm and TSMCN7 processes used by Nvidia for its previous-generation Ampere architecture.[5]

Background

[edit]

The Ada Lovelace architecture follows on from the Ampere architecture that was released in 2020. The Ada Lovelace architecture was announced by Nvidia CEOJensen Huang during a GTC 2022 keynote on September 20, 2022 with the architecture powering Nvidia's GPUs for gaming, workstations and datacenters.[6]

Architectural details

[edit]

Architectural improvements of the Ada Lovelace architecture include the following:[7]

  • CUDA Compute Capability 8.9[8]
  • TSMC4N process (custom designed for Nvidia) - not to be confused with TSMC's regular N4 node
  • 4th-generation Tensor Cores with FP8, FP16, bfloat16, TensorFloat-32 (TF32) and sparsity acceleration
  • 3rd-generation Ray Tracing Cores, plus concurrent ray tracing and shading and compute
  • Shader Execution Reordering (SER)[9]
  • Nvidia video encoder/decoder (NVENC/NVDEC) with 8K 10-bit 60FPSAV1 fixed function hardware encoding[10][11]
  • NoNVLink support[12][13]

Streaming multiprocessors (SMs)

[edit]

CUDA cores

[edit]

128 CUDA cores are included in each SM.

RT cores

[edit]

Ada Lovelace features third-generation RT cores. The RTX 4090 features 128 RT cores compared to the 84 in the previous generation RTX 3090 Ti. These 128 RT cores can provide up to 191 TFLOPS of compute with 1.49 TFLOPS per RT core.[14]A new stage in the ray tracing pipeline called Shader Execution Reordering (SER) is added in the Lovelace architecture which Nvidia claims provides a 2x performance improvement in ray tracing workloads.[6]

Tensor cores

[edit]

Lovelace's new fourth-generation Tensor cores enable the AI technology used in DLSS 3's frame generation techniques. Much like Ampere, each SM contains 4 Tensor cores but Lovelace contains a greater number of Tensor cores overall given its increased number of SMs.

Clock speeds

[edit]

There is a significant increase in clock speeds with the Ada Lovelace architecture with the RTX 4090's base clock speed being higher than the boost clock speed of the RTX 3090 Ti.

RTX 2080 TiRTX 3090 TiRTX 4090
ArchitectureTuringAmpereAda Lovelace
Base clock speed
(MHz)
135015602235
Boost clock speed
(MHz)
163518602520

Cache and memory subsystem

[edit]
RTX 2080 TiRTX 3090 TiRTX 4090
ArchitectureTuringAmpereAda Lovelace
L1 Data Cache6.375 MB
(96 KB per SM)
10.5 MB
(128 KB per SM)
16 MB
(128 KB per SM)
L2 Cache5.5 MB6 MB72 MB

The last enabled AD102 Lovelacedie features 96 MB of L2 cache, a 16x increase from the 6 MB in the Ampere-based GA102 die.[15] The GPU having quick access to a high amount of L2 cache benefits complex operations like ray tracing compared to the GPU seeking data from the GDDR video memory which is slower. Relying less on accessing memory for storing important and frequently accessed data means that a narrower memory bus width can be used in tandem with a large L2 cache.

Each memory controller uses a 32-bit connection with up to 12 controllers present for a combined memory bus width of 384-bit. The Lovelace architecture can use eitherGDDR6 orGDDR6X memory. GDDR6X memory features on the desktop GeForce RTX 40 series while the more energy-efficient GDDR6 memory is used on its corresponding mobile versions and on RTX A6000 workstation GPUs.

Power efficiency and process node

[edit]

The Ada Lovelace architecture is able to use lower voltages compared to its predecessor.[6] Nvidia claims a 2x performance increase for the RTX 4090 at the same 450W used by the previous generation flagship RTX 3090 Ti.[16]

Increased power efficiency can be attributed in part to the smallerfabrication node used by the Lovelace architecture. The Ada Lovelace architecture is fabricated onTSMC's cutting-edge4N process, a custom designed process node for Nvidia. The previous generation Ampere architecture usedSamsung's 8nm-based8N process node from 2018, which was two years old by the time of Ampere's launch.[17][18] The AD102 die with its 76.3 billion transistors has a transistor density of 125.5 million per mm2, a 178% increase in density from GA102's 45.1 million per mm2.

Media engine

[edit]

The Lovelace architecture utilizes the new 8th generation NvidiaNVENC video encoder and the 7th generation NVDEC video decoder introduced by Ampere returns.[19]

NVENCAV1 hardware encoding with support for up to 8K resolution at 60FPS in10-bit color is added, enabling higher video fidelity at lower bit rates compared to theH.264 andH.265 codecs.[20] Nvidia claims that its NVENC AV1 encoder featured in the Lovelace architecture is 40% more efficient than the H.264 encoder in the Ampere architecture.[21]

The Lovelace architecture received criticism for not supporting theDisplayPort 2.0 connection that supports higher display data bandwidth and instead uses the older DisplayPort 1.4a which is limited to a peak bandwidth of 32 Gbit/s.[22] As a result, Lovelace GPUs would be limited by DisplayPort 1.4a's supported refresh rates despite the GPU's performance being able to reach higher frame rates.Intel'sArc GPUs that also released in October 2022 included DisplayPort 2.0.AMD's competingRDNA 3 architecture released just two months after Lovelace includedDisplayPort 2.1.[23]

Ada Lovelace dies

[edit]
Die[24]AD102[25]AD103[26]AD104[27]AD106[28]AD107[29]
Die size609 mm2379 mm2294 mm2188 mm2159 mm2
Transistors76.3B45.9B35.8B22.9B18.9B
Transistor density125.3 MTr/mm2121.1 MTr/mm2121.8 MTr/mm2121.8 MTr/mm2118.9 MTr/mm2
Graphics processing clusters127532
Streaming multiprocessors14480603624
CUDA cores1843210240768046083072
Texture mapping units57632024014496
Render output units192112804832
Tensor cores57632024014496
RT cores14480603624
L1cache18 MB10 MB7.5 MB4.5 MB3 MB
128 KB per SM
L2 cache96 MB64 MB48 MB32 MB

Ada Lovelace-based products

[edit]

Consumer

[edit]

Desktop

[edit]
See also:List of Nvidia graphics processing units § Desktop GPUs § RTX 40 series
  • GeForce 40 series
    • GeForce RTX 4060 (AD107)
    • GeForce RTX 4060 Ti (AD106)
    • GeForce RTX 4070 (AD104)
    • GeForce RTX 4070 SUPER (AD104)
    • GeForce RTX 4070 Ti (AD104)
    • GeForce RTX 4070 Ti SUPER (AD103)
    • GeForce RTX 4080 (AD103)
    • GeForce RTX 4080 SUPER (AD103)
    • GeForce RTX 4090 D (AD102)
    • GeForce RTX 4090 (AD102)

Mobile

[edit]
See also:List of Nvidia graphics processing units § Mobile GPUs § GeForce 40 series
  • GeForce 40 series
    • GeForce RTX 4050 Laptop (AD107)
    • GeForce RTX 4060 Laptop (AD107)
    • GeForce RTX 4070 Laptop (AD106)
    • GeForce RTX 4080 Laptop (AD104)
    • GeForce RTX 4090 Laptop (AD103)

Professional

[edit]

Desktop workstation

[edit]
See also:List of Nvidia graphics processing units § Workstation GPUs § RTX Ada Generation
  • Nvidia Workstation GPUs (formerlyQuadro)
    • Nvidia RTX 2000 Ada Generation (AD107)
    • Nvidia RTX 4000 Ada Generation (AD104)
    • Nvidia RTX 4000 SFF Ada Generation (AD104)
    • Nvidia RTX 4500 Ada Generation (AD104)
    • Nvidia RTX 5000 Ada Generation (AD102)
    • Nvidia RTX 5880 Ada Generation (AD102)
    • Nvidia RTX 6000 Ada Generation (AD102)

Mobile workstation

[edit]
See also:List of Nvidia graphics processing units § Mobile Workstation GPUs § RTX Ada Generation
  • Nvidia Workstation GPUs (formerlyQuadro)
    • Nvidia RTX 500 Ada Generation Laptop (AD107)
    • Nvidia RTX 1000 Ada Generation Laptop (AD107)
    • Nvidia RTX 2000 Ada Generation Laptop (AD107)
    • Nvidia RTX 3000 Ada Generation Laptop (AD106)
    • Nvidia RTX 3500 Ada Generation Laptop (AD104)
    • Nvidia RTX 4000 Ada Generation Laptop (AD104)
    • Nvidia RTX 5000 Ada Generation Laptop (AD103)

Datacenter

[edit]
See also:List of Nvidia graphics processing units § Data Center GPUs § Tesla
Products using Ada Lovelace (per die)
TypeAD107AD106AD104AD103AD102
GeForce 40 Series (Desktop)GeForce RTX 4060GeForce RTX 4060 TiGeForce RTX 4070

GeForce RTX 4070 SUPER

GeForce RTX 4070 Ti

GeForce RTX 4070 Ti Super

GeForce RTX 4080

GeForce RTX 4080 Super

GeForce RTX 4090 D

GeForce RTX 4090

GeForce 40 Series (Mobile)GeForce RTX 4050

GeForce RTX 4060

GeForce RTX 4070GeForce RTX 4080GeForce RTX 4090
Nvidia Workstation GPUs (Desktop)RTX 2000 Ada GenerationRTX 4000 Ada Generation

RTX 4000 SFF Ada Generation

RTX 4500 Ada Generation

RTX 5000 Ada Generation

RTX 5880 Ada Generation

RTX 6000 Ada Generation

Nvidia Workstation GPUs (Mobile)RTX 500 Ada Generation

RTX 1000 Ada Generation
RTX 2000 Ada Generation

RTX 3000 Ada GenerationRTX 3500 Ada Generation

RTX 4000 Ada Generation

RTX 5000 Ada Generation
Nvidia Data Center GPUsNvidia L4[30]Nvidia L40

Nvidia L40G

Nvidia L40CNX

See also

[edit]

References

[edit]
  1. ^Freund, Karl (September 20, 2022)."Nvidia Launches Lovelace GPU, Cloud Services, Ships H100 GPUs, New Drive Thor".Forbes. RetrievedNovember 18, 2022.
  2. ^Mujtaba, Hassan (September 15, 2022)."Nvidia's Next-Gen Ada Lovelace Gaming GPU Architecture For GeForce RTX 40 Series Confirmed".Wccftech. RetrievedNovember 18, 2022.
  3. ^"Nvidia Delivers Quantum Leap in Performance, Introduces New Era of Neural Rendering with GeForce RTX 40 Series".Nvidia Newsroom (Press release). September 20, 2022. RetrievedSeptember 20, 2022.
  4. ^"Nvidia's New Ada Lovelace RTX GPU Arrives for Designers and Creators".Nvidia Newsroom. September 20, 2022. RetrievedNovember 18, 2022.
  5. ^Machkovec, Sam (September 20, 2022)."Nvidia's Ada Lovelace GPU generation: $1,599 for RTX 4090, $899 and up for 4080".Ars Technica. RetrievedNovember 18, 2022.
  6. ^abcChiappetta, Marco (September 22, 2022)."NVIDIA GeForce RTX 40 Architecture Overview: Ada's Special Sauce Unveiled".HotHardware. RetrievedApril 8, 2023.
  7. ^"NVIDIA Ada Lovelace Architecture".NVIDIA. September 20, 2022. RetrievedSeptember 20, 2022.
  8. ^"CUDA C++ Programming Guide".docs.nvidia.com. RetrievedApril 15, 2023.
  9. ^"Improve Shader Performance and In-Game Frame Rates with Shader Execution Reordering".NVIDIA Technical Blog. October 13, 2022. RetrievedApril 6, 2023.
  10. ^Deigado, Gerado (September 20, 2022)."Creativity At The Speed of Light: GeForce RTX 40 Series Graphics Cards Unleash Up To 2X Performance in 3D Rendering, AI, and Video Exports For Gamers and Creators".NVIDIA. RetrievedSeptember 20, 2022.
  11. ^"Nvidia Video Codec SDK".NVIDIA Developer. September 20, 2022. RetrievedNovember 18, 2022.
  12. ^Chuong Nguyen (September 21, 2022)."Nvidia kills off NVLink on RTX 4090".Windows Central. RetrievedJanuary 1, 2023.
  13. ^btarunr (September 21, 2022)."Jensen Confirms: NVLink Support in Ada Lovelace is Gone".TechPowerUp. RetrievedNovember 18, 2022.
  14. ^"Nvidia Ada Lovelace GPU Architecture: Designed to deliver outstanding gaming and creating, professional graphics, AI, and compute performance"(PDF).Nvidia. p. 30. RetrievedApril 5, 2023.
  15. ^"Nvidia Ada Lovelace GPU Architecture: Designed to deliver outstanding gaming and creating, professional graphics, AI, and compute performance"(PDF).Nvidia. p. 12. RetrievedApril 6, 2023.
  16. ^"Nvidia Ada Lovelace GPU Architecture: Designed to deliver outstanding gaming and creating, professional graphics, AI, and compute performance"(PDF).Nvidia. p. 12. RetrievedApril 5, 2023.
  17. ^James, Dave (September 1, 2020)."Nvidia confirms Samsung 8nm process for RTX 3090, RTX 3080, and RTX 3070".PC Gamer. RetrievedApril 5, 2023.
  18. ^Bosnjak, Dominik (September 1, 2020)."Samsung's old 8nm tech at the heart of NVIDIA's monstrous Ampere cards".SamMobile. RetrievedApril 5, 2023.
  19. ^"Nvidia Ada Lovelace GPU Architecture: Designed to deliver outstanding gaming and creating, professional graphics, AI, and compute performance"(PDF).Nvidia. p. 25. RetrievedApril 5, 2023.
  20. ^Muthana, Prathap; Mishra, Sampurnananda; Patait, Abhijit (January 18, 2023)."Improving Video Quality and Performance with AV1 and NVIDIA Ada Lovelace Architecture".Nvidia Developer. RetrievedApril 5, 2023.
  21. ^"Nvidia Ada Science: How Ada advances the science of graphics with DLSS 3"(PDF).Nvidia. p. 13. RetrievedApril 5, 2023.
  22. ^Garreffa, Anthony (September 25, 2022)."NVIDIA's next-gen GeForce RTX 40 series lack DP2.0 connectivity, silly".TweakTown. RetrievedApril 5, 2023.
  23. ^Judd, Will (November 3, 2022)."AMD announces 7900 XTX and 7900 XT graphics cards with FSR 3".Eurogamer. RetrievedApril 5, 2023.
  24. ^"NVIDIA confirms Ada 102/103/104 GPU specs, AD104 has more transistors than GA102".VideoCardz. September 23, 2022. RetrievedSeptember 23, 2022.
  25. ^"NVIDIA AD102 GPU Specs".TechPowerUp. RetrievedDecember 17, 2022.
  26. ^"NVIDIA AD103 GPU Specs".TechPowerUp. RetrievedJuly 16, 2024.
  27. ^"NVIDIA AD104 GPU Specs".TechPowerUp. RetrievedOctober 18, 2022.
  28. ^"NVIDIA AD106 GPU Specs".TechPowerUp. RetrievedDecember 17, 2022.
  29. ^"NVIDIA AD107 GPU Specs".TechPowerUp. RetrievedDecember 17, 2022.
  30. ^"NVIDIA L4 Specs".TechPowerUp. March 21, 2023. RetrievedApril 15, 2024.

External links

[edit]
Fixed pixel pipeline
Pre-GeForce
Vertex andpixel shaders
Unified shaders
Unified shaders &NUMA
Ray tracing &Tensor Cores
Software and technologies
Multimedia acceleration
Software
Technologies
GPU microarchitectures
Other products
GraphicsWorkstation cards
GPGPU
Console components
Nvidia Shield
SoCs and embedded
CPUs
Computerchipsets
Company
Key people
Acquisitions
Retrieved from "https://en.wikipedia.org/w/index.php?title=Ada_Lovelace_(microarchitecture)&oldid=1298337474"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp