Movatterモバイル変換

[0]ホーム

Jump to content

CDNA (microarchitecture)

Català

Edit links

From Wikipedia, the free encyclopedia

AMD compute-focused GPU microarchitecture

AMD CDNA
History

Release date	November 16, 2020 (4 years ago) (2020-11-16)
Designed by	AMD
Fabrication process	TSMC N7 TSMCN6 TSMCN5^[1]
Predecessor	AMD FirePro
Variant	RDNA (consumer, professional)

CDNA (Compute DNA) is a compute-centeredgraphics processing unit (GPU)microarchitecture designed byAMD for datacenters. Mostly used in theAMD Instinct line of data center graphics cards, CDNA is a successor to theGraphics Core Next (GCN) microarchitecture; the other successor beingRDNA (Radeon DNA), a consumer graphics focused microarchitecture.

The first generation of CDNA was announced on March 5th, 2020,^[2] and was featured in the AMD Instinct MI100, launched November 16th, 2020.^[3] This is CDNA 1's only produced product, manufactured onTSMC'sN7 FinFET process.

The second iteration of the CDNA line implemented amulti-chip module (MCM) approach, differing from its predecessor's monolithic approach. Featured in the AMD Instinct MI250X and MI250, this MCM design used an elevated fanout bridge (EFB)^[4] to connect the dies. These two products were announced November 8th, 2021, and launched November 11th. The CDNA 2 line includes an additional latecomer using a monolithic design, the MI210.^[5] The MI250X and MI250 were the first AMD products to use theOpen Compute Project (OCP)'s OCP Accelerator Module (OAM) socket form factor. Lower wattagePCIe versions are available.

The third iteration of CDNA switches to a MCM design utilizing different chiplets manufactured on multiple nodes. Currently consisting of the MI300X and MI300A, this product contains 15 unique dies and is connected with advanced 3D packaging techniques. The MI300 series was announced on January 5, 2023, and launched in H2 2023.

Model (Code name)	Released	Architecture & fab	Transistors & die size	Core		Fillrate^[a]		Processing power (TFLOPS)								Memory				TBP	Software interface	Physical interface
				Core		Fillrate^[a]		Vector^[a]^[b]			Matrix^[a]^[b]					Memory
				Config^[c]	Clock^[a] (MHz)	Texture^[d] (GT/s)	Pixel^[e] (GP/s)	Half (FP16)	Single (FP32)	Double (FP64)	INT8	BF16	FP16	FP32	FP64	Bus type & width	Size (GB)	Clock (MT/s)	Bandwidth (GB/s)
AMD Instinct MI100 (Arcturus)^[10]^[11]	Nov 16, 2020	CDNA TSMC N7	25.6×10⁹ 750 mm²	7680:480:- 120 CU	1000 1502	480 720.96	-		15.72 23.10	7.86 11.5	122.88 184.57	61.44 92.28	122.88 184.57	30.72 46.14	15.36 23.07	HBM2 4096-bit	32	2400	1228	300 W	PCIe 4.0 ×16	PCIe ×16

History
Release date	November 8, 2021 (3 years ago) (2021-11-08)
Fabrication process	TSMCN6
Predecessor	CDNA 1
Successor	CDNA 3

Accelerator	Launch date	Architecture	Lithography	Compute Units	Memory			PCIe support	Form factor	Processing power								TBP
Accelerator	Launch date	Architecture	Lithography	Compute Units	Size	Type	Bandwidth (GB/s)	PCIe support	Form factor	FP16	BF16	FP32	FP32 matrix	FP64 performance	FP64 matrix	INT8	INT4	TBP
MI210	2022-03-22^[16]	CDNA 2	6 nm	104	64 GB	HBM2E	1600			181 TFLOPS		22.6 TFLOPS	45.3 TFLOPS	22.6 TFLOPS	45.3 TFLOPS	181 TOPS		300 W
MI250	2021-11-08^[17]			208	128 GB		3200	OAM		362.1 TFLOPS		45.3 TFLOPS	90.5 TFLOPS	45.3 TFLOPS	90.5 TFLOPS	362.1 TOPS		560 W
MI250X	2021-11-08^[17]			220	128 GB		3200	OAM		383 TFLOPS		47.92 TFLOPS	95.7 TFLOPS	47.9 TFLOPS	95.7 TFLOPS	383 TOPS		560 W

History
Release date	December 6, 2023 (17 months ago) (2023-12-06)
Fabrication process	TSMCN5 &N6
Predecessor	CDNA 2

Model (Code name)	Release date	Architecture & fab	Transistors & die size	Core		Fillrate^[a]		Vector Processing power^[a]^[b] (TFLOPS)			Matrix Processing power^[a]^[b] (TFLOPS)					Memory				TBP	Software Interface	Physical Interface
Model (Code name)	Release date	Architecture & fab	Transistors & die size	Config^[c]	Clock^[a] (MHz)	Texture^[d] (GT/s)	Pixel^[e] (GP/s)	Half (FP16)	Single (FP32)	Double (FP64)	INT8	BF16	FP16	FP32	FP64	Bus type & width	Size (GB)	Clock (MT/s)	Bandwidth (GB/s)	TBP	Software Interface	Physical Interface
Tesla V100 (PCIE) (GV100)^[23]^[24]	May 10, 2017	Volta TSMC 12 nm	12.1×10⁹ 815 mm²	5120:320:128:640 80 SM	1370	438.4	175.36	28.06	14.03	7.01	N/A	N/A	N/A	112.23	N/A	HBM2 4096 bit	16 32	1750	900	250 W	PCIe 3.0 ×16	PCIe ×16
Tesla V100 (SXM) (GV100)^[25]^[26]	May 10, 2017	Volta TSMC 12 nm	12.1×10⁹ 815 mm²	5120:320:128:640 80 SM	1455	465.6	186.24	29.80	14.90	7.46	N/A	N/A	N/A	119.19	N/A	HBM2 4096 bit	16 32	1750	900	300 W	NVLINK	SXM2
Radeon Instinct MI50 (Vega 20)^[27]^[28]^[29]^[30]^[31]^[32]	Nov 18, 2018	GCN 5 TSMC 7 nm	13.2×10⁹ 331 mm²	3840:240:64 60 CU	1450 1725	348.0 414.0	92.80 110.4	22.27 26.50	11.14 13.25	5.568 6.624	N/A	N/A	26.5	13.3	?	HBM2 4096-bit	16 32	2000	1024	300 W	PCIe 4.0 ×16	PCIe ×16
Radeon Instinct MI60 (Vega 20)^[28]^[33]^[34]^[35]	Nov 18, 2018	GCN 5 TSMC 7 nm	13.2×10⁹ 331 mm²	4096:256:64 64 CU	1500 1800	384.0 460.8	96.00 115.2	24.58 29.49	12.29 14.75	6.144 7.373	N/A	N/A	32	16	?	HBM2 4096-bit		2000	1024	300 W	PCIe 4.0 ×16	PCIe ×16
Tesla A100 (PCIE) (GA100)^[36]^[37]	May 14, 2020	Ampere TSMC 7 nm	54.2×10⁹ 826 mm²	6912:432:-:432 108 SM	1065 1410	460.08 609.12	-	58.89 77.97	14.72 19.49	7.36 9.75	942.24 1247.47	235.56 311.87	235.56 311.87	117.78 155.93	14.72 19.49	HBM2 5120 bit	40 80	3186	2039	250 W	PCIe 4.0 ×16	PCIe ×16
Tesla A100 (SXM) (GA100))^[38]^[39]	May 14, 2020	Ampere TSMC 7 nm	54.2×10⁹ 826 mm²	6912:432:-:432 108 SM	1275 1410	550.80 609.12	-	70.50 77.97	17.63 19.49	8.81 9.75	1128.04 1247.47	282.01 311.87	282.01 311.87	141.00 155.93	17.63 19.49	HBM2 5120 bit	40 80	3186	2039	400 W	NVLINK	SXM4
AMD Instinct MI100 (Arcturus)^[40]^[41]	Nov 16, 2020	CDNA TSMC 7 nm	25.6×10⁹ 750 mm²	7860:480:-:480 120 CU	1000 1502	480 720.96	-	?	15.72 23.10	7.86 11.5	122.88 184.57	61.44 92.28	122.88 184.57	30.72 46.14	15.36 23.07	HBM2 4096-bit	32	2400	1228	300 W	PCIe 4.0 ×16	PCIe ×16
AMD Instinct MI250X (PCIE) (Aldebaran)	Nov 8, 2021	CDNA 2 TSMC 6 nm	58×10⁹ 1540 mm²	14080:880:-:880 220 CU
AMD Instinct MI250X (OAM) (Aldebaran)	Nov 8, 2021	CDNA 2 TSMC 6 nm	58×10⁹ 1540 mm²	14080:880:-:880 220 CU
Tesla H100 (PCIE) (GH100)	Mar 22, 2022	Hopper TSMC 4 nm	80×10⁹ 814 mm²
Tesla H100 (SXM) (GH100)	Mar 22, 2022	Hopper TSMC 4 nm	80×10⁹ 814 mm²

Movatterモバイル変換

CDNA 1

Architecture

Memory system

Experimental PIM implementation

Changes from GCN

Products

CDNA 2

Architecture

Memory system

Interconnect

Changes from CDNA

Products

Products

CDNA 3

Products

Product Comparisons

See also

References

External links