Movatterモバイル変換

[0]ホーム

Jump to content

AMD Instinct

Edit links

From Wikipedia, the free encyclopedia

(Redirected fromRadeon Instinct)

Brand of data center GPUs by AMD

AMD Instinct
History

Release date	June 20, 2017; 8 years ago (2017-06-20)
Designed by	AMD
Marketed by	AMD
Architecture	GCN 3 GCN 4 GCN 5 CDNA CDNA 2 CDNA 3
Models	MI Series
Cores	36-304Compute Units (CUs)
Transistors	5.7B (Polaris10) 14 nm 8.9B (Fiji) 28 nm 12.5B (Vega10) 14 nm 13.2B (Vega20) 7 nm 25.6B (Arcturus) 7 nm 58.2B (Aldebaran) 6 nm 146B (Antares) 5 nm 153B (Aqua Vanjaram) 5 nm
Predecessor	AMD FirePro Radeon Sky series

AMD Instinct isAMD's brand of data centerGPUs.^[1]^[2] It replaced AMD'sFirePro S brand in 2016. Compared to theRadeon brand of mainstream consumer/gamer products, the Instinct product line is intended to accelerate deep learning,artificial neural network, andhigh-performance computing/GPGPU applications.

The AMD Instinct product line directly competes withNvidia'sTesla andIntel'sXeon Phi andData Center GPU lines of machine learning and GPGPU cards.

The brand was originally known asAMD Radeon Instinct, but AMD dropped the Radeon brand from the name before AMD Instinct MI100 was introduced in November 2020.

In June 2022,supercomputers based on AMD'sEpyc CPUs and Instinct GPUs took the lead on theGreen500 list of the most power-efficient supercomputers with over 50% lead over any other, and held the top first 4 spots.^[3] One of them, the AMD-basedFrontier is since June 2022 and as of 2023 the fastest supercomputer in the world on theTOP500 list.^[4]^[5]

Accelerator	Launch date	Architecture	Lithography	Compute Units	Memory			PCIe support	Form factor	Processing power								TBP
Accelerator	Launch date	Architecture	Lithography	Compute Units	Size	Type	Bandwidth (GB/s)	PCIe support	Form factor	FP16	BF16	FP32	FP32 matrix	FP64 performance	FP64 matrix	INT8	INT4	TBP
MI6	2016-12-12^[6]	GCN 4	14 nm	36	16 GB	GDDR5	224	3.0	PCIe	5.7 TFLOPS	N/A	5.7 TFLOPS	N/A	358 GFLOPS	N/A	N/A	N/A	150 W
MI8		GCN 3	28 nm	64	4 GB	HBM	512			8.2 TFLOPS		8.2 TFLOPS		512 GFLOPS				175 W
MI25		GCN 5	14 nm	64	16 GB	HBM2	484			26.4 TFLOPS		12.3 TFLOPS		768 GFLOPS				300 W
MI50	2018-11-06^[7]		7 nm	60	16 GB		1024	4.0		26.5 TFLOPS		13.3 TFLOPS		6.6 TFLOPS		53 TOPS		300 W
MI60	2018-11-06^[7]			64	32 GB		1024			29.5 TFLOPS		14.7 TFLOPS		7.4 TFLOPS		59 TOPS		300 W
MI100	2020-11-16	CDNA		120	32 GB		1200			184.6 TFLOPS	92.3 TFLOPS	23.1 TFLOPS	46.1 TFLOPS	11.5 TFLOPS		184.6 TOPS		300 W
MI210	2022-03-22^[8]	CDNA 2	6 nm	104	64 GB	HBM2E	1600			181 TFLOPS		22.6 TFLOPS	45.3 TFLOPS	22.6 TFLOPS	45.3 TFLOPS	181 TOPS		300 W
MI250	2021-11-08^[9]			208	128 GB		3200		OAM	362.1 TFLOPS		45.3 TFLOPS	90.5 TFLOPS	45.3 TFLOPS	90.5 TFLOPS	362.1 TOPS		560 W
MI250X	2021-11-08^[9]			220	128 GB		3200		OAM	383 TFLOPS		47.92 TFLOPS	95.7 TFLOPS	47.9 TFLOPS	95.7 TFLOPS	383 TOPS		560 W
MI300A	2023-12-06^[10]	CDNA 3	6 & 5 nm	228	128 GB	HBM3	5300	5.0	APU SH5 socket	980.6 TFLOPS 1961.2 TFLOPS (with Sparsity)		122.6 TFLOPS		61.3 TFLOPS	122.6 TFLOPS	1961.2 TOPS 3922.3 TOPS (with Sparsity)	N/A	550 W 760 W (with liquid cooling)
MI300X	2023-12-06^[10]			304	192 GB	HBM3	5300		OAM	1307.4 TFLOPS 2614.9 TFLOPS (with Sparsity)		163.4 TFLOPS		81.7 TFLOPS	163.4 TFLOPS	2614.9 TOPS 5229.8 TOPS (with Sparsity)	N/A	750 W
MI325X	2024-10-10^[11]			304	256 GB	HBM3E	6000		OAM	1307.4 TFLOPS 2614.9 TFLOPS (with Sparsity)		163.4 TFLOPS		81.7 TFLOPS	163.4 TFLOPS	2614.9 TOPS 5229.8 TOPS (with Sparsity)	N/A	750 W
MI350X	2025-06-13^[12]	CDNA 4	3 nm	256	288 GB	HBM3E	8000	5.0	OAM	2386.9 TFLOPS 4613.8 TFLOPS (with Sparsity)		144.2 TFLOPS		72.1 TFLOPS		4.6137 POPS 9.2274 POPS (with Sparsity)		1000 W
MI355X	2025-06-13^[12]	CDNA 4	3 nm	256	288 GB	HBM3E	8000	5.0	OAM	2516.6 TFLOPS 5033.2 TFLOPS (with Sparsity)		157.3 TFLOPS		78.6 TFLOPS		5.0332 POPS 10.066 POPS (with Sparsity)		1400 W

Model (Code name)	Launch	Architecture & fab	LLVM target^[24]	Transistors & die size	Core		Fillrate^[a]^[b]^[c]		VectorTFLOPS^[a]^[d]			Memory				TBP	Bus interface
Model (Code name)	Launch	Architecture & fab	LLVM target^[24]	Transistors & die size	Config^[e]	Clock^[a] (MHz)	Texture (GT/s)	Pixel (GP/s)	FP16	FP32	FP64	Size (GB)	Bus type & width	Bandwidth (GB/s)	Clock (MT/s)	TBP	Bus interface
Radeon Instinct MI6 (Polaris 10)^[25]^[26]^[27]^[28]^[29]^[30]	Jun 20, 2017	GCN 4 GloFo 14LP	gfx803	5.7×10⁹ 232 mm²	2304:144:32 36 CU	1120 1233	161.3 177.6	35.84 39.46	5.161 5.682	5.161 5.682	0.323 0.355	16	GDDR5 256-bit	224	7000	150 W	PCIe 3.0 ×16
Radeon Instinct MI8 (Fiji)^[25]^[26]^[27]^[31]^[32]^[33]		GCN 3 TSMC 28 nm	gfx803	8.9×10⁹ 596 mm²	4096:256:64 64 CU	1000	256.0	64.00	8.192	8.192	0.512	4	HBM 4096-bit	512	1000	175 W
Radeon Instinct MI25 (Vega 10)^[25]^[26]^[27]^[34]^[35]^[36]^[37]		GCN 5 GloFo 14LP	gfx900	12.5×10⁹ 510 mm²	4096:256:64 64 CU	1400 1500	358.4 384.0	89.60 96.00	22.94 24.58	11.47 12.29	0.717 0.768	16	HBM2 2048-bit	484	1890	300 W
Radeon Instinct MI50 (Vega 20)^[38]^[39]^[40]^[41]^[42]^[43]	Nov 18, 2018	GCN 5 TSMC N7	gfx906	13.2×10⁹ 331 mm²	3840:240:64 60 CU	1450 1725	348.0 414.0	92.80 110.4	22.27 26.50	11.14 13.25	5.568 6.624	16 32	HBM2 4096-bit	1024	2000	300 W	PCIe 4.0 ×16
Radeon Instinct MI60 (Vega 20)^[39]^[44]^[45]^[46]	Nov 18, 2018	GCN 5 TSMC N7	gfx906	13.2×10⁹ 331 mm²	4096:256:64 64 CU	1500 1800	384.0 460.8	96.00 115.2	24.58 29.49	12.29 14.75	6.144 7.373	32	HBM2 4096-bit	1024	2000	300 W	PCIe 4.0 ×16

Movatterモバイル変換

Products

MI6

MI8

MI25

MI50, MI60

MI100 series (CDNA 1)

MI300 series

MI350 series

Software

ROCm

MxGPU

MIOpen

Chipset table

See also

References

External links