Movatterモバイル変換

ROCm

From Wikipedia, the free encyclopedia

Parallel computing platform: GPGPU libraries and application programming interface

ROCm

Developer	AMD
Initial release	November 14, 2016; 9 years ago (2016-11-14)

Stable release	7.1.0 / October 30, 2025; 27 days ago (2025-10-30)^[1]

Repository	Meta-repository github.com/ROCm/ROCm
Written in	C,C++,Python,Fortran,Julia
Middleware	HIP
Engine	AMDgpu kernel driver,HIPCC, aLLVM-basedcompiler
Operating system	Linux,Windows^[2]
Platform	Supported GPUs
Predecessor	Close to metal,Stream,HSA
Size	<2GiB
Type	GPGPU libraries and APIs
License	MIT License
Website	www.amd.com/en/products/software/rocm.html

ROCm^[3] is anAdvanced Micro Devices (AMD) software stack forgraphics processing unit (GPU) programming. ROCm spans several domains, includinggeneral-purpose computing on graphics processing units (GPGPU),high performance computing (HPC), andheterogeneous computing. It offers several programming models:HIP (GPU-kernel-based programming),OpenMP (directive-based programming), andOpenCL.

ROCm is free, libre andopen-source software (except the GPUfirmware blobs^[4]), and it is distributed under various licenses. ROCm initially stood forRadeonOpenCompute platform; however, due toOpen Compute being a registered trademark, ROCm is no longer an acronym — it is simply AMD's open-source stack designed for GPU compute.

Name ofGPU series	Southern Islands	Sea Islands	Volcanic Islands	Arctic Islands/Polaris	Vega	Navi 1X	Navi 2X	Navi 3X	Navi 4X
Released	Jan 2012	Sep 2013	Jun 2015	Jun 2016	Jun 2017	Jul 2019	Nov 2020	Dec 2022	Jan 2025
Marketing Name	Radeon HD 7000	Radeon Rx 200	Radeon Rx 300	Radeon RX 400/500	Radeon RX Vega/Radeon VII(7 nm)	Radeon RX 5000	Radeon RX 6000	Radeon RX 7000	Radeon RX 9000
AMD support
Instruction set	GCN instruction set					RDNA instruction set
Microarchitecture	GCN 1st gen	GCN 2nd gen	GCN 3rd gen	GCN 4th gen	GCN 5th gen	RDNA	RDNA 2	RDNA 3	RDNA 4
Type	Unified shader model
ROCm				needenv var in 5.x^[16]	dropped in 6.x^[17]	^[a]
OpenCL	1.2 (onLinux: 1.1 (no Image support) with Mesa 3D)	2.0 (Adrenalin driver onWin7+) (onLinux: 1.1 (no Image support) with Mesa 3D, 2.0 with AMD drivers or AMD ROCm)				2.0	2.1^[20]
Vulkan	1.0 (Win 7+ orMesa 17+)	1.2 (Adrenalin 20.1, Linux Mesa 3D 20.0)						1.3
Shader model	5.1	5.1 6.3			6.4		6.5	6.7
OpenGL	4.6 (on Linux: 4.6 (Mesa 3D 20.0))
Direct3D	11 (11_1) 12 (11_1)	11 (12_0) 12 (12_0)			11 (12_1) 12 (12_1)		11 (12_1) 12 (12_2)
`/drm/amdgpu`^[b]	Experimental^[21]

v t e Parallel computing
General	Distributed computing Parallel computing Parallel algorithm Massively parallel Cloud computing High-performance computing Multiprocessing Manycore processor GPGPU Computer network Systolic array
Levels	Bit Instruction Thread Task Data Memory Loop Pipeline
Multithreading	Temporal Simultaneous (SMT) Simultaneous and heterogenous Speculative (SpMT) Preemptive Cooperative Clustered multi-thread (CMT) Hardware scout
Theory	PRAM model PEM model Analysis of parallel algorithms Amdahl's law Gustafson's law Cost efficiency Karp–Flatt metric Slowdown Speedup
Elements	Process Thread Fiber Instruction window Array
Coordination	Multiprocessing Memory coherence Cache coherence Cache invalidation Barrier Synchronization Application checkpointing
Programming	Stream processing Dataflow programming Models Implicit parallelism Explicit parallelism Concurrency Non-blocking algorithm
Hardware	Flynn's taxonomy SISD SIMD Array processing (SIMT) Pipelined processing Associative processing MISD MIMD Dataflow architecture Pipelined processor Superscalar processor Vector processor Multiprocessor symmetric asymmetric Memory shared distributed distributed shared UMA NUMA COMA Massively parallel computer Computer cluster Beowulf cluster Grid computer Hardware acceleration
APIs	Ateji PX Boost Chapel HPX Charm++ Cilk Coarray Fortran CUDA Dryad C++ AMP Global Arrays GPUOpen MPI OpenMP OpenCL OpenHMPP OpenACC Parallel Extensions PVM pthreads RaftLib ROCm UPC TBB ZPL
Problems	Automatic parallelization Deadlock Deterministic algorithm Embarrassingly parallel Parallel slowdown Race condition Software lockout Scalability Starvation
Category: Parallel computing

v t e Numerical linear algebra
Key concepts	Floating point Numerical stability
Problems	System of linear equations Matrix decompositions Matrix multiplication (algorithms) Matrix splitting Sparse problems
Hardware	CPU cache TLB Cache-oblivious algorithm SIMD Multiprocessing
Software	ATLAS MATLAB Basic Linear Algebra Subprograms (BLAS) LAPACK Specialized libraries General purpose software

Movatterモバイル変換

Background

Heterogeneous System Architecture Intermediate Language

Programming abilities

Hardware support

Professional-grade GPUs

Consumer-grade GPUs

Software ecosystem

Learning resources

Third-party integration

Machine learning

Supercomputing

Other acceleration & graphics interoperation

Other Languages

Julia

Software distribution

Official

Third-party

Components

Low-level

ROCk – Kernel driver

ROCm – Device libraries

ROCt – Thunk

ROCr – Runtime

ROCm – CompilerSupport

Mid-level

ROCclr Common Language Runtime

OpenCL

HIP –Heterogeneous Interface for Portability

HIPCC

HIPIFY

GPUFORT

High-level

rocBLAS / hipBLAS

rocSOLVER / hipSOLVER

Utilities

Comparison with competitors

Nvidia CUDA

Intel OneAPI

Unified Acceleration Foundation (UXL)

See also

References

External links