- Notifications
You must be signed in to change notification settings - Fork158
Stretching GPU performance for GEMMs and tensor contractions.
License
ROCm/Tensile
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Tensile is a tool for creating benchmark-driven backend libraries for GEMMs, GEMM-like problems (such as batched GEMM), and general N-dimensional tensor contractions on a GPU.The Tensile library is mainly used as a backend library for rocBLAS.Tensile acts as the performance backbone for a wide variety of 'compute' applications running on AMD GPUs.
Note
The published documentation is available atTensile in an organized, easy-to-read format, with search and a table of contents. The documentation source files reside in theTensile/docs/src
folder of this repository. As with all ROCm projects, the documentation is open source. For more information on contributing to the documentation, seeContribute to ROCm documentation.
About
Stretching GPU performance for GEMMs and tensor contractions.