Tensile¶
Tensile is a tool for creating benchmark-driven backend libraries for GEMMs, GEMM-like problems (such as batched GEMM), and general N-dimensional tensor contractions on a GPU. The Tensile library is mainly used as a backend library for rocBLAS. Tensile acts as the performance backbone for a wide variety of 'compute' applications running on AMD GPUs.
homepage: https://github.com/ROCm/Tensile
| version | toolchain |
|---|---|
4.43.0 |
rocm-compilers/19.0.0-ROCm-6.4.1 |
(quick links: (all) - 0 - a - b - c - d - e - f - g - h - i - j - k - l - m - n - o - p - q - r - s - t - u - v - w - x - y - z)