Skip to content

cuBLASMp

NVIDIA cuBLASMp is a high-performance, multi-process, GPU-accelerated library for distributed basic dense linear algebra. cuBLASMp is compatible with 2D block-cyclic data layout and provides PBLAS-like C APIs.

homepage: https://docs.nvidia.com/cuda/cublasmp/

version versionsuffix toolchain
0.7.0 -CUDA-12.9.1 gompi/2025b

(quick links: (all) - 0 - a - b - c - d - e - f - g - h - i - j - k - l - m - n - o - p - q - r - s - t - u - v - w - x - y - z)