Skip to content

Mamba-SSM

Mamba is a new state space model architecture showing promising performance on information-dense data such as language modeling, where previous subquadratic models fall short of Transformers. It is based on the line of progress on structured state space models, with an efficient hardware-aware design and implementation in the spirit of FlashAttention.

homepage: https://github.com/state-spaces/mamba

version versionsuffix toolchain
2.2.4 -PyTorch-2.1.2-CUDA-12.1.1 foss/2023a

(quick links: (all) - 0 - a - b - c - d - e - f - g - h - i - j - k - l - m - n - o - p - q - r - s - t - u - v - w - x - y - z)