general performance is predicted to become equivalent or better than other architectures educated on related information, although not to match much larger or high-quality-tuned models.
Mamba, like Flash Attention, https://k2spiceshop.com/product/liquid-k2-on-paper-online/