Pretrained models for the paper 2Mamba2Furious: Linear in Complexity, Competitive in Accuracy (https://arxiv.org/abs/2602.17363)
-
2Mamba2Furious: Linear in Complexity, Competitive in Accuracy
Paper • 2602.17363 • Published • 2 -
gmongaras/medium_8192sl_gpu_64bs__squared__sm_norm__A_mask_type_neg_softplus__in_conv_k_2__att2
3B • Updated -
gmongaras/medium_8192sl_gpu_64bs__softmax
0.7B • Updated -
gmongaras/medium_8192sl_gpu_64bs__mamba
0.7B • Updated