[LG] Routing Mamba: Scaling State Space Models with Mixture-of-Experts Projection
[Microsoft]
https://arxiv.org/abs/2506.18145