Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Zeta
#12 opened 1 month ago in kyegomez/MoE-Mamba
#9 opened 2 months ago in kyegomez/MoE-Mamba