Log in

Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"

Issues looking for funding

#39 opened in kyegomez/Mixture-of-Depths

1
Fund
Creatorkyegomez
Stars73
LicenseMIT License
RepositoryGitHub