An open source community implementation of the model from "DIFFERENTIAL TRANSFORMER" paper by Microsoft.
#5 opened 1 month ago in kyegomez/DifferentialTransformer