Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch
#68 opened 2 months ago in kyegomez/BitNet