An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"
#1 opened 1 year ago in kyegomez/GPT3