An simple pytorch implementation of Flash MultiHead Attention
#3 opened 8 months ago in kyegomez/FlashMHA