An simple pytorch implementation of Flash MultiHead Attention
#3 opened 6 months ago in kyegomez/FlashMHA