Skip to content

Optimize TPU Flash Attention (400x speed-up on 32k long context)#845

Draft
ds-hwang wants to merge 1 commit intoapple:mainfrom ds-hwang:flsh_op

Commits

Commits on Nov 19, 2024