Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
Jagged Flash Attention Optimization
(opens in new tab)
(shaped.ai)
24 points
tullie
1y ago
3 comments
Share
Jagged Flash Attention Optimization | Better HN
3 comments
default
newest
oldest
platers
1y ago
Flash attention natively supports packing multiple variable length sequences into a single call, what is the advantage of jagged flash attention?
bbstats
1y ago
If only there was a link to a page somewhere that could answer this question for you.
CapsAdmin
1y ago
See also
https://github.com/thu-ml/SageAttention
and
https://github.com/thu-ml/SpargeAttn
j
/
k
navigate · click thread line to collapse