Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
New deepseek paper: Natively Trainable Sparse Attention mechanism
(opens in new tab)
(twitter.com)
5 points
redlock
1y ago
1 comments
Save
Share
1 comments
1 comments · 1 top-level
top
newest
oldest
eunos
1y ago
Authored and Uploaded by none others than Liang Wenfeng himself
j
/
k
navigate · click thread line to collapse