Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
Why do LLMs attend to the first token?
(opens in new tab)
(arxiv.org)
2 points
adhi01
1y ago
1 comments
Save
Share
1 comments
1 comments · 1 top-level
top
newest
oldest
maytc
1y ago
Curious if the authors had a chance to look at the Softpick paper?
https://arxiv.org/abs/2504.20966
j
/
k
navigate · click thread line to collapse