Skip to content

Top Best Ask Show New Jobs

pidtom | Better HN

pidtom

5 karmaJoined March 27, 20264 submissions

Recent submissions

1

Skipping 90% of KV dequant work speeds up LLM decode by 22% (opens in new tab)

(github.com)GitHub

1pidtom2mo ago0