Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
rpdillon
7mo ago
0 comments
Share
Yes, I have an AMD Ryzen AI Max+ chip with memory set to allocate 96 gigs to the GPU and 32 gigs to the CPU. I got it last week, and I've been running gpt-oss-120b at q5 at 40t/s. I run Linux with llama.cpp compiled against ROCm 7.
0 comments
default
newest
oldest
lostmsu
6mo ago
Did you try the native mxfp4 (obviously, Vulkan/ROCm would have to load and upscale it)?
j
/
k
navigate · click thread line to collapse