Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
csomar
1y ago
0 comments
Save
Share
Models are predictable at 0 temperatures. They might have tested the output beforehand.
0 comments
3 comments · 1 top-level
top
newest
oldest
fzzzy
1y ago
· 2 in thread
Models in practice haven't been deterministic at 0 temperature, although nobody knows exactly why. Either hardware or software bugs.
Jensson
1y ago
We know exactly why, it is because floating point operations aren't associative but the GPU scheduler assumes they are, and the scheduler isn't deterministic. Running the model strictly hurts performance so they don't do that.
fzzzy
1y ago
Cool, thanks a lot for the explanation. Makes sense.
j
/
k
navigate · click thread line to collapse