Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
michaeljx
11mo ago
0 comments
Share
A ok got it, the next token is sampled from a deterministic probability distribution, hence the random output. But why not get the token with the highest probability/weight? Is this to avoid some local minima?
0 comments
default
newest
oldest
minimaxir
11mo ago
It depends on your use case. Deterministic output is less "creative."
j
/
k
navigate · click thread line to collapse