Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
whimsicalism
2y ago
0 comments
Save
Share
You don't benchmark foundation model against RLHF model, results aren't very useful.
0 comments
2 comments · 1 top-level
top
newest
oldest
moffkalast
2y ago
· 1 in thread
This does seem to be a RLHF model, not a base model. Unless 'supervised fine-tuning' and 'human preference' mean something else.
whimsicalism
OP
2y ago
Ah I see there is also a llama-2-chat model.
j
/
k
navigate · click thread line to collapse