Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
trashcan2137
2mo ago
0 comments
Save
Share
and the EOS is "<turn|>". "<|channel>thought\n" is also used for the thinking trace!
Can someone explain this to me? Why is this faux-XML important here?
0 comments
2 comments · 2 top-level
top
newest
oldest
pertymcpert
2mo ago
That’s how the model is trained to signal the end to its generation and to indicate its thinking.
sroussey
2mo ago
These are likely individual tokens. They are super common.
j
/
k
navigate · click thread line to collapse