Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
Language models transmit behavioural traits through hidden signals in data
(opens in new tab)
(nature.com)
4 points
armcat
2mo ago
2 comments
Save
Share
2 comments
2 comments · 2 top-level
top
newest
oldest
zahra_lahrsson
2mo ago
Related to this:
https://www.nature.com/articles/d41586-026-00906-0
(LLMs can subliminally learn malicious behavior through distilling)
pop_mccoy
2mo ago
Explains the high performance of distilled models then (e.g. Chinese ones).
j
/
k
navigate · click thread line to collapse