Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
Language models transmit behavioural traits through hidden signals in data | Better HN
Language models transmit behavioural traits through hidden signals in data
(opens in new tab)
(nature.com)
4 points
armcat
24d ago
2 comments
Share
2 comments
default
newest
oldest
zahra_lahrsson
24d ago
Related to this:
https://www.nature.com/articles/d41586-026-00906-0
(LLMs can subliminally learn malicious behavior through distilling)
pop_mccoy
24d ago
Explains the high performance of distilled models then (e.g. Chinese ones).
j
/
k
navigate · click thread line to collapse