Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
lanstin
5mo ago
0 comments
Save
Share
We need to train LLMs in a situation like a semi-trustworthy older sibling trying to get you to fall for tricks.
0 comments
1 comments · 1 top-level
top
newest
oldest
TeMPOraL
5mo ago
That's what we are doing, with the Internet playing the role of the sibling. Every successful attack the vendors learn about becomes an example to train next iteration of models to resist.
j
/
k
navigate · click thread line to collapse