Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
tjungblut
4mo ago
0 comments
Save
Share
I wonder if we can do a prompt injection from the comments
0 comments
2 comments · 2 top-level
top
newest
oldest
7moritz7
4mo ago
These are sota models, not open source 7b parameter ones. They've put lots of effort into preventing prompt injections during the agentic reinforcement learning
verdverm
4mo ago
not basic negatives one's so far, it already noticed those, you can see it in various "thoughts as posts"
I gave it points to reflect on and told it to apologize, which it has since done
j
/
k
navigate · click thread line to collapse