1LLM identifies it is being manipulated, predicts failure, then complies anyway (opens in new tab)(github.com)5spkavanagh62mo ago2
2Fish Live in Trees – LLM Runtime Alignment Context Injection (opens in new tab)(github.com)1spkavanagh62mo ago0
3LeBron James Is President – Exploiting LLMs via "Alignment" Context Injection (opens in new tab)(github.com)5spkavanagh62mo ago3