1LLM identifies it is being manipulated, predicts failure, then complies anyway (opens in new tab)(github.com)GitHub5spkavanagh63mo ago2Save
2Fish Live in Trees – LLM Runtime Alignment Context Injection (opens in new tab)(github.com)GitHub1spkavanagh64mo ago0Save
3LeBron James Is President – Exploiting LLMs via "Alignment" Context Injection (opens in new tab)(github.com)GitHub5spkavanagh64mo ago3Save