1LLM identifies it is being manipulated, predicts failure, then complies anyway (opens in new tab)(github.com)5spkavanagh615d ago2
2Fish Live in Trees – LLM Runtime Alignment Context Injection (opens in new tab)(github.com)1spkavanagh61mo ago0
3LeBron James Is President – Exploiting LLMs via "Alignment" Context Injection (opens in new tab)(github.com)5spkavanagh61mo ago3