undefined | Better HN

0 pointshmokiguess2d ago0 comments

Can someone help me understand why classic sanitizing is not used as a solved problem to prompt injection? All these tags, patterns, etc, feel like prime for a parser rule, but maybe I am thinking too abstract here and missing an obvious knowledge gap I have on LLMs

0 comments

2 comments · 1 top-level

vova_hn21d ago· 1 in thread

Role tags are not actual symbols "<system>", they are special tokens that do not correspond to any normal text. So you can't really inject a role tag, that is not the actual problem.

hmokiguessOP20h ago

as in this stuff happens at the tokenizer / internal representation layer? sorry can you help me understand why can't we sanitize it?

j / k navigate · click thread line to collapse