undefined | Better HN

0 pointsint_19h3y ago0 comments

In practice, it ends up being an AI that won't do the former for your average person but can still be prompt-engineered to do the latter by a sufficiently determined attacker.

0 comments

ben_w3y ago

Probably. But AI alignment research is currently extremely primitive (I think they describe themselves as “pre-paradigmatic”), so giving them some time to find a better way is at least worth trying.

j / k navigate · click thread line to collapse

0 comments

ben_w3y ago

j / k navigate · click thread line to collapse