You're right that current systems aren't close to that level of reasoning.
What I'm wondering is whether we can approximate some of it structurally — by defining when execution is allowed or not — even without that level of sophistication in the model itself.
Curious how far you think simple constraint systems can go before something like that kind of reasoning becomes necessary.