8dazo on Hacker News

Ask HN: Is there any tool that can stop LLM calls at runtime (not just monitor)?

I’ve been running into cases where LLM/agent systems make unexpected or repeated calls and costs spike quickly.

Most tools I’ve found focus on observability (logs, traces, dashboards), but not actual enforcement.

Is there anything that can:

- stop or cut off a call mid-execution (based on budget, tokens, or conditions)?

- enforce limits at runtime instead of just alerting after the fact?

Curious if people here are solving this in practice, or just handling it at the application level.

38dazo2mo ago0

I’ve been running into an issue with LLM/agent systems where unexpected loops or repeated calls can quickly drive up costs.

Most tools I’ve seen focus on observability (logs, traces, dashboards), but not actual enforcement at runtime.

Curious how people here are handling this in production:

- Are you enforcing hard limits (budget, rate, etc.) or just monitoring?

- Do you handle this at the app level or via some middleware/proxy?

- Have you built something in-house for this?

Feels like an unsolved problem, especially with agents.

Would love to hear how others are dealing with it.

38dazo2mo ago2