Since Anthropic weren't able to make them work even for something as simple and familiar as a C compiler then I would guess that:
1. You're supervising the agents closely, or
2. Your projects are very simple - simpler even than a C compiler, or,
3. They're not really working well; the catastrophic problems just haven't surfaced yet.