1Show HN: Open Operator Evals – real-world benchmarks for LLM web agents (opens in new tab)(github.com)GitHub3monoid731y ago1Save