Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
Browser Agent Benchmark: Comparing LLM models for web automation
(opens in new tab)
(browser-use.com)
13 points
MagMueller
4mo ago
5 comments
Save
Share
5 comments
4 comments · 2 top-level
top
newest
oldest
pixel_popping
4mo ago
· 2 in thread
It's lacking the best model (Opus 4.5) on the benchmark tho.
djohnston
4mo ago
Yeah but then their own product might not score the highest.
pixel_popping
4mo ago
Exactly why I'm pointing it out, which feels a bit corrupt, but understandable.
1 more reply
wiradikusuma
4mo ago
Since we're in this topic, can anyone suggest good AI-based tool for exploratory (fuzzy?) web testing?
j
/
k
navigate · click thread line to collapse