Show HN: I built an integration for RL training of browser agents for everyone (opens in new tab)

(github.com)

5 pointsfiltr1213h ago1 comments

This integration allows for scalable evals and training of browser agents with hosted Prime Intellect eval + training pipelines and headless browser infrastructure on Browserbase to RL train browser agents with LoRA.

1 comments

nithisha220111h ago

Interesting, how do you handle the observability side during training? One thing I ran into with multi-agent RL is that reward signals alone don't tell you much about why an agent is failing. Curious if you've built any tooling around that.

j / k navigate · click thread line to collapse

Show HN: I built an integration for RL training of browser agents for everyone (opens in new tab)

(github.com)

5 pointsfiltr1213h ago1 comments

1 comments

nithisha220111h ago

j / k navigate · click thread line to collapse