Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Browser Agent Benchmark: Comparing LLM models for web automation (browser-use.com)
6 points by MagMueller 4 hours ago | hide | past | favorite | 2 comments




It's lacking the best model (Opus 4.5) on the benchmark tho.

Since we're in this topic, can anyone suggest good AI-based tool for exploratory (fuzzy?) web testing?



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: