If you look at Elixir keynote for Phoenix.new -- a cool agentic coding tool -- you'll see some hints about a browser control using a API tool call. It's called "web" in the video.
Cool to see lots of people independently come to "CLIs are all you need". I'm still not sure if it's a short-term bandaid because agents are so good at terminal use or if it's part of a longer term trend but it's definitely felt much more seamless to me then MCPs.
If you look at Elixir keynote for Phoenix.new -- a cool agentic coding tool -- you'll see some hints about a browser control using a API tool call. It's called "web" in the video.
Video: https://youtu.be/ojL_VHc4gLk?t=2132
More discussion: https://simonwillison.net/2025/Jun/23/phoenix-new/
Cool to see lots of people independently come to "CLIs are all you need". I'm still not sure if it's a short-term bandaid because agents are so good at terminal use or if it's part of a longer term trend but it's definitely felt much more seamless to me then MCPs.
(my one of many contribution https://github.com/caesarnine/binsmith)
A little bit different, but also allows to scrape efficiently. Json http communication rather than cli.
https://github.com/rumca-js/crawler-buddy
More like a framework for other mechanisms
This looks remarkably similar to https://github.com/vercel-labs/agent-browser
How is it different?
vibium clicker, too. https://github.com/VibiumDev/vibium/blob/main/CONTRIBUTING.m...
"browser automation for ai agents" is a popular idea these days.
is there a benchmark? there are a lot of scraping agents nowdays..