
Browser Use
The open-source default for browser-controlling AI agents.
Thesis
- 01
The browser is the universal computer-use surface. Operating systems fragment; the web unifies. Every workflow that matters — Salesforce, Google Workspace, Kayak, Notion, LinkedIn, every internal admin tool — already runs in a browser. When Anthropic shipped Computer Use and OpenAI shipped Operator within three months of each other, they were both betting on the same thesis.[3] [4] The browser is where agents go to do work because the work is already there.
- 02
OSS won this category in three months. Browser Use went from launch to 50k+ stars by being the cleanest implementation of the thing every agent dev was hacking together. 14% of YC's W25 batch was already in production at memo time.[5] OpenAI Operator is the closed, managed answer; Browser Use is the open one — and 14% of YC W25 chose open.[2]
- 03
The next agent stack is built on Browser Use the same way the last one was built on Playwright. Selenium → Playwright → Browser Use is the same arc. Each step abstracts away the layer underneath. Browser Use is model-agnostic — Claude, GPT, Gemini, DeepSeek, Qwen, and local Ollama all work the same way — so the bet is on the harness, not the model.[6] [13]
- 04
The founders are exactly the right team. Magnus Müller (ETH Zürich, ML researcher, repeat founder, web-scraping hobbyist since he learned to code) and Gregor Žunič (ETH Zürich Data Science MSc, Physics BSc) built the open-source default for what every AI lab is now scrambling to ship. Pivoted from an SEO startup the moment the bigger surface became visible. Shipped Browser Use Cloud within 72 hours of OpenAI's Operator announcement.[7] [9]
Problem
LLMs can think. The web is where they have to act. Every workflow that matters runs behind a UI built for humans.
Every AI agent that does paid work ends up needing to use a browser. Booking a flight, filing a form, reading a SaaS dashboard, posting on LinkedIn, scraping a competitor's pricing page, running a research workflow across ten sites — all of it lives behind interfaces designed for humans, not APIs.
Building that capability in-house is brutal. Selectors break every time a site ships a redesign. Bot-detection systems fingerprint your headless Chrome and block you. Authentication adds friction at every step. Rate limits, parsing errors, and API-key sprawl turn every agent into a maintenance project that nobody on the team wants to own.
The 61% of the web with no public API is exactly the surface where the highest-value agent work happens — and it's the surface that breaks the fastest. The category has been waiting for a Playwright-equivalent primitive — something built for LLM-driven control rather than scripted automation. Browser Use is that primitive.[13]
61%
of the web has no public API
The high-value workflow surface, unaddressed by API-first tools
$5B → $30B
Web RPA TAM (2024 → 2030)
Agent-driven control expands it another 10×
3 months
Browser Use launch → 50k stars
Operator's January 2025 launch added 25k stars in four days
Why Now
Anthropic shipped Computer Use in October. OpenAI shipped Operator in January. The category opened in front of our eyes.
Three preconditions converged in the same six months: frontier labs validated the browser surface, the harness caught up with the model, and OSS distribution beat closed distribution.
Operator can go to the web and do tasks for you. We think this will be a really big deal — agents that actually do things on the internet.
Sam Altman[4]
CEO · OpenAI
Instead of making specific tools to help Claude complete individual tasks, we're teaching it general computer skills — allowing it to use a wide range of standard tools and software programs designed for people.
Anthropic[3]
Claude 3.5 Sonnet Computer Use
Our vision is simple: tell your computer what to do, and it gets it done.
Magnus Müller[7]
Co-founder · Browser Use
Three preconditions converged in the same six months.
Frontier labs validated computer use. Anthropic shipped Computer Use with Claude 3.5 Sonnet in October 2024.[3] OpenAI shipped Operator on January 23, 2025.[4] Two of the three biggest labs placed the same bet inside a single quarter: the browser is where agents work. The category that didn't exist on YC application day was suddenly the most-discussed surface in the AI stack.
The harness caught up with the model. Browser Use hit 89.1% on the WebVoyager benchmark — state-of-the-art at the time, ahead of OpenAI Operator's 87%.[8] A year ago, none of this worked reliably. Now the bottleneck has moved from "can the model see the page?" to "what's the cleanest abstraction on top?" That second question is the entire venture opportunity, and it's the question Browser Use answers.
OSS distribution beat closed distribution. When Operator launched in a closed beta available only to $200/month ChatGPT Pro subscribers, Browser Use gained 25k+ stars in four days as developers searched for the open alternative.[2] [7] Magnus and Gregor were building in public on X the entire time, shipping the cloud version within 72 hours of the announcement. The closed launch became Browser Use's largest distribution event.
Instead of making specific tools to help Claude complete individual tasks, we're teaching it general computer skills — allowing it to use a wide range of standard tools and software programs designed for people.
How It Works
Three layers. One Python import. Any LLM controls a browser.
Open core, managed cloud. Same shape that won Postgres, Next.js, and Kafka their categories.
The OSS core is the install. pip install browser-use, point it at the LLM of your choice, and you're running. MIT licensed, no telemetry tax, no vendor lock-in. The reason 14% of YC W25 is already in production: the integration takes an afternoon.[2] [5]
Browser Use Cloud is the renewal. Proxy rotation, persistent sessions, parallel execution, and a managed control plane at $30/month — vs. $200/month for ChatGPT Pro (the only way to get Operator at launch).[9] The cloud absorbs the operational complexity the OSS version can't.
Model neutrality is the structural moat. Every closed competitor — Operator, Computer Use — is locked to a single model vendor. Every AI engineer building a real product needs fallbacks, cost ceilings, and the freedom to swap models when a new SOTA lands. Browser Use is the only harness that respects that.[6]
The OSS Distribution Story
OpenAI launched Operator. Browser Use gained 25k stars in four days.
The textbook example of how OSS captures demand created by a closed competitor's launch — and the reason Browser Use is now the default browser-agent harness inside the AI cohort.
The Operator launch was Browser Use's largest distribution event.
January 23, 2025 — Operator ships behind a paywall. OpenAI announces Operator: closed beta, $200/month ChatGPT Pro requirement, GPT-4o locked, no self-hosting, no enterprise API at launch.[4] Every developer who wanted to ship a browser agent against a non-OpenAI model — or didn't want to pay $200/month — went looking for an alternative.
72 hours later — Browser Use Cloud ships. Magnus and Gregor shipped the managed cloud version of Browser Use inside the news cycle. $30/month entry pricing.[9] Model neutral. Self-hostable. Every Operator limitation became a Browser Use feature on the launch page.
Four days later — 25k stars. The GitHub star curve absorbed the entire Operator news cycle.[2] The W25 batch picked Browser Use as the default browser-agent harness. Three months later: 50k stars, 5k Discord members, 500+ contributors. The OSS distribution flywheel is the same one Vercel rode with Next.js, Supabase with Postgres, and Mastra with TypeScript agents — except Browser Use compressed that arc into a quarter.[10]
Our vision is simple: tell your computer what to do, and it gets it done.
Market
The densest buyer pool is every AI startup shipping an agent.
Every AI startup shipping an agent ends up needing browser control. YC's W25 batch already adopted Browser Use as the default — 14% in production at memo time.[5] The next ring is the broader Seed-to-Series-B AI cohort: research agents, sales agents, ops agents, coding agents that need to browse. The agent framework boom is mid-cycle; Browser Use sits underneath it as the primitive everyone reaches for.
The long pole is every business workflow that runs in a browser. Web RPA sits at ~$5B today growing to ~$30B by 2030 — and that's just the existing automation pool. The agent layer expands it 10× by making automation viable for the 61% of the web with no API. When agents replace BPO, when every SaaS company ships an agent SKU, when every enterprise has a fleet of agents doing knowledge-worker tasks — Browser Use is the harness underneath.
Every AI agent project is a browser-control project in waiting. Browser Use should be the answer by default — and it is, three months into the curve.
Competitive landscape
Four categories of competition. Browser Use is positioned against all of them.
Each category has a structural limitation — model lock-in, missing harness, missing OSS, or missing AI-native abstraction. Model-agnostic + OSS + Python + Cloud is the answer to all four.
The closed labs ship the agent. The OSS default ships the harness underneath it. Browser Use is the harness — and the harness compounds across model generations the agent never will.
Founder deep dive
Two ETH Zürich engineers who built bots before it was a category — and pivoted the moment the bigger surface arrived.
Founder & team
Risks & mitigations
What we're watching
References
- [1]Browser Use — Product homepage
- [2]GitHub — browser-use/browser-use (MIT license, 50k+ stars at memo time, 98k+ today)
- [3]Anthropic — Introducing computer use, Claude 3.5 Sonnet (Oct 22, 2024)
- [4]OpenAI — Introducing Operator (Jan 23, 2025)
- [5]Y Combinator — Browser Use company profile (W25)
- [6]Browser Use — Supported models documentation
- [7]Y Combinator Launches — Browser Use: Open-Source Alternative to OpenAI Operator
- [8]Browser Use — State-of-the-art on WebVoyager (89.1% success across 586 web tasks)
- [9]Browser Use Cloud — Hosted version ($30/month vs. $200 ChatGPT Pro for Operator)
- [10]Browser Use Discord — Community (5k+ active members)
- [11]Magnus Müller on X — @mamagnus00 (building in public)
- [12]Gregor Žunič on X — @gregpr07 (building in public)
- [13]Microsoft Playwright — the previous-generation browser-automation primitive
- [14]WebVoyager — Benchmark for vision-language web agents


