
Superset
The IDE for the AI Agents Era.
7k+
Daily active users
Growing 50%+ week-over-week
10.8k
GitHub stars
From 0 in 5 months
2
Enterprise pilots
Ramp · Thrive Holdings
In use at




















Open-source · used by engineers across the world.
Thesis
Superset is the agentic IDE for the engineers already living in the future: tokenmaxxers running 10+ coding agents in parallel.[1][2] Today, it removes the pain of managing worktrees, terminal tabs, and tmux sessions. But the bigger vision is the control plane for software factories.[5]
As coding agents become developers, engineers become fleet commanders[13]: spinning up agents to refactor code, write tests, build UI, fix bugs, and open PRs in parallel. Over time, the job shifts from writing every line of code to designing, supervising, and improving the systems that write code.[5]
Superset is building the workspace for that future: a place to dispatch, isolate, monitor, review, and merge work from hundreds of agents.
If coding agents are the workers, Superset is the factory floor.
- 01
The bottleneck moved from writing code to orchestrating agents. Ramp's internal agent ships 30% of merged PRs.[7] Stripe's Minions produce 1,000+ PRs/week.[8] Spotify's Honk writes ~50% of their PRs.[9] Every major eng org is reinventing this layer. Superset is the off-the-shelf version.
- 02
Open-source + CLI-agnostic is strategic. Open-source is the wedge: ICs adopt the IDE in an hour, the community ships extensions, and the architecture is legible to enterprise security teams from day one. CLI-agnostic builds defensibility: developers run the native harness built into each CLI (Claude Code, Codex, Antigravity) and pay through their existing subscriptions — no full API rates, no capped Agent SDK credits, significant token savings vs API-wrapping competitors.[17]
- 03
Superset is building the horizontal command center. The market splits three ways: vertical agent platforms that own one model + one IDE, autonomous labor platforms that abstract the developer away, and horizontal orchestration layers that assume developers will use many agents and need a neutral cockpit. Superset leads the third strategy with an open-source, CLI-agent-agnostic approach.
- 04
Three repeat founders, all ex-CTOs at YC companies. Kiet (Onlook W25), Satya (Untether Labs W23), Avi (Adam W25). They shipped the MVP at a YC hackathon, won, and grew to 7k+ DAUs growing 50%+ week-over-week, 10.8k GitHub stars, and paid enterprise pilots (Ramp, Thrive Holdings) inside six months.[6]
Problem
The first wave of AI IDEs was built for one engineer, one agent.
Cursor, GitHub Copilot, Windsurf, and the rest were all designed around a single human paired with a single agent — one chat window, one editor, one diff at a time. That shape was necessary at the time: the underlying coding models still needed close supervision, manual edits, and frequent human overwrites for anything non-trivial. Which is exactly why these products were all built as forks of VS Code with the text editor as the primary surface and the agent as an assistant in the sidebar.
Then the models got good. Claude Sonnet 4.6 / Opus 4.7 (1M context), GPT-5 / Codex, and Antigravity can now write whole features, review their own PRs, run their own tests, and ship without a hand-holder. The 5–10× productivity gain everyone talks about only shows up when you run several of them at once — and the new top 5–10% of builders, the tokenmaxxers, have already shifted: instead of waiting idly while one agent works, they spin up many in parallel and review the outputs as they finish.
But running several at once is where the friction lives. You're juggling terminal tabs, learning tmux, setting up git worktrees by hand, switching branches, re-orienting yourself every time you come back to the agent you started twenty minutes ago, then reviewing diffs across scattered sessions. The VS-Code-fork architecture wasn't designed for this — it was designed for one human typing.
Most engineers respond by either (a) running one agent at a time and idling while it works, or (b) building bespoke internal infrastructure. The Fortune 500 dev orgs took option (b): Ramp built Inspect,[7] Stripe built Minions,[8] Spotify built Honk.[9] OpenAI coined the term harness engineering to describe the practice — and used it internally to ship a beta product of roughly 1M lines of code with zero manually-written source code.[4] Each of them is now publishing engineering posts about how essential this layer is — and how expensive it was to build.
5–10×
Code output gain
With parallel coding agents
6+ mo
In-house build time
Per Fortune 500 engineering case study
1
Click to a worktree
cmd+N in Superset
Why Now
Tokenmaxxer to Software Factory.
Andrej Karpathy has publicly described the exact product Superset is building — a proper "agent command center" IDE for teams of coding agents.[16] Alfred Lin (Sequoia) calls these builders fleet commanders and reports they're already 3–5× more productive than the median.[13] In the same six-month window, Ramp, Stripe, Spotify, and OpenAI all publicly described internal background-agent infrastructure.
The arc
Tomorrow · the team
The software factory
The org-scale version: dozens to hundreds of agents triggered upstream from Linear, GitHub, and Slack — running in sandboxed cloud workspaces, on schedules, and on event triggers. Humans move off the factory line: they architect the system and review the edge cases, but don't write or review any diff.[5] Ramp Inspect, Stripe Minions, and Spotify Honk are early implementations.[7][8][9]
The arc is the thesis. The same primitives — parallel agents, isolated worktrees, diff review, event-triggered orchestration — show up in both the tokenmaxxer's IDE today and the team's factory tomorrow. Superset's wedge is the open-source IDE an IC adopts inside an hour; the durable economic unit is the team-platform layer (Remote Workspaces, Automations, Shared Memory) that the same engineer pulls in once their team starts running parallel agents at scale. Every tokenmaxxer is a future buyer of the factory; every factory was started by a tokenmaxxer.
tmux grids are awesome, but i feel a need to have a proper "agent command center" IDE for teams of them, which I could maximize per monitor. E.g. I want to see/hide toggle them, see if any are idle, pop open related tools (e.g. terminal), stats (usage), etc.
Andrej Karpathy[16]
Founder · Eureka Labs · ex-OpenAI · ex-Tesla AI
The best builders today do not try to write every line of code. They use their taste and judgment to determine what to build and plan how to build it. Then they spin up parallel agents: one refactoring, one writing tests, one working on the UI, all operating with clear intent.
Alfred Lin[13]
Partner · Sequoia Capital
We frequently see engineers spinning up multiple minions in parallel, to enable them to parallelize the completion of many different tasks. A typical minion run starts in a Slack message and ends in a pull request which passes CI and is ready for human review, with no interaction in between.

Stripe Engineering[8]
On Minions, their internal coding agents
There's no limit to how many sessions you can have running concurrently, and your laptop doesn't need to be involved at all. When background agents are fast, they're strictly better than local: same intelligence, more power, and unlimited concurrency.

Ramp Engineering[7]
On Inspect, their internal background agent
The top 5% to 10% of builders are now 3 to 5 times more productive than they were a year ago. The median builder is up maybe 10% to 20%.
Three things converged inside a single six-month window.
Model reliability crossed the bar. The inflection point was December 2025: Claude Opus 4.7 (1M context) shipped, GPT-5/Codex matured, and Anthropic and OpenAI's coding harnesses leveled up around them — Skills, subagents, slash commands, hooks, persistent memory. The experimental pattern of one human running many agents in parallel tipped into a production-grade workflow within a quarter.
Model-provider pricing punished the API-wrappers. In a six-week stretch this spring, Anthropic cut third-party harnesses off from Claude subscriptions, then reinstated access via a capped Agent SDK credit pool billed at full API rates — a step-change cost increase for any tool that wraps the API.[17] Superset's CLI-agnostic architecture sidesteps the squeeze: agents run as subprocesses against the user's existing Claude Code, Codex, or Antigravity subscription. The same structural insulation would absorb the next pricing shift, whichever provider triggers it.
The flagship orgs went public. Ramp,[7] Stripe,[8] and Spotify[9] have all disclosed in-house builds of the software-factory model in the last six months — making the architecture legible to enterprise buyers and putting Superset in position to be the off-the-shelf fleet console.
The software development life cycle must change. To get the most out of coding agents, they need to shift not only to the background, but also to the cloud, on schedules and on event triggers, to automate the new bottlenecks springing up in the lifecycle.
How It Works
From local worktrees to the cloud command center.
v1 — the IDE for parallel agents running locally.
The MVP that came out of the YC hackathon: a desktop IDE built around cmd+N spawning isolated git worktrees, with any CLI coding agent (Claude Code, Codex CLI, OpenCode, Antigravity CLI) running inside each one. Notifications when each agent finishes. GitHub-style diff review across sessions. One-click open in VS Code, Cursor, JetBrains, or Xcode. Linear integration to fire agents at backlog tickets directly.
v1 found product-market fit fast — 7k+ DAUs growing 50%+ w/w, 10.8k GitHub stars, front-page HN in five months.[14] But worktrees don't scale: shared CPU, RAM, and disk space across many parallel agents, overlapping builds, messy envs and secrets, all bound to one machine. The worktree model worked beautifully — until it ran into the physical limits of the laptop it was running on.
v2 — the platform layer that unbinds Superset from the laptop.
The pivot: move the workloads to the cloud. v2 shipped in early May 2026, layering the missing infrastructure underneath the IDE — and notably, the team kept the v1 IDE shipping, the Discord swarming, the DAUs compounding 50%+ w/w, and the enterprise pilots closing the entire time they were building it.
Remote Workspaces. Ship a session to the cloud; continue working from anywhere; share with teammates. Solves the laptop-bound limitation that internal background agents at Ramp and Stripe both name as their core unlock.[7][8] Start on your laptop, continue on your phone, scale to as many agents as you need.
Automations. Schedule agents to pick up tasks for you later — or fire them on events: a new issue opened in Linear or GitHub, a Sentry error detected, a failed CI run, an inbound Slack message.
Superset CLI. Gives agents new superpowers and unlocks programmatic workflows — turning Superset itself into a primitive other agents can call.
MCP refresh. Works with v2 + an expanded toolset. Integrates across the apps where product and engineering work happens — Slack, Linear, GitHub, and more. Message @Superset in Slack to kick off a new task without ever opening the IDE.
Mobile · coming soon. Talk to your codebase from anywhere. Kick off a task on your phone walking to a meeting, review the diff on the train, ship from the couch. Same Remote Workspace, different surface.
Open-source, with the GitHub trajectory of a category leader
TractionFive months from launch to 7k+ daily active users growing 50%+ week-over-week, 10.8k GitHub stars, and front-page HN on launch day.[14] Paid enterprise pilots already in motion with Ramp and Thrive Holdings.
The Long Arc
The IDE is the wedge. The Software Factory is the prize.
A Software Factory is an org-scale operating model where parallel coding agents take work from anywhere it originates, run in isolated environments, and return finished work for human review. Engineers don't hand-write or hand-review every diff; they architect the system, calibrate the loops, and decide which edge cases get attention. Today's tokenmaxxer is already running a small version of this on their laptop. Tomorrow's team runs a big one. Superset's IDE is the wedge into both.
Every major eng org is converging on the same architecture.
The loop. Take a task. Spawn a sandboxed agent with the right tools and codebase context. Write code. Run CI. Open a PR. Review and iterate. Repeat in parallel — dozens, hundreds, thousands of times. The human's job has narrowed to architecting the loop and reviewing what comes out the other end, not typing the diff. Ramp's Inspect, Stripe's Minions, and Spotify's Honk are three implementations of the same loop.[7][8][9]
The pillars. Three primitives make the loop work: isolated compute (each agent's blast radius is sandboxed), event routing (work flows in and out automatically), and governance (humans review, approve, and shape what merges).[3] Superset's v2 maps cleanly to all three. Together they're what moves humans from on the line to on the loop.
The economic argument. Every team that has done this in-house has spent 6+ months and senior engineers to build it. Most companies will not. Superset's bet: nobody actually wants to build their own factory machinery — they just want the IDE that ships with it, so their engineers can step off the line on day one.
There's no limit to how many sessions you can have running concurrently, and your laptop doesn't need to be involved at all. When background agents are fast, they're strictly better than local: same intelligence, more power, and unlimited concurrency.
Market
Three concentric markets. Superset has commercial validation in all three already.
Near term — tokenmaxxers. Individual engineers paying $200+/month for Claude Code or Codex who would happily run more agents if the friction were lower. 7k+ DAUs growing 50%+ week-over-week, 10.8k stars, and front-page HN inside five months — the natural-demand signal.[6][14]
Mid term — agentic coding teams. Small-to-mid engineering orgs (5–200 engineers) standardizing on a parallel-agent workflow as a team. Multi-seat enterprise deals are already in motion with Ramp and Thrive Holdings. The revenue stack is per-seat IDE plus the org-level platform features — Remote Workspaces, Automations, shared memory, admin controls, SSO/RBAC, audit logs — that get pulled in once an IC's workflow becomes a team workflow.
Long term — Software Factories. Dozens to hundreds of agents triggered upstream from Linear, GitHub, Slack, and on-call — running in sandboxed cloud workspaces and shipping PRs continuously. Ramp Inspect, Stripe Minions, and Spotify Honk are early in-house implementations.[7][8][9] Every team that wants this without spending six months building it is a future Superset customer.
A typical minion run starts in a Slack message and ends in a pull request which passes CI and is ready for human review, with no interaction in between. Over 1,000 PRs per week are completely minion-produced.
Competitive landscape
Three strategies inside the agentic coding era.
As coding agents become the workers, three distinct strategies have emerged for how engineering teams operate them. Superset is the only one in the third bucket with category-leading OSS traction.
Superset's positioning
Superset is building the horizontal command center for multi-agent software development. As Claude Code, Codex, Cursor, OpenCode, and Antigravity become the workers, Superset is the one place to dispatch, isolate, observe, review, and merge their work.
Founder deep dive
The founding moment — and why it had to be these three.
Founders
Risks & mitigations
What we're watching
References
- [1]Superset — YC Profile
- [2]Superset — Company Website
- [3]background-agents.com — The Self-Driving Codebase
- [4]OpenAI — Harness Engineering: Leveraging Codex in an Agent-First World
- [5]Simon Willison — Software Factory
- [6]Superset on GitHub
- [7]Ramp Builders — Why We Built Our Background Agent (Inspect)
- [8]Stripe Engineering — Minions: Stripe's One-Shot End-to-End Coding Agents
- [9]Spotify Engineering — Honk: Spotify's Background Coding Agent (Part 1)
- [10]Cursor — Towards Self-Driving Codebases
- [11]Superset 2.0 launch on Product Hunt
- [12]Superset launch on Bookface (YC internal, P26)
- [13]Alfred Lin (Sequoia) — Developer to Fleet Commander
- [14]Show HN — Superset: terminal to run 10 parallel coding agents
- [15]How Superset built the IDE for AI agents on Vercel
- [16]Andrej Karpathy — "agent command center" IDE for teams of agents (X)
- [17]VentureBeat — Anthropic reinstates OpenClaw and third-party agent usage on Claude subscriptions — with a catch (May 13, 2026)
- [18]Google Developers Blog — Transitioning Gemini CLI to Antigravity CLI (May 19, 2026)



