Superset — Investment Memo · Orange Collective

Thesis

Superset is the agentic IDE for the engineers already living in the future: tokenmaxxers running 10+ coding agents in parallel.^[1]^[2] Today, it removes the pain of managing worktrees, terminal tabs, and tmux sessions. But the bigger vision is the control plane for software factories.^[5]

As coding agents become developers, engineers become fleet commanders^[13]: spinning up agents to refactor code, write tests, build UI, fix bugs, and open PRs in parallel. Over time, the job shifts from writing every line of code to designing, supervising, and improving the systems that write code.^[5]

Superset is building the workspace for that future: a place to dispatch, isolate, monitor, review, and merge work from hundreds of agents.

If coding agents are the workers, Superset is the factory floor.

01
The bottleneck moved from writing code to orchestrating agents. Ramp's internal agent ships 30% of merged PRs.^[7] Stripe's Minions produce 1,000+ PRs/week.^[8] Spotify's Honk writes ~50% of their PRs.^[9] Every major eng org is reinventing this layer. Superset is the off-the-shelf version.
02
Open-source + CLI-agnostic is strategic. Open-source is the wedge: ICs adopt the IDE in an hour, the community ships extensions, and the architecture is legible to enterprise security teams from day one. CLI-agnostic builds defensibility: developers run the native harness built into each CLI (Claude Code, Codex, Antigravity) and pay through their existing subscriptions — no full API rates, no capped Agent SDK credits, significant token savings vs API-wrapping competitors.^[17]
03
Superset is building the horizontal command center. The market splits three ways: vertical agent platforms that own one model + one IDE, autonomous labor platforms that abstract the developer away, and horizontal orchestration layers that assume developers will use many agents and need a neutral cockpit. Superset leads the third strategy with an open-source, CLI-agent-agnostic approach.
04
Three repeat founders, all ex-CTOs at YC companies. Kiet (Onlook W25), Satya (Untether Labs W23), Avi (Adam W25). They shipped the MVP at a YC hackathon, won, and grew to 7k+ DAUs growing 50%+ week-over-week, 10.8k GitHub stars, and paid enterprise pilots (Ramp, Thrive Holdings) inside six months.^[6]

Problem

The first wave of AI IDEs was built for one engineer, one agent.

Cursor, GitHub Copilot, Windsurf, and the rest were all designed around a single human paired with a single agent — one chat window, one editor, one diff at a time. That shape was necessary at the time: the underlying coding models still needed close supervision, manual edits, and frequent human overwrites for anything non-trivial. Which is exactly why these products were all built as forks of VS Code with the text editor as the primary surface and the agent as an assistant in the sidebar.

Then the models got good. Claude Sonnet 4.6 / Opus 4.7 (1M context), GPT-5 / Codex, and Antigravity can now write whole features, review their own PRs, run their own tests, and ship without a hand-holder. The 5–10× productivity gain everyone talks about only shows up when you run several of them at once — and the new top 5–10% of builders, the tokenmaxxers, have already shifted: instead of waiting idly while one agent works, they spin up many in parallel and review the outputs as they finish.

But running several at once is where the friction lives. You're juggling terminal tabs, learning tmux, setting up git worktrees by hand, switching branches, re-orienting yourself every time you come back to the agent you started twenty minutes ago, then reviewing diffs across scattered sessions. The VS-Code-fork architecture wasn't designed for this — it was designed for one human typing.

Most engineers respond by either (a) running one agent at a time and idling while it works, or (b) building bespoke internal infrastructure. The Fortune 500 dev orgs took option (b): Ramp built Inspect,^[7] Stripe built Minions,^[8] Spotify built Honk.^[9] OpenAI coined the term harness engineering to describe the practice — and used it internally to ship a beta product of roughly 1M lines of code with zero manually-written source code.^[4] Each of them is now publishing engineering posts about how essential this layer is — and how expensive it was to build.

5–10×

Code output gain

With parallel coding agents

6+ mo

In-house build time

Per Fortune 500 engineering case study

1

Click to a worktree

cmd+N in Superset

Why Now

Tokenmaxxer to Software Factory.

Andrej Karpathy has publicly described the exact product Superset is building — a proper "agent command center" IDE for teams of coding agents.^[16] Alfred Lin (Sequoia) calls these builders fleet commanders and reports they're already 3–5× more productive than the median.^[13] In the same six-month window, Ramp, Stripe, Spotify, and OpenAI all publicly described internal background-agent infrastructure.

The arc

Tomorrow · the team

The software factory

The org-scale version: dozens to hundreds of agents triggered upstream from Linear, GitHub, and Slack — running in sandboxed cloud workspaces, on schedules, and on event triggers. Humans move off the factory line: they architect the system and review the edge cases, but don't write or review any diff.^[5] Ramp Inspect, Stripe Minions, and Spotify Honk are early implementations.^[7]^[8]^[9]

The arc is the thesis. The same primitives — parallel agents, isolated worktrees, diff review, event-triggered orchestration — show up in both the tokenmaxxer's IDE today and the team's factory tomorrow. Superset's wedge is the open-source IDE an IC adopts inside an hour; the durable economic unit is the team-platform layer (Remote Workspaces, Automations, Shared Memory) that the same engineer pulls in once their team starts running parallel agents at scale. Every tokenmaxxer is a future buyer of the factory; every factory was started by a tokenmaxxer.

tmux grids are awesome, but i feel a need to have a proper "agent command center" IDE for teams of them, which I could maximize per monitor. E.g. I want to see/hide toggle them, see if any are idle, pop open related tools (e.g. terminal), stats (usage), etc.

Andrej Karpathy^[16]

Founder · Eureka Labs · ex-OpenAI · ex-Tesla AI

The best builders today do not try to write every line of code. They use their taste and judgment to determine what to build and plan how to build it. Then they spin up parallel agents: one refactoring, one writing tests, one working on the UI, all operating with clear intent.

Alfred Lin^[13]

Partner · Sequoia Capital

We frequently see engineers spinning up multiple minions in parallel, to enable them to parallelize the completion of many different tasks. A typical minion run starts in a Slack message and ends in a pull request which passes CI and is ready for human review, with no interaction in between.

Stripe Engineering^[8]

On Minions, their internal coding agents

There's no limit to how many sessions you can have running concurrently, and your laptop doesn't need to be involved at all. When background agents are fast, they're strictly better than local: same intelligence, more power, and unlimited concurrency.

Ramp Engineering^[7]

On Inspect, their internal background agent

The top 5% to 10% of builders are now 3 to 5 times more productive than they were a year ago. The median builder is up maybe 10% to 20%.

— Alfred Lin, Partner at Sequoia Capital — "Developer to Fleet Commander"^[13]

Three things converged inside a single six-month window.

Model reliability crossed the bar. The inflection point was December 2025: Claude Opus 4.7 (1M context) shipped, GPT-5/Codex matured, and Anthropic and OpenAI's coding harnesses leveled up around them — Skills, subagents, slash commands, hooks, persistent memory. The experimental pattern of one human running many agents in parallel tipped into a production-grade workflow within a quarter.

Model-provider pricing punished the API-wrappers. In a six-week stretch this spring, Anthropic cut third-party harnesses off from Claude subscriptions, then reinstated access via a capped Agent SDK credit pool billed at full API rates — a step-change cost increase for any tool that wraps the API.^[17] Superset's CLI-agnostic architecture sidesteps the squeeze: agents run as subprocesses against the user's existing Claude Code, Codex, or Antigravity subscription. The same structural insulation would absorb the next pricing shift, whichever provider triggers it.

The flagship orgs went public. Ramp,^[7] Stripe,^[8] and Spotify^[9] have all disclosed in-house builds of the software-factory model in the last six months — making the architecture legible to enterprise buyers and putting Superset in position to be the off-the-shelf fleet console.

How It Works

From local worktrees to the cloud command center.

v1 — the IDE for parallel agents running locally.

The MVP that came out of the YC hackathon: a desktop IDE built around cmd+N spawning isolated git worktrees, with any CLI coding agent (Claude Code, Codex CLI, OpenCode, Antigravity CLI) running inside each one. Notifications when each agent finishes. GitHub-style diff review across sessions. One-click open in VS Code, Cursor, JetBrains, or Xcode. Linear integration to fire agents at backlog tickets directly.

v1 found product-market fit fast — 7k+ DAUs growing 50%+ w/w, 10.8k GitHub stars, front-page HN in five months.^[14] But worktrees don't scale: shared CPU, RAM, and disk space across many parallel agents, overlapping builds, messy envs and secrets, all bound to one machine. The worktree model worked beautifully — until it ran into the physical limits of the laptop it was running on.

Superset v1 Demo^[2]

v2 — the platform layer that unbinds Superset from the laptop.

The pivot: move the workloads to the cloud. v2 shipped in early May 2026, layering the missing infrastructure underneath the IDE — and notably, the team kept the v1 IDE shipping, the Discord swarming, the DAUs compounding 50%+ w/w, and the enterprise pilots closing the entire time they were building it.

Remote Workspaces. Ship a session to the cloud; continue working from anywhere; share with teammates. Solves the laptop-bound limitation that internal background agents at Ramp and Stripe both name as their core unlock.^[7]^[8] Start on your laptop, continue on your phone, scale to as many agents as you need.

Automations. Schedule agents to pick up tasks for you later — or fire them on events: a new issue opened in Linear or GitHub, a Sentry error detected, a failed CI run, an inbound Slack message.

Superset CLI. Gives agents new superpowers and unlocks programmatic workflows — turning Superset itself into a primitive other agents can call.

MCP refresh. Works with v2 + an expanded toolset. Integrates across the apps where product and engineering work happens — Slack, Linear, GitHub, and more. Message @Superset in Slack to kick off a new task without ever opening the IDE.

Mobile · coming soon. Talk to your codebase from anywhere. Kick off a task on your phone walking to a meeting, review the diff on the train, ship from the couch. Same Remote Workspace, different surface.

Open-source, with the GitHub trajectory of a category leader

Traction

7k+ DAUs+50%/wk growth10.8k starsFront-page HNShow HN #1Used at Amazon · Cloudflare · Vercel · DoorDash · Atlassian · Intercom · OracleY Combinator

Five months from launch to 7k+ daily active users growing 50%+ week-over-week, 10.8k GitHub stars, and front-page HN on launch day.^[14] Paid enterprise pilots already in motion with Ramp and Thrive Holdings.

The Long Arc

The IDE is the wedge. The Software Factory is the prize.

A Software Factory is an org-scale operating model where parallel coding agents take work from anywhere it originates, run in isolated environments, and return finished work for human review. Engineers don't hand-write or hand-review every diff; they architect the system, calibrate the loops, and decide which edge cases get attention. Today's tokenmaxxer is already running a small version of this on their laptop. Tomorrow's team runs a big one. Superset's IDE is the wedge into both.

Every major eng org is converging on the same architecture.

The loop. Take a task. Spawn a sandboxed agent with the right tools and codebase context. Write code. Run CI. Open a PR. Review and iterate. Repeat in parallel — dozens, hundreds, thousands of times. The human's job has narrowed to architecting the loop and reviewing what comes out the other end, not typing the diff. Ramp's Inspect, Stripe's Minions, and Spotify's Honk are three implementations of the same loop.^[7]^[8]^[9]

The pillars. Three primitives make the loop work: isolated compute (each agent's blast radius is sandboxed), event routing (work flows in and out automatically), and governance (humans review, approve, and shape what merges).^[3] Superset's v2 maps cleanly to all three. Together they're what moves humans from on the line to on the loop.

The economic argument. Every team that has done this in-house has spent 6+ months and senior engineers to build it. Most companies will not. Superset's bet: nobody actually wants to build their own factory machinery — they just want the IDE that ships with it, so their engineers can step off the line on day one.

Market

Three concentric markets. Superset has commercial validation in all three already.

Near term — tokenmaxxers. Individual engineers paying $200+/month for Claude Code or Codex who would happily run more agents if the friction were lower. 7k+ DAUs growing 50%+ week-over-week, 10.8k stars, and front-page HN inside five months — the natural-demand signal.^[6]^[14]

Mid term — agentic coding teams. Small-to-mid engineering orgs (5–200 engineers) standardizing on a parallel-agent workflow as a team. Multi-seat enterprise deals are already in motion with Ramp and Thrive Holdings. The revenue stack is per-seat IDE plus the org-level platform features — Remote Workspaces, Automations, shared memory, admin controls, SSO/RBAC, audit logs — that get pulled in once an IC's workflow becomes a team workflow.

Long term — Software Factories. Dozens to hundreds of agents triggered upstream from Linear, GitHub, Slack, and on-call — running in sandboxed cloud workspaces and shipping PRs continuously. Ramp Inspect, Stripe Minions, and Spotify Honk are early in-house implementations.^[7]^[8]^[9] Every team that wants this without spending six months building it is a future Superset customer.

Competitive landscape

Three strategies inside the agentic coding era.

As coding agents become the workers, three distinct strategies have emerged for how engineering teams operate them. Superset is the only one in the third bucket with category-leading OSS traction.

2. Horizontal orchestration layers

Conductor · cmux · Warp

Assume developers will use many agents and need a neutral cockpit. Strength: agent-agnostic, workflow-centric, best-of-breed model selection. Weakness: must stay ahead of first-party apps that keep adding orchestration features. Conductor and similar API/Agent-SDK-wrapping orchestrators were caught by Anthropic's April→May 2026 policy (~12–175× cost increase). Superset is leading the category with an open-source, CLI-agent-agnostic approach — running Claude Code as a subprocess against the user's interactive subscription, outside both the original prohibition and the new credit pool. Ramp Inspect, Stripe Minions, and Spotify Honk are in-house implementations of the same category — validating both the demand and the architecture.^[17]

Superset's positioning

Superset is building the horizontal command center for multi-agent software development. As Claude Code, Codex, Cursor, OpenCode, and Antigravity become the workers, Superset is the one place to dispatch, isolate, observe, review, and merge their work.

Founder deep dive

The founding moment — and why it had to be these three.

The hackathon win. In November 2025, the three of them — all former CTOs at YC-backed companies (Onlook W25, Untether Labs W23, Adam W25) — were trying to figure out what to build next. They were all heavy Claude Code users running multiple agents in parallel, and dealing with so much friction with terminal tabs, worktrees, and tmux that the agents were faster than their workflow.^[12]

Kiet's idea: a terminal optimized for parallel coding agents. They built the MVP at a YC hackathon over a weekend and won. That was the signal. Five months later: 7k+ DAUs growing 50%+ week-over-week, 10.8k GitHub stars, paid enterprise pilots (Ramp, Thrive Holdings), and engineering users at Vercel, DoorDash, and Atlassian.^[6]

Why these three. Kiet is the open-source maintainer, originator, and product voice — the one who lived the parallel-agent pain at Onlook. Satya owns the systems work (worktree isolation, persistent daemons, the parts that have to be rock-solid). Avi is a deep-technical writer-builder who ships product and engineering content that's been picked up across the AI-tooling community. All three are repeat technical founders. None of them is figuring out how to be a CTO for the first time.

They run the YC playbook by the letter. Talk to your users: they're in their own Discord, X replies, and HN threads every day — every release announcement is a thread the founders are answering in. Launch early and often: they build in public — changelogs, weekly demos, design docs, even the bugs. Do things that don't scale: they rotate weekly GTM duties between the three of them — sales calls, content, onboarding — so all three founders keep direct contact with the customer. Move fast: from YC-hackathon MVP to 7k+ DAUs growing 50%+ w/w, 10.8k stars, and paid enterprise pilots in five months. The founder loop is tight, public, and visibly compounding.

On the long-term vision. Today's Superset is the IDE for tokenmaxxers. Tomorrow's Superset is the platform for software factories: pull work from Linear, GitHub issues, Slack, and on-call queues; spawn hundreds of parallel agents in sandboxed Remote Workspaces; review the diffs in a unified UI; ship.

On open-source as a wedge. 10.8k stars in five months at zero marketing spend. The engineering-content community already cites Avi's worktree posts as the canonical write-ups on parallel-agent infrastructure. Open-source is doing what enterprise marketing can't — proving the architecture works before the buyer ever talks to sales.

Founders

Kiet Ho

Repeat Founder

Co-founder & CEO

Repeat founder. Previously co-founder/CTO at Onlook (YC W25). Earlier at Amazon and ServiceNow. Open-source maintainer; conceived Superset at a YC hackathon after running into the friction of managing many Claude Code agents at once.

Satya Patel

Repeat Founder

Co-founder

Former CTO at Untether Labs (YC W23). Prior engineering at Scribe, Google, Amazon, and Facebook. Systems-oriented; authored Superset's worktree isolation and persistent daemon architecture.

Avi Peltz

Repeat Founder

Co-founder

Repeat founder. Previously co-founder/CTO at Adam (YC W25) and co-founder of BioGlyph. ML + computer-vision background. Writes deeply technical engineering content on parallel agents, git worktrees, and terminal daemons.

Risks & mitigations

Risk

Best engineering teams build their own background-agent infrastructure (Ramp, Stripe, Spotify, OpenAI) and never adopt a third-party platform.

Mitigation

Each of those teams spent 6+ months and senior engineers to build it. Most companies don't have that headcount or that conviction. Superset wins by selling the platform that the next thousand engineering orgs would otherwise have to build themselves — and by having an OSS-first wedge into the team via the personal IDE before the platform conversation begins.

What we're watching

Conversion of OSS users → paid IDE seats and → enterprise platform contracts (Remote Workspaces + Automations).
Cloud sandboxes / Remote Workspaces GA — does it match Modal/Daytona's speed and cost economics?
Chat 2.0, mobile, and Shared Memory ship — and whether they shift Superset from "IDE" to "platform" framing in user perception.
Reaction from Anthropic, OpenAI, and Cursor to the orchestration-layer thesis — partnership, neutral coexistence, or competitive entry.

References