articles

The Computer Is the Harness

Name: The Computer Is the Harness
Author: Felix Rieseberg, Alessio Fanelli, Swyx

Felix Rieseberg, Alessio Fanelli, Swyx

30 highlights

agentic-philosophy-traces agentic-product-philosophy agentic-coding software-design agentic-concepts agentic-insight agentic-design-patterns 2026-roadmap-reflection

Highlights & Annotations

The most important thing about Cloud Cowork is not the product itself. It is the thesis embedded in the product: an AI model needs a computer to be useful, the same way a human worker needs a computer to be useful. Strip away the chat interface debates, the agent framework wars, the MCP connector fatigue—and this is the irreducible insight that Anthropic’s desktop team has landed on after over a year of prototyping.

Ref. 6B83-A

Felix Rieseberg, the engineer behind Cloud Cowork (and, in a previous life, Electron and the Slack desktop app), frames this with a deceptively simple analogy: imagine telling a developer they don’t get a computer. Instead, their employer will email them code, and they’ll email code back. That arrangement might technically work. But it would be absurd. It would cripple productivity in ways that are obvious to anyone who has ever actually built software.

Ref. ADE4-B

This insight has implications far beyond desktop apps. It reframes the entire agent infrastructure conversation. The question is not “how do we build better agents?” but “how do we give agents the same tools that make humans effective?” The answer turns out to be surprisingly literal: give them a computer. A real one. With an operating system they can install things on, files they can read and write, and a network they can reach through. The VM—a lightweight Linux virtual machine running on the user’s local hardware—is the harness that makes everything else work.

Ref. C093-C

Rieseberg describes something that has shifted at Anthropic—and that he believes is shifting in the industry more broadly—in how products get built. The old model was sequential: a product manager would interview customers, draft a spec, design the solution, then execute. The new model is parallel: build all the candidates very quickly, test them with a small focus group, then ship the one that works best.

Ref. 83C7-D

Cloud Cowork itself was built this way. The ten-day development timeline is real but misleading if taken at face value. Anthropic had been prototyping agentic desktop experiences for over a year. Multiple internal teams had built different approaches. What the Cowork team did was select the best components from the existing prototypes—the VM isolation from one effort, the Chrome integration from another, the skills system that emerged from a data analysis prototype—and assembled them into a coherent product. The ten days were assembly; the year preceding was R&D.;

Ref. 43F3-E

The implication for the broader industry is significant. If you are still running a spec-first product process for AI features, you are making your most important decisions with the least information. Execution cost has dropped enough that the economically rational thing to do is build three versions and test all of them, rather than spec one version and hope you chose correctly.

Ref. 3A18-F

“Instead of telling the thing, here’s a CLI, please call this CLI, or here’s an MCP, please call this interface shape—this is the endpoint. If you want to know something, post here, maybe POST SQL, it’s going to be okay.”

Ref. 0242-G

Claude is much better at writing AppleScript and executing its own AppleScript than relying on third-party MCP tools. The general model, given a capable environment, often outperforms specialized integrations—because it can adapt to the specific context rather than conforming to a pre-defined interface.

Ref. 87EC-H

Rieseberg introduces a tension that anyone building on top of language models needs to grapple with honestly: how much should you invest in scaffolding when the next model might make it unnecessary? This is the scaffolding half-life problem—the recognition that corrections you build around a model’s weaknesses today may become dead weight when those weaknesses are addressed in the next model generation.

Ref. FF77-I

If you find yourself doing the same task manually before handing it to an agent, you are one level of delegation too low. The sign of a mature agent workflow is when you delegate not just the execution but the triage and planning that precede it. The agent should decide what to work on, not just do the work.

Ref. E90A-J

The host offers a provocative counter-proposal: simulated careers. If the value of junior years is the handful of deep learning moments scattered across months of routine work, AI could compress those moments into dense training simulations. Instead of spending three months on a distributed systems feature at a company, a new engineer could speed-run the experience in a simulated environment in one week—encountering the same failure modes, making the same architectural decisions, but without the downtime between learning moments.

Ref. 3B05-K

When building AI products, don’t rebuild primitives—assemble them. Cloud Cowork was built in ten days because the primitives already existed: Cloud Code for the agent engine, the VM framework for isolation, Chrome integration for browser awareness, skills for extensibility. The innovation was in the assembly, not in any individual component. Organizations should invest in creating and maintaining high-quality primitives, then let teams rapidly assemble them into products.

Ref. 883F-L

When Claude can see what it’s working on—through a browser, through screenshots, through screen capture—it becomes dramatically more effective. This isn’t surprising (developers also debug better when they can see the output), but it’s often overlooked in agent architecture. If your agent is producing visual outputs but operating blind, you are leaving significant capability on the table. Give the agent eyes.

Ref. 138E-M

Products that are hyper-specialized to a narrow use case are vulnerable to being absorbed by general-purpose platforms as models improve. Rieseberg is explicit: there is no Cloud Code for Node.js applications that use React and this specific stack and only that. The general tool that works across many contexts is more defensible than the specialized tool that excels at one context—because the specialization advantage erodes with each model improvement, while the general platform’s network effects grow.

Ref. C7DC-N

General vs. Specialized. Rieseberg argues for general-purpose tools. But the podcast itself demonstrates specialized value: the host’s YouTube upload skill is hyper-specific to his workflow. The resolution may be that the platform should be general, but the skills should be specific—a general-purpose engine running personalized instructions.

Ref. 4B3F-O

Capability vs. Safety. The VM provides sandboxed capability. But users who want agents to interact with their real computer—moving files to real directories, controlling real applications—will bump against the sandbox boundary. Every expansion of the sandbox increases both utility and risk. There is no stable equilibrium—only ongoing calibration.

Ref. 24A5-P

Model Overhang vs. Scaffolding Investment. If models are already more capable than users leverage them for, the bottleneck is environment, not intelligence. But if you invest only in environment and the next model needs different environmental support, you’ve built the wrong thing. The Anthropic team’s advantage is visibility into what’s coming; everyone else is making bets in the dark.

Ref. 64FD-Q

Personal vs. Portable Skills. The power of skills comes from personalization (my preferences, my workflows). The utility of skills comes from portability (share with colleagues, use across tools). These are in direct tension. The right abstraction for separating personal from universal within a skill has not been found yet.

Ref. F852-R

Anthropomorphize Aggressively

Ref. C1F8-S

When you are stuck on an agent design decision, ask: “If this were a human coworker, what would they need?” Then provide exactly that. The gap between what feels natural for a human and what we provide to AI systems is where most usability failures live.

Ref. 5F6C-T

The Value of the Local Machine

Ref. AE75-U

The Swiss Cheese Model

Ref. 40EF-W

The power of skills as a primitive comes from three properties working in concert. First, zero structure requirements —you don’t need a tutorial, a schema, or a framework to write a skill. Just text. Second, extreme portability —a skill is a file, and files can be copied, symlinked, versioned, and shared with trivial tooling. Third, composability —skills can reference other skills, can be split into sub-skills, and can be orchestrated by a parent skill. The host in this conversation demonstrates this directly: he wrote a skill for uploading videos to YouTube, then told Claude to split it into three sub-skills with a parent orchestrator, creating a modular automation pipeline from a single natural-language instruction.

Ref. AAB6-X

The Portability Problem

Ref. 0258-Y

What Does “Your Computer” Mean?

Ref. 6F1E-A

The Multiplayer Question

Ref. 9300-B

The Capability-Over-Correction Principle

Ref. 296E-C

The Delegation Level Check

Ref. A3F9-D