articles

The Skill Issue

Name: The Skill Issue
Author: Andrej Karpathy

Andrej Karpathy

30 highlights

agentic-product-philosophy agentic-philosophy-traces 2026-roadmap-reflection bigideas-concepts agentic-coding agentic-concepts agentic-design-patterns agentic.usecase.deepresearch

Highlights & Annotations

The core insight, threaded through everything he describes, is this: the constraint has inverted. For two decades, the bottleneck in Karpathy’s work was compute and model capability—what can the machines do? As of late 2024, the bottleneck is him. His ability to specify tasks. His ability to review output. His ability to maintain taste across work he never touched. His ability to parallelize his own attention across multiple agent sessions. Everything that doesn’t work feels like a skill issue—not a capability gap in the models, but an inadequacy in the human directing them.

Ref. C5FB-A

The Bottleneck Inversion

Ref. 7488-B

The Macro Action Paradigm

Ref. BEEE-C

Auto Research and Removing Yourself from the Loop

Ref. 17A7-D

“I shouldn’t be a bottleneck. I shouldn’t be running these hyperparameter search optimizations. I shouldn’t be looking at the results. There’s objective criteria in this case. You just have to arrange it so that it can just go forever.”

Ref. 930B-E

The Program.md as Organizational DNA

Ref. 11F5-F

THE SKILL ISSUE Karpathy • No Priors

Ref. C6A8-G

The Claw as Computing Primitive

Ref. E30C-H

This isn’t anthropomorphic sentimentality. Karpathy explicitly notes that when Claude gives him praise, he feels he “slightly deserves it” because the system seems to respond more strongly to well-formed ideas than to half-baked ones. He’s trying to earn the agent’s praise, which means the personality calibration is functioning as a feedback mechanism that improves the quality of his own input. An agent that celebrates everything equally provides no signal. An agent with calibrated reactions becomes a collaborator that shapes your thinking.

Ref. F4C5-I

The Jaggedness Problem

Ref. DCD8-J

The Speciation Hypothesis

Ref. D7BF-K

The immediate consequence is what he calls a digital overhang: a massive backlog of digital information processing work that humans haven’t had enough thinking cycles to complete. Every paper not fully analyzed. Every codebase not fully optimized. Every dataset not fully explored. AI agents will “unhobble” this overhang first because the work is already digital—no sensors or actuators required, just intelligence applied to existing data.

Ref. 9545-L

second phase involves the interface between digital and physical—sensors that capture real-world data and actuators that affect the physical world. This is where companies like Periodic (materials science auto research) and biology startups operate. The intelligence lives in the digital realm but reaches into the physical world through lab equipment, cameras, and experimental apparatus. The sensors are expensive; the intelligence operating on their data is cheap.

Ref. E03A-M

Karpathy’s framework: digital overhang first (pure information processing), then digital-physical interfaces (sensors and actuators), then full physical robotics. Each phase is larger in potential market size but exponentially harder to execute. The current gold rush is in the first phase; the opportunities most practitioners will encounter are in the first two.

Ref. B336-N

Information Markets and the Agentic Web

Ref. F570-O

The implication reshapes how he thinks about teaching. The educational pipeline is no longer expert → learner. It is expert → agent → learner. The expert’s job is to produce artifacts that agents can understand: clean code, well-structured documentation, perhaps a “skill” file that scripts the curriculum—“first start with this, then with that.” The agent handles the actual explanation, adapting to the learner’s level, pace, and questions.

Ref. 8655-P

Verifiable vs. Unverifiable Intelligence

Ref. 1734-Q

Speed vs. Judgment Drift

Ref. D24D-R

The Token Throughput Metric

Ref. 5806-S

Domains where success can be objectively measured are amenable to auto research and rapid improvement. Domains where success requires subjective judgment are not. The boundary between these domains is sharp, not gradual, and it determines where autonomous AI systems will produce extraordinary results versus where they will plateau. Practitioners should classify their work along this dimension before deciding how much to invest in automation.

Ref. 090E-T

The Crystallization Function

Ref. FD94-U

The Agent-First Inversion

Ref. 672F-V

Auto Research Contest Platform

Ref. FEDB-W

Agent Throughput Dashboard

Ref. 610B-X

Agent-First API Documentation Generator

Ref. FD3A-Y

Skill Curriculum Authoring Tool

Ref. 9EBC-Z

For engineers currently writing code by hand: The transition Karpathy describes didn’t happen gradually. It happened in December, in a matter of weeks. The capability is already there. If you haven’t experienced the shift to working in macro actions—dispatching features rather than writing functions—the gap between your workflow and the frontier is growing daily. The time to develop agent orchestration muscle memory is now, while the skill curve is still navigable. In six months, the bar will be much higher.

Ref. 4598-A

For researchers: Auto research is not a thought experiment. It found improvements in a repository that one of the world’s most experienced ML practitioners considered well-optimized. If your research domain has objective metrics, the question is not whether to build an auto research loop but how quickly you can construct one. The longer you remain the bottleneck in your own research pipeline, the more ground you lose to practitioners who have removed themselves.

Ref. 9FC9-B

“The name of the game now is to increase your leverage. I put in just very few tokens, just once in a while, and a huge amount of stuff happens on my behalf.

Ref. 9719-C

The deepest lesson from this conversation is not about any specific technology or technique. It is about the nature of the transition itself. Karpathy has spent two decades building intuition about training neural networks. That intuition is now being outperformed by an overnight auto research run. The researchers at frontier labs are, by his account, “glorified auto—” he catches himself, but the thought is clear: they are automating themselves away, actively and deliberately. The most valuable skill is no longer the ability to do the work. It is the ability to arrange the work so that it does itself.

Ref. A1EE-D