Skip to content

Code Is Cheap. Judgment Is Not.

ctx

Why AI Replaces Effort, Not Expertise

Jose Alekhinne / February 17, 2026

Are You Worried About AI Taking Your Job?

You might be confusing the thing that's cheap with the thing that's valuable.

I keep seeing the same conversation: Engineers, designers, writers: all asking the same question with the same dread:

"What happens when AI can do what I do?"

The question is wrong:

  • AI does not replace workers;
  • AI replaces unstructured effort.

The distinction matters, and everything I have learned building ctx reinforces it.


The Three Confusions

People who feel doomed by AI usually confuse three things:

People confuse... With...
Effort Value
Typing Thinking
Production Judgment
  • Effort is time spent.
  • Value is the outcome that time produces.

They are not the same; they never were.

AI just makes the gap impossible to ignore.

Typing is mechanical: Thinking is directional.

An AI can type faster than any human. Yet, it cannot decide what to type without someone framing the problem, sequencing the work, and evaluating the result.

Production is making artifacts. Judgment is knowing:

  • which artifacts to make,
  • in what order,
  • to what standard,
  • and when to stop.

AI floods the system with production capacity; it does not flood the system with judgment.


Code Is Nothing

This sounds provocative until you internalize it:

Code is cheap. Artifacts are cheap.

An AI can generate a thousand lines of working code in literal *minutes**:

It can scaffold a project, write tests, build a CI pipeline, draft documentation. The raw production of software artifacts is no longer the bottleneck.

So, what is not cheap?

  • Taste: knowing what belongs and what does not
  • Framing: turning a vague goal into a concrete problem
  • Sequencing: deciding what to build first and why
  • Fanning out: breaking work into parallel streams that converge
  • Acceptance criteria: defining what "done" looks like before starting
  • Judgment: the thousand small decisions that separate code that works from code that lasts

These are the skills that direct production: Hhuman skills.

Not because AI is incapable of learning them, but because they require something AI does not have:

temporal accountability for generated outcomes.

That is, you cannot keep AI accountable for the $#!% it generated three months ago. A human, on the other hand, will always be accountable.


The Evidence From Building ctx

I did not arrive at this conclusion theoretically.

I arrived at it by building a tool with an AI agent for three weeks and watching exactly where a human touch mattered.

YOLO Mode Proved Production Is Cheap

In Building ctx Using ctx, I documented the YOLO phase: auto-accept everything, let the AI ship features at full speed. It produced 14 commands in a week. Impressive output.

The code worked. The architecture drifted. Magic strings accumulated. Conventions diverged. The AI was producing at a pace no human could match, and every artifact it produced was a small bet that nobody was evaluating.

Production without judgment is not velocity. It is debt accumulation at breakneck speed.

The 3:1 Ratio Proved Judgment Has a Cadence

In The 3:1 Ratio, the git history told the story:

Three sessions of forward momentum followed by one session of deliberate consolidation. The consolidation session is where the human applies judgment: reviewing what the AI built, catching drift, realigning conventions.

The AI does the refactoring. The human decides what to refactor and when to stop.

Without the human, the AI will refactor forever, improving things that do not matter and missing things that do.

The Attention Budget Proved Framing Is Scarce

In The Attention Budget, I explained why more context makes AI worse, not better. Every token competes for attention: Dump everything in and the AI sees nothing clearly.

This is a framing problem: The human's job is to decide what the AI should focus on: what to include, what to exclude, what to emphasize.

ctx agent --budget 4000 is not just a CLI flag: It is a forcing function for human judgment about relevance.

The AI processes. The human curates.

Skills Design Proved Taste Is Load-Bearing

The skill trilogy (You Can't Import Expertise, The Anatomy of a Skill That Works) showed that the difference between a useful skill and a useless one is not craftsmanship:

It is taste.

A well-crafted skill with the wrong focus is worse than no skill at all: It consumes the attention budget with generic advice while the project-specific problems go unchecked.

The E/A/R framework (Expert, Activation, Redundant) is a judgment too:. The AI cannot apply it to itself. The human evaluates what the AI already knows, what it needs to be told, and what is noise.

Automation Discipline Proved Restraint Is a Skill

In Not Everything Is a Skill, the lesson was that the urge to automate is not the need to automate. A useful prompt does not automatically deserve to become a slash command.

The human applies judgment about frequency, stability, and attention cost.

The AI can build the skill. Only the human can decide whether it should exist.

Defense in Depth Proved Boundaries Require Judgment

In Defense in Depth, the entire security model for unattended AI agents came down to: markdown is not a security boundary. Telling an AI "don't do bad things" is production (of instructions). Setting up an unprivileged user in a network-isolated container is judgment (about risk).

The AI follows instructions. The human decides which instructions are enforceable and which are "wishful thinking".

Parallel Agents Proved Scale Amplifies the Gap

In Parallel Agents and Merge Debt, the lesson was that multiplying agents multiplies output. But it also multiplies the need for judgment:

Five agents running in parallel produce five sessions of drift in one clock hour. The human who can frame tasks cleanly, define narrow acceptance criteria, and evaluate results quickly becomes the limiting factor.

More agents do not reduce the need for judgment. They increase it.


The Two Reactions

When AI floods the system with cheap output, two things happen:

Those who only produce: panic. If your value proposition is "I write code," and an AI writes code faster, cheaper, and at higher volume, then the math is unfavorable. Not because AI took your job, but because your job was never the code. It was the judgment around the code, and you were not exercising it.

Those who direct: accelerate. If your value proposition is "I know what to build, in what order, to what standard," then AI is the best thing that ever happened to you: Production is no longer the bottleneck: Your ability to frame, sequence, evaluate, and course-correct is now the limiting factor on throughput.

The gap between these two is not talent: It is the awareness of where the value lives.


What This Means in Practice

If you are an engineer reading this, the actionable insight is not "learn prompt engineering" or "master AI tools." It is:

Get better at the things AI cannot do.

AI does this well You need to do this
Generate code Frame the problem
Write tests Define acceptance criteria
Scaffold projects Sequence the work
Fix bugs from stack traces Evaluate tradeoffs
Produce volume Exercise restraint
Follow instructions Decide which instructions matter

The skills on the right column are not new. They are the same skills that have always separated senior engineers from junior ones.

AI did not create the distinction; it just made it load-bearing.


If Anything, I Feel Empowered

I will end with something personal.

I am not worried: I am empowered.

Before ctx, I could think faster than I could produce:

  • Ideas sat in a queue.
  • The bottleneck was always "I know what to build, but building it takes too long."

Now the bottleneck is gone. Poof!

  • Production is cheap.
  • The queue is clearing.
  • The limiting factor is how fast I can think, not how fast I can type.

That is not a threat: That is the best force multiplier I've ever had.

The people who feel threatened are confusing the accelerator for the replacement:

*AI does not replace the conductor; it gives them a bigger orchestra.


If you remember one thing from this post...

Code is cheap. Judgment is not.

AI replaces unstructured effort, not directed expertise. The skills that matter now are the same skills that have always mattered: taste, framing, sequencing, and the discipline to stop.

The difference is that now, for the first time, those skills are the only bottleneck left.


The Arc

This post is a retrospective. It synthesizes the thread running through every previous entry in this blog:

From YOLO mode to defense in depth, the pattern is the same:

  • Production is the easy part;
  • Judgment is the hard part;
  • AI changed the ratio, not the rule.

This post synthesizes the thread running through every previous entry in this blog. The evidence is drawn from three weeks of building ctx with AI assistance, the decisions recorded in DECISIONS.md, the learnings captured in LEARNINGS.md, and the git history that tracks where the human mattered and where the AI ran unsupervised.

See also: When a System Starts Explaining Itself -- what happens after the arc: the first field notes from the moment the system starts compounding in someone else's hands.