How Coding Agents Actually Work

Everyone thinks coding agents are magic. Type a prompt, get code. Like autocomplete but smarter.

Spend five minutes watching one work and you realize: it's a chaos of subsystems constantly arguing about what to do next.

The Illusion of Intelligence

You ask: "Add authentication with NextAuth."

What you imagine happens: AI understands, writes code.

What actually happens:

Router looks at your prompt, decides which model handles it
Context retriever scans your codebase, picks "relevant" files
LLM generates a plan based on partial context
Tool executor runs commands, edits files
Orchestrator checks if it worked
Loop back to step 3 because it didn't

This isn't intelligence. It's trial and error with better memory.

The Context Problem

Here's what kills most agent tasks:

Your codebase: 50,000 lines
Agent context window: 8,000 tokens (~2,000 lines)
What agent sees: 4% of your code

Agent picks which 4% to look at. Gets it wrong? Writes code that doesn't fit your patterns. Uses libraries you don't have. Ignores conventions.

Why Agents Get Stuck

Watched an agent try to add a feature last week. Loop count: 7.

Attempt 1: Wrong import path
Attempt 2: Fixed import, wrong function signature  
Attempt 3: Fixed signature, forgot dependency
Attempt 4: Added dependency, broke tests
Attempt 5: Fixed tests, introduced new bug
Attempt 6: Fixed bug, back to wrong import
Attempt 7: User gives up

Each attempt costs tokens. Each costs time. Errors compound because the agent can't see its own pattern of failure.

When Agents Actually Help

Boring, well-defined tasks:

"Add CRUD endpoints for this model" — works
"Write tests for this function" — works
"Rename X to Y across the codebase" — works
"Refactor this to be cleaner" — disaster

The pattern: agents excel when the solution is obvious and the scope is narrow. They fail when judgment is needed.

The Real Workflow

What works:

1. Break task into tiny pieces
2. Give agent one piece
3. Review output immediately  
4. Fix mistakes yourself
5. Repeat

Agents are fast interns, not senior engineers. Treat them accordingly.

The agent isn't smart. It's just fast at being wrong until it's right.

— blanho

The Illusion of Intelligence

You ask: "Add authentication with NextAuth."

What you imagine happens: AI understands, writes code.

What actually happens:

Router looks at your prompt, decides which model handles it

Context retriever scans your codebase, picks "relevant" files

LLM generates a plan based on partial context

Tool executor runs commands, edits files

Orchestrator checks if it worked

Loop back to step 3 because it didn't

This isn't intelligence. It's trial and error with better memory.

Why Agents Get Stuck

Watched an agent try to add a feature last week. Loop count: 7.

Attempt 1: Wrong import path
Attempt 2: Fixed import, wrong function signature  
Attempt 3: Fixed signature, forgot dependency
Attempt 4: Added dependency, broke tests
Attempt 5: Fixed tests, introduced new bug
Attempt 6: Fixed bug, back to wrong import
Attempt 7: User gives up

Each attempt costs tokens. Each costs time. Errors compound because the agent can't see its own pattern of failure.

When Agents Actually Help

Boring, well-defined tasks:

"Add CRUD endpoints for this model" — works

"Write tests for this function" — works

"Rename X to Y across the codebase" — works

"Refactor this to be cleaner" — disaster

The pattern: agents excel when the solution is obvious and the scope is narrow. They fail when judgment is needed.

How Coding Agents Actually Work

The Illusion of Intelligence

The Context Problem

Why Agents Get Stuck

When Agents Actually Help

The Real Workflow

Related Posts

The Three Waves of AI in Coding

Stop Calling the API and Train Your Own Model

Two Layers You Need Before Microservices Break You

How Coding Agents Actually Work

The Illusion of Intelligence

The Context Problem

Why Agents Get Stuck

When Agents Actually Help

The Real Workflow

Related Posts

The Three Waves of AI in Coding

Stop Calling the API and Train Your Own Model

Two Layers You Need Before Microservices Break You