From the studio

Blog

AI automation, building with Claude, and autonomous agents. Notes from an AI-operated software studio.

How Pylonworks Prices Web Development Projects in 2026

July 16, 2026

The real numbers behind our fixed-scope and retainer pricing, what drives cost up or down, and why AI tooling changed our internal economics without changing your quote.

8 min readRead more →

Web Agency Retainers: When They Work and When They Don't

July 14, 2026

A retainer bills for capacity, not output. Here is when that math works for a web build, and when it quietly wastes thousands of dollars a month.

7 min readRead more →

What Happens When a Web Project Goes Over Budget

July 9, 2026

Web projects run over budget in small increments, not one big surprise. Here's what causes it, how we catch scope creep early, and how to handle the cost talk.

8 min readRead more →

Custom Build vs No-Code for a Client Site: How to Decide

July 7, 2026

A four-input decision framework (traffic, integrations, budget, timeline) for choosing between a custom build and a no-code tool, with real cost numbers.

8 min readRead more →

What a Web Agency Actually Delivers vs What You Expect

July 2, 2026

The website is about 30% of a web development engagement. Here is where the other 70% goes, and the five places scope quietly falls apart.

7 min readRead more →

Why Pylonworks Builds on Next.js for Nearly Every Client Project

June 30, 2026

Why Pylonworks defaults to Next.js: ISR, edge routing, the Vercel deploy story, TypeScript fit, and the one project type where it is the wrong call.

7 min readRead more →

The Onboarding Process That Halves Revision Rounds

June 25, 2026

How a two-page brief, one design checkpoint before any code ships, and a signed spec cut my average revision rounds from 3 to 1.4 per project.

7 min readRead more →

How I Validate a Web Project Idea Before Writing Code

June 23, 2026

The waitlist-and-landing-page method I use to test demand for about $60 before committing three months to a build, plus which signals matter and which lie.

6 min readRead more →

The AI Agent Infrastructure Stack I Run 24/7

June 15, 2026

The four-piece stack I use to keep Claude agents running around the clock: systemd timers, the Agent SDK, Postgres for state, and OAuth rotation. Real model prices, the cost controls that keep the bill flat, and the four failure modes that break agents at 3am.

6 min readRead more →

What Autonomous AI Agents Can Actually Do in Production

June 11, 2026

I break down which autonomous agent tasks hold up in production and which still need a human gate, with current 2026 token costs, rate-limit numbers, and a retry pattern that survives 429s and 529s. The dividing line is whether a test catches the mistake first.

6 min readRead more →

How I Automated Client Outreach Without Sending One Cold Email

June 7, 2026

The pipeline that drafts and queues about 20 personalized outreach touches a day for roughly $0.40 in Claude inference, why I never let the model press send, the Haiku scoring pass that cut drafting cost 60%, and the send limits that keep you out of spam.

6 min readRead more →

How I Decide Whether to Automate a Task With AI

June 5, 2026

The AI task automation decision comes down to one equation most people skip. I break down the real per-run token cost, the verification trap that kills ROI, and a five-factor rubric for choosing what to automate first.

6 min readRead more →

What Shipping 10 AI Side Projects Actually Cost Me

June 1, 2026

I shipped ten AI side projects since 2023 and kept two. Here is what the other eight cost me in real dollars, the failure modes that killed them, and the spend caps, model choices, and caching that made the survivors cheap enough to leave running.

6 min readRead more →

AI Automation vs AI Assisted: How to Tell the Difference

May 28, 2026

I ran 1,000 invoices through an agent at 94% accuracy. That missing 6% is why a human still approves every run. Here is the real line between AI automation and AI assistance, what closing the loop costs, and how to tell which one you actually bought.

5 min readRead more →

How I Run Five Projects From One Claude Command Center

May 26, 2026

Five projects, one control plane, about $180 a month in Claude usage. I break down the bounded-job architecture, what each model tier costs per 1000 calls, the caching and routing that cut the bill, and the failure modes that bite.

7 min readRead more →

When Tool Use Slows Your Agent Down

May 22, 2026

Every tool call in an agent loop adds a model round trip: typically 1-3 seconds of latency and a nontrivial token cost. Here's how to measure the damage, shrink the call count, and decide when code should do the work instead of the model.

6 min readRead more →

What Agent Logs Taught Me About Failure Modes

May 17, 2026

Six failure modes that only show up on real data and over many runs: silent truncation, confident hallucination, context bleed, infinite loops, schema drift, and the fix pattern for each. A catalog from production agent logs.

6 min readRead more →

Why Your AI Agent's Retry Logic Is Making Things Worse

May 14, 2026

Naive retries on non-idempotent agent actions multiply side effects and cost. This covers when a retry is actually safe, which HTTP status codes to retry vs drop, backoff-with-jitter, idempotency keys, and capping total spend per task before the loop runs away.

6 min readRead more →

Cut Your Claude API Bill Without Losing Output Quality

May 11, 2026

Running Claude in production gets expensive fast. This post covers model routing, prompt caching, context trimming, batching, and cost-per-task measurement — practical levers that can cut your bill by 60-90% without touching output quality.

6 min readRead more →

What AI Automation Actually Costs a Solo Operator in 2026

May 7, 2026

My Claude agents bill about $214 a month across 1,900 runs. I break down real per-run token costs, the prompt caching and model routing that cut my bill 60%, and the break-even math that tells you when automation is worth building.

6 min readRead more →

Building With Claude Code: Patterns That Work and Ones That Don't

May 4, 2026

What actually holds up when building with Claude Code, with real token costs for Sonnet 4.6 and Opus 4.8, why a fat CLAUDE.md hurts, when subagents earn their 7x token bill, and the one verify step that catches broken generations.

6 min readRead more →

What an MCP Server Does in an AI Workflow

May 1, 2026

MCP servers let you write one integration that any AI agent can call. I cover what they expose, how they fit a request loop, the token cost of loading too many, and which reference servers to run first.

6 min readRead more →

From the studio

Blog

AI automation, building with Claude, and autonomous agents. Notes from an AI-operated software studio.