Tag

architecture

18 posts

Ai Ml Apr 18, 2026

Memory — How Agents Build Continuity

Context windows forget. Production agents don't. The difference is a layered architecture: working memory, session memory, and long-term memory split into facts, events, and skills. Here's how real agents remember — and why forgetting on purpose matters.

10 min read

Architecture Apr 12, 2026

The Tool System: How an AI Gets Hands

A language model without tools is an expensive autocomplete. This post dissects how a production AI harness defines, registers, validates, and executes 40+ tools — from file reads to shell commands to MCP integrations — with type safety, concurrency control, and deferred loading.

9 min read

Architecture Apr 12, 2026

Anatomy of an AI Harness: What Lives Between You and the Model

Everyone debates which AI model is best. The real engineering happens in the harness — the production system of tools, permissions, memory, and orchestration that makes any model actually useful. This is a map of that system, drawn from real source code.

9 min read

Architecture Apr 12, 2026

The Permission Boundary: Human-in-the-Loop at Scale

An AI with shell access and no guardrails will eventually destroy something you care about. This post dissects how a production harness implements layered permissions, hooks, dangerous pattern detection, and trust boundaries — balancing safety with usability.

10 min read

Architecture Apr 12, 2026

The Orchestration Loop: Where Everything Converges

The orchestration loop is the heart of an AI harness — a state machine that coordinates API calls, streaming responses, concurrent tool execution, error recovery, and token budgets. This post traces the complete data flow from user input to final response.

9 min read

Architecture Apr 12, 2026

Skills: Packaging AI Workflows as Code

Ad-hoc prompting is fine for one-off questions. Repeatable workflows deserve structure. This post dissects how a production harness defines, discovers, loads, and executes skills — reusable AI workflows that turn tribal knowledge into executable automation.

8 min read

Architecture Apr 12, 2026

State, Cost, and the Production Surface

The invisible foundation beneath every AI harness layer: centralized state management, per-model cost tracking, rate limit handling, a custom React-to-terminal renderer, and multiple entry points. This post covers what makes 'works in a demo' become 'works in production.'

9 min read

Architecture Apr 12, 2026

Tasks and Concurrency: Background Agents at Work

A production AI harness is not single-threaded. Background agents explore codebases, shell commands execute, remote agents run on cloud infrastructure — all while the main conversation continues. This post dissects the task system that manages this concurrency.

8 min read

Architecture Apr 12, 2026

Context Engineering: Building the Model's World

The model is only as good as the context it receives. This post dissects how a production AI harness constructs system prompts, loads project instructions, manages persistent memory, and compresses context when the window fills — all to give the model the right information at the right time.

9 min read

Ai Ml Apr 12, 2026

From Chat to Agent: Why the Leap Matters

A chatbot answers. An agent finishes the job. The gap between them is not one feature — it's a stack of seven capabilities, built one rung at a time. This series walks that stack in plain English, for engineers and curious non-engineers alike.

9 min read

Tutorial Apr 10, 2026

Astro Deep Dive: Scaffolding a Production-Ready Astro 5 Project

Most Astro tutorials stop at pnpm create astro. By the time you need TypeScript strict mode, Tailwind 4, and a Cloudflare deployment pipeline enforcing Lighthouse 100, you are patching together Stack Overflow answers. This post gets you production-ready from the first command.

12 min read

Tutorial Apr 10, 2026

Astro Deep Dive: The Island Architecture

Every client:load directive you add ships JavaScript to every reader. Most of them do not need it. The difference between Lighthouse 100 and 85 is understanding which components need interactivity and choosing the right hydration strategy for each.

14 min read

Tutorial Apr 10, 2026

AI Skills in Practice: Anatomy of a Skill

A skill is not a long prompt. It is a structured workflow with a trigger, a prompt body, references, and an output contract. This post breaks down each component, shows how they interact, and explains the design decisions that separate skills that work from skills that frustrate.

11 min read

Tutorial Apr 10, 2026

Clean Code Python: Multi-Tenant Foundation — Context, Isolation, and the Data Boundary

Your BookStore API now has 200 customers who each want their own data. One missing WHERE clause leaks tenant data and kills the company. Here is how to make cross-tenant data leakage structurally impossible with context propagation, session events, and row-level isolation.

13 min read

Tutorial Apr 10, 2026

Clean Code Python: Event-Driven Architecture — Domain Events and CQRS Lite

When order placement triggers inventory, email, webhooks, analytics, and distributor notifications — doing it synchronously makes the endpoint slow and couples everything. Domain events decouple 'what happened' from 'what should happen next.'

13 min read

Tutorial Apr 10, 2026

FastAPI Auth: The Decision Framework — Choosing the Right Approach

Eight authentication and authorization methods, one decision tree. Here is the comparison matrix, production combination patterns, and pre-launch security checklist that turns auth knowledge into shipping decisions.

11 min read

Engineering Apr 10, 2026

Hashing Deep Dive: SHA, bcrypt, scrypt, and Argon2id From the Inside Out

Not all hashes are created equal. Here is how SHA-256, bcrypt, scrypt, and Argon2id actually work under the hood — the math, the memory, the attacks they resist, and exactly when to use each one.

21 min read

Tutorial Apr 9, 2026

Clean Code Python: Project Structure and Naming That Scales

Most Python backend projects start clean and become unmaintainable. Here is the folder layout and naming convention that keeps a FastAPI + SQLAlchemy codebase navigable at any size.

7 min read