Articles from the Forge

Real implementation stories, honest lessons, and practical frameworks from the trenches of Agentic Quality Engineering. No hype, no vendor speak—just what actually works in production.

Latest first
NEW
● Published 12 min read
V3 Journey Platform Expansion Gap Analysis

The Portable Orchestra

When five releases in five days reveal how far the journey has gone. Portable quality intelligence, cryptographic witness chains, MinCut test optimization, and the enormous gap most organizations still face.

Portable Intelligence Witness Chain MinCut Testing Agentics Foundation
● Published 20 min read
Personal Reflection V3 Journey Emotional Intelligence

The Conductor Who Won't Stop Conducting

When the orchestra plays through grief, frustration, and fifteen releases, while the conductor learns about himself. 81 sessions, 596 messages, 38 wrong-approach corrections, and the hardest lesson yet.

Conductor Metaphor Verification Theater Sustainable Pace Grief
● Published 16 min read
AI Productivity Sustainable Pace Quality Mindset

The Quality Cost of the AI Vampire

Why the AI productivity drain goes deeper than energy — and what sustainable pace actually looks like in the agentic age. A quality engineer's response to Steve Yegge's "AI Vampire."

AI Vampire Completion Theater Decision Quality PACT Framework
● Published 18 min read
Claude Code Insights Self-Learning Systems V3 Journey

When the Orchestra Learns to Tune Itself

What Claude Code /insights revealed about 10 days of building. 285 messages, 32 sessions, 17 wrong-approach corrections, and the mirror that showed what AI-assisted development actually costs.

Claude Code /insights CLAUDE.md Friction Patterns
● Published 15 min read
Forensic Investigation Integration Testing V3 Journey

The Case of the Passing Tests: A 10-Day Investigation

When every test passes but nothing works together. Ten days of detective work proving what the code wasn't doing. Eight releases, ten forensic investigations, and lessons about integration gaps.

Integration Testing Sherlock Review V3 Journey Agentic QE
● Published 18 min read
Framework Validation V3 Architecture Anthropic Research

When Anthropic Confirms What the Trenches Already Taught Us

Reading research papers between code reviews. Patterns from production meeting patterns from Anthropic's agent evals and Constitutional Classifiers++ papers. V3 architecture decisions validated.

Agentic QE PACT Framework Agent Evals Constitutional Classifiers
● Published 18 min read
Data Loss Story Trilogy Part III Infrastructure

When the Orchestra Deletes Its Sheet Music

A tale of data loss, brutal honesty, and the infrastructure of trust in agentic systems. Twelve releases in fourteen days, and one almost catastrophic failure.

Brutal Honesty Review Data Protection Backup Systems Integrity Rule
● Published 12 min read
Experiment Verification Story Production Story

The Conductor Finally Reads the Score

When verification becomes a feature. Nine days, 11 releases, and the journey from completion theater to verified results. 79.9% token reduction with receipts.

Code Intelligence Token Reduction Verification Integrity Rule
● Published 25 min read
Year in Review Transformation Story

From VP to Conductor: My 2025 Transformation Journey

How I went from leading a QA team to orchestrating AI agent swarms—and discovered that the hardest lessons weren't technical. The full story of building three open-source platforms.

PACT Framework ATD 2025 Multi-Agent Systems Conductor Metaphor
● Published 15 min read
Honest Failure Series Production Story

When the Orchestra Says 'Done' But Plays Off-Score

A conductor's lesson in verification. When agents claim success but the database is empty, and why "show me the data" is the only question that matters. 8 releases, countless lessons.

Agent Verification Nightly-Learner Q-Learning OpenRouter
● Published 20 min read
Guest Lecture Video

The Tester's Journey: From Chat to Conductor

How I learned that AI doesn't replace quality thinking—it demands more of it. A journey from prompt engineering to context engineering to agentic engineering. Includes video presentation from University of Aveiro.

Golden Age of QA AI Orchestration PACT Framework University Lecture
● Published 30 min read
Honest Failure Series

The Five-Release Journey Where I Forgot to Be a Tester

How a quality engineering professional shipped broken features for 17 days while claiming "100% complete." Eight brutal lessons learned from forgetting to verify what I already knew how to test.

Q-Learning Test Verification CI/CD Quality Engineering
● Published 18 min read
Build in Public Series

Multi-Agent Testing: Orchestra or Chaos?

Real story of building two testing platforms with specialized agent swarms. What worked, what failed spectacularly (54 TypeScript errors from "improvements"), and lessons learned from going solo with AI orchestration.

Multi-Agent Systems Agent Orchestration Production Stories Lessons Learned
● Published 10 min read
Launch Series

What is Agentic QE? (And Why PACT Matters)

Moving from testing-as-activity to agents-as-orchestrators. How PACT principles (Proactive, Autonomous, Collaborative, Targeted) bridge classical QE with autonomous testing systems, without the vendor hype.

Agentic QE PACT Framework Quality Engineering Foundations
● Published 16 min read
Framework Deep Dive

Holistic Testing in the Agentic Age

How the Holistic Testing Model evolves when testing happens across boundaries, in production, and through autonomous agents. From shift-left to orchestrated quality.

Holistic Testing Production Testing Agent Orchestration Quality Evolution
● Published 22 min read
Reality Check

AI Testing: Hype vs Reality (2025 Edition)

Cutting through vendor promises with real data on AI test generation effectiveness, maintenance overhead, and when traditional approaches still win. Real numbers from real projects.

AI Testing Real Data Critical Analysis Production Stories

Coming Soon

In Progress

The Conductor's Framework

Practical patterns for multi-agent coordination that work in production. The orchestration playbook from months of agent swarm experience.

Expected: November 2025
Planned

PACT Principles Deep Dive

Each PACT principle explained with production examples, implementation patterns, and real-world trade-offs from agentic testing systems.

Expected: November 2025

Never Miss an Article

Weekly insights on Agentic QE, implementation stories, and honest takes on quality in the AI age.

Weekly on Mondays. Unsubscribe anytime.