Articles from the Forge

Real implementation stories, honest lessons, and practical frameworks from the trenches of Agentic Quality Engineering. No hype, no vendor speak—just what actually works in production.

Latest first
NEW
● Published 18 min read
Framework Validation V3 Architecture Anthropic Research

When Anthropic Confirms What the Trenches Already Taught Us

Reading research papers between code reviews. Patterns from production meeting patterns from Anthropic's agent evals and Constitutional Classifiers++ papers. V3 architecture decisions validated.

Agentic QE PACT Framework Agent Evals Constitutional Classifiers
● Published 18 min read
Data Loss Story Trilogy Part III Infrastructure

When the Orchestra Deletes Its Sheet Music

A tale of data loss, brutal honesty, and the infrastructure of trust in agentic systems. Twelve releases in fourteen days, and one almost catastrophic failure.

Brutal Honesty Review Data Protection Backup Systems Integrity Rule
● Published 12 min read
Experiment Verification Story Production Story

The Conductor Finally Reads the Score

When verification becomes a feature. Nine days, 11 releases, and the journey from completion theater to verified results. 79.9% token reduction with receipts.

Code Intelligence Token Reduction Verification Integrity Rule
● Published 25 min read
Year in Review Transformation Story

From VP to Conductor: My 2025 Transformation Journey

How I went from leading a QA team to orchestrating AI agent swarms—and discovered that the hardest lessons weren't technical. The full story of building three open-source platforms.

PACT Framework ATD 2025 Multi-Agent Systems Conductor Metaphor
● Published 15 min read
Honest Failure Series Production Story

When the Orchestra Says 'Done' But Plays Off-Score

A conductor's lesson in verification. When agents claim success but the database is empty, and why "show me the data" is the only question that matters. 8 releases, countless lessons.

Agent Verification Nightly-Learner Q-Learning OpenRouter
● Published 20 min read
Guest Lecture Video

The Tester's Journey: From Chat to Conductor

How I learned that AI doesn't replace quality thinking—it demands more of it. A journey from prompt engineering to context engineering to agentic engineering. Includes video presentation from University of Aveiro.

Golden Age of QA AI Orchestration PACT Framework University Lecture
● Published 30 min read
Honest Failure Series

The Five-Release Journey Where I Forgot to Be a Tester

How a quality engineering professional shipped broken features for 17 days while claiming "100% complete." Eight brutal lessons learned from forgetting to verify what I already knew how to test.

Q-Learning Test Verification CI/CD Quality Engineering
● Published 18 min read
Build in Public Series

Multi-Agent Testing: Orchestra or Chaos?

Real story of building two testing platforms with specialized agent swarms. What worked, what failed spectacularly (54 TypeScript errors from "improvements"), and lessons learned from going solo with AI orchestration.

Multi-Agent Systems Agent Orchestration Production Stories Lessons Learned
● Published 10 min read
Launch Series

What is Agentic QE? (And Why PACT Matters)

Moving from testing-as-activity to agents-as-orchestrators. How PACT principles (Proactive, Autonomous, Collaborative, Targeted) bridge classical QE with autonomous testing systems, without the vendor hype.

Agentic QE PACT Framework Quality Engineering Foundations
● Published 16 min read
Framework Deep Dive

Holistic Testing in the Agentic Age

How the Holistic Testing Model evolves when testing happens across boundaries, in production, and through autonomous agents. From shift-left to orchestrated quality.

Holistic Testing Production Testing Agent Orchestration Quality Evolution
● Published 22 min read
Reality Check

AI Testing: Hype vs Reality (2025 Edition)

Cutting through vendor promises with real data on AI test generation effectiveness, maintenance overhead, and when traditional approaches still win. Real numbers from real projects.

AI Testing Real Data Critical Analysis Production Stories

Coming Soon

In Progress

The Conductor's Framework

Practical patterns for multi-agent coordination that work in production. The orchestration playbook from months of agent swarm experience.

Expected: November 2025
Planned

PACT Principles Deep Dive

Each PACT principle explained with production examples, implementation patterns, and real-world trade-offs from agentic testing systems.

Expected: November 2025

Never Miss an Article

Weekly insights on Agentic QE, implementation stories, and honest takes on quality in the AI age.

Weekly on Mondays. Unsubscribe anytime.