AI Architecture

Production ML systems from 30 years of UNIX infrastructure experience

AI architecture is not model design — it is systems engineering. This section documents production-grade machine learning architectures shaped by over 30 years of UNIX infrastructure experience, with a focus on reliability, observability, deterministic behaviour, and long-term operability in real production environments.

AI Architecture January 26, 2026 21:15

Version Control for Deterministic Systems: Git Isn't Enough

How Merkle chains, cryptographic attestation, and reproducible builds satisfy certification evidence requirements

12 min read →
AI Architecture January 24, 2026 00:30

Testing ML Systems: Beyond Unit Tests and Accuracy Metrics

A practical testing strategy for production machine learning

6 min read →
AI Architecture January 24, 2026 00:05

Cost Engineering for ML Infrastructure

Where the money goes in ML infrastructure and what to optimise first

7 min read →
AI Architecture January 23, 2026 22:00

State Management in ML Services: Beyond Stateless Inference

Architectural patterns for ML systems that need to remember

9 min read →
AI Architecture January 23, 2026 21:14

Graceful Degradation in ML Systems

Fallback strategies for production inference that fails gracefully instead of failing loudly

9 min read →
AI Architecture January 23, 2026 18:00

The Observability Blind Spot: What ML Metrics Miss

Why accuracy looks fine while your production system burns

11 min read →
AI Architecture January 19, 2026 23:00

The Certifiable-* Ecosystem: One Deterministic ML Pipeline

Eight interlocking C99 projects that make every ML pipeline stage bit-identical and auditable

10 min read →
AI Architecture January 19, 2026 00:15

Deterministic ML Pipeline for Safety-Critical Systems

How fixed-point arithmetic and cryptographic chaining produce bit-identical results from data to deployment

11 min read →
AI Architecture January 15, 2026 22:31

WCET Analysis for Neural Network Inference

How to prove worst-case execution time for convolution, matrix multiply, and pooling operations

11 min read →

10 of 17 articles