Tag
Production
9 posts
Production lessons, incidents, postmortems and reliability.

AI agents in production: lessons learned after 18 months
Patterns, anti-patterns and architecture decisions when deploying AI agents in real production environments. 18 months of learnings condensed.

Feature flags in production: risk management without slowing development
Feature flag implementation patterns, lifecycle management, stale flag debt, and LaunchDarkly alternatives for teams that deploy to production frequently.
Computer Vision in Industry: Real Cases and ROI
Computer vision in industry: construction progress tracking, shelf monitoring, package inspection. Real cases with ROI calculations.
Zero Downtime Deployment: Strategies for Never Stopping Production
Zero downtime deployment strategies: blue-green, rolling, canary. Database migrations without interruption, session handling, and rollback in production.
AI Agents in Production: Orchestration and Failure Patterns
Production orchestration patterns for AI agents. Supervisor, hierarchical, consensus. Failure handling, human-in-the-loop, and cost management.
Testing in Production: Canary Deployments, Feature Flags, and Chaos Engineering
Practical guide to testing in production: canary releases, progressive rollouts, feature flag management, and first steps with chaos engineering.
LLMs in Production: Costs, Latency, and the Metrics Nobody Talks About
The reality of operating LLMs in production: token costs at scale, latency budgets, caching, model routing, and the hidden costs of prompt engineering.
Kubernetes in Production: Lessons from 3 Years Operating Clusters
Hard-won lessons from running Kubernetes in production for 3 years. When it is worth it, cluster sizing, networking gotchas, security hardening and cost optimization.

MLOps: From Notebook to Production Pipeline
Bridging data science experimentation and production ML systems. Model registries, feature stores, monitoring, cost optimization.