Advanced
Production-grade systems and the full picture. RAG, evals, cost, security, and long-running agents.
11 min readSecure document ingestion for RAG: PDFs, OCR, metadata, and retention
Design a secure document-ingestion pipeline for RAG with permission metadata, OCR quality checks, source freshness, retention rules, deletion behavior, and ingestion tests.
9 min readAI ROI and maturity: how to measure adoption that actually works
Measure AI adoption using workflow ROI, quality, risk controls, and maturity levels instead of tool usage vanity metrics.
9 min readBuild vs buy AI systems: the practical decision framework
Decide when to buy, configure, extend, or build an AI system based on workflow fit, data control, cost, capability, and strategic value.
10 min readCompany knowledge RAG: permissions, leakage, and source boundaries
Design a company knowledge RAG with permission-aware retrieval, source ownership, leakage controls, and refusal behavior.
10 min readProduction AI failure modes: what breaks after the demo
Build a production AI failure-mode register with controls for hallucination, stale context, prompt injection, unsafe tool use, and weak fallbacks.
9 min readAI-native IDEs and repository-aware coding workflows
Design a repository-aware AI coding workflow that improves delivery speed without weakening review, security, tests, or ownership.
10 min readPrivate AI deployment patterns: local, VPC, self-hosted, and hybrid
Choose a private AI deployment pattern based on data sensitivity, capability needs, cost, latency, and operational capacity.
9 min readVoice agents for customer flows: where they work and where they fail
Decide whether a customer voice agent is appropriate and design the first rollout with disclosure, escalation, testing, and monitoring.
9 min readEU AI Act for SMEs: a practical governance plan
Create a practical AI governance baseline for an SME using AI tools, automations, or customer-facing systems in the EU.
13 min readShipping an LLM product: pricing, margins, and the anti-moat trap
Use the article as decision context for adoption, risk, governance, or investment choices.
11 min readSelf-hosted vs hosted inference: vLLM, TGI, and the break-even math
Use the article as decision context for adoption, risk, governance, or investment choices.
12 min readCost-optimizing inference: prompt caching, routing, and output control
Use the article as decision context for adoption, risk, governance, or investment choices.
14 min readPrompt injection and LLM security: threat models and defense-in-depth
Threat-model an LLM workflow and add concrete controls for untrusted content, retrieval, tool calls, authorization, monitoring, and incident response.
12 min readComputer use and browser agents in production
Evaluate the implementation pattern, failure modes, and guardrails before building.
12 min readBuilding memory for long-running agents
Evaluate the implementation pattern, failure modes, and guardrails before building.
12 min readContext engineering: managing 1M-token windows without context rot
Evaluate the implementation pattern, failure modes, and guardrails before building.
11 min readLangGraph vs CrewAI vs direct API: choosing an agent framework in 2026
Evaluate the implementation pattern, failure modes, and guardrails before building.
13 min readDesigning agents that don't loop forever
Evaluate the implementation pattern, failure modes, and guardrails before building.
13 min readFine-tuning in 2026: when LoRA beats RAG, and how to do it without a cluster
Evaluate the implementation pattern, failure modes, and guardrails before building.
12 min readChoosing between prompting, RAG, and fine-tuning (and when to combine)
Use the article as decision context for adoption, risk, governance, or investment choices.
12 min readRAG beyond chunks: graph RAG, agentic RAG, long-context RAG
Evaluate the implementation pattern, failure modes, and guardrails before building.
12 min readBuilding a production RAG: ingestion, embedding, retrieval, reranking, eval
Evaluate the implementation pattern, failure modes, and guardrails before building.
12 min readDesigning MCP tools that LLMs actually use correctly
Evaluate the implementation pattern, failure modes, and guardrails before building.
14 min readMCP from scratch: build a production-ready server in TypeScript
Evaluate the implementation pattern, failure modes, and guardrails before building.
Showing 24 of 29