Horizon LabsHorizon Labs

Insights

Product, design, AI, and engineering perspectives from our team.

Context Engineering for LLM Apps: Beyond Prompt Templates
25 June 2026

Context Engineering for LLM Apps: Beyond Prompt Templates

Prompt templates are where LLM applications start. Context engineering is what makes them work reliably in production. This article covers the four core levers — retrieval, compression, memory, and ordering — and how to build a context pipeline that produces consistent, cost-efficient model behaviour at scale.

9 min readChris Kerr
Data Contracts: Stopping Pipeline Breakage Before It Starts
25 June 2026

Data Contracts: Stopping Pipeline Breakage Before It Starts

Silent schema drift is one of the most common and costly causes of broken data pipelines and degraded AI models — and it rarely announces itself. Data contracts are the structural mechanism that catches upstream changes before they reach production, enforcing schema, quality, and freshness expectations at the producer level. This post explains what data contracts are, how to implement them with modern tooling, and why they are foundational to reliable AI and analytics infrastructure.

9 min readChris Kerr
AI Incident Response: What to Do When Your Model Fails in Production
25 June 2026

AI Incident Response: What to Do When Your Model Fails in Production

When an AI model fails in production, the failure is often silent — no error code, just degrading outputs. This guide is a practical incident response playbook for ML and LLM systems: detection, severity classification, rollback, stakeholder communication, and post-incident review, built for technical leaders who need to extend their existing incident processes to cover AI-specific failure modes.

13 min readChris Kerr
The Open Knowledge Format: AI-ready knowledge without lock-in
25 June 2026

The Open Knowledge Format: AI-ready knowledge without lock-in

The Open Knowledge Format (OKF) is a vendor-neutral way to turn scattered organisational knowledge into a portable, AI-ready asset. Here's what it is, why it matters for RAG and AI readiness, and how to start.

6 min readChris Kerr
Change Management for AI Adoption: Getting Staff to Actually Use It
24 June 2026

Change Management for AI Adoption: Getting Staff to Actually Use It

AI rollouts stall not because the technology fails, but because the people side is an afterthought. This post breaks down why adoption flatlines and provides a practical eight-step change management playbook to drive genuine, sustained AI usage across your teams.

10 min readChris Kerr
Upskilling Your Engineering Team for AI: A Practical Plan for CTOs
24 June 2026

Upskilling Your Engineering Team for AI: A Practical Plan for CTOs

Most engineering teams can integrate an API — far fewer are ready to build production AI systems that are observable, compliant, and resilient. This post gives CTOs a concrete plan: how to map current capabilities, build tiered learning paths, design hands-on projects, and decide when to train versus hire.

10 min readChris Kerr
Fine-Tuning Small Language Models for Domain-Specific Tasks
23 June 2026

Fine-Tuning Small Language Models for Domain-Specific Tasks

Fine-tuning a small language model can outperform a frontier model on narrow tasks — but only when the task, data, and economics actually justify the overhead. This article covers when fine-tuning makes sense, how to prepare data and evaluate properly, and how to honestly assess the cost trade-off against prompting a frontier model API.

10 min readChris Kerr
Caching Strategies for LLM Applications: Reducing Latency and Cost
23 June 2026

Caching Strategies for LLM Applications: Reducing Latency and Cost

Caching is one of the most underused levers for reducing cost and latency in production LLM applications. This article covers prompt caching, semantic caching, and response caching — what each layer does, when to use it, and how to think about invalidation and observability.

11 min readChris Kerr
Structured Outputs and Function Calling: Making LLMs Reliable
23 June 2026

Structured Outputs and Function Calling: Making LLMs Reliable

Structured outputs and function calling are the mechanisms that make LLMs viable in production workflows — but reliable implementation requires deliberate schema design, validation layers, and observability from the start. This guide covers how to use both patterns effectively, when to choose each, and the failure modes that catch teams off-guard at scale.

9 min readChris Kerr