Technical Debt and AI: Why You Can't Build on a Broken Foundation
Layering AI onto systems carrying significant technical debt doesn't hide the debt — it amplifies it. This post covers how to audit the specific foundations your AI use case depends on, prioritise remediation without a full rewrite, and sequence modernisation and AI adoption so your investment actually reaches production.

The hidden cost of technical debt when adopting AI
Technical debt is the accumulated cost of shortcuts, deferred maintenance, and architectural decisions that made sense at the time but now slow everything down. When organisations attempt to layer AI onto systems carrying significant technical debt, that cost doesn't disappear — it multiplies. The debt surfaces as integration failures, unreliable data pipelines, and AI outputs your team can't trust or act on.
This is one of the more honest conversations in AI adoption, and it doesn't get enough airtime. The pressure to ship AI features is real. Boards are asking about it. Competitors appear to be doing it. But the organisations that rush AI onto unstable foundations tend to end up with expensive pilots that never reach production — not because AI failed them, but because their infrastructure was never ready.
What technical debt actually looks like in an AI context
Technical debt in an AI context means your systems cannot reliably supply, transform, or serve the data and compute that AI requires. It is more specific than the general notion of "old code". The debt that blocks AI adoption tends to cluster around a few recurring patterns.

Fragmented data ownership. Data lives in siloed databases, departmental spreadsheets, and vendor-locked SaaS tools with no consistent schema or governance. An AI model is only as reliable as the data it is trained or grounded on. Fragmented ownership means inconsistent definitions, duplicate records, and no single source of truth — conditions that produce models that confidently return wrong answers.
Monolithic application architecture. A tightly coupled monolith makes it difficult to expose the clean APIs and event streams that AI features depend on. You can't easily plug a recommendation engine or an LLM-powered assistant into a codebase where business logic, data access, and presentation are entangled. Every integration becomes a high-risk surgery.
Missing observability. Systems without structured logging, metrics, and tracing make it impossible to monitor AI behaviour in production. Detecting model drift, catching edge-case failures, and debugging unexpected outputs all depend on telemetry that many legacy systems simply don't produce.
Undocumented business logic. Rules encoded in stored procedures from 2009, or institutional knowledge held by two engineers who have since left, create enormous risk when you try to automate or augment those processes with AI. You can't safely automate what you don't fully understand.
Why AI amplifies debt rather than hiding it
There is a tempting logic that says AI can work around messy data and fragile systems — that a smart enough model will compensate for poor foundations. This logic is wrong, and it is worth being direct about why.

AI systems are more sensitive to data quality issues than traditional software, not less. A conventional application with a bug produces a predictable wrong output. An AI system trained or grounded on poor data produces outputs that are plausible-sounding but wrong in ways that are harder to detect and harder to debug. The failure mode is less obvious, which makes it more dangerous.
AI also raises the stakes for system reliability. When an AI feature is making recommendations that affect customer experience, pricing, or operations, a flaky integration or an unreliable data pipeline has business consequences that a slow-loading dashboard does not. The higher the stakes of the AI application, the more damage underlying debt can do.
Finally, AI requires ongoing operations. You need to retrain or update models, monitor for drift, and manage prompt changes. Every one of those activities is harder — sometimes impossible — if the underlying infrastructure is not properly instrumented and maintainable. What looked like manageable debt before AI adoption becomes a genuine blocker once you are trying to run AI in production.
How to audit your foundations before an AI investment
An AI readiness audit is not a full technical due diligence engagement. It is a focused assessment of the specific systems, data assets, and infrastructure components that your intended AI use case depends on. A useful audit covers four domains.
Data audit. Identify the data sources your AI application will rely on. For each source, assess accessibility (can you query it reliably via API or pipeline?), quality (what is the error rate, staleness, and completeness?), and governance (who owns it, and is there a schema contract?). This alone surfaces most of the risk.
Integration architecture review. Map how your target systems communicate. If the answer is "batch file exports" or "direct database reads from production", that is a signal. AI features generally require event-driven or API-based integration patterns. Assess whether your architecture can support them without a full rebuild.
Observability assessment. Review what structured telemetry your systems currently emit. Can you trace a request end-to-end? Can you query logs by correlation ID? Do you have alerting on error rates and latency? If not, you are building AI into a black box.
Team and process review. AI systems require different operational practices than traditional software. Assess whether your team has the skills and processes to manage model versioning, data pipeline health, and production monitoring — or whether those capabilities need to be built alongside the AI work.
Our application modernisation engagements typically begin with a structured technical architecture review that covers these domains. The findings become the input to a prioritised remediation plan.
How to prioritise remediation without a full rewrite
Not all debt needs to be resolved before AI work begins. The goal of prioritisation is to identify the debt that sits on the critical path of your specific AI use case — and address that first, in a way that is incremental and doesn't require stopping product development.
Map debt to AI dependency. List the AI features or use cases you are targeting. For each one, trace back to the data sources, APIs, and infrastructure components it requires. Debt in those specific components is high priority. Debt elsewhere can wait.
Apply the strangler fig pattern to architecture. Rather than rewriting a monolith before you can ship AI, identify the seams where you can extract clean services incrementally. Build the AI feature against the new service boundary, not the legacy system directly. The legacy system continues to run while you grow the modern layer around it. This approach is well-established for monolith decomposition and applies equally when the driver is AI enablement. We have written more about this in our post on the strangler fig pattern for legacy systems.
Treat data quality as a first-class workstream. Data remediation is not glamorous work, but it is often the single highest-leverage activity before an AI investment. Establish clear data ownership, define schema contracts, and build pipeline validation checks. This work benefits every AI use case you pursue — not just the first one.
Automate observability early. Instrumentation is much cheaper to add before you build the AI layer than after. Structured logging and basic metrics should be in place before you start model development, not retrofitted during an incident.
The relationship between modernisation and AI adoption
Application modernisation and AI adoption are often treated as separate programmes. They are not. Modernisation creates the conditions — clean APIs, reliable data flows, observable systems — under which AI can actually work. AI adoption gives modernisation a compelling business driver beyond "reduce technical debt", which has historically been a hard sell at the executive level.
Organisations that sequence these well — modernise the components AI depends on, then build AI on the modernised layer — tend to move faster than those who attempt either in isolation. The AI use case forces specificity in the modernisation work: you are not modernising everything, you are modernising the parts that matter for the outcome you are targeting.
This is the framing we bring to AI product strategy engagements. The question is not just "what AI should we build?" but "what does our foundation need to look like for that AI to reach production and stay there?"
Building the data infrastructure AI actually needs
Data infrastructure is the layer most often underestimated in AI planning. A working AI system in production requires not just historical data to train or ground against, but reliable pipelines to keep that data fresh, a feature store or vector index to serve it efficiently, and monitoring to detect when data distributions shift.
Growing Australian businesses often have reasonable data volumes but inadequate infrastructure to operationalise it. Data sits in production databases, analytics is done in exports to spreadsheets, and there is no event stream or change-data-capture layer that downstream AI systems could consume.
Building that infrastructure before you invest in AI model development is not overhead — it is the work that determines whether your AI investment returns value or stalls in a pilot. Our data infrastructure practice exists specifically to help technology teams build that layer in a way that is maintainable without a large internal data engineering team.
A practical starting point
If your organisation is planning an AI initiative and you have a suspicion that the foundations aren't ready, the most valuable thing you can do is get a clear picture of where the debt sits and how much of it is actually on the critical path of what you want to build.
That assessment doesn't have to be a six-month engagement. A focused technical review — four to six weeks, scoped to your target AI use case — will surface the genuine blockers, separate them from the noise, and give you a prioritised remediation plan that is realistic about sequencing.
If you're navigating the intersection of legacy systems and AI adoption, we can help. Start with a conversation about what you're trying to build and what your current foundation looks like — that's usually enough to identify whether there's a fit.
For more thinking on AI adoption, data infrastructure, and engineering strategy, browse our insights.
Chris Kerr
Founder of Horizon Labs. Twenty years building production software for Australian mid-market businesses, the last seven focused on putting AI into systems that operate at 3am without anyone watching. Writes about strategy, fractional CTO work, and the operational discipline that separates AI demos from AI products.

