MHRA AI Airlock: NHS ambient voice procurement

The Medicines and Healthcare products Regulatory Agency has stopped treating ambient voice as a future problem. In April 2026, the Department of Health and Social Care committed £1.2 million per year to expand the MHRA AI Airlock — its regulatory sandbox for AI as a Medical Device — through 2029. Ambient voice technology, the category of AI that listens to clinician-patient conversations and drafts the note, now sits at the centre of that programme alongside cancer diagnostics and rare-disease tools. The signal to the market is unambiguous: the suppliers who clear the Airlock gates first will own the NHS framework agreements that follow.

That matters because the framework is already moving. On 16 January 2026, NHS England published a national supplier registry of 19 ambient voice technology vendors. London trusts have committed to deploying AI scribing across 20,000 clinicians. The Digital Dictation, Speech Recognition and Outsourced Transcription 2 framework extends to January 2034 and is the procurement vehicle that channels trust-level spend into national contracts. If your enterprise voice AI strategy touches healthcare — directly as a supplier, or indirectly as a buyer commissioning clinical workflows — the architecture choices you make this quarter determine whether you sit inside the framework or outside it.

This guide is shipped by the team behind Dilr Voice — enterprise voice AI live in 40+ countries. Or see voice AI agents for the architecture detail behind regulated deployments.

The Airlock criteria are not a procurement checklist invented in Whitehall. They are an architectural lens: consent capture, on-device or sovereign processing, clinician-in-the-loop review, and an immutable audit trail. Read them carefully and they map directly onto enterprise voice AI design decisions any regulated buyer should already be asking about. The difference is that in healthcare, the MHRA AI Airlock makes those decisions auditable before a single patient interaction. The parallel for finance, insurance, and law isn't hypothetical — it is the same architecture the FCA's AI governance regime in 2026 expects of voice AI deployments touching customer outcomes.

Key takeaway

The Airlock is not a feature checklist. It is a public, evidence-based gate that decides which ambient voice suppliers earn the right to be procured by 215 NHS trusts. Map your voice architecture to the four gates now, or buy a remediation programme in 2027.

The four Airlock gates — and what NHS procurement is actually testing

The Airlock programme runs phased cohorts. Phase 2, finalising in March 2026 with reporting due Summer 2026, included seven AI technologies — ambient voice tools sat alongside diagnostics for cancer, eye disease, and obesity treatment. Each candidate passes through structured evidence collection in real clinical environments before MHRA publishes findings. The findings then feed two parallel procurement gates: the NHS England AVT Supplier Registry (live since January 2026) and the Digital Dictation, Speech Recognition and Outsourced Transcription 2 framework. Suppliers who can demonstrate Airlock-grade evidence collapse a procurement cycle that would otherwise take 18 months into something a trust can act on in a quarter.

£3.6m

DHSC funding for AI Airlock through 2029

Suppliers on NHSE AVT registry (Jan 2026)

20k

London clinicians targeted for AI scribe rollout

2034

Framework horizon for NHS ambient voice procurement

The contrarian read is this: most ambient voice suppliers are optimising for accuracy benchmarks and clinician productivity claims. The Airlock does not care about either in isolation. It cares about whether the supplier can produce evidence — auditable, reproducible, and tied to a specific clinical pathway — that the system behaves as advertised under variation. That is a much harder bar to clear than a marketing whitepaper, and it is closer to medical-device validation than to SaaS QA. Voice AI buyers in regulated industries outside the NHS should pay attention: this is the regulatory direction of travel. The same logic underpins how the ICO's AI Code of Practice applies to voice AI for non-healthcare deployments, and how clinical-data architecture maps onto HIPAA-grade voice automation for cross-border operators.

What the Airlock actually demands of voice architecture

The four gates translate into concrete design decisions. Consent must be captured at the start of every clinical encounter, with the patient able to withdraw at any point and the system able to redact retrospectively. Processing must be sovereign — UK or EU residency at minimum, with on-device options for the most sensitive consultations. Clinician-in-the-loop is non-negotiable: the model drafts, the clinician signs off, and the audit trail records both. The audit trail itself must be immutable, queryable, and aligned with the Digital Technology Assessment Criteria (DTAC) that NHS England demands at the registry layer.

Most enterprise voice AI platforms were not designed for this. They were designed for contact-centre traffic, where consent is captured once at the IVR, where processing can be cross-border, and where audit trails are eventual rather than immutable. Retrofitting them for clinical use is possible but expensive, and the Airlock evidence requirement makes the retrofit visible to regulators in a way that contact-centre deployment never was. The same architecture review underpins our AI placement diagnostic — a fixed-fee assessment used before any NHS deployment commitment, and the equivalent of the pre-Airlock readiness work serious suppliers commission privately.

How the four gates map to voice AI architecture

The mapping is precise. A serious supplier should be able to walk a buyer through each gate with engineering specifics — not slide-deck claims.

Airlock criterion	What it tests	What suppliers must demonstrate	Common failure	DILR.AI default
Consent capture	Patient understands, agrees, can withdraw	Verbal consent token logged at session start; withdrawal triggers redaction within 24 hours	Consent collected by reception, not at recording boundary; no withdrawal pathway	Per-encounter consent token, retroactive redaction API, withdrawal SLA logged
On-device or sovereign processing	Data minimisation, residency	UK or EU residency end-to-end; on-device option for sensitive consultations	Cross-border inference via US LLM API; transcript egress to vendor cloud	UK-resident inference, optional on-device model for ambient capture
Clinician-in-the-loop	Liability and clinical safety	Clinician sees draft, edits, signs off; no auto-commit to EHR	Auto-commit on consultation end; clinician sees only the final note	Mandatory review state, edit-diff capture, sign-off audit row
Audit trail	Reproducibility, DTAC compliance	Immutable log of consent, transcript, model version, edits, sign-off	Logs editable; no model version capture; gaps between transcript and note	Append-only ledger, model-version pinning, DTAC-aligned export

The same approval-and-evidence pattern is what we describe in our AI voice agents product surface — and what regulated buyers in fintech, insurance, and legal services are starting to demand even before their own regulators catch up. The healthcare bar is moving the market.

Why the framework opens a procurement window now — and how to plan for it

The NHS England AVT Supplier Registry reopened for applications on 3 February 2026 and remains open. The framework horizon is 2034. The trust-level rollouts — London's 20,000-clinician programme being the largest visible signal — are happening this calendar year, not in five years. Three forces are compressing the timeline: the productivity case (clinicians lose hours per shift to documentation), the political case (the NHS 10-Year Plan demands AI-led workflow change), and the regulatory case (Airlock evidence is the cheapest path to procurement defensibility).

The implication for any enterprise voice AI vendor or buyer is structural. If you are a supplier, the Airlock is the gate. Passing it generates evidence that satisfies DTAC, satisfies the registry, and satisfies the framework — three procurement layers cleared in one move. If you are a buyer commissioning ambient voice inside a trust, your own diligence shrinks dramatically because the Airlock has already done the hard regulatory work. If you are an enterprise outside healthcare watching the UK regulatory direction, the Airlock pattern is the leading indicator for how the ICO, FCA, and EU AI Act will converge on AI tool inventories in the next 18 months.

The cautionary note: speed of registry inclusion is not the same as speed of trust deployment. A supplier can be on the registry and still take 9 months to land its first paid clinical pathway, because the trust-level governance work — DPIA, clinical safety case, change management — runs in parallel. Suppliers who treat the Airlock as a marketing milestone rather than an operational template will discover this the hard way. The same dynamic is well-documented for non-clinical deployments in our analysis of AI voice pilot purgatory — the gap between procurement readiness and production scale is where most programmes stall.

The non-consensus angle: ambient voice is the wedge, not the destination

Most coverage of ambient voice treats it as the end-state — clinicians dictate, AI scribes, productivity rises, end of story. The harder read is that ambient voice is the wedge into a much larger reorganisation of clinical voice infrastructure. Once a trust has consented voice capture, sovereign processing, clinician-in-the-loop review, and an immutable audit trail running across primary care consultations, the same infrastructure becomes the substrate for triage automation, patient follow-up, appointment reminders, and outbound recall campaigns. The supplier who lands the ambient voice contract first inherits the right of first refusal on every adjacent voice workflow the trust commissions afterward. That is why the framework horizon is 2034 and not 2027.

This is the same dynamic playing out in adjacent verticals — see our analysis of AI voice for healthcare appointment management and the underlying voice biometric data security architecture that any cross-pathway voice deployment requires. The buyers who recognise the wedge dynamic early will negotiate framework-grade pricing for ambient voice and inherit the adjacent volume at a marginal cost. The buyers who treat each workflow as a fresh procurement will pay platform pricing four times.

Suppliers also need to think hard about the orchestration question. The Airlock evaluates the system, not the components. A supplier stacking a US speech-to-text API, a US LLM, and a UK-hosted application layer will struggle to produce coherent evidence — every Airlock probe surfaces a residency, consent, or audit gap. The orchestration-versus-platform decision becomes existential, not architectural. And the underlying UK data residency obligations for enterprise voice AI are the floor — Airlock evidence sits on top.

If you are scoping an ambient voice deployment, the right next step is a structured architecture review against the four gates before you commit to a supplier. That review is what we deliver as a fixed-fee engagement under our AI operating model consulting — RACI, governance, evidence templates, and a framework-readiness scorecard the trust's clinical safety officer can sign off. The same engagement scope applies to suppliers preparing for Airlock entry, and we are happy to discuss it directly via /contact.

If you are scoping ambient voice for a UK trust, start with the architecture: try Dilr Voice for the regulated voice substrate, engage our AI execution office for trust-level rollout discipline, see our DATS methodology for the five-stage engagement model, or read about our approach to placing AI inside regulated workflows.

Service

AI Placement Diagnostic

Talk to the operators

Clear the Airlock gates before the framework window closes.

30-min scoping call · No deck · Confidential. We map your ambient voice architecture against the four MHRA gates and tell you exactly where the evidence is thin.

Book a call → See operating model →

Written by the Dilr.ai engineering team — practitioners who ship enterprise AI in production. Follow us on LinkedIn for shipping notes, or subscribe via the RSS feed.

The four Airlock gates — and what NHS procurement is actually testing

What the Airlock actually demands of voice architecture

How the four gates map to voice AI architecture

Why the framework opens a procurement window now — and how to plan for it

The non-consensus angle: ambient voice is the wedge, not the destination

Clear the Airlock gates before the framework window closes.

Related articles

FCA AI response: voice AI financial services 2026

AI voice for SaaS customer success: scale CSMs

AI voice logistics: dispatch calls without missed loads

One email, once a month. No hype. Just what we learned shipping.