BIITS AI Strategy — Target Operating Model

v1.3 — May 2026 All organisation sizes Governance · Strategy · Tactical · Operational

Target Operating Model

Click any component or handover to see its detail, dependencies, and what breaks if it is missing.

Layer component — click for detail

Handover point (bidirectional)

Layer risk

GovernanceWho decides & controls

Governance Layer — Active from Phase 0

Executive Sponsor Steering Committee CAIO / AI Lead Policy & Acceptable Use AI Inventory GRC Framework Compliance Alignment Audit & Review Cadence

↓

↕ Handover: Mandate & Boundaries

↑

StrategyWhat & why

Strategy Layer — Established Phase 0, revisited every gate

Business Objective Mapping AI Wave Placement Competitive & Reg. Context Use Case Portfolio Build / Buy / Partner Multi-Year Roadmap Investment Model Strategic KPIs

↓

↕ Handover: Prioritised Roadmap & Budget

↑

TacticalHow we execute

Tactical Layer — Phase execution & delivery

Phase Execution Plans Pilot Design & Delivery Data Readiness Programme Skills Gap Assessment Vendor & Tool Selection Change Management Kill / Scale Framework AI CoE / Intake Integration Layer Governance

↓

↕ Handover: Deployed System & Run Criteria

↑

OperationalHow it runs daily

Operational Layer — Standing processes, never switches off

MLOps & Observability Audit Trails Incident Response Bias & Explainability Access Controls Human-in-the-Loop KPI Tracking & Optimisation Vendor Governance Model Lifecycle Mgmt

⚠ Governance Risk

No AI Inventory = can't audit. No review cadence = governance only at phase gates.

⚠ Strategy Risk

No Build/Buy/Partner = ad hoc vendor decisions. No competitive scan = roadmap in a vacuum.

⚠ Tactical Risk

70% of AI failures start here — data not ready, change mgmt skipped, integration ungoverned.

⚠ Operational Risk

Models drift silently. No audit trail = no compliance defence. <1% of orgs fully operationalised responsible AI.

Governance Layer

The Governance Layer is active from Phase 0 onwards. It is not a phase deliverable — it is a standing structure that runs in parallel with all execution phases. Without it, every layer below drifts.

Single named owner of the AI mandate at board or executive level (CEO / CIO / CDO). Must be appointed before Phase 0 begins — not a committee role.

Responsibilities:
- Holds budget authority and strategic investment decisions
- Resolves cross-functional conflicts that the Steering Committee cannot
- Signs off on phase gate go/no-go decisions
- Named in all AI programme documentation

By tier: Small = Owner/CEO. Medium = CIO or COO. Enterprise = CIO / CDO.

AI Steering Committee

Formal governance body with cross-functional representation. Operates on its own calendar — not triggered only by phase gates.

Members: Legal, DPO, CISO, Business Process Owners, IT Lead, CAIO (chair).

Cadence: Monthly (Enterprise), Quarterly (Medium). Not required for Small — BIITS acts as governance support.

Responsibilities:
- Intake and prioritisation of new AI use case requests
- Phase gate go/no-go decisions (full compliance review before proceeding)
- Monthly review of KPI dashboard and incident log
- Annual governance audit scheduling and oversight

CAIO / Client-Side AI Lead

The CAIO / AI Lead is responsible for:
- Chairing the Steering Committee
- Maintaining the AI Inventory and Risk Register
- Translating governance constraints into strategic and tactical boundaries
- Day-to-day coordination between Legal, CISO, and delivery teams
- Escalation path for all governance questions

By tier: Small = not required (BIITS provides governance support). Medium = named IT Manager or senior business lead. Enterprise = dedicated CAIO or Head of AI.

Policy & Acceptable Use

Minimum policy contents:
- Data classification rules (what data can be used in which AI systems)
- Approved AI use cases (explicit list — everything not on it requires steering committee approval)
- Prohibited AI use cases (explicit list — no exceptions without executive sponsor sign-off)
- Vendor AI tool approval process (which third-party tools are permitted; how new ones are evaluated)
- Human override requirements (which decisions must have human sign-off regardless of AI confidence)
- Employee obligations (usage rules, disclosure requirements, confidentiality)

Owner: CAIO with Legal and DPO sign-off.
Review cadence: At each phase gate + annually. Updated within 30 days of any regulatory change.

AI Inventory

A living catalogue of every AI system in use — internal builds, vendor tools, embedded AI (Copilot, ChatGPT, etc.). Cannot audit what you haven't catalogued.

Minimum fields per entry:

Field	Description
System name	What it is called internally
Owner	Named individual accountable for this system
Purpose	What it does and which process it supports
Data inputs	What data it accesses and at what classification level
EU AI Act tier	Minimal / Limited / High / Unacceptable risk
Vendor / Build	Who provides or built it
Last review date	When it was last assessed
Status	Active / Pilot / Retired

Established: Phase 0 (initial catalogue). Updated: at every phase gate and when new tools are introduced.
Owner: CAIO. Reviewed by: Steering Committee quarterly.

GRC Framework — Baseline

The GRC Framework operates at two levels:

Level 1 — Baseline (active from Phase 0):
- Risk appetite statement (documented, executive-signed)
- EU AI Act risk classification applied to all use cases at scoring time
- GDPR data flows mapped for every AI system touching personal data
- Sector-specific rules applied (CMMC, HIPAA, SOX where relevant)
- Vendor AI tool compliance declaration (DPA, data residency, audit rights)

Level 2 — Advanced controls (activated at Phase 3, Agentic AI):
- Bias monitoring (detect and document bias in model outputs; cadence defined per use case)
- Model explainability (SHAP, LIME, or counterfactual explanations for regulated decisions)
- Drift detection (statistical monitoring; alert thresholds set at deployment)
- Audit trails (every agent action logged: timestamp, actor, input, output, decision rationale)
- Access controls (role-based permissions; no agent has broader data access than required)
- Incident response plan (severity tiers, escalation paths, post-incident review)

Important: Phase 3 activates Level 2 controls on top of the Level 1 baseline already in place — not instead of it. Both levels run in parallel from Phase 3 onwards.

Compliance Alignment

Standing alignment with applicable regulatory frameworks. Reviewed at each phase gate.

Framework	Scope	Trigger
EU AI Act	All AI systems — risk classification mandatory	Phase 0 onwards
GDPR	Any AI system processing personal data	Phase 0 onwards
CMMC 2.0	Defence sector clients; DoD supply chain	If applicable
SOC 2	SaaS or data processing clients	If applicable
HIPAA	Healthcare sector	If applicable
ISO 42001	AI management system standard	Enterprise; Phase 3+

Audit & Review Cadence

A governance-level review calendar that runs independently of delivery phases. Phase gates are tactical milestones; this is the governance heartbeat.

Review type	Frequency	Owner	Input
Steering Committee KPI review	Monthly (Ent) / Quarterly (Med)	CAIO	KPI dashboard + incident log
Phase gate compliance check	Per phase gate	CAIO + Legal	Phase completion report + AI Inventory
AI Inventory review	Quarterly	CAIO	All active AI systems
Policy review	Annual + post-regulatory change	CAIO + Legal + DPO	Policy document + incident history
Independent governance audit	Annual	External auditor	Full governance framework
Vendor compliance review	Quarterly	CAIO + IT	Vendor register + SLA performance

Strategy Layer

The Strategy Layer sets direction and connects AI investment to business outcomes. It is established at Phase 0 and revisited at every phase gate. Without a clear strategy layer, tactical execution becomes a list of disconnected experiments.

Business Objective Mapping

Every AI initiative must connect to a measurable P&L outcome before it enters the use case queue. This is the primary filter — not technology interest, not executive enthusiasm.

Mapping format per use case:
- AI initiative → business process it changes → KPI it moves → estimated financial impact
- Acceptable outcome categories: cost reduction, revenue growth, churn reduction, cycle time improvement, risk reduction, customer experience improvement
- Initiatives with no mappable outcome are not scored — they are returned to the business owner for reframing

Done at: Phase 0. Revisited at: each phase gate.

AI Wave Placement

Clients are placed on the AI evolution track at Phase 0. This determines the roadmap entry point and which phases are in scope.

Wave	Technology	Status (May 2026)
Traditional AI	Predictive analytics, ML models, classification, RPA, computer vision, NLP	Established
Generative AI	LLMs, multimodal generation, RAG, text/image/audio/code synthesis	Mainstream adoption
Agentic AI	Autonomous agents, multi-agent orchestration, CoT reasoning, tool use	Scaling — governance is the constraint
Physical AI	Robots, digital twins, IoT, autonomous vehicles	Emerging — manufacturing/logistics/defence
Sovereign AI	National/org AI independence, data residency, local infrastructure	Board-level priority

Wave placement is not static. Re-assess at each phase gate — clients can accelerate or pause wave progression based on data readiness and governance maturity. The most common failure is skipping to Agentic AI without completing the GenAI foundation.

Competitive & Regulatory Context

Strategy built without external visibility is strategy built in a vacuum. Two scans run in parallel at Phase 0 and refresh annually.

Regulatory scan: Maps applicable frameworks per use case — EU AI Act risk tier, GDPR exposure, sector rules (CMMC, HIPAA, SOX). Output: a constraint map that filters the use case shortlist before scoring begins. No use case enters the portfolio without a compliance ruling.

Competitive scan (added v1.3): Which competitors are deploying AI, in which functions, and at what scale? Sources: public announcements, earnings calls, job postings, vendor case studies. Output: 1-page competitive AI positioning summary per client. This surfaces urgency that internal stakeholders often underestimate — and identifies differentiation opportunities that competitors have not yet claimed.

Refreshed: Annually as a standing deliverable. Triggered immediately on a significant competitor move or regulatory change.

Use Case Portfolio

The Use Case Portfolio is the prioritised shortlist of AI initiatives approved for execution. It is not a wishlist — it is a scored, ranked, gate-controlled queue.

Scoring model (1–5 each dimension):

Dimension	What it measures	Quick-kill threshold
Business value	Financial impact mapped to a KPI	—
Data readiness	Data available, accessible, quality-validated	≤2 → deprioritise
Feasibility	Technical complexity + capability availability	—
Time-to-value	How quickly pilot results are measurable	—
Risk	Regulatory exposure + failure consequence	≤2 → deprioritise

T-shirt sizing by effort: S (1–4 weeks pilot), M (4–8 weeks), L (8–16 weeks), XL (programme-level, multi-phase).

Output: Ranked shortlist of 3–5 use cases per phase, each with Build/Buy/Partner decision, data requirements, and pilot KPI targets.

Owner: CAIO + your BIITS consultant. Approved by: Steering Committee. Reviewed at: every phase gate.

Build / Buy / Partner

Required per capability before it enters the use case execution queue. Ad hoc vendor selection without this framework is the most common source of lock-in risk.

Decision	Criteria	Lock-in risk
Build	Capability is proprietary; data is a competitive asset; long-term internal maintenance capacity exists	Low (internal control)
Buy	Commodity capability; vendor ecosystem mature; time-to-value more important than differentiation	Evaluate: exit terms, data portability, pricing model
Partner	Speed is the primary constraint; capability is adjacent to core; internal expertise gap	Medium — DPA and IP terms must be defined upfront

Sign-off required: CAIO + Executive Sponsor. Revisited: at every phase gate. Documented in: AI Inventory with rationale.

Multi-Year Roadmap

The roadmap is a 5-phase gate sequence — not a Gantt chart. Each phase has explicit go/no-go criteria that must be met before the next phase begins. Timing varies by company size tier.

Phase	Focus	Small	Medium	Enterprise
Phase 0 — Assess	Diagnostic, wave placement, use case scoring, governance baseline	2–4 wks	4–8 wks	6–12 wks
Phase 1 — Foundation	Data readiness, infrastructure, skills gap, policy, AI Inventory	4–8 wks	8–16 wks	12–24 wks
Phase 2 — GenAI Value	First production pilots, RAG, automation, measurable ROI	4–8 wks	8–20 wks	12–28 wks
Phase 3 — Agentic AI	Multi-step agents, complex automation, GRC Level 2, CoE formalised	Optional	12–24 wks	16–32 wks
Phase 4 — Scale & Optimise	Portfolio expansion, continuous optimisation, AI-native operations	Ongoing	Ongoing	Ongoing

Gate criteria — Phase 0 → Phase 1: Executive Sponsor named; minimum 3 use cases scored; GRC Level 1 baseline documented; data readiness assessment complete for top use case.

Gate criteria — Phase 2 → Phase 3: Phase 2 pilot ROI confirmed; adoption KPI above baseline; AI Inventory current; Steering Committee approved Phase 3 scope; data readiness score ≥16/24 for agentic use case.

How You Invest — Phase by Phase

AI investment is scoped from your Phase 0 diagnostic — not from a fixed price list. Every budget is allocated per phase with ROI targets agreed per use case before a single pilot begins. Three starting points, depending on where you are today.

Starting point	Profile	What's in scope	Typical duration
AI Kickstart	Smaller organisations — no formal AI programme yet, ready to move	Diagnostic + foundations + 3 quick-win GenAI pilots in production	4–6 weeks
AI Strategy Sprint	Mid-size organisations — pilots running but uncoordinated, need a framework	Phases 0–2 completed + Phase 3 roadmap + governance standing up	12–16 weeks
AI Transformation Programme	Larger organisations — strategic AI commitment, need a full operating model	Full Phase 0–4 + CoE + MLOps + continuous strategic advisory	6–18 months

The right starting point is determined at Phase 0. BIITS will not recommend a scope before the diagnostic is complete.

Strategic KPIs

Strategic KPIs are agreed at executive level at the end of Phase 0 — before any delivery begins. They belong to the Steering Committee scorecard, not the delivery team's sprint board.

Required format per KPI:

Field	Description
Outcome	What business result this measures
Baseline	Current state — measured, not estimated
12-month target	Minimum acceptable improvement
24-month target	Full-programme ambition
Owner	Named individual accountable for delivery
Data source	Where the measurement comes from and how often it updates

KPI categories: Efficiency (cost reduction, cycle time), Quality (error rate, accuracy), Revenue (new revenue, churn reduction), Risk (incidents avoided, compliance findings), Adoption (active users, workflow penetration).

Rule: If a KPI cannot be measured today (no baseline data source exists), it cannot be a strategic KPI. Fix the data source first — then set the target.

Tactical Layer

The Tactical Layer is where strategy becomes execution. It runs phase by phase — each activity has a named owner, a start dependency, and a gate-linked completion criterion. This is also where 70% of AI programme failures originate: data not ready, change management skipped, integration ungoverned.

Phase Execution Plans

Each phase has a documented execution plan before work begins. A plan without named owners and dependency maps is a timeline — not a plan.

Required elements per phase plan:
- Named owner per activity (one person, not a team or role title)
- Milestone dates tied to gate criteria — not calendar quarters
- Dependency map: what must be true before each activity can start
- Risk register: top 3 risks per phase with mitigation action, owner, and early-warning signal

Gate completion evidence: Phase plans are closed with a gate completion report — not marked complete by delivery team self-assessment. Steering Committee reviews the evidence and approves the gate.

Pilot Design & Delivery

A pilot is a functional AI system running in an actual workflow — not a sandbox, not a demo, not a proof of concept with synthetic data. The only question a pilot answers is: does this produce confirmed business value at acceptable risk in real conditions?

Pilot design requirements:
- Target duration: 4–8 weeks in production workflow
- KPIs defined before launch with baselines: accuracy, cycle time, cost saving, adoption rate
- Human review checkpoints configured for any customer-facing or regulated output
- Kill criteria agreed upfront — specific condition + time window. If the condition is met, the pilot stops. No extensions without Steering Committee approval.

What makes pilots fail: KPIs defined after launch (no baseline); data quality assumed rather than validated; kill criteria absent (pilots run indefinitely); change management treated as a communications email.

Data Readiness Programme

Data readiness runs as a parallel workstream from Phase 0. It is not a blocker that halts everything else — it is a scored, actively-managed programme that determines which use cases can proceed to production and when.

6-dimension scorecard (0–4 each, total 24 points):

Dimension	What it assesses	Score 0	Score 4
Availability	Data exists and is accessible for the use case	Data not available	Automated pipeline, real-time access
Quality & Lineage	Accuracy, completeness, provenance documented	No validation process	Validated, lineage fully documented
Integration	Connected to relevant operational systems	Manual export only	API-connected, bidirectional
Governance	Data ownership and classification defined	No owner defined	Owner named, classification enforced
Accessibility	Right people have access; wrong people don't	No access controls	Role-based, audited quarterly
Security	Data protected at rest and in transit	No encryption	Encrypted, access logged

Gate rule: Score <16/24 = do not proceed to production for this use case. Remediate failing dimensions first. Score ≥16/24 = approved to proceed, with a remediation plan for remaining gaps.

Skills Gap Assessment

The Skills Gap Assessment identifies which roles are unprepared for AI-augmented work and determines whether gaps are closed by training, hiring, or partnering. Run at Phase 0 and re-run at each phase gate.

6 skill dimensions assessed per role: AI literacy, prompt engineering, data stewardship, model monitoring, AI-assisted workflow operation, change facilitation.

Scope by tier: Small = owner + 2–3 key staff. Medium = all AI-adjacent roles. Enterprise = full role families by function.

Output per role: Gap heat map (red/amber/green per dimension) + prioritised reskilling plan with named individuals and deadlines + Buy/Train/Partner decision per gap.

Common failure: Assessment covers IT roles only. The highest-impact AI adoption failures happen in operations, finance, and customer-facing roles — not in the IT team. The Skills Gap Assessment must follow the use case portfolio, not the org chart.

Vendor & Tool Selection

Triggered at Phase 1/2 — after use cases are locked and data requirements are confirmed. No vendor is selected before the Build/Buy/Partner decision is made at the Strategy Layer.

7-criterion evaluation framework:

Criterion	What to assess
Capability fit	Does it solve the confirmed use case — not a superset of hypothetical use cases
Data handling	Where does client data go; is it used for training; data residency location
EU AI Act compliance	How does the vendor classify their system; are audit rights contractually included
Lock-in risk	Data portability, API openness, exit costs, pricing model trajectory
SLA	Uptime, latency, error rate guarantees — contractual, not marketing copy
Pricing	Total cost of ownership including token consumption, storage, support tiers
Security posture	SOC 2 / ISO 27001 certified; pen-tested; incident history disclosed on request

Minimum contract requirements before signature: DPA signed with EU data residency confirmed; audit rights included; exit terms and data deletion procedures defined; SLA with financial remedy for breach.

Change Management

Change management is a parallel workstream from Phase 0 — not a communications plan added at the end of delivery. The primary failure mode for technically successful AI pilots is adoption failure driven by absent change management.

Required workstream elements:
- Role impact map: which roles change, which are augmented, which are eliminated — updated at each phase
- Reskilling plan: output of Skills Gap Assessment, with named individuals and completion dates
- Adoption KPI: defined before pilot launch. No adoption KPI = change management has already failed
- Leadership engagement plan: Executive Sponsor and middle management alignment (AI adoption fails most often at the middle management layer, not the front line)

Adoption KPI format: % of target users actively using the AI-assisted workflow at least X times per week, measured at weeks 4, 8, and 12 post-launch.

Kill / Scale Framework

Every pilot exits through one of three gates: Scale, Iterate, or Kill. These decisions are made on evidence — not on sunk cost, executive enthusiasm, or vendor pressure.

Decision	Criteria	Next action
Scale	ROI confirmed against pre-defined targets + adoption above baseline + data quality held in production	Expand to full production; update use case portfolio and investment model
Iterate	ROI unclear but adoption strong (fix measurement) — OR — adoption low but ROI strong (fix change management)	One more defined cycle with specific hypothesis and hard kill date
Kill	Data quality failed; ROI negative after 2 iterate cycles; risk exceeded policy limits	Log in AI Inventory with rationale; reallocate resource to next use case in portfolio

Important: Kill decisions are not failures — they are the framework working. A portfolio with no kills is a portfolio where kill criteria were never enforced.

AI CoE / Intake Process

The AI Centre of Excellence manages the use case intake pipeline and ensures new requests are evaluated against the portfolio before entering delivery. Its structure scales with the client.

Tier	CoE structure	Intake process
Small	BIITS supports as external CoE; dedicated internal AI lead not required at this stage	Use case requests reviewed with your BIITS consultant + Executive Sponsor
Medium	Lightweight internal CoE at Phase 2/3 boundary: AI lead + data steward + IT representative	Standard intake form → scored against portfolio → CAIO approval
Enterprise	Formal CoE with documented intake at Phase 1 completion: dedicated AI lead, data governance, security, change management	Standard intake form → portfolio scoring → Steering Committee approval

Intake form minimum fields: Use case description; business process affected; KPI targeted; data requirements; requestor and business owner named; estimated value and urgency.

Integration Layer Governance

AI output that cannot reach the systems it needs to affect is operationally worthless. Integration without governance creates the opposite risk: AI output that incorrectly updates financial or operational records.

Write-back control tiers (ascending sensitivity):
1. Advisory — output is informational only; no system write; no approval needed
2. Workflow trigger — AI initiates a workflow (creates a task, sends a notification); rule-based conditions required
3. Record update — AI writes to a system of record (CRM, ERP, HRIS); HITL review or threshold rule required
4. Financial transaction — AI initiates or approves a payment or contract; always dual sign-off, no exceptions

Pre-production integration checklist:
- API contracts defined and version-controlled before build starts
- End-to-end integration test completed in staging environment
- Failure mode test: what happens when the AI system is unavailable?
- Rollback test: can the integration be cleanly reversed without data corruption?
- Fallback process documented: named manual process for every AI-assisted workflow

Operational Layer

The Operational Layer never switches off. Once a system is in production, these processes run continuously — independent of project phases, delivery teams, or BIITS engagement status. If any of these processes are absent post-deployment, the programme has a standing compliance and operational risk exposure.

MLOps & Observability

Model performance in production degrades silently if not actively monitored. MLOps is the discipline of treating deployed AI systems with the same operational rigour as production software infrastructure.

Standing monitoring requirements:
- Model performance: tracked continuously against the pilot baseline; deviations above threshold alert the CoE owner automatically
- Drift detection: input distribution (data pattern changes) and output distribution (model behaviour changes); alert thresholds set at deployment and reviewed quarterly
- Pipeline health: uptime, latency, processing volume anomalies; real-time alerting
- CI/CD discipline: all model updates go through staging validation before production — no ad hoc pushes to production models

Monitoring configuration is a Phase 3 handover item — it cannot be added retroactively without service disruption and creates a compliance gap in audit trails if absent at deployment.

Audit Trails

Audit trails are the primary compliance defence in a regulatory enquiry. They must be configured at deployment — they cannot be added retroactively once a system is in production.

What must be logged per event: Model output; human override of a model output; access event (who accessed what system and when); configuration change to an AI system.

Parameter	Standard	Regulated environments
Retention period	Minimum 1 year	Minimum 3 years
Deletion authority	CAIO + Legal only; no self-service deletion	Legal approval required per deletion
Read access	CAIO + Legal: read/write. Business owners: read-only	Add external auditor read-only access
Retrieval SLA	Within 48 hours for any regulatory request	Within 24 hours
Annual test	Retrieval capability tested and documented annually	Tested semi-annually

Gap discovered post-deployment: P2 incident. CAIO notified within 4 hours. Remediation plan with completion date required within 5 business days.

Incident Response

Incident response for AI systems requires defined severity tiers, response SLAs, and root cause discipline. A plan document that no one practises is not incident response.

Severity	Definition	Response SLA	Escalation
P1	Customer harm; data breach; discriminatory output at scale; regulatory breach	1 hour	CAIO + Executive Sponsor immediately; Steering Committee within 24h
P2	Significant output error affecting operations; audit trail gap discovered; access control breach	4 hours	CAIO
P3	Degraded model performance; pipeline latency above threshold; non-critical bias signal	24 hours	CoE owner
P4	Minor output quality drop; non-critical monitoring alert; data quality warning	5 business days	CoE owner

Post-incident requirement (P1/P2): Root cause analysis completed within 5 business days. Written corrective action with named owner and completion date. Filed in AI Inventory and incident register. Steering Committee notified at next scheduled review.

Bias & Explainability

Bias monitoring and explainability are not Phase 3 configuration items that run once at deployment — they are standing operational processes that continue for the life of every deployed system.

Bias monitoring cadence: Weekly for high-risk or customer-facing use cases. Monthly for standard use cases.

Checks per cycle: Output distribution across demographic groups; decision rate consistency across segments; error rate disparity by group. Threshold breach triggers P2 incident automatically. Findings are logged whether the check passes or fails — a log with only passes is a log that isn't running.

Explainability for regulated decisions: Any decision affecting credit, employment, insurance, or healthcare must be explainable to the subject on request. Approved methods: SHAP, LIME, counterfactual explanations, decision tree surrogates. Named person responsible for retrieval; maximum 48-hour response SLA.

Annual explainability audit: Verify that the explanation method still accurately reflects model behaviour — models can drift away from their explanations over time as data patterns shift.

Access Controls

AI systems introduce new access risk vectors: agents with broad tool access, LLMs that can be prompted to reveal data, and shadow AI tools that bypass approved channels entirely.

Standing access control requirements:
- Quarterly access review: who has access to which AI systems and underlying data, and is that access still required
- Offboarding: AI system access revoked within 24 hours of employee departure — separate from standard IT offboarding checklist
- Monthly privilege creep scan (automated where tooling supports)
- Least privilege for agents: AI agents are granted the minimum data access required to complete their task; no persistent elevated access between tasks
- Access outside approved matrix → immediate revocation + P2 incident logged

Shadow AI mitigation: Clear policy + enforcement mechanism + approved alternatives that are genuinely better than the shadow alternatives. Prohibition without a better approved option accelerates shadow AI adoption, not compliance.

Human-in-the-Loop (HITL)

Human-in-the-loop is a governance requirement in high-stakes domains — not a fallback for AI systems that are underperforming.

Mandatory HITL domains — no exceptions without Executive Sponsor sign-off:
- Financial decisions (payments, contracts, credit decisions)
- HR decisions (hiring, performance assessment, termination)
- Legal decisions (claims, liability determinations, regulatory filings)
- Security decisions (access grants, incident response actions)

HITL operational requirements:
- Named reviewer per use case — not a team, a named individual
- Maximum review SLA defined before deployment (e.g., 4-hour SLA for contract approval workflow)
- Fallback reviewer named: the process does not stall if the primary reviewer is unavailable
- Every human override logged in audit trail with timestamp and override rationale

HITL design test: If the human reviewer approves everything the AI produces without genuinely reading it, HITL is theatre — not governance. Fix the review design, reduce the review volume, or acknowledge the risk explicitly and document the decision.

KPI Tracking & Optimisation Loop

The KPI loop connects operational performance to strategic targets. Without it, the Steering Committee receives anecdotal updates. With it, the Executive Sponsor has a data-driven scorecard that drives decisions.

Cadence	Metrics tracked	Audience
Weekly	Adoption rate, error/hallucination count, pipeline uptime, data quality score	CoE / delivery team
Monthly	ROI progress vs. target, cost per outcome unit, model performance vs. baseline, bias check results	CAIO + Business process owners
Per gate	Full ROI assessment, strategic KPI progress update, portfolio review	Executive Sponsor + Steering Committee

Optimisation loop: Monitor → Analyse (root cause) → Remediate (tactical or operational fix) → Govern (update policy or AI Inventory if needed) → Refresh (update baselines post-remediation).

Escalation rule: Any metric below threshold for 3 consecutive reporting periods escalates to the next audience tier automatically. No silent degradation is permissible.

Vendor Governance

Vendor governance is active from the first vendor tool deployed — not from Phase 4. Vendors delivering AI capabilities carry compliance, performance, and concentration risk that must be actively managed, not periodically checked.

Quarterly review per vendor:
- SLA compliance: uptime, latency, error rates vs. contracted terms
- Capability roadmap: are planned features still on track? Any EOL notices?
- Cost vs. value: actual token/API consumption vs. budget; pricing model changes in pipeline
- Data handling compliance: DPA still current, data residency confirmed
- Security posture: any disclosed incidents or certification status changes

Annual review (full re-assessment): Full vendor risk re-assessment + renewal/exit decision + competitive market check — are better alternatives available at equivalent or lower risk?

Vendor register fields in AI Inventory: Contract expiry date, DPA status and expiry, last review date, exit complexity rating (Low / Medium / High). Concentration risk flag required if >60% of AI workload runs on a single vendor.

Model Lifecycle Management

Deployed models are not static — they age, drift, and eventually need to be retired or replaced. Without active lifecycle management, deprecated models stay in production long after their performance has degraded or their data sources have changed.

Version record required per deployed model:

Field	Description
Model name & version	Internal identifier + vendor version where applicable
Deployment date	Date entered production
Performance baseline	KPI values at deployment — the reference point for all subsequent drift measurement
Owner	Named individual accountable for this model's operational health
Next review date	Scheduled quarterly by default; monthly for high-risk use cases

Retirement triggers — any one of the following:
- Performance below defined threshold for 4 consecutive weeks
- Validated replacement available with confirmed improvement over incumbent
- Data source changed materially (distribution shift confirmed by monitoring)
- Vendor end-of-life notice received
- Business process the model supported has been discontinued

Transition process: Parallel run — replacement model runs alongside the retiring model for a minimum of 2 weeks before cutover. Rollback: previous model version retained in staging for 30 days post-cutover; 4-hour rollback execution SLA. Cutover and retirement logged in AI Inventory with rationale and owner sign-off.

AI Architecture & Technology v1.4 — In development

Planned content: LLM selection framework, RAG pipeline design, vector database strategy, AI agent architecture, automation layer patterns, scalability design and IntegrationHub/API governance. This section will cover the technical architecture decisions a CIO must lock in before scaling AI.

Data & Platform Foundation v1.4 — In development

Planned content: Data quality framework, data governance controls, data ownership model, cloud strategy for AI workloads, legacy system integration patterns and database selection (HTAP, vector, operational). Prerequisite reading for any organisation before starting Phase 1.

Economics & ROI v1.4 — In development

Planned content: Token consumption modelling, cost forecasting by phase, Opex vs Capex structuring for AI spend, licensing model comparison, productivity gain measurement, deflection metrics and value realisation timelines. This is the section that unlocks the CFO conversation.

Security & Data Protection v1.4 — In development

Planned content: Data privacy controls, PII handling within AI pipelines, model access control, prompt injection and leakage prevention, shadow AI detection, identity governance for AI systems, secure integration patterns and breach response procedures specific to AI incidents.

Organisation & Talent v1.4 — In development

Planned content: AI skills taxonomy, talent gap assessment method, training programme design, role redesign for AI-augmented work, adoption resistance management, leadership buy-in strategies, cross-functional team models and Centre of Excellence structuring.

User & Employee Experience v1.4 — In development

Planned content: Trust-in-AI design principles, transparency and explainability for end users, confidence score display, user feedback loops, workflow usability testing for AI-augmented processes, adoption tracking metrics and experience measurement frameworks.