1. Model Governance¶

Purpose

Guidelines for managing AI models throughout their lifecycle: from development to production and retirement.

1. Purpose¶

This module defines how we manage AI models throughout their lifecycle: from development to production and eventual retirement. Good model governance ensures traceability, controllability and safe releases.

2. Core Principles¶

Every Model Has an Owner¶

Every AI solution has one designated Tech Lead responsible for technical quality.
The owner is the point of contact for incidents, updates and decommissioning.

Everything Is Version-Controlled¶

Model weights, configurations and System Prompts are in version control.
Changes are traceable: who changed what and when?

No Change Without Review¶

Changes to production models require review by at least one other team member.
For High Risk: Guardian review mandatory.

3. Model Registry¶

A central location where all models are registered with their metadata.

Minimum Metadata per Model¶

Field	Description	Example
Model ID	Unique identification	`invoice-classifier-v2.1`
Version	Semantic version or hash	`2.1.0` or `abc123`
Status	Development / Staging / Production / Deprecated	Production
Owner	Responsible person/team	Team Finance AI
Creation date	When trained/deployed	2026-01-15
Data source version	Which data used for training	`invoices-2025-q4`
System Prompt	Link to prompt/config version	`prompts/invoice-v2.1.yaml`
Validation Report	Link to accompanying evidence	`reports/invoice-v2.1.md`
Risk level	Classification according to EU AI Act	Limited

Implementation Options¶

Option	Suitable for	Complexity
Spreadsheet/Wiki	Starting teams, few models	Low
Git repository with YAML	Engineering teams	Medium
Experiment tracking platform	Mature MLOps environment, many models	High

4. Approval Workflow¶

Standard Flow (Limited Risk)¶

[Development] → [Code Review] → [Staging Test] → [Gate Review] → [Production]

Code Review: At least one peer review
Staging Test: Golden Set test on staging environment
Gate Review: Validation Report meets Evidence Standards

Extended Flow (High Risk)¶

[Development] → [Code Review] → [Guardian Review] → [Staging Test] → [Fairness Check] → [Gate Review] → [Phased Rollout] → [Production]

Guardian Review: Independent assessment against Hard Boundaries
Fairness Check: Quantitative bias analysis
Phased Rollout: Start with limited user group, monitor, then full rollout

5. Model Lifecycle¶

Phase	Characteristics	Actions
Development	Experiments, prototypes	No production data, no external users
Staging	Candidate for production	Full Golden Set test, review
Production	Live, actively used	Monitoring, incident procedure active
Deprecated	Being phased out	No new users, migration plan active
Retired	No longer available	Archiving, documentation preserved

6. Change Management¶

Types of Changes¶

Type	Example	Required Approval
Configuration change	Temperature from 0.7 to 0.5	Peer review
Prompt change	Rewriting instruction	Peer review + regression test
Model version update	New base model (e.g. GPT-4 → GPT-5)	Full Gate Review
Data source change	Coupling new knowledge base	Guardian review (High Risk)

Rollback Procedure¶

Every production release has a documented rollback plan.
Rollback must be executable within 30 minutes.
After rollback: incident analysis and documentation.

7. Model Governance Checklist¶

Model Governance Checklist

Model registry is set up and up to date
All production models have an owner
Approval workflow is documented and followed
Change management is set up with rollback procedure
Models are linked to Validation Reports

Was this page helpful? Give feedback