1. Model Governance¶
Purpose
Guidelines for managing AI models throughout their lifecycle: from development to production and retirement.
1. Purpose¶
This module defines how we manage AI models throughout their lifecycle: from development to production and eventual retirement. Good model governance ensures traceability, controllability and safe releases.
2. Core Principles¶
Every Model Has an Owner¶
- Every AI solution has one designated Tech Lead responsible for technical quality.
- The owner is the point of contact for incidents, updates and decommissioning.
Everything Is Version-Controlled¶
- Model weights, configurations and System Prompts are in version control.
- Changes are traceable: who changed what and when?
No Change Without Review¶
- Changes to production models require review by at least one other team member.
- For High Risk: Guardian review mandatory.
3. Model Registry¶
A central location where all models are registered with their metadata.
Minimum Metadata per Model¶
| Field | Description | Example |
|---|---|---|
| Model ID | Unique identification | invoice-classifier-v2.1 |
| Version | Semantic version or hash | 2.1.0 or abc123 |
| Status | Development / Staging / Production / Deprecated | Production |
| Owner | Responsible person/team | Team Finance AI |
| Creation date | When trained/deployed | 2026-01-15 |
| Data source version | Which data used for training | invoices-2025-q4 |
| System Prompt | Link to prompt/config version | prompts/invoice-v2.1.yaml |
| Validation Report | Link to accompanying evidence | reports/invoice-v2.1.md |
| Risk level | Classification according to EU AI Act | Limited |
Implementation Options¶
| Option | Suitable for | Complexity |
|---|---|---|
| Spreadsheet/Wiki | Starting teams, few models | Low |
| Git repository with YAML | Engineering teams | Medium |
| Experiment tracking platform | Mature MLOps environment, many models | High |
4. Approval Workflow¶
Standard Flow (Limited Risk)¶
[Development] → [Code Review] → [Staging Test] → [Gate Review] → [Production]
- Code Review: At least one peer review
- Staging Test: Golden Set test on staging environment
- Gate Review: Validation Report meets Evidence Standards
Extended Flow (High Risk)¶
[Development] → [Code Review] → [Guardian Review] → [Staging Test] → [Fairness Check] → [Gate Review] → [Phased Rollout] → [Production]
- Guardian Review: Independent assessment against Hard Boundaries
- Fairness Check: Quantitative bias analysis
- Phased Rollout: Start with limited user group, monitor, then full rollout
5. Model Lifecycle¶
| Phase | Characteristics | Actions |
|---|---|---|
| Development | Experiments, prototypes | No production data, no external users |
| Staging | Candidate for production | Full Golden Set test, review |
| Production | Live, actively used | Monitoring, incident procedure active |
| Deprecated | Being phased out | No new users, migration plan active |
| Retired | No longer available | Archiving, documentation preserved |
6. Change Management¶
Types of Changes¶
| Type | Example | Required Approval |
|---|---|---|
| Configuration change | Temperature from 0.7 to 0.5 | Peer review |
| Prompt change | Rewriting instruction | Peer review + regression test |
| Model version update | New base model (e.g. GPT-4 → GPT-5) | Full Gate Review |
| Data source change | Coupling new knowledge base | Guardian review (High Risk) |
Rollback Procedure¶
- Every production release has a documented rollback plan.
- Rollback must be executable within 30 minutes.
- After rollback: incident analysis and documentation.
7. Model Governance Checklist¶
Model Governance Checklist
- Model registry is set up and up to date
- All production models have an owner
- Approval workflow is documented and followed
- Change management is set up with rollback procedure
- Models are linked to Validation Reports
8. Related Modules¶
Was this page helpful?
Give feedback