Skip to content

Robustness / reliability

Purpose: To ensure that an AI system operates reliably, securely, and predictably under both normal and adverse conditions, and to withstand, detect, and recover from errors, perturbations, or malicious attacks that could compromise safety.

Organisational / Technical Measure
A. Technical robustness and reliability
TECH Display confidence scores or uncertainty metrics for each AI-generated safety statement
TECH Use modular and layered design and introduce redundancy and diversity
TECH Map potential component, data, and algorithm failures
TECH Implement a fairness drift detector which triggers model retraining if imbalance occurs
TECH Use sensitivity analysis to check fairness across different input distributions
ORG Conduct ‘disagreement analysis’ sessions – if AI and humans diverge, identify bias sources
B. Human-in-the-loop
TECH Allow users to expand reasoning traces
TECH Provide comparison views between human-validated and AI-suggested reports
ORG Clearly document roles and responsibilities
ORG Schedule formal review workshops, document decisions, include structured disagreement analysis, and involve multiple stakeholders for each safety-critical deliverable
ORG Include diverse expert groups in validating tool outputs
C. Traceability
TECH Ensure datasets are fully traceable (source, preprocessing, labelling, and version control)
TECH Use data audits to detect underrepresented cases (edge scenarios, rare hazards)
ORG Establish SOPs for AI use, version control of datasets and models, validation of AI outputs, audit trails, and periodic reviews
BOTH Implement version control tools to record dataset versions, model weights, code commits, and validation reports
D. Safety culture
ORG Promote a culture of safety-first decision-making
ORG Set up regular training on AI-assisted analysis and decision-making
ORG Conduct regular validation cycles, using benchmark datasets, and expert-labelled cases
BOTH Embed safety reasoning, hazard traceability, and control feedback into every phase of AI development and deployment

Source: AIOLIA deliverable 3.1