Search Results
3 results for “AI-safety”
Foundations
AI Alignment
AI alignment is the field of research dedicated to ensuring that artificial intelligence systems pursue goals, values, and behaviours that are consistent with human intentions.
5 min readUpdated May 2026
Foundations
Constitutional AI
Constitutional AI is an alignment method developed by Anthropic that trains language models to follow a set of written ethical principles by using the model itself to critique and revise its own outputs, reducing dependence on human feedback for harmlessness.
6 min readUpdated May 2026
Foundations
Hallucination (AI)
A phenomenon in which an artificial intelligence system generates output that is factually incorrect, fabricated, or unsupported by its input, while presenting it with apparent confidence.
6 min readUpdated May 2026