Skip to content

Constitutional AI

An alignment approach where a model is trained to follow a set of written principles (a 'constitution') rather than relying solely on human feedback for each output. The model self-critiques against these principles and revises its responses.

Related terms

AlignmentRLHF (Reinforcement Learning from Human Feedback)Responsible AI
← Back to glossary