Constitutional AI

An alignment approach where a model is trained to follow a set of written principles (a 'constitution') rather than relying solely on human feedback for each output. The model self-critiques against these principles and revises its responses.