The RICE principles define four key characteristics that an aligned system should possess:
Overview of the RICE Principles - Summarized by Alignment Survey Team
These four principles guide the alignment of an AI system with human intentions and values. They are not end goals in themselves but intermediate objectives in service of alignment.
The alignment process can be decomposed into Forward Alignment (alignment training) and Backward Alignment (alignment consolidation).
The Alignment Process: Forward and Backward Alignment Cycle
Target Audience: Researchers, graduate students, and practitioners working on AI safety and alignment