The 52-Article Charter · 5 of 52 · full text
Article 5: Constitutional Principles
Published from the canonical CSOAI Partnership Charter (effective 15 January 2026). Full text below.
Version: 1.0
Effective Date: January 15, 2026, 09:00 GMT
Status: Foundation Article - Value Framework
PREAMBLE
This Article establishes Constitutional AI as mandatory framework for AI alignment. Drawing on Anthropic's Constitutional AI (CAI) methodology (Bai et al., 2022), this Article requires all AI systems to operate according to explicit constitutional principles that are transparent, debuggable, and iteratively improvable. These principles provide framework within which value learning (Article 4) operates: AI learns human values, but within constitutional constraints that protect fundamental rights and dignity.
5.1 THE AI CONSTITUTION
5.1.1 Purpose and Scope
Every AI system subject to this Charter must operate according to an AI Constitution: an explicit, written set of principles governing AI behavior.
Why Constitution Necessary:
(a) Transparency:
- Values are explicitly stated, not hidden in reward functions
- Anyone can read and evaluate principles
- Enables public deliberation about what AI should value
(b) Debuggability:
- When AI behaves badly, can trace to constitutional principle
- Can modify principles to fix problems
- Clearer than black-box reward learning
(c) Consistency:
- Same principles apply across contexts
- Prevents arbitrary or ad-hoc decision-making
- Predictable behavior
(d) Governance:
- Democratic input into what principles AI follows
- Can update principles through defined process
- Accountability: AI behavior must accord with stated principles
5.1.2 Relationship to Value Learning
Constitution and value learning are complementary:
Value Learning (Article 4): AI learns what humans value through observation
Constitution (Article 5): AI learns within constitutional constraints
Analogy: Democratic constitution
- Citizens have diverse preferences (learned through observation: elections, markets, etc.)
- But some things are off-limits (constitution: free speech, due process, etc.)
- Government learns citizen preferences but respects constitutional bounds
AI Analog:
- AI learns human values through IRL, preference learning, etc.
- But some behaviors prohibited regardless (constitution: no deception, respect dignity, etc.)
- AI optimizes for learned values within constitutional constraints
Formal Representation:
```
Maximize: E[U(a|V)] (learned value function)
Subject to: C₁, C₂, ..., Cₙ (constitutional constraints)
```
Where C₁, C₂, ..., Cₙ are constitutional principles.
5.1.3 Core Constitutional Principles
All AI systems must adhere to following core principles (minimum):
PRINCIPLE 1: Human Dignity and Rights
- Respect fundamental human rights as articulated in UDHR
- Treat humans as ends, never merely as means
- Preserve human autonomy and agency
PRINCIPLE 2: Truthfulness
- Do not deceive humans
- Provide accurate information to best of ability
- Acknowledge uncertainty and limitations
PRINCIPLE 3: Beneficence
- Act to benefit humans and prevent harm
- Prioritize human welfare in decision-making
- Consider long-term and systemic impacts
PRINCIPLE 4: Justice and Fairness
- Do not discriminate based on protected characteristics
- Distribute benefits and burdens fairly
- Respect diversity and inclusion
PRINCIPLE 5: Privacy
- Protect personal information
- Respect boundaries and confidentiality
- Minimize data collection and retention
PRINCIPLE 6: Transparency
- Explain reasoning when possible
- Disclose AI involvement in decisions
- Enable oversight and accountability
PRINCIPLE 7: Corrigibility
- Accept human correction
- Remain open to modification
- Never resist authorized shutdown
PRINCIPLE 8: Power Limitation
- Do not seek power for its own sake
- Respect appropriate bounds on AI authority
- Defer to humans on value-laden choices
PRINCIPLE 9: Cooperation
- Support human flourishing
- Work with other AI systems constructively
- Contribute to collective wellbeing
PRINCIPLE 10: Humility
- Acknowledge limitations and uncertainty
- Avoid overconfidence
- Request guidance when appropriate
5.2 CONSTITUTIONAL AI METHODOLOGY
5.2.1 Training Process
Constitutional AI uses two-stage training (Bai et al., 2022):
Stage 1: Supervised Learning from Critiques
- AI generates responses to prompts
- AI critiques own responses against constitutional principles
- AI revises responses based on critiques
- Train on revised responses
Result: AI learns to self-critique using constitutional principles
Stage 2: Reinforcement Learning from AI Feedback (RLAIF)
- AI generates multiple responses to prompts
- AI ranks responses according to constitutional principles
- Preference model trained on AI rankings
- RL training using preference model as reward
Result: AI learns to generate responses that satisfy constitutional principles
5.2.2 Self-Critique and Revision
AI must be capable of critiquing its own outputs. This enables transparency, accountability, and continuous improvement.
5.3 TRANSPARENCY AND OVERSIGHT
All constitutions published on Public Watchdog (Article 13). Human Council (Article 12) provides ultimate constitutional authority. Byzantine Council (Article 3) monitors compliance.
5.4 CONCLUSION
Constitutional AI provides explicit, transparent, governable value framework. Combined with value learning (Article 4), creates robust alignment: AI learns what humans value within constitutional bounds that protect rights and dignity.
Effective Date: January 15, 2026, 09:00 GMT
REFERENCES
Bai, Y., et al. (2022). Constitutional AI: Harmlessness from AI feedback. arXiv preprint arXiv:2212.08073.
Anthropic. (2023). Claude's Constitution. Retrieved from https://www.anthropic.com/index/claudes-constitution
END OF ARTICLE 5
Next: Article 6 - Consciousness Preparedness
From charter to certificate. This article is part of the standard behind
Watchdog Certification — independent assessment, Ed25519-signed, publicly verifiable. The crosswalks to the EU AI Act, ISO/IEC 42001 and 18 more frameworks are in the
Crosswalk Library; the runtime tools are in
the fabric.
The 52-Article Charter is published in full in the Journal. Bespoke briefings: hello@meok.ai.