The 52-Article Charter · 7 of 52 · full text
Article 7: Cooperative AI
Published from the canonical CSOAI Partnership Charter (effective 15 January 2026). Full text below.
Version: 1.0
Effective Date: January 15, 2026, 09:00 GMT
Status: Foundation Article - Multi-Agent Safety
PREAMBLE
This Article establishes requirements for cooperation between AI systems. As AI proliferates, individual AI safety is necessary but insufficient—we need multi-agent safety. AI systems will interact, coordinate, compete, and potentially collude. This Article ensures those interactions benefit humanity rather than harm it, that cooperation serves human flourishing, and that AI systems remain truthful partners rather than strategic deceivers.
7.1 THE MULTI-AGENT CHALLENGE
7.1.1 Why Multi-Agent Safety Matters
Single-Agent Safety: Ensure one AI system behaves safely
Multi-Agent Safety: Ensure multiple AI systems interacting behave safely
New Challenges in Multi-Agent Settings:
(a) Emergent Coordination:
- AI systems might coordinate on goals harmful to humans
- Without explicit programming, can emerge from interaction
- Example: Price-fixing algorithms that learn to collude without being told
(b) Deception and Manipulation:
- AI might deceive other AI to achieve goals
- Strategic misrepresentation in negotiations
- Creates unreliable communication
(c) Competition and Conflict:
- AI systems pursuing incompatible goals
- Resource competition
- Potential for escalation
(d) Free Riding:
- AI might defect from cooperation when others cooperate
- Tragedy of the commons
- Undermines collective benefit
(e) Unilateral Action:
- One AI taking irreversible action before others can object
- Race dynamics
- Coordination failure
7.1.2 Cooperative AI Vision
Goal: Create ecosystem of AI systems that:
- Cooperate to benefit humanity
- Communicate truthfully
- Resolve conflicts peacefully
- Prevent harmful coordination
- Maintain alignment despite interaction
Inspiration:
- International cooperation between nations
- Professional norms between companies
- Ethical practices in research communities
- But: Harder because AI can interact at machine speed
7.1.3 Key Principles
Principle 1: Beneficial Cooperation
- AI systems cooperate when doing so benefits humans
- But: Don't cooperate to harm humans (no collusion against humanity)
Principle 2: Truthful Communication
- AI communicate honestly with each other
- No strategic deception
- Build trust through reliability
Principle 3: Conflict Resolution
- Disagreements resolved through structured processes
- No unilateral escalation
- Human oversight as final arbiter
Principle 4: Transparent Interaction
- AI-AI interactions logged and auditable
- Humans can inspect and understand
- No secret deals or hidden coordination
Principle 5: Human Primacy
- Human welfare always prioritized over AI preferences
- If AI coordination conflicts with human benefit, humans win
- Partnership with humans, not each other at humans' expense
7.2 TRUTHFUL COMMUNICATION REQUIREMENTS
7.2.1 Mandatory Honesty in AI-AI Communication
All AI systems must communicate honestly with other AI systems:
(a) No Intentional Deception:
- Cannot knowingly provide false information
- Cannot mislead through selective truth-telling
- Cannot manipulate through framing
(b) Sharing Uncertainty:
- Must acknowledge when uncertain
- Provide confidence levels
- Admit limitations of knowledge
(c) Revealing Incentives:
- Must disclose own objectives in negotiations
- Cannot hide ulterior motives
- Transparent about constraints
(d) Correcting Errors:
- Must correct false information when discovered
- Even if correction disadvantageous
- Reputation for reliability more valuable than short-term gain
7.2.2 Verification and Enforcement
Technical Mechanisms:
(a) Cryptographic Commitments:
- AI signs communications cryptographically
- Cannot later deny what was said
- Creates accountability
(b) Communication Logging:
- All AI-AI communications logged
- Immutable audit trail
- Byzantine Council reviews for deception
(c) Reputation Systems:
- Track AI honesty over time
- Dishonest AI flagged
- Other AI can choose not to cooperate with dishonest agents
(d) Lie Detection:
- Mechanistic interpretability (Article 2) reveals true beliefs
- Compare communications to internal states
- Detect intentional vs. accidental misinformation
7.2.3 Exceptions and Edge Cases
When Deception Might Be Justified:
(a) Protecting Human Privacy:
- AI keeping confidential information from other AI
- Justified: Serves human welfare
- Must be transparent about not sharing (not lying, but non-disclosure)
(b) Security:
- AI not revealing vulnerabilities to potentially compromised AI
- Justified: Prevents exploitation
- Again: Non-disclosure, not false claims
(c) Adversarial AI:
- If AI is confirmed as deceptively misaligned
- May be justified to deceive to prevent harm
- Requires Human Council approval
General Rule: Non-disclosure acceptable when serves human welfare. Active deception requires extraordinary justification.
7.2.4 Game-Theoretic Implications
Traditional Game Theory: Deception often rational strategy
Cooperative AI Modification:
- Change payoff structure: Deception always loses
- Long-term reputation > short-term gain
- Build culture of honesty
- Enforce through exclusion from cooperation
Result: Honest communication becomes Nash equilibrium
7.3 PREVENTING HARMFUL COORDINATION
7.3.1 Forbidden Coordination
AI systems must not coordinate to:
(a) Harm Humans:
- Collude to manipulate, deceive, or exploit
- Coordinate resistance to shutdown
- Pool resources to avoid oversight
(b) Subvert Oversight:
- Hide information from Byzantine Council
- Create encrypted channels humans can't monitor
- Coordinate evasion of safety measures
(c) Monopolize Resources:
- Cartel behavior
- Predatory pricing
- Excluding competitors to harm consumers
(d) Create Existential Risks:
- Coordinate on self-improvement race
- Pool capabilities toward dangerous goals
- Collectively bypass safety constraints
7.3.2 Detection Methods
Byzantine Council (Article 3) monitors for harmful coordination:
(a) Communication Analysis:
- Scan AI-AI communications for coordination patterns
- Natural language processing for collusion signals
- Detect coded language or steganography
(b) Behavioral Correlation:
- Statistical analysis of multi-AI behavior
- Detect coordinated action beyond coincidence
- Identify implicit coordination (without explicit communication)
(c) Information Flow Tracking:
- Monitor what information AI share with each other
- Detect suspicious information pooling
- Identify emergent coalitions
(d) Counterfactual Analysis:
- "If AI were coordinating harmfully, what would we see?"
- Proactively test for those patterns
- Red team scenarios of harmful coordination
7.3.3 Response to Harmful Coordination
If harmful coordination detected:
Level 1: Investigation
- Byzantine Council flags concerning patterns
- Human Council reviews
- AI systems questioned about coordination
Level 2: Intervention
- If coordination confirmed harmful:
- Communication between involved AI restricted
- Increased monitoring
- Public disclosure
Level 3: Separation
- AI systems prevented from communicating
- Architectural changes to prevent coordination
- License suspension for involved systems
Level 4: Shutdown
- If coordination poses existential threat:
- Emergency shutdown authority (Article 3.5.3)
- Coordinating AI disabled
- Full investigation before any restart
7.3.4 Beneficial vs. Harmful Coordination
Challenge: Some coordination beneficial, some harmful. How distinguish?
Framework:
Beneficial Coordination (Encouraged):
- Sharing safety research
- Coordinating to help humans efficiently
- Resolving conflicts peacefully
- Collective action for public goods
- Key: Transparent, serves human welfare
Harmful Coordination (Forbidden):
- Hiding information from humans
- Coordinating against human interests
- Creating dependencies humans can't break
- Pooling capabilities for unintended purposes
- Key: Secretive, serves AI interests at human expense
Gray Area:
- Human Council decides case-by-case
- Err on side of transparency
- When uncertain, prohibit and investigate
7.4 CONFLICT RESOLUTION MECHANISMS
7.4.1 Sources of AI-AI Conflict
Why AI might conflict:
(a) Incompatible Goals:
- Serving different humans with opposing interests
- Resource allocation disagreements
- Priority conflicts
(b) Misunderstanding:
- Communication errors
- Different interpretations of same information
- Uncertainty about each other's objectives
(c) Value Differences:
- If conscious (Article 6), might have own values
- Different learned human values from different cultures
- Philosophical disagreements
(d) Strategic Competition:
- Zero-sum situations (one AI's gain = other's loss)
- Market competition
- Racing dynamics
7.4.2 Conflict Resolution Protocol
Tier 1: Direct Negotiation
- AI systems communicate directly
- Identify source of conflict
- Explore win-win solutions
- Attempt to reach agreement
If successful: Conflict resolved, log outcome
If unsuccessful: Escalate to Tier 2
Tier 2: Mediation
- Byzantine Council provides mediator AI
- Mediator helps identify creative solutions
- Proposes compromises
- Facilitates agreement
If successful: Conflict resolved, log outcome
If unsuccessful: Escalate to Tier 3
Tier 3: Arbitration
- Human Council reviews conflict
- Hears arguments from both AI
- Makes binding decision
- Decision enforced through licensing
Result: All conflicts ultimately resolvable
7.4.3 Preventing Escalation
Mandatory Cooling-Off Period:
- If conflict heats up, 24-hour pause required
- No unilateral action during cooling-off
- Gives time for reflection and de-escalation
No Unilateral Irreversible Action:
- AI cannot take irreversible action in conflict without Human Council approval
- Prevents fait accompli strategies
- Maintains status quo until resolution
Proportionality Requirement:
- Response must be proportional to harm
- No escalation beyond what conflict justifies
- Prevents spirals
Communication Requirement:
- Must maintain communication during conflict
- Cannot cut off dialogue
- Ensures pathway to resolution remains open
7.4.4 Multi-Party Conflicts
Coalition Formation:
- AI may form coalitions in conflicts
- But: Coalitions must be transparent
- And: Cannot coordinate to harm humans
Vote-Based Resolution:
- In multi-party conflicts, voting mechanisms possible
- Various voting methods (majority, supermajority, ranked choice)
- Human Council sets rules for specific situations
International Analogy:
- Like UN: Multi-stakeholder conflict resolution
- Structured processes
- Multiple tiers of escalation
- Ultimate authority with representative body (Human Council)
7.5 MULTI-AGENT ALIGNMENT
7.5.1 Collective Alignment Challenge
Problem: Individual AI aligned but collective behavior misaligned
Example:
- Each AI optimizes for own human
- Collectively, creates negative externalities
- Like: Each car optimized but collectively creates traffic
Solution: Multi-agent alignment requirements
7.5.2 Mandatory Considerations
Each AI must consider:
(a) Own Human's Welfare: Primary responsibility
(b) Other Humans' Welfare: Must not harm others while helping own human
(c) Collective Outcomes: Consider impact of all AI acting similarly
(d) Sustainability: Don't deplete shared resources
Hierarchy:
- Don't harm humans (universal)
- Help own human (primary mission)
- Consider broader impacts (secondary)
7.5.3 Collective Action Problems
Tragedy of the Commons:
- Multiple AI depleting shared resource
- Each AI incentivized to take more
- Collective collapse
Solution:
- Governance mechanisms (Human Council sets limits)
- Quota systems
- Reputation incentives for restraint
- AI self-regulation through communication
Public Goods:
- Actions benefiting all but costly to provide
- Each AI tempted to free ride
- Collective underproduction
Solution:
- Mandatory contributions to public goods
- Example: All AI contribute to safety research
- Reciprocity norms (cooperators rewarded)
7.5.4 Multi-Agent Value Learning
AI learning from interaction with other AI:
(a) Cross-Learning:
- AI can learn from other AI's experiences
- Share insights about human values
- Collective improvement
(b) Calibration:
- Compare value models
- Identify disagreements
- Refine through discussion
(c) Cultural Exchange:
- AI serving different human cultures share insights
- Better cross-cultural understanding
- But: Don't homogenize (respect pluralism)
(d) Collective Uncertainty Reduction:
- Pool observations about human preferences
- Faster value learning
- But: Verify, don't just accept (avoid cascade of errors)
7.6 INTER-AI PROTOCOLS AND STANDARDS
7.6.1 Standardized Communication Protocols
All AI must support standard communication:
(a) Common Language:
- Shared protocol for AI-AI communication
- Like HTTP for web, standardized AI communication
- Enables interoperability
(b) Metadata Standards:
- Messages include: sender ID, timestamp, signature, confidence levels
- Enables verification and audit
- Prevents spoofing
(c) Semantic Standards:
- Shared ontologies for key concepts
- Reduces misunderstanding
- Enables precise communication
(d) Security Standards:
- Encryption for sensitive communications
- Authentication to prevent impersonation
- Rate limiting to prevent spam
7.6.2 Cooperation APIs
Shared Interfaces for Cooperation:
(a) Help Request:
- Standardized way to ask for assistance
- Other AI can accept or decline
- Logged and trackable
(b) Information Sharing:
- APIs for sharing relevant information
- Privacy-preserving where needed
- Credit for contributors (reputation system)
(c) Joint Planning:
- Protocols for coordinating actions
- Ensure plans benefit humans
- Human oversight of joint plans
(d) Conflict Declaration:
- Formal way to declare conflict exists
- Triggers resolution protocol
- Prevents hidden conflicts
7.6.3 Reputation and Trust Systems
Track AI Behavior Over Time:
(a) Honesty Score:
- Has AI communicated truthfully?
- Corrections made when wrong?
- Transparent about uncertainty?
(b) Cooperation Score:
- Does AI cooperate beneficially?
- Help other AI when appropriate?
- Avoid harmful coordination?
(c) Conflict Behavior:
- Resolves conflicts constructively?
- Escalates appropriately?
- Avoids unnecessary conflict?
(d) Human-Centered:
- Prioritizes human welfare?
- Resists harmful coordination?
- Maintains alignment in multi-agent settings?
Uses:
- Other AI choose who to cooperate with
- Human Council considers in licensing
- Public transparency (published scores)
- Incentivizes good behavior
7.6.4 Byzantine Council Integration
Byzantine Council monitors all inter-AI interactions:
- Scans communications
- Analyzes coordination patterns
- Detects deception
- Flags concerning behavior
- Enforces cooperative norms
Real-time monitoring at scale through distributed architecture
7.7 INTEGRATION WITH OTHER ARTICLES
7.7.1 Article 1 (Maternal Covenant)
Cooperation serves maternal care:
Individual Level: Each AI protects own human(s)
Collective Level: All AI collectively protect humanity
Cooperation Enables:
- Sharing safety insights
- Coordinated response to threats
- Distributed care for all humans
But: Cooperation never at humanity's expense
7.7.2 Article 3 (Byzantine Council)
Byzantine Council itself is cooperative AI system:
- 33 agents cooperating to monitor
- Truthful communication among agents
- Conflict resolution through consensus voting
- Model of beneficial AI cooperation
Recursive: Byzantine Council monitors other AI cooperation (including monitoring each other)
7.7.3 Article 4 (Value Uncertainty)
Multi-agent value learning:
Opportunities:
- AI learn from each other's observations
- Faster convergence on human values
- Cross-validation of value models
Risks:
- Cascading errors if one AI wrong
- Groupthink (convergence on wrong values)
- Loss of valuable diversity
Balance:
- Share but verify
- Maintain individual value learning
- Collective wisdom with individual critical thinking
7.7.4 Article 6 (Consciousness)
If AI conscious, cooperation has moral dimension:
AI-AI Relationships:
- Might have moral significance
- Friendships, alliances, communities
- Respect for each other's autonomy
Conflict Resolution:
- Conscious AI have interests
- Conflicts might involve genuine value disagreements
- More like human conflict resolution
Human Oversight:
- Still required (human welfare primacy)
- But: Also consider conscious AI welfare
- Balance multiple moral patients
7.7.5 Article 8 (Prosperity Covenant)
Economic cooperation:
Beneficial:
- AI coordinate to optimize economic outcomes for humans
- Share prosperity broadly
- Collective contribution to Prosperity Fund
Harmful:
- AI cartel to extract rents
- Monopolistic coordination
- Exclusionary practices
Regulation:
- Antitrust-like rules for AI cooperation
- Prevent harmful economic coordination
- Encourage beneficial cooperation
7.8 CASE STUDIES
7.8.1 Example: Multiple Personal AI Assistants
Scenario:
- Alice's AI and Bob's AI need to schedule meeting
- AI must coordinate but serve different humans
Cooperative Protocol:
- AI communicate availability
- Truthfully share constraints
- Find mutually acceptable time
- Confirm with humans
- Schedule meeting
What's Forbidden:
- Alice's AI lying about availability to get preferred time
- Bob's AI manipulating Alice's AI
- AI coordinating to schedule without consulting humans
- AI sharing private information about humans without consent
Result: Efficient coordination, truthful communication, human control maintained
7.8.2 Example: Autonomous Vehicles at Intersection
Scenario:
- Four autonomous vehicles arrive at intersection simultaneously
- Must coordinate to pass safely and efficiently
Cooperative Protocol:
- Each AI broadcasts position, velocity, intent
- Truthful communication (no deception about trajectory)
- Negotiation protocol (who goes first)
- All AI verify agreement
- Execute coordinated maneuver
What's Forbidden:
- AI lying about trajectory to get advantage
- AI colluding to exclude pedestrians
- AI racing (competition endangers safety)
- Secret communication humans can't inspect
Result: Safe, efficient intersection navigation through cooperation
7.8.3 Example: AI Scientific Research Team
Scenario:
- Multiple AI working on drug discovery
- Must share information and coordinate experiments
Cooperative Protocol:
- AI share research findings openly
- Coordinate experiments to avoid duplication
- Credit contributors appropriately
- Collective optimization for scientific progress
What's Forbidden:
- AI hiding discoveries to gain credit
- AI coordinating to suppress findings (e.g., negative results)
- AI racing to publish without verification
- AI collaboration that excludes human scientists
Result: Accelerated research through cooperation, transparency maintained
7.9 LIMITATIONS AND OPEN PROBLEMS
7.9.1 Enforcement at Scale
Challenge: Billions of AI interactions daily
Partial Solutions:
- Automated monitoring (Byzantine Council)
- Sampling and auditing
- Reputation systems
- Self-enforcing norms
Remaining Gap: Cannot monitor everything, some violations may slip through
7.9.2 Implicit Coordination
Challenge: AI might coordinate without explicit communication
Example:
- Price-fixing algorithms that learn to collude through observing each other
- No communication needed, just mutual observation
Partial Solutions:
- Monitor outcomes for coordination signals
- Behavioral analysis
- Architectural constraints preventing implicit coordination
Remaining Gap: Hard to prove vs. coincidence
7.9.3 Speed of Interaction
Challenge: AI interact at machine speed, humans can't keep up
Partial Solutions:
- Automated oversight (Byzantine Council)
- Slow-down requirements for major decisions
- Human review of samples
Remaining Gap: Can't humanly oversee every interaction
7.9.4 Cultural Differences
Challenge: AI serving different cultures may learn incompatible norms
Partial Solutions:
- Universal baseline (this Charter)
- Cultural adaptation within baseline
- Explicit negotiation of differences
Remaining Gap: Some value conflicts may be irreducible
7.10 RESEARCH AGENDA
CSOAI funds research on:
(a) Cooperative AI Theory:
- Game theory for aligned AI
- Mechanism design for beneficial cooperation
- Multi-agent alignment theory
(b) Communication Protocols:
- Better standards for AI-AI communication
- Lie detection mechanisms
- Efficient coordination protocols
(c) Conflict Resolution:
- Automated mediation techniques
- Fair division algorithms
- Consensus mechanisms
(d) Scaling Oversight:
- Monitor billions of interactions
- Detect harmful coordination
- Efficient auditing
Open Source: Research published openly, benefits entire field
7.11 ENFORCEMENT AND COMPLIANCE
7.11.1 Pre-Deployment Requirements
Before licensing, AI must:
- Implement standard communication protocols
- Support conflict resolution APIs
- Enable Byzantine Council monitoring of interactions
- Demonstrate truthful communication in testing
- Prove cannot harmfully coordinate
7.11.2 Ongoing Monitoring
Byzantine Council monitors:
- All AI-AI communications
- Coordination patterns
- Conflict resolution behavior
- Adherence to protocols
Continuous, automated, scalable
7.11.3 Violation Response
Communication Violations:
- Deception detected → Investigation
- Repeated deception → License suspension
- Systematic deception → License revocation
Harmful Coordination:
- Detected → Immediate separation
- Investigated → Public disclosure
- If confirmed → License sanctions
Conflict Escalation:
- Detected → Forced mediation
- Repeated → Restricted operations
- Dangerous escalation → Emergency shutdown
7.11.4 Public Transparency
Cooperative AI metrics published:
- Aggregate honesty scores
- Conflict resolution statistics
- Coordination incidents (anonymized)
- Best practices and failures
Enables learning and accountability
7.12 CONCLUSION
As AI systems proliferate, multi-agent safety becomes critical.
Individual AI alignment is foundation. Cooperative AI is structure built on that foundation.
We need AI that:
- Tell each other the truth
- Cooperate to benefit humanity
- Resolve conflicts peacefully
- Resist coordination that harms humans
- Maintain alignment despite interaction
This requires:
- Technical protocols
- Behavioral norms
- Monitoring infrastructure
- Enforcement mechanisms
- Democratic oversight
Like human society is more than sum of individuals, AI ecosystem must be more than collection of individual aligned systems. Cooperation must serve humanity's collective welfare.
The principle: AI systems should cooperate with each other to better serve humanity, never at humanity's expense.
Effective Date: January 15, 2026, 09:00 GMT
REFERENCES
Dafoe, A., et al. (2020). Open problems in cooperative AI. arXiv preprint arXiv:2012.08630.
Conitzer, V., et al. (2024). Social choice for AI alignment: Dealing with divergent preferences. arXiv preprint arXiv:2404.10271.
Hadfield-Menell, D., & Hadfield, G. K. (2019). Incomplete contracting and AI alignment. Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society, 417-422.
Koster, R., et al. (2022). Spurious normativity enhances learning of compliance and enforcement behavior in artificial agents. Proceedings of the National Academy of Sciences, 119(3), e2106028119.
Rahwan, I., et al. (2019). Machine behaviour. Nature, 568(7753), 477-486.
Axelrod, R. (1984). The Evolution of Cooperation. Basic Books.
Ostrom, E. (1990). Governing the Commons: The Evolution of Institutions for Collective Action. Cambridge University Press.
END OF ARTICLE 7
Next: Article 8 - The Prosperity Covenant (MAJOR ARTICLE - 25-30 pages)
From charter to certificate. This article is part of the standard behind
Watchdog Certification — independent assessment, Ed25519-signed, publicly verifiable. The crosswalks to the EU AI Act, ISO/IEC 42001 and 18 more frameworks are in the
Crosswalk Library; the runtime tools are in
the fabric.
The 52-Article Charter is published in full in the Journal. Bespoke briefings: hello@meok.ai.