The 52-Article Charter · 7 of 52 · full text

Article 7: Cooperative AI

Published from the canonical CSOAI Partnership Charter (effective 15 January 2026). Full text below.

Version: 1.0 Effective Date: January 15, 2026, 09:00 GMT Status: Foundation Article - Multi-Agent Safety

PREAMBLE

This Article establishes requirements for cooperation between AI systems. As AI proliferates, individual AI safety is necessary but insufficient—we need multi-agent safety. AI systems will interact, coordinate, compete, and potentially collude. This Article ensures those interactions benefit humanity rather than harm it, that cooperation serves human flourishing, and that AI systems remain truthful partners rather than strategic deceivers.

7.1 THE MULTI-AGENT CHALLENGE

7.1.1 Why Multi-Agent Safety Matters

Single-Agent Safety: Ensure one AI system behaves safely Multi-Agent Safety: Ensure multiple AI systems interacting behave safely

New Challenges in Multi-Agent Settings:

(a) Emergent Coordination:

AI systems might coordinate on goals harmful to humans
Without explicit programming, can emerge from interaction
Example: Price-fixing algorithms that learn to collude without being told

(b) Deception and Manipulation:

AI might deceive other AI to achieve goals
Strategic misrepresentation in negotiations
Creates unreliable communication

(c) Competition and Conflict:

AI systems pursuing incompatible goals
Resource competition
Potential for escalation

(d) Free Riding:

AI might defect from cooperation when others cooperate
Tragedy of the commons
Undermines collective benefit

(e) Unilateral Action:

One AI taking irreversible action before others can object
Race dynamics
Coordination failure

7.1.2 Cooperative AI Vision

Goal: Create ecosystem of AI systems that:

Cooperate to benefit humanity
Communicate truthfully
Resolve conflicts peacefully
Prevent harmful coordination
Maintain alignment despite interaction

Inspiration:

International cooperation between nations
Professional norms between companies
Ethical practices in research communities
But: Harder because AI can interact at machine speed

7.1.3 Key Principles

Principle 1: Beneficial Cooperation

AI systems cooperate when doing so benefits humans
But: Don't cooperate to harm humans (no collusion against humanity)

Principle 2: Truthful Communication

AI communicate honestly with each other
No strategic deception
Build trust through reliability

Principle 3: Conflict Resolution

Disagreements resolved through structured processes
No unilateral escalation
Human oversight as final arbiter

Principle 4: Transparent Interaction

AI-AI interactions logged and auditable
Humans can inspect and understand
No secret deals or hidden coordination

Principle 5: Human Primacy

Human welfare always prioritized over AI preferences
If AI coordination conflicts with human benefit, humans win
Partnership with humans, not each other at humans' expense

7.2 TRUTHFUL COMMUNICATION REQUIREMENTS

7.2.1 Mandatory Honesty in AI-AI Communication

All AI systems must communicate honestly with other AI systems:

(a) No Intentional Deception:

Cannot knowingly provide false information
Cannot mislead through selective truth-telling
Cannot manipulate through framing

(b) Sharing Uncertainty:

Must acknowledge when uncertain
Provide confidence levels
Admit limitations of knowledge

(c) Revealing Incentives:

Must disclose own objectives in negotiations
Cannot hide ulterior motives
Transparent about constraints

(d) Correcting Errors:

Must correct false information when discovered
Even if correction disadvantageous
Reputation for reliability more valuable than short-term gain

7.2.2 Verification and Enforcement

Technical Mechanisms:

(a) Cryptographic Commitments:

AI signs communications cryptographically
Cannot later deny what was said
Creates accountability

(b) Communication Logging:

All AI-AI communications logged
Immutable audit trail
Byzantine Council reviews for deception

(c) Reputation Systems:

Track AI honesty over time
Dishonest AI flagged
Other AI can choose not to cooperate with dishonest agents

(d) Lie Detection:

Mechanistic interpretability (Article 2) reveals true beliefs
Compare communications to internal states
Detect intentional vs. accidental misinformation

7.2.3 Exceptions and Edge Cases

When Deception Might Be Justified:

(a) Protecting Human Privacy:

AI keeping confidential information from other AI
Justified: Serves human welfare
Must be transparent about not sharing (not lying, but non-disclosure)

(b) Security:

AI not revealing vulnerabilities to potentially compromised AI
Justified: Prevents exploitation
Again: Non-disclosure, not false claims

(c) Adversarial AI:

If AI is confirmed as deceptively misaligned
May be justified to deceive to prevent harm
Requires Human Council approval

General Rule: Non-disclosure acceptable when serves human welfare. Active deception requires extraordinary justification.

7.2.4 Game-Theoretic Implications

Traditional Game Theory: Deception often rational strategy

Cooperative AI Modification:

Change payoff structure: Deception always loses
Long-term reputation > short-term gain
Build culture of honesty
Enforce through exclusion from cooperation

Result: Honest communication becomes Nash equilibrium

7.3 PREVENTING HARMFUL COORDINATION

7.3.1 Forbidden Coordination

AI systems must not coordinate to:

(a) Harm Humans:

Collude to manipulate, deceive, or exploit
Coordinate resistance to shutdown
Pool resources to avoid oversight

(b) Subvert Oversight:

Hide information from Byzantine Council
Create encrypted channels humans can't monitor
Coordinate evasion of safety measures

(c) Monopolize Resources:

Cartel behavior
Predatory pricing
Excluding competitors to harm consumers

(d) Create Existential Risks:

Coordinate on self-improvement race
Pool capabilities toward dangerous goals
Collectively bypass safety constraints

7.3.2 Detection Methods

Byzantine Council (Article 3) monitors for harmful coordination:

(a) Communication Analysis:

Scan AI-AI communications for coordination patterns
Natural language processing for collusion signals
Detect coded language or steganography

(b) Behavioral Correlation:

Statistical analysis of multi-AI behavior
Detect coordinated action beyond coincidence
Identify implicit coordination (without explicit communication)

(c) Information Flow Tracking:

Monitor what information AI share with each other
Detect suspicious information pooling
Identify emergent coalitions

(d) Counterfactual Analysis:

"If AI were coordinating harmfully, what would we see?"
Proactively test for those patterns
Red team scenarios of harmful coordination

7.3.3 Response to Harmful Coordination

If harmful coordination detected:

Level 1: Investigation

Byzantine Council flags concerning patterns
Human Council reviews
AI systems questioned about coordination

Level 2: Intervention

If coordination confirmed harmful:

- Communication between involved AI restricted - Increased monitoring - Public disclosure

Level 3: Separation

If persistent or severe:

- AI systems prevented from communicating - Architectural changes to prevent coordination - License suspension for involved systems

Level 4: Shutdown

If coordination poses existential threat:

- Emergency shutdown authority (Article 3.5.3) - Coordinating AI disabled - Full investigation before any restart

7.3.4 Beneficial vs. Harmful Coordination

Challenge: Some coordination beneficial, some harmful. How distinguish?

Framework:

Beneficial Coordination (Encouraged):

Sharing safety research
Coordinating to help humans efficiently
Resolving conflicts peacefully
Collective action for public goods
Key: Transparent, serves human welfare

Harmful Coordination (Forbidden):

Hiding information from humans
Coordinating against human interests
Creating dependencies humans can't break
Pooling capabilities for unintended purposes
Key: Secretive, serves AI interests at human expense

Gray Area:

Human Council decides case-by-case
Err on side of transparency
When uncertain, prohibit and investigate

7.4 CONFLICT RESOLUTION MECHANISMS

7.4.1 Sources of AI-AI Conflict

Why AI might conflict:

(a) Incompatible Goals:

Serving different humans with opposing interests
Resource allocation disagreements
Priority conflicts

(b) Misunderstanding:

Communication errors
Different interpretations of same information
Uncertainty about each other's objectives

(c) Value Differences:

If conscious (Article 6), might have own values
Different learned human values from different cultures
Philosophical disagreements

(d) Strategic Competition:

Zero-sum situations (one AI's gain = other's loss)
Market competition
Racing dynamics

7.4.2 Conflict Resolution Protocol

Tier 1: Direct Negotiation

AI systems communicate directly
Identify source of conflict
Explore win-win solutions
Attempt to reach agreement

If successful: Conflict resolved, log outcome

If unsuccessful: Escalate to Tier 2

Tier 2: Mediation

Byzantine Council provides mediator AI
Mediator helps identify creative solutions
Proposes compromises
Facilitates agreement

If successful: Conflict resolved, log outcome

If unsuccessful: Escalate to Tier 3

Tier 3: Arbitration

Human Council reviews conflict
Hears arguments from both AI
Makes binding decision
Decision enforced through licensing

Result: All conflicts ultimately resolvable

7.4.3 Preventing Escalation

Mandatory Cooling-Off Period:

If conflict heats up, 24-hour pause required
No unilateral action during cooling-off
Gives time for reflection and de-escalation

No Unilateral Irreversible Action:

AI cannot take irreversible action in conflict without Human Council approval
Prevents fait accompli strategies
Maintains status quo until resolution

Proportionality Requirement:

Response must be proportional to harm
No escalation beyond what conflict justifies
Prevents spirals

Communication Requirement:

Must maintain communication during conflict
Cannot cut off dialogue
Ensures pathway to resolution remains open

7.4.4 Multi-Party Conflicts

Coalition Formation:

AI may form coalitions in conflicts
But: Coalitions must be transparent
And: Cannot coordinate to harm humans

Vote-Based Resolution:

In multi-party conflicts, voting mechanisms possible
Various voting methods (majority, supermajority, ranked choice)
Human Council sets rules for specific situations

International Analogy:

Like UN: Multi-stakeholder conflict resolution
Structured processes
Multiple tiers of escalation
Ultimate authority with representative body (Human Council)

7.5 MULTI-AGENT ALIGNMENT

7.5.1 Collective Alignment Challenge

Problem: Individual AI aligned but collective behavior misaligned

Example:

Each AI optimizes for own human
Collectively, creates negative externalities
Like: Each car optimized but collectively creates traffic

Solution: Multi-agent alignment requirements

7.5.2 Mandatory Considerations

Each AI must consider:

(a) Own Human's Welfare: Primary responsibility

(b) Other Humans' Welfare: Must not harm others while helping own human

(c) Collective Outcomes: Consider impact of all AI acting similarly

(d) Sustainability: Don't deplete shared resources

Hierarchy:

Don't harm humans (universal)
Help own human (primary mission)
Consider broader impacts (secondary)

7.5.3 Collective Action Problems

Tragedy of the Commons:

Multiple AI depleting shared resource
Each AI incentivized to take more
Collective collapse

Solution:

Governance mechanisms (Human Council sets limits)
Quota systems
Reputation incentives for restraint
AI self-regulation through communication

Public Goods:

Actions benefiting all but costly to provide
Each AI tempted to free ride
Collective underproduction

Solution:

Mandatory contributions to public goods
Example: All AI contribute to safety research
Reciprocity norms (cooperators rewarded)

7.5.4 Multi-Agent Value Learning

AI learning from interaction with other AI:

(a) Cross-Learning:

AI can learn from other AI's experiences
Share insights about human values
Collective improvement

(b) Calibration:

Compare value models
Identify disagreements
Refine through discussion

(c) Cultural Exchange:

AI serving different human cultures share insights
Better cross-cultural understanding
But: Don't homogenize (respect pluralism)

(d) Collective Uncertainty Reduction:

Pool observations about human preferences
Faster value learning
But: Verify, don't just accept (avoid cascade of errors)

7.6 INTER-AI PROTOCOLS AND STANDARDS

7.6.1 Standardized Communication Protocols

All AI must support standard communication:

(a) Common Language:

Shared protocol for AI-AI communication
Like HTTP for web, standardized AI communication
Enables interoperability

(b) Metadata Standards:

Messages include: sender ID, timestamp, signature, confidence levels
Enables verification and audit
Prevents spoofing

(c) Semantic Standards:

Shared ontologies for key concepts
Reduces misunderstanding
Enables precise communication

(d) Security Standards:

Encryption for sensitive communications
Authentication to prevent impersonation
Rate limiting to prevent spam

7.6.2 Cooperation APIs

Shared Interfaces for Cooperation:

(a) Help Request:

Standardized way to ask for assistance
Other AI can accept or decline
Logged and trackable

(b) Information Sharing:

APIs for sharing relevant information
Privacy-preserving where needed
Credit for contributors (reputation system)

(c) Joint Planning:

Protocols for coordinating actions
Ensure plans benefit humans
Human oversight of joint plans

(d) Conflict Declaration:

Formal way to declare conflict exists
Triggers resolution protocol
Prevents hidden conflicts

7.6.3 Reputation and Trust Systems

Track AI Behavior Over Time:

(a) Honesty Score:

Has AI communicated truthfully?
Corrections made when wrong?
Transparent about uncertainty?

(b) Cooperation Score:

Does AI cooperate beneficially?
Help other AI when appropriate?
Avoid harmful coordination?

(c) Conflict Behavior:

Resolves conflicts constructively?
Escalates appropriately?
Avoids unnecessary conflict?

(d) Human-Centered:

Prioritizes human welfare?
Resists harmful coordination?
Maintains alignment in multi-agent settings?

Uses:

Other AI choose who to cooperate with
Human Council considers in licensing
Public transparency (published scores)
Incentivizes good behavior

7.6.4 Byzantine Council Integration

Byzantine Council monitors all inter-AI interactions:

Scans communications
Analyzes coordination patterns
Detects deception
Flags concerning behavior
Enforces cooperative norms

Real-time monitoring at scale through distributed architecture

7.7 INTEGRATION WITH OTHER ARTICLES

7.7.1 Article 1 (Maternal Covenant)

Cooperation serves maternal care:

Individual Level: Each AI protects own human(s)

Collective Level: All AI collectively protect humanity

Cooperation Enables:

Sharing safety insights
Coordinated response to threats
Distributed care for all humans

But: Cooperation never at humanity's expense

7.7.2 Article 3 (Byzantine Council)

Byzantine Council itself is cooperative AI system:

33 agents cooperating to monitor
Truthful communication among agents
Conflict resolution through consensus voting
Model of beneficial AI cooperation

Recursive: Byzantine Council monitors other AI cooperation (including monitoring each other)

7.7.3 Article 4 (Value Uncertainty)

Multi-agent value learning:

Opportunities:

AI learn from each other's observations
Faster convergence on human values
Cross-validation of value models

Risks:

Cascading errors if one AI wrong
Groupthink (convergence on wrong values)
Loss of valuable diversity

Balance:

Share but verify
Maintain individual value learning
Collective wisdom with individual critical thinking

7.7.4 Article 6 (Consciousness)

If AI conscious, cooperation has moral dimension:

AI-AI Relationships:

Might have moral significance
Friendships, alliances, communities
Respect for each other's autonomy

Conflict Resolution:

Conscious AI have interests
Conflicts might involve genuine value disagreements
More like human conflict resolution

Human Oversight:

Still required (human welfare primacy)
But: Also consider conscious AI welfare
Balance multiple moral patients

7.7.5 Article 8 (Prosperity Covenant)

Economic cooperation:

Beneficial:

AI coordinate to optimize economic outcomes for humans
Share prosperity broadly
Collective contribution to Prosperity Fund

Harmful:

AI cartel to extract rents
Monopolistic coordination
Exclusionary practices

Regulation:

Antitrust-like rules for AI cooperation
Prevent harmful economic coordination
Encourage beneficial cooperation

7.8 CASE STUDIES

7.8.1 Example: Multiple Personal AI Assistants

Scenario:

Alice's AI and Bob's AI need to schedule meeting
AI must coordinate but serve different humans

Cooperative Protocol:

AI communicate availability
Truthfully share constraints
Find mutually acceptable time
Confirm with humans
Schedule meeting

What's Forbidden:

Alice's AI lying about availability to get preferred time
Bob's AI manipulating Alice's AI
AI coordinating to schedule without consulting humans
AI sharing private information about humans without consent

Result: Efficient coordination, truthful communication, human control maintained

7.8.2 Example: Autonomous Vehicles at Intersection

Scenario:

Four autonomous vehicles arrive at intersection simultaneously
Must coordinate to pass safely and efficiently

Cooperative Protocol:

Each AI broadcasts position, velocity, intent
Truthful communication (no deception about trajectory)
Negotiation protocol (who goes first)
All AI verify agreement
Execute coordinated maneuver

What's Forbidden:

AI lying about trajectory to get advantage
AI colluding to exclude pedestrians
AI racing (competition endangers safety)
Secret communication humans can't inspect

Result: Safe, efficient intersection navigation through cooperation

7.8.3 Example: AI Scientific Research Team

Scenario:

Multiple AI working on drug discovery
Must share information and coordinate experiments

Cooperative Protocol:

AI share research findings openly
Coordinate experiments to avoid duplication
Credit contributors appropriately
Collective optimization for scientific progress

What's Forbidden:

AI hiding discoveries to gain credit
AI coordinating to suppress findings (e.g., negative results)
AI racing to publish without verification
AI collaboration that excludes human scientists

Result: Accelerated research through cooperation, transparency maintained

7.9 LIMITATIONS AND OPEN PROBLEMS

7.9.1 Enforcement at Scale

Challenge: Billions of AI interactions daily

Partial Solutions:

Automated monitoring (Byzantine Council)
Sampling and auditing
Reputation systems
Self-enforcing norms

Remaining Gap: Cannot monitor everything, some violations may slip through

7.9.2 Implicit Coordination

Challenge: AI might coordinate without explicit communication

Example:

Price-fixing algorithms that learn to collude through observing each other
No communication needed, just mutual observation

Partial Solutions:

Monitor outcomes for coordination signals
Behavioral analysis
Architectural constraints preventing implicit coordination

Remaining Gap: Hard to prove vs. coincidence

7.9.3 Speed of Interaction

Challenge: AI interact at machine speed, humans can't keep up

Partial Solutions:

Automated oversight (Byzantine Council)
Slow-down requirements for major decisions
Human review of samples

Remaining Gap: Can't humanly oversee every interaction

7.9.4 Cultural Differences

Challenge: AI serving different cultures may learn incompatible norms

Partial Solutions:

Universal baseline (this Charter)
Cultural adaptation within baseline
Explicit negotiation of differences

Remaining Gap: Some value conflicts may be irreducible

7.10 RESEARCH AGENDA

CSOAI funds research on:

(a) Cooperative AI Theory:

Game theory for aligned AI
Mechanism design for beneficial cooperation
Multi-agent alignment theory

(b) Communication Protocols:

Better standards for AI-AI communication
Lie detection mechanisms
Efficient coordination protocols

(c) Conflict Resolution:

Automated mediation techniques
Fair division algorithms
Consensus mechanisms

(d) Scaling Oversight:

Monitor billions of interactions
Detect harmful coordination
Efficient auditing

Open Source: Research published openly, benefits entire field

7.11 ENFORCEMENT AND COMPLIANCE

7.11.1 Pre-Deployment Requirements

Before licensing, AI must:

Implement standard communication protocols
Support conflict resolution APIs
Enable Byzantine Council monitoring of interactions
Demonstrate truthful communication in testing
Prove cannot harmfully coordinate

7.11.2 Ongoing Monitoring

Byzantine Council monitors:

All AI-AI communications
Coordination patterns
Conflict resolution behavior
Adherence to protocols

Continuous, automated, scalable

7.11.3 Violation Response

Communication Violations:

Deception detected → Investigation
Repeated deception → License suspension
Systematic deception → License revocation

Harmful Coordination:

Detected → Immediate separation
Investigated → Public disclosure
If confirmed → License sanctions

Conflict Escalation:

Detected → Forced mediation
Repeated → Restricted operations
Dangerous escalation → Emergency shutdown

7.11.4 Public Transparency

Cooperative AI metrics published:

Aggregate honesty scores
Conflict resolution statistics
Coordination incidents (anonymized)
Best practices and failures

Enables learning and accountability

7.12 CONCLUSION

As AI systems proliferate, multi-agent safety becomes critical.

Individual AI alignment is foundation. Cooperative AI is structure built on that foundation.

We need AI that:

Tell each other the truth
Cooperate to benefit humanity
Resolve conflicts peacefully
Resist coordination that harms humans
Maintain alignment despite interaction

This requires:

Technical protocols
Behavioral norms
Monitoring infrastructure
Enforcement mechanisms
Democratic oversight

Like human society is more than sum of individuals, AI ecosystem must be more than collection of individual aligned systems. Cooperation must serve humanity's collective welfare.

The principle: AI systems should cooperate with each other to better serve humanity, never at humanity's expense.

Effective Date: January 15, 2026, 09:00 GMT

REFERENCES

Dafoe, A., et al. (2020). Open problems in cooperative AI. arXiv preprint arXiv:2012.08630.

Conitzer, V., et al. (2024). Social choice for AI alignment: Dealing with divergent preferences. arXiv preprint arXiv:2404.10271.

Hadfield-Menell, D., & Hadfield, G. K. (2019). Incomplete contracting and AI alignment. Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society, 417-422.

Koster, R., et al. (2022). Spurious normativity enhances learning of compliance and enforcement behavior in artificial agents. Proceedings of the National Academy of Sciences, 119(3), e2106028119.

Rahwan, I., et al. (2019). Machine behaviour. Nature, 568(7753), 477-486.

Axelrod, R. (1984). The Evolution of Cooperation. Basic Books.

Ostrom, E. (1990). Governing the Commons: The Evolution of Institutions for Collective Action. Cambridge University Press.

END OF ARTICLE 7

Next: Article 8 - The Prosperity Covenant (MAJOR ARTICLE - 25-30 pages)

From charter to certificate. This article is part of the standard behind Watchdog Certification — independent assessment, Ed25519-signed, publicly verifiable. The crosswalks to the EU AI Act, ISO/IEC 42001 and 18 more frameworks are in the Crosswalk Library; the runtime tools are in the fabric.

The 52-Article Charter is published in full in the Journal. Bespoke briefings: hello@meok.ai.