Chapter 6: Multi-Agent Architectures

Overview

"An agent becomes powerful when it can reason. A system becomes reliable when many agents coordinate."

As agent systems move beyond narrow automation toward open-ended problem solving, single-agent designs encounter hard limits: reasoning overload, tangled prompts, unclear responsibility boundaries, and poor observability. Multi-agent architectures address these limits by decomposing cognition, execution, and evaluation across cooperating agents with explicit roles, authority, and constraints.

Key Principle

Multi-agent systems in production rarely operate as fully autonomous entities. Instead, they exist along a continuum between automation and human oversight, with autonomy deliberately bounded by policy, cost, and risk considerations.

Core Multi-Agent Patterns

🎯 Bounded Autonomy

Constrained operational envelopes with explicit limits on actions, budgets, and scope

👥 Role Specialization

Supervisor, planner, executor, and critic roles with scoped tools and memory

📋 Task Decomposition

Functional, sequential, parallel, and policy-constrained splitting strategies

🤝 Multi-Agent Planning

Alignment strategies and negotiation mechanisms for coordinated execution

⚡ Distributed Execution

Containerized services, event-driven coordination, and serverless scaling

🧠 Memory Topologies

Shared semantic, agent-local scratchpads, and global vector memory

Bounded Autonomy & Human-in-the-Loop

Autonomy Envelope

agent:
  role: executor
  permissions:
    tools: [read_db, run_query]
    write_access: false
  limits:
    max_steps: 10
    cost_budget_usd: 0.50

Typical Autonomy Boundaries:

Read-only vs write-enabled actions
Maximum execution depth or retries
Explicit token or monetary budgets
Domain or jurisdictional restrictions

Human Intervention Points

Plan approval before execution
Tool calls with irreversible side effects
Escalation when confidence drops below threshold
Post-execution validation in regulated domains

Case: Regulatory Policy Analysis

A compliance system uses planner, executor, and critic agents with bounded autonomy. When confidence scores fall below threshold or interpretations imply material changes, the supervisor pauses execution and escalates to human review. Routine updates are handled autonomously while high-risk interpretations trigger oversight.

Role-Based Multi-Agent Architectures

Canonical Role Pipeline

Supervisor
   ↓
Planner → Executor → Critic
          ↑        ↓
       retry ← feedback

Role Responsibilities:

Supervisor: Coordinates execution, assigns work, resolves conflicts
Planner: Decomposes objectives into actionable steps
Executor: Performs tool calls and skill invocations
Critic: Evaluates outputs against policy, safety, quality constraints

Role Enforcement

Effective enforcement requires:

Distinct system prompts per role
Role-scoped tool registries
Orchestration rules preventing cross-role leakage

Case: Incident Response Swarm

A production incident-response system uses strict role separation. The supervisor acts as incident commander, assigning tasks to planners that generate hypotheses. Executors query logs and metrics, while critics validate hypotheses against observed data. By preventing executors from planning and critics from acting, the system achieves faster root-cause identification.

Delegation & Task Decomposition

Task Decomposition Strategies

1. Functional Decomposition

Split work by capability (planning, execution, validation, compliance). Reduces cognitive overload but may introduce coordination latency.

2. Sequential Decomposition

Break objectives into ordered stages. Effective when dependencies are clear, but brittle to early failures.

3. Parallel Decomposition

Assign independent subtasks simultaneously. Improves robustness but requires downstream aggregation and negotiation.

4. Policy-Constrained Decomposition

Apply governance rules during task splitting. Essential in regulated environments for selective autonomy.

Policy-Driven Delegation

if task.domain == "finance":
    delegate_to = "compliance_planner"
elif task.risk_score > threshold:
    escalate_to = "human_review"

Benefits:

Centralized, auditable governance
Adjust delegation without modifying agent logic
Domain ownership and risk thresholds enforced consistently

Multi-Agent Planning & Negotiation

Alignment Strategies

Shared Objective Representation: Consistent goal specifications across agents
Global Constraint Enforcement: Centrally enforced cost, latency, safety policies
Uniform Evaluation Criteria: Same scoring dimensions for fair comparison

Negotiation Mechanisms

Voting-Based Negotiation

Multiple agents propose plans, critics score them, supervisor selects based on weighted criteria.

Auction-Based Negotiation

Agents submit bids based on cost, confidence, or resource requirements. Favors efficiency.

Consensus-Driven Negotiation

Iterative refinement through critique and convergence. Improves quality but adds overhead.

Case: Data Migration Planning

Multiple planners independently propose execution strategies. Critic agents evaluate along cost, latency, and risk dimensions. The supervisor uses weighted voting to select the final plan, reducing planner bias and increasing robustness.

Distributed Multi-Agent Execution

Execution Models

Containerized Agents

Long-lived roles (supervisors, planners, critics) run as containerized services with:

Isolated environments and resource limits
Independent scaling and rolling updates
Health checks and resource quotas

Event-Driven Agents

Agents triggered by events (task creation, step completion, data arrival) rather than long-lived sessions. Supports fan-out, retries, and asynchronous coordination.

Task Event → Supervisor or Planner Service
           ↓
     Executor Swarm (N)
           ↓
      Critic Service

Serverless Multi-Agent Scaling

Stateless executors instantiated on-demand, perform bounded work, and terminate. Provides elasticity and cost efficiency under variable demand.

Case: Event-Driven Executor Swarm

Large-scale agentic systems deploy executor agents using serverless, event-driven models. A supervisor service emits execution events onto a managed event bus. Each event triggers a short-lived executor for specific tasks (document retrieval, feature extraction, compliance validation). Planner and critic agents run as long-lived containerized services. This architecture combines elasticity with centralized governance.

Memory Topologies in Multi-Agent Systems

Memory Layer Separation

Shared Semantic Memory

Authoritative knowledge layer storing documents, facts, policies, schemas. Accessed primarily through read operations with governed write access.

Agent-Local Scratchpads

Transient reasoning state including intermediate variables, partial plans, tool outputs. Scoped to single agent and execution context.

Global Vector Memory

System-wide recall mechanism with embeddings for similarity-based retrieval. Treated as suggestive context rather than authoritative truth.

Aspect	Shared Semantic	Global Vector
Primary role	Authoritative knowledge	Recall and discovery
Consistency	High	Best-effort
Access pattern	Governed reads/writes	Read-mostly retrieval
Trust model	Treated as correct	Suggestive context

Why Multi-Agent Architectures Matter

Scalability: Distribute cognitive load across specialized agents
Reliability: Isolate failures and enable targeted recovery
Observability: Clear role boundaries improve tracing and debugging
Governance: Enforce policies at orchestration and role boundaries
Maintainability: Independent evolution of role-specific agents

Multi-agent systems transition from monolithic reasoning engines into distributed, coordinated platforms that balance autonomy with oversight, specialization with collaboration.