Chapter 6: Multi-Agent Architectures

From Single Agents to Coordinated Systems

Overview

"An agent becomes powerful when it can reason. A system becomes reliable when many agents coordinate."

As agent systems move beyond narrow automation toward open-ended problem solving, single-agent designs encounter hard limits: reasoning overload, tangled prompts, unclear responsibility boundaries, and poor observability. Multi-agent architectures address these limits by decomposing cognition, execution, and evaluation across cooperating agents with explicit roles, authority, and constraints.

Key Principle

Multi-agent systems in production rarely operate as fully autonomous entities. Instead, they exist along a continuum between automation and human oversight, with autonomy deliberately bounded by policy, cost, and risk considerations.

Core Multi-Agent Patterns

🎯 Bounded Autonomy

Constrained operational envelopes with explicit limits on actions, budgets, and scope

👥 Role Specialization

Supervisor, planner, executor, and critic roles with scoped tools and memory

📋 Task Decomposition

Functional, sequential, parallel, and policy-constrained splitting strategies

🤝 Multi-Agent Planning

Alignment strategies and negotiation mechanisms for coordinated execution

⚡ Distributed Execution

Containerized services, event-driven coordination, and serverless scaling

🧠 Memory Topologies

Shared semantic, agent-local scratchpads, and global vector memory

Bounded Autonomy & Human-in-the-Loop

Autonomy Envelope

agent:
  role: executor
  permissions:
    tools: [read_db, run_query]
    write_access: false
  limits:
    max_steps: 10
    cost_budget_usd: 0.50

Typical Autonomy Boundaries:

  • Read-only vs write-enabled actions
  • Maximum execution depth or retries
  • Explicit token or monetary budgets
  • Domain or jurisdictional restrictions

Human Intervention Points

  • Plan approval before execution
  • Tool calls with irreversible side effects
  • Escalation when confidence drops below threshold
  • Post-execution validation in regulated domains

Case: Regulatory Policy Analysis

A compliance system uses planner, executor, and critic agents with bounded autonomy. When confidence scores fall below threshold or interpretations imply material changes, the supervisor pauses execution and escalates to human review. Routine updates are handled autonomously while high-risk interpretations trigger oversight.

Role-Based Multi-Agent Architectures

Canonical Role Pipeline

Supervisor
   ↓
Planner → Executor → Critic
          ↑        ↓
       retry ← feedback

Role Responsibilities:

  • Supervisor: Coordinates execution, assigns work, resolves conflicts
  • Planner: Decomposes objectives into actionable steps
  • Executor: Performs tool calls and skill invocations
  • Critic: Evaluates outputs against policy, safety, quality constraints

Role Enforcement

Effective enforcement requires:

  • Distinct system prompts per role
  • Role-scoped tool registries
  • Orchestration rules preventing cross-role leakage

Case: Incident Response Swarm

A production incident-response system uses strict role separation. The supervisor acts as incident commander, assigning tasks to planners that generate hypotheses. Executors query logs and metrics, while critics validate hypotheses against observed data. By preventing executors from planning and critics from acting, the system achieves faster root-cause identification.

Delegation & Task Decomposition

Task Decomposition Strategies

1. Functional Decomposition

Split work by capability (planning, execution, validation, compliance). Reduces cognitive overload but may introduce coordination latency.

2. Sequential Decomposition

Break objectives into ordered stages. Effective when dependencies are clear, but brittle to early failures.

3. Parallel Decomposition

Assign independent subtasks simultaneously. Improves robustness but requires downstream aggregation and negotiation.

4. Policy-Constrained Decomposition

Apply governance rules during task splitting. Essential in regulated environments for selective autonomy.

Policy-Driven Delegation

if task.domain == "finance":
    delegate_to = "compliance_planner"
elif task.risk_score > threshold:
    escalate_to = "human_review"

Benefits:

  • Centralized, auditable governance
  • Adjust delegation without modifying agent logic
  • Domain ownership and risk thresholds enforced consistently

Multi-Agent Planning & Negotiation

Alignment Strategies

  • Shared Objective Representation: Consistent goal specifications across agents
  • Global Constraint Enforcement: Centrally enforced cost, latency, safety policies
  • Uniform Evaluation Criteria: Same scoring dimensions for fair comparison

Negotiation Mechanisms

Voting-Based Negotiation

Multiple agents propose plans, critics score them, supervisor selects based on weighted criteria.

Auction-Based Negotiation

Agents submit bids based on cost, confidence, or resource requirements. Favors efficiency.

Consensus-Driven Negotiation

Iterative refinement through critique and convergence. Improves quality but adds overhead.

Case: Data Migration Planning

Multiple planners independently propose execution strategies. Critic agents evaluate along cost, latency, and risk dimensions. The supervisor uses weighted voting to select the final plan, reducing planner bias and increasing robustness.

Distributed Multi-Agent Execution

Execution Models

Containerized Agents

Long-lived roles (supervisors, planners, critics) run as containerized services with:

  • Isolated environments and resource limits
  • Independent scaling and rolling updates
  • Health checks and resource quotas

Event-Driven Agents

Agents triggered by events (task creation, step completion, data arrival) rather than long-lived sessions. Supports fan-out, retries, and asynchronous coordination.

Task Event → Supervisor or Planner Service
           ↓
     Executor Swarm (N)
           ↓
      Critic Service

Serverless Multi-Agent Scaling

Stateless executors instantiated on-demand, perform bounded work, and terminate. Provides elasticity and cost efficiency under variable demand.

Case: Event-Driven Executor Swarm

Large-scale agentic systems deploy executor agents using serverless, event-driven models. A supervisor service emits execution events onto a managed event bus. Each event triggers a short-lived executor for specific tasks (document retrieval, feature extraction, compliance validation). Planner and critic agents run as long-lived containerized services. This architecture combines elasticity with centralized governance.

Memory Topologies in Multi-Agent Systems

Memory Layer Separation

Shared Semantic Memory

Authoritative knowledge layer storing documents, facts, policies, schemas. Accessed primarily through read operations with governed write access.

Agent-Local Scratchpads

Transient reasoning state including intermediate variables, partial plans, tool outputs. Scoped to single agent and execution context.

Global Vector Memory

System-wide recall mechanism with embeddings for similarity-based retrieval. Treated as suggestive context rather than authoritative truth.

Aspect Shared Semantic Global Vector
Primary role Authoritative knowledge Recall and discovery
Consistency High Best-effort
Access pattern Governed reads/writes Read-mostly retrieval
Trust model Treated as correct Suggestive context

Why Multi-Agent Architectures Matter

Multi-agent systems transition from monolithic reasoning engines into distributed, coordinated platforms that balance autonomy with oversight, specialization with collaboration.