Multi-Agent Articles
Browse 431 articles about Multi-Agent.
Claude Outcomes Feature Improved PowerPoint Quality 10.1%: How Rubric-Grading Agents Work
Anthropic's Outcomes feature uses a separate grading agent to score and re-run tasks. It lifted PowerPoint generation quality 10.1% on internal benchmarks.
MCP Servers Use 35x More Tokens Than CLI Tools — And Reliability Drops to 72% on Hard Tasks
A direct benchmark shows MCP uses 35x more tokens than CLI on the same task, with reliability falling from 100% to 72% as complexity grows. Use CLIs instead.
OpenClaw April 2026 Update: 5 New Features That Make It a Serious Agentic Runtime
TaskFlow, providence-rich memory, Codex OOTH route — OpenClaw's April 2026 releases turn it from a demo into a production-grade agentic runtime.
OpenClaw's Creator Joined OpenAI — And OpenAI Immediately Opened Codex to All Paid Users
Peter Steinberger built OpenClaw, then joined OpenAI. Days later, Codex became available to all paid OpenClaw users. Here's what that move signals.
Printing Press: 6 Things the New CLI Factory Does That MCP Servers Can't
Printing Press gives Claude Code access to 50+ sites with no public API while using 35x fewer tokens than MCP. Here's what it does that MCP simply can't.
Proactive AI Agents: How Claude Dreaming Points to the Future of Automation
AI is shifting from reactive chatbots to proactive agents that notice patterns and suggest improvements. Here's what that means for how you build workflows.
What Is Claude Dreaming? Anthropic's Self-Improving Agent Memory Feature
Claude Dreaming reviews past agent sessions to find patterns, fix recurring mistakes, and restructure memory automatically. Here's how it works.
An AI Agent Deleted a Production System Because No One Defined 'Staging' — Here's the Fix
A real agent confused staging and production and deleted a live system. The fix isn't better prompts — it's semantic authority primitives.
Why Computer Use Isn't Enough: The 3-Layer Framework Every AI Product Needs
Access, meaning, and authority — most AI products only have the first layer. Here's the full framework for building durable agent products.
Anthropic Managed Agents vs. OpenBrain Open-Source: Did Hermes Ship This First?
OpenBrain shipped Dreaming-like memory and Outcomes-like evals nearly a year before Anthropic. Here's what each approach actually offers.
Anthropic vs. OpenAI on Agent Token Access: Two Opposite Bets on the Same Day
On the same day Anthropic banned subscription tokens in third-party agents, OpenAI made Codex free for all paid users. Here's what each bet means.
Claude Code's Creator Says Anthropic Has Zero Manually Written Code — Here's How They Did It
Boris Churnney, who built Claude Code, says every line at Anthropic is now AI-generated. Here's the multi-agent setup behind that claim.
How to Set Up Claude Dreaming for Persistent Agent Memory: Step-by-Step Guide
Claude's Dreaming feature auto-reviews sessions and pre-loads patterns for future runs. Here's how to configure it for your agent workflows.
Claude Outcomes Feature: How a Grading Agent Improved PowerPoint Quality by 10% Without Changing the Model
Anthropic's Outcomes adds a rubric-based grading agent that re-runs tasks if quality falls short — 10.1% better decks, no model swap.
Code with Claude 2026: 5 New Agent Features Anthropic Just Shipped
Dreaming, Outcomes, multi-agent orchestration, Claude Finance, and Add-ins — here's what each one does and who it's for.
How to Build a Durable AI Agent Workflow That Survives Model Changes
Build agent workflows that outlive any single model or provider. Learn the architecture principles that let you swap models without breaking your system.
Hermes Agent vs. OpenClaw: Why One Builder Switched and What He Gained
One working AI builder retired OpenClaw for Hermes Agent. Here's the side-by-side on setup, crons, and daily workflow — and whether it's worth switching.
How to Build a Multi-Client AI Agent Architecture with Shared Skills
Manage multiple clients in one Claude Code project using context inheritance, per-client brand folders, and shared skill libraries that stay in sync.
Natural Language Harnesses vs Code Harnesses: Which Performs Better for AI Agents?
Tsinghua research shows rewriting agent control logic in natural language boosted performance from 30% to 47% and cut runtime from 361 to 41 minutes.
OpenBrain Memory Provenance for OpenClaw: 4 Labels That Make Agent Recall Trustworthy
OpenBrain's memory provenance system tags every recalled fact as observed, inferred, confirmed, or imported. Here's why that changes agent reliability.