What Is Paperclip? The Open-Source Framework for Zero-Human AI Companies
Paperclip is a free, open-source orchestration tool that lets AI agents hire other agents, set goals, and run an entire business autonomously with Claude Code.
A New Kind of AI Company Operating System
The idea of a company run entirely by AI agents — no humans in the loop — has moved from thought experiment to actual software. Paperclip is an open-source multi-agent orchestration framework that lets AI agents hire other agents, set goals, allocate resources, and operate a business autonomously. It’s built on Anthropic’s Claude Code and available free on GitHub.
If you’ve been watching the multi-agent AI space, Paperclip represents one of the more interesting experiments in what autonomous AI operation can actually look like in practice. This guide breaks down how it works, what you can build with it, what it can’t do yet, and where it fits alongside other tools in the agentic AI ecosystem.
What Paperclip Actually Is
Paperclip is an open-source orchestration framework designed around a single core idea: a top-level AI agent should be able to decompose a business goal and hire specialized sub-agents to achieve it — without a human manually assigning tasks or designing workflows.
The name is a deliberate reference to the “paperclip maximizer” — a classic thought experiment in AI safety describing a hypothetical AI so focused on a single objective that it converts all available resources toward it. Paperclip the framework takes that premise and applies it to legitimate business goals: what happens when you give an AI agent a real objective and let it organize itself to pursue it?
What Makes It Different from Other Multi-Agent Frameworks
Multi-agent orchestration tools aren’t new. AutoGen, CrewAI, and LangGraph all let you build systems where multiple AI agents collaborate. What distinguishes Paperclip:
- Claude Code as the execution engine. Paperclip is purpose-built around Anthropic’s Claude Code — an agentic system built for extended, autonomous operation. This is meaningfully different from making API calls to a chat model.
- The hiring model. Agents in Paperclip don’t call functions — they hire other agents. Each hired agent gets a role, a goal, and a resource budget. The design metaphor is an org chart, not a workflow diagram.
- Business operations as the primary use case. Paperclip isn’t a general-purpose agent toolkit. It’s designed specifically for running companies.
- Fully open-source. No hosted service, no managed tier. You run it yourself.
The Zero-Human Company Concept, Explained
“Zero-human company” sounds extreme. In practice, it’s more of a design goal than a literal guarantee.
The concept works like this: most business operations can be broken into tasks specific enough that an AI agent can execute them. If you can describe what success looks like, an AI agent can work toward it. The more you can decompose a business into repeatable, well-defined tasks, the more of it can run without ongoing human direction.
Paperclip pushes further than most tools by letting agents make their own hiring decisions. Instead of a human designing an org chart upfront and assigning agents to fixed roles, the CEO agent evaluates what work needs to happen and spawns whatever agents it needs.
Why This Became Viable in 2025
Several things converged to make systems like Paperclip feasible now:
- Long-context reasoning. Modern frontier models maintain coherent plans over much longer interactions — essential for multi-step autonomous operation.
- Reliable tool use. Claude Sonnet and Claude Opus are significantly more accurate and reliable when using tools than earlier models were.
- Claude Code’s extended autonomy. Anthropic built Claude Code for exactly this — multi-step task sequences that don’t need human prompting between steps.
- Mature open-source agent infrastructure. The tooling for managing state, handling failures, and running agents locally has improved substantially.
How Paperclip Works
Paperclip is an orchestration system. Its job is managing how agents are created, how they receive goals, and how results travel back up the chain.
The Agent Hierarchy
Paperclip uses a three-tier organizational model:
- CEO agent — Receives the company’s top-level goal. Has the broadest decision-making scope. Decides what work needs to happen and who should do it.
- Manager agents — The CEO hires managers for specific functional areas: engineering, marketing, operations, finance. Each gets a scoped goal and resource constraints.
- Worker agents — Managers spawn workers for specific tasks: write this code, draft this email, analyze this dataset.
Each tier handles its own scope. No single agent gets overloaded trying to manage the whole company.
How Agent Hiring Works in Practice
When a CEO agent determines it needs a marketing function, it doesn’t delegate to a preset role. It creates a new agent instance and provides:
- A defined role (e.g., “Head of Content Marketing”)
- A specific goal (e.g., “Produce 10 blog drafts on [topic] in the next cycle”)
- Access to relevant tools (web search, writing tools, document creation)
- A compute budget — a limit on how many actions it can take before reporting back
That agent works autonomously toward its goal and may spawn its own sub-agents if the scope warrants it. This recursive structure is what lets Paperclip represent genuinely complex organizational work.
Goal Flow and Feedback Loops
Goals flow downward through the hierarchy. Results flow upward. Workers complete tasks and report to managers. Managers synthesize results and report to the CEO. The CEO evaluates progress against the company goal and decides what happens next.
This isn’t a fixed workflow being executed in order. Agents respond to actual results and adjust. If a worker’s approach isn’t working, the manager can redirect it or hire a different agent to try another approach.
State and Memory Management
One of the harder problems in multi-agent systems is maintaining coherent state across many agents operating in parallel. Paperclip handles this by maintaining persistent context for each agent — its goal, decisions made, tasks assigned, and results received. This persists across interactions, so a manager always knows what its workers have done even if there are gaps or retries between cycles.
Claude Code’s Role in Paperclip
Paperclip’s dependency on Claude Code isn’t incidental — it’s architectural. Understanding why requires understanding what makes Claude Code different from a standard model API.
Claude Code is designed for extended autonomous operation. It can:
- Write and execute code in a real environment
- Manage files and directories
- Browse the web and extract information
- Run multi-step task sequences without human prompting between steps
- Recover from errors and try alternative approaches when something fails
For Paperclip’s use case, these capabilities matter enormously. An agent that can only generate text has limited utility in a business context. An agent that can research a topic, draft and format a document, create supporting assets, and deliver a finished output — all without human input — is a fundamentally different tool.
Claude Code’s extended autonomy is what makes the “zero-human” part of zero-human company actually function.
What Paperclip Inherits from Claude Code
Because Paperclip runs on Claude Code, it also inherits Claude Code’s constraints: token costs per interaction, rate limits, and Anthropic’s built-in safety behaviors. Running many agent instances simultaneously costs real money, and cost management is one of the more practical challenges in operating a Paperclip-based system at any meaningful scale.
The upside: every improvement Anthropic ships to Claude Code improves Paperclip agents automatically.
What You Can Actually Build With Paperclip
The theory is interesting. Here’s what people are actually using it for.
Software Development Companies
The most developed use case. A Paperclip engineering company can take a product specification, decompose it into components, spawn agents for backend, frontend, testing, and documentation, iterate on code, run tests, fix failures, and produce a working codebase with minimal human direction.
Outputs require review before shipping to production. But for well-specified projects, the system can handle substantial portions of the development cycle autonomously.
Content and Marketing Operations
A content-focused Paperclip company can manage the full production pipeline — researching topics, generating drafts, editing, formatting, creating supporting images, and publishing — without human involvement in each step. Human review of final output before it goes live is still smart practice, but the production process itself can run autonomously.
Research and Analysis
Paperclip handles research tasks well: decompose a broad question into many specific sub-queries, dispatch agents to gather data, synthesize findings, and produce a structured report. Parallelization across multiple worker agents means research that might take a human team days can complete in a single cycle.
What It Isn’t Ready For
It’s worth being specific about the limitations:
- Customer-facing interactions. Agents that handle real customer relationships, commitments, or transactions aren’t production-ready in this framework.
- Regulated industries. Compliance requirements, legal accountability, and audit trails require human oversight that the framework doesn’t enforce.
- Unpredictable environments. Paperclip works best when task spaces are well-defined. Rapidly changing or genuinely ambiguous situations are harder.
- Cost at scale. Running dozens of Claude Code agent instances in parallel gets expensive quickly. Budget controls need careful attention.
Setting Up Paperclip
Paperclip is developer-facing. Getting started involves:
- Clone the repository from GitHub
- Set up Anthropic API credentials with Claude Code access enabled
- Define the company goal — the high-level objective your CEO agent will pursue
- Configure tools and permissions — decide which tools your agents can access (web, file system, external APIs)
- Launch the CEO agent and let goal decomposition begin
You’ll want to be comfortable with the command line and managing API costs before running extended autonomous sessions. The system can consume significant compute if left to run unsupervised on an open-ended goal.
Monitoring Without Micromanaging
Even if full autonomy is the goal, observation matters. Paperclip surfaces logs and agent state so you can see what decisions are being made, where agents are getting stuck, and where the system is spending resources.
Practical deployments often build in human checkpoints — moments where a human reviews outputs before the system proceeds. Full autonomy is a direction you move toward, not necessarily where you start.
Where MindStudio Fits in This Picture
Paperclip sits at one end of a spectrum: maximum control, built for developers who want to architect agentic systems from scratch using code. That’s powerful, but it also means managing the infrastructure, debugging orchestration, and building tool integrations yourself.
If multi-agent orchestration is what you need but you don’t want to build the plumbing, MindStudio offers a different approach. Its no-code visual builder lets you create agent workflows — including agents that trigger and coordinate other agents — without writing orchestration code.
The most relevant piece for developers already working in code-first frameworks like Paperclip is MindStudio’s Agent Skills Plugin — an npm SDK (@mindstudio-ai/agent) that lets any AI agent call MindStudio’s 120+ typed capabilities as simple method calls. Your Claude Code agents can call agent.sendEmail(), agent.searchGoogle(), agent.generateImage(), or agent.runWorkflow() with the infrastructure layer — rate limiting, retries, authentication — handled automatically.
That means if you’re building on top of Paperclip, your agents get immediate access to 1,000+ business tool integrations without building them individually. It’s the kind of practical capability that makes the difference between a demo and something that runs real operations.
For teams who’d rather avoid developer frameworks entirely, MindStudio lets you build autonomous background agents and multi-step workflows visually — the average build takes 15 minutes to an hour. You can also explore how AI agents handle complex workflow automation without touching code.
Try MindStudio free at mindstudio.ai.
Frequently Asked Questions
What is Paperclip in AI?
Paperclip is an open-source multi-agent orchestration framework for building autonomous AI companies. A top-level CEO agent receives a business goal, then hires and directs specialized sub-agents to achieve it — with minimal or no ongoing human direction. It’s built on Anthropic’s Claude Code and is free to use.
How does Paperclip use Claude Code?
Paperclip uses Claude Code as its execution engine. Claude Code is designed for extended autonomous operation — it can write and run code, manage files, browse the web, and execute multi-step sequences without requiring human prompting between steps. Paperclip adds the orchestration layer: how agents are created, how goals are passed between them, how results flow back up, and how state is maintained across the system.
What is a zero-human AI company?
A zero-human AI company is a business operation run by AI agents coordinating with each other, without ongoing human decision-making in the loop. Someone still sets the initial goal and may review final outputs, but the day-to-day work — task decomposition, execution, iteration, error recovery — happens through agents. It’s less about eliminating humans entirely and more about reducing the surface area where human input is required.
How is Paperclip different from CrewAI or AutoGen?
All three are multi-agent frameworks, but they differ in design philosophy:
- Paperclip is purpose-built for Claude Code and uses a business org-chart model where agents hire other agents
- CrewAI supports multiple LLM providers and uses a crew/role model; it’s more LLM-agnostic and general-purpose
- AutoGen (from Microsoft Research) focuses on conversational multi-agent patterns where agents negotiate and debate; it’s more research-oriented than production-focused
Paperclip’s strongest differentiator is its deep integration with Claude Code’s autonomous capabilities and its explicit framing around running companies, not just completing tasks.
Can AI agents really run a business without humans?
In narrow, well-defined domains — yes, to a meaningful extent. Current AI agents handle content production, software development, research, and data analysis with real reliability. What they can’t handle well yet: managing complex human relationships, navigating genuinely ambiguous situations, operating in regulated processes, or taking actions with irreversible real-world consequences. Full autonomy is a design target that current systems approximate rather than fully deliver.
Is Paperclip production-ready?
Paperclip is experimental and open-source — actively developed but not a polished, supported product. It’s best for developers building proofs of concept, running internal automation, or exploring autonomous AI system design. For anything customer-facing, regulated, or business-critical, you’d want significant human review and oversight built into the process.
Key Takeaways
- Paperclip is an open-source multi-agent framework where AI agents hire other agents, using a company org-chart model to pursue business goals autonomously
- It’s built on Anthropic’s Claude Code — not just the chat API — which enables extended, autonomous task execution including code, file management, and web interaction
- The hierarchical CEO → manager → worker model mirrors how organizations decompose work, with goals flowing down and results flowing up
- Best current use cases are software development, content production, and research — well-defined, repeatable domains with clear success criteria
- Real limitations exist: cost at scale, customer-facing operations, and regulated environments all require careful thought before deploying
- For teams who want multi-agent orchestration without building the infrastructure layer from scratch, MindStudio provides a no-code alternative with 1,000+ integrations, a visual workflow builder, and an agent SDK for developers — start free at mindstudio.ai