What Is the Anthropic Managed Agents Dashboard? How to Monitor Sessions, Vaults, and Costs

A Tour of the Anthropic Console for Agent Workloads

If you’re running Claude-powered agents at any meaningful scale, the Anthropic Managed Agents dashboard is where visibility lives. It’s the control plane for monitoring what your agents are doing, how much they’re costing, and whether your credentials and environments are set up correctly. For teams building multi-agent workflows — or anyone moving from prototype to production — understanding every tab in this dashboard is not optional.

This guide walks through the Anthropic Managed Agents dashboard section by section: what each area shows, what it’s actually useful for, and what to watch when things go wrong.

What Anthropic Managed Agents Actually Is

Anthropic Managed Agents is the hosted infrastructure layer Anthropic provides for running Claude agents in production. Rather than spinning up your own compute, managing session state, and wiring together credential storage yourself, Anthropic handles that backend layer — you define the agent logic and Anthropic handles keeping it running reliably.

The dashboard at console.anthropic.com surfaces the state of all that activity. It’s organized around a few key concerns:

Sessions — the discrete runs of your agent, from start to finish
Environments — the configuration context agents run inside
Vaults — secure credential storage for third-party integrations
Usage and costs — token consumption, model breakdowns, and billing projections

Each of these maps to a tab or section in the console. Let’s go through them.

The Sessions Tab: Monitoring Agent Runs

Plans first. Then code.

PROJECTYOUR APP

SCREENS12

DB TABLES6

BUILT BYREMY

1280 px · TYP.

yourapp.msagent.ai

A · UI · FRONT END

Remy writes the spec, manages the build, and ships the app.

The Sessions tab is the operational heartbeat of the dashboard. Every time an agent run starts — whether triggered by a user, an API call, a schedule, or another agent — a session is created and logged here.

What a Session Record Contains

Each session entry shows:

Session ID — a unique identifier you can reference in logs or support tickets
Status — active, completed, failed, or timed out
Start and end time — with duration shown in seconds or minutes
Model used — which Claude version handled the session
Token count — input tokens, output tokens, and total
Cost — the dollar figure for that session
Triggering source — API call, direct UI interaction, scheduled trigger, or sub-agent invocation

You can filter by time range, status, model, or project. For debugging, the most useful filters are usually status (to isolate failures) and time range (to narrow down to a specific deployment or release window).

Drilling Into a Single Session

Clicking into a session opens the detail view. This is where you can trace the full conversation history the agent had, including any tool calls made and their outputs. If your agent is calling external APIs, searching the web, or reading documents, those actions appear in sequence here.

The detail view also shows the system prompt used for that session. This is worth auditing periodically — especially if you have multiple versions of an agent in flight — because a misconfigured system prompt can cause subtle, hard-to-diagnose behavior.

What to Watch for in Sessions

A few patterns are worth monitoring proactively:

High failure rates — if more than a small percentage of sessions are failing, check whether a tool is returning errors or a prompt change broke expected behavior
Unusually long sessions — agents that loop or get stuck can run up costs and block other work
Token spikes — a session that consumes 10x the normal token count often means the agent is processing unexpectedly large inputs or getting into a repetitive reasoning loop
Sub-agent invocations — in multi-agent setups, sessions spawned by other sessions are tracked separately but can be linked via parent session ID

Environments: Where Agent Configuration Lives

Environments in Anthropic Managed Agents serve a similar purpose to environments in any modern software deployment setup — they let you maintain separate configurations for development, staging, and production without having to manually swap settings.

What an Environment Contains

Each environment is a named container that holds:

Default model selection — which Claude version agents in this environment use by default
System prompt templates — base prompts that agents inherit unless overridden at the session level
Tool and integration settings — which capabilities are enabled in this environment
Rate limits and concurrency caps — how many sessions can run simultaneously, and at what token throughput
Vault bindings — which credential vaults this environment has access to (more on vaults below)

The separation between environments means you can test a new system prompt in dev without touching production. It also means you can apply tighter rate limits in staging to simulate real-world constraints before full deployment.

Managing Multiple Environments

✗ VIBE-CODED APP

Tangled. Half-built. Brittle.

✓ AN APP, MANAGED BY REMY

UIReact + Tailwind✓

APIValidated routes✓

DBPostgres + auth✓

DEPLOYProduction-ready✓

Architected. End to end.

Built like a system. Not vibe-coded.

Remy manages the project — every layer architected, not stitched together at the last second.

Most teams settle into a standard pattern: one environment per deployment stage (dev, staging, prod), plus optionally a sandbox environment for experimentation. The dashboard makes it easy to duplicate an environment configuration, which reduces the risk of manually recreating settings and introducing drift.

If you’re running agents for multiple clients or business units, you might also use environments as an isolation layer — each client gets their own environment with its own vault bindings and rate limits, so activity from one never bleeds into another.

Vaults: Secure Credential Storage for Agent Integrations

This is one of the more practically important parts of the dashboard, and also one that’s easiest to overlook until something breaks.

Vaults solve a fundamental problem with production agents: your agents need credentials to do anything useful — API keys for external services, OAuth tokens, database passwords, webhook secrets — but those credentials can’t be hardcoded into your prompts or exposed through the conversation history.

How Vaults Work

A vault is an encrypted key-value store. You add credentials once (through the Vaults tab in the dashboard), and they’re stored securely. When an agent needs a credential, it references the vault key by name rather than the actual value. The credential is injected at runtime without ever appearing in session logs.

The Vaults tab shows:

Vault name — a label you assign
Keys stored — the names of credentials in that vault (not the values themselves)
Last modified — useful for tracking when credentials were rotated
Environment bindings — which environments have access to this vault
Access log — a record of when the vault was accessed and by which sessions

Creating and Managing Vault Entries

Adding a credential to a vault is straightforward: name the key, paste the value, save. The value is encrypted and the dashboard never shows it again after creation — you can only overwrite or delete it.

For credential rotation, the workflow is to add a new key with the updated value, update any agent references to the new key name, verify sessions are running correctly, then delete the old key. Doing it in that order avoids a window where agents lose access to a needed credential.

Common Vault Pitfalls

Binding vaults to the wrong environment — if production agents can’t find a credential, check that the correct vault is bound to the production environment
Key name mismatches — agents reference vault keys by their exact string names; a typo in the agent configuration won’t surface as a clear error
Not rotating credentials — vault access logs make it possible to audit whether old credentials are still being used before you retire them

Usage and Cost Monitoring

The cost section of the dashboard answers the questions every team eventually asks: how much is this actually costing, and which agents or sessions are responsible for most of that spend?

The Usage Overview

The main usage view shows:

Total tokens consumed — broken down by input and output, since these are priced differently
Total cost — in USD, for the selected time range
Daily and hourly breakdowns — useful for spotting usage spikes tied to specific deployments or traffic patterns
Model distribution — if you’re using multiple Claude versions, this shows the token and cost share per model

Other agents ship a demo. Remy ships an app.

React + Tailwind ✓ LIVE

API

REST · typed contracts ✓ LIVE

DATABASE

real SQL, not mocked ✓ LIVE

AUTH

roles · sessions · tokens ✓ LIVE

DEPLOY

git-backed, live URL ✓ LIVE

Real backend. Real database. Real auth. Real plumbing. Remy has it all.

The model distribution view is particularly useful for cost optimization. Claude 3 Opus, Haiku, and Sonnet have meaningfully different price points. If you’re using Opus for tasks that Haiku could handle just as well, this view makes that visible.

Filtering by Project, Environment, and Session

You can filter the usage view by project, environment, or individual session. This lets you answer questions like:

“How much did the customer support agent cost last month?”
“Did the new prompt we shipped on Tuesday cause a cost increase?”
“Which environment is consuming the most tokens?”

Filtering by environment is especially useful when you have dev and production running simultaneously — you don’t want dev experimentation distorting your production cost baseline.

Setting Usage Alerts

The dashboard lets you configure alerts that fire when token consumption or cost crosses a threshold you define. This is worth setting up before you go to production, not after. A runaway agent — one stuck in a loop or receiving unexpectedly large inputs — can generate significant cost in a short window.

Alerts can be sent via email or webhook. The webhook option is useful if you want to pipe alerts into Slack, PagerDuty, or an incident management system.

Understanding Token Costs for Multi-Agent Workflows

Multi-agent setups, where one Claude instance orchestrates others, can accumulate token costs faster than single-agent workflows because each sub-agent session has its own input context. If the orchestrator passes the full conversation history to each sub-agent, that context gets re-processed (and re-billed) with every invocation.

The Sessions tab helps here — you can trace the parent-child relationships between sessions and see exactly how many tokens each tier of the hierarchy consumed. That’s the starting point for optimization decisions like context pruning or passing summarized state instead of raw history.

How MindStudio Fits Into Multi-Agent Workflows

If you’re building the agents that run inside Anthropic’s infrastructure, you still need a place to design, iterate, and manage the agent logic itself. That’s where a platform like MindStudio becomes relevant.

MindStudio is a no-code builder for AI agents and automated workflows. It supports 200+ AI models — including Claude — and lets you build agents visually without writing infrastructure code. Once you’ve built an agent in MindStudio, you can connect it to external systems, schedule it, expose it as an API endpoint, or chain it together with other agents.

For teams using the Anthropic Managed Agents dashboard to monitor production sessions, MindStudio handles the upstream problem: what does the agent actually do, step by step? You can design a multi-step Claude workflow in MindStudio — with branching logic, tool calls, and conditional outputs — and then monitor the downstream cost and session activity in the Anthropic Console.

The combination is particularly useful for teams that want to iterate on agent behavior quickly (MindStudio’s visual builder) while maintaining the visibility and credential management that comes with Anthropic’s managed infrastructure.

You can try MindStudio free at mindstudio.ai — the average agent build takes 15 minutes to an hour.

For more on building Claude-powered workflows, see how multi-agent automation works in MindStudio and how to connect agents to external tools without managing credentials manually.

Common Questions About the Anthropic Managed Agents Dashboard

Other agents start typing. Remy starts asking.

YOU SAID "Build me a sales CRM."

REMY ASKS

01 DESIGN Should it feel like Linear, or Salesforce?

02 UX How do reps move deals — drag, or dropdown?

03 ARCH Single team, or multi-org with permissions?

Scoping, trade-offs, edge cases — the real work. Before a line of code.

What’s the difference between a session and a conversation?

A session is a broader concept than a conversation turn. A session begins when an agent run is triggered and ends when the agent completes its task, errors out, or times out. Within a session, there can be multiple turns of back-and-forth with the model, plus any tool calls the agent makes. Think of a session as the full lifecycle of one unit of work.

How do I find a specific failed session for debugging?

Filter the Sessions tab by status (set to “failed”) and narrow the time range to when you believe the failure occurred. If you have the session ID from your logs, you can search directly by that. Once you’re in the detail view, look at the last tool call or model response before the failure — that’s usually where the error originated.

Yes. A vault can be bound to multiple environments. The typical pattern is to use separate vaults for production (tighter access) and dev/staging (shared access), but the platform doesn’t enforce that — you decide the binding structure based on your security requirements.

Are token costs billed per session or aggregated monthly?

Costs accumulate per token at the model’s per-token rate and are aggregated for billing purposes on a monthly cycle. The dashboard shows both the granular per-session cost and the rolling monthly total, so you can track both in real time.

How do I monitor costs for a specific agent without filtering manually each time?

Organize your agents into projects in the Anthropic Console. Once grouped by project, you can view usage filtered to that project without resetting filters each time. Projects also make it easier to set per-project budget alerts.

What happens when an agent session times out?

Timed-out sessions appear in the Sessions tab with a “timed out” status. The agent stops mid-run and the session is closed. No partial output is returned to the caller. You can configure session timeout limits at the environment level — setting them too low causes legitimate long-running tasks to fail, while setting them too high can let stuck agents run indefinitely.

Key Takeaways

The Sessions tab tracks every agent run with status, duration, token count, and cost — drill into individual sessions to trace tool calls and debug failures.
Environments separate your dev, staging, and production configurations, including model defaults, rate limits, and vault bindings.
Vaults store credentials encrypted and inject them at runtime, so API keys and secrets never appear in conversation logs or prompts.
Usage and cost monitoring lets you filter spend by project, environment, and model — set up threshold alerts before going to production, not after.
In multi-agent workflows, token costs compound quickly — use session tracing to identify which tier of the hierarchy is responsible for the majority of spend.
Tools like MindStudio handle agent design and workflow logic upstream, while the Anthropic Console handles runtime visibility and security downstream — the two complement each other well.

What Is the Anthropic Managed Agents Dashboard? How to Monitor Sessions, Vaults, and Costs

A Tour of the Anthropic Console for Agent Workloads

What Anthropic Managed Agents Actually Is

The Sessions Tab: Monitoring Agent Runs