MCP Servers Explained: What They Are and Why Every AI Agent Needs Them

Q: What's the difference between MCP tools, resources, and prompts?

Tools are callable functions — the agent passes arguments and gets a result back. Resources are readable data — documents, records, or datasets the agent can pull into its context. Prompts are reusable message templates that a host application can inject into an agent's context. Tools are by far the most commonly used of the three in production agent systems.

The Problem AI Agents Have Without MCP Servers

An AI agent that can only reason is not very useful in production. It needs to read files, call APIs, query databases, send messages, and take actions in external systems. But for a long time, there was no standard way to give agents that kind of access. Every tool integration was a one-off: custom code, custom schemas, custom authentication handling.

MCP servers solve this. The Model Context Protocol (MCP) is an open standard that defines how AI agents discover and call external tools and data sources. If you’re building anything with AI agents — whether it’s an automation workflow, a coding assistant, or a multi-step research pipeline — understanding MCP servers is not optional. It’s foundational.

This article covers what MCP servers are, how they work, how authentication is handled, what the real tradeoffs are, and when you should actually use one.

What an MCP Server Actually Is

MCP stands for Model Context Protocol. It’s a specification, originally developed by Anthropic, that defines a standard interface between an AI agent (the client) and external tools or data sources (the server).

An MCP server is any process that implements that interface. It exposes a set of tools — callable functions with typed inputs and outputs — that an agent can discover at runtime and invoke during a task.

Remy doesn't write the code. It manages the agents who do.

AGENTS ASSIGNED TO THIS BUILD

Remy

Product Manager Agent

Leading

Design

Engineer

Deploy

Remy runs the project. The specialists do the work. You work with the PM, not the implementers.

Think of it like this: REST APIs exist so HTTP clients can talk to backend services in a predictable way. MCP servers exist so AI agents can talk to tools and data in a predictable way.

The key difference from a plain API is that MCP servers are specifically designed to be agent-readable. The tool definitions include names, descriptions, and parameter schemas that a language model can understand and reason about. The agent doesn’t need hardcoded knowledge of what tools exist — it can ask the server, read the tool list, and decide which one to call.

For a deeper look at the architecture, what MCP servers are and how they work is worth reading before you go much further.

How MCP Servers Work: The Core Architecture

The Three Components

Every MCP interaction involves three things:

The client — the AI agent (or the host application running it). This is what makes requests.
The MCP server — the process that exposes tools and handles requests. This is what the agent calls.
The underlying service — the actual thing the MCP server wraps: a database, a third-party API, a filesystem, an internal service.

The agent never talks directly to the underlying service. It talks to the MCP server, which then handles the actual interaction.

The Request Cycle

Here’s what happens when an agent uses an MCP server:

Discovery — The agent connects to an MCP server and calls tools/list. The server returns a list of available tools, each with a name, description, and JSON Schema for inputs and outputs.
Selection — The agent (guided by the LLM) reads the tool descriptions and decides which one to call based on the task at hand.
Invocation — The agent calls tools/call with the tool name and a JSON payload matching the schema.
Execution — The MCP server receives the call, validates the inputs, executes the underlying action, and returns a structured result.
Continuation — The agent gets the result back in its context and decides what to do next.

This cycle is what enables multi-step reasoning in AI agents. The agent isn’t just generating text — it’s taking actions, observing results, and adjusting its next steps based on what it learns.

Transport Methods

MCP servers communicate over one of two transports:

stdio — The client spawns the server as a subprocess and communicates over standard input/output. Common for local development and tool-specific servers (like a filesystem MCP).
HTTP with SSE — The server runs as a remote process, and the client connects over HTTP with Server-Sent Events for streaming responses. Better for hosted, multi-tenant, or production deployments.

The protocol itself is JSON-RPC 2.0 in both cases, so the message format is the same regardless of transport.

What MCP Servers Expose

MCP servers can expose three types of capabilities, though tools are by far the most common in practice.

Tools

Tools are callable functions. They take typed inputs and return structured outputs. A tool might:

Search a database and return rows
Send an email
Create a calendar event
Read a file
Call a third-party API endpoint
Write data back to a system of record

Remy doesn't build the plumbing. It inherits it.

Other agents wire up auth, databases, models, and integrations from scratch every time you ask them to build something.

WHAT REMY DOESN'T HAVE TO BUILD

200+

AI MODELS

GPT · Claude · Gemini · Llama

✓

1,000+

INTEGRATIONS

Slack · Stripe · Notion · HubSpot

✓

MANAGED DB

AUTH

PAYMENTS

CRONS

Remy ships with all of it from MindStudio — so every cycle goes into the app you actually want.

Tools are where agents do most of their actual work. The quality of a tool’s description — its name, what it does, what its parameters mean — directly affects whether an agent will call it correctly.

Resources

Resources are data that the server makes available for the agent to read. Unlike tools, resources aren’t functions you call with arguments — they’re more like documents or datasets the agent can pull into its context. Think: the content of a file, a database record, a page from Notion.

Resources are useful when you want the agent to have read access to structured data without needing to define a tool for every possible query.

Prompts

Prompts are templated message sequences the server can expose — basically reusable instructions that the host application can inject into the agent’s context. These are less commonly used than tools and resources.

How Authentication Works with MCP Servers

Authentication is one of the most important — and least well-explained — aspects of MCP servers. If your server wraps a third-party service like Gmail, GitHub, or Salesforce, it needs credentials to act on behalf of a user. How that works depends on where the server runs.

Static Credentials (Simple Case)

For many server implementations, authentication is handled at configuration time. You set environment variables (like GITHUB_TOKEN or NOTION_API_KEY) when you start the server, and the server uses those credentials for all requests.

This is simple but rigid. There’s no per-user auth, no token scoping, and no way to handle multiple users. Fine for a personal dev setup; not great for production.

OAuth 2.0 (The Right Answer for Production)

The MCP specification includes an OAuth 2.0 authorization framework for remote servers. When an agent connects to a remote MCP server that requires user-level auth, the flow works like this:

The server advertises its auth requirements.
The client (host application) initiates an OAuth flow and gets an authorization code.
The code is exchanged for an access token.
The client includes the access token in subsequent MCP requests.
The server validates the token and uses it when calling the underlying service.

This is the model that makes sense for services like Gmail, Slack, or Google Drive — where every user has their own credentials and the agent should only access what that user can access.

Agent identity infrastructure is an evolving area, and OAuth is currently the closest thing to a real standard for giving agents delegated access to services. It’s not perfect — token refresh, scope management, and per-agent identity are still messy problems — but it’s what production systems use today.

API Keys Passed at Runtime

Some implementations pass credentials as part of the tool invocation rather than at server startup. The agent (or the platform running it) includes the API key in the request payload. This puts more control in the client’s hands but requires the client to manage secrets securely.

The Real Limitations of MCP Servers

MCP is a useful standard, but it has genuine tradeoffs worth knowing before you build on it.

Wrapping an API Is Not Enough

✗ VIBE-CODED APP

Tangled. Half-built. Brittle.

✓ AN APP, MANAGED BY REMY

UIReact + Tailwind✓

APIValidated routes✓

DBPostgres + auth✓

DEPLOYProduction-ready✓

Architected. End to end.

Built like a system. Not vibe-coded.

Remy manages the project — every layer architected, not stitched together at the last second.

The most common mistake is treating MCP server creation as “just wrap an API.” But the MCP server trap is real: if the underlying API returns data in a format that’s hard for an agent to reason about — deeply nested JSON, ambiguous field names, missing context — the MCP wrapper doesn’t fix any of that. The agent gets confused output with a nicer interface.

Good MCP servers don’t just proxy API responses. They transform and structure data so it’s genuinely agent-readable. That’s additional work that most tutorials skip over.

Token Overhead

Every tool result that comes back from an MCP server gets added to the agent’s context. If you’re calling tools that return large payloads, you burn through your context window fast. Token overhead with MCP servers is a real performance and cost issue in production, especially for coding agents that call many tools in sequence.

Good tool design limits response sizes. Return only what the agent needs for the next step, not everything the underlying API provides.

The N×M Integration Problem

If you have N agents and M tools, you potentially need N×M integrations. MCP helps standardize the interface, but it doesn’t eliminate the combinatorial problem of connecting many agents to many services. This is why solutions like the agent integration layer exist — to manage those connections at scale rather than building them one by one.

Security Risks

MCP servers that accept calls from agents need to be treated as attack surfaces. An agent operating on user behalf with broad tool access can cause real damage if it’s been manipulated via prompt injection. AI agent security is a topic that’s easy to ignore during development and painful to deal with in production.

Limit tool scope. Require confirmation for destructive actions. Log everything. Don’t give an agent access to tools it doesn’t need for the task.

Who Is Adopting MCP — and Why It Matters

MCP started as Anthropic’s internal protocol and was open-sourced in late 2023. Adoption has moved quickly since.

Anthropic’s own products — Claude desktop, Claude.ai, and the APIs — all support MCP natively. The Conway Extension Format adds a proprietary layer on top of MCP for some Anthropic-specific features, but the base protocol is open.

OpenAI has adopted the same tool-calling patterns, and there’s growing cross-vendor convergence around how agents call external tools. Agent skills as an open standard is a real trend — Claude, OpenAI, and Google are all moving toward compatible tool-calling interfaces.

Slack now supports MCP natively, which means Slackbot can act as an agentic teammate — pulling data, running tasks, and interacting with external services directly from within Slack conversations.

Apple has been moving toward MCP-compatible interfaces through App Intents and Siri extensions, part of a broader shift in how Apple’s AI strategy handles agent-accessible system capabilities.

The picture that’s emerging is an ecosystem where MCP is the standard handshake between agents and tools, regardless of which LLM powers the agent or which platform it runs on.

When to Use an MCP Server (and When Not To)

MCP servers are the right choice when:

You need a standard, reusable interface that multiple agents can use
You’re connecting to a service that requires auth and you want to handle that in one place
You want tool discovery to be dynamic rather than hardcoded
You’re building on top of an ecosystem (like Claude desktop or a platform that supports MCP) where MCP is the expected integration format

REMY IS NOT

✕a coding agent
✕no-code
✕vibe coding
✕a faster Cursor

IT IS

✓a general contractor for software

The one that tells the coding agents what to build.

MCP servers are probably overkill when:

You have one agent and one tool and the integration never changes
You just need a quick API call and a simple function definition is enough
Latency is critical and you don’t want the extra hop through a server process

For many production use cases, the question isn’t “MCP vs. no MCP” but “which MCP server, and how do I design the tools inside it well.”

Understanding where MCP fits in the broader stack — alongside orchestration, memory, identity, and deployment — is part of understanding the agent infrastructure stack as a whole.

How Remy Handles Tool Access

Remy is a spec-driven development environment — you describe your application in annotated markdown and Remy compiles it into a full-stack app with a real backend, database, and auth.

When your app includes integrations with external services, Remy handles the connectivity layer without you needing to wire up individual MCP servers by hand. The platform — built on MindStudio’s infrastructure, which includes 1,000+ integrations — manages the connection between your app’s backend and the services it needs to talk to.

If you’re building an agent-powered application that needs to call Notion, Gmail, Slack, or a custom internal API, you describe what the app does in the spec. The integration layer is handled for you. You’re not writing OAuth flows or tool schemas from scratch.

This is part of what makes spec-driven development practical at the application level. You focus on what your app should do; the compiled output handles the how.

You can try Remy at mindstudio.ai/remy.

Frequently Asked Questions

What is an MCP server in simple terms?

An MCP server is a process that exposes a set of tools — callable functions with typed inputs and outputs — that an AI agent can discover and call during a task. It sits between the agent and the actual service or data source the agent needs to interact with. The agent asks “what tools do you have?” and the server responds with a list. The agent then calls whichever tools it needs.

How is an MCP server different from a regular API?

A regular API is designed for software developers to integrate with by writing code. An MCP server is designed for AI agents to discover and use at runtime. The key difference is that MCP tool definitions include natural-language descriptions that a language model can read and reason about — the agent doesn’t need to have the API hardcoded into its instructions. It discovers what’s available and decides what to call.

Do I need to build my own MCP server to use MCP?

No. There are hundreds of pre-built MCP servers available for common services: GitHub, Notion, Slack, Gmail, Google Drive, Linear, Jira, and more. If you’re connecting to a common service, there’s probably an existing server you can use. If you’re connecting to a custom internal system, you’ll need to build one — but the protocol is well-documented and there are SDKs in Python, TypeScript, and other languages.

How does authentication work with MCP servers?

For local or single-user setups, authentication is usually handled via static environment variables — API keys set when the server starts. For production, remote MCP servers use OAuth 2.0: the agent’s host platform handles the OAuth flow, gets an access token, and passes it with each request. The server validates the token and uses it to call the underlying service on behalf of the authenticated user.

Are MCP servers secure?

They can be, but security isn’t automatic. The main risks are prompt injection (an attacker manipulates the agent into calling tools it shouldn’t) and overly broad tool permissions. Best practices include: giving agents access only to tools they need, requiring user confirmation for destructive actions, validating inputs at the server level, and logging all tool calls. Progressive autonomy — starting agents with limited permissions and expanding gradually — is a useful principle here.

What’s the difference between MCP tools, resources, and prompts?

Tools are callable functions — the agent passes arguments and gets a result back. Resources are readable data — documents, records, or datasets the agent can pull into its context. Prompts are reusable message templates that a host application can inject into an agent’s context. Tools are by far the most commonly used of the three in production agent systems.

Key Takeaways

MCP servers are the standard interface between AI agents and external tools, APIs, and data sources — using a discoverable, agent-readable format.
The protocol uses JSON-RPC over stdio or HTTP, with tool discovery via tools/list and invocation via tools/call.
Authentication ranges from static API keys (simple) to OAuth 2.0 (right for production multi-user deployments).
Real limitations include token overhead from large responses, the complexity of the N×M integration problem, and security risks if tool access isn’t scoped carefully.
Adoption is accelerating — Claude, OpenAI tools, Slack, and Apple are all converging toward compatible agent-tool interfaces, making MCP a foundational standard worth learning now.
Good MCP design isn’t just wrapping an API. It means structuring responses so agents can actually reason about them.

If you’re building applications that need agents to interact with real services, try Remy — a spec-driven environment where the integration layer is handled for you.