AI - Agent Protocols
As AI agents move from "cool demos" to "enterprise tools," the biggest challenge isn't intelligence—it’s interoperability.
For an agent to be useful, it needs to talk to your database, your calendar, and other agents. Without standards, developers have to write custom "glue code" for every single integration.
MCP (Model Context Protocol)
Introduced by Anthropic in late 2024, MCP is quickly becoming the "USB port" for AI agents.
- The Problem: Previously, if you wanted ChatGPT or Claude to read your Google Drive, someone had to write a specific integration. If you wanted it to read a Slack channel, that was another integration.
- The Solution: MCP is an open standard that allows developers to build a single "server" for their data that any MCP-compliant AI agent can plug into.
- Why it matters: It decouples the Model (the brain) from the Context (your data). You can swap out the AI model without having to rebuild all your data connections. It allows agents to securely browse local folders, query SQL databases, or check your Evernote through a universal interface.
A2A (Agent-to-Agent) Communication
We are moving away from a single "God-Model" toward Multi-Agent Systems (MAS). For this to work, agents need a protocol to talk to each other without human intervention.
- The Protocol: While there isn't one single "A2A" brand yet, standards like the Agent Protocol (hosted by the AI Engineer Foundation) are emerging.
- How it works: It defines a standardized API specification for interacting with AI agents. It establishes how an "Initiator Agent" sends a task to a "Participant Agent," how they share files, and how they report completion or failure.
- The "Handoff": Imagine a Travel Agent AI. It realizes it needs to book a flight, so it calls a Payment Agent. A2A protocols ensure the Payment Agent understands the request, confirms the transaction, and sends back a receipt in a format the Travel Agent can process.
Tool Use & Function Calling (The Foundation)
Before MCP, we had Function Calling (standardized largely by OpenAI’s JSON schema).
- This is the "language" agents use to trigger actions. Instead of the AI just saying "I will check the weather," the model outputs a structured piece of code (JSON) like:
get_weather(city: "New York"). - Standardization: Most model providers (Google, Meta, Mistral) have now converged on the same JSON-based schema for tool use, making it easier for developers to build "agentic" apps that work across different LLMs.
Semantic Kernels and Orchestration Frameworks
Frameworks like Microsoft’s Semantic Kernel, LangGraph, and CrewAI act as the "Operating Systems" for these protocols.
- They provide the structure for State Management (remembering what happened in step 1 while doing step 5) and Planning.
- They ensure that when an agent uses an MCP server or talks A2A, the "memory" of that interaction is preserved across the entire workflow.
The "CLI vs. MCP" debate is currently a hot topic among AI engineers. To understand why someone would claim the CLI (Command Line Interface) is better than MCP (Model Context Protocol), you have to look at the trade-off between raw power and structured control.
Here is the breakdown of the argument for CLI, the counter-argument for MCP, and why the "winner" depends on what you are building.
Why should you care?
Without these standards, we have "walled gardens"—an AI that can only work inside Microsoft Word or only inside Google Workspace.
With MCP and A2A protocols, we get the "Agentic Web":
- Portability: You can take your custom-built "Research Agent" and use it across different platforms.
- Scalability: Companies can build a library of "Micro-Agents" (one for billing, one for tech support, one for scheduling) that all play nice together.
- Security: Standardized protocols allow for standardized permissions. You can grant an agent "Read-Only MCP access" to a specific folder rather than giving it the keys to your entire computer.
The Bottom Line: We are building the "Internet of Agents." Just as HTTP allowed any computer to talk to any website, protocols like MCP are allowing any AI to work with any tool.
Is CLI better than MCP?
The Argument for CLI: "The Universal Interface"
Proponents of CLI (Command Line Interface) argue that it is the ultimate "God Mode" for an agent.
- Unmatched Power: If an agent has access to a terminal, it can do anything. It can install packages, write files, run scripts, query databases, and deploy code. It doesn't need a specific "protocol" for every tool; the tool just needs to exist in Linux.
- No Abstraction Leakage: When you use a protocol like MCP, you are adding a layer between the AI and the machine. CLI is direct. There is no "translation" error between what the AI wants to do and what the machine executes.
- Agent-Native: The most advanced agents today (like Devin or OpenDevin) are essentially "LLMs with a terminal." They don't wait for an MCP server to be built for a new tool; they just
pip installthe library they need and use it immediately. - Universality: Every server, cloud provider, and laptop in the world already speaks CLI. You don't need to convince the world to adopt a new standard.
The Argument for MCP: "The Safe & Structured Bridge"
Proponents of MCP argue that giving an AI a raw CLI is like giving a chainsaw to a toddler—it’s powerful, but it’s going to end in a mess.
- Context Efficiency (The "LLM Tax"): Terminals are noisy. If an agent runs a command that spits out 500 lines of logs, it eats up the model's "context window" (its memory). MCP serves structured data. Instead of a wall of text, MCP provides exactly the JSON or text the model needs, making it faster and cheaper to run.
- Safety & Guardrails: With a CLI, an agent can accidentally run
rm -rf /or delete a production database. MCP acts as a permission layer. You can expose a specific function (e.g., "Read Sales Data") without giving the agent access to the entire file system. - Discovery: When an agent connects to an MCP server, the server "tells" the agent exactly what it can do: "I have these 3 tools, and here are the parameters they require." In a CLI, the agent has to "guess" or read long man-pages to figure out how to use a tool.
- Non-Technical Integration: MCP can connect to things that don't have a CLI, like a Notion workspace, a Slack channel, or a specialized medical database, providing a unified way for the AI to "see" that data.
Why the claim "CLI is better" exists:
Someone claiming CLI is better is likely a Software Engineer building Autonomous Coding Agents. In that world, the agent needs to be in the terminal to compile code, run tests, and debug. For them, MCP feels like "training wheels" that slow them down.
Why MCP is winning the "Enterprise" battle:
If you are a Company wanting to let an AI agent help your HR team, you are never going to give that agent a CLI to your company servers. You will use MCP to give the agent a very specific "window" into your HR software.
The Verdict:
- CLI is for "creators": Agents that build software and manage systems.
- MCP is for "consumers": Agents that need to interact with specialized data, APIs, and business tools safely.
In the future, the best agents will likely use both: They will use a CLI to do the heavy lifting and coding, and MCP servers to "reach out" and grab data from the rest of the web.