Nowadays, AI is widely applied in various fields, especially in daily tasks for office workers and programmers. While AI excels at analyzing and understanding data, it still faces limitations in storing information, retrieving external data, and interacting with the surrounding environment.
In this article, I will introduce a method that helps AI models expand their data access capabilities and interact more flexibly with system entities.
Modern AI models are highly efficient at analyzing information, but they cannot update themselves with new knowledge. This is because information constantly changes, and sometimes, AI lacks the permission or ability to access the required data. In reality, humans also face similar limitations in accessing and processing information.
For example, an AI cannot read your personal emails, nor does it know the latest controversies surrounding the streamer ViruSs, simply because this information emerged after the AI model was trained.
Another limitation is the ability to interact with the environment. For instance, AI cannot automatically send messages or play a song on YouTube from your computer without external tools.
In short, think of AI as a human brain—it has strong analytical capabilities but also has constraints. It cannot know everything, nor can it access or manipulate everything on its own without supporting tools.
The AI community has been developing a tool, a protocol that enables AI models to communicate and utilize supporting tools for searching, expanding knowledge, and interacting more effectively with the surrounding environment.
Let’s draw a parallel to humans: the protocol that allows us to expand our knowledge and access information is language. Language acts as a natural protocol, enabling the human brain to reason, analyze, and acquire new knowledge without memorizing everything.
Similarly, AI models also need a language to interact with supporting tools. This protocol is called Model Context Protocol (MCP), which allows AI to connect with programmed tools and extend its capabilities beyond its initial dataset.
I won’t go into a rigid or overly detailed explanation, as you can find all the necessary information on the official Model Context Protocol website here: https://modelcontextprotocol.io.
Take a look at the diagram below:
Notes: The MCP host can be AI applications like Claude Desktop or Cursor AI Editor.
The MCP server is a system that extends AI models' knowledge and interaction capabilities by providing specialized tools.
For example:
With these MCP servers, AI can expand its abilities and perform tasks beyond its built-in data.
Each MCP server describes the functions it provides so that AI models can use them. When a user queries the AI model, the model will automatically determine:
This process enables AI to not only analyze information but also understand when and how to extend its capabilities through supporting tools.
Suppose I ask an AI model: “Calculate the total amount transacted through Vietcombank this month.” The AI would process the request as follows:
1️⃣ Based on the user’s query and descriptions of the MCP functions, the AI determines that it needs to use the email retrieval function.
2️⃣ The AI extracts the time range "this month" and filters emails from Vietcombank to call the appropriate function.
3️⃣ The AI receives a list of transaction-related emails from Vietcombank.
4️⃣ The AI sequentially calls a function to fetch the details of each email, analyzes them, and extracts the transaction amounts.
5️⃣ Finally, the AI sums up the amounts and returns the total to the user.
Thanks to this mechanism, AI does not just respond based on pre-existing data but also learns to search, retrieve, and process information from external sources, providing more accurate and real-world answers. 🚀
At its core, MCP servers are just regular processes. They communicate with the MCP host through one of the following two methods:
🔹 stdin/stdout: The MCP host launches an MCP server as a child process and communicates via stdin/stdout.
🔹 SSE (Server-Sent Events): The MCP host and MCP server operate as independent processes, communicating over HTTP, similar to a traditional web backend.
These two methods provide flexibility for the MCP host in connecting with MCP servers, depending on system architecture and usage needs. 🚀
In my Claude Desktop setup, MCP is configured as follows:
In Cursor AI Editor:
If you want to create your own MCP setup, check out the detailed guide here:
https://modelcontextprotocol.io/introduction.
If you want to enhance AI models’ search capabilities, refer to my article:
Build a RAG System for AI Agent.
Model Context Protocol (MCP) is a significant step forward in extending AI models’ capabilities. Through MCP, we can:
With the rapid advancement of AI technology, MCP is gradually becoming a crucial standard for building intelligent and flexible AI applications. I hope this article has given you an overview of MCP and how you can apply it to your projects.
See you in the next article! 🚀