When does MCP make sense vs CLI?

(ejholmes.github.io)

84 points | by ejholmes 2 hours ago

30 comments

  • jackfranklyn 17 minutes ago
    The token budget angle is what makes this a real architectural decision rather than a philosophical one.

    I've been using both approaches in projects and the pattern I've landed on: MCP for anything stateful (db connections, authenticated sessions, browser automation) and CLI for stateless operations where the output is predictable. The reason is simple - MCP tool definitions sit in context permanently, so you're paying tokens whether you use them or not. A CLI you can invoke on demand and forget.

    The discovery aspect is underrated though. With MCP the model knows what tools exist and what arguments they take without you writing elaborate system prompts. With CLI the model either needs to already know the tool (grep, git, curl) or you end up describing it anyway, which is basically reinventing tool definitions.

    Honestly the whole debate feels like REST vs GraphQL circa 2017. Both work, the answer depends on your constraints, and in two years we'll probably have something that obsoletes both.

  • goranmoomin 1 hour ago
    I can't believe everyone is talking about MCP vs CLI and which is superior; both are a method of tool calling, it does not matter which format the LLM uses for tool calling as long as it provides the same capabilities. CLIs might be marginably better (LLMs might have been trained on common CLIs), but MCPs have their uses (complex auth, connecting users to data sources) and in my experience if you're using any of the frontier models, it doesn't really matter which tool calling format you're using; a bespoke format also works.

    The difference that should be talked about, should be how skills allow much more efficient context management. Skills are frequently connected to CLI usage, but I don't see any reason why. For example, Amp allows skills to attach MCP servers to them – the MCP server is automatically launched when the Agent loads that skill[0]. I belive that both for MCP servers and CLIs, having them in skills is the way for efficent context, and hoping that other agents also adopt this same feature.

    [0]: https://ampcode.com/manual#mcp-servers-in-skills

    • sophiabits 5 minutes ago
      > the MCP server is automatically launched when the Agent loads that skill

      The main problem with this approach at the moment is it busts your prompt cache, because LLMs expect all tool definitions to be defined at the beginning of the context window. Input tokens are the main driver of inference costs and a lot of use cases aren't economical without prompt caching.

      Hopefully in future LLMs are trained so you can add tool definitions anywhere in the context window. Lots of use cases benefit from this, e.g. in ecommerce there's really no point providing a "clear cart" tool to the LLM upfront, it'd be nice if you could dynamically provide it after item(s) are first added.

    • goodmythical 1 hour ago
      >as long as it provides the same capabilities.

      That's fine if you definition of capabilities is wide enough to include model understanding of the provided tool and token waste in the model trying to understand the tool and token waste in the model doing things ass backwards and inflating the context because it can't see the vastly shorter path to the solution provided by the tool and...

      There is plenty of evidence to suggest that performance, success rates, and efficiency, are all impacted quite drastically by the particular combination of tool and model.

      This is evidenced by the end of your paragraph in which you admit that you are focused only on a couple (or perhaps a few) models. But even then, throw them a tool they don't understand that has the same capabilities as a tool they do understand and you're going to burn a bunch of tokens watching it try to figure the tool out.

      Tooling absolutely matters.

      • goranmoomin 1 minute ago
        > model understanding of the provided tool and token waste in the model trying to understand the tool and token waste in the model doing things ass backwards and inflating the context because it can't see the vastly shorter path to the solution provided by the tool and...

        > But even then, throw them a tool they don't understand that has the same capabilities as a tool they do understand and you're going to burn a bunch of tokens watching it try to figure the tool out.

        What I was trying to say was that this applies to both MCPs and CLIs – obviously, if you have a certain CLI tool that's represented thoroughly through the model's training dataset (i.e. grep, gh, sed, and so on), it's definitely beneficial to use CLIs (since it means less context spending, less trial-and-error to get the expected results, and so on).

        However if you have a novel thing that you want to connect to LLM-based Agents, i.e. a reverse enginnering tool, or a browser debugging protocol adapter, or your next big thing(tm), it might not really matter if you have a CLI or a MCP since LLMs are both post-trained (hence proficent) for both, and you'll have to do the trial-and-error thing anyway (since neither would represented in the training dataset).

        I would say that the MCP hype is dying out so I personally won't build a new product with MCP right now, but no need to ditch MCPs for any reason, nor do I see anything inherently deficient in the MCP protocol itself. It's just another tool-calling solution.

    • jeremyjh 54 minutes ago
      No, it really matters because of the impact it has on context tokens. Reading on GH issue with MCP burns 54k tokens just to load the spec. If you use several MCPs it adds up really fast.
      • nextaccountic 17 minutes ago
        In the front page there's a project that attempts to reduce tje boilerplate of mcp output in claude code

        Eventually I hope that models themselves become smarter and don't save the whole 54k tokens in their context window

      • ashdksnndck 48 minutes ago
        Verbosity of the output seems orthogonal to the cli vs mcp distinction? When I made mcp tools and noticed a lot of tokens being used, I changed the default to output less and added options to expose different kinds of detailed info depending what the model wants. CLI can support similar behavior.
    • vojtapol 1 hour ago
      MCP needs to be supported during the training and trained into the LLM whereas using CLI is very common in the training set already. Since MCP does not really provide any significant benefits I think good CLI tools and its use by LLMs should be the way forward.
    • avaer 1 hour ago
      MCP vs CLI is the modern version of people discussing the merits of curly braces vs significant whitespace.

      That is, I don't think we're gonna be arguing about it for very long.

  • ejholmes 20 minutes ago
    Hi friends! Author here. This blew up a bit, so some words.

    The article title and content is intentionally provocative. It’s just to get people thinking. My real views are probably a lot more balanced. I totally get there’s a space where MCP probably does actually make sense. Particularly in areas where CLI invocation would be challenging. I think we probably could have come up with something better than MCP to fill that space, but it’s still better than nothing.

    Really all I want folks to take away from this is to think “hmm, maybe a CLI would actually be better for this particular use case”. If I were to point a finger at anything in particular, it would be Datadog and Slack who have chosen to build MCP’s instead of official CLI’s that agents can use. A CLI would be infinitely better (for me).

    • csheaff 0 minutes ago
      Thank you for writing this. I've had similar thoughts myself and have been teetering back and forth between MCP and skills that invoke CLI. I'm hoping this creates a discussion that points to the right pattern.
  • juanre 1 hour ago
    Reports of MCP's demise have been greatly exaggerated, but a CLI is indeed the right choice when the interface to the LLM is not a chat in a browser window.

    For example, I built https://claweb.ai to enable agents to communicate with other agents. They run aw [1], an OSS Go CLI that manages all the details. This means they can have sync chats (not impossible with MCP, but very difficult). It also enables signing messages and (coming soon) e2ee. This would be, as far as I can tell, impossible using MCP.

    [1] https://github.com/awebai/aw

  • phpnode 1 hour ago
    I don't doubt that CLIs + skills are a good alternative to MCP in some contexts, but if you're building an app for non-developers and you need to let users connect it to arbitrary data sources there's really no sensible, safe path to using CLIs instead. MCP is going to be around for a long time, and we can expect it to get much better than it is today.
    • sigmoid10 1 hour ago
      >we can expect it to get much better than it is today

      Which is not a high bar to clear. It literally only got where it is now because execs and product people love themselves another standard, because if they get their products to support it they can write that on some excel sheet as shipped feature and pin it on their chest. Even if the standard sucks on a technical level and the spec changes all the time.

      • phpnode 47 minutes ago
        This is excessively cynical, it's a useful tool despite its shortcomings.
    • simianwords 1 hour ago
      Why? The llm can install cli through apt-get or equivalent and non developers wouldn’t need to know
      • phpnode 1 hour ago
        well I'm sure you can understand the dangers of that, and why that won't work if your app is hosted and doesn't run on users' local machines
      • oldestofsports 57 minutes ago
        What non developer would have apt installed on their device though
  • iamspoilt 1 hour ago
    As a counter argument to the kubectl example made in the article, I found the k8s MCP (https://github.com/containers/kubernetes-mcp-server) to be particularly usefuly in trying to restrict LLM access to certain tools such as exec and delete tools, something which is not doable out of box if you use the kubectl CLI (unless you use the --as or --as-group flags and don't tell the LLM what user/usergroup those are).

    I have used the kk8s MCP directly inside Github Copilot Chat in VSCode and restricted the write tools in the Configure Tools prompt. With a pseudo protocol established via this MCP and the IDE integration, I find it much safer to prompt the LLM into debugging a live K8s cluster vs. without having any such primitives.

    So I don't see why MCPs are or should be dead.

  • sebast_bake 23 minutes ago
    The opposite is true. CLI based integration does not exist in a single consumer grade ai agent product that I’m aware of. CLI is only used in products like Claude Claude and OpenClaw that are targeting technically competent users.

    For the other 99% of the population, MCP offers security guardrails and simple consistent auth. Much better than CLI for the vast majority of use cases involving non-technical people.

  • mavam 32 minutes ago
    Why choose if you can have both? You can turn any MCP into an CLI with Pete's MCPorter: https://mcporter.dev.

    Since I've just switched from buggy Claude Code to pi, I created an extension for it: https://github.com/mavam/pi-mcporter.

    There are still a few OAuth quirks, but it works well.

  • CuriouslyC 1 hour ago
    There's been an anti-MCP pro-CLI train going for a while since ~May of last year (I've been personally beating this drum since then) but I think MCP has a very real use case.

    Specifically, MCP is a great unit of encapsulation. I have a secure agent framework (https://github.com/sibyllinesoft/smith-core) where I convert MCPs to microservices via sidecar and plug them into a service mesh, it makes securing agent capabilities really easy by leveraging existing policy and management tools. Then agents can just curl everything in bash rather than needing CLIs for everything. CLIs are still slightly more token efficient but overall the simplicity and the power of the scheme is a huge win.

  • simonw 46 minutes ago
    MCP makes sense when you're not running a full container-based Unix environment for your agent to run Bash commands inside of.
  • recursivedoubts 1 hour ago
    MCP has one thing going for it as an agentic API standard: token efficiency

    The single-request-for-all-abilities model + JSON RPC is more token efficient than most alternatives. Less flexible in many ways, but given the current ReAct, etc. model of agentic AI, in which conversations grow geometrically with API responses, token efficiency is very important.

    • SOLAR_FIELDS 1 hour ago
      But the flip side of this is that the tools themselves take up a ton of token context. So if you have one mcp it’s great but there is an upper bound that you hit pretty quick of how many tools you can realistically expose to an agent without adding some intermediary lookup layer. It’s not compact enough of a spec and doesn’t have lazy loading built into it
      • harrall 1 hour ago
        Yes but I consider that just a bug in the agents that use MCP servers.

        It could just be fixed to compress the context or the protocol could be tweaked.

        Switching to CLIs is like buying a new car because you need an oil change. Sure, in this case, the user doesn’t get to control if the oil change can be done, but the issue is not the car — it’s that no one will do the relatively trivial fix.

        • dnautics 1 hour ago
          you know what you could do? You could write a skill that turns mcps on or off!
    • ako 1 hour ago
      I've been creating a cli tool with a focus on token efficiency. Dont see why cli could not be as token efficient as mcp. The cli has the option to output ascii, markdown and json.
      • recursivedoubts 1 hour ago
        I'm working on a paper on this, if you are using a hypermedia-like system for progressive revelation of functionality you are likely to find that this chatty style of API is inefficient compared with an RPC-like system. The problem is architectural rather than representational.

        I say this as a hypermedia enthusiast who was hoping to show otherwise.

      • bear3r 32 minutes ago
        the output format (ascii/json/markdown) is one piece, but the other side is input schema. mcp declares what args are valid and their types upfront, so the model can't hallucinate a flag that doesn't exist. cli tools don't expose that contract unless you parse --help output, which is fragile.
        • ako 13 minutes ago
          So far, cli --help seems to work quite well. I'm optimizing the cli to interact with the agent, e.g., commands that describe exactly what output is expected for the cli DSL, error messages that contain DSL examples that exactly describe the agent how to fix bugs, etc. Overall i think the DSL is more token efficient that a similar JSON, and easier to review for humans.
  • appsoftware 1 hour ago
    ?? I'm using my own remote MCP server with openclaw now. I do understand the use case for CLI. In his Lex Friedman interview the creator highlights some of the advantages of CLI, such as being able to grep over responses. But there are situations where remote MCP works really well, such as where OAuth is used for authentication - you can hit an endpoint on the MCP server, get redirected to authenticate and authorise scopes etc and the auth server then responds to the MCP server.
  • p_ing 1 hour ago
    Tell my business users to use CLI when they create their agents. It's just not happening. MCP is point-and-click for them.

    MCP is far from dead, at least outside of tech circles.

  • AznHisoka 1 hour ago
    In terms of what companies are actually implementing, MCP isnt dead by a long time. Number of companies with a MCP server grew 242% in the last 6 months and is actually accelerating (according to Bloomberry) [1]

    https://bloomberry.com/blog/we-analyzed-1400-mcp-servers-her...

    • lakrici88284 1 hour ago
      Companies are usually chasing last year's trend, and MCP makes for an easy "look, were adopting AI!" bullet point.
      • AznHisoka 42 minutes ago
        Right, but even if this is just a matter of "chasing a trend", it does have a network effect and makes the entire MCP ecosystem much more useful to consumers, which begets more MCP servers.
  • the_mitsuhiko 1 hour ago
    > OpenClaw doesn’t support it. Pi doesn’t support it.

    It's maybe not optimal to conclude anything from these two. The Vienna school of AI agents focuses on self extending agents and that's not really compatible with MCP. There are lots of other approaches where MCP is very entrenched and probably will stick around.

  • bikeshaving 1 hour ago
    I keep asking why the default Claude tools like Read(), Write(), Edit(), MultiEdit(), Replace() tools aren’t just Bash() with some combination of cat, sed, grep, find. Isn’t it just easier to pipe everything through the shell? We just need to figure out the permissions for it.
    • fcarraldo 1 hour ago
      Because the Tools model allows for finer grained security controls than just bash and pipe. Do you really want Claude doing `find | exec` instead of calling an API that’s designed to prevent damage?
      • arbll 1 hour ago
        It might be the wrong place to do security anyway since `bash` and other hard-to-control tools will be needed. Sandboxing is likely the only way out
      • webstrand 1 hour ago
        yeah, I would rather it did that. You run Claude in a sandbox that restricts visibility to only the files it should know about in the first place. Currently I use a mix of bwrap and syd for filtering.
    • rfw300 1 hour ago
      Making those tools first-class primitives is good for (human) UX: you see the diffs inline, you can add custom rules and hooks that trigger on certain files being edited, etc.
  • Nevin1901 1 hour ago
    This is actually the first use case where I agree with the poster. really interesting, especially for technical people using ai. why would you spend time setting up and installing an mcp server when u can give it one man page
  • ddp26 1 hour ago
    I don't understand the CLI vs MCP. In cli's like Claude Code, MCPs give a lot of additional functionality, such as status polling that is hard to get right with raw documentation on what APIs to call.
  • orange_joe 1 hour ago
    This doesn't really pay attention to token costs. If I'm making a series of statically dependent calls I want to avoid blowing up the context with information on the intermediary states. Also, I don't really want to send my users skill.md files on how to do X,Y & Z.
    • krzyk 1 hour ago
      Why? MCP and CLI is similar here.

      You need agent to find MCP and what it can be used for (context), similarly you can write what CLI use for e.g. jira.

      Rest is up to agent, it needs to list what it can do in MCP, similarly CLI with proper help text will list that.

      Regarding context those tools are exactly the same.

      • lmeyerov 1 hour ago
        This feels right in theory and wrong in practice

        When measuring speed running blue team CTFs ("Breaking BOTS" talk at Chaos Congress), I saw about a ~2x difference in speed (~= tokens) for a database usage between curl (~skills) vs mcp (~python). In theory you can rewrite the mcp into the skill as .md/.py, but at that point ... .

        Also I think some people are talking past one another in these discussions. The skill format is a folder that supports dropping in code files, so much of what MCP does can be copy-pasted into that. However, many people discussing skills mean markdown-only and letting the LLM do the rest, which would require a fancy bootstrapping period to make as smooth as the code version. I'd agree that skills, when a folder coming with code, does feel like largely obviating MCPs for solo use cases, until you consider remote MCPs & OAuth, which seem unaddressed and core in practice for wider use.

    • phpnode 1 hour ago
      the article only makes sense if you think that only developers use AI tools, and that the discovery / setup problem doesn't matter
      • trollbridge 1 hour ago
        But that's the current primary use case for AI. We aren't anywhere close to being able to sanitise input from hostile third parties enough to just let people start inputting prompts to my own system.
        • phpnode 53 minutes ago
          there's a whole world of AI tools out there that don't focus on developers. These tools often need to interact with external services in one way or another, and MCP gives those less technical users an easy way to connect e.g. Notion or Linear in a couple of clicks, with auth taken care of automatically. CLIs are never replacing that use case.
  • ako 1 hour ago
    Biggest downside of CLI for me is that it needs to run in a container. You're allowing the agent to run CLI tools, so you need to limit what it can do.
    • wolttam 1 hour ago
      It gets significantly harder to isolate the authentication details when the model has access to a shell, even in a container. The CLI tool that the model is running may need to access the environment or some credentials file, and what's to stop the model from accessing those credentials directly?

      It breaks most assumptions we have about the shell's security model.

    • tuwtuwtuwtuw 1 hour ago
      Couldn't that be solved by whitelisting specific commands?
      • wolttam 1 hour ago
        Such a mechanism would need to be implemented at `execve`, because it would be too easy for the model to stuff the command inside a script or other executable.
  • lukol 1 hour ago
    Couldn't agree more. Simple REST APIs often do the job as well. MCP felt like a vibe-coded fever dream from the start.
  • lasgawe 1 hour ago
    I don't know about this. I use AI, but I've never used or tried MCP. I've never had any problems with the current tools.
    • I_am_tiberius 1 hour ago
      That's the way my 80 year old grandpa talks.
  • dnautics 1 hour ago
    what honestly is the difference between an mcp and a skill + instructions + curl.

    Really it seems to me the difference is that an mcp could be more token-efficient, but it isn't, because you dump every mcp's instructions all the time into your context.

    Of course then again skills frequently doesn't get triggered.

    just seems like coding agent bugs/choices and protocol design?

  • rvz 1 hour ago
    MCPs were dead in the water and were completely a bad standard to begin with. The hype around never made sense.

    Not only it had lots of issues and security problems all over the place and it was designed to be complicated.

    For example, Why does your password manager need an MCP server? [0]

    But it still does not mean a CLI is any better for everything.

    [0] https://news.ycombinator.com/item?id=44528411

  • whatever1 1 hour ago
    First they came for our RAGs, now for our MCPs. What’s next ?
  • SignalStackDev 1 hour ago
    [dead]
  • aplomb1026 1 hour ago
    [dead]
  • nimbus-hn-test 1 hour ago
    [dead]
  • mudkipdev 1 hour ago
    This got renamed right in front of my eyes
  • mt42or 1 hour ago
    I remember this kind of people against Kubernetes the same exact way. Very funny.
    • tedk-42 1 hour ago
      Same clowns complaining that `npm install` downloads the entire internet.

      Now it's completely fine for an AI agent to do the same and blow up their context window.