I Connected Claude Code to Roboflow MCP — Here’s What Worked

Last weekend I wanted to build a quick vision app to detect whether parts on our workshop shelf were correctly labeled. Nothing production-grade — just a proof of concept to see if AI could save us 15 minutes of manual checking.

Usually this means: collect images, upload to Roboflow, label them, train a model, write an inference script, connect everything. Half a day if I’m fast. But Roboflow recently shipped an MCP server that lets AI coding agents do all of this directly.

I connected Claude Code to it. Here’s what happened.

What the MCP Server Actually Does

The Model Context Protocol is a standard way for AI agents to call external tools. Roboflow’s MCP server exposes their entire platform as tools — create projects, upload images, auto-label with foundation models, train, deploy.

I added one line to my Claude Code MCP config pointing to mcp.roboflow.com. Claude discovered 15+ tools automatically. No SDK installation, no API key juggling beyond pasting it once.

The surprising part was how natural the conversation felt. I said “I have a folder of shelf images, train a model to detect if labels are present.” Claude Code used the MCP tools to create a project, upload the images, run auto-labeling with a foundation model, train a small model, and return the API endpoint. All in about 8 minutes.

Where It Shined

The auto-labeling step was the biggest time-saver. Normally I’d spend 20-30 minutes manually drawing bounding boxes. The MCP server called Roboflow’s foundation model to pre-label everything, then Claude reviewed the labels and asked me to verify the tricky ones. I had to correct maybe 5 out of 40 images.

Another win: the deploy step. Usually I write a small Python script with the roboflow package, test it, debug the API key path, etc. Claude generated the inference script, set up the virtual environment, and ran a test prediction — all through MCP tool calls.

Where It Stumbled

The MCP server times out on large uploads. My folder had 60 images (about 120MB total) and the upload tool would fail silently after ~40 images. I had to split them into batches manually.

Also, Claude Code sometimes overcomplicated the workflow. For a simple shelf detection task, it wanted to set up a multi-stage pipeline with confidence thresholds and edge deployment configs. I had to explicitly say “just give me a REST API endpoint, nothing else.”

The model quality was mediocre — 82% mAP on my test set. Good enough for a PoC, but I’d need more labeled data for production. The MCP server won’t magically fix bad training data.

Would I Use It Again?

For rapid prototyping, absolutely. Going from “I wonder if this would work” to a working API endpoint in under 10 minutes is genuinely impressive. The MCP pattern also means every improvement Roboflow makes to their platform becomes instantly available to my coding agent.

For production models, I’d still use the Roboflow web UI or Python SDK directly. The MCP server is fast but imprecise — Claude sometimes picks suboptimal preprocessing steps or training parameters because it can’t see the visual output of each step.

My verdict: this is what MCP was designed for. Connect your AI coding agent to your ML platform, let it handle the boilerplate, then step in for the decisions that need human judgment. The 8 minutes I saved on setup let me spend 30 minutes improving the model quality manually — and that’s the right trade.

Discover more from Susiloharjo

Subscribe to get the latest posts sent to your email.