Mapillary MCP. Analyze street view imagery, object detections, and map features.
Works with every AI agent you already use
…and any MCP-compatible client
Just plug in your AI agents and start using Vinkius.
Mapillary connects your AI client to the world's largest street-level imagery platform. You search for images, map features (like traffic signs), and object detections across any area globally.
It gives you image metadata—GPS coordinates, capture dates, and compass angles—so you can pinpoint exactly what was captured where.
What your AI agents can do
Get detection value
Gets specific details—type, GPS coordinates, and association—for a single detected object.
Get image
Returns comprehensive metadata for one image ID, including capture time, location, and sequence ID.
Get image detections
Lists all detected objects within a specific image, providing their types, geometry, and confidence scores.
You ask your agent to find street-level photos or sequences along a specific route using coordinates.
Your agent searches for map features—like stop signs, speed limits, and road markings—in a defined area, useful for planning or auditing.
You instruct your agent to list every detected object (e.g., streetlights, vehicles) found within a specific image ID.
The server returns the full details for an image, including its capture date, precise GPS coordinates, and angle.
You retrieve granular data on a single object detection, showing its specific type, value, and exact location within the photo.
Ask AI about this MCP
Supported MCP Clients
Waiting for input…
Mapillary MCP Server: 7 Tools for Geospatial Analysis
This server provides granular access to street-level imagery data. Use the tools to search areas, retrieve image metadata, and detect specific objects or map features.
019d8454get detection value
Gets specific details—type, GPS coordinates, and association—for a single detected object.
019d8454get image
Returns comprehensive metadata for one image ID, including capture time, location, and sequence ID.
019d8454get image detections
Lists all detected objects within a specific image, providing their types, geometry, and confidence scores.
019d8454get map features
Searches for static map elements like traffic signs or road markings across an area of interest.
019d8454get sequence
Retrieves metadata and linked images for a continuous set of photos captured along a route.
019d8454search images
Finds image IDs, coordinates, and capture dates by searching within a defined geographic bounding box.
019d8454search sequences
Locates entire photo sequences (routes) that pass through a specified geographical area.
Choose How to Get Started
Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.
Build Your Own
Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.
- Import from OpenAPI, Swagger, or YAML specs
- Create Agent Skills with progressive disclosure
- Deploy to edge with MCPFusion framework
- Built in DLP, auth, and compliance on every call
- Real time usage dashboard and cost metering
- Publish to catalog or keep private
Make Your AI Do More
Start with Mapillary, then connect any of our 4,700+ other servers whenever your AI needs more. One click, no limits.
- Use this MCP plus 4,700+ others, all in one place
- Add new capabilities to your AI anytime you want
- Every connection is secured and compliant automatically
- Track usage and costs across all your servers
- Works with Claude, ChatGPT, Cursor, and more
- New servers added to the catalog every week
What you can do with this MCP connector
Mapillary lets your agent access the world’s street-level imagery data through natural conversation.
This server exposes several tools for geospatial analysis:
- Search Images: Pinpoint images using a geographic bounding box.
- Search Sequences: Find connected image sets (routes) that pass through an area.
- Map Features: Search for specific static elements like traffic signs or road markings by location.
- Image Detection: Get object detections and detailed values from specific images, linking the detection to its geometry and confidence score.
How Mapillary MCP Works
- 1 First, your agent uses
search_imagesorget_map_featureswith coordinates to narrow down the area of interest. - 2 Next, you pass a specific image ID or sequence ID to
get_imageorget_image_detectionsto retrieve the raw data and associated objects. - 3 Finally, your agent processes the output using
get_detection_valueorget_sequenceto extract actionable metadata—like a speed limit value or GPS path.
The bottom line is you get structured access to massive amounts of geospatial data that would otherwise require dedicated GIS software.
Who Is Mapillary MCP For?
Urban planners, environmental researchers, and civil engineers use this. They need to audit physical infrastructure—traffic signs, road markings, etc.—across large areas without manual site visits. If you're tired of relying on vague satellite views for ground truth data, this is for you.
Uses get_map_features and search_images to inventory traffic signs or analyze road markings across a neighborhood grid.
Combines get_image_detections with GPS data from get_image to build environmental models based on captured objects over time.
Uses the full suite of tools, especially object detection and sequence search, to validate perception stack performance against real-world visual evidence.
What Changes When You Connect
- Find objects across wide areas: Instead of checking one photo at a time, use
get_map_featuresto search for all traffic signs or road markings in an entire quadrant. - Understand the journey's context: Use
search_sequencesand thenget_sequenceto track how conditions change along a specific route path over time. - Deep dive into single images: If you find an image ID using
search_images, runget_image_detectionsimmediately. This lists every object the system found, giving confidence scores for each one. - Pinpoint exact object data: Don't just know a car was there; use
get_detection_valueto get that specific detection's type, GPS coordinates, and image link. - Get full photo context: Use
get_imageon any ID to pull back the original capture date, compass angle, and organization data—the whole metadata package.
Real-World Use Cases
Auditing Speed Limits
A planner needs to check if all speed limits are posted on a major street. They use get_map_features over the street's bounding box. The agent filters results for 'speed limit sign' and gets GPS coordinates and confidence scores, confirming compliance quickly.
Tracking Changes in Urban Areas
A researcher wants to know what objects were present in a park six months ago versus today. They use search_sequences to find the path, then compare the object lists from get_image_detections for corresponding image IDs.
Validating Autonomous Driving Paths
A developer simulates a driving route using coordinates and calls search_images. For key intersections, they run get_image_detections to confirm the agent's perception stack can identify everything from stop signs to pedestrian crossings.
Forensic Image Analysis
Given a specific image ID, the user needs to know exactly what was captured. They call get_image, which provides the full metadata (GPS, date) and then call get_image_detections for the list of objects.
The Tradeoffs
Searching blindly by keyword
Asking the agent: 'Show me pictures of traffic signs in London.' This is too vague and misses critical spatial data.
→
You must define a physical boundary. First, use search_images with the correct bounding box (e.g., '-0.15,51.50,-0.10,51.52'). Then, run get_map_features to specifically filter for 'traffic sign' within that area.
Treating all photos the same
Calling get_image_detections without first knowing if an image ID is valid or what its capture date was. This leads to missing context.
→
Always start by calling get_image(image_id) to confirm the metadata, location, and sequence details before attempting any deep detection calls.
Missing the route context
Only searching for images using search_images when the user actually wants a continuous path view. This gives isolated points.
→
If you need multiple connected photos, use search_sequences first to find the sequence ID. Then pass that ID to get_sequence for full context.
When It Fits, When It Doesn't
Use this server if your job requires physical ground truth data: analyzing infrastructure, detecting objects in place, or tracing routes using coordinates. You must use it when you need more than just a general map view—you need the actual image-level detections and metadata.
Don't use this if you only need to know which street is near another (use standard mapping APIs for that). Also, don't rely on it if your goal is merely generating conceptual data; you must always have specific coordinates or bounding boxes to start. If you only want a list of general points of interest without detection capabilities, other map services might suffice.
To get the most out of this, you need an orchestration layer that chains calls: search_sequences -> get_sequence -> (loop through image IDs) get_image_detections. This complex workflow is where the power lies.
Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by Mapillary. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.
VINKIUS INFRASTRUCTURE
Cloud Hosted
Managed infra
V8 Isolated
Sandboxed per request
Zero-Trust Proxy
No stored credentials
DLP Enforced
Policy on every call
GDPR Compliant
EU data residency
Token Compression
~60% cost reduction
Works with Claude, ChatGPT, Cursor, and more
The Model Context Protocol standardizes how applications expose capabilities to LLMs. Instead of operating in isolation, your AI gains direct access to external platforms, live data, and real-world actions through secure, standardized connections.
This server provides 7 capabilities that interface natively with Claude, ChatGPT, Cursor, and any MCP client. No middleware. No custom integration required.
Available Capabilities
Manually checking street view data takes hours of clicking and cross-referencing tabs.
Right now, if you're auditing traffic signs or object placement, you pull up a map service. You draw a box, see hundreds of potential points, and then you have to open each one individually. You copy the coordinates into a spreadsheet, open another window to check the date, and manually verify if the sign was even visible in that specific photo.
With this MCP server, you tell your agent: 'Find all speed limit signs between these two points.' The agent runs `get_map_features` across the area. You get a structured list of results—GPS coordinates, feature type, and confidence score—all in one go.
Mapillary MCP Server: Getting object detections from visual data.
Manual processes often stop at 'I see an object.' You get a vague detection box. You don't know if it’s a streetlight, or just a pole—and you certainly don't have the exact GPS location of the base, only the bounding box.
The Mapillary MCP Server gives you that specificity. You run `get_image_detections` and immediately get the value, type, geometry, *and* confidence score for every object found in a single image. It turns vague observation into auditable data points.
Common Questions About Mapillary MCP
How do I search street view images using `search_images`? +
You pass the function your bounding box parameters: min_lon, min_lat, max_lon, max_lat. The server returns a list of image IDs, coordinates, and capture dates that fall within that specific rectangle.
What is the difference between `search_images` and `search_sequences`? +
search_images finds individual, isolated photos. search_sequences finds continuous paths or routes composed of multiple connected images; you'll get a sequence ID instead.
Can I find traffic signs using the `get_map_features` tool? +
Yes, that’s exactly what it does. You query get_map_features by area and filter for 'traffic sign' or other road markings to build an inventory.
What fields can I request when using `get_image`? +
You pass a fields parameter. You can request specific metadata like geometry, compass angle, captured_at, sequence ID, or various thumbnail URLs to get exactly the data you need.
How do I authenticate my agent to use the `get_image` tool? +
You first subscribe to the Mapillary server and provide your unique access token. Once connected, you can call get_image with specific coordinates or IDs to pull detailed metadata about any street-level photo.
What is the best way to handle high volumes of data when running a query using `search_images`? +
Be mindful of your rate limits and bounding box size. For large areas, break your search into smaller, manageable geographical sections. This prevents quota overruns and keeps results stable.
If I get a list of detections for an image using `get_image_detections`, how do I get more detail on one specific object? +
You pass the unique detection ID to the get_detection_value tool. This gives you granular information, including the object's exact type, confidence score, and precise GPS coordinates.
When using `get_map_features`, what format must I use for my search area? +
You need to provide your search area as a geographical bounding box. This is formatted as four comma-separated values: min_lon, min_lat, max_lon, and max_lat.
How do I get a Mapillary access token? +
Sign up at mapillary.com/developer and create an application to get your access token.
What is a bounding box? +
A bounding box defines a geographic area using min_longitude, min_latitude, max_longitude, max_latitude. Example: "-0.15,51.50,-0.10,51.52" covers central London.
What map features are available? +
Mapillary detects traffic signs (stop, speed limit, yield, etc.), road objects (fire hydrants, mailboxes, etc.) and road markings. Use get_map_features to search by area.
Use it with your favorite AI tools
Connect this server to Cursor, Claude, VS Code, and more.
More in this category
Tencent Youtu / 腾讯优图
China's leading computer vision platform — perform facial recognition, analysis, and liveness checks via AI.
Synthesia
Connect your AI to Synthesia. Generate corporate AI avatar videos from text prompt, explore templates, and automatically dub existing media directly from the terminal.
FileStack
Upload, transform, and analyze files with Filestack—extract text via OCR, detect objects, and manage video transcoding directly from your AI agent.
You might also like
Momence
Manage sessions, members, teachers, memberships, and bookings for your Momence-powered fitness or yoga studio through natural conversation.
EOSDA Agriculture Satellite Data
Precision agriculture satellite intelligence — access NDVI, soil moisture, and crop health via AI.
CompanyCam
Enable your AI agent to manage construction projects, photos, and jobsite documentation via the CompanyCam API.