Face++ / Megvii MCP. Analyze faces, body posture, and gestures from images.

Q: How does the comparefaces tool work?

The comparefaces tool calculates a similarity score based on facial feature vectors. It gives you a confidence percentage (e.g., 98.5%) that both images belong to the same person, which is much more reliable than a simple visual match.

Q: Can I use the detectface tool to get more than just a bounding box?

Yes. The detectface tool returns detailed attributes for each detected face, including estimates for age, gender, and emotional expression, in addition to the location data.

Q: What is the difference between searchface and getfacesetdetail?

getfacesetdetail shows you everything currently stored in the FaceSet (the roster). searchface lets you query that roster actively—for example, finding all faces matching 'male' and 'happy' within the set.

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

See Vinkius in Action

Works with every AI agent you already use

…and any MCP-compatible client

Just plug in your AI agents and start using Vinkius.

Face++ / Megvii. This platform lets you detect faces, compare identities, and analyze body gestures using AI. It handles everything from basic face detection (age, gender, emotion) to managing large face databases (FaceSets).

You can also detect human skeletons and analyze specific hand gestures from images. It’s a full vision intelligence suite for KYC, research, and content analysis.

What your AI agents can do

Add face to faceset

Adds a detected face to a specified FaceSet.

Compare faces

Compares two images to determine the likelihood that they belong to the same person.

Create faceset

Creates a new, empty FaceSet database for tracking identities.

+ 7 more capabilities included

Detecting and profiling faces

Use detect_face to find faces in an image and retrieve metadata like age, gender, and emotion.

Verifying identity similarity

The compare_faces tool calculates the confidence score that two given images belong to the same person.

Managing face databases

The create_faceset tool builds a new searchable database, and add_face_to_faceset adds detected faces to an existing set.

Analyzing human posture and structure

Use detect_body to locate human figures and skeleton_detect to map the body's skeletal keypoints.

Identifying hand gestures

The gesture_detect tool reads images to pinpoint and identify specific hand movements.

Searching within large face collections

Use search_face to query a FaceSet and find a specific face based on its attributes.

Ask AI about this MCP

Ask ChatGPT

Ask Claude

Ask Perplexity

Supported MCP Clients

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

+ other MCP clients

Free for Subscribers

Waiting for input…

AI Agent

Face++ / Megvii MCP Server: 10 Tools for Vision Analysis

Use these tools to detect faces, map skeletons, compare identities, and analyze gestures through your AI agent in a single, unified workflow.

add019d8438

add face to faceset

Adds a detected face to a specified FaceSet.

compare019d8438

compare faces

Compares two images to determine the likelihood that they belong to the same person.

create019d8438

create faceset

Creates a new, empty FaceSet database for tracking identities.

detect019d8438

detect body

Identifies and returns the bounding box coordinates for human bodies in an image.

detect019d8438

detect face

Finds and analyzes faces within an image, providing bounding boxes and attributes like age and gender.

gesture019d8438

gesture detect

Detects and identifies specific hand gestures present in an image.

get019d8438

get faceset detail

Retrieves the current contents and metadata for a specified FaceSet.

remove019d8438

remove face from faceset

Removes a specific face record from an existing FaceSet.

search019d8438

search face

Queries a FaceSet using specific criteria (e.g., attribute, coordinates) to find a matching face.

skeleton019d8438

skeleton detect

Detects and maps the human skeleton keypoints, providing detailed pose estimation.

Choose How to Get Started

Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.

Build Your Own

Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.

Import from OpenAPI, Swagger, or YAML specs
Create Agent Skills with progressive disclosure
Deploy to edge with MCPFusion framework
Built in DLP, auth, and compliance on every call
Real time usage dashboard and cost metering
Publish to catalog or keep private

Start building

Make Your AI Do More

Start with Face++ / Megvii, then connect any of our 4,700+ other servers whenever your AI needs more. One click, no limits.

Use this MCP plus 4,700+ others, all in one place
Add new capabilities to your AI anytime you want
Every connection is secured and compliant automatically
Track usage and costs across all your servers
Works with Claude, ChatGPT, Cursor, and more
New servers added to the catalog every week

What you can do with this MCP connector

This server connects your AI agent to Face++ (Megvii), a complete vision intelligence suite. You can use it to detect, profile, and analyze everything from faces and bodies to hand gestures.

Detecting and Profiling Faces: You can run detect_face to find faces in an image and get key metadata like age, gender, and emotion. Use compare_faces to check if two images show the same person by calculating a confidence score. Managing Face Databases: You'll build new searchable databases with create_faceset, and then you can add detected faces to a set using add_face_to_faceset.

To find a specific person in a large collection, run search_face against a FaceSet. You can also check what's already in a set with get_faceset_detail or take a face out with remove_face_from_faceset. Analyzing Human Posture and Structure: You can locate human figures using detect_body to get bounding box coordinates, and map the body's entire skeletal structure with skeleton_detect. Identifying Hand Gestures: The gesture_detect tool reads images to pinpoint and identify specific hand movements. Comparing and Analyzing: You can determine the likelihood of two images belonging to the same person using compare_faces.

Your AI agent handles the whole process. It sends the image or video data, and the server returns structured vision data—like face attributes or confidence scores—which your agent uses to act on the information.

How Face++ / Megvii MCP Works

1 Subscribe to the server and provide your Face++ API Key and Secret.
2 Your AI agent sends the image or video data and the specific task (e.g., 'Compare these two images') to the MCP client.
3 The client runs the appropriate tool, returning structured data like bounding boxes, confidence scores, or demographic attributes.

The bottom line is: your agent runs complex computer vision tasks using natural language, eliminating the need to use a web console.

Who Is Face++ / Megvii MCP For?

Security and compliance officers who run KYC audits, or UX researchers analyzing user behavior. If your job involves processing visual data—whether it's checking documents or watching user interactions—you need this. It moves identity verification and visual content review from manual clicking into conversational queries.

Compliance Officer

Automates Know Your Customer (KYC) audits by querying identity verification and facial data against a database.

UX Researcher

Analyzes user emotions and gestures during product testing by feeding visual data directly to their AI workspace.

Forensic Investigator

Tracks subjects across multiple images, comparing facial features and body movements to build a suspect profile.

What Changes When You Connect

Identity Confirmation: Use compare_faces to instantly calculate a similarity score between two images. You don't need to manually cross-reference documents; the agent handles the math.
Full Body Analysis: Detect human figures with detect_body and map the full structure using skeleton_detect. This lets you analyze posture or movement without running multiple, separate tools.
Centralized Identity Tracking: Build and manage dedicated identity databases using create_faceset. Then, use add_face_to_faceset to feed faces into the set and search_face to query them later.
Emotional Context: The detect_face tool goes beyond simple detection. It returns attributes like age, gender, and emotional state, giving you context right out of the box.
Gesture Recognition: Use gesture_detect to pinpoint specific hand movements. This is crucial for UX research or operational safety checks where hand signals matter.
Streamlined Compliance: By combining detect_face with search_face, you can automate KYC audits. Your agent can process an image, check for a face, and immediately search that face against a known database.

Real-World Use Cases

Validating employee credentials for access control

A security guard needs to check if an employee is authorized. Instead of manually comparing IDs, they ask their agent: 'Compare this photo to the authorized employee list.' The agent runs compare_faces against the master records, giving an immediate confidence score. The guard gets a pass/fail answer in seconds.

Analyzing crowd behavior at an event

A crowd analyst wants to track movement. They ask the agent to 'Detect all bodies and map their skeletons.' The agent runs detect_body and skeleton_detect on the video feed, providing coordinates for every person. This helps identify unusual crowd density or potential hazards.

Researching user interaction during product testing

A UX researcher needs to know if a user is frustrated. They feed the video to the agent and ask: 'What is the user's emotional state and what gestures are they making?' The agent uses detect_face and gesture_detect, providing both emotional metrics and hand signals in one response.

Building a suspect profile from fragmented evidence

An investigator has several photos. They first run detect_face on all of them. Then, they use create_faceset to build a case file, adding each face via add_face_to_faceset. Finally, they use search_face to link all the faces in the case file.

The Tradeoffs

Running detection tools individually

Running detect_face to get a bounding box, then manually taking those coordinates and feeding them into skeleton_detect as if they are compatible. This often fails because the tools don't share a common coordinate system or expected output format.

→ Don't chain the tools manually. Ask your agent to run the workflow: 'First, detect the body, then analyze the skeleton.' The agent handles the necessary data flow between detect_body and skeleton_detect.

Over-relying on single-tool output

Using only detect_face results and assuming that's enough for identity verification. This ignores potential changes in pose or lighting that would invalidate the result.

→ Always validate identity using compare_faces. This tool calculates similarity confidence, giving you a quantifiable score that accounts for pose variation, not just a simple match/no-match.

Trying to process data in batches without managing state

Calling create_faceset and then running 50 separate add_face_to_faceset calls without checking the FaceSet's current state or managing potential failures.

→ Always use get_faceset_detail after setup or before a major batch update. This confirms the FaceSet is ready and shows exactly how many faces are already registered, preventing redundant or failed adds.

When It Fits, When It Doesn't

Use this if you need to process visual data in a structured, multi-step way. Specifically, if your workflow requires comparing people (compare_faces), building a persistent record of identities (create_faceset/add_face_to_faceset), or analyzing physical movement (skeleton_detect/gesture_detect).

Don't use it if you just need simple image cropping or basic file manipulation. For those tasks, a standard file utility or simple image processing library is better. You need the high-level intelligence to detect what is in the image and how the subjects relate to each other.

If your goal is purely to check if an image has a face, detect_face is enough. But if you need to know who that face belongs to, you need the full suite: detect_face -> create_faceset -> add_face_to_faceset -> search_face.

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by Face++ / Megvii. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

VINKIUS INFRASTRUCTURE

Cloud Hosted

Managed infra

V8 Isolated

Sandboxed per request

Zero-Trust Proxy

No stored credentials

DLP Enforced

Policy on every call

GDPR Compliant

EU data residency

Token Compression

~60% cost reduction

How we secure it →

Works with Claude, ChatGPT, Cursor, and more

The Model Context Protocol standardizes how applications expose capabilities to LLMs. Instead of operating in isolation, your AI gains direct access to external platforms, live data, and real-world actions through secure, standardized connections.

This server provides 10 capabilities that interface natively with Claude, ChatGPT, Cursor, and any MCP client. No middleware. No custom integration required.

Available Capabilities

add_face_to_faceset compare_faces create_faceset detect_body detect_face gesture_detect get_faceset_detail remove_face_from_faceset search_face skeleton_detect

Manually cross-referencing visual evidence is slow and prone to error.

In the old way, if you needed to compare two people's IDs, you'd pull up two separate photo files. You'd have to zoom in, manually check the features, and then maybe copy a timestamp or ID number into a spreadsheet. If the lighting was bad or they changed their angle, you had to restart the whole comparison, wasting time and risking human error.

With this MCP server, you just ask your agent to compare the photos. The agent runs the `compare_faces` tool, returns a single confidence percentage (e.g., 98.5%), and you get the definitive answer immediately. No manual comparison needed.

Face++ / Megvii MCP Server: Run complex vision ops from chat.

You used to need to write complex scripts that chained together separate APIs: first, a call to detect the body; then, a second call to get the skeleton keypoints from the bounding box; and finally, a third call to analyze the gestures. This was brittle and required deep knowledge of API contracts.

Now, your agent handles the whole thing. You simply ask for the body and skeleton data, and the agent executes `detect_body` and `skeleton_detect` automatically, giving you the complete structural data in one conversation thread.

Common Questions About Face++ / Megvii MCP

How does the `compare_faces` tool work? +

The compare_faces tool calculates a similarity score based on facial feature vectors. It gives you a confidence percentage (e.g., 98.5%) that both images belong to the same person, which is much more reliable than a simple visual match.

Can I use the `detect_face` tool to get more than just a bounding box? +

Yes. The detect_face tool returns detailed attributes for each detected face, including estimates for age, gender, and emotional expression, in addition to the location data.

What is the difference between `search_face` and `get_faceset_detail`? +

get_faceset_detail shows you everything currently stored in the FaceSet (the roster). search_face lets you query that roster actively—for example, finding all faces matching 'male' and 'happy' within the set.

Does the `skeleton_detect` tool work on video? +

Yes, it processes frames from video data. It detects the human skeleton keypoints, providing detailed pose estimation for movement analysis.

How do I manage a FaceSet using the `add_face_to_faceset` tool? +

You add faces by providing the FaceSet ID and the image data. The tool handles the data upload and linking. You'll need to ensure the FaceSet already exists before running this command.

What is the expected format for the `compare_faces` tool when dealing with multiple images? +

You pass a list of image URLs or bytes to the tool. The API calculates the similarity confidence score for every pair. The output gives you a direct, quantifiable percentage match for comparison.

If I run `detect_body` on a complex image, how does the system handle occlusion? +

The system detects the body structure even when parts are obscured. It provides key points and a bounding box around the visible area. You'll get the best possible analysis given the image quality.

How do I check the status or get details of an existing FaceSet using `get_faceset_detail`? +

Simply provide the FaceSet ID. The tool returns metadata, including the total number of faces and the date the set was created. This confirms the set's existence and gives you basic management info.

How do I find my Face++ API Key and Secret? +

Log in to the Face++ Console, go to [Dashboard] -> [API Key], and you will find your unique Key and Secret there. Ensure you have the necessary permissions enabled.

Can I analyze images from a local file? +

This server currently supports analysis via image_url. To use local files, you should upload them to a public or private storage and provide the resulting URL to the tools.

What attributes can be returned during face detection? +

Common attributes include gender, age, emotion, head pose, smile, face quality, and skin status. You can specify these in the return_attributes parameter of the detect_face tool.

Use it with your favorite AI tools

Connect this server to Cursor, Claude, VS Code, and more.

OpenAI Agents SDK sdk-python

Google ADK sdk-python

Pydantic AI sdk-python

Vercel AI SDK sdk-typescript