Baidu Qianfan MCP for AI. Connect Enterprise LLMs and Multimodal AI Services

Q: How do I use the chatcompletions tool with my proprietary data?

You first run getembeddings on your documents to convert them into vectors. Then, when you call chatcompletions, the agent uses those vectors as context for the model so it answers based on your private data.

Q: What if I want to check what models are available for chatcompletions?

You simply use listmodels. This tool shows all the specific AI services you have access to, letting you choose the best one for your current task.

Q: How do I know if a model is ready for production use?

Use listtrainjobs to check the status of any model training. This confirms that the model has finished its job and is stable enough for real-world integration.

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

See Vinkius in Action

Works with every AI agent you already use

…and any MCP-compatible client

Connect to your AI in seconds.

Baidu Qianfan connects your AI agent directly to enterprise LLM services for Chinese language applications. This MCP manages chat completions, generates vector embeddings for RAG systems, creates images from text prompts, and lets you monitor model usage—all through one standard connection.

What your AI can do

Get embeddings

Converts text into a numerical vector, which allows your agent to measure the semantic similarity between documents.

List train jobs

Checks the status and history of any active or completed model training jobs.

Chat completions

Sends a message and context history to a Baidu Qianfan model to get a natural language reply.

+ 3 more capabilities included

Run Conversational AI

Start and continue conversations with powerful foundation models, maintaining context across multiple turns.

Index Text for Search

Convert any block of text into mathematical vector embeddings so your agent can perform deep semantic searches against private datasets.

Generate Visual Content

Create original images using advanced diffusion models, simply by providing a descriptive text prompt.

Manage AI Prompts

Access and retrieve standardized prompt templates to ensure your model outputs are always consistent in tone and format.

Audit Usage Metrics

Monitor token consumption and service status programmatically, giving you a clear picture of your operational costs.

Ask an AI about this

Included with Plan

Waiting for input…

AI Agent

Baidu Qianfan with 6 Tools

Use these tools to control model endpoints, generate embeddings, list datasets, check models, monitor jobs, and create images.

Make your AI actually useful.

Add this MCP to Claude, Cursor, or Windsurf and your AI stops guessing. It gets real tools to look things up, take action, and handle the stuff you keep doing by hand.

Start using Baidu Qianfan on Vinkius

Get Embeddings

Converts text into a numerical vector, which allows your agent to measure the semantic similarity between documents.

List Train Jobs

Checks the status and history of any active or completed model training jobs.

Chat Completions

Sends a message and context history to a Baidu Qianfan model to get a natural...

List Datasets

Retrieves a list of all data sets you have uploaded and indexed within the platform.

List Models

Shows which specific AI model services are available for use right now.

Text To Image

Creates a unique image file based on a detailed text prompt you provide.

Security and governance baked right in.

Pick your AI client below to get set up. Just create a Vinkius account, subscribe, and you're instantly up and running. We handle the entire backend infrastructure, delivering out-of-the-box support for HTTPS Streamable, SSE, and OAuth2—zero messy routing required.

Claude AI

Open Claude Settings

Go to claude.ai, click your profile icon, then navigate to Customize → Connectors.

Add Custom Connector

Click the "+" button and select Add custom connector. Paste your Vinkius endpoint URL:

https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp

Replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com. For OAuth-protected servers, expand Advanced settings to add credentials.

Start a conversation

Open a new chat. The Baidu Qianfan integration is available immediately — no restart needed.

Antigravity

Configure Agent Environment

Open your Antigravity agent's workspace configuration or mcp-servers.json file.

Bind the Endpoint

Add the Vinkius endpoint URL to your agent's MCP connections list:

"mcp_servers": {
  "baidu-qianfan": {
    "serverUrl": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
  }
}

Provide your secure token in place of [YOUR_TOKEN_HERE] to ensure your agent requests are authenticated.

Execute

Start your Antigravity session. The agent will autonomously discover and utilize the Baidu Qianfan tools with full Vinkius guardrails applied.

VS Code Copilot

⚡

One-Click Install (Recommended)

In your Vinkius Dashboard, simply click the Add to VS Code button for this server. We'll automatically configure your local workspace.

Or configure manually

Open MCP Settings

Open VS Code, press Ctrl/Cmd + Shift + P, and search for GitHub Copilot: MCP Servers.

Add Server Config

Add the Vinkius endpoint configuration to your mcp-servers.json file:

"baidu-qianfan": {
  "url": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
}

Ensure you replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com.

LangChain

Install Dependencies

Install the LangChain MCP adapters for your environment:

pip install langchain-mcp-adapters

Connect the Server

Use the SSEClient in LangChain to connect to the Vinkius managed endpoint:

from langchain_mcp_adapters.client import SSEClient

# Connect to Vinkius
client = SSEClient(url="https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp")
tools = client.get_tools()

CrewAI

Define the Tool

Load the Vinkius MCP tools into your CrewAI agents:

from crewai import Agent
from mcp_crewai import MCPTool

# Connect securely to Vinkius
vinkius_tools = MCPTool(url="https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp")

# Assign to Agent
researcher = Agent(
    role='Data Researcher',
    tools=vinkius_tools.get_all()
)

Execute Task

Run your CrewAI process. The agent will autonomously route tasks to the Vinkius managed server.

Choose How to Get Started

Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.

Build Your Own

Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.

Import from OpenAPI, Swagger, or YAML specs
Create Agent Skills with progressive disclosure
Deploy to edge with MCPFusion framework
Built in DLP, auth, and compliance on every call
Real time usage dashboard and cost metering
Publish to catalog or keep private

Start building

Make Your AI Do More

Start with Baidu Qianfan, then connect any of our 5,100+ other servers whenever your AI needs more. One click, no limits.

Use this MCP plus 5,100+ others, all in one place
Add new capabilities to your AI anytime you want
Every connection is secured and compliant automatically
Track usage and costs across all your servers
Works with Claude, ChatGPT, Cursor, and more
New servers added to the catalog every week

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by Baidu Qianfan. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

VINKIUS INFRASTRUCTURE

Cloud Hosted

Managed infra

V8 Isolated

Sandboxed per request

Zero-Trust Proxy

No stored credentials

DLP Enforced

Policy on every call

GDPR Compliant

EU data residency

Token Compression

~60% cost reduction

Your data is protected. See how we built it.

Works with Claude, ChatGPT, Cursor, and more

The Model Context Protocol standardizes how applications expose capabilities to LLMs. Instead of operating in isolation, your AI gains direct access to external platforms, live data, and real-world actions through secure, standardized connections.

This connection provides 6 powerful capabilities that interface natively with Claude, ChatGPT, Cursor, and other compatible AI platforms. No middleware. No custom integration required.

Dealing with AI services used to mean jumping between dashboards.

Today, if you want your agent to chat with the model *and* search private documents, you have to jump through hoops. You're in one dashboard for chatting; then you leave it and go to another console just to upload data and run embeddings. Then maybe a third tool is needed just to check what models are even available.

With this MCP, all those actions are centralized. Your agent talks to the chat service, runs `get_embeddings` on your documents, and checks model status—all in one place. You just get the ability to run complex workflows without leaving your primary development environment.

The Power of Structured Model Operations

Before this MCP, managing a full AI stack meant manually tracking model versions and data schemas across multiple dashboards. If you needed to generate images based on text or wanted to compare two chat models side-by-side, it was a time-consuming mess of copy/pasting API keys.

Now, the agent handles all that plumbing. It knows how to manage prompt templates, run `text_to_image`, and track usage metrics automatically. You just get reliable, multi-faceted AI functionality built right into your workflow.

Support 24/7 support@vinkius.com ↗

Security Vinkius Trust Center ↗

SLA Service Level Agreement ↗

Report Listing Send Report ↗

What your AI can actually do with this

You can run complex workflows without leaving your agent interface. Think of it like a central control panel for high-performance AI models: trigger conversations with advanced chat completions using persistent context; generate the vector embeddings needed to power custom search indexes; or create detailed images just by typing a prompt.

The platform also lets you manage everything from model access and usage tracking to retrieving standardized prompt templates. If your work involves integrating multiple types of large language models into an application, this is what you need. Vinkius makes connecting these specialized services simple, letting your agent talk to Baidu's full suite of AI tools without needing complex API wrapper code.

It means the power of enterprise-grade Chinese LLMs is available right where you’re working.

Built · Hosted · Managed by Vinkius Baidu Qianfan MCP - LLM APIs, Embeddings & Image Generation

Server ID 019d841a-e362-73d8-9c0e-7b393e596f19

Vinkius Inspector

Compliance Grade A+

Score 100/100

Report View Report ↗

What Changes When You Connect

Control model access: Easily check which services are available using the list_models tool. This stops you from accidentally calling an endpoint that's deprecated or offline.

Power RAG systems: Generate accurate vector embeddings for any text using get_embeddings, letting your agent search private knowledge bases, not just its training data.

Consistent outputs: Manage and retrieve centralized prompt templates so all model calls—whether it’s writing code or generating a report—use the same high-quality structure.

Multi-modal capability: Beyond text, you can trigger advanced image generation tasks using text_to_image to create visual assets directly from your agent's workflow.

Deep operational visibility: Keep track of costs and usage by monitoring token consumption. This helps you budget and optimize your calls with the service status check.

See it in action

01 01

Building a Customer Support Chatbot

The agent needs to answer questions based on hundreds of internal manuals. First, it uses get_embeddings on the manual text chunks and indexes them. Then, when a user asks a question, the agent runs a conversation using chat_completions, referencing those indexed documents for accurate answers.

02 02

Creating Marketing Assets at Scale

A product manager needs 20 unique headers for an ad campaign. Instead of manually prompting an image generator, the agent loops through a list of concepts and executes text_to_image twenty times, collecting all assets in one go.

03 03

Comparing Model Performance

A development team wants to know if the new 'Speed' model is better than the old 'Turbo' model for summarization. They use list_models to confirm both are active, then run parallel tests using chat_completions on the exact same data set.

04 04

Auditing AI Costs

The finance team needs a report showing how much token usage was consumed last month. They programmatically check the model service status and consumption metrics to ensure billing is accurate before submitting the budget proposal.

The honest tradeoffs

Hardcoding API keys

Anti-pattern

A junior dev writes code that manually inserts the secret key into a function call, making it impossible to update when credentials change.

The Fix

Don't handle secrets directly. Use this MCP; Vinkius manages the connection and authentication flow for you, keeping your API keys secure and abstracted away from your core logic.

Treating all LLMs equally

Anti-pattern

Calling chat_completions without checking if the model is still supported or if it requires a specific endpoint name.

The Fix

Always check available services first. Use list_models to confirm your target model (like Ernie-4.0) is active before you send any message via chat_completions.

Manual embedding pipelines

Anti-pattern

Trying to use a general web search tool instead of generating vector embeddings for specific, proprietary documents.

The Fix

For internal knowledge only, you must generate vectors. Use get_embeddings on your document text before building any RAG pipeline so the agent searches exactly what you own.

When It Fits, When It Doesn't

Use this MCP if your core challenge is integrating multiple types of enterprise AI capabilities—chatting, image generation, and deep semantic search—into a single workflow. You need this when working with specialized or regional LLMs that require dedicated connection management.

Don't use it if you just need a simple, off-the-shelf text summarization tool for public data; then, a simpler, general API might suffice. Conversely, if your entire product revolves around building a complex knowledge graph over proprietary documents and generating accompanying visuals, this is the right choice because of its structured access to get_embeddings and text_to_image. It's built for depth and enterprise complexity.

Questions you might have

How do I use the chat_completions tool with my proprietary data? +

You first run get_embeddings on your documents to convert them into vectors. Then, when you call chat_completions, the agent uses those vectors as context for the model so it answers based on your private data.

Can I generate images using text_to_image within my workflow? +

Yes. You can trigger image generation tasks directly through this MCP, which is useful when you need to create visual content that matches the context of a conversation or article.

What if I want to check what models are available for chat_completions? +

You simply use list_models. This tool shows all the specific AI services you have access to, letting you choose the best one for your current task.

How do I know if a model is ready for production use? +

Use list_train_jobs to check the status of any model training. This confirms that the model has finished its job and is stable enough for real-world integration.

When I call `get_embeddings`, what credentials do I need for my AI agent? +

You must provide a valid API Key and Secret Key. These are generated in the Baidu Qianfan Console. Your agent uses these keys to authenticate your requests, ensuring secure access to the embedding service.

If I run `chat_completions`, what happens if my calls hit a rate limit? +

The API response will include an explicit error code detailing the rate violation. Your client should implement exponential backoff and retry logic to gracefully manage temporary service restrictions.

How do I use `list_datasets` to verify my data sources before RAG? +

The tool lists available datasets, providing IDs and names. To actually ingest or query the content, you'll need a separate workflow step that passes those listed identifiers to your retrieval pipeline.

How does `list_train_jobs` help me monitor custom model development? +

It fetches metadata about active and completed training runs. This lets you check the current status, review job IDs, and verify when a customized foundation model is ready for deployment.

Which version of Ernie Bot should I use for chat? +

For high performance and reasoning, use ernie-4.0-8k. For faster response times and cost efficiency, ernie-speed-128k or ernie-lite-8k are excellent choices.

Can I automatically generate embeddings for RAG? +

Yes! Use the get_embeddings tool with your text input. The agent will retrieve the vector representations from Baidu's embedding models, ready for indexing in your vector database.

How do I use prompt templates from the console? +

Use the list_prompt_templates tool to find your configured templates. You can then retrieve specific details using get_prompt_template to maintain consistency across your AI workflows.

Connect to your AI in seconds.

Get embeddings

List train jobs

Chat completions

Baidu Qianfan with 6 Tools

Make your AI actually useful.

Get Embeddings

List Train Jobs

Chat Completions

List Datasets

List Models

Text To Image

Security and governance baked right in.

Claude AI

Open Claude Settings

Add Custom Connector

Start a conversation

Claude Code

Open your terminal

Add the MCP Server

Start coding

Cursor

One-Click Install (Recommended)

Open Cursor Settings

Add New Server

Use in Composer

Antigravity

Configure Agent Environment

Bind the Endpoint

Execute

VS Code Copilot

One-Click Install (Recommended)

Open MCP Settings

Add Server Config

Windsurf

One-Click Install (Recommended)

Open Windsurf Settings

Add Server Endpoint

LangChain

Install Dependencies

Connect the Server

CrewAI

Define the Tool

Execute Task

Choose How to Get Started

Build Your Own

Make Your AI Do More

Works with Claude, ChatGPT, Cursor, and more

Dealing with AI services used to mean jumping between dashboards.

The Power of Structured Model Operations

What your AI can actually do with this

Here's how it actually works

Who is this actually for?

What Changes When You Connect

See it in action

Building a Customer Support Chatbot

Creating Marketing Assets at Scale

Comparing Model Performance

Auditing AI Costs

The honest tradeoffs

Hardcoding API keys

Treating all LLMs equally

Manual embedding pipelines

When It Fits, When It Doesn't

Questions you might have