4,500+ servers built on MCP Fusion
Vinkius

SenseCore Platform MCP. Control model runs, compute resources, and completions.

Claude Claude
ChatGPT ChatGPT
Cursor Cursor
Gemini Gemini
Windsurf Windsurf
VS Code VS Code
JetBrains JetBrains
Vercel Vercel
See Vinkius in Action

Works with every AI agent you already use

…and any MCP-compatible client

SenseCore Platform MCP on Cursor AI Code Editor MCP Client SenseCore Platform MCP on Claude Desktop App MCP Integration SenseCore Platform MCP on OpenAI Agents SDK MCP Compatible SenseCore Platform MCP on Visual Studio Code MCP Extension Client SenseCore Platform MCP on GitHub Copilot AI Agent MCP Integration SenseCore Platform MCP on Google Gemini AI MCP Integration SenseCore Platform MCP on Lovable AI Development MCP Client SenseCore Platform MCP on Mistral AI Agents MCP Compatible SenseCore Platform MCP on Amazon AWS Bedrock MCP Support

Just plug in your AI agents and start using Vinkius.

SenseCore Platform provides 10 tools for managing industrial-grade AI infrastructure via MCP. Use it to run chat completions with SenseTime models, list and inspect available foundation models, monitor compute node availability, and track long-running inference task status directly from your agent.

What your AI agents can do

Chat completions

Sends a message through the SenseCore large language model engine.

Create assistant

Sets up and defines a new, dedicated AI assistant within your project.

Create message

Adds content to an existing conversation thread for historical context or continuation.

+ 8 more capabilities included
Orchestrate Model Conversations

Initiate chat completions using specific SenseTime foundation models while maintaining thread history and context.

Manage AI Assistant Lifecycles

Define, list, and interact with dedicated AI assistants to structure multi-step or guided conversations.

Discover Model Specs

Retrieve a complete catalog of foundation models, including technical details and versioning information.

Track Compute Utilization

Check the health metrics, latency (P99), and overall uptime for specific model services or GPU clusters.

Monitor Background Jobs

List and check the status of complex, long-running tasks like training runs or large inference batches.

Supported MCP Clients

Claude Claude
ChatGPT ChatGPT
Cursor Cursor
Gemini Gemini
Windsurf Windsurf
VS Code VS Code
JetBrains JetBrains
Vercel Vercel
+ other MCP clients
Free for Subscribers

Waiting for input…

AI Agent

SenseCore Platform MCP Server: 11 Tools for AI Infrastructure Control

Manage everything from model discovery to run status checks. Use these 11 tools to orchestrate complex SenseTime AI workflows directly from your agent.

chat019d847c

chat completions

Sends a message through the SenseCore large language model engine.

create019d847c

create assistant

Sets up and defines a new, dedicated AI assistant within your project.

create019d847c

create message

Adds content to an existing conversation thread for historical context or continuation.

create019d847c

create run

Starts executing a defined assistant against a specific message thread, initiating the core workflow.

create019d847c

create thread

Initializes a brand new, empty conversation history that can accept messages and runs.

get019d847c

get assistant details

Retrieves the full configuration settings for an existing AI assistant by its ID.

get019d847c

get run status

Checks and returns the current status (running, failed, complete) of a previously started assistant run.

list019d847c

list assistants

Lists all AI assistants that have been configured for use within your account's project.

list019d847c

list files

Retrieves a list of files that have been uploaded to the current workspace or thread context.

list019d847c

list messages

Fetches the complete message history for a given conversation thread ID.

list019d847c

list models

Gets a list of all available SenseNova foundation models and their versions.

Choose How to Get Started

Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.

Build Your Own

Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.

  • Import from OpenAPI, Swagger, or YAML specs
  • Create Agent Skills with progressive disclosure
  • Deploy to edge with MCPFusion framework
  • Built in DLP, auth, and compliance on every call
  • Real time usage dashboard and cost metering
  • Publish to catalog or keep private
Start building

Make Your AI Do More

Start with SenseCore Platform, then connect any of our 4,700+ other servers whenever your AI needs more. One click, no limits.

  • Use this MCP plus 4,700+ others, all in one place
  • Add new capabilities to your AI anytime you want
  • Every connection is secured and compliant automatically
  • Track usage and costs across all your servers
  • Works with Claude, ChatGPT, Cursor, and more
  • New servers added to the catalog every week

What you can do with this MCP connector

SenseCore Platform gives your AI agent the muscle it needs to run serious industrial-grade model workflows through MCP. You're not just connecting to an API; you're getting full control over the entire lifecycle, from defining a new assistant to tracking complex background jobs. Here’s what you can do with these tools.

Orchestrating Model Conversations

When your agent needs to talk, it uses chat_completions to send messages through SenseCore's large language model engine. You pass in a message and the specific SenseTime model you want to use; the engine handles the rest. If you need to maintain context—which you always do—you first initialize a brand new conversation using create_thread.

That gives your agent an empty history it can build on. Then, before running anything complex, you'll use list_messages to pull the entire message history for that thread ID so your agent remembers everything that was said. To add content at any point, even if you just want to inject a document reference or correct a previous turn, you call create_message, appending it directly to that existing conversation context.

When everything is set up—the thread exists, the messages are there, and the model is ready—you kick off the core action with create_run. This tool tells your agent to execute the defined assistant against the current message thread, initiating the full workflow run.

Managing AI Assistant Lifecycles

You don't just want a random chat; you need a structured process. That’s where dedicated assistants come in. You can set up and define a completely new, specialized AI assistant using create_assistant. This defines the rules for multi-step or guided conversations within your project. Once it's defined, if you need to know what kind of assistant is already running around, you run list_assistants to get a full roster of everything configured in your account.

If you want deep technical specs on a specific helper—like how many steps it has or which model it defaults to—you use get_assistant_details and pass in its ID. These tools let you build, manage, and audit the intelligence layer before any messages are even sent.

Discovering Model Specs and Files

If you don't know exactly what models you have access to, you run list_models. This gives you a complete catalog of every available SenseNova foundation model and tracks their specific versions. It’s your source of truth for what engines are running. Furthermore, if your workflow involves data—whether it’s reference documents or user uploads—you can use list_files to pull a list of all files uploaded to the current workspace or thread context, ensuring your agent knows exactly what data it's working with before it processes anything.

Monitoring and Tracking Jobs

Running an LLM isn't always instant. Sometimes you're talking about huge batches of inference or long training runs that take forever. You don't want to stare at a loading bar all day, so you use get_run_status. This checks the current status—whether it's running, if it failed, or if it completed successfully—for any complex assistant run you started earlier.

It keeps you in the loop without having to manually poll for updates.

In short, you get a complete operational view: you can define your workflow with dedicated assistants, pull the specific models and data files needed, start the conversation thread, kick off the job, and then constantly monitor its status until it's done. It’s comprehensive control over everything running on SenseCore.

How SenseCore Platform MCP Works

  1. 1 Subscribe to this server. Then, log into the SenseCore Console to get your API Key, Secret Key, Organization ID, and Project ID.
  2. 2 Insert those credentials into your AI client's configuration fields. This authenticates your agent against the SenseTime infrastructure.
  3. 3 Your agent can now call tools like list_models or chat_completions, executing operations directly against the managed resources.

The bottom line is you plug in your credentials once, and your AI client handles all the complex orchestration calls for model control and resource monitoring after that.

Who Is SenseCore Platform MCP For?

MLOps engineers who are tired of manually checking dashboards for job status or API developers building applications that need enterprise-grade compute management. This is for anyone whose code needs to reliably talk to complex, paid AI infrastructure.

Machine Learning Ops Engineer

Uses get_run_status and list_assistants to monitor batch inference jobs and ensure models deploy correctly.

Enterprise AI Developer

Calls create_assistant and chat_completions to build production-ready applications using SenseTime's specific foundation models.

Infrastructure Engineer

Uses service monitoring tools to check P99 latency and compute node availability before deploying a new model endpoint.

What Changes When You Connect

  • You don't have to guess if a job finished. Use get_run_status to check the real-time status of any long-running inference task, preventing stale data issues.
  • Stop rebuilding conversations from scratch. The combination of create_thread, list_messages, and chat_completions lets you manage full, persistent chat history automatically.
  • It's not just about chatting—it's about structure. Define specific workflows using create_assistant before running them with create_run. This makes the system repeatable.
  • Model discovery is fast. Instead of guessing model names, call list_models to get a definitive list and technical specs for every SenseNova version available.
  • Resource checks are built-in. You can monitor compute node availability and track quota usage directly via the platform's tools before your agent even tries to run.
  • Your code doesn't fail when context is missing. By using list_messages, you ensure your agent always has access to the full thread history, keeping conversations coherent.

Real-World Use Cases

01

Building a Customer Support Bot

A support bot needs memory. The agent first calls create_thread and then uses list_messages to pull the last 10 turns of chat history. It passes this context payload, along with the user's query, to chat_completions. The result is a highly contextual reply that references past details.

02

Automating Model Integration

An ML team needs to test three new models. Instead of calling endpoints individually, the agent uses list_models to validate all available options first. It then chooses a specific model (e.g., SenseChat-5) and uses get_assistant_details to confirm its configuration before running any tests.

03

Debugging Failed Batch Jobs

A large data batch job fails in the background. The agent calls create_run and then repeatedly hits get_run_status. This gives an immediate status update, telling the engineer exactly which step failed (e.g., 'Quota Exceeded') so they can fix it without manual console diving.

04

Creating a Multi-Step Workflow

A user needs to process a document through several steps. The agent first defines the entire flow using create_assistant. Then, when the user is ready, it calls create_run on that assistant template, allowing SenseCore to handle all the internal logic.

The Tradeoffs

Assuming Conversation History

Calling chat_completions directly without first running an agent or retrieving history. The model will respond generically and lose context.

Always start by establishing state. First, call create_thread to initialize the conversation container. Then, retrieve past turns using list_messages before calling chat_completions. This guarantees continuity.

Running a Job Without Knowing Status

Executing create_run and then immediately assuming success. The job might be stuck or failed, leading to bad data.

After calling create_run, you must poll the system. Use get_run_status until the status is 'complete' or 'failed'. Never trust an immediate response for long tasks.

Confusing Assistants and Threads

Trying to run a task on an arbitrary message ID instead of using the dedicated workflow tools. The system won't know what process to follow.

A conversation needs structure. Use create_assistant first to define the 'how.' Then, use that assistant ID with create_run against a create_thread. This is the correct execution path.

When It Fits, When It Doesn't

Use this MCP Server if your application's core functionality depends on managing state across multiple AI interactions or needs deep oversight into compute resources. Specifically, if you need to track job status (get_run_status), manage structured conversations (create_assistant), or monitor GPU usage/latency (via the platform tools), this is necessary.

Don't use it if you just need a simple API call—for example, if you only need to send one isolated prompt without remembering anything. In that case, a basic, single-call REST endpoint might suffice. But because your application needs enterprise reliability (i.e., knowing why a job failed or what the P99 latency was), this orchestration layer is mandatory.

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by SenseCore Platform. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

VINKIUS INFRASTRUCTURE

Cloud Hosted

Managed infra

V8 Isolated

Sandboxed per request

Zero-Trust Proxy

No stored credentials

DLP Enforced

Policy on every call

GDPR Compliant

EU data residency

Token Compression

~60% cost reduction

How we secure it →

Works with Claude, ChatGPT, Cursor, and more

The Model Context Protocol standardizes how applications expose capabilities to LLMs. Instead of operating in isolation, your AI gains direct access to external platforms, live data, and real-world actions through secure, standardized connections.

This server provides 11 capabilities that interface natively with Claude, ChatGPT, Cursor, and any MCP client. No middleware. No custom integration required.

Available Capabilities

chat_completions create_assistant create_message create_run create_thread get_assistant_details get_run_status list_assistants list_files list_messages list_models

Managing complex AI interactions shouldn't require writing three separate API calls just to remember context.

Today, if you build an agent that needs memory, you run into a mess of manual steps. You have to first check the conversation history endpoint (`list_messages`), then format those messages correctly, and finally send them all in a batch payload to the completion endpoint. It's error-prone and clutters your code.

With this MCP server, that whole sequence is encapsulated. You define the thread using `create_thread` and rely on the platform tools to maintain the state. Your agent just needs to call `chat_completions`, trusting that the underlying service handled all the history retrieval for you.

SenseCore Platform MCP Server: Structure your AI logic with `create_assistant`.

Without a defined assistant, every conversation is just a stream of messages. If your workflow requires specific tools or a predictable sequence (e.g., 'Summarize' then 'Translate'), you have to manage that flow manually in Python logic. By creating an assistant first (`create_assistant`), you build the guardrails into the platform itself. You define the process once, and then all subsequent runs are reliable and repeatable.

Common Questions About SenseCore Platform MCP

How do I check if a long-running inference job failed using get_run_status? +

You monitor the run status by repeatedly calling get_run_status with the run ID. If the status returns 'failed,' you'll also get error details explaining what caused the failure, letting you fix it fast.

Do I need to use create_assistant before chat_completions? +

No, not always. chat_completions works for simple chats. But if your conversation needs specific tools or a defined process (like data extraction), you should define it first using create_assistant.

What is the difference between create_thread and list_messages? +

create_thread makes an empty container for history. You call list_messages later to pull the actual content that has been added to that thread ID.

How do I get model specs without listing all models first? +

You use list_models to get a list of available names. Then, you can retrieve detailed specifications for any specific model name using the platform's dedicated inspection tools.

Before I can use `list_assistants`, what credentials do I need for my AI client? +

You must provide your API Key, Secret Key, and Organization ID. These are necessary authentication tokens that prove your agent has access to the SenseCore infrastructure.

If I hit rate limits while using `chat_completions`, how should my agent handle it? +

Your client must implement exponential backoff and retry logic. The API will return a specific status code (e.g., 429) when you exceed the defined request quota.

How do I manage files that are attached to a conversation using `list_files`? +

list_files retrieves all file IDs associated with your current project or thread context. You'll need the specific file ID when calling other tools, like sending messages.

I used `list_models` and saw 20 models; how do I know which one is best for my task? +

list_models only gives you names. For specific technical requirements—like parameter support or latency guarantees—you must call get_assistant_details using the model's ID.

Can I automatically list all available models in my SenseCore project? +

Yes! Use the list_models tool. Your agent will retrieve a complete list of all SenseTime foundation models and specialized variants currently active in your account.

How do I check the health status of my deployed model services? +

Use the get_service_health tool with the specific Service ID. The agent will return real-time metrics on availability, throughput, and average latency.

Can I monitor GPU resource utilization via the AI agent? +

Yes! The get_resource_usage tool retrieves granular metrics on compute node utilization and remaining quota for your specific project environment.

More in this category

You might also like

Built & Managed by Vinkius 30s setup 11 tools

We've already built the connector for SenseCore Platform. Just plug in your AI agents and start using Vinkius.

No hosting. No infrastructure. No complex setup.
All 11 tools are live and waiting. You're up and running in seconds.

Claude Claude
ChatGPT ChatGPT
Cursor Cursor
Gemini Gemini
Windsurf Windsurf
VS Code VS Code
JetBrains JetBrains
Vercel Vercel
+ other MCP clients

Vinkius gives your AI agents access to the full catalog of app connectors, all fully managed, secure, and enterprise-ready. One subscription, every tool you need.

Zero hosting required Full MCP catalog included Enterprise-grade security Auto-updated by Vinkius

Built, hosted, and secured by Vinkius. You just connect and go.